BULL Documentation
ou juste avant la balise de fermeture -->

 

 

 CD ROM Annuaire d'Entreprises France prospect (avec ou sans emails) : REMISE DE 10 % Avec le code réduction AUDEN872

10% de réduction sur vos envois d'emailing --> CLIQUEZ ICI

Retour à l'accueil, cliquez ici

ou juste avant la balise de fermeture --> DOCUMENTATION BULL http://support.bull.com/documentation/byproduct/soldoc_popupList?k=servers/escala&fromsso=y

http://support.bull.com/documentation/byproduct/soldoc_popupList?k=servers/ns&fromsso=y

[   ] 86A101FH01.pdf 16-Dec-2011 19:33 15M [   ] 86A102ET05.pdf 16-Dec-2011 17:07 6.7M [   ] 86A102FA01.pdf 16-Dec-2011 20:45 81M [   ] 86A102FH01.pdf 16-Dec-2011 19:37 16M [   ] 86A114FG01.pdf 16-Dec-2011 19:28 20M [   ] 86A116EF01.pdf 16-Dec-2011 19:55 20M [   ] 86A117EF01.pdf 16-Dec-2011 20:03 11M [   ] 86A121EG01.pdf 16-Dec-2011 18:26 11M [   ] 86A121EM01.pdf 16-Dec-2011 16:49 9.8M [   ] 86A125EM01.pdf 16-Dec-2011 16:52 5.5M [   ] 86A129FG01.pdf 16-Dec-2011 19:14 14M [   ] 86A134EM00.pdf 16-Dec-2011 17:14 6.7M [   ] 86A134ER00.pdf 16-Dec-2011 17:04 5.8M [   ] 86A138EG01.pdf 16-Dec-2011 19:07 13M [   ] 86A141EM06.pdf 16-Dec-2011 16:49 9.5M [   ] 86A142EG00.pdf 16-Dec-2011 18:33 12M [   ] 86A150FD06.pdf 16-Dec-2011 17:18 3.8M [   ] 86A154FF03.pdf 16-Dec-2011 18:14 45M [   ] 86A156EG01.pdf 16-Dec-2011 18:38 11M [   ] 86A169FG02.pdf 16-Dec-2011 18:50 10M [   ] 86A171FA10.pdf 16-Dec-2011 17:16 4.7M [   ] 86A172EG01.pdf 16-Dec-2011 17:01 5.9M [   ] 86A182EF03.pdf 16-Dec-2011 18:04 26M [   ] 86A185ER00.pdf 16-Dec-2011 17:11 6.7M [   ] 86A189EF02.pdf 16-Dec-2011 16:57 5.0M [   ] 86A190FF03.pdf 16-Dec-2011 19:44 16M [   ] 86A194EF01.pdf 16-Dec-2011 18:20 11M [   ] 86A194EM01.pdf 16-Dec-2011 17:03 11M [   ] 86A196KX00.pdf 16-Dec-2011 20:04 3.3M [   ] 86A197KX00.pdf 16-Dec-2011 20:05 4.7M [   ] 86A198KX00.pdf 16-Dec-2011 20:02 4.6M [   ] 86A235ER00.pdf 16-Dec-2011 20:50 34M [   ] 86A254FA04.pdf 16-Dec-2011 17:19 1.4M [   ] 86A255FA04.pdf 16-Dec-2011 17:17 2.5M [   ] 86A256FA04.pdf 16-Dec-2011 17:19 3.4M [   ] 86A257FA04.pdf 16-Dec-2011 17:18 831K [   ] 86A258FA05.pdf 16-Dec-2011 17:18 464K [   ] 86A259FA04.pdf 16-Dec-2011 17:18 1.7M [   ] 86F102ET05.pdf 16-Dec-2011 17:07 8.5M [   ] 86F102FA01.pdf 16-Dec-2011 20:39 68M [   ] 86F114FG01.pdf 16-Dec-2011 19:07 33M [   ] 86F129FG01.pdf 16-Dec-2011 18:45 14M [   ] 86F134EM00.pdf 16-Dec-2011 17:10 6.2M [   ] 86F141EM06.pdf 16-Dec-2011 16:54 9.7M [   ] 86F150FD06.pdf 16-Dec-2011 17:16 3.8M [   ] 86F189FF03.pdf 16-Dec-2011 19:45 24M [   ] 86F190FF03.pdf 16-Dec-2011 19:26 28M [   ] 86F194EM01.pdf 16-Dec-2011 16:58 11M [   ] 86X116EM00.pdf 16-Dec-2011 19:00 108M Bull ESCALA Expandable Storage Plus 2104 Models DS4 and TS4 Translated Safety Notices 86 X1 16EM 00 ORDER REFERENCE Bull ESCALA Expandable Storage Plus 2104 Models DS4 and TS4 Translated Safety Notices 86 X1 16EM 00 ORDER REFERENCE Bull ESCALA Expandable Storage Plus 2104 Models DS4 and TS4 Translated Safety Notices Hardware November 2003 BULL CEDOC 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE 86 X1 16EM 00 ORDER REFERENCE The following copyright notice protects this book under the Copyright laws of the United States of America and other countries which prohibit such actions as, but not limited to, copying, distributing, modifying, and making derivative works. Copyright Bull S.A. 1992, 2003 Printed in France Suggestions and criticisms concerning the form, content, and presentation of this book are invited. A form is provided at the end of this book for this purpose. To order additional copies of this book or other Bull Technical Publications, you are invited to use the Ordering Form also provided at the end of this book. Trademarks and Acknowledgements We acknowledge the right of proprietors of trademarks mentioned in this book. AIX is a registered trademark of International Business Machines Corporation, and is being used under licence. UNIX is a registered trademark in the United States of America and other countries licensed exclusively through the Open Group. Linux is a registered trademark of Linus Torvalds. The information in this document is subject to change without notice. Bull will not be liable for errors contained herein, or for incidental or consequential damages in connection with the use of this material. Contents About this document . . . . . . . . . . . . . . . . . . . . . . v Who should read this document . . . . . . . . . . . . . . . . . . . v Additional information . . . . . . . . . . . . . . . . . . . . . . . v 2104 Model DS4 and Model TS4 library . . . . . . . . . . . . . . . v Related documents . . . . . . . . . . . . . . . . . . . . . . . v Web sites . . . . . . . . . . . . . . . . . . . . . . . . . . vi How to send your comments . . . . . . . . . . . . . . . . . . . vi Expandable Storage Plus 2104 Model DS4 and Model TS4 translations . . . 1 Bulgarian . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 Brazilian-Portuguese . . . . . . . . . . . . . . . . . . . . . . . 24 Chinese, simplified . . . . . . . . . . . . . . . . . . . . . . . 47 Chinese, traditional . . . . . . . . . . . . . . . . . . . . . . . 70 Croatian . . . . . . . . . . . . . . . . . . . . . . . . . . . 93 Czech . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116 Danish . . . . . . . . . . . . . . . . . . . . . . . . . . . 139 Dutch . . . . . . . . . . . . . . . . . . . . . . . . . . . . 162 English . . . . . . . . . . . . . . . . . . . . . . . . . . . 185 Finnish . . . . . . . . . . . . . . . . . . . . . . . . . . . 208 French . . . . . . . . . . . . . . . . . . . . . . . . . . . 231 German . . . . . . . . . . . . . . . . . . . . . . . . . . . 254 Greek . . . . . . . . . . . . . . . . . . . . . . . . . . . . 277 Hebrew . . . . . . . . . . . . . . . . . . . . . . . . . . . 300 Hungarian . . . . . . . . . . . . . . . . . . . . . . . . . . 323 Japanese . . . . . . . . . . . . . . . . . . . . . . . . . . 346 Korean . . . . . . . . . . . . . . . . . . . . . . . . . . . 369 Lithuanian . . . . . . . . . . . . . . . . . . . . . . . . . . 392 Macedonia . . . . . . . . . . . . . . . . . . . . . . . . . . 415 Norwegian . . . . . . . . . . . . . . . . . . . . . . . . . . 438 Polish . . . . . . . . . . . . . . . . . . . . . . . . . . . . 461 Portuguese . . . . . . . . . . . . . . . . . . . . . . . . . . 484 Romanian . . . . . . . . . . . . . . . . . . . . . . . . . . 507 Russian . . . . . . . . . . . . . . . . . . . . . . . . . . . 530 Slovak . . . . . . . . . . . . . . . . . . . . . . . . . . . 553 Slovenia . . . . . . . . . . . . . . . . . . . . . . . . . . . 576 Spanish . . . . . . . . . . . . . . . . . . . . . . . . . . . 599 Swedish . . . . . . . . . . . . . . . . . . . . . . . . . . . 622 Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . 645 Trademarks . . . . . . . . . . . . . . . . . . . . . . . . . . 646 Electronic emission statements . . . . . . . . . . . . . . . . . . 646 Federal Communications Commission (FCC) statement . . . . . . . . 646 Industry Canada compliance statement . . . . . . . . . . . . . . 646 Chinese Class A warning statement . . . . . . . . . . . . . . . . 646 European Community compliance statement . . . . . . . . . . . . 647 Germany compliance statement . . . . . . . . . . . . . . . . . 647 Japanese Voluntary Control Council for Interference (VCCI) class 1 statement . . . . . . . . . . . . . . . . . . . . . . . . 648 Korean Government Ministry of Communication (MOC) statement . . . . 648 Taiwan class A compliance statement . . . . . . . . . . . . . . . 648 © Copyright IBM Corp. 2003 iii iv Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices About this document This publication contains translated safety notices and safety inspection procedures for the Expandable Storage Plus 2104 Model DS4 and Model TS4 (hereafter referred to as the 2104 Model DS4 and Model TS4). Important: The installation of this product is the responsibility of the customer. Who should read this document This publication is intended for people who operate a system that connects one or more 2104 Model DS4s and Model TS4s to it. Throughout this publication, the term service representative applies to the person that has been authorized by your organization to maintain your 2104 Model DS4 and Model TS4. Additional information This section contains the following information: v A list of the documents in the 2104 Model DS4 and Model TS4 library v A list of the related documents v The available Web sites v Information about how to send your comments 2104 Model DS4 and Model TS4 library The following documents contain information related to this product: v Expandable Storage Plus 2104 Model DS4 and Model TS4 Hardware Technical Information, 86 A1 19EM v Expandable Storage Plus 2104 Model DS4 and Model TS4 Operator’s Guide, 86 A1 17EM v Expandable Storage Plus 2104 Model DS4 Installation Guide, 86 A1 14EM v Expandable Storage Plus 2104 Model TS4 Installation Guide, 86 A1 15EM v Expandable Storage Plus 2104 Model DS4 and Model TS4 Service Guide, 86 A1 18EM v Expandable Storage Plus 2104 Model DS4 and Model TS4 Translated Safety Notices, 86 X1 16EM Related documents The following documents contain information related to this product: v The operator’s guide for your system v The user’s guide for your using system SCSI attachment (for example, your SCSI adapter) v The site and hardware planning information for your system v The problem solving guide and reference for your system v The 7014 Model T00 and T42 Rack Installation and Service Guide, 86 A1 94KX The following documents contain information related to the Expandable Storage Plus disk enclosures that attach to the RISC systems: © Copyright IBM Corp. 2003 v v Diagnostic Information for Multiple Bus Systems, 86 A1 26HX v Site Preparation Guide for Rack Systems, 86 A1 30PX v Adapters, Devices and Cable Information for Multiple Bus Systems, 86 A1 27HX vi Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations Following are the translated safety notices and environmental notices for the 2104 Model DS4 and Model TS4. © Copyright IBM Corp. 2003 1 Bulgarian 2 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 3 4 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 5 6 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 7 8 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 9 10 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 11 12 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 13 14 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 15 16 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 17 18 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 19 20 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 21 22 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 23 Brazilian-Portuguese 24 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 25 26 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 27 28 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 29 30 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 31 32 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 33 34 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 35 36 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 37 38 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 39 40 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 41 42 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 43 44 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 45 46 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Chinese, simplified Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 47 48 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 49 50 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 51 52 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 53 54 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 55 56 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 57 58 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 59 60 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 61 62 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 63 64 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 65 66 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 67 68 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 69 Chinese, traditional 70 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 71 72 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 73 74 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 75 76 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 77 78 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 79 80 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 81 82 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 83 84 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 85 86 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 87 88 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 89 90 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 91 92 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Croatian Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 93 94 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 95 96 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 97 98 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 99 100 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 101 102 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 103 104 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 105 106 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 107 108 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 109 110 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 111 112 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 113 114 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 115 Czech 116 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 117 118 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 119 120 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 121 122 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 123 124 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 125 126 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 127 128 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 129 130 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 131 132 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 133 134 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 135 136 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 137 138 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Danish Sikkerhedsmæssige og miljømæssige forskrifter Oversatte sikkerhedsforskrifter SCDAN901 Dette afsnit indeholder følgende oplysninger: Oversatte sikkerhedsforskrifter Retningslinjer for sikkerhedstjek af dette produkt Miljømæssige forskrifter Denne vejledning indeholder udelukkende sikkerhedsforskrifter. Sikkerhedsforskrifter mærket advarer om forhold, der kan være livsfarlige eller forårsage alvorlige ulykker. Sikkerhedsforskrifter mærket advarer om forhold, der kan være farlige. Sikkerhedsforskrifter mærket advarer om forhold eller procedurer, der kan forårsage skade på maskiner, udstyr, programmer eller data. Sikkerhedsforskrifterne er opstillet i numerisk rækkefølge efter deres id, der vises i parentes efter hver forskrift. Nedenfor vises eksempler på id-nummeret på sikkerhedsforskrifter mærket Fare! og Pas på!. Fare! Pas på! Bemærk Fare! En forkert forbundet stikkontakt kan give livsfarlig elektrisk spænding på metaldele i systemet eller på enheder, der er tilsluttet systemet. Det er kundens ansvar at sikre, at stikkontakten er korrekt forbundet til jord. (1) Pas på! Placér ikke hænder eller værktøj i den åbning, hvor ventilatoren har været monteret. (1) Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 139 Statisk elektricitet Retningslinjer for sikkerhedstjek Tjek af rack SCDAN902 Bemærk: Tag følgende forholdsregler for at undgå statisk elektricitet, når du håndterer udskiftelige enheder (FRU) og andre computerdele: Begræns dine bevægelser. Bevægelse kan skabe statisk elektricitet omkring dig. Vær altid forsigtig, når du rører ved computerdele. Hold på kanten af kortene eller på metaldækslet. Rør aldrig ved et udækket kredsløb. Personer, der ikke har en antistatisk beskyttelsesstrop om håndleddet, skal undgå at røre ved computerdele. Gør følgende, inden du installerer en ny del: Sørg for, at transportposen med delen rører en ikke-malet metaldel på 2104 Model DS4, 2104 Model TS4 eller værtssystemet i mindst 2 sekunder. Herved reduceres den statiske elektricitet i pakken og i din krop. Tag delen ud af emballagen, og installér den om muligt direkte i 2104 Model DS4 eller Model TS4, uden først at lægge den fra dig. Gør følgende, hvis du skal lægge delen fra dig: 1. Placér den antistatiske emballage, delen blev leveret i, på en plan flade. 2. Læg delen oven på den antistatiske emballage. Læg ikke delen direkte på en metalflade. Vejledningen beskriver, hvordan du udfører et sikkerhedstjek af 2104 Model DS4 eller Model TS4. Der er oplysninger om udførelse af sikkerhedstjek af racket i installationsvejledningen til racket. Pas på! 2104 Model DS4 og Model TS4 er designet, så de kan installeres af kunden, og certificerede som "kundeinstallation". Kontrollér, at det system eller det rack, som 2104 Model DS4 eller Model TS4 skal installeres i, også er designet og de er certificeret til kundeinstallation. Er det ikke tilfældet, skal 2104 Model DS4 eller Model TS4 installeres af en servicetekniker. (22) 140 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Kontrol af 2104 Model DS4 eller Model TS4 Udvendigt maskintjek Indvendigt maskintjek SCDAN903 Foretag følgende indvendige kontrol af maskinen: 1. Kontrollér, om der er foretaget ikke-IBM-ændringer af maskinen. Hvis der er foretaget sådanne ændringer, skal du bestille formularen "Non-IBM Alteration Attachment Survey" nummer R009 via IBM's bestillingssystem - Publications Ordering System. Udfyld formularen, og send den. 2. Kontrollér maskinen indvendigt for følgende: Løse metaldele eller andre forurenende stoffer Tegn på vand eller anden væske Ild Røgskade 3. Kontrollér, om der er tydelige mekaniske problemer, f.eks. løse komponenter. 4. Kontrollér ikke-beskyttede kabler og stik for slid, revner eller klemskader. Foretag følgende udvendige kontrol af maskinen: 1. Kontrollér kabinettet for evt. skader (løse, afbrækkede eller skarpe kanter) 2. Kontrollér netledningerne, og tjek, at isoleringen ikke er slidt eller beskadiget. 3. Kontrollér, om der er foretaget åbenlyse ikke-standardændringer. Brug sund fornuft ved vurderingen af sikkerheden i forbindelse med sådanne ændringer. 4. Kontrollér, at alle udvendige dæksler er på plads og ikke er beskadiget. 5. Kontrollér, at alle låse og hængsler er i orden. 6. Kontrollér, at netledningen ikke er beskadiget. 7. Kontrollér, at ledningerne ikke er slidte, beskadigede eller sidder i klemme. 8. Tjek ventilator- og strømforsyningsenhederne. Kontrollér, at skruer eller nitter i dækslet til strømforsyningsenheden ikke er fjernet, og at de ikke sidder forkert. 9. Kontrollér, at det eksterne signalkabel ikke er beskadiget. 10. Kontrollér, at der ikke er nogen skarpe kanter på dækslet, at det ikke er beskadiget, og at der ikke er foretaget ændringer, som gør, at man kan komme til at berøre enhedens indvendige dele. 11. Afhjælp eventuelle problemer. Udfør nedenstående sikkerhedstjek for at klarlægge eventuelle farlige forhold. Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 141 Kontrol af sikkerhedsmærkater SCDAN904 Kontrollér følgende i forbindelse med mærkaterne: 1. Kontrollér, at der er sikkerhedsmærkater på 2104 Model DS4 eller Model TS4. Se Figur 1. 2. Kontrollér, at advarselsmærkaten angående netledningen er placeret på strømforsyningen i 2104 Model DS4 eller Model TS4. Se Figur 2. SC000274 Figur 1. 2104 Model DS4- eller Model TS4-mærkat Figur 2. Advarselsmærkat angående netledning SC000325 Pas på! Enheden kan have to netledninger. Tag begge ledninger ud for at fjerne al strøm til maskinen. (1) 142 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCDAN905 3. Kontrollér, at advarselsmærkaten angående strømforsyningsdækslet er placeret på strømforsyningen til 2104 Model DS4 eller Model TS4. Se Figur 3. 4. Kontrollér, at advarselsmærkaten angående sikringen er placeret på strømforsyningen til 2104 Model DS4 eller Model TS4. Se Figur 4. Figur 3. Advarselsmærkat angående strømforsyningsdæksel Figur 4. Advarselsmærkat angående sikring SC000326 SC000327 Pas på! Fjern ikke dækslet. Der er ingen dele, der kan repareres. (2) Pas på! Dobbeltpolet/neutral sikring. (3) Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 143 SCDAN905A 5. Kontrollér, at mærkaten med vægtangivelse er placeret på 2104 Model DS4. Se Figur 5. Pas på! Denne enhed vejer over 32 kg. Der er flere oplysninger i manualerne. (4) Figur 5. 2104 Model DS4 - mærkat med vægtangivelse SC000220 Der er flere oplysninger I manualerne 32 kg Bemærk: Hvis der er lagret data på diskene, skal du mærke diskene, før du fjerner dem. Når du monterer diskene igen, skal du montere hver disk i samme bås, som du fjernede den fra. Hvis du ikke gør det, kan du risikere at miste data. > 18 kg > 32 kg Pas på! Hvis der er installeret alt, hvad der er muligt i en 2104 Model DS4, det vil sige 14 diske og 2 strømforsyninger, er den samlede vægt 32 kg. Du skal fjerne mindst 6 diske fra 2104 Model DS4, før du løfter den eller installerer den i et rack eller i 2104 Model TS4. Derved bliver den samlede vægt mindre end 32 kg, og 2104 Model DS4 kan håndteres af to personer. Hvis vægten ikke reduceres, kan det resultere i personskade. 144 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Pas på! Enheden vejer mellem 32 kg og 55 kg. (5) Figur 6. 2104 Model TS4 - mærkat med vægtangivelse 32 - 55 kg SC000219 6. Kontrollér, at mærkaten med vægtangivelse er placeret på 2104 Model TS4. Se Figur 6. Pas på! Det medfølgende løfteværktøj skal bruges ved flytning og montering af produktet, når alt ekstraudstyr er installeret i det. Der skal 3 personer til at flytte det på forsvarlig måde. Hvis løfteværktøjet ikke er ved hånden, skal vægten reduceres til 32 kg eller derunder, ved at du fjerner alle de tunge komponenter (diske og strømforsyninger) fra produktet. Der skal så kun 2 personer til at flytte eller montere produktet. (6) Bemærk: Hvis der er lagret data på diskene, skal du mærke diskene, før du fjerner dem. Når du monterer diskene igen, skal du montere hver disk i samme bås, som du fjernede den fra. Hvis du ikke gør det, kan du risikere at miste data. > 18 kg > 32 kg SCDAN906 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 145 SCDAN907 7. Kontrollér, at den mærkat, der angiver standbytilstand, er placeret på 2104 Model DS4 eller Model TS4. Se Figur 7. Pas på! Standbytilstand angives ved symbolet til højre for jævnstrøm direkte over kontakten, SW1. Når du placerer SW1 i højre position direkte under standbysymbolet, slukker du ikke for vekselstrømmen til enheden. (7) Figur 7. Mærkat angående standbytilstand SC000326 146 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCDAN908 Dette afsnit beskriver, hvordan du kontrollerer jordforbindelsen til 2104 Model DS4 og Model TS4. Kontrol af jordforbindelse til 2104 Model DS4 Gør følgende for at kontrollere jordforbindelsen til 2104 Model DS4: 3. Bemærk: Elektriske kredsløb kan blive beskadiget, hvis de eksterne SCSI-kabler tilsluttes 2104 Model DS4, mens du tjekker jordforbindelsen. Kontrollér, at der ikke er tilsluttet eksterne SCSI-kabler til stikkene . 4. Kontrollér jordforbindelsen til 2104 Model DS4: a. Kig på selve de stik, som du fjernede fra strømfordelingsenheden eller backupbatterienheden i trin 2, og kontrollér, at det stikben, der giver jordforbindelsen, er i god stand. b. Brug et ohmmeter, når du forbinder den ene leder til 2104 Model DS4-rammen og den anden leder til det tredje stikben til jordforbindelse foran på de stik, som du fjernede i trin 2. Der skal være 0,1 ohm eller derunder mellem stikbenet til jordforbindelse på stikket og 2104 Model DS4-rammen. Figur 8. Strømforsyningskabler og SCSI-stik på 2104 Model DS4 Kontrol af jordforbindelse til 2104 Model DS4 og Model TS4 1. Sørg for, at der ikke er strøm til racket. Der er flere oplysninger i vejledningen til racket eller værtssystemet. 2. Kontrollér, at netledningerne er tilsluttet hver enkelt ventilator- og strømforsyningsenhed. Se Figur 8. Kontrollér også, at netledningerne er taget ud af strømfordelingsenheden eller backupbatterienheden i racket. Der er flere oplysninger i vejledningen til racket eller værtssystemet. 2 1 SC000210 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 147 SCDAN909 Fortsæt med trin 5, hvis du udfører service på en 2104 Model DS4 (220 volt). Fortsæt med tirn 7, hvis du udfører service på en 2104 Model DS4 (-48 volt). 5. Kontrollér, at der er forbindelse mellem 2104 Model DS4-rammen og stikbenet til jord på netledningen på hver enkelt ventilator- og strømforsyningsenhed. Se Figur 9. 6. Fortsæt med trin 9. 7. Kontrollér, at der er forbindelse mellem 2104 Model DS4-rammen og stikbenene 1 og 7 til jord på hvert hovednetledningsstik. Se Figur 10. 8. Fortsæt med trin 9. 9. Udskift en ventilator- og strømforsyningsenhed, hvis forbindelserne ikke er i orden. Der er flere oplysninger om ventilator- og strømforsyningsenheder i . Kontrollér jordforbindelsen igen. Hvis forbindelserne er i orden i en ventilatorog strømforsyningsenhed, kan problemet skyldes netledningen eller værtssystemets jordforbindelse. Expandable Storage Plus 2104 Model DS4 and Model TS4 Service Guide Figur 9. Stikben til jord (220 V 2104 Model DS4 ) Figur 10. Stikben til jord (-48 V 2104 Model DS4) 1 SC000329 Kontrollen af jordforbindelsen er afsluttet, hvis jordforbindelsen er i orden. Fjern netledningerne fra ventilator- og strømforsyningsenheden i 2104 Model DS4, hvis jordforbindelsen ikke er i orden. SC000330 Ben 6 Ben 1 Ben 12 Ben 7 148 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCDAN910 Fortsæt med trin 10, hvis du udfører service på en 2104 Model DS4 (220 volt). Læs i installationsvejledningen til racket eller værtssystemet for at isolere fejlen, hvis du udfører service på en 2104 Model DS4 (-48 volt). Kontrollér jordforbindelsen igen. 10. Kontrollér forbindelserne i netledningen. Hvis forbindelserne i netledningen ikke er i orden, skal du udskifte den med en ny, og derefter kontrollere jordforbindelsen igen. Installationsvejledningen til racket eller værtssystemet indeholder oplysninger om, hvordan du isolerer fejlen, hvis forbindelsen i netledningen er i orden. Gør følgende for at kontrollere jordforbindelsen til 2104 Model TS4: 2. Kontrollér, at den anden ende af netledningerne ikke er tilsluttet stikkontakten. Tag om nødvendigt ledningerne ud. Kontrol af jordforbindelse til 2104 Model TS4 1. Kontrollér, at der er tilsluttet en netledning i hvert stik 1 . Se Figur 11. Figur 11. Netledninger og SCSI-stik (2104 Model TS4) SC000230 1 2 3 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 149 SCDAN911 3. Elektriske kredsløb kan beskadiges, hvis de eksterne SCSI-kabler tilsluttes 2104 Model TS4, mens du tjekker jordforbindelsen. Kontrollér, at der ikke er tilsluttet eksterne SCSI-kabler til SCSI-stikkene . Se Figur 11. 4. Kontrollér, at der er forbindelse mellem 2104 Model TS4-rammen og stikbenet til jord på hver netledning. a. Tjek netledningerne, og kontrollér, at stikbenet til jord er i god stand. b. Brug et ohmmeter til at kontrollere, at forbindelsen mellem stikbenet til jord og jordforbindelsen til rammen er på 0,1 ohm eller derunder. 5. Kontrollen af jordforbindelsen er afsluttet, hvis forbindelsen er i orden. 6. Tag netledningerne ud af stikkontakterne, hvis forbindelsen til jord ikke er i orden. a. Kontrollér forbindelserne i hver netledning. b. Udskift en netledning, hvis den er fejlbehæftet. c. Kontrollér, at der er forbindelse mellem 2104 Model TS4-rammen og stikbenet til jord på netledningen til hver enkelt ventilator- og strømforsyningsenhed. Se Figur 12. Bemærk: d Udskift en ventilator- og strømforsyningsenhed, hvis forbindelsen ikke er i orden. Der er flere oplysninger i i afsnittet om ventilator- og strømforsyningsenheder. Fortsæt med "Kontrol af jordforbindelse til 2104 Model TS4", og udfør trin 1 til og med 6d for at udføre kontrollen af jordforbindelsen igen. Expandable Storage Plus 2104 Model DS4 and Model TS4 Service Guide Figur 12. Stikben til jord (2104 Model TS4) 1 2 3 SC000331 150 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCDAN912 Figur 13. Fjernelse af strøm fra 2104 Model DS4 Bemærk: Bemærk: Medmindre du har en særlig grund til at gøre det, skal du ikke fjerne strømmen fra værtssystemet eller fra 2104 Model DS4, medmindre du får besked på at gøre det i de vejledninger, du følger. Gør følgende for at fjerne strømmen fra 2104 Model DS4: 1. Kontrollér med kunden, at al kommunikation mellem 2104 Model DS4 og værtssystemet er standset. 2. Indstil kontakten dc on/standby på hver ventilator- og strømforsyningsenhed på standby. Se Figur 13. a. Nogle 2104 Model DS4-enheder har en ventilator- og strømforsyningsenhed og en ventilatorenhed. Ventilatorenheden har ingen dc on/standbykontakt. b. På en ventilator- og strømforsyningsenhed kan CHK-lampen lyse, selvom dc on/standbykontakten er sat til standby. Fjern strømmen fra 2104 Model DS4 eller Model TS4 1 Fjern strømmen fra 2104 Model DS4 SC000206 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 151 SCDAN913 3. Fare! Når du udfører næste trin, skal du fjerne netledningerne. Ledningerne er strømførende, når strømfordelingsenheden i racket eller UPSenheden (Uninterruptable Power Supply) er tændt. (1) 1 1 2 2 Bemærk: Bemærk: Medmindre du har en særlig grund til at gøre det, skal du ikke fjerne strømmen fra værtssystemet eller fra 2104 Model TS4, medmindre du får besked på at gøre det i de vejledninger, du følger. Gør følgende for at fjerne strømmen fra 2104 Model TS4: 1. Kontrollér med kunden, at al kommunikation mellem 2104 Model TS4 og værtssystemet er standset. 2. Indstil dc on/standbykontakten på hver ventilator- og strømforsyningsenhed bag på 2104 Model TS4 på standby. Se Figur 14. a. Nogle 2104 Model TS4-enheder har en ventilator- og strømforsyningsenhed og en ventilatorenhed. Ventilatorenheden har ingen dc on/standbykontakt. b. På en ventilator- og strømforsyningsenhed kan CHK-lampen lyse, selvom dc on/standbykontakten er sat til standby. Fjern strømmen fra 2104 Model TS4 Fjern netledningerne fra bagsiden af 2104 Model DS4. Se Figur 13. 4. Gør følgende for at slutte strømmen til 2104 Model DS4 igen: a. Montér netledningerne igen. b. Indstil dc on/standbykontakten på hver ventilator- og strømforsyningsenhed til on. Du kan konfigurere motorstartsekvensen på 2104 Model DS4. Der er flere oplysninger i afsnittene om kontakterne til automatisk start af disk og til forsinket motorstart i . Bemærk: Expandable Storage Plus 2104 Model DS4 and Model TS4 Service Guide 152 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCDAN914 1 2 2 3. Fjern netledningerne fra bagsiden af 2104 Model TS4. Se Figur 14. 4. Gør følgende for at slutte strømmen til 2104 Model TS4 igen: a. Montér netledningerne igen. b. Indstil dc on/standbykontakten på hver ventilator- og strømforsyningsenhed på on. Du kan konfigurere motorstartsekvensen på 2104 Model TS4. Der er flere oplysninger i afsnittene om kontakterne til automatisk start af disk og til forsinket motorstart i . Bemærk: Expandable Storage Plus 2104 Model DS4 and Model TS4 Service Guide Figur 14. Fjern strømmen fra 2104 Model TS4 SC000207 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 153 SCDAN915 Sikkerhedsforskrifter mærket Fare! (DANGER) Fare! Fare! Fare! Fare! Fare! Når du udfører næste trin, skal du fjerne netledningerne. Ledningerne er strømførende, når strømfordelingsenheden i racket eller UPS-enheden (Uninterruptable Power Supply) er tændt. (1) Sæt ikke en netledning i ventilator- og strømforsyningsenheden, før enheden er helt på plads, og fingerskruerne er skruet helt i. (3) Kabler til skærmterminaler, printere, telefoner eller overspændingsafledere til kommunikationsudstyr må ikke afmonteres i tordenvejr. (5) En forkert forbundet stikkontakt kan give livsfarlig elektrisk spænding på metaldele i systemet eller på udstyr, der er tilsluttet systemet. Det er kundens ansvar at sikre, at stikkontakten er korrekt forbundet til jord. Kabler til skærmterminaler, printere, telefoner eller overspændingsafledere til kommunikationsudstyr må ikke afmonteres i tordenvejr. (4) Forsøg ikke at åbne dækslerne til ventilator- og strømforsyningsenheden. (2) 154 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCDAN916 Sikkerhedsforskrifter mærket Pas på! (CAUTION) Pas på! Enheden kan have to netledninger. Tag begge ledninger ud for at fjerne al strøm til maskinen. (1) Pas på! Fjern ikke dækslet. Der er ingen dele, der kan repareres. (2) Pas på! Dobbeltpolet/neutral sikring. (3) Figur 15. Advarselsmærkat angående netledning Figur 16. Advarselsmærkat angående dæksel til strømforsyning Figur 17. Advarselsmærket angående sikring SC000327 SC000326 SC000325 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 155 SCDAN916A Pas på! Enheden vejer over 32 kg. Der er flere oplysninger i manualerne. (4) Figur 18. 2104 Model DS4 - mærkat med vægtangivelse SC000220 Der er flere oplysninger i manualerne 32 kg Bemærk: Hvis der er lagret data på diskene, skal du mærke diskene, før du fjerner dem. Når du monterer diskene igen, skal du montere hver disk i samme bås, som du fjernede den fra. Hvis du ikke gør det, kan du risikere at miste data. > 18 kg > 32 kg Pas på! Hvis der er installeret alt, hvad der er muligt i en 2104 Model DS4, det vil sige 14 diske og 2 strømforsyninger, er den samlede vægt 32 kg. Du skal fjerne mindst 6 diske fra 2104 Model DS4, før du løfter den eller monterer den i et rack eller i 2104 Model TS4. Derved bliver den samlede vægt under 32 kg, og 2104 Model DS4 kan håndteres af to personer. Hvis vægten ikke reduceres, kan det resultere i personskade. 156 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Pas på! Enheden vejer mellem 32 kg og 55 kg. (5) Pas på! Det medfølgende løfteværktøj skal bruges ved flytning eller montering af produktet, når alt ekstraudstyr er installeret i produktet. Der skal 3 personer til at flytte det på forsvarlig måde. Hvis løfteværktøjet ikke er ved hånden, skal vægten reduceres til 32 kg eller derunder, ved at du fjerner alle de tunge komponenter (diske og strømforsyninger) fra produktet. Der skal så kun 2 personer til at flytte eller montere produktet. (6) Figur 19. 2104 Model TS4 - mærkat med vægtangivelse SC000219 Bemærk: Hvis der er lagret data på diskene, skal du mærke diskene, før du fjerner dem. Når du monterer diskene igen, skal du montere hver disk i samme bås, som du fjernede den fra. Hvis du ikke gør det, kan du risikere at miste data. > 18 kg > 32 kg 32 - 55 kg SCDAN917 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 157 SCDAN918 Pas på! Standbytilstand angives ved symbolet til højre for jævnstrøm direkte over kontakten, SW1. Når du placerer SW1 i højre position direkte under standbysymbolet, slukker du ikke for vekselstrømmen til enheden. (7) Pas på! Støttepladen skal fastgøres korrekt foran i bunden af racket for at forhindre det i at vælte, når du fjerner 2104 Model DS4 fra racket. Du må ikke fjerne eller installere enheder, medmindre der er monteret en støtteplade på racket. (8) Pas på! Støttepladen skal fastgøres korrekt foran i bunden af racket for at forhindre det i at vælte, når du monterer 2104 Model DS4 i racket. Du må ikke montere eller fjerne enheder, medmindre der er monteret en støtteplade på racket. (9) Figur 20. Angivelse af standbytilstand SC000328 158 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCDAN319 Pas på! Placér ikke hænder eller værktøj i den åbning, hvor ventilatoren har været monteret. (10) Pas på! Placér ikke hænder eller værktøj i den åbning, hvor ventilator- og strømforsyningsenheden har været monteret. (11) Pas på! Placér ikke hænder eller værktøj i den åbning, hvor SCSI-interfacekortet har været monteret. (12) Pas på! Dette produkt er forsynet med en 3-polet netledning og et trebensstik. Af sikkerhedshensyn skal netledningen altid bruges sammen med en korrekt jordforbundet stikkontakt. (13) Pas på! Rør ikke stikkontakten eller dens dækplade med andet end testpindene (probe), før du har udført dette sikkerhedstjek. (14) Pas på! Hvis aflæsningen ikke er uendelig, må du ikke fortsætte. Få det elektriske system bragt i orden, inden du fortsætter. Strømafbryderen må ikke slås til, før ovennævnte trin er udført tilfredsstillende. (15) Pas på! Brug ikke håndtagene på ventilatorenheden eller ventilator- og strømforsyningsenheden til at løfte 2104 Model DS4. Håndtagene er ikke konstrueret til at bære enhedens vægt. (16) Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 159 SCDAN920 Pas på! Når du skubber enheden helt ind, flyttes grebet automatisk mod dets låste position. Pas på, at dine fingre ikke kommer i klemme mellem grebet og enheden. (17) Pas på! Sørg for, at hovednetledningen er fjernet fra den fejlbehæftede ventilator- og strømforsyningsenhed, før du fortsætter .(18) Pas på! Placér ikke hænder eller værktøj i åbningen oven over ventilator- og strømforsyningsenheden. (19) Pas på! Placér ikke hænder eller værktøj i åbningen mellem strømforsyningsenhederne. (20) Pas på! Du skal også overholde sikkerhedsforskrifterne for rackmodellerne T00 og T42, før du begynder at fjerne rammeenheden, hvis 2104 Model DS4 er installeret i en Model T00 eller T42. Sikkerhedsforskrifterne findes i afsnittet om systeminstallation i , SA38-0577 eller på webstedet: www-1.ibm.com/servers/eserver/pseries/library/hardware_docs/7014_t00.html (21) Pas på! 2104 Model DS4 og Model TS4 er designet, så de kan installeres af kunden, og de er certificerede som "kundeinstallation". Kontrollér, at det system eller det rack, som 2104 Model DS4 eller Model TS4 skal installeres i, også er designet og certificeret til kundeinstallation. Er det ikke tilfældet, skal 2104 Model DS4 eller Model TS4 installeres af en servicetekniker. (22) 7014 Model T00 and T42 Rack Installation and Service Guide 160 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCDAN921 Pas på! Placér ikke hænder eller værktøj i den åbning, hvor kortet har været monteret. (23) Pas på! Der skal tre personer til at løfte 2104 Model TS4. Forsøg ikke at løfte 2104 Model TS4 alene. (24) Pas på! Brug ikke håndtagene på ventilatorenheden eller ventilator- og strømforsyningsenheden til at løfte 2104 Model TS4. Håndtagene er ikke konstrueret til at bære enhedens vægt. (25) 2104 Model DS4 er designet, så den kan installeres af kunden, og den er certificeret som "kundeinstallation". Kontrollér, at det system eller det rack, som 2104 Model DS4 skal installeres i, også er designet og certificeret til kundeinstallation. Er det ikke tilfældet, skal 2104 Model DS4 installeres af en servicetekniker. (26) Pas på! Pas på! 2104 Model TS4 er designet, så den kan installeres af kunden, og den er certificeret som "kundeinstallation". Kontrollér, at det system, som 2104 Model TS4 skal installeres i, også er designet og certificeret til kundeinstallation. Er det ikke tilfældet, skal 2104 Model TS4 installeres af en servicetekniker. (27) Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 161 Dutch 162 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 163 164 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 165 166 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 167 168 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 169 170 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 171 172 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 173 174 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 175 176 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 177 178 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 179 180 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 181 182 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 183 184 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices English Safety and environmental notices Safety notices and their translations SC000901 This section contains the following information: Safety notices and their translations Safety inspection procedures for this product Environmental notices and statements Safety notices are printed throughout this document. A notice warns you of conditions or procedures that can result in death or severe personal injury. A warns you of conditions or procedures that can cause personal injury that is neither lethal nor extremely hazardous. An notice warns you of conditions or procedures that can cause damage to machines, equipment, programs, or data. The notices are listed in numeric order based on their IDs, which are displayed in parentheses, at the end of each notice. See the following examples of danger and caution notices for the location of the ID number. Danger Caution Attention DANGER An electrical outlet that is not correctly wired could place a hazardous voltage on the metal parts of the system or the products that attach to the system. It is the customer’s responsibility to ensure that the outlet is correctly wired and grounded to prevent an electrical shock. (1) CAUTION: Do not insert hands or tools into the opening of the empty space that contained the fan assembly. (1) Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 185 Electrostatic discharge (ESD) Safety inspection procedures Inspecting the rack SC000902 Attention When you handle field-replaceable units (FRUs) and other computer parts, take these precautions to avoid static damage: Limit your movement. Movement can cause static electricity to build up around you. Always touch the computer parts carefully. Hold the cards by their edges or metal cover. Never touch any exposed circuits. Prevent people who are not correctly grounded from touching computer parts. Before you install a new part, touch the static-protective package that contains the part against an unpainted metal part of the 2104 Model DS4, 2104 Model TS4, or host system for at least 2 seconds. This reduces the static electricity in the package and in your body. Remove the part from its package and, if possible, install it directly into the 2104 Model DS4 or Model TS4 without putting the part down. If you must put the part down, follow these steps: 1. Place the static-protective package that contained the part onto a smooth, level surface. 2. Place the part onto the static-protective package. Do not place the part directly onto any metal surface. The safety inspection procedures describe how to inspect the 2104 Model DS4 or Model TS4. Refer to your rack installation manual for information about performing a safety inspection on the rack. CAUTION: The 2104 Model DS4 and Model TS4 are designed to be installed by the customer and are certified as customer setup. Make sure that the system or rack into which the 2104 Model DS4 or Model TS4 will be installed is also designed and certified for customer setup. If they are not, then the 2104 Model DS4 or Model TS4 must be installed by a CE. (22) 186 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Inspecting the 2104 Model DS4 or Model TS4 External machine checks Internal machine checks SC000903 Perform the following internal machine checks: 1. Check for any non-IBM changes that might have been made to the machine. If any are present, obtain the “Non-IBM Alteration Attachment Survey” form, number R009, from the IBM branch office. Complete the form, and return it to the branch office. 2. Check the condition of the inside of the machine for: Metal or other contaminants Indications of water or other fluid Fire Smoke damage 3. Check for any obvious mechanical problems, such as loose components. 4. Check any exposed cables and connectors for wear, cracks, or pinching. Perform the following external machine checks: 1. Check the chassis for damage (loose, broken, or sharp edges). 2. Check the power cables and ensure that the insulation is not worn or damaged. 3. Check for any obvious nonstandard changes. Use good judgment about the safety of any such changes. 4. Verify that all external covers are present and are not damaged. 5. Ensure that all latches and hinges are in correct operating condition. 6. Check the power cable for damage. 7. Check for worn, damaged, or pinched cables. 8. Inspect the fan-and-power-supply assemblies. Check that the fasteners in the cover of the power-supply unit (screws or rivets) have not been removed or disturbed. 9. Check the external signal cable for damage. 10. Check the cover for sharp edges, damage, or alterations that expose the internal parts of the device. 11. Correct any problems that you find. Perform the following safety checks to identify unsafe conditions. Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 187 Safety label checks SC000904 Perform the following safety label checks: 1. Verify that the label is installed on the 2104 Model DS4 or Model TS4. See Figure 1. 2. Verify that the linecord caution label is installed on the 2104 Model DS4 or Model TS4 power supply. See Figure 2. SC000274 Figure 1. 2104 Model DS4 or Model TS4 label Figure 2. Linecord caution label SC000325 CAUTION: This unit might have two linecords. To remove all power, disconnect both linecords. (1) 188 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SC000905 3. Verify that the power supply cover caution label is installed on the 2104 Model DS4 or Model TS4 power supply. See Figure 3. 4. Verify that the fusing caution label is installed on the 2104 Model DS4 or Model TS4 power supply. See Figure 4. Figure 3. Power supply cover caution label Figure 4. Fusing caution label SC000326 SC000327 CAUTION: Do not remove cover, do not service, no serviceable parts. (2) CAUTION: Double Pole/Neutral Fusing. (3) Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 189 SC000905A 5. Verify that the weight label is installed on the 2104 Model DS4. See Figure 5. CAUTION: This unit weighs over 32 Kg (70.5 lbs). Refer to manuals. (4) Figure 5. 2104 Model DS4 weight label SC000220 Refer to manuals 32 Kg (70.5 lbs) Attention: If you have data stored on the drives, label the drives before you remove them. When you replace the drives, install each one in the same drive bay from which you removed it. Failure to do so could result in a loss of data. > 18 Kg (37 lbs) > 32 Kg (70.5 lbs) CAUTION: If a 2104 Model DS4 is fully loaded with 14 drives and two power supplies, its total weight exceeds 32 Kg (70.5 lbs). You must remove at least six drives from the 2104 Model DS4 before you lift it or install it into a rack or 2104 Model TS4. This reduces the total weight to less than 32 Kg (70.5 lbs) and the can then be safely handled by two people. Failure to do so can result in injury. 2104 Model DS4 190 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices CAUTION: This unit weighs between 32 Kg - 55 Kg (70.5 lbs - 121.2 lbs). (5) Figure 6. 2104 Model TS4 weight label 32 - 55 Kg (70.5 - 121.2 lbs) SC000219 6. Verify that the weight label is installed on the 2104 Model TS4. See Figure 6. CAUTION: ) The provided cardboard lifting tool is required for moving, installing, and relocating the product when fully populated. Three people are required to safely move the product. Failure to do so might result in injury. In case the lifting tool is not readily available, you must reduce the weight to 32 Kg or less by removing all of the heavy components (disk drives and power supplies) from the product. Then only two people are required to move, install, and relocate the product. (6 Attention: If you have data stored on the drives, label the drives before you remove them. When you replace the drives, install each one in the same drive bay from which you removed it. Failure to do so could result in a loss of data. > 18 Kg (37 lbs) > 32 Kg (70.5 lbs) SC000906 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 191 SC000907 7. Verify that the standby condition label is installed on the 2104 Model DS4 or Model TS4. See Figure 7. CAUTION: A standby condition is indicated by the symbol to the right of the DC directly above the switch, SW1. When you toggle SW1 to the right position directly under the standby symbol, the ac-power to the unit is not shut off. (7) Figure 7. Standby condition label SC000326 192 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SC000908 This section contains instructions for checking the grounding of the 2104 Model DS4 or Model TS4. Checking the grounding of the 2104 Model DS4 Perform the following steps to check the grounding of the 2104 Model DS4: 3. Some electrical circuits could be damaged if the external SCSI cables are connected to the 2104 Model DS4 while the grounding check is being done. Ensure that no external SCSI cables are plugged at the connectors . 4. Check the grounding of the 2104 Model DS4: Attention: a. At the connector ends of the power plugs that you unplugged from the power distribution unit or battery-backup unit in step 2, do a visual inspection and ensure that the third wire ground pin is in good condition. b. Use an ohm meter to connect one lead to the 2104 Model DS4 chassis and the other lead to the third wire ground pin at the face of the power connectors that you unplugged in step 2. You should read 0.1 ohm o Model DS4 r less between the ground pin on the power connector and the 2104 chassis. Figure 8. Power cables and SCSI connectors on the 2104 Model DS4 Checking the grounding of the 2104 Model DS4 or Model TS4 1. Ensure that all power is removed from the rack. See the installation document for the rack or host system. 2. Ensure that the power cables are plugged into each fan-and-power-supply assembly. See Figure 8. Also ensure that the other ends of the power cables are unplugged from the power distribution unit or battery-backup unit in the rack. See the installation document for the rack or host system. 2 1 SC000210 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 193 SC000909 If you are servicing a 220-volt 2104 Model DS4, go to step 5. If you are servicing a -48 volt 2104 Model DS4, go to step 7. 5. Check for continuity between the chassis of the 2104 Model DS4 and the ground pin of the power connector on each fan-and-power-supply assembly. See Figure 9. 6. Go to step 9. 7. Check for continuity between the chassis of the 2104 Model DS4 and the ground pins 1 and 7 of each mainline-power connector. See Figure 10. 8. Go to step 9. 9. If any fan-and-power-supply assembly has no continuity, exchange that assembly for a new one. See the section about fan and power supply assemblies in the . Perform the complete grounding check again. If each fan-and-power-supply assembly has continuity, you might have a problem with the power cable or with the grounding of the host system. Expandable Storage Plus 2104 Model DS4 and Model TS4 Service Guide Figure 9. Ground pin (220 V 2104 Model DS4 ) Figure 10. Ground pins ( -48 V 2104 Model DS4) 1 SC000329 If the grounding is correct, the grounding check is complete. If the grounding is not correct, unplug the power cables from the fan-and-power-supply assemblies in the 2104 Model DS4. SC000330 Pin 6 Pin 1 Pin 12 Pin 7 194 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SC000910 If you are servicing a 220-volt 2104 Model DS4, go to step 10. If you are servicing a -48 volt 2104 Model DS4, see the installation document for the rack or host system to isolate the fault. Perform the complete grounding check again. 10. Check the power cable for continuity. If the power cable does not have continuity, exchange it for a new one, then perform the complete grounding check again. If the power cable does have continuity, see the installation document for the rack or host system to isolate the fault. Perform the following steps to check the grounding of the 2104 Model TS4: 2. Ensure that the other ends of the power cables are not plugged into electrical power outlets. Unplug the cables if necessary. Checking the grounding of the 2104 Model TS4 1. Ensure that a power cable is plugged into each power socket . See Figure 11. 1 Figure 11. Power cables and SCSI connectors (2104 Model TS4) SC000230 1 2 3 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 195 SC000911 3. : Some electrical circuits could be damaged if the external SCSI cables are connected to the 2104 Model TS4 while the grounding check is being done. Ensure that no external SCSI cables are connected at the SCSI connectors . See Figure 11. 4. Check for continuity between the chassis of the 2104 Model TS4 and the ground pin of each power cable. a. Check the power cables and ensure that the third-wire ground connector is in good condition. b. Use a meter to check that the third-wire ground continuity is 0.1 ohm or less between the external ground pin and the chassis ground. 5. If the continuity is good, the grounding check is complete. 6. If the grounding is not correct, unplug the power cables from the mainline power connectors. a. Check each power cable for continuity. b. If either power cable is failing, exchange it for a new one. c. Check for continuity between the chassis of the 2104 Model TS4 and the ground pin of the power connector on each fan-and-power-supply assembly. See Figure 12. Attention d If either fan-and-power-supply assembly does not have continuity, exchange that assembly for a new one. (See the section about fan and power supply assemblies in the Go to “Checking the grounding of the 2104 Model TS4” and perform step 1 through step 6d to complete the grounding check again. Expandable Storage Plus 2104 Model DS4 and Model TS4 Service Guide.) Figure 12. Ground pin (2104 Model TS4) 1 2 3 SC000331 196 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SC000912 Figure 13. Removing the power from a 2104 Model DS4 Unless you have a particular reason to do so, do not remove power from the host system or from the 2104 Model DS4 unless the instructions that you are following tell you to. Perform the following steps to remove the power from a 2104 Model DS4: 1. Verify with the customer that all operations between the 2104 Model DS4 and the host system have stopped. 2. Set the dc on/standby switch of each fan-and-power-supply assembly to standby. See Figure 13. a. Some 2104 Model DS4s have a fan-and-power-supply assembly and a fan assembly. The fan assembly has no dc on/standby switch. b. A fan-and-power-supply assembly might have its CHK light on although its dc on/standby switch is set to standby. Note: Notes: Removing power from the 2104 Model DS4 or Model TS4 1 Removing power from a 2104 Model DS4 SC000206 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 197 SC000913 3. DANGER In the following step you are going to remove the power cables. These cables are live if the rack power distribution unit or uninterruptible power supply (UPS) unit is still switched on. (1) 1 1 2 2 Note: Notes: Unless you have a particular reason to do so, do not remove power from the host system or from the 2104 Model TS4 unless the instructions that you are following tell you to. Perform the following steps to remove the power from a 2104 Model TS4: 1. Verify with the customer that all operations between the 2104 Model TS4 and the host system have stopped. 2. At the back of the 2104 Model TS4, set the dc on/standby switch of each fan-and-power-supply assembly to standby. See Figure 14. a. Some 2104 Model TS4s have a fan-and-power-supply assembly and a fan assembly. The fan assembly has no dc on/standby switch. b. A fan-and-power-supply assembly might have its CHK light on although its dc on/standby switch is set to standby. Removing power from a 2104 Model TS4 Remove the power cables from the back of the 2104 Model DS4. See Figure 13. 4. Perform the following steps to return power to the 2104 Model DS4: a. Reinstall the power cables . b. Set the dc on/standby switch of each fan-and-power-supply assembly to on. You can configure the motor-start sequencing of the 2104 Model DS4. For more information, see the sections about the drive autostart switch and the delay motor start mode switch in the . Note: Expandable Storage Plus 2104 Model DS4 and Model TS4 Service Guide 198 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SC000914 1 2 2 3. Remove the power cables from the electrical power outlet and from the back of the 2104 Model TS4. See Figure 14. 4. Perform the following steps to return power to the 2104 Model TS4: a. Reinstall the power cables . b. Set the dc on/standby switch of each fan-and-power-supply assembly to on. You can configure the motor-start sequencing of the 2104 Model TS4. For more information, see the sections about the drive autostart switch and the delay motor start mode switch in the Note: Expandable Storage Plus 2104 Model DS4 and Model TS4 Service Guide. Figure 14. Removing power from a 2104 Model TS4 SC000207 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 199 SC000915 Danger notices DANGER DANGER DANGER DANGER DANGER In the following step you are going to remove the power cables. These cables are live if the rack power distribution unit or uninterruptible power supply (UPS) unit is still switched on. (1) Do not plug a power cable into the fan-and-power-supply assembly until the assembly is fully home and its thumbscrews are fully tightened. (3) During an electrical storm, do not disconnect cables for display stations, printers, telephones, or station protectors for communication lines. (5) An electrical outlet that is not correctly wired could place hazardous voltage on metal parts of the system or the devices that attach to that system. It is the customer’s responsibility to ensure that the outlet is correctly wired and grounded to prevent an electrical shock. During an electrical storm, do not disconnect cables for display stations, printers, telephones, or station protectors for communication lines. (4) Do not try to open the covers of the fan-and-power-supply assembly. (2) 200 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SC000916 Caution notices CAUTION: This unit might have two linecords. To remove all power, disconnect both linecords. (1) CAUTION: Do not remove cover, do not service, no serviceable parts. (2) CAUTION: Double Pole/Neutral Fusing. (3) Figure 15. Linecord caution label Figure 16. Power supply cover caution label Figure 17. Fusing caution label SC000327 SC000326 SC000325 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 201 SC000916A CAUTION: This unit weighs over 32 Kg (70.5 lbs). Refer to manuals. (4) Figure 18. 2104 Model DS4 weight label SC000220 Refer to manuals 32 Kg (70.5 lbs) Attention: If you have data stored on the drives, label the drives before you remove them. When you replace the drives, install each one in the same drive bay from which you removed it. Failure to do so could result in a loss of data. > 18 Kg (37 lbs) > 32 Kg (70.5 lbs) CAUTION: If a 2104 Model DS4 is fully loaded with 14 drives and two power supplies, its total weight exceeds 32 Kg (70.5 lbs). You must remove at least six drives from the 2104 Model DS4 before you lift it or install it into a rack or 2104 Model TS4. This reduces the total weight to less than 32 Kg (70.5 lbs) and the can then be safely handled by two people. Failure to do so can result in injury. 2104 Model DS4 202 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices CAUTION: This unit weighs between 32 Kg - 55 Kg (70.5 lbs - 121.2 lbs). (5) CAUTION: (6) The provided cardboard lifting tool is required for moving, installing, and relocating the product when fully populated. Three people are required to safely move the product. Failure to do so might result in injury. In case the lifting tool is not readily available, you must reduce the weight to 32 Kg or less by removing all of the heavy components (disk drives and power supplies) from the product. Then only two people are required to move, install, and relocate the product. Figure 19. 2104 Model TS4 weight label SC000219 Attention: If you have data stored on the drives, label the drives before you remove them. When you replace the drives, install each one in the same drive bay from which you removed it. Failure to do so could result in a loss of data. > 18 Kg (37 lbs) > 32 Kg (70.5 lbs) 32 - 55 Kg (70.5 - 121.2 lbs) SC000917 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 203 SC000918 CAUTION: A standby condition is indicated by the symbol to the right of the DC directly above the switch, SW1. When you toggle SW1 to the right position directly under the standby symbol, the ac-power to the unit is not shut off. (7) CAUTION: The stabilizer must be correctly attached to the bottom front of the rack to prevent the rack from tipping forward while the 2104 Model DS4 is being removed from the rack. Do not pull out or install any unit if a stabilizer is not attached to the rack. (8) CAUTION: The stabilizer must be correctly attached to the bottom front of the rack to prevent the rack from tipping forward while the 2104 Model DS4 is being installed into the rack. Do not remove or install any unit if a stabilizer is not attached to the rack. (9) Figure 20. Location of standby condition SC000328 204 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SC000319 CAUTION: Do not insert hands or tools into the empty space that contained the fan assembly. (10) CAUTION: Do not insert hands or tools into the empty space that contained the fan-and-power-supply assembly. (11) CAUTION: Do not insert hands or tools into the empty space that contained the SCSI interface card assembly. (12) CAUTION: This product is equipped with a 3-wire power cable and plug for the user’s safety. Use this power cable in conjunction with a correctly grounded electrical outlet to avoid an electrical shock. (13) CAUTION: Do not touch the power outlet or the power outlet face plate with anything other than test probes before you complete this safety check. (14) CAUTION: If the reading is not infinity, do not proceed. Make the necessary corrections to the wiring before you continue. Do not switch on the branch circuit CB until you satisfactorily complete the previous steps. (15) CAUTION: Do not use the handles of the fan or fan-and-power-supply assemblies to carry the 2104 Model DS4. These handles are not intended to support the weight of the unit. (16) Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 205 SC000920 CAUTION: As you push the assembly fully home, the lever automatically moves toward its closed position. Ensure that your fingers do not become pinched between the lever and the assembly. (17) CAUTION: Ensure that the mainline power cable has been removed from the failing fan-and-power supply before you continue. (18) CAUTION: Do not insert hands or tools into the empty space above the fan-and-power supply assembly. (19) CAUTION: Do not insert hands or tools into the empty space between the power supply assemblies. (20) CAUTION: If the 2104 Model DS4 is installed in a Model T00 or T42 rack, you must also observe the safety notices for those racks before you start to remove the frame assembly. You can find the safety notices in the section about system installation in the SA38-0577, or at the following Web site: www-1.ibm.com/servers/eserver/pseries/library/hardware_docs/7014_t00.html (21) CAUTION: The 2104 Model DS4 and Model TS4 are designed to be installed by the customer and are certified as customer setup. Make sure that the system or rack into which the 2104 Model DS4 or Model TS4 will be installed is also designed and certified for customer setup. If they a the 2104 Model DS4 or Model TS4 must be installed by a CE. (22) 7014 Model T00 and T42 Rack Installation and Service Guide, re not, then 206 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SC000921 CAUTION: Do not insert hands or tools into the space that contained the card assembly. (23) CAUTION: It takes three people to lift the 2104 Model TS4. Do not attempt to lift the 2104 Model TS4 by yourself. Do not attempt to lift it without help from two other people. (24) CAUTION: Do not use the handles of the fan or fan-and-power-supply assemblies to carry the 2104 Model TS4. These handles are not intended to support the weight of the unit. (25) The 2104 Model DS4 is designed to be installed by the customer and is certified as customer setup. Make sure that the system or rack into which the 2104 Model DS4 will be installed is also designed and certified for customer setup. If they are not, then the 2104 Model DS4 must be installed by a CE. (26) CAUTION: CAUTION: The 2104 Model TS4 is designed to be installed by the customer and is certified as customer setup. Make sure that the system into which the 2104 Model TS4 will be installed is also designed and certified for customer setup. If it is not, then the 2104 Model TS4 must be installed by a CE. (27) Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 207 Finnish 208 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 209 210 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 211 212 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 213 214 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 215 216 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 217 218 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 219 220 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 221 222 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 223 224 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 225 226 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 227 228 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 229 230 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices French Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 231 232 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 233 234 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 235 236 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 237 238 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 239 240 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 241 242 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 243 244 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 245 246 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 247 248 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 249 250 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 251 252 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 253 German Sicherheitshinweise und Hinweise zum Umweltschutz Sicherheitshinweise und ihre Übersetzungen SCGER901 Dieser Abschnitt enthält die folgenden Informationen: Sicherheitshinweise und ihre Übersetzungen Prozeduren zur Sicherheitsüberprüfung für dieses Produkt Hinweise zur Wiederverwertbarkeit und Entsorgung Dieses Buch enthält Sicherheitshinweise. Die Nichtbeachtung der Vorschriften in den mit gekennzeichneten Sicherheitshinweisen kann zu schweren Verletzungen oder sogar zum Tod führen. Die Nichtbeachtung der Vorschriften in den mit gekennzeichneten Sicherheitshinweisen kann zu leichten Verletzungen bzw. zu Schäden an Hardware oder Software führen. Die Sicherheitshinweise sind in numerischer Reihenfolge anhand ihrer IDs aufgeführt, die am Ende der einzelnen Sicherheitshinweise in Klammern stehen. Die folgenden Beispiele zeigen die Position der ID-Nummern in den Sicherheitshinweisen. Vorsicht Achtung Vorsicht Bei nicht ordnungsgemäß angeschlossener Netzsteckdose können an offen liegenden Metallteilen des Systems oder an angeschlossenen Einheiten gefährliche Berührungsspannungen auftreten. Für den ordnungsgemäßen Zustand der Steckdose ist der Betreiber verantwortlich. (1) Achtung: Keine Hände oderWerkzeuge in die Öffnung des freien Bereichs einführen, in dem sich die Ventilatorbaugruppe befand. (1) 254 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Elektrostatische Entladung (ESE) Prozeduren zur Sicherheitsüberprüfung Rack überprüfen SCGER902 Achtung: Bei der Handhabung von durch den Kundendienst austauschbaren Funktionseinheiten (FRUs) oder anderen Computerteilen die folgenden Vorsichtsmaßnahmen ergreifen, um Schäden durch statische Entladung zu vermeiden: Bewegungen einschränken. Durch Bewegung kann statische Aufladung aufgebaut werden. Computerteile vorsichtig anfassen. Karten an den Rändern oder an der Metallabdeckung anfassen. Niemals offen liegende Schaltkreise berühren. Vermeiden, dass Personen, die nicht korrekt geerdet sind, Computerteile berühren. Vor der Installation eines neuen Teils die antistatische Verpackung, in der sich das Teil befindet, mit einem unlackierten Metallteil des 2104 Modells DS4 oder Ts4 oder des Hostsystems mindestens zwei Sekunden lang berühren. Hierdurch wird die statische Elektrizität abgeleitet. Das Teil aus der Verpackung nehmen und (wenn möglich) direkt in das 2104 Modell DS4 oder TS4 installieren, ohne das Teil abzusetzen. Muss das Teil abgesetzt werden, wie folgt vorgehen: 1. Die antistatische Verpackung, in der sich das Teil befindet, auf eine glatte, ebene Oberfläche legen. 2. Das Teil auf die antistatische Verpackung und nicht direkt auf eine Metalloberfläche legen. In den Prozeduren zur Sicherheitsüberprüfung wird beschrieben, wie das 2104 Modell DS4 oder TS4 überprüft wird. Informationen über die Ausführung einer Sicherheitsüberprüfung für das Rack enthält das Installationshandbuch für das Rack. Achtung: Die 2104 Modelle DS4 und TS4 sind für die Installation durch Kunden konzipiert und zertifiziert. Achten Sie darauf, dass das System oder Rack, in das das 2104 Modell DS4 oder TS4 eingebaut wird, ebenfalls für die Installation durch den Kunden konzipiert und zertifiziert ist. Ist dies nicht der Fall, muss das 2104 Modell DS4 oder TS4 vom Kundendienst installiert werden. (22) Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 255 2104 Modell DS4 oder TS4 überprüfen Externe Maschinenprüfungen Interne Maschinenprüfungen SCGER903 Führen Sie die folgenden internen Maschinenüberprüfungen aus: 1. Überprüfen Sie, ob Änderungen an der Maschine vorgenommen wurden, die nicht von IBM ausgeführt wurden. Ist dies der Fall, besorgen Sie sich das Formular für nicht von IBM ausgeführte Änderungen (Non-IBM Alteration Attachment Survey, Formnummer R009) bei der zuständigen Geschäftsstelle. Füllen Sie das Formular aus und geben Sie es an die Geschäftsstelle zurück. 2. Überprüfen Sie das Innere der Maschine auf: Metallteile oder andere Verschmutzungen Anzeichen von Wasser oder Feuchtigkeit Brandschäden Rauchschäden 3. Überprüfen Sie, ob offensichtliche mechanische Probleme wie z. B. lose Komponenten vorliegen. 4. Überprüfen Sie offen liegende Kabel auf Abnutzung oder Beschädigung. Führen Sie die folgenden externen Maschinenüberprüfungen aus: 1. Überprüfen Sie das Gehäuse auf Beschädigungen (lockere, abgebrochene oder scharfe Kanten). 2. Überprüfen Sie die Netzkabel. Die Isolierung darf nicht abgenutzt oder beschädigt sein. 3. Überprüfen Sie, ob vom Standard abweichende Änderungen vorgenommen wurden. Überprüfen Sie die Sicherheit solcher Änderungen mit besonderer Sorgfalt. 4. Überprüfen Sie, ob alle externen Abdeckungen vorhanden und unbeschädigt sind. 5. Überprüfen Sie, ob alle Verriegelungen und Scharniere in der korrekten Betriebsposition sind. 6. Überprüfen Sie das Netzkabel auf Beschädigungen. 7. Überprüfen Sie, ob Kabel abgenutzt oder beschädigt sind. 8. Überprüfen Sie die aus Ventilator und Netzteil bestehenden Baugruppen. Stellen Sie sicher, dass die Befestigungen in der Abdeckung des Netzteils (Schrauben oder Nieten) nicht entfernt wurden oder beschädigt sind. 9. Überprüfen Sie das externe Signalkabel auf Beschädigungen. 10. Überprüfen Sie die Abdeckung auf scharfe Kanten, Beschädigungen oder Veränderungen, durch die interne Teile der Einheit freigelegt werden. 11. Beheben Sie alle gefundenen Probleme. Führen Sie die folgenden Sicherheitsüberprüfungen aus, um Gefahrenquellen zu erkennen. 256 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Sicherheitsaufkleber überprüfen SCGER904 Führen Sie die folgenden Überprüfungen für Sicherheitsaufkleber durch: 1. Überprüfen Sie, ob der Aufkleber am 2104 Modell DS4 oder TS4 angebracht ist. Siehe Abbildung 1. 2. Überprüfen Sie, ob der Sicherheitsaufkleber für Netzkabel am Netzteil für das 2104 Modell DS4 oder TS4 angebracht ist. Siehe Abbildung 2. SCGER274 Abbildung 1. Aufkleber für 2104 Modell DS4 oder TS4 Abbildung 2. Sicherheitsaufkleber für Netzkabel SCGER325 Achtung: Diese Einheit könnte zwei Netzkabel haben. Um die Einheit ganz von der Netzspannung zu trennen, beide Netzkabel lösen. (1) Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 257 SCGER905 3. Überprüfen Sie, ob der Sicherheitsaufkleber für die Netzteilabdeckung am Netzteil für das 2104 Modell DS4 oder TS4 angebracht ist. Siehe Abbildung 3. 4. Überprüfen Sie, ob der Sicherheitsaufkleber für die Sicherung am Netzteil für das 2104 Modell DS4 oder TS4 angebracht ist. Siehe Abbildung 4. Abbildung 3. Sicherheitsaufkleber für Netzteilabdeckung Abbildung 4: Sicherheitsaufkleber für Sicherung SCGER326 SCGER327 Achtung: Abdeckung nicht entfernen. Nicht reparieren. Keine wartungsbedürftigen Teile. (2) Achtung: Zweipolige bzw. Neutralleiter-Sicherung. (3) 258 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCGER905A 5. Überprüfen Sie, ob der Gewichtsaufkleber am 2104 Modell DS4 angebracht ist. Siehe Abbildung 5. Achtung: Diese Einheit wiegt über 32 kg. Siehe Handbücher. (4) Abbildung 5. Gewichtsaufkleber für 2104 Modell DS4 SCGER220 Refer to manuals 32 Kg (70.5 lbs) Achtung: Sind Daten auf den Laufwerken gespeichert, die Laufwerke vor dem Ausbau mit einem Aufkleber versehen. Die Laufwerke müssen beim Einbau wieder in die Laufwerkpositionen installiert werden, aus denen sie beim Ausbau entfernt wurden. Andernfalls kann es zu einem Datenverlust kommen. > 18 Kg (37 lbs) > 32 Kg (70.5 lbs) Achtung: Ist ein 2104 Modell DS4 mit 14 Laufwerken und 2 Netzteilen ausgestattet, überschreitet das Gesamtgewicht 32 kg. Es müssen mindestens 6 Laufwerke aus dem 2104 Modell DS4 entfernt werden, bevor es angehoben oder in ein Rack oder 2104 Modell TS4 installiert werden kann, damit das Gesamtgewicht auf unter 32 kg reduziert und das 2104 Modell DS4 sicher von zwei Personen gehandhabt werden kann. Andernfalls können Verletzungen auftreten. Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 259 Diese Einheit wiegt zwischen 32 und 55 kg. (5) Achtung: Abbildung 6. Gewichtsaufkleber für 2104 Modell TS4 32 - 55 Kg (70.5 - 121.2 lbs) SCGER219 6. Überprüfen Sie, ob der Gewichtsaufkleber am 2104 Modell TS4 angebracht ist. Siehe Abbildung 6. Achtung: Wird ein vollständig ausgestattetes Produkt versetzt, installiert oder transportiert, ist das mitgelieferte Anhebewerkzeug erforderlich. Zum sicheren Versetzen, Installieren oder Transportieren des Produkts sind drei Personen erforderlich. Andernfalls können Verletzungen auftreten. Ist das Anhebewerkzeug nicht verfügbar, muss das Gewicht durch das Entfernen aller schweren Komponenten (Plattenlaufwerke und Netzteile) aus dem Produkt auf unter 32 kg reduziert werden, da in diesem Fall nur zwei Personen erforderlich sind, um das Produkt zu versetzen, zu installieren oder zu transportieren. (6) Achtung: Sind Daten auf den Laufwerken gespeichert, die Laufwerke vor dem Ausbau mit einem Aufkleber versehen. Die Laufwerke müssen beim Einbau wieder in die Laufwerkpositionen installiert werden, aus denen sie beim Ausbau entfernt wurden. Andernfalls kann es zu einem Datenverlust kommen. > 18 Kg (37 lbs) > 32 Kg (70.5 lbs) SCGER906 260 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCGER907 7. Überprüfen Sie, ob der Aufkleber für Bereitschaftsbedingung am 2104 Modell DS4 oder TS4 angebracht ist. Siehe Abbildung 7. Achtung: Durch das Symbol rechts von "DC", direkt über dem Schalter SW1, wird eine Bereitschaftsbedingung angegeben. Wird der Schalter SW1 nach rechts direkt unter das Bereitschaftssymbol gelegt, wird dieWechselstromversorgung der Einheit nicht ausgeschaltet. (7) Abbildung 7. Aufkleber für Bereitschaftsbedingung SCGER326 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 261 SCGER908 Dieser Abschnitt enthält Anweisungen zur Überprüfung der Erdung des 2104 Modells DS4 oder TS4. Erdung des 2104 Modells DS4 überprüfen Gehen Sie wie folgt vor, um die Erdung des 2104 Modells DS4 zu überprüfen: 3. Einige elektrische Schaltkreise können beschädigt werden, wenn die externen SCSI-Kabel an das 2104 Modell DS4 angeschlossen sind, während die Erdung überprüft wird. Achten Sie darauf, dass keine externen SCSI-Kabel an die Anschlüsse angeschlossen sind. 4. Überprüfen Sie die Erdung des 2104 Modells DS4: a. Stellen Sie an den Anschlussenden der Netzkabel, die in Schritt 2 aus der Stromversorgungseinheit oder der Notstromversorgung herausgezogen wurden, durch eine Sichtprüfung fest, ob die Schutzleiterverbindung in Ordnung ist. b. Schließen Sie eine Messleitung eines Ohm-Meters an das Gehäuse des 2104 Modells DS4 und die andere Messleitung an die Schutzleiterverbindung an der Vorderseite der Netzteilanschlüsse an, die in Schritt 2 herausgezogen wurden. Der zwischen dem Schutzleiterkontakt am Netzteilanschluss und dem Gehäuse des 2104 Modells DS4 gemessene Wert darf maximal 0,1 Ohm betragen. Achtung: Abbildung 8. Netzkabel und SCSI-Anschlüsse auf dem 2104 Modell DS4 Erdung des 2104 Modells DS4 oder TS4 überprüfen 1. Achten Sie darauf, dass die Stromzufuhr zum Rack ausgeschaltet ist.Weitere Informationen enthält das Installationsdokument für das Rack oder Hostsystem. 2. Achten Sie darauf, dass die Netzkabel in die aus Ventilator und Netzteil bestehenden Baugruppen eingesteckt sind. Siehe Abbildung 8. Achten Sie zudem darauf, dass die anderen Enden der Netzkabel aus der Stromversorgungseinheit oder der Notstromversorgung im Rack herausgezogen sind. Weitere Informationen enthält das Installationsdokument für das Rack oder Hostsystem. 2 1 SCGER210 262 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCGER909 Wird ein 220 Volt 2104 Modell DS4 gewartet, machen Sie mit Schritt 5 weiter. Wird ein -48 Volt 2104 Modell DS4 gewartet, machen Sie mit Schritt 7 weiter. 5. Führen Sie eine Durchgangsprüfung zwischen dem Gehäuse des 2104 Modells DS4 und dem Schutzleiterkontakt des Netzteilanschlusses der einzelnen aus Ventilator und Netzteil bestehenden Baugruppen aus. Siehe Abbildung 9. 6. Machen Sie mit Schritt 9 weiter. 7. Führen Sie eine Durchgangsprüfung zwischen dem Gehäuse des 2104 Modells DS4 und den Schutzleiterkontakten 1 und 7 der einzelnen Hauptstromanschlüsse aus. Siehe Abbildung 10. 8. Machen Sie mit Schritt 9 weiter. 9. Wird für eine aus Ventilator und Netzteil bestehende Baugruppe kein Durchgang gemessen, tauschen Sie die Baugruppe aus.Weitere Informationen enthält der Abschnitt über aus Ventilator und Netzteil bestehende Baugruppen im . Führen Sie die gesamte Erdungsprüfung erneut aus. Wird für jede aus Ventilator und Netzteil bestehende Baugruppe Durchgang gemessen, liegt der Fehler möglicherweise beim Netzkabel oder bei der Erdung des Hostsystems. Expandable Storage Plus 2104 Model DS4 and Model TS4 Service Guide Abbildung 9. Schutzleiterkontakt (220 V 2104 Modell DS4) Abbildung 10. Schutzleiterkontakte (-48 V 2104 Modell DS4) 1 SCGER329 Ist die Erdung korrekt, ist die Erdungsüberprüfung abgeschlossen. Ist die Erdung nicht korrekt, ziehen Sie die Netzkabel aus den aus Ventilator und Netzteil bestehenden Baugruppen im 2104 Modell DS4. SCGER330 Kontaktstift 6 Kontaktstift 1 Kontaktstift 12 Kontaktstift 7 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 263 SCGER910 Wird ein 220 Volt 2104 Modell DS4 gewartet, machen Sie mit Schritt 10 weiter. Wird ein -48 Volt 2104 Modell DS4 gewartet, können Sie dem Installationsdokument für das Rack oder das Hostsystem Informationen zur Fehlerbestimmung entnehmen. Führen Sie die gesamte Erdungsprüfung erneut aus. 10. Überprüfen Sie das Netzkabel auf Durchgang. Wird am Netzkabel kein Durchgang gemessen, tauschen Sie das Netzkabel aus. Führen Sie dann die gesamte Erdungsprüfung erneut aus. Wird am Netzkabel Durchgang gemessen, können Sie dem Installationsdokument für das Rack oder das Hostsystem Informationen zur Fehlerbestimmung entnehmen. Gehen Sie wie folgt vor, um die Erdung des 2104 Modells TS4 zu überprüfen: 2. Achten Sie darauf, dass das andere Ende der Netzkabel nicht an eine Netzsteckdose angeschlossen ist. Ziehen Sie die Kabel gegebenenfalls heraus. Erdung des 2104 Modells TS4 überprüfen 1. Achten Sie darauf, dass an jeden Stromversorgungsanschluss ein Netzkabel abgeschlossen ist. Siehe Abbildung 11. 1 Abbildung 11. Netzkabel und SCSI-Anschlüsse (2104 Modell TS4) SCGER230 1 2 3 264 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCGER911 3. Einige elektrische Schaltkreise können beschädigt werden, wenn die externen SCSI-Kabel an das 2104 Modell TS4 angeschlossen sind, während die Erdung überprüft wird. Achten Sie darauf, dass keine externen SCSI-Kabel an die SCSI-Anschlüsse angeschlossen sind. Siehe Abbildung 11. 4. Führen Sie eine Durchgangsprüfung zwischen dem Gehäuse des 2104 Modells TS4 und dem Schutzleiterkontakt der einzelnen Netzkabel aus. a. Überprüfen Sie die Netzkabel. Achten Sie darauf, dass die Schutzleiterverbindung in Ordnung ist. b. Messen Sie den Widerstand der Schutzleiterverbindung zwischen dem externen Schutzleiterkontakt und dem Gehäuse. Der gemessene Wert darf maximal 0,1 Ohm betragen. 5. Ist der Durchgang korrekt, ist die Erdungsüberprüfung abgeschlossen. 6. Ist die Erdung nicht korrekt, ziehen Sie die Netzkabel aus den Hauptstromanschlüssen. a. Überprüfen Sie die einzelnen Netzkabel auf Durchgang. b. Tauschen Sie die Netzkabel aus, bei denen der Durchgang nicht korrekt ist. c. Führen Sie eine Durchgangsprüfung zwischen dem Gehäuse des 2104 Modells TS4 und dem Schutzleiterkontakt des Netzteilanschlusses der einzelnen aus Ventilator und Netzteil bestehenden Baugruppen aus. Siehe Abbildung 12. Achtung: d. Wird für eine aus Ventilator und Netzteil bestehende Baugruppe kein Durchgang gemessen, tauschen Sie die Baugruppe aus. (Weitere Informationen enthält der Abschnitt über aus Ventilator und Netzteil bestehende Baugruppen im .) Machen Sie mit "Erdung des 2104 Modells TS4 überprüfen" weiter und führen Sie die Schritte 1 bis 6d aus, um die Erdungsprüfung erneut auszuführen. Expandable Storage Plus 2104 Model DS4 and Model TS4 Service Guide Abbildung 12. Schutzleiterkontakt (2104 Modell TS4) 1 2 3 SCGER331 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 265 SCGER912 Abbildung 13. 2104 Modell DS4 von der Stromversorgung trennen Anmerkung: Anmerkungen: Liegt kein besonderer Grund vor, das Hostsystem oder das 2104 Modell Ds4 nur dann von der Stromversorgung trennen, wenn in den auszuführenden Anweisungen dazu aufgefordert wird. Gehen Sie wie folgt vor, um das 2104 Modell DS4 von der Stromversorgung zu trennen: 1. Überprüfen Sie zusammen mit dem Kunden, ob alle Operationen zwischen dem 2104 Modell DS4 und dem Hostsystem gestoppt wurden. 2. Setzen Sie den DC-Schalter Ein/Bereitschaft der einzelnen aus Ventilator und Netzteil bestehenden Baugruppen auf "Bereitschaft". Siehe Abbildung 13. a. Einige 2104 Modelle DS4 haben eine aus Ventilator und Netzteil bestehende Baugruppe und eine Ventilatorbaugruppe. Die Ventilatorbaugruppe hat keinen DC-Schalter Ein/Bereitschaft. b. Bei einer aus Ventilator und Netzteil bestehenden Baugruppe kann die Anzeige CHK an sein, obwohl der DC-Schalter Ein/Bereitschaft auf “Bereitschaft" steht. 2104 Modell DS4 oder TS4 von der Stromversorgung trennen 1 2104 Modell DS4 von der Stromversorgung trennen SCGER206 266 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCGER913 3. Vorsicht Im folgenden Schritt werden die Netzkabel entfernt. Ist die Stromversorgungseinheit des Gehäuses oder die unterbrechungsfreie Stromversorgung noch eingeschaltet, führen diese Kabel Strom. (1) 1 1 2 2 Anmerkung: Anmerkungen: Liegt kein besonderer Grund vor, das Hostsystem oder das 2104 Modell TS4 nur dann von der Stromversorgung trennen, wenn in den auszuführenden Anweisungen dazu aufgefordert wird. Gehen Sie wie folgt vor, um das 2104 Modell TS4 von der Stromversorgung zu trennen: 1. Überprüfen Sie zusammen mit dem Kunden, ob alle Operationen zwischen dem 2104 Modell TS4 und dem Hostsystem gestoppt wurden. 2. Setzen Sie den DC-Schalter Ein/Bereitschaft (er befindet sich an der Rückseite des 2104 Modells TS4) der einzelnen aus Ventilator und Netzteil bestehenden Baugruppen auf "Bereitschaft". Siehe Abbildung 14. a. Einige 2104 Modelle TS4 haben eine aus Ventilator und Netzteil bestehende Baugruppe und eine Ventilatorbaugruppe. Die Ventilatorbaugruppe hat keinen DC-Schalter Ein/Bereitschaft. b. Bei einer aus Ventilator und Netzteil bestehenden Baugruppe kann die Anzeige CHK an sein, obwohl der DC-Schalter Ein/Bereitschaft auf “Bereitschaft" steht. 2104 Modell TS4 von der Stromversorgung trennen Trennen Sie die Netzkabel von der Rückseite des 2104 Modells DS4. Siehe Abbildung 13. 4. Gehen Sie wie folgt vor, um das 2104 Modell DS4 wieder an die Stromversorgung anzuschließen: a. Schließen Sie die Netzkabel wieder an. b. Setzen Sie den DC-Schalter Ein/Bereitschaft der einzelnen aus Ventilator und Netzteil bestehenden Baugruppen auf "Ein". Sie können die Motorstartfolge des 2104 Modells DS4 konfigurieren. Weitere Informationen enthalten die Abschnitte über den Schalter für automatisches Starten des Laufwerks und für den Modus für verzögerten Motorstart im . Anmerkung: Expandable Storage Plus 2104 Model DS4 and Model TS4 Service Guide Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 267 SCGER914 1 2 2 3. Trennen Sie die Netzkabel von der Netzsteckdose und von der Rückseite des 2104 Modells TS4. Siehe Abbildung 14. 4. Gehen Sie wie folgt vor, um das 2104 Modell TS4 wieder an die Stromversorgung anzuschließen: a. Schließen Sie die Netzkabel wieder an. b. Setzen Sie den DC-Schalter Ein/Bereitschaft der einzelnen aus Ventilator und Netzteil bestehenden Baugruppen auf "Ein". Sie können die Motorstartfolge des 2104 Modells TS4 konfigurieren.Weitere Informationen enthalten die Abschnitte über den Schalter für automatisches Starten des Laufwerks und für den Modus für verzögerten Motorstart im . Anmerkung: Expandable Storage Plus 2104 Model DS4 and Model TS4 Service Guide Abbildung 14. 2104 Modell TS4 von der Stromversorgung trennen SCGER207 268 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCGER915 Sicherheitshinweise "Vorsicht" Vorsicht Vorsicht Vorsicht Vorsicht Vorsicht Im folgenden Schritt werden die Netzkabel entfernt. Ist die Stromversorgungseinheit des Gehäuses oder die unterbrechungsfreie Stromversorgung noch eingeschaltet, führen diese Kabel Strom. (1) Kein Netzkabel in die aus Ventilator und Netzteil bestehende Baugruppe einstecken, bis die Baugruppe vollständig in Ausgangsposition ist und die Rändelschrauben angezogen sind. (3) Während eines Gewitters keine Kabel von Bildschirmeinheiten, Druckern, Telefonapparaten oder Blitzschutzeinheiten lösen. (5) Bei nicht ordnungsgemäß angeschlossener Netzsteckdose können an Metallteilen des Gerätes und an angeschlossenen Einheiten gefährliche Berührungsspannungen auftreten. Für den ordnungsgemäßen Zustand der Steckdose ist der Betreiber verantwortlich.Während eines Gewitters keine Kabel von Bildschirmeinheiten, Druckern,Telefonapparaten oder Blitzschutzeinheiten lösen. (4) Die Abdeckungen der aus Ventilator und Netzteil bestehenden Baugruppe nicht öffnen. (2) Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 269 SCGER916 Sicherheitshinweise "Achtung" Achtung: Diese Einheit könnte zwei Netzkabel haben. Um die Einheit ganz von der Netzspannung zu trennen, beide Netzkabel lösen. (1) Achtung: Abdeckung nicht entfernen. Nicht reparieren. Keine wartungsbedürftigen Teile. (2) Achtung: Zweipolige bzw. Neutralleiter-Sicherung. (3) Abbildung 15. Sicherheitsaufkleber für Netzkabel Abbildung 16. Sicherheitsaufkleber der Netzteilabdeckung Abbildung 17. Sicherheitsaufkleber für Sicherung SCGER327 SCGER326 SCGER325 270 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCGER916A Achtung: Diese Einheit wiegt über 32 kg. Siehe Handbücher. (4) Abbildung 18. Gewichtsaufkleber für 2104 Modell DS4 SCGER220 Refer to manuals 32 Kg (70.5 lbs) Achtung: Sind Daten auf den Laufwerken gespeichert, die Laufwerke vor dem Ausbau mit einem Aufkleber versehen. Die Laufwerke müssen beim Einbau wieder in die Laufwerkpositionen installiert werden, aus denen sie beim Ausbau entfernt wurden. Andernfalls kann es zu einem Datenverlust kommen. > 18 Kg (37 lbs) > 32 Kg (70.5 lbs) Achtung: Ist ein 2104 Modell DS4 mit 14 Laufwerken und 2 Netzteilen ausgestattet, überschreitet das Gesamtgewicht 32 kg. Es müssen mindestens 6 Laufwerke aus dem 2104 Modell DS4 entfernt werden, bevor es angehoben oder in ein Rack oder 2104 Modell TS4 installiert werden kann, damit das Gesamtgewicht auf unter 32 kg reduziert und das 2104 Modell DS4 sicher von zwei Personen gehandhabt werden kann. Andernfalls können Verletzungen auftreten. Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 271 Achtung: Diese Einheit wiegt zwischen 32 und 55 kg. (5) Achtung: Wird ein vollständig ausgestattetes Produkt versetzt, installiert oder transportiert, ist das mitgelieferte Anhebewerkzeug erforderlich. Zum sicheren Versetzen, Installieren oder Transportieren des Produkts sind drei Personen erforderlich. Andernfalls können Verletzungen auftreten. Ist das Anhebewerkzeug nicht verfügbar, muss das Gewicht durch das Entfernen aller schweren Komponenten (Plattenlaufwerke und Netzteile) aus dem Produkt auf unter 32 kg reduziert werden, da in diesem Fall nur zwei Personen erforderlich sind, um das Produkt zu versetzen, zu installieren oder zu transportieren. (6) Abbildung 19. Gewichtsaufkleber für 2104 Modell TS4 SCGER219 Achtung: Sind Daten auf den Laufwerken gespeichert, die Laufwerke vor dem Ausbau mit einem Aufkleber versehen. Die Laufwerke müssen beim Einbau wieder in die Laufwerkpositionen installiert werden, aus denen sie beim Ausbau entfernt wurden. Andernfalls kann es zu einem Datenverlust kommen. > 18 Kg (37 lbs) > 32 Kg (70.5 lbs) 32 - 55 Kg (70.5 - 121.2 lbs) SCGER917 272 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCGER918 Achtung: Durch das Symbol rechts von "DC", direkt über dem Schalter SW1, wird eine Bereitschaftsbedingung angegeben. Wird der Schalter SW1 nach rechts direkt unter das Bereitschaftssymbol gelegt, wird dieWechselstromversorgung der Einheit nicht ausgeschaltet. (7) Achtung: Der Stabilisator muss ordnungsgemäß unten an der Vorderseite des Racks angebracht werden, um ein Vorkippen beim Ausbau des 2104 Modells DS4 aus dem Rack zu verhindern. Keine Einheiten installieren oder entfernen, wenn kein Stabilisator am Rack montiert ist. (8) Achtung: Der Stabilisator muss ordnungsgemäß unten an der Vorderseite des Racks angebracht werden, um ein Vorkippen beim Einbau des 2104 Modells DS4 in das Rack zu verhindern. Keine Einheiten installieren oder entfernen, wenn kein Stabilisator am Rack montiert ist. (9) Abbildung 20. Position einer Bereitschaftsbedingung SCGER328 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 273 SCGER919 Achtung: Keine Hände oderWerkzeuge in den freien Bereich einführen, in dem sich die Ventilatorbaugruppe befand. (10) Achtung: Keine Hände oderWerkzeuge in den freien Bereich einführen, in dem sich die aus Ventilator und Netzteil bestehende Baugruppe befand. (11) Achtung: Keine Hände oderWerkzeuge in den freien Bereich einführen, in dem sich die SCSI-Schnittstellenkarten-Baugruppe befand. (12) Achtung: Netzkabel und Netzstecker dieses Gerätes entsprechen den einschlägigen Sicherheitsbestimmungen. Dieses Gerät nur an eine Schutzkontaktsteckdose mit ordnungsgemäßer Schutzleiterverbindung anschließen. (13) Achtung: Die Netzsteckdose und die Abdeckplatte der Netzsteckdose nicht mit etwas anderem als den Prüfspitzen berühren, bevor diese Sicherheitsüberprüfung beendet ist. (14) Achtung: Wird nicht 'Unendlich' gemessen, nicht weiterarbeiten. Die entsprechenden Korrekturen an der Verkabelung vornehmen, bevor weitergearbeitet wird. Netzstromkreis erst einschalten, wenn alle genannten Schritte richtig ausgeführt wurden. (15) Achtung: Nicht die Griffe des Ventilators oder der aus Ventilator und Netzteil bestehenden Baugruppen benutzen, um das 2104 Modell DS4 zu tragen. Diese Griffe sind nicht dafür vorgesehen, das Gewicht der Einheit zu tragen. (16) 274 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCGER920 Achtung: Wird die Baugruppe vollständig in Ausgangsposition gebracht, bewegt sich der Hebel automatisch zur geschlossenen Position hin. Darauf achten, dass die Finger nicht zwischen dem Hebel und der Baugruppe eingeklemmt werden. (17) Achtung: Das Hauptnetzkabel muss von der fehlerhaften, aus Ventilator und Netzteil bestehenden Baugruppe entfernt werden, bevor weitergearbeitet wird. (18) Achtung: Keine Hände oderWerkzeuge in den freien Bereich über der aus Ventilator und Netzteil bestehenden Baugruppe einführen. (19) Achtung: Keine Hände oderWerkzeuge in den freien Bereich zwischen den Netzteilbaugruppen einführen. (20) Achtung: Wird das 2104 Modell DS4 in ein Rack-Modell T00 oder T42 installiert, müssen zudem die Sicherheitshinweise für diese Racks beachtet werden, bevor mit dem Ausbau des Rahmens begonnen wird. Die Sicherheitshinweise befinden sich in dem Abschnitt über die Systeminstallation im , IBM Form SA38-0577, oder auf der folgendenWeb-Site: www-1.ibm.com/servers/eserver/pseries/library/hardware_docs/7014_t00.html (21) Achtung: Die 2104 Modelle DS4 und TS4 sind für die Installation durch Kunden konzipiert und zertifiziert. Achten Sie darauf, dass das System oder Rack, in das das 2104 Modell DS4 oder TS4 eingebaut wird, ebenfalls für die Installation durch den Kunden konzipiert und zertifiziert ist. Ist dies nicht der Fall, muss das 2104 Modell DS4 oder TS4 vom Kundendienst installiert werden. (22) 7014 Model T00 and T42 Rack Installation and Service Guide Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 275 SCGER921 Achtung: Keine Hände oderWerkzeuge in den Bereich einführen, in dem sich die Kartenbaugruppe befand. (23) Achtung: Zum Anheben des 2104 Modells TS4 sind drei Personen erforderlich. Nicht versuchen,das 2104 Modell TS4 alleine anzuheben. Zum Anheben sind zwei weitere Personen erforderlich. (24) Achtung: Nicht die Griffe des Ventilators oder der aus Ventilator und Netzteil bestehenden Baugruppen benutzen, um das 2104 Modell TS4 zu tragen. Diese Griffe sind nicht dafür vorgesehen, das Gewicht der Einheit zu tragen. (25) Das 2104 Modell DS4 ist für die Installation durch Kunden konzipiert und zertifiziert. Achten Sie darauf, dass das System oder Rack, in das das 2104 Modell DS4 eingebaut wird, ebenfalls für die Installation durch den Kunden konzipiert und zertifiziert ist. Ist dies nicht der Fall, muss das 2104 Modell DS4 vom Kundendienst installiert werden. (26) Achtung: Achtung: Das 2104 Modell TS4 ist für die Installation durch Kunden konzipiert und zertifiziert. Achten Sie darauf, dass das System, in das das 2104 Modell TS4 eingebaut wird, ebenfalls für die Installation durch den Kunden konzipiert und zertifiziert ist. Ist dies nicht der Fall, muss das 2104 Modell TS4 vom Kundendienst installiert werden. (27) 276 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Greek Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 277 278 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 279 280 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 281 282 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 283 284 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 285 286 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 287 288 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 289 290 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 291 292 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 293 294 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 295 296 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 297 298 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 299 Hebrew 300 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 301 302 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 303 304 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 305 306 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 307 308 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 309 310 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 311 312 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 313 314 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 315 316 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 317 318 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 319 320 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 321 322 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Hungarian Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 323 324 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 325 326 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 327 328 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 329 330 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 331 332 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 333 334 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 335 336 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 337 338 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCHUN916 Óvatosságra intõ figyelmeztetések VIGYÁZAT: Ennek az egységnek lehet, hogy két hálózati csatlakozóvezetéke van. Áramtalanításkor húzza ki mindkét hálózati csatlakozóvezetéket! (1) VIGYÁZAT: Ne vegye le a burkolatot, és ne kísérelje meg a javítást! Nem tartalmaz javítható alkatrészt. (2) VIGYÁZAT: Kétpólusú nullavezetõs biztosítás. (3) 15. ábra Hálózati csatlakozóvezeték figyelmeztetõ címke 16. ábra Tápegységburkolat figyelmeztetõ címke 17. ábra Biztosító figyelmeztetõ címkéje SC000327 SC000326 SC000325 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 339 340 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 341 342 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 343 344 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 345 Japanese 346 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 347 348 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 349 350 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 351 352 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 353 354 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 355 356 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 357 358 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 359 360 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 361 362 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 363 364 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 365 366 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 367 368 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Korean Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 369 370 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 371 372 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 373 374 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 375 376 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 377 378 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 379 380 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 381 382 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 383 384 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 385 386 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 387 388 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 389 390 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 391 Lithuanian Saugos ir aplinkosaugos informacija Saugos áspëjimai ir jø vertimas SCLIT901 Ðiame skyriuje pateikiama tokia informacija: Saugos áspëjimai ir jø vertimas Ðio produkto saugos patikrinimo instrukcijos Aplinkosaugos informacija ir nuostatos. Ðiame dokumente pateikiami saugos patarimai. Áspëjimas informuoja jus apie sàlygas arba veiksmus, kurie gali bûti mirtinai pavojingi arba dël kuriø galite rimtai susiþaloti. Áspëjimas informuoja jus apie sàlygas arba veiksmus, dël kuriø galite susiþeisti, bet ne mirtinai ir ne per daug pavojingai. Praneðimas informuoja apie sàlygas ir veiksmus, dël kuriø gali sugesti maðinos áranga, neveikti programa bei gali bûti paþeisti duomenys. Praneðimai yra pateikiami eilës tvarka pagal jø ID, kurie rodomi skliausteliuose kiekvieno praneðimo pabaigoje. Perþiûrëkite toliau pateikiamus pavojaus ir perspëjimo praneðimus, susietus su tam tikru ID numeriu. Pavojus Áspëjimas Dëmesio PAVOJUS Dël neteisingai prijungto elektros tinklo lizdo metalinës sistemos dalys arba prie sistemos prijungti prietaisai gali bûti labai aukðtos átampos. Pirkëjas yra atsakingas ir turi uþtikrinti, kad tinklo lizdas yra teisingai ájungtas ir áþemintas, ir tokiu bûdu apsaugo nuo elektros ðoko. (1) ÁSPËJIMAS: Nekiðkite rankø arba árankiø á tuðèià ertmæventiliatoriø rinkinyje. (1) 392 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Elektros krûvio paðalinimas Saugos tikrinimo veiksmai Korpuso patikrinimas SCLIT902 Dëmesio Kai lieèiate pakeièiamus elementus ir kitas kompiuterio dalis, imkitës ðiø priemoniø, kad iðvengtumëte paþeidimo statiniu krûviu: Apribokite savo judesius. Dël judesiø aplink jus gali susikaupti statinis krûvis. Visuomet bûkite atidûs liesdami kompiuterio dalis. Plokðtes laikykite uþ briaunø arba metalinës dangos. Niekada nelieskite neapsaugotø grandiniø. Neleiskite liesti kompiuterio daliø kitiems þmonëms nepasirûpinus áþeminimu. Prieð ádiegdami naujà dalá modeliams 2104 Model DS4, 2104 Model TS4 arba valdanèiajai sistemai, palieskite paketo apsauganèio nuo statinës iðkrovos nedaþytà metalinædalá ir laikykite palietæmaþiausiai 2 sekundes. Tai sumaþins pakuotës ir jûsø kûno statiná krûvá. Árenginio dalá iðimkite ið pakuotës ir, jei ámanoma, nepadëdami ámontuokite jà tiesiai á modelá 2104 DS4 arba TS4. Jei árenginio dalá turite kur nors padëti, atlikite ðiuos veiksmus: 1. Nuo statiðkumo apsauganèià pakuotæ, kurioje buvo árenginio dalis, padëkite ant lygaus ir horizontalaus pavirðiaus. 2. Padëkite detalæant nuo statinio krûvio apsauganèios pakuotës. Árenginio dalies negalima dëti tiesiai ant metalinio pavirðiaus. Saugos tikrinimo procedûrose nurodoma, kaip tikrinti modelá 2104 DS4 arba Ts4. Informacijos apie korpuso saugos patikrinimà ieðkokite korpuso montavimo ir diegimo vadove. ÁSPËJIMAS: Modeliai 2104 DS4 ir TS4 yra sukurti taip, kad juos ádiegti galëtø vartotojas, ir yra sertifikuoti kaip vartotojo sàranka. Ásitikinkite, kad sistema arba korpusas, á kurá bus diegiamas modelis 2104 DS4 arba TS4, yra pritaikytas ir sertifikuotas vartotojo sàrankai. Jeigu ne, tuomet modelá 2104 DS4 arba TS4 turi ádiegti techninës prieþiûros specialistai. (22) Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 393 Modelio 2104 DS4 arba TS4 patikrinimas Iðoriniai árenginio patikrinimai Vidiniai árenginio patikrinimai SCLIT903 Atlikite ðiuos vidinius árenginio patikrinimus: 1. Patikrinkite, ar nebuvo kokiø nors, ne ámonës IBM atliktø pakeitimø. Jei tokiø pakeitimø radote, ið IBM filialo bûstinës paimkite blankà "Non-IBM Alteration Attachment Survey", Nr. R009. Jà uþpildægràþinkite á filialo bûstinæ. 2. Patikrinkite, ar árenginio viduje nëra: Metalo arba kitokiø neðvarumø Vandens arba kito skysèio pëdsakø Degimo poþymiø Dûmø 3. Patikrinkite, ar nëra matomø mechaniniø nesklandumø, pvz., atsilaisvinusiø komponentø. 4. Patikrinkite, ar matomi laidai ir jungtys nesusidëvëjæ, nesuskilæ, nesuspausti. Atlikite ðiuos iðorinius árenginio patikrinimus: 1. Patikrinkite, ar korpusas nepaþeistas (sulauþytas, nesutvirtintas arba aðtriomis briaunomis). 2. Patikrinkite el. maitinimo laidus ir ásitikinkite, kad izoliacija nesusidëvëjusi ir nepaþeista. 3. Patikrinkite, ar nëra kokiø nors matomø neáprastø pakitimø. Bûkite labai atidûs vertindami ðiø pakitimø saugà. 4. Patikrinkite, ar yra visi iðoriniai dangalai, ir, ar jie nesugadinti. 5. Ásitikinkite, kad visi sklàsèiai ir vyriai yra tinkamoje eksploatavimui bûklëje. 6. Patikrinkite, ar nepaþeistas el. maitinimo laidas. 7. Patikrinkite, ar laidai nesusidëvëjæ, nepaþeisti arba neprispausti. 8. Patikrinkite ventiliatoriaus ir el. maitinimo tiekimo árenginius. Patikrinkite, ar el. maitinimo tiekimo árenginio dangèio sàvarþos (varþtai arba kniedës) neiðimti arba neatsilaisvinæ. 9. Patikrinkite, ar nepaþeistas iðorinis ryðio laidas. 10. Patikrinkite, ar dangtis neturi aðtriø briaunø, nesugadintas, ar nëra pakitimø, kurie atidengia vidines árenginio dalis. 11. Paðalinkite rastus nesklandumus. Norëdami nustatyti, kokios sàlygos yra nesaugios, atlikite ðiuos saugos patikrinimus. 394 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Saugos korteliø patikrinimai SCLIT904 1. Patikrinkite, ar kortelë pritvirtinta ant modelio 2104 DS4 arba TS4. Þr. 1 paveikslëlá. 2. Patikrinkite, ar ant modelio 2104 DS4 arba TS4 el. maitinimo ðaltinio pritvirtinta kortelë su áspëjimu apie laidà. Þr. 2 paveikslëlá. SCLIT274 1 paveikslëlis. Kortelë, skirta modeliui 2104 DS4 arba Ts4 2 paveikslëlis. Áspëjimo apie laidà kortelë. SCLIT325 ÁSPËJIMAS: Ðiame árenginyje gali bûti du laidai. Norëdami nutraukti el. maitinimà, atjunkite abu laidus. (1) Atlikite saugos korteliø patikrinimà: Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 395 SCLIT905 3. Patikrinkite, ar ant modelio 2104 DS4 arba TS4 el. maitinimo ðaltinio danèio yra privirtinta kortelë su áspëjimu. Þr. 3 paveikslëlá. 4. Patikrinkite, ar ant modelio 2104 DS4 arba TS4 el. maitinimo ðaltinio yra pritvirtinta kortelë su áspëjimu apie saugiklá. Þr. 4 paveikslëlá. 3 paveikslëlis. El. maitinimo ðaltinio dangèio áspëjimo kortelë 4 paveikslëlis. Lentelë su áspëjimu apie saugiklá SCLIT326 SCLIT327 ÁSPËJIMAS: Nenuimkite dangèio, nieko nekeiskite, nëra keièiamø daliø. (2) ÁSPËJIMAS: Du poliai/ Neutralë su saugikliu. (3) 396 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCLIT905A 5. Patikrinkite, ar ant modelio 2104 DS4 yra svorio kortelë. Þr. 5 paveikslëlá. ÁSPËJIMAS: Ðis árengimas sveria daugiau nei 32 kg (70,5 svaro). Skaitykite vartotojo instrukcijose. (4) 5 paveikslëlis. Modelio 2104 DS4 svorio kortelë SCLIT220 Refer to manuals 32 Kg (70.5 lbs) Dëmesio: Jei diskuose saugote duomenis, prieð diskus iðimdami, uþdëkite ant jø etiketes. Kai diskus keièiate, kiekvienà ið jø ádëkite á tà patá disko dëklà, ið kurio já iðëmëte. Antraip prarasite duomenis. > 18 kg (37 svaro) > 32 kg (70,5 svaro) PERSPËJIMAS: Jei modelyje 2104 DS4 yra visi 14 diskø árenginiø ir du el.maitinimo ðaltiniai, visas jo svoris virðija 32 kg (70.5 svaro). Ið modelio 2104 DS4 reikia paðalinti maþiausiai ðeðis diskus, jei norite já pakelti arba já ámontuoti á korpusà arba 2104 modelá TS4. Bendras svoris bus maþesnis nei 32 kg (70,5 svaro) ir modelá 2104 DS4 du þmonës galës saugiai paimti. Antraip galima susiþeisti. Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 397 ÁSPËJIMAS: Ðis árenginys sveria nuo 32 iki 55 kg (70.5 - 121.2 svarai). (5) 6 paveikslëlis. Modelio 2104 TS4 svorio etiketë. 32 - 55 Kg (70.5 - 121.2 lbs) SCLIT219 6. Patikrinkite, ar ant modelio 2104 TS4 pritvirtinta svorio etiketë. Þr. 6 paveikslëlá. DËMESIO: Pridëto dëþës pakëlimo árankio reikia perstumiant, diegiant ir perkeliant árenginá, kai jis visai uþpildytas. Norint árenginá perkelti saugiai, reikia trijø þmoniø. Antraip galite susiþeisti. Jei këlimo priemonës neturite, árenginio svorá privalote sumaþinti iki 32 kg arba daugiau nuimdami visas sunkias dalis (diskus ir el. maitinimo ðaltinius). Tuomet reikës tik dviejø þmoniø árenginiui pastumti, ádiegti arba perkelti. (6) Dëmesio: Jei diskuose saugote duomenis, prieð diskus iðimdami, uþdëkite ant jø etiketes. Kai diskus keièiate, kiekvienà ið jø ádëkite á tà patá disko dëklà, ið kurio já iðëmëte. Antraip prarasite duomenis. > 18 kg (37 svarai) > 32 kg (70.5 svaro) SCLIT906 398 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCLIT907 7. Patikrinkite, ar ant modelio 2104 DS4 arba TS4 pritvirtinta budëjimo reþimo etiketë. Þr. 7 paveikslëlá. ÁSPËJIMAS: Budëjimo reþimas paþymëtas simboliu, esanèiu deðinëje tiesioginës srovës pusëje, tiesiai virð jungiklio SW1. Kai SW1 perjungiate á deðinæpozicijà tiesiai po budëjimo reþimo simboliu, árenginio kintamos srovës el. Maitinimas neiðjungiamas. (7) 7 paveikslëlis. Budëjimo reþimo etiketë SCLIT326 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 399 SCLIT908 This section contains instructions for checking the grounding of the 2104 Model DS4 or Model TS4. Modelio 2104 DS4 áþeminimo patikrinimas Norëdami patikrinti modelio 2104 DS4 áþeminimà atlikite ðiuos veiksmus: 3. Kai kurios el. grandinës gali bûti paþeistos, jei iðoriniai SCSI kabeliai prijungti prie modelio DS4 tuo metu, kai atliekamas áþeminimo patikrinimas. Ásitikinkite, kad iðoriniai SCSI kabeliai neájungti á jungtis . 4. Patikrinkite modelio DS4 áþeminimà: ø akumuliatoriø árenginio antrajame þingsnyje, kiðtukai ir treèiojo laido kontaktas yra geros bûklës à ommetro kontaktà prie 2104 modelio DS4 korpuso iðvado ir kità prie treèiojo laido áþeminimo kontakto. vienà jungiamàjá laidà prie 2104 modelio DS4 korpuso ir kità jungiamàjá laidà prie treèiojo laido áþeminimo kontakto, esanèio prieð maitinimo jungtis, kurias atjungëte antrajame þingsnyje. Ommetro parodymai turi bûti 0,1 omo ar maþiau tarp áþeminimo kontakto, esanèio maitinimo laido jungtyje, ir 2104 modelio DS4 korpuso. Dëmesio: a. Apþiûrëkite ir ásitikinkite, kad maitinimo jungèiø, kurias iðjungëte ið maitimo tiekimo ar atsargini . 2104 2104 b. Pajunkite vien Prijunkite ommetro 8 paveikslëlis. El. maitinimo kabeliai ir SCSI jungtys modelyje 2104 DS4 Modelio 2104 DS4 arba TS4 áeminimo patikrinimas 1. Ásitikinkite, kad visi el. maitinimo ðaltiniai atjungti nuo korpuso. Þr. montavimo diegimo dokumente, skirtame korpusui arba valdanèiajai sistemai. 2. Ásitikinkite, kad el. maitinimo kabeliai yra ájungti á kiekvienà ventiliatoriaus ir el. maitinimo ðaltinio deriná. Þr. 8 paveikslëlá. Taip pat patikrinkite, ar el. maitinimo kabeliø galai atjungti nuo el. maitinimo paskirstymo arba atsarginio akumuliatoriø árenginio. Þr. montavimo diegimo dokumente, skirtame korpusui arba valdanèiajai sistemai. 2 1 SCLIT210 400 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCLIT909 Jei dirbate su 220 voltø 2104 modeliu DS4, rinkitës 5 þingsná. Jei dirbate su -48 voltø 2104 modeliu DS4, atlikite 7 þingsná. 5. Patikrinkite modelio DS4 korpuso ir kiekvieno ventiliatoriaus el. maitinimo ðaltinio árenginio el. maitinimo jungties áþeminimo kontakto tolydumà. Þr. 9 paveikslëlá. 2104 6. Pereikite prie 9 þingsnio. 7. Patikrinkite modelio DS4 korpuso ir kiekvieno pagrindinio el. maitinimo ðaltinio 1 ir 7 áþeminimo kontaktø tolydumà. Þr. 10 paveikslëlá. 2104 8. Pereikite prie 9 þingsnio. 9. Jei kuris nors ventiliatoriaus el. tiekimo árenginys netolydus, pakeiskite já nauju. Þr. skyriø, kuriame raðoma apie ventiliatoriaus ir el. maitinimo tiekimo árenginius, esantá vadove . Dar kartà iðsamiai patikrinkite áþeminimà. Jei kiekvienas ventiliatoriaus el. maitinimo ðaltinio árenginys tolydus, gali bûti, kad nesklandumø iðkyla dël el. maitinimo kabelio arba valdanèiosios sistemos áþeminimo. Expandable Storage Plus 2104 Model DS4 and Model TS4 Service Guide 9 paveikslëlis. Áþeminimo kontaktas (220 V 2104 modelis DS4 ) 10 paveikslëlis. Áþeminimo kontaktai (48 V modelis 2104 DS4) 1 SCLIT329 Jei áþeminimas tinkamas, áþeminimo patikrinimas uþbaigtas. Jei áþeminimas netinkamas, iðjunkite 2104 DS4 maitinimo kabelius ið ventiliatoriaus - el. maitinimo ðaltinio árenginio. modelio SCLIT330 6 kontaktas 1 kontaktas 12 kontaktas 7 kontaktas Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 401 SCLIT910 Jei dirbate su 220 voltø modeliu DS4, pereikite prie 10. Jei dirbate su 48 voltø modeliu DS4, norëdami paðalinti gedimà þr. korpuso arba valdanèiosios sistemos diegimo dokumentà. Dar kartà iðsamiai patikrinkite áþeminimà. 2104 2104 10. Patikrinkite, ar tolydus el. maitinimo kabelis. Jei kabelis netolydus, pakeiskite já nauju, tuomet dar kartà iðsamiai patikrinkite áþeminimà. Jei el. maitinimo kabelis netolydus, skaitykite korpuso arba valdanèiosios sistemos diegimo ir montavimo instrukcijas ir paðalinkite gedimà. Norëdami patikrinti 2104 modelio TS4 áþeminimà, atlikite ðiuos veiksmus: 2. Patikrinkite, ar el. maitinimo kabelio galai neájungti á el. maitinimo lizdus. Jei reikia, kabelius atjunkite. Modelio 2104 TS4 áþeminimo patikrinimas 1. Patikrinkite, ar el. maitinimo kabelis ájungtas á kiekvienà el. maitinimo lizdà . Þr. 11 paveikslëlá. 1 11 paveikslëlis. El. maitinimo kabeliai ir SCSI jungtys (2104 modelis Ts4) SCLIT230 1 2 3 402 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCLIT911 3. Kai kurios el. grandinës gali bûti paþeistos, jei iðoriniai SCSI kabeliai jungiami prie modelio TS4 tuo metu, kai atliekamas áþeminimo patikrinimas. Ásitikinkite, kad nëra jokiø iðoriniø SCSI kabeliø, ájungtø á jungtis. . Þr. 11 paveikslëlá. 4. Patikrinkite modelio TS4 korpuso ir kiekvieno pagrindinio el. Maitinimo ðaltinio áþeminimo kontaktø tolydumà. a. Patikrinkite el. maitinimo kabelius ir paþiûrëkite, ar treèiojo laido áþeminimo jungtis yra geros bûklës. b. Norëdami patikrinti, ar treèiojo laido áþeminimo tolydumas tarp iðorinio áþeminimo kontakto pagrindo ir korpuso pagrindo yra 0,1 omø arba maþiau, naudokite matavimo prietaisà. 5. Jei áþeminimas tinkamas, jo patikrinimas baigtas. 6. Jei áþeminimas netinkamas, iðjunkite modelio DS4 el. maitinimo kabelius ið ventiliatoriaus el. maitinimo ðaltinio árenginio. a. Patikrinkite, ar el. maitinimo kabelis tolydus. b. Jei kuris nors el. maitinimo kabelis yra paþeistas, pakeiskite nauju. c. Patikrinkite modelio TS4 korpuso ir kiekvieno ventiliatoriaus el. maitinimo ðaltinio árenginio el. maitinimo jungties áþeminimo kontakto tolydumà. Þr. 12 paveikslëlá. Dëmesio: 2104 2104 2104 2104 d Jei kuris nors ventiliatoriaus el. maitinimo ðaltinio árenginys netolydus, pakeiskite já nauju. (Þr. vadovo dalá apie ventiliatoriaus ir el. maitinimo ðaltiniø árengimus.) Pasirinkite " odelio TS4 áþeminimo patikrinimas" ir atlikite 1-6 veiksmus, kad dar kartà iðsamiai patikrintumëte áþeminimà. Expandable Storage Plus 2104 Model DS4 and Model TS4 Service Guide 2104 m 12 paveikslëlis. Áþeminimo kontaktas (2104 modelis Ts4) 1 2 3 SCLIT331 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 403 SCLIT912 13 paveikslëlis. El. maitinimo ðaltinio atjungimas nuo 2104 modelio DS4 Neatjunkite srovës ið valdanèiosios sistemos arba modelio DS4, jei tam nëra konkreèiø prieþasèiø, nebent taip nurodo intrukcijos, kuriomis vadovaujatës. Atlikite ðiuos veiksmus ir atjunkite el. srovës maitinimà nuo modelio DS4: 1. Pasiteiraukite kliento, ar visos operacijos, vykdomos tarp modelio DS4 ir valdanèiosios sistemos, yra sustabdytos. 2. Kiekvieno ventiliatoriaus el. maitinimo árenginio tiesioginës srovës jungiklius ájungta/ budëjimas perjunkite á budëjimo reþimà. Þr. 13 paveikslëlá. a. Kai kuriuose modeliuose DS4 yra ventiliatoriaus el. maitinimo árenginys ir ventiliatoriø árenginys. Ventiliatoriø árenginyje nëra tiesioginës srovës jungiklio ájungta/ budëjimas. b. Ventiliatoriaus - el. maitinimo árenginyje CHK lemputë gali ðviesti, nors jo tiesioginës srovës jungiklis ájungta/ budëjimas yra budëjimo reþime. Pastaba: Pastabos: 2104 2104 2104 2104 El. srovës atjungimas nuo 2104 modelio DS4 arba TS4 El. srovës atjungimas nuo 2104 modelio DS4 SCLIT206 404 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCLIT913 3. PAVOJUS Ðiais kabeliais teka srovë, jei korpuso el. maitinimo paskirstymo árenginys arba nepertraukiamos srovës árenginys yra dar ájungti. (1) 1 1 2 2 2104 2104 2104 2104 2104 Pastaba: Pastabos: Neatjunkite el. maitinimo ðaltinio nuo valdanèiosios sistemos arba modelio TS4, jei tam nëra konkreèiø prieþasèiø, nebent taip nurodo instrukcijos, kuriomis vadovaujatës. Atlikite ðiuos veiksmus ir atjunkite el. srovës maitinimà nuo modelio TS4: 1. Pasiteiraukite kliento, ar visos operacijos, vykdomos tarp modelio TS4 ir valdanèiosios sistemos, yra sustabdytos. 2. Kiekvieno ventiliatoriaus el. maitinimo árenginio tiesioginës srovës jungiklius ájungta/ budëjimas, esanèius modelio TS4 galinëje dalyje, perjunkite á budëjimo reþimà. Þr. 14 paveikslëlá. a. Kai kuriuose modeliuose DS4 yra ventiliatoriaus el. maitinimo árenginys ir ventiliatoriø árenginys. Ventiliatoriø árenginyje nëra tiesioginës srovës jungiklio ájungta/ budëjimas. b. Ventiliatoriaus - el. maitinimo árenginyje CHK lemputë gali ðviesti, nors jo tiesioginës srovës mygtukas ájungta/ budëjimas yra budëjimo reþime. El. maitinimo ðaltinio atjungimas nuo modelio 2104 TS4 2104 m 2104 Maitinimo kabelius išjunkite iš 2104 modelio DS4 upakalin s dalies. r. 13 paveiksl . ë ëlá 4. odelio DS4 el. maitinimui ájungti atlikite ðiuos veiksmus: a. Vël ájunkite el. maitinimo kabelius . b. Kiekviename ventiliatoriaus el. maitinimo árenginyje ájunkite mygtukà ájungta/ budëjimas. Galite konfigûruoti modelio DS4 variklio ájungimo eigà. Jei norite suþinoti daugiau, skaitykite skyrius apie variklio automatinio ájungimo jungiklá ir uþdelsto ájungimo reþimo jungiklá, esanèius vadove . Pastaba: Expandable Storage Plus 2104 Model DS4 and Model TS4 Service Guide Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 405 SCLIT914 1 2 2 3. Iðtraukite el. maitinimo kabelius ið el. srovës iðvadø ir ið galinës 2104 modelio 104 TS4 dalies. Þr. 14 paveikslëlá. 4. odelio TS4 el. maitinimui ájungti atlikite ðiuos veiksmus: a. Vël ájunkite el. maitinimo kabelius . b. Kiekviename ventiliatoriaus el. maitinimo árenginyje ájunkite mygtukà ájungta/ budëjimas. Galite konfigûruoti modelio TS4 variklio ájungimo eigà. Jei norite suþinoti daugiau, skaitykite skyrius apie variklio automatinio ájungimo jungiklá ir uþdelsto ájungimo reþimo jungiklá, esanèius vadove Pastaba: Expandable Storage Plus 2104 Model DS4 and Model TS4 Service Guide. 2104 m 2104 14 paveikslëlis. El. maitinimo ðaltinio atjungimas nuo 2104 modelio TS4 SCLIT207 406 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCLIT915 Praneðimai apie pavojø PAVOJUS PAVOJUS PAVOJUS PAVOJUS PAVOJUS Kitu þingsniu atjungsite el. maitinimo kabelius. Ðiais kabeliais teka srovë, jei korpuso el. maitinimo paskirstymo árenginys arba nepertraukiamos srovës árenginys yra dar ájungti. (1) Nejunkite el. maitinimo kabelio á ventiliatoriaus el. maitinimo ðaltinio árenginá, kol árenginys nëra baigtas montuoti ir verþlës nëra visiðkai priverþtos. (3) Perkûnijos metu nereikia atjungti kabeliø nuo ekranø, spausdintuvø, telefonø arba ryðiø linijø apsaugos stoèiø. (5) Dël neteisingai prijungto elektros srovës iðvado metalinës sistemos dalys arba prie sistemos prijungti prietaisai gali bûti labai aukðtos átampos. Pirkëjas yra atsakingas ir turi uþtikrinti, kad tinklo lizdas yra teisingai ájungtas ir áþemintas, ir tokiu bûdu apsaugo nuo elektros ðoko. Perkûnijos metu nereikia atjungti kabeliø nuo ekranø, spausdintuvø, telefonø arba ryðiø linijø apsaugos stoèiø. (4) Nebandykite atidaryti ventiliatoriaus el. maitinimo ðaltinio árenginio dangèiø. (2) Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 407 SCLIT916 Áspëjamieji praneðimai ÁSPËJIMAS: Ðiame árenginyje gali bûti du laidai. Norëdami nutraukti el. maitinimà, atjunkite abu laidus. (1) ÁSPËJIMAS: Nenuimkite dangèio, netaisykite, nëra keièiamø daliø. (2) ÁSPËJIMAS: Du poliai/ Neutralë su saugikliu. (3) 15 paveikslëlis. Áspëjimo apie laidà kortelë 16 paveikslëlis. El. maitinimo ðaltinio dangèio áspëjimo kortelë 17 paveikslëlis. Lentelë su áspëjimu apie saugiklá SCLIT327 SCLIT326 SCLIT325 408 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCLIT916A ÁSPËJIMAS: Ðis árenginys sveria daugiau nei 32 kg (70.5 svaro). Skaitykite vartotojo instrukcijose. (4) 18 paveikslëlis. 2104 modelio DS4 svorio kortelë SCLIT220 Refer to manuals 32 Kg (70.5 lbs) Dëmesio: Jei diskuose saugote duomenis, prieð diskus iðimdami, uþdëkite ant jø etiketes. Kai diskus keièiate, kiekvienà ið jø ádëkite á tà patá disko dëklà, ið kurio já iðëmëte. Antraip prarasite duomenis. > 18 Kg (37 lbs) > 32 Kg (70.5 lbs) PERSPËJIMAS:Jei modelyje 2104 DS4 yra visi 14 diskø árenginiø ir du el. maitinimo ðaltiniai, visas jo svoris virðija 32 Kg (70.5 svaro).Ið modelio 2104 DS4 reikia paðalinti maþiausiai ðeðis diskus, jei norite já pakelti arba já ámontuoti á korpusà arba 2104 modelá TS4. Bendras svoris bus maþesnis ne 32 Kg (70.5 lbs) ir modelá 2104 DS4 du þmonës galës saugiai paimti. Antraip galima susiþeisti. Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 409 ÁSPËJIMAS: Ðis árenginys sveria nuo 32 iki 55 kg (70.5 - 121.2 svarai). (5) ÁSPËJIMAS: Pridëto dëþës pakëlimo árankio reikia perstumiant, diegiant ir perkeliant árenginá, kai jis visai uþpildytas. Norint árenginá perkelti saugiai, reikia trijø þmoniø. Antraip galite susiþeisti. Jei këlimo priemonës neturite, árenginio svorá privalote sumaþinti iki 32 kg arba maþesnio nuimdami visas sunkias dalis (diskus ir el. maitinimo ðaltinius). Tuomet reikës tik dviejø þmoniø árenginiui pastumti, ádiegti arba perkelti. (6) 19 paveikslëlis. 2104 modelio TS4 svorio kortelë SCLIT219 Dëmesio: Jei diskuose saugote duomenis, prieð diskus iðimdami, uþdëkite ant jø etiketes. Kai diskus keièiate, kiekvienà ið jø ádëkite á tà patá disko dëklà, ið kurio já iðëmëte. Antraip prarasite duomenis. > 18 kg (37 svarai) > 32 kg (70.5 svaro) 32 - 55 Kg (70.5 - 121.2 lbs) SCLIT917 410 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCLIT918 Budëjimo reþimas paþymëtas simboliu, esanèiu deðinëje tiesioginës srovës pusëje, tiesiai virð jungiklio SW1. Kai SW1 pakeièiate á deðinæpozicijà tiesiai po budëjimo reþimo simboliu, árenginio kintamosios srovës el. maitinimas neiðjungiamas. (7) ÁSPËJIMAS: Stabilizatorius turi bûti teisingai prijungtas prie apatinës priekinës korpuso dalies, kad apsaugotø korpusà nuo pasvirimo á prieká tuo metu, kai modelis DS4 yra iðimamas ið korpuso. Neiðimkite ir nemontuokite jokiø árenginiø, jei stabilizatorius nëra pritaisytas prie korpuso. (8) Stabilizatorius turi bûti teisingai prijungtas prie apatinës priekinës korpuso dalies, kad apsaugotø korpusà nuo pasvirimo á prieká tuo metu, kai modelis DS4 yra iðimamas ið korpuso. Neiðimkite ir nemontuokite jokiø árenginiø, jei stabilizatorius nëra pritaisytas prie korpuso. (9) 2104 ÁSPËJIMAS: 2104 20 paveikslëlis. Budëjimo reþimo pozicija SCLIT328 ÁSPËJIMAS: Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 411 SCLIT919 DËMESIO: Nekiðkite rankø arba árankiø á ertmæventiliatoriø rinkinyje. (10) DËMESIO: Nekiðkite rankø arba árankiø á ertmæventiliatoriø rinkinyje. (11) DËMESIO: Nekiðkite rankø arba árankiø á ertmæ, esanèià SCSI sàsajø kortelës árenginyje. (12) DËMESIO: Dël vartotojø saugumo ðiame produkte yra 3-laidþiai el. maitinimo kabeliai ir kiðtukai. Naudokite ðá el. maitinimo kabelá kartu su teisingai áþemintu elektros iðvadu, kad iðvengtumëte elektros ðoko. (13) DËMESIO: Nelieskite el. maitinimo iðvadø arba el. maitinimo iðvadø skydeliø ne su tikrinimo liestukais, kol nebaigsite ðio saugos patikrinimo. (14) DËMESIO: Jei matavimo parodymas nëra begalybë, negalima nieko daryti. Atlikite reikiamus elektros laidø sistemos pataisymus prieð tæsdami. Neájunkite atðakinës grandinës pertraukiklio, kol tinkamai neatliksite ankstesniø þingsniø. (15) DËMESIO: Nenaudokite ventiliatoriaus arba ventiliatoriaus el. maitinimo árenginio rankenø neðdami modelá DS4. Ðios rankenos gali neiðlaikyti árenginio svorio. (16) 2104 412 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCLIT920 ÁSPËJIMAS: Kai árengys paruoðtas, svirtis automatiðkai pajuda link savo uþdaros padëties. Pasisaugokite, kad jûsø pirðtai nebûtø suspausti tarp svirties ir árenginio. (17) Prieð tæsdami toliau, patikrinkite, ar pagrindinio el. maitinimo ðaltinio kabelis buvo iðjungtas ið sugedusio ventiliatoriaus el. maitinimo. (18) Nekiðkite rankø arba árankiø á tuðèià ertmævirð ventiliatoriaus el. maitinimo ðaltinio árenginio. (19) Nekiðkite rankø arba árankiø á tuðèià ertmætarp el. maitinimo árenginiø. (20) Jei modelis DS4 yra ámontuotas modelio T00 arba T42 korpuse, prieð nuimdami karkasà turite atkreipti dëmesá á pastabas apie tø korpusø saugà. Saugos nurodymus galite rasti skyriuje apie sistemos diegimà vadove , SA38-0577 arba ðiame tinklapyje: www-1.ibm.com/servers/eserver/pseries/library/hardware_docs/7014_t00.html (21) odelis DS4 ir modelis TS4 yra sukurti taip, kad juos galëtø ámontuoti pirkëjas, ir yra sertifikuoti kaip vartotojo sàranka. Ásitikinkite, kad sistema arba korpusas, á kurá bus diegiamas modelis DS4 arba TS4, yra pritaikytas ir sertifikuotas vartotojo sàrankai. Jeigu ne, tuomet modelá DS4 arba TS4 turi ádiegti techninës prieþiûros specialistai. (22) 7014 Model T00 and T42 Rack Installation and Service Guide ÁSPËJIMAS: ÁSPËJIMAS: ÁSPËJIMAS: ÁSPËJIMAS: 2104 ÁSPËJIMAS: 2104 m 2104 2104 2104 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 413 SCLIT921 ÁSPËJIMAS: Nekiðkite rankø arba árankiø á ertmæ, esanèià kortø árenginyje. (23) odelio TS4 pakelti reikia trijø þmoniø. Nebandykite jo kelti vienas. Nebandykite jo kelti be kitø dviejø þmoniø pagalbos. (24) Nenaudokite ventiliatoriaus arba ventiliatoriaus el. maitinimo árenginio rankenø neðdami modelá TS4. Ðios rankenos gali neiðlaikyti árenginio svorio. (25) ÁSPËJIMAS: 2104 m ÁSPËJIMAS: 2104 odelis DS4 ir modelis TS4 yra sukurti taip, kad juos galëtø ámontuoti pirkëjas, ir yra sertifikuoti kaip vartotojo sàranka. Patikrinkite ar sistema arba korpusas, á kurá bus ádiegtas modelis 2104 DS4, yra taip pat sukurti ir sertifikuoti vartotojo sàrankai. Jeigu ne, tuomet modelá DS4 turi ádiegti techninës prieþiûros specialistai. (26) 2104 m 2104 2104 ÁSPËJIMAS: ÁSPËJIMAS: 2104 m 2104 2104 odelis TS4 yra sukurtas taip, kad já galëtø ámontuoti vartotojas, ir yra sertifikuotas kaip vartotojo sàranka. Patikrinkite ar sistema á kurià bus ádiegtas modelis TS4 yra taip pat sukurta ir sertifikuota vartotojo sàrankai. Jeigu ne, tuomet modelá TS4 turi ádiegti techninës prieþiûros specialistai. (27) 414 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Macedonia Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 415 416 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 417 418 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 419 420 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 421 422 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 423 424 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 425 426 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 427 428 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 429 430 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 431 432 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 433 434 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 435 436 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 437 Norwegian 438 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 439 440 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 441 442 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 443 444 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 445 446 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 447 448 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 449 450 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 451 452 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 453 454 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 455 456 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 457 458 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 459 460 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Polish Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 461 462 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 463 464 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 465 466 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 467 468 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 469 470 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 471 472 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 473 474 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 475 476 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 477 478 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 479 480 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 481 482 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 483 Portuguese Avisos de segurança e ambientais Avisos de segurança e respectivas traduções SC000901 Esta secção contém as seguintes informações: Avisos de segurança e respectivas traduções Procedimentos de inspecção de segurança para este produto Avisos e declarações ambientais Os avisos de segurança estão impressos ao longo deste documento. Um aviso de Perigo informa-o de condições ou procedimentos que podem resultar em morte ou ferimentos pessoais graves. Um aviso de informa-o de condições ou procedimentos que podem resultar em ferimentos pessoais que não são nem letais nem extremamente perigosos. Um aviso de informa-o de condições ou procedimentos que podem provocar danos em má quinas, equipamento, programas ou dados. Os avisos estão listados por ordem numérica com base nos seus IDs, que são apresentados entre parentesis, no fim de cada aviso. Consulte os seguintes exemplos de avisos de perigo e cuidado para a localização do número de ID. Cuidado Atenção PERIGO Uma tomada eléctrica que não esteja correctamente ligada pode passar corrente perigosa para as peças metá licas do sistema ou para os produtos que ligam ao sistema. É da responsabilidade do cliente assegurar que a tomada está correctamente ligada e com ligação à terra de modo a evitar um choque eléctrico. (1) CUIDADO: Não insira as mãos ou ferramentas na abertura do espaço vazio que continha o conjunto da ventoinha. (1) 484 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Descarga electroestá tica (ESD) Procedimentos de inspecção de segurança Inspeccionar o bastidor SC000902 Atenção Ao manusear unidades substituíveis localmente (FRUs) e outras peças de computador, tome estas precauções de modo a evitar danos devidos à está tica: Limite os movimentos. O movimento pode criar electricidade está tica à sua volta. Toque sempre com cuidado nas peças do computador. Segure nas placas pelas extremidades ou pela cobertura metá lica. Nunca toque em circuitos expostos. Evite que pessoas que não estejam correctamente protegidas com terra toquem em peças do computador. Antes de instalar uma nova peça, toque com a embalagem de protecção contra está tica que contém a peça numa parte metá lica não pintada do 2104 Modelo DS4, 2104 Modelo TS4, ou sistema central durante pelo menos 2 segundos. Isto reduz a electricidade está tica da embalagem e do corpo do utilizador. Remova a peça da embalagem e, se possível, instale-a directamente no 2104 Modelo DS4 ou Modelo TS4 sem a pousar. Se tiver de pousar a peça, siga estes passos: 1. Coloque a embalagem de protecção contra está tica que continha a peça numa superfície lisa e nivelada. 2. Coloque a peça numa embalagem de protecção contra está tica. Não coloque a peça directamente em cima de qualquer superfície metá lica. Os procedimentos de inspecção de segurança descrevem o modo de inspecção do 2104 Modelo DS4 ou Modelo TS4. Consulte o manual de instalação do bastidor para obter informações sobre como efectuar a inspecção no bastidor. CUIDADO: O 2104 Modelo DS4 e Modelo TS4 estão concebidos sendo para instalação pelo cliente e estão certificados como para instalação pelo cliente. Certifique-se que o sistema ou bastidor no qual o 2104 Modelo DS4 / Modelo TS4 será instalado também está concebido e certificado para configuração pelo cliente. Se não, o 2104 Modelo DS4 / Modelo TS4 terá então que ser instalado por um CE. (22) Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 485 Inspeccionar o 2104 Modelo DS4 ou Modelo TS4 Verificações externas na má quina Verificações internas na má quina SC000903 Efectue as seguintes verificações internas na má quina: 1. Verifique se existem quaisquer alterações alheias à IBM que possam ter sido feitas na má quina. Se existir alguma, peça o formulá rio “Non-IBM Alteration Attachment Survey”, número R009, ao representante da IBM. Preencha o formulá rio e devolva-o ao representante. 2. Verifique o estado do interior da má quina e veja se não existem: Contaminantes metá licos ou outros Vestigios de á gua ou outro fluido Fogo Danos provocados por fumo 3. Verifique se existem problemas mecânicos óbvios, como componentes soltos. 4. Verifique se existem cabos e conectores expostos com desgaste, rachas ou trincados. Efectue as seguintes verificações externas na má quina: 1. Verifique danos no chassis (arestas soltas, partidas ou cortantes). 2. Verifique os cabos de alimentação e certifique-se que o isolamento não está gasto ou danificado. 3. Verifique quaisquer alterações óbvias fora do comum. Faça uma boa avaliação sobre a segurança de tais alterações. 4. Verifique que todas as tampas exteriores existem e não estão danificadas. 5. Certifique-se que todos os trincos e dobradiças estão no correcto estado de funcionamento. 6. Verifique se existem danos no cabo de alimentação. 7. Verifique se existem cabos gastos, danificados ou trincados. 8. Inspeccione os conjuntos da vetoínha e fonte de alimentação. Verifique se os grampos de fixação da cobertura da unidade da fonte de alimentação (parafusos ou rebites) não foram removidos ou alterados. 9. Verifique se existem danos no cabo de sinal externo. 10. Verifique se existem arestas vivas, danos ou alterações que exponham as peças internas do dispositivo. 11. Corrija quaisquer problemas que possa encontrar. Efectue as seguintes verificações de segurança para identificar condições inseguras. 486 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Verificações na etiqueta de segurança SC000904 Efectue as seguintes verificações na etiqueta de segurança: 1. Certifique-se de que a etiqueta está instalada no 2104 Modelo DS4 ou Modelo TS4. Consulte a Figura 1. 2. Certifique-se que a etiqueta de cuidado do cabo eléctrico está instalada na fonte de alimentação 2104 Modelo DS4 ou Modelo TS4. Consulte a Figura 2. SC000274 Figura 1. Etiqueta do 2104 Modelo DS4 ou Modelo TS4 Figura 2. Etiqueta de cuidado do cabo eléctrico SC000325 CUIDADO: Esta unidade pode ter dois cabos eléctricos. Para toda a alimentação, desligue ambos os cabos eléctricos. (1) Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 487 SC000905 3. Verifique se a etiqueta de cuidado para colocar em coberturas se encontra na da fonte de alimentação do 2104 Modelo DS4 ou do Modelo Ts4. Consulte a Figura 3. 4. Verifique se a etiqueta de cuidado com fusíveis está colocada na fonte de alimentação do 2104 Modelo DS4 ou do Modelo TS4. Consulte a Figura 4. Figura 3. Etiqueta de cuidado com a cobertura da fonte de alimentação Figura 4. Etiqueta de cuidado com fusíveis SC000326 SC000327 CUIDADO: Não retire a cobertura, não tente reparações, as peças não necessitam de manutenção. (2) CUIDADO: Bipolaridade/Fusíveis Neutros. (3) 488 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SC000905A 5. Verifique se a etiqueta do peso está instalada no 2104 Modelo DS4. Ver Figura 5. CUIDADO: Este unidade pesa mais de 32 Kg (70.5 lbs). Consulte os manuais. (4) Figura 5. Etiqueta de peso do 2104 Modelo DS4 SC000220 Refer to manuals 32 Kg (70.5 lbs) Atenção: Caso tenha dados armazenados nas unidades, identifique as mesmas antes de as remover. Ao voltar a colocar as unidades, instale-as no compartimento de unidade de onde as removeu. De contrá rio, pode ter como resultado a perda de dados. > 18 Kg (37 lbs) > 32 Kg (70.5 lbs) CUIDADO: Se o 2104 Modelo DS4 estiver completamente carregado com 14 unidades e duas fontes de alimentação, o seu peso total excede os 32 Kg (70.5 lbs). Tem de remover pelo menos seis unidades do 2104 Modelo DS4 antes de o levantar ou instalar num bastidor ou no 2104 Modelo TS4. Assim, reduz o peso total para menos de 32 Kg (70.5 lbs) e o . Se não o fizer, corre o risco de provocar ferimentos. 2104 Modelo DS4 pode ser manuseado de forma segura por duas pessoas Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 489 CUIDADO: Esta unidade pesa entre 32 Kg - 55 Kg (70.5 lbs - 121.2 lbs). (5) Figura 6. 2104 Etiqueta de peso do Modelo TS4 32 - 55 Kg (70.5 - 121.2 lbs) SC000219 6. Verifique se a etiqueta de peso está instalada no 2104 Modelo TS4. Consulte a Figura 6. CUIDADO: A ferramenta de elevação de embalagens fornecida é necessá ria para mover, instalar e reposicionar o produto quando totalmente carregado. São necessá rias três pessoas para mover o produto em segurança. Uma falha neste procedimento pode provocar lesões. No caso da ferramenta de elevação não estar prontamente disponível, deve reduzir o peso para 32 Kg ou menos removendo todos os componentes pesados (unidades de disco e fontes de alimentação) do produto. Sendo então necessá rias apenas duas pessoas para mover, instalar e reposicionar o produto. (6) Atenção: Se tiver dados armazenados nas unidades, identifique as unidades com etiquetas antes de as remover. Quando substituir as unidades, instale cada uma destas no compartimento do qual as removeu. Uma falha neste processo pode resultar numa perda de dados. > 18 Kg (37 lbs) > 32 Kg (70.5 lbs) SC000906 490 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SC000907 7. Verifique se a etiqueta de condição de espera está instalada no 2104 Modelo DS4 ou Modelo TS4. Consulte a Figura 7. CUIDADO: Uma condição de espera é indicada pelo símbolo à direita de DC logo acima do interruptor, SW1. Quando muda o SW1 para a posição direita logo abaixo do símbolo de espera, a corrente-ca para a unidade não é cortada. (7) Figura 7. Etiqueta da condição de espera SC000326 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 491 SC000908 Esta secção contém instruções para verificar a ligação à terra do 2104 Modelo DS4 ou Modelo TS4. Verificar a ligação à terra do 2104 Modelo DS4 Execute os seguintes passos para verificar a ligação à terra do 2104 Modelo DS4: 3. Parte dos circuitos eléctricos pode ser danificada se os cabos SCSI externos estiverem ligados ao 2104 Modelo DS4 enquanto a verificação de ligação à terra estiver a ser efectuadas. Certifique-se de que não existem cabos externos SCSI ligados aos conectores . 4. Verifique a ligação à terra do 2104 Modelo DS4: unidade de distribuição de alimentação ou de baterias de reserva Atenção: a. Nas extremidades do conector das fichas de alimentação que desligou da no passo 2, efectue uma inspecção visual e certifique-se de que o pino de ligação à terra do terceiro fio está em boas condições. b. Utilize um ohmímetro e ligue um fio ao chassis d 2104 Modelo DS4 e o outro ao pino de ligação à terra do terceiro fio na face dos conectores de alimentação que desligou no passo 2. Deve ler 0.1 ohm ou Modelo DS4 menos entre o pino de ligação à terra do conector de alimentação e o chassis do 2104 . Figura 8. Cabos de alimentação e conectores SCSI no 2104 Modelo DS4 Verificar a ligação à terra do 2104 Modelo DS4 ou Modelo TS4 1. Certifique-se de que toda a alimentação é removida do bastidor. Consulte o documento de instalação do bastidor ou sistema central. 2. Certifique-se de que os cabos de alimentação estão ligados a cada conjunto de ventoínha e fonte de alimentação. Consulte a Figura 8. Certifique-se também de que as outras extremidades dos cabos de alimentação estão desligados da unidade de distribuição de alimentação ou de baterias de reserva do bastidor. Consulte o documento de instalação do bastidor ou sistema central. 2 1 SC000210 492 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SC000909 Se estiver a efectuar a manutenção num 2104 Modelo DS4 de 220-volt, avance para o passe 5. Se estiver a efectuar a manutenção num 2104 Modelo DS4 de -48 volts, avance para o passo 7. 5. Verifique a continuidade entre o chassis do 2104 Modelo DS4 e o pino da terra do conector de alimentação de cada conjunto de ventoínha e fonte de alimentação. Consulte a Figura 9. 6. Avance para o passo 9. 7. Verifique a continuidade entre o chassis do 2104 Modelo DS4 e os pinos de terra 1 e 7 de cada conector de alimentação principal. Consulte a Figura 10. 8. Avance para o passo 9. 9. Se algum dos conjuntos de ventoínha e fonte de alimentação não tiver continuidade, troque-o por um novo. Consulte a secção sobre conjuntos de ventoínha e fonte de alimentação em . Efectue de novo toda a verificação de ligação à terra. Se todos os conjuntos de ventoínha e fonte de alimentação tiverem continuidade, pode ter um problema no cabo de alimentação ou com a ligação à terra do sistema central. Expandable Storage Plus 2104 Model DS4 and Model TS4 Service Guide Figura 9. Pino de ligação à terra (2104 Modelo DS4 de 220 V) Figura 10. Pinos de ligação à terra (2104 Model DS4 de -48 V) 1 SC000329 Se a ligação à terra estiver correcta, a verificação da mesma está completa. Se a ligação à terra não estiver correcta, desligue os cabos de alimentação dos conjuntos de ventoínha e fonte de alimentação do 2104 Modelo DS4. SC000330 Pino 6 Pino 1 Pino 12 Pino 7 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 493 SC000910 Se estiver a efectuar manutenção no 2104 Modelo DS4 de 220-volts, avance para o passo 10. Se estiver a efectuar manutenção no 2104 Modelo DS4 de -48 volts, consulte o documento de instalação do bastidor ou sistema central para isolar a falha. Efectue novamente toda a verificação de ligação à terra. 10. Verifique a continuidade do cabo de alimentação. Se o cabo de alimentação não tiver continuidade, troque-o por um novo, efectue então novamente toda a verificação de ligação à terra. Se o cabo de alimentação não tiver continuidade, consulte o documento de instalação do bastidor ou sistema central para isolar a falha. Efectue os seguintes passos para verificar a ligação à terra do 2104 Modelo TS4: 2. Certifique-se de que as outras extremidades dos cabos de alimentação não estão ligados a tomadas eléctricas. Desligue os cabos se necessá rio. Verificar a ligação à terra do 2104 Modelo TS4 1. Certifique-se de que têm um cabo de alimentação ligado a cada uma das tomadas de alimentação 1 . Consulte a Figura 11. Figura 11. Cabos de alimentação e conectores SCSI (2104 Modelo TS4) SC000230 1 2 3 494 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SC000911 3. : Alguns circuitos eléctricos podem ser danificados se os cabos SCSI externos estiverem ligados ao 2104 Modelo TS4 enquanto a verificação da ligação à terra estiver a ser efectuada. Certifique-se que não existem cabos SCSI externos ligados nos conectores SCSI . Consulte a Figura 11. 4. Verifique a continuidade entre o chassis do 2104 Modelo TS4 e o pino de ligação à terra de cada cabo de alimentação. a. Verifique os cabos de alimentação e certifique-se de que o conector de ligação à terra do terceiro fio está em perfeitas condições. b. Utilize um medidor para verificar que a continuidade da ligação à terra do terceiro fio é de 0.1 ohm ou inferior entre o pino externo de ligação à terra e a terra do chassis. 5. Se a continuidade for boa, a verificação de ligação à terra está completa. 6. Se a ligação à terra não estiver correcta, desligue os cabos de alimentação dos conectores de alimentação da linha principal. a. Verifique a continuidade de cada um dos cabos de alimentação. b. Se algum dos cabos de alimentação estiver a falhar, troque-o por um novo. c. Verifique a continuidade entre o chassis do 2104 Modelo TS4 e o pino de ligação à terra do conector de alimentação em cada conjunto de ventoínha e fonte de alimentação. Consulte a Figura 12. Atenção d Se o conjunto de ventoínha ou de fonte de alimentação não tiver continuidade, troque esse conjunto por um novo. (Consulte a secção sobre conjuntos de ventoínha e fonte de alimentação em Avance para “Verificar a ligação à terra do 2104 Modelo TS4” e execute os passos desde o 1 até ao 6d para completar de novo a verificação de ligação à terra. Expandable Storage Plus 2104 Model DS4 and Model TS4 Service Guide.) Figura 12. Pino de ligação à terra (2104 Modelo TS4) 1 2 3 SC000331 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 495 SC000912 Figura 13. Remover a alimentação de um 2104 Modelo DS4 A não ser que tenha uma razão especial para o fazer, não remova a alimentação do sistema central ou do 2104 Modelo DS4 se as instruções que estiver a seguir não o indicarem. Efectue os seguintes passos para remover a alimentação de um 2104 Modelo DS4: 1. Verifique junto do cliente que todas as operações entre o 2104 Modelo DS4 e o sistema central foram paradas. 2. Coloque o interruptor cc on/standby de cada conjunto de ventoínha e fonte de alimentação na posição standby. Consulte a Figura 13. a. Alguns 2104 Modelo DS4 têm um conjunto de ventoínha e fonte de alimentação e outro de ventoínha.O conjunto de ventoínha não tem interruptor cc on/standby. b. É possível que um conjunto de ventoínha e fonte de alimentação possa ter a luz CHK ligada embora o interruptor cc on/standby esteja na posição standby. Nota: Notas: Remover a alimentação do 2104 Modelo DS4 ou Modelo TS4 1 Remover a alimentação de um 2104 Modelo DS4 SC000206 496 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SC000913 3. PERIGO No passo seguinte irá remover os cabos de alimentação. Estes cabos estão alimentados se a unidade de distribuição de alimentação ou fonte de alimentação ininterruptível (UPS) ainda estiver ligada. (1) 1 1 2 2 Nota: Notes: A não ser que tenha uma razão especial para o fazer, não remova a alimentação do sistema central ou do 2104 Modelo TS4 se as instruções que estiver a seguir não o indicarem. Efectue os seguintes passos para remover a alimentação de um 2104 Modelo TS4: 1. Verifique junto do cliente que todas as operações entre o 2104 Modelo TS4 e o sistema central foram paradas. 2. Na parte posterior do 2104 Modelo TS4, Coloque o interruptor cc on/standby de cada conjunto de ventoínha e fonte de alimentação na posição standby. Consulte a Figura 14. a. Alguns 2104 Modelo TS4 têm um conjunto de ventoínha e fonte de alimentação e outro de ventoínha.O conjunto de ventoínha não tem interruptor cc on/standby. b. É possível que um conjunto de ventoínha e fonte de alimentação possa ter a luz CHK ligada embora o interruptor cc on/standby esteja na posição standby. Remover a alimentação de um 2104 Modelo TS4 Remova os cabos de alimentação da parte posterior do 2104 Modelo DS4. Consulte a Figura 13. 4. Efectue os seguintes passos para devolver a alimentação ao 2104 Modelo DS4: a. Volte a instalar os cabos de alimentação . b. Coloque o interruptor cc on/standby de cada conjunto de ventoínha e fonte de alimentação para on. É possível configurar a sequência de arranque do motor do 2104 Modelo DS4. Para obter mais informações, consulte as secções sobre os interruptores de arranque automá tico e de modo de arranque retardado do motor em . Nota: Expandable Storage Plus 2104 Model DS4 and Model TS4 Service Guide Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 497 SC000914 1 2 2 3. Remover os cabos de alimentação da tomada eléctrica e da parte posterior do 2104 Modelo TS4. Consulte a Figura 14. 4. Execute os seguintes passos para realimentar o 2104 Modelo TS4: a. Reinstale os cabos de alimentação . b. Mude o interruptor ca on/standby de cada conjunto de ventoínha e fonte de alimentação para on. É possível configurar a sequência de arranque do motor do 2104 Modelo TS4. Para obter mais informações, consulte as secções sobre o interruptor de arranque automá tico da unidade e o de modo de arranque retardado do motor em Nota: Expandable Storage Plus 2104 Model DS4 and Model TS4 Service Guide. Figura 14. Remover a alimentação de um 2104 Modelo TS4 SC000207 498 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SC000915 Avisos de perigo PERIGO PERIGO PERIGO PERIGO PERIGO No passo seguinte irá remover os cabos de alimentação. Estes cabos estão se a unidade de distribuição de alimentação ou fonte de alimentação ininterruptível (UPS) ainda está ligada. (1) Não ligue um cabo de alimentação ao conjunto de ventoínha e fonte de alimentação até que o conjunto esteja perfeitamente encaixado e as porcas de orelhas devidamente apertadas. (3) Durante uma trovoada, não desligue os cabos de estações de visualização, impressoras, telefones ou protectores de estação para linhas de comunicação. (5) Uma tomada eléctrica que não esteja correctamente ligada pode passar uma voltagem perigosa às peças metá licas do sistema ou dispositivos que estejam ligados ao sistema. É da responsabilidade do cliente certificar-se de que a tomada está correctamente ligada e com ligação à terra de modo a prevenir um choque eléctrico. Durante uma trovoada, não desligue os cabos das estações de visualização, impressoras, telefones ou protectores de estação para linhas de comunicação. (4) Não tente abrir as tampas do conjunto de ventoínha e fonte de alimentação. (2) Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 499 SC000916 Avisos de segurança CUIDADO: Esta unidade poderá ter dois cabos eléctricos. Para cortar a alimentação, desligue ambos os cabos. (1) CUIDADO: Não retire a cobertura, não tente reparações, as peças não necessitam de manutenção. (2) CUIDADO: Bipolaridade/Fusíveis Neutros. (3) Figura 15. Etiqueta de cuidado com cabos eléctricos Figura 16. Etiqueta de cuidado com a cobertura da fonte de alimentação. Figura 17. Etiqueta de cuidado com fusíveis SC000327 SC000326 SC000325 500 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SC000916A CUIDADO: Esta unidade pesa mais de 32 Kg (70.5 lbs). Consulte os manuais. (4) Figura 18. Etiqueta de peso do 2104 Modelo DS4 SC000220 Refer to manuals 32 Kg (70.5 lbs) Atenção: Caso tenha dados armazenados nas unidades, identifique as mesmas antes de as remover. Ao voltar a colocar as unidades, instale-as no compartimento de unidade de onde as removeu. De contrá rio, pode ter como resultado a perda de dados. > 18 Kg (37 lbs) > 32 Kg (70.5 lbs) CUIDADO: Se o 2104 Modelo DS4 , 32 Kg (70.5 lbs). 2104 Model DS4 2104 Modelo TS4. 32 Kg (70.5 lbs) e o estiver completamente carregado com 14 unidades e duas fontes de alimentação o seu peso total excede os Tem de remover pelo menos seis unidades do antes de o levantar ou instalar num bastidor ou no Assim, reduz o peso total para menos de 2104 Modelo DS4 pode ser manuseado de forma segura por duas pessoas. Se não o fizer, correo risco de provocar ferimentos. Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 501 CUIDADO: Esta unidade pesa entre 32 Kg - 55 Kg (70.5 lbs - 121.2 lbs). (5) CUI O: (6) DAD A ferramenta de elevação de embalagens fornecida é necessá ria para mover, instalar e reposicionar o produto quando totalmente carregado. São necessá rias três pessoas para mover o produto em segurança. Uma falha neste procedimento pode provocar lesões. No caso da ferramenta de elevação não estar prontamente disponível, deve reduzir o peso para 32 Kg ou menos removendo todos os componentes pesados (unidades de disco e fontes de alimentação) do produto. Sendo então necessá rias apenas duas pessoas para mover, instalar e reposicionar o produto. Figura 19. Etiqueta de peso 2104 Modelo TS4 SC000219 Atenção: Se tiver dados armazenados nas unidades, identifique as unidades com etiquetas antes de as remover. Quando substituir as unidades, instale cada uma destas no compartimento do qual as removeu. Uma falha neste processo pode resultar numa perda de dados. > 18 Kg (37 lbs) > 32 Kg (70.5 lbs) 32 - 55 Kg (70.5 - 121.2 lbs) SC000917 502 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SC000918 CUIDADO: Uma condição de espera é indicada pelo símbolo à direita de CA logo acima do interruptor, SW1. Quando muda SW1 para a posição da direita logo abaixo do símbolo standby, a alimentação ca para a unidade não é cortada. (7) CUIDADO: O estabilizador tem de estar correctamente montado na parte inferior frontal do bastidor de modo a evitar que este tombe para a frente aquando da remoção do 2104 Modelo DS4 do bastidor. Não remova nem instale qualquer unidade se não houver um estabilizador montado no bastidor. (8) CUIDADO: O estabilizador tem de estar correctamente montado na parte inferior frontal do bastidor de modo a evitar que este tombe para a frente aquando da instalação do 2104 Model DS4 no bastidor. Não remova nem instale qualquer unidade se não houver um estabilizador montado no bastidor. (9) Figura 20. Localização da condição de espera SC000328 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 503 SC000319 CUIDADO: Não inserir as mãos ou ferramentas no espaço vazio que continha o conjunto de ventoínha. (10) CUIDADO: Não inserir as mãos ou ferramentas no espaço vazio que continha o conjunto de ventoínha e fonte de alimentação. (11) CUIDADO: Não inserir as mãos ou ferramentas no espaço vazio que continha o conjunto da placa da interface SCSI. (12) CUIDADO: Este produto está equipado com um cabo de alimentação e uma ficha de 3 fios para segurança do utilizador. Utilize este cabo de alimentação juntamente com uma tomada eléctrica devidamente ligada à terra de modo a evitar um choque eléctrico. (13) CUIDADO: Não toque na tomada de alimentação ou na chapa da mesma com nada que não sondas de teste antes de completar esta verificação de segurança. (14) CUIDADO: Se a leitura não for infinito, não avance. Faça as correcções necessá rias na ligação dos fios antes de continuar. Não ligue o circuito de derivação CB até que conclua satisfatoriamente os passos anteriores. (15) CUIDADO: Não utilize as pegas dos conjuntos de ventoínha e fonte de alimentação para transportar o 2104 Modelo DS4. Estas pegas não foram concebidas para suportar o peso da unidade. (16) 504 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SC000920 CUI O: , . . (17) CU : . (18) CUI O: no e . (19) CUI O: no e . (20) CUI O: 2104 Model DS4 n Model T00 o T42, observ . e s e SA38-0577, o Web site: www-1.ibm.com/servers/eserver/pseries/library/hardware_docs/7014_t00.html (21) CUI O: 2104 Model DS4 Model TS4 nstal d c e certifi . no 2104 Model DS4 o Model TS4 i . 2104 Model DS4 o Model TS4 instal d CE. (22) DAD À medida que encaixa o conjunto a alavanca move-se automaticamente para a posição de fechada Assegure-se que não entala os dedos entre a alavanca e o conjunto IDADO Certifique-se de que os cabo de alimentação da linha principal foi removido da ventoínha e fonte de alimentação com falhas antes de continuar DAD Não inserir as mãos ou ferramentas spaço vazio por cima do conjunto de ventoínha e fonte de alimentação DAD Não inserir as mãos ou ferramentas spaço vazio entre os conjuntos de fonte de alimentação DAD Se o o estiver instalado um bastidor o u deve também ar os avisos de segurança para esses bastidores antes de começar a remover o conjunto da moldura Pode encontrar os avisos d egurança na secção sobre instalação do sistema m u no seguinte DAD O o e o foram concebidos para serem i a os pelo liente estão cados como para instalação pelo cliente Certifique-se de que o sistema ou bastidor qual o o u o vai ser nstalado também está concebido e certificado como para instalação pelo cliente Caso contrá rio 7014 Model T00 and T42 Rack Installation and Service Guide, , o o u o terá que ser a o por um Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 505 SC000921 CUIDADO: Não inserir as mãos ou ferramentas no espaço que continha o conjunto da placa. (23) CUIDADO: São necessá rias três pessoas para levantar o 2104 Modelo TS4. Não tente levantar o 2104 Modelo TS4 sózinho. Não tente levantá -lo sem a ajuda de outras duas pessoas. (24) CUIDADO: Não utilize as pegas da ventoínha ou conjuntos de ventoínha e fonte de alimentação para transportar o 2104 Modelo TS4. Estas pegas não foram concebidas para suportar o peso da unidade. (25) O 2104 Modelo DS4 está concebido como sendo para instalação pelo cliente e está certificado como para instalação pelo cliente. Certifique-se de que o sistema ou bastidor no qual o 2104 Modelo DS4 irá ser instalado também está concebido e certificado como para instalação pelo cliente. Caso contrá rio, o 2104 Modelo DS4 terá então que ser instalado por um CE. (26) CUIDADO: CUIDADO: O o está concebido como sendo para instalação pelo liente e está certificado como para instalação pelo cliente Certifique-se de que o sistema qual o o irá ser instalado também está concebido e certificado como para instalação pelo cliente Caso contrá rio o 2104 Modelo TS4 terá então que ser instalado por um 2104 Model TS4 c . no 2104 Model TS4 . , CE. (27) 506 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Romanian Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 507 508 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 509 510 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 511 512 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 513 514 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 515 516 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 517 518 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 519 520 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 521 522 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 523 524 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 525 526 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 527 528 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 529 Russian 530 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 531 532 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 533 534 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 535 536 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 537 538 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 539 540 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 541 542 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 543 544 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 545 546 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 547 548 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 549 550 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 551 552 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Slovak Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 553 554 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 555 556 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 557 558 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 559 560 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 561 562 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 563 564 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 565 566 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 567 568 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 569 570 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 571 572 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 573 574 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 575 Slovenia 576 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 577 578 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 579 580 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 581 582 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 583 584 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 585 586 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 587 588 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 589 590 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 591 592 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 593 594 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 595 596 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 597 598 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Spanish Avisos de seguridad y medioambientales Avisos de seguridad y sus traducciones SC000901 Esta sección contiene la siguiente información: Avisos de seguridad y sus traducciones Procedimientos de inspección de la seguridad para este producto Avisos y declaraciones medioambientales Los avisos de seguridad aparecen impresos a lo largo de este documento. Los avisos de advierten de condiciones o procedimientos que pueden causar la muerte o daños personales graves. Los avisos de advierten de condiciones o procedimientos que pueden causar daños personales no letales ni extremadamente peligrosos. Los avisos de advierten de condiciones o procedimientos que pueden causar daños en má quinas, equipos, programas o datos. Los avisos se listan en orden numérico según sus ID, que se muestran entre paréntesis al final de cada uno de ellos. Compruebe la ubicación del número de ID en los siguientes ejemplos de mensajes de peligro y precaución. Peligro Precaución Atención PELIGRO Una toma de corriente que no esté correctamente conectada podría dar lugar un voltaje peligroso en las piezas metá licas del sistema o en los productos conectados al mismo. Es responsabilidad del cliente asegurarse de que la toma esté correctamente cableada y conectada a tierra para evitar descargas eléctricas. (1) PRECAUCIÓN: No introduzca las manos ni herramientas en la abertura del espacio vacío que aloja el conjunto del ventilador. (1) Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 599 Descarga electroestá tica (ESD) Procedimientos de inspección de la seguridad Inspección del bastidor SC000902 Atención Cuando manipule unidades sustituibles localmente (FRU) y otras piezas, tome las siguientes precauciones para evitar daños producidos por la electricidad está tica: Limite sus movimientos. El movimiento puede atraer la electricidad está tica a su alrededor. Manipule siempre con cuidado las piezas informá ticas. Sostenga las tarjetas por sus bordes o su cubierta metá lica. No toque nunca circuitos que estén al descubierto. Evite que toquen las piezas informá ticas personas que no estén debidamente en conctacto a tierra. Antes de instalar una pieza nueva, tome el paquete protector de electricidad está tica que contiene la pieza y póngalo en contacto con una pieza sin pintar del sistema 2104 Modelo DS4, 2104 Modelo TS4 o sistema principal durante 2 segundos como mínimo. De este modo se reducirá la electricidad está tica del paquete y de su cuerpo. Extraiga la pieza de su paquete y, si es posible, instá lela en el sistema 2104 Modelo DS4 o Modelo TS4 sin depositarla sobre ninguna superficie. Si debe depositar la pieza, siga los siguientes pasos: 1. Coloque el paquete protector de electricidad está tica que contenía la pieza sobre una superficie lisa y plana. 2. Deposite la pieza sobre el paquete protector de electricidad está tica. No la deposite directamente sobre ninguna superficie metá lica. Los procedimientos de inspección de la seguridad describen el modo de inspeccionar el sistema 2104 Modelo DS4 o Modelo TS4. Consulte el manual de instalación del bastidor para obtener información sobre el modo de realizar una inspección segura del mismo. PRECAUCIÓN: El sistema 2104 Modelo DS4 y Modelo TS4 se han diseñado para que los instale el cliente y está n certificados como configurables por el cliente. Asegúrese de que el sistema o el bastidor en el que se vaya a instalar el sistema 2104 Modelo DS4 o Modelo TS4 esté también diseñado y certificado como configurable por el cliente. De lo contrario, el sistema 2104 Modelo DS4 o Modelo TS4 deberá instalarlo un técnico de sistemas. (22) 600 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Inspección del sistema 2104 Modelo DS4 o Modelo TS4 Comprobaciones externas de la má quina Comprobaciones internas de la má quina SC000903 Realice las siguientes comprobaciones internas de la má quina: 1. Compruebe si se ha realizado alguna modificación en la má quina que no haya hecho IBM. En caso afirmativo, obtenga el formulario “Non-IBM Alteration Attachment Survey” (Inspección anexa de alteración no realizada por IBM), número R009, de la sucursal de IBM. Rellene el formulario y devuélvalo a la sucursal. 2. Compruebe las condiciones del interior de la má quina, por si hubiese: Metales u otros contaminantes Indicaciones de agua u otros fluidos Fuego Daños por humo 3. Compruebe cualquier problema mecá nico obvio, tal como la pérdida de componentes. 4. Compruebe el desgaste, roturas o pinzamientos de los cables y conectores expuestos. Realice las siguientes comprobaciones externas de la má quinas: 1. Compruebe los daños en el chasis (pérdidas, roturas o bordes cortantes). 2. Compruebe los cables de alimentación y asegúrese de que el material aislante no esté desgastado o dañado. 3. Compruebe cualquier cambio no está ndar que sea obvio. Realice una valoración de las implicaciones en la seguridad de cualquiera de tales cambios. 4. Verifique que todas las cubiertas externas estén presentes y no estén dañadas. 5. Compruebe que todos los pestillos y bisagras estén en condiciones de funcionamiento correctas. 6. Compruebe si existen daños en el cable de alimentación 7. Compruebe si los cables está n desgastados, dañados o pinzados. 8. Inspeccione los conjuntos de ventilador y fuente de alimentación. Compruebe que el mecanismo de cierre de la cubierta de la unidad de fuente de alimentación (tornillos o remaches) no se haya desmontado o alterado. 9. Compruebe si existen daños en el cable de señal externo. 10. Compruebe si la cubierta tiene bordes cortantes, daños o alteraciones que expongan las piezas internas del dispositivo. 11. Corrija cualquier problema que encuentre. Realice las siguientes comprobaciones de seguridad para identificar las condiciones en las que no exista seguridad. Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 601 Comprobaciones de las etiquetas de seguridad SC000904 Realice las siguientes comprobaciones de las etiquetas de seguridad: 1. Verifique que la etiqueta esté instalada en el sistema 2104 Modelo DS4 o Modelo TS4. Vea la figura 1. 2. Verifique que la etiqueta de precaución de la línea conductora esté instalada en la fuente de alimentación del sistema 2104 Modelo DS4 o Modelo TS4. Vea la figura 2. SC000274 Figura 1. Etiqueta del sistema 2104 Modelo DS4 o Modelo TS4 Figura 2. Etiqueta de precaución de cable conductor SC000325 PRECAUCIÓN: Esta unidad puede tener dos cables conductores. Para eliminar toda la alimentación, desconecte ambos cables conductores. (1) 602 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SC000905 3. Verifique que la etiqueta de precaución de la cubierta de la fuente de alimentación esté instalada en la fuente de alimentación del sistema 2104 Modelo DS4 o Modelo TS4. Vea la figura 3. 4. Verifique que la etiqueta de precaución del fusible esté instalada en la fuente de alimentación del sistema 2104 Modelo DS4 o Modelo TS4. Vea la figura 4. Figura 3. Etiqueta de precaución de la cubierta de la fuente de alimentación Figura 4. Etiqueta de precaución del fusible SC000326 SC000327 PRECAUCIÓN: No desmonte la cubierta, no proporcione servicio, no hay piezas susceptibles de recibir servicio. (2) PRECAUCIÓN: Fusible de doble polo/neutro. (3) Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 603 SC000905A 5. Verifique que la etiqueta del peso esté instalada en el sistema 2104 Modelo DS4. Vea la figura 5. PRECAUCIÓN: Esta unidad pesa sobre 32 Kg (70,5 lbs). Consulte los manuales. (4) Figura 5. Etiqueta de peso del sistema 2104 Modelo DS4 SC000220 Refer to manuals 32 Kg (70.5 lbs) Atención: Si tiene datos guardados en la unidades, etiquete las unidades antes de extraerlas. Cuando sustituya las unidades, instale cada una en la misma bahía de unidad de la que la ha extraído. El no hacerlo de esta manera, puede producir una pérdida de datos. > 18 Kg (37 lbs) > 32 Kg (70,5 lbs) PRECAUCIÓN 2104 Modelo DS4 está completamente cargado con 14 unidades y dos fuentes de alimentación, su peso total excede de 32 Kg (70,5 lbs). Debe extraer al menos seis unidades del sistema 2104 Modelo DS4 antes de levantarlo o instalarlo en un bastidor o sistema 2104 Modelo TS4. Esta acción reduce el peso total a menos de 32 Kg (70,5 lbs) con lo que dos personas pueden manejar el sistema Si no se hace de está forma se pueden producir lesiones físicas. : Si un sistema 2104 Modelo DS4. 604 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices PRECAUCIÓN: Esta unidad pesa entre 32 y 55 Kg (70,5 lbs y 121,2 lbs). (5) Figura 6. Información de peso del sistema 2104 Modelo TS4 32 - 55 Kg (70.5 - 121.2 lbs) SC000219 6. Verifique que la etiqueta del peso esté instalada en el sistema 2104 Modelo TS4. Vea la Figura 6. PRECAUCIÓN: Para trasladar, instalar y reubicar el producto cuando esté completamente lleno se requiere la herramienta para levantar el embalaje que se proporciona. Para mover el producto con seguridad son necesarias tres personas. De no hacerse así podrían producirse lesiones personales ) . 32 Kg o menos extrayendo todos los componentes pesados (unidades de disco y fuentes de alimentación) del producto. Entonces se necesitará n solo dos personas para trasladar, instalar y reubicar el producto. (6 En caso de que la herramienta para levantar el producto no esté disponible fá cilmente, puede reducir el peso a Atención: Si tiene datos almacenados en las unidades, etiquételas antes de extraerlas. Cuando sustituya las unidades, instale cada una en la misma bahía de unidad de la que la ha extraído. Si no se hiciese así, se podría producir una pérdida de datos. > 18 Kg (37 lbs) > 32 Kg (70,5 lbs) SC000906 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 605 SC000907 7. Verifique que la etiqueta de condición de en espera esté instalada en el sistema 2104 Modelo DS4 o Modelo TS4. Vea la Figura 7. PRECAUCIÓN: Una condición de en espera se indica mediante el símbolo a la derecha de DC directamente por encima del interruptor, SW1. Cuando se conmuta SW1 a la posición derecha directamente bajo el símbolo de en espera, la alimentación de ca para la unidad no se apaga. (7) Figura 7. Etiqueta de condición de En espera SC000326 606 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SC000908 Esta sección contiene instrucciones para comprobar la toma de tierra del sistema 2104 Modelo DS4 o Modelo TS4. Comprobación de la toma de tierra del sistema 2104 Modelo DS4 Siga los pasos siguientes para comprobar la toma de tierra del sistema 2104 Modelo DS4: 3. Algunos circuitos eléctricos se podrían dañar si los cables SCSI externos se conectan al sistema 2104 Modelo DS4 mientras se está realizando la comprobación de la toma de tierra. Asegúrese de que no haya cables SCSI externos enchufados a los conectores . 4. Compruebe la toma de tierra del sistema 2104 Modelo DS4: Atención: a. En los extremos de conexión de los cables de alimentación que ha desenchufado de la unidad de distribución de alimentación o de la unidad de respaldo en el paso 2, realice una inspección visual y asegúrese de que la clavija de toma de tierra de tres cables esté en buenas condiciones. b. Utilice un medidor de ohmios y conecte un extremo al chasis del sistema 2104 Modelo DS4 y el otro extremo a la clavija de toma de tierra de tres cables en la cara de los conectores de alimentación que ha desenchufado en el paso 2. La lectura debería ser igual o inferior a 0,1 ohmios entre la clavija de toma de tierra del conector de alimentación y el chasis del sistema 2104 Modelo DS4. Figura 8. Cables de alimentación y conectores SCSI en el sistema 2104 Modelo DS4 Comprobación de la toma de tierra del sistema 2104 Modelo DS4 o Modelo TS4 1. Asegúrese de que toda la alimentación eléctrica se haya desconectado del bastidor. Vea el bastidor o sistema principal en el documento de instalación. 2. Asegúrese de que los cables de alimentación estén enchufados en cada conjunto de ventilador-fuente de alimentación. Vea la figura 8. Asegúrese también de que los otros extremos de los cables de alimentación en el bastidor estén desconectados de la unidad de distribución de alimentación o de la unidad de respaldo por batería. Vea el bastidor o sistema principal en el documento de instalación. 2 1 SC000210 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 607 SC000909 Si está proporcionando servicio técnico a un sistema 2104 Modelo DS4 de 220 voltios, vaya al paso 5. Si está proporcionando servicio técnico a un sistema 2104 Modelo DS4 de -48 voltios, vaya al paso 7. 5. Compruebe la continuidad entre el chasis del sistema 2104 Modelo DS4 y la clavija de toma de tierra del conector de alimentación de cada conjunto de ventilador-fuente de alimentación. Vea la figura 9. 6. Vaya al paso 9. 7. Compruebe la continuidad entre el chasis del sistema 2104 Modelo DS4 y las clavijas de toma de tierra 1 y 7 de cada conector de línea principal. Vea la figura 10. 8. Vaya al paso 9. 9. Si alguno de los conjuntos de ventilador-fuente de alimentación no tiene continuidad, cambie dicho conjunto por uno nuevo. Vea la sección sobre conjuntos de ventilador y fuente de alimentación en el manual . Realice de nuevo la comprobación completa de la toma de tierra. Si cada conjunto de ventiladorfuente de alimentación tienen continuidad, es posible que exista un problema con el cable de alimentación o con la toma de tierra del sistema principal. Expandable Storage Plus 2104 Model DS4 and Model TS4 Service Guide Figura 9. Clavija de toma de tierra (Sistema 2104 Modelo DS4 de 220 voltios) Figura 10. Clavijas de toma de tierra (Sistema 2104 Modelo DS4 de -48 voltios) 1 SC000329 Si la toma de tierra es correcta, la comprobación de la misma habrá finalizado. Si la conexión a tierra no es correcta, desenchufe los cables de alimentación de los conjuntos de ventilador-fuente de alimentación del sistema 2104 Modelo DS4. SC000330 Clavija 6 Clavija 1 Clavija 7 Clavija12 608 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SC000910 Si está proporcionando servicio a un sistema 2104 Modelo DS4 de 220 voltios, vaya al paso 10. Si está proporcionando servicio a un sistema 2104 Modelo DS4 de -48 voltios, vea la documentación de instalación del del bastidor o sistema principal para determinar la anomalía. Realice la comprobación completa de la conexión de tierra de nuevo. 10. Compruebe la continuidad del cable de alimentación. Si el cable no tiene continuidad, cá mbielo por uno nuevo y, a continuación, realice la comprobación completa de la conexión de tierra de nuevo. Si el cable tiene continuidad, vea la documentación de instalación para el bastidor o sistema principal para determinar la anomalía. Para comprobar la toma de tierra del sistema 2104 Modelo Ts4, realice los pasos siguientes: 2. Asegúrese de que los otros extremos de los cables de alimentación no estén conectados a las tomas de alimentación eléctrica. Desenchufe los cables si fuese necesario. Comprobación de la toma de tierra del sistema 2104 Modelo TS4 1. Asegúrese de que haya conectado un cable de alimentación en cada toma de alimentación 1 . Vea la figura 11. Figura 11. Cables de alimentación y conectores SCSI (Sistema 2104 Modelo TS4) SC000230 1 2 3 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 609 SC000911 3. : Algunos circuitos eléctricos se podrían dañar si los cables SCSI externos se conectan al sistema 2104 Modelo DS4 mientras se está realizando la comprobación de la toma de tierra. Asegúrese de que no haya cables SCSI externos conectados a los conectores . Vea la figura 11. 4. Compruebe la continuidad entre el chasis del sistema 2104 Modelo TS4 y la clavija de toma de tierra de cada cable de alimentación. a. Compruebe los cables de alimentación y asegúrese de que el conector de tierra del tercer cable se encuentre en buenas condiciones. b. Utilice un medidor para comprobar que la continuidad de la toma de tierra del tercer cable es de 0,1 ohmios o menor entre la clavija de toma de tierra externa y la toma de tierra del chasis. 5. Si la continuidad es buena, la comprobación de la toma de tierra se ha completado. 6. Si la toma de tierra no es correcta, desenchufe los cables de alimentación de los conectores de la línea principal de alimentación. a. Compruebe la continuidad de cada cable de alimentación. b. Si el cable de alimentación falla, cá mbielo por uno nuevo. c. Compruebe la continuidad entre el chasis del sistema 2104 Modelo TS4 y la clavija de toma de tierra del conector de alimentación en cada conjunto de ventilador y fuente de alimentación. Vea la figura 12. Atención d Si el conjunto de ventilador y fuente de alimentación no tiene continuidad, cambie el conjunto por uno nuevo. (Vea la sección sobre conjuntos de ventilador y fuente de alimentación en el manual Vaya a “Comprobación de la toma de tierra del sistema 2104 Modelo TS4” y realice los pasos 1 al 6d para completar la comprobación de la toma de tierra de nuevo. Expandable Storage Plus 2104 Model DS4 and Model TS4 Service Guide.) Figura 12. Clavija de toma de tierra (Sistema 2104 Modelo TS4) 1 2 3 SC000331 610 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SC000912 Figura 13. Eliminación de la alimentación de un sistema 2104 Modelo DS4 Salvo que tenga una razón particular para hacerlo, no elimine la alimentación del sistema principal o del sistema 2104 Modelo DS4 a menos que las instrucciones que está siguiendo así se lo indiquen. Siga los pasos siguientes para eliminar la alimentación de un sistema 2104 Modelo DS4: 1. Compruebe con el cliente que todas las operaciones entre el sistema 2104 Modelo DS4 y el sistema principal se hayan detenido. 2. Posicione el interruptor de encendido/espera de cc de cada conjunto de ventilador-fuente de alimentación en posición de espera. Vea la figura 13. a. Algunos sistemas 2104 Modelo DS4 tienen un conjunto de ventilador-fuente de alimentación y un conjunto de ventilador. El conjunto de ventilador no tiene interruptor de encendido/espera de cc. b. Un conjunto de ventilador-fuente de alimentación puede tener su indicador de comprobación (CHK) encendido aunque su interruptor de encendido/espera de cc esté en posición de espera. Nota: Notas: Eliminación de la alimentación del sistema 2104 Modelo DS4 o Modelo TS4 1 Eliminación de la alimentación de un sistema 2104 Modelo DS4 SC000206 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 611 SC000913 3. PELIGRO En el paso siguiente se van a extraer los cables de alimentación. Estos cables tienen corriente si la unidad de distribución de alimentación del bastidor a la unidad de fuente de alimentación ininterrumpida (UPS) está n aún encendidas. (1) 1 1 2 2 Note: Notas: Salvo que tenga una razón particular para hacerlo, no elimine la alimentación del sistema principal o del sistema 2104 Modelo TS4 a menos que las instrucciones que está siguiendo le indiquen que lo haga. Siga los pasos siguientes para eliminar la alimentación de un sistema 2104 Modelo TS4: 1. Compruebe con el cliente que todas las operaciones entre el sistema 2104 Modelo TS4 y el sistema principal se hayan detenido. 2. En la parte posterior del sistema 2104 Modelo TS4, posicione el interruptor de encendido/espera de cc de cada conjunto de ventilador-fuente de alimentación en posición de espera. Vea la figura 14. a. Algunos sistemas 2104 Modelo TS4 tienen un conjunto de ventiladorfuente de alimentación y un conjunto de ventilador. El conjunto de ventilador no tiene interruptor de encendido/espera de cc. b. Un conjunto de ventilador-fuente de alimentación puede tener su indicador de comprobación (CHK) encendido aunque su interruptor de encendido/ espera de cc esté en posición de espera. Eliminación de la alimentación de un sistema 2104 Modelo TS4 Extraiga los cables de alimentación de la parte posterior del sistema 2104 Modelo DS4. Vea la figura 13. 4. Siga los pasos siguientes para devolver la alimentación al sistema 2104 Modelo DS4: a. Vuelva a instalar los cables de alimentación . b. Posicione el interruptor de encendido/espera de cc de cada conjunto de ventilador-fuente de alimentación en posición de encendido. Puede configurar la secuenciación de arranque del motor del sistema 2104 Modelo DS4. Para obtener má s información, vea las secciones sobre el conmutador de arranque automá tico de la unidad y el conmutador de modalidad de arranque del motor de retardo en el manual . Nota: Expandable Storage Plus 2104 Model DS4 and Model TS4 Service Guide 612 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SC000914 1 2 2 3. Extraiga los cables de alimentación de la toma de alimentación eléctrica y de la parte posterior del sistema 2104 Modelo TS4. Vea la figura 14. 4. Realice los pasos siguientes para devolver la alimentación al sistema 2104 Modelo TS4: a. Vuelva a instalar los cables de alimentación . b. Posicione el interruptor de encendido/espera de cc de cada conjunto de ventilador y fuente de alimentación en la posición de encendido. Es posible configurar la secuenciación del motor de arranque del 2104 Modelo TS4. Para obtener má s información, vea las secciones sobre el interruptor de arranque automá tico de la unidad y el interruptor de modo de retardo de arranque del motor en el manual Nota: Expandable Storage Plus 2104 Model DS4 and Model TS4 Service Guide. Figura 14. Eliminación de la alimentación de un sistema 2104 Modelo TS4 SC000207 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 613 SC000915 Avisos de peligro PELIGRO PELIGRO PELIGRO PELIGRO PELIGRO En el paso siguiente se van a extraer los cables de alimentación. Estos cables pueden tener corriente si la unidad de distribución de alimentación del bastidor o la unidad de fuente de alimentación ininterrumpida (UPS) aún se encuentran encendidas. (1) No enchufe ningún cable de alimentación en el conjunto de ventilador y fuente de alimentación hasta que el conjunto esté completamente en su lugar y sus tornillos estén completamente apretados. (3) Durante una tormenta con aparato eléctrico, no desconecte los cables de estaciones de pantalla, impresoras, teléfonos o protectores de estación para líneas de comunicaciones. (5) Una toma de alimentación eléctrica que no esté correctamente cableada podría dar lugar a un voltaje peligroso en las piezas metá licas del sistema o en los dispositivos que estén conectados al mismo. Es responsabilidad del cliente asegurarse de que la toma de alimentación esté correctamente cableada y conectada a tierra para evitar una descarga eléctrica. Durante una tormenta con aparato eléctrico, no desconecte los cables de estaciones de pantalla, impresoras, teléfonos o protectores de estación para líneas de comunicaciones. (4) No intente abrir las cubiertas del conjunto de ventilador y fuente de alimentación. (2) 614 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SC000916 Avisos de precaución PRECAUCIÓN: Es posible que esta unidad tenga dos cables de línea. Para suprimir toda la alimentación, desconecte ambos cables de línea. (1) PRECAUCIÓN: No desmonte la cubierta, no precisa servicio, no contiene piezas que se puedan reparar. (2) PRECAUCIÓN: Polo doble/Fusible neutro. (3) Figura 15. Etiqueta de precaución del cable de línea Figura 16. Etiqueta de precaución de la cubierta de la fuente de alimentación Figura 17. Etiqueta de precaución del fusible SC000327 SC000326 SC000325 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 615 SC000916A PRECAUCIÓN 32 Kg (70,5 lbs). Consulte los manuales. : Esta unidad pesa má s de (4) Figura 18. Etiqueta de peso del sistema 2104 Modelo DS4 SC000220 Refer to manuals 32 Kg (70.5 lbs) Atención: Si existen datos almacenados en las unidades, etiquételas antes de extraerlas. Cuando sustituya las unidades, instale cada una de ellas en la misma bahía de unidad de la que la ha extraído. De no hacerlo de así, se puede producir una pérdida de datos. > 18 Kg (37 lbs) > 32 Kg (70,5 lbs) PRECAUCIÓN 2104 Modelo DS4 con 14 unidades y dos fuentes de alimentación, su peso total superará los 32 Kg (70,5 lbs). Deberá extraer seis unidades como mínimo del sistema 2104 Modelo DS4 antes de levantarlo o instalarlo en un bastidor o sistema 2104 Modelo TS4. Esto reducirá el peso total a menos de 32 Kg (70,5 lbs) y hará que el sistema . No hacerlo así puede producir lesiones. : Si se carga por completo un sistema 2104 Modelo DS4 pueda ser manejado con seguridad por dos personas 616 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices PRECAUCIÓN: Esta unidad pesa entre 32 Kg - 55 Kg (70,5 lbs - 121,2 lbs). (5) PRECAUCIÓN es necesaria para transportar, instalar y reubicar el producto cuando esté completamente lleno. Para transportar el producto con seguridad será n necesarias tres personas. No hacerlo así podría producir lesiones. En el caso de que la herramienta de levantamiento no esté disponible, deberá reducir el peso a 32 Kg o menos extrayendo todos los componentes pesados (unidades de disco duro y fuentes de alimentación) del producto. Sólo se necesitará n dos personas para transportar, instalar y reubicar el producto. : La herramienta de levantamiento del embalaje proporcionada (6) Figura 19. Etiqueta del peso del sistema 2104 Modelo TS4 SC000219 Atención: Si existen datos almacenados en las unidades, etiquételas antes de extraerlas. Cuando sustituya las unidades, instale cada una de ellas en la misma bahía de unidad de donde las ha extraido. No hacerlo así podría resultar en una pérdida de datos. > 18 Kg (37 lbs) > 32 Kg (70,5 lbs) 32 - 55 Kg (70.5 - 121.2 lbs) SC000917 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 617 SC000918 PRECAUCIÓN: Una condición de en espera viene indicada por el símbolo a la derecha de la CC justo encima del interruptor, SW1. Cuando conmute SW1 a la posición de la derecha justo debajo del símbolo de en espera, la alimentación de CA de la unidad no estará apagada. (7) PRECAUCIÓN: El estabilizador debe estar correctamente sujeto a la parte frontal inferior del bastidor para evitar que éste se vuelque hacia delante cuando se extraiga el sistema 2104 Modelo DS4 del bastidor. No extraiga ni instale ninguna unidad si no existe un estabilizador sujeto al bastidor. (8) PRECAUCIÓN: El estabilizador debe estar correctamente sujeto a la parte frontal inferior del bastidor para evitar que éste se vuelque hacia delante cuando se instale el sistema 2104 Modelo DS4 en el bastidor. No extraiga ni instale ninguna unidad si no existe un estabilizador sujeto al bastidor. (9) Figura 20. Ubicación de la condición de en espera SC000328 618 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SC000319 PRECAUCIÓN: No introduzca las manos ni herramientas en el espacio vacío que albergaba al conjunto del ventilador. (10) PRECAUCIÓN: No introduzca las manos ni herramientas en el espacio vacío que albergaba al conjunto del ventilador y de la fuente de alimentación. (11) PRECAUCIÓN: No introduzca las manos ni herramientas en el espacio vacío que albergaba al conjunto de la tarjeta de interfaz SCSI. (12) PRECAUCIÓN: Este producto está equipado con un enchufe y un cable de alimentación de 3 hilos como medida de seguridad para el usuario. Utilice este cable de alimentación junto con una toma eléctrica correctamente conectada a tierra para evitar descargas eléctricas. (13) PRECAUCIÓN: No toque la toma de alimentación ni la toma de alimentación eléctrica con nada que no sean puntas de prueba antes de completar esta comprobación de seguridad. (14) PRECAUCIÓN: Si la lectura no es infinito, no continúe. Efectúe las correcciones necesarias en los cables antes de continuar. No conmute la ramificación del circuito CB hasta que finalice satisfactoriamente los pasos anteriores. (15) PRECAUCIÓN: No utilice las asas del ventilador o de los conjuntos del ventilador y de la fuente de alimentación para transportar el sistema 2104 Modelo Ds4. Estas asas no está n diseñadas para soportar el peso de la unidad. (16) Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 619 SC000920 PRECAUCIÓN: Cuando empuje la unidad para colocarla totalmente en su sitio, la palanca se desplazará automá ticamente hacia su posición de cierre. Procure no pellizcarse los dedos entre la palanca y la unidad. (17) Asegúrese de que el cable de alimentación de la línea principal se haya extraído del conjunto de ventilador y unidad de alimentación defectuoso antes de continuar. (18) No introduzca las manos ni herramientas en el espacio vacío situado sobre la unidad de alimentación y ventilación. (19) No introduzca las manos ni herramientas en el espacio vacío situado entre entre las unidades de alimentación. (20) Si el sistema 2104 Modelo DS4 se instala en el bastidor de un sistema Modelo T00 o T42, deberá tener en cuenta también los avisos de seguridad de dicho bastidor antes de proceder a extraer el armazón de la unidad. Encontrará los avisos de seguridad en la sección dedicada a la instalación del sistema de la publicación SA38-0577, o en el siguiente sitio Web: www-1.ibm.com/servers/eserver/pseries/library/hardware_docs/7014_t00.html (21) Los sistemas 2104 Modelo DS4 y Modelo TS4 se han diseñado para que los instale el cliente y está n certificados como configurables por el cliente. Asegúrese de que el sistema o el bastidor en el que se vaya a instalar el sistema 2104 Modelo DS4 o Modelo TS4 esté también diseñado y certificado como configurable por el cliente. De lo contrario, el sistema 2104 Modelo DS4 o Modelo TS4 deberá instalarlo un técnico de sistemas. (22) PRECAUCIÓN: PRECAUCIÓN: PRECAUCIÓN: PRECAUCIÓN: PRECAUCIÓN: 7014 Model T00 and T42 Rack Installation and Service Guide, 620 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SC000921 PRECAUCIÓN: No introduzca las manos ni herramientas en el espacio que albergaba el conjunto de la tarjeta. (23) PRECAUCIÓN: Se necesitan tres personas para levantar el sistema 2104 Modelo TS4. No intente levantar el sistema 2104 Modelo TS4 usted solo. No intente levartarlo sin ayuda de otras dos personas. (24) PRECAUCIÓN: No utilice las asas del ventilador o de los conjuntos del ventilador y de la fuente de alimentación para transportar el sistema 2104 Modelo TS4. Estas asas no está n diseñadas para soportar el peso de la unidad. (25) El sistema 2104 Modelo DS4 está diseñado para que lo instale el cliente y está certificado como configurado por el cliente. Asegúrese de que el sistema o bastidor en el que se va a instalar el sistema 2104 Modelo DS4 también esté diseñado y certificado para la configuración del cliente. Si no lo está , un CE deberá instalar el sistema 2104 Modelo DS4. (26) PRECAUCIÓN: PRECAUCIÓN: El sistema 2104 Modelo TS4 está diseñado para que lo instale el cliente y está certificado como configurado por el cliente. Asegúrese de que el sistema en el que se va a instalar el sistema 2104 Modelo TS4 también esté diseñado y certificado para la configuración del cliente. Si no lo está , un CE deberá instalar el sistema 2104 Modelo TS4. (27) Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 621 Swedish Upplysningar om säkerhet och miljö Säkerhetsanvisningar med översättning SCSVE901 Det här avsnittet innehåller följande information: Säkerhetsanvisningar med översättning Procedurer för säkerhetsinspektion av produkten Miljöanvisningar och annan miljöinformation Det här dokumentet innehåller säkerhetsinformation. Meddelanden som föregås av ordet Fara varnar för tillstånd eller procedurer som kan orsaka död eller allvarlig personskada. Meddelanden som föregås av ordet Varning varnar för tillstånd eller procedurer som kan orsaka personskada som varken är dödlig eller extremt hälsovådlig. Meddelanden som föregås av ordet Viktigt varnar för tillstånd eller procedurer som kan orsaka skada på datorer, utrustning, program eller data. Meddelandena är förtecknade i nummerordning efter ID, som står inom parentes efter varje meddelande. Följande exempel visar var ID-numren är placerade i meddelanden av typen Fara och Varning. FARA Ett eluttag som inte är korrekt installerat kan göra att det går livsfarlig spänning genom metalldelar i systemet eller produkter som är anslutna till systemet. Det är kundens ansvar att se till att alla eluttag är korrekt installerade och jordade. (1) VARNING: Stoppa inte in händerna eller verktyg i öppningen till det tomma utrymmet där fläkten var placerad. (1) 622 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Statisk elektricitet Procedurer för säkerhetsinspektion Inspektera racket SCSVE902 Viktigt: När du hanterar FRU-enheter (Field Replaceable Units) och andra datordelar ska du vidta följande försiktighetsåtgärder för att undvika skador av statisk elektricitet: Rör dig så lite som möjligt. När du rör dig kan statisk elektricitet byggas upp omkring dig. Var försiktig när du rör vid datorns delar. Håll i kortens kanter eller metallhölje. Rör aldrig vid frilagda kretsar. Förhindra att personer som inte är jordade rör vid datorns delar. Innan du installerar en ny del håller du dess antistatiska förpackning mot en omålad metalldel i 2104-enheten eller värdsystemet i minst 2 sekunder. På så sätt minskar du den statiska elektriciteten i förpackningen och i din egen kropp. Ta ut delen ur förpackningen och installera den omedelbart i 2104-enheten (modell DS4 eller TS4) utan att lägga ifrån dig delen. Om du måste lägga ifrån dig delen gör du så här: 1. Placera den antistatiska förpackningen på ett plant och jämnt underlag. 2. Lägg delen ovanpå den antistatiska förpackningen. Lägg inte delen direkt på en metallyta. I anvisningarna för säkerhetsinspektionen finns en beskrivning av hur du kontrollerar 2104 modell DS4 eller TS4. I installationshandboken till ditt rack finns mer information om hur du gör en säkerhetsinspektion. VARNING: Produkten 2104, modell DS4 och TS4, är utformad för att installeras av kunden och är certifierad som kundinstallerad. Kontrollera att den enhet (dator eller rack) där 2104 modell DS4 eller TS4 ska installeras också är utformad och certifierad för att installeras av kunden. I annat fall måste 2104 modell DS4 eller TS4 installeras av en servicetekniker. (22) Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 623 Inspektera 2104 - modell DS4 eller modell TS4 Externa maskinkontroller Interna maskinkontroller SCSVE903 Utför följande kontroller av maskinens inre: 1. Kontrollera att inte några andra ändringar än IBMs egna har gjorts på maskinen. Om du hittar sådana ändringar skaffar du formuläret "Non-IBM Alteration Attachment Survey", nr R009, från IBM-återförsäljaren. Fyll i formuläret och lämna det till IBM-återförsäljaren. 2. Kontrollera maskinens inre skick och se efter om det finns tecken på metallföroreningar eller annan nedsmutsning vatten eller annan vätska brand rökskada 3. Se efter om det finns uppenbara mekaniska fel, t.ex. lösa komponenter. 4. Kontrollera att det inte finns förslitningar, sprickor eller klämskador på exponerade kablar och kontakter. Utför följande kontroller av maskinens utsida: 1. Kontrollera att det inte finns skador (lösa, skadade eller vassa kanter) på chassit. 2. Kontrollera nätkablarna och se efter att isoleringen inte är sliten eller skadad. 3. Se efter att inga ändringar gjorts som avviker från standard. Använd sunt förnuft när du bedömer säkerheten hos sådana ändringar. 4. Kontrollera att alla yttre luckor finns på plats och inte är skadade. 5. Förvissa dig om att alla spärrar och gångjärn fungerar som de ska. 6. Kontrollera att nätkabeln inte är skadad. 7. Se efter om det finns några slitna, skadade eller ihopklämda kablar. 8. Inspektera fläkt-/nätaggregatenheterna. Se efter att fästanordningarna för nätaggregatets hölje (skruvar eller nitar) inte har avlägsnats eller rubbats. 9. Kontrollera att den yttre signalkabeln inte är skadad. 10. Kontrollera kåpan och se efter att det inte finns några vassa kanter, skador eller förändringar som gör att enhetens inre delar exponeras. 11. Rätta till alla fel som du hittar. Utför följande säkerhetskontroller. 624 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Kontroll av säkerhetsetikett SCSVE904 Utför följande kontroll av säkerhetsetiketten: 1. Kontrollera att etiketten är fastsatt på 2104 modell DS4 eller modell TS4. Se figur 1. 2. Kontrollera att nätkabelns varningsetikett är fastsatt på nätaggregatet i 2104 modell DS4 eller TS4. Se figur 2. SC000274 Figur 1. 2104-etikett - modell DS4 eller modell TS4 Figur 2. Varningsetikett om nätkabel SC000325 VARNING: Enheten kan ha två nätkablar. Koppla ur båda nätkablarna så att all ström bryts. (1) Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 625 SCSVE905 3. Kontrollera att varningsetiketten om nätaggregatets hölje är fastsatt på nätaggregatet i 2104 modell DS4 eller TS4. Se figur 3. 4. Kontrollera att varningsetiketten om säkring är fastsatt på nätaggregatet i 2104 modell DS4 eller TS4. Se figur 4. Figur 3. Varningsetikett om nätaggregat Figur 4. Varningsetikett om säkring SC000326 SC000327 VARNING: Avlägsna inte höljet. Försök inte reparera. Ingen av delarna kan repareras. (2) VARNING: Dubbelpolig/neutral säkring. (3) 626 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCSVE905A 5. Kontrollera att viktetiketten är fastsatt på 2104 modell DS4. Se figur 5. VARNING: Enheten väger över 32 kg. Se handböckerna. (4) Figur 5. Viktetikett för 2104 modell DS4 SC000220 32 kg (se handböckerna) Viktigt: Om du har data lagrade i enheterna bör du märka enheterna innan du tar bort dem. När du sätter tillbaka enheterna installerar du dem i samma enhetsfack där de tidigare var installerade. I annat fall finns det risk för dataförluster. > 18 kg > 32 kg VARNING: Om 2104 modell DS4 är fullbestyckad med 14 enheter och två nätaggregat väger den över 32 kg. Du måste ta ut minst sex enheter ur 2104 modell DS4 innan du kan lyfta eller installera den i ett rack eller i 2104 modell Ts4. På så sätt minskar du vikten till under 32 kg, och 2104 modell DS4 kan utan risk hanteras av två personer. I annat fall finns det risk för skador. Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 627 VARNING: Enheten väger mellan 32 och 55 kg. (5) Figur 6. Viktetikett för 2104 modell TS4 32 - 55 kg SC000219 6. Kontrollera att viktetiketten är fastsatt på 2104 modell TS4. Se figur 6. VARNING: Lyftredskapet av kartong (följer med) krävs för att flytta och installera produkten när den är fullbestyckad. Det behövs minst tre personer när den ska flyttas. I annat fall finns det risk för skador. Om lyftredskapet inte är tillgängligt måste du minska vikten till 32 kg eller mindre genom att ta bort alla tunga komponenter (diskenheter och nätaggregat) från produkten. Sedan krävs det bara två personer för att flytta och installera produkten. (6) Viktigt: Om du har data lagrade i enheterna bör du märka enheterna innan du tar bort dem. När du sätter tillbaka enheterna installerar du dem i samma enhetsfack där de tidigare var installerade. I annat fall finns det risk för dataförluster. > 18 kg > 32 kg SCSVE906 628 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCSVE907 7. Kontrollera att etiketten om vänteläge är fastsatt på 2104 modell DS4 eller modell Ts4. Se figur 7. VARNING: Vänteläge anges av symbolen till höger om DC, direkt ovanför Sw1- omkopplaren. När du ställer SW1-omkopplaren i det högra läget (direkt under väntelägessymbolen) är växelströmmen till enheten inte avstängd. (7) Figur 7. Etikett - vänteläge SC000326 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 629 SCSVE908 Avsnittet innehåller anvisningar för kontroll av jordningen av 2104 modell DS4 eller modell TS4. Kontrollera jordningen av 2104 modell DS4 Så här kontrollerar du jordningen av 2104 modell DS4: 3. Viktigt: Vissa elektriska kretsar kan skadas om de externa SCSI-kablarna ansluts till 2104 modell DS4 medan jordningstestet utförs. Se till att inga externa SCSI-kablar är anslutna till kontakterna . 4. Så här kontrollerar du jordningen av 2104 modell DS4: a. Granska stickkontakterna som du kopplade bort från strömfördelningsenheten eller reservbatteriet i steg 2 och försäkra dig om att det tredje jordstiftet är i gott skick. b. Använd en ohmmeter och anslut den ena ledaren till chassit på 2104 modell DS4 och den andra ledaren till det tredje jordstiftet på stickkontakterna som du kopplade ur i steg 2. Avläsningen bör ge högst 0,1 ohm mellan jordstiftet och chassit. Figur 8. Nätkablar och SCSI-kontakter på 2104 modell DS4 Kontrollera jordningen av 2104 modell DS4 eller modell TS4 1. Se till att all strömförsörjning är bortkopplad från racket. Läs installationsanvisningarna för racket eller värdsystemet. 2. Kontrollera att nätkablarna är anslutna till vardera fläkt-/nätaggregatenheten. Se figur 8. Se också till att nätkablarnas motsatta ände är bortkopplade från strömfördelningsenheten och rackets reservbatteri. Läs installationsanvisningarna för racket eller värdsystemet. 2 1 SC000210 630 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCSVE909 Om du arbetar med en 220 volts 2104 modell DS4 går du till steg 5. Om du arbetar med en -48 volts 2104 modell DS4 går du till steg 7. 5. Kontrollera ledningssammanhanget mellan chassit på 2104 modell DS4 och jordstift på strömkontakten på fläkt-/nätaggregatenheterna. Se figur 9. 6. Gå till steg 9. 7. Kontrollera ledningssammanhanget mellan chassit på 2104 modell DS4 och jordstiften 1 och 7 för varje nätkabelkontakt. Se figur 10. 8. Gå till steg 9. 9. Om en fläkt-/nätaggregatenhet inte har ledningssammanhang byter du ut den mot en ny. Se vidare avsnittet om fläkt-/nätaggregatenheter i servicehandboken till Expandable Storage Plus 2104 modell DS4 och modell TS4. Utför hela jordningstestet på nytt. Om alla fläkt-/nätaggregatenheter har ledningssammanhang, kanske felet beror på nätkabeln eller jordningen av värdsystemet. Figur 9. Jordstift (220 V 2104 modell DS4) Figur 10. Jordstift (-48 V 2104 modell DS4) 1 SC000329 Om jordningen är korrekt är jordningstestet klart. Om jordningen inte är korrekt kopplar du bort nätkablarna från fläkt-/nätaggregatenheten 2104 modell DS4. SC000330 Stift 6 Stift 1 Stift 12 Stift 7 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 631 SCSVE910 Om du arbetar med en 220 volts 2104 modell DS4 går du till steg 10. Om du arbetar med en -48 volts 2104 modell DS4, försöker du lokalisera felet med hjälp av installationsanvisningarna till racket eller värdsystemet. Utför hela jordningstestet på nytt. 10. Kontrollera nätkabelns ledningssammanhang. Om nätkabeln inte har ledningssammanhang byter du ut den mot en ny. Utför sedan hela jordningstestet på nytt. Om nätkabeln har ledningssammanhang försöker du lokalisera felet med hjälp av installationsanvisningarna för racket eller värdsystemet. Så här kontrollerar du jordningen av 2104 modell TS4: 2. Försäkra dig om att nätkablarnas motsatta ände inte är anslutna till eluttag. Koppla loss kablarna om så behövs. Kontrollera jordningen av 2104 modell TS4 1. Kontrollera att en nätkabel är ansluten till varje eluttag (1). Se figur 11. 1 Figur 11. Nätkablar och SCSI-kontakter (2104 modell TS4) SC000230 1 2 3 632 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCSVE911 3. Viktigt: Vissa elektriska kretsar kan skadas om de externa SCSI-kablarna ansluts till 2104 modell TS4 medan jordningstestet utförs. Se till att inga externa SCSI-kablar är anslutna till SCSI-kontakterna . Se figur 11. 4. Kontrollera ledningssammanhanget mellan chassit på 2104 modell TS4 och jordstiftet på varje nätkabelkontakt. a. Kontrollera nätkablarna och se efter att den tredje jordledaren är i gott skick. b. Använd en mätare och kontrollera att jordledarens ledningssammanhang är högst 0,1 ohm mellan det externa jordstiftet och chassits jordning. 5. Om ledningssammanhanget är bra är jordningstestet klart. 6. Om jordningen inte är korrekt kopplar du bort nätkablarna från eluttagen. a. Testa varje nätkabels jordningssammanhang. b. Om det är fel på någon nätkabel byter du ut den mot en ny. c. Kontrollera ledningssammanhanget mellan chassit på 2104 modell TS4 och jordstift på strömkontakten på fläkt-/nätaggregatenheterna. Se figur 12. d Omfläkt-/nätaggregatenheten inte har ledningssammanhang byter du ut den mot en ny. (Se vidare avsnittet om fläkt-/nätaggregatenheter i servicehandboken till Expandable Storage Plus 2104 modell DS4 och modell TS4.) Gå till "Kontrollera jordningen av 2104 modell TS4" och utför hela jordningstestet på nytt enligt steg 1 till och med steg 6d. Figur 12. Jordstift (2104 modell TS4) 1 2 3 SC000331 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 633 SCSVE912 Figur 13. Koppla bort strömmen från 2104 modell DS4 Anm: Koppla inte bort strömmen från värdsystemet eller från 2104 modell DS4 annat än om det står i instruktionerna att du ska göra det, eller du behöver göra det av någon annan anledning. Så här kopplar du bort strömmen från 2104 modell DS4: 1. Kontrollera med kunden att alla operationer mellan 2104 modell DS4 och värdsystemet har stoppats. 2. Ställ likströmsomkopplaren på alla fläkt-/nätaggregatenheter i vänteläge. Se figur 13. Anm: a. Vissa versioner av 2104 modell DS4 har en fläkt-/nätaggregatenhet och en fläktenhet. Fläktenheten har inte någon likströmsomkopplare. b. På en fläkt-/nätaggregatenhet kan kontrollampan lysa trots att likströmsomkopplaren är satt i vänteläge. Koppla bort strömmen från 2104 modell DS4 eller modell TS4 1 Koppla bort strömmen från 2104 modell DS4 SC000206 634 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCSVE913 3. FARA I följande steg kopplar du bort nätkablarna. Det går ström genom kablarna så länge rackets strömfördelnings- eller UPS-enhet (Uninterruptable Power Supply) inte stängts av. (1) 1 1 2 2 Anm: Koppla inte bort strömmen från värdsystemet eller från 2104 modell TS4 annat än om det står i instruktionerna att du ska göra det, eller du behöver göra det av någon annan anledning. Så här kopplar du bort strömmen från 2104 modell TS4: 1. Kontrollera med kunden att alla operationer mellan 2104 modell TS4 och värdsystemet har stoppats. 2. På baksidan av 2104 modell TS4 ställer du likströmsomkopplaren på alla fläkt-/nätaggregatenheter i vänteläge. Se figur 14. Anm: a. Vissa versioner av 2104 modell TS4 har en fläkt-/nätaggregatenhet och en fläktenhet. Fläktenheten har inte någon likströmsomkopplare. b. På en fläkt-/nätaggregatenhet kan kontrollampan lysa trots att likströmsomkopplaren är satt i vänteläge. Koppla bort strömmen från 2104 modell TS4 Koppla loss nätkablarna från baksidan av 2104 modell DS4. Se figur 13. 4. Så här kopplar du på strömmen igen till 2104 modell DS4: a. Anslut nätkablarna igen. b. Ställ varje fläkt-/nätaggregatenhets likströmsomkopplare i påslaget läge ("On"). Anm: Du kan konfigurera motorstartordningen för 2104 modell DS4. Mer information finns i avsnittet om omkopplare för autostart och fördröjd motorstart i servicehandboken till Expandable Storage Plus 2104 modell DS4 och modell TS4. Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 635 SCSVE914 1 2 2 3. Koppla loss nätkablarna från eluttaget och från baksidan av 2104 modell TS4. Se figur 14. 4. Så här kopplar du på strömmen igen till 2104 modell TS4: a. Anslut nätkablarna igen. b. Ställ varje fläkt-/nätaggregatenhets likströmsomkopplare i påslaget läge ("On"). Anm: Du kan konfigurera motorstartordningen för 2104 modell TS4. Mer information finns i avsnittet om omkopplare för autostart och fördröjd motorstart i servicehandboken till Expandable Storage Plus 2104 modell DS4 och modell Ts4. Figur 14. Koppla bort strömmen från 2104 modell TS4 SC000207 636 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCSVE915 Fara FARA FARA FARA FARA FARA I följande steg kopplar du bort nätkablarna. Det går ström genom kablarna så länge rackets strömfördelnings- eller UPS-enhet (Uninterruptable Power Supply) inte stängts av. (1) Anslut inte en nätkabel till fläkt-/nätaggregatenheten förrän enheten är helt installerad och tumskruvarna åtdragna. (3) Undvik att koppla bort kablar till bildskärmar, skrivare, telefoner eller åskskydd för kommunikationslinjer vid åskväder. (5) Ett felaktigt installerat eluttag kan göra metalldelar i systemet eller på anslutna enheter strömförande. Det är kundens ansvar att se till att alla eluttag är korrekt installerade och jordade. Undvik att koppla bort kablar till bildskärmar, skrivare, telefoner eller åskskydd för kommunikationslinjer vid åskväder. (4) Försök inte öppna fläkt-/nätaggregatenhetens kåpa. (2) Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 637 SCSVE916 Varningsmeddelanden VARNING: Enheten kan ha två nätkablar. Koppla ur båda nätkablarna så att all ström bryts. (1) VARNING: Avlägsna inte höljet. Försök inte reparera. Ingen av delarna kan repareras. (2) VARNING: Dubbelpolig/neutral säkring. (3) Figur 15. Varningsetikett om nätkabel Figur 16. Varningsetikett om nätaggregat Figur 17. Figur 4. Varningsetikett om säkring SC000327 SC000326 SC000325 638 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCSVE916A VARNING: Enheten väger över 32 kg. Se handböckerna. (4) Figur 18. Viktetikett för 2104 modell DS4 SC000220 32 kg (se handböckerna) Viktigt: Om du har data lagrade i enheterna bör du märka enheterna innan du tar bort dem. När du sätter tillbaka enheterna installerar du dem i samma enhetsfack där de tidigare var installerade. I annat fall finns det risk för dataförluster. > 18 kg > 32 kg VARNING: Om 2104 modell DS4 är fullbestyckad med 14 enheter och två nätaggregat väger den över 32 kg. Du måste ta ut minst sex enheter ur 2104 modell DS4 innan du kan lyfta eller installera den i ett rack eller i 2104 modell Ts4. På så sätt minskar du vikten till under 32 kg och 2104 modell DS4 kan utan risk hanteras av två personer. I annat fall finns det risk för skador. Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 639 VARNING: Enheten väger mellan 32 och 55 kg. (5) VARNING: Lyftredskapet av kartong (följer med) krävs för att flytta och installera produkten när den är fullbestyckad. Det behövs minst tre personer när den ska flyttas. I annat fall finns det risk för skador. Om lyftredskapet inte är tillgängligt måste du minska vikten till 32 kg eller mindre genom att ta bort alla tunga komponenter (diskenheter och nätaggregat) från produkten. Sedan krävs det bara två personer för att flytta och installera produkten. (6) Figur 19. Viktetikett för 2104 modell TS4 SC000219 Viktigt: Om du har data lagrade i enheterna bör du märka enheterna innan du tar bort dem. När du sätter tillbaka enheterna installerar du dem i samma enhetsfack där de tidigare var installerade. I annat fall finns det risk för dataförluster. > 18 kg > 32 kg 32 - 55 kg SCSVE917 640 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCSVE918 VARNING: Vänteläge anges av symbolen till höger om DC, direkt ovanför Sw1- omkopplaren. När du ställer SW1-omkopplaren i det högra läget (direkt under väntelägessymbolen) är växelströmmen till enheten inte avstängd. (7) VARNING: Tippskyddet måste vara fäst vid rackets nedre framkant för att förhindra att racket tippar framåt när 2104 modell DS4 tagits ut ur racket. Ta inte bort eller installera enheter förrän ett tippskydd har fästs vid racket. (8) VARNING: Tippskyddet måste vara fäst vid rackets nedre framkant för att förhindra att racket tippar framåt när 2104 modell DS4 installeras i racket. Ta inte bort eller installera enheter förrän ett tippskydd har fästs vid racket. (9) Figur 20. Placering av etikett om vänteläge SC000328 Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 641 SCSVE319 VARNING: Stoppa inte in händerna eller verktyg i det tomma utrymmet där fläkten var placerad. (10) VARNING: Stoppa inte in händerna eller verktyg i det tomma utrymmet där fläkt- /nätaggregatenheten var placerad. (11) VARNING: Stoppa inte in händerna eller verktyg i det tomma utrymmet där SCSI-kortet var placerat. (12) VARNING: Av säkerhetsskäl är produkten utrustad med en nätkabel med tre ledare och kontakt. Du undviker risken att datorns hölje blir strömförande vid en eventuell kortslutning genom att ansluta nätkabeln till ett skyddsjordat nätuttag. (13) VARNING: Rör inte vid eluttaget eller eluttagets täckplatta med något annat än testsonder innan du har genomfört hela säkerhetskontrollen. (14) VARNING: Om det avlästa värdet inte är oändligt ska du inte fortsätta. Gör de ändringar som behövs innan du fortsätter. Slå inte på kretsens strömbrytare förrän samtliga ovanstående steg har genomförts på rätt sätt. (15) VARNING: Lyft inte 2104 modell DS4 genom att ta tag i handtagen på fläkten eller fläkt- /nätaggregatenheten. Handtagen är inte tänkta att bära enhetens vikt. (16) 642 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices SCSVE920 VARNING: När du skjutit in enheten helt och hållet stängs spaken automatiskt. Var försiktig så att du inte klämmer fingrarna mellan spaken och enheten. (17) VARNING: Förvissa dig om att huvudströmkabeln har kopplats loss från den icke fungerande fläkt-/nätaggregatenhet innan du fortsätter. (18) VARNING: Stoppa inte in händerna eller verktyg i det tomma utrymmet ovanför fläkt- /nätaggregatenheten. (19) VARNING: Stoppa inte in händerna eller verktyg i det tomma utrymmet mellan nätaggregaten. (20) VARNING: Om 2104 modell DS4 installeras i ett rack av modell T00 eller T42 måste du också följa säkerhetsanvisningarna för dessa rack innan du börjar ta bort ramen. Säkerhetsanvisningarna finns i avsnittet om systeminstallation i 7014 Model T00 and T42 Rack Installation and Service Guide, SA38-0577 och på följande webbadress: www-1.ibm.com/servers/eserver/pseries/library/hardware_docs/7014_t00.html (21) VARNING: Produkten 2104, modell DS4 och TS4, är utformad för att installeras av kunden och är certifierad som kundinstallerad. Kontrollera att den enhet (dator eller rack) där 2104 modell DS4 eller TS4 ska installeras också är utformad och certifierad för att installeras av kunden. I annat fall måste 2104 modell DS4 eller TS4 installeras av en servicetekniker. (22) Expandable Storage Plus 2104 Model DS4 and Model TS4 translations 643 SCSVE921 VARNING: Stoppa inte in händerna eller verktyg i det tomma uttrymme där kortet var placerat. (23) VARNING: Det krävs tre personer för att lyfta 2104 modell TS4. Försök inte lyfta 2104 modell TS4 själv. Försök inte att lyfta den utan att ha hjälp av två andra personer. (24) VARNING: Lyft inte 2104 modell TS4 genom att ta tag i handtagen på fläkten eller fläkt- /nätaggregatenheten. Handtagen är inte tänkta att bära enhetens vikt. (25) 2104 modell DS4 är utformad för att installeras av kunden och är certifierad som kundinstallerad. Kontrollera att den enhet (dator eller rack) där 2104 modell DS4 ska installeras också är utformad och certifierad för att installeras av kunden. I annat fall måste 2104 modell DS4 installeras av en servicetekniker. (26) VARNING: VARNING: 2104 modell TS4 är utformad för att installeras av kunden och är certifierad som kundinstallerad. Kontrollera att det system där 2104 modell TS4 ska installeras också är utformad och certifierad för att installeras av kunden. I annat fall måste 2104 modell TS4 installeras av en servicetekniker. (27) 644 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Notices This information was developed for products and services offered in the U. S. A. IBM may not offer the products, services, or features discussed in this document in other countries. Consult your local IBM representative for information on the products and services currently available in your area. Any reference to an IBM product, program, or service is not intended to state or imply that only that IBM product, program, or service may be used. Any functionally equivalent product, program, or service that does not infringe on any IBM intellectual property right may be used instead. However, it is the user’s responsibility to evaluate and verify the operation of any non-IBM product, program, or service. IBM may have patents or pending patent applications covering subject matter described in this document. The furnishing of this document does not give you any license to these patents. You can send license inquiries, in writing to: IBM Director of Licensing IBM Corporation North Castle Drive Armonk, N.Y. 10504–1785 U.S.A. The following paragraph does not apply to the United Kingdom or any other country where such provisions are inconsistent with local law: INTERNATIONAL BUSINESS MACHINES CORPORATION PROVIDES THIS PUBLICATION “AS IS” WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESS OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF NON-INFRINGEMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Some states do not allow disclaimer of express or implied warranties in certain transactions, therefore, this statement may not apply to you. This information could include technical inaccuracies or typographical errors. Changes are periodically made to the information herein; these changes will be incorporated in new editions of the publication. IBM may make improvements and/or changes in the product(s) and/or the program(s) described in this publication at any time without notice. Any references in this information to non-IBM Web sites are provided for convenience only and do not in any manner serve as an endorsement of those Web sites. The materials at those Web sites are not part of the materials for this IBM product and use of those Web sites is at your own risk. IBM may use or distribute any of the information you supply in any way it believes appropriate without incurring any obligation to you. © Copyright IBM Corp. 2003 645 Trademarks The following terms are trademarks of the International Business Machines Corporation in the United States, other countries, or both: AIX IBM RS/6000 Eserver Intel is a registered trademark of Intel Corporation in the United States, other countries, or both. Other company, product, or service names may be trademarks or service marks of others. Electronic emission statements This section gives the electronic emission notices or statements for the United States and other countries. Federal Communications Commission (FCC) statement This equipment has been tested and found to comply with the limits for a class A digital device, pursuant to Part 15 of the FCC Rules. These limits are designed to provide reasonable protection against harmful interference when the equipment is operated in a commercial environment. This equipment generates, uses, and can radiate radio frequency energy and, if not installed and used in accordance with the instruction manual, may cause harmful interference to radio communications. Operation of this equipment in a residential area is likely to cause harmful interference, in which case the user will be required to correct the interference at his own expense. Properly shielded and grounded cables and connectors must be used in order to meet FCC emission limits. IBM is not responsible for any radio or television interference caused by using other than recommended cables and connectors or by unauthorized changes or modifications to this equipment. Unauthorized changes or modifications could void the user’s authority to operate the equipment. This device complies with Part 15 of the FCC Rules. Operation is subject to the following two conditions: (1) this device may not cause harmful interference, and (2) this device must accept any interference received, including interference that may cause undesired operation. Industry Canada compliance statement Avis de conformite a la reglementation d’Industrie Canada: Cet appareil numerique de la classe A est conform a la norme NMB-003 du Canada. Chinese Class A warning statement Attention: This is a Class A product. In a domestic environment, this product may cause radio interference in which case the user may be required to take adequate measures. 646 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices European Community compliance statement This product is in conformity with the protection requirements of EC Council Directive 89/336/EEC on the approximation of the laws of the Member States relating to electromagnetic compatibility. IBM cannot accept responsibility for any failure to satisfy the protection requirements resulting from a non-recommended modification of the product, including the fitting of non-IBM option cards. This product is in conformity with the EU council directive 73/23/EEC on the approximation of the laws of the Member States relating to electrical equipment designed for use within certain voltage limits. This conformity is based on compliance with the following harmonized standard: EN60950. This product has been tested and found to comply with the limits for class A Information Technology Equipment according to European Standard EN 55022. The limits for class A equipment were derived for commercial and industrial environments to provide reasonable protection against interference with licensed communication equipment. Attention: This is a class A product. In a domestic environment, this product may cause radio interference in which case the user may be required to take adequate measures. Where shielded or special cables (for example, cables fitted with ferrites) are used in the test to make the product comply with the limits: Properly shielded and grounded cables and connectors must be used in order to reduce the potential for causing interference to radio and TV communications and to other electrical or electronic equipment. Such cables and connectors are available from IBM authorized dealers. IBM cannot accept responsibility for any interference caused by using other than recommended cables and connectors. Germany compliance statement Zulassungsbescheinigung laut Gesetz ueber die elektromagnetische Vertraeglichkeit von Geraeten (EMVG) vom 30. August 1995. Dieses Geraet ist berechtigt, in Uebereinstimmung mit dem deutschen EMVG das EG-Konformitaetszeichen - CE - zu fuehren. Der Aussteller der Konformitaetserklaeung ist die IBM Deutschland. Informationen in Hinsicht EMVG Paragraph 3 Abs. (2) 2: Das Geraet erfuellt die Schutzanforderungen nach EN 50082-1 und EN 55022 Klasse A. Notices 647 EN 55022 Klasse A Geraete beduerfen folgender Hinweise: Nach dem EMVG:_|t¹| “Geraete duerfen an Orten, fuer die sie nicht ausreichend entstoert sind, nur mit besonderer Genehmigung des Bundesministeriums fuer Post und Telekommunikation oder des Bundesamtes fuer Post und Telekommunikation betrieben werden. Die Genehmigung wird erteilt, wenn keine elektromagnetischen Stoerungen zu erwarten sind.” (Auszug aus dem EMVG, Paragraph 3, Abs.4) Dieses Genehmigungsverfahren ist nach Paragraph 9 EMVG in Verbindung mit der entsprechenden Kostenverordnung (Amtsblatt 14/93) kostenpflichtig. Nach der EN 55022: “Dies ist eine Einrichtung der Klasse A. Diese Einrichtung kann im Wohnbereich Funkstoerungen verursachen. in diesem Fall kann vom Betreiber verlangt werden, angemessene Massnahmen durchzufuehren und dafuer aufzukommen.” Anmerkung: Um die Einhaltung des EMVG sicherzustellen, sind die Geraete wie in den Handbuechern angegeben zu installieren und zu betreiben. Japanese Voluntary Control Council for Interference (VCCI) class 1 statement Korean Government Ministry of Communication (MOC) statement Please note that this device has been approved for business purposes with regard to electromagnetic interference. If you find that this is not suitable for your use, you may exchange it for one with a non-business use. Taiwan class A compliance statement VS07171L 648 Expandable Storage Plus: 2104 Model DS4 and Model TS4 Translated Safety Notices Vos remarques sur ce document / Technical publication remark form Titre / Title : Bull ESCALA Expandable Storage Plus 2104 Models DS4 and TS4 Translated Safety Notices Nº Reférence / Reference Nº : 86 X1 16M 00 Daté / Dated : November 2003 ERREURS DETECTEES / ERRORS IN PUBLICATION AMELIORATIONS SUGGEREES / SUGGESTIONS FOR IMPROVEMENT TO PUBLICATION Vos remarques et suggestions seront examinées attentivement. Si vous désirez une réponse écrite, veuillez indiquer ci-après votre adresse postale complète. Your comments will be promptly investigated by qualified technical personnel and action will be taken as required. If you require a written reply, please furnish your complete mailing address below. NOM / NAME : Date : SOCIETE / COMPANY : ADRESSE / ADDRESS : Remettez cet imprimé à un responsable BULL ou envoyez-le directement à : Please give this technical publication remark form to your BULL representative or mail to: BULL CEDOC 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE Technical Publications Ordering Form Bon de Commande de Documents Techniques To order additional publications, please fill up a copy of this form and send it via mail to: Pour commander des documents techniques, remplissez une copie de ce formulaire et envoyez-la à : BULL CEDOC ATTN / Mr. L. CHERUBIN 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE Phone / Téléphone : +33 (0) 2 41 73 63 96 FAX / Télécopie +33 (0) 2 41 73 60 19 E–Mail / Courrier Electronique : srv.Cedoc@franp.bull.fr Or visit our web sites at: / Ou visitez nos sites web à: http://www.logistics.bull.net/cedoc http://www–frec.bull.com http://www.bull.com CEDOC Reference # No Référence CEDOC Qty Qté CEDOC Reference # No Référence CEDOC Qty Qté CEDOC Reference # No Référence CEDOC Qty Qté _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] [ _ _ ] : no revision number means latest revision / pas de numéro de révision signifie révision la plus récente NOM / NAME : Date : SOCIETE / COMPANY : ADRESSE / ADDRESS : PHONE / TELEPHONE : FAX : E–MAIL : For Bull Subsidiaries / Pour les Filiales Bull : Identification: For Bull Affiliated Customers / Pour les Clients Affiliés Bull : Customer Code / Code Client : For Bull Internal Customers / Pour les Clients Internes Bull : Budgetary Section / Section Budgétaire : For Others / Pour les Autres : Please ask your Bull representative. / Merci de demander à votre contact Bull. BULL CEDOC 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE 86 X1 16M 00 ORDER REFERENCE PLACE BAR CODE IN LOWER LEFT CORNER Utiliser les marques de découpe pour obtenir les étiquettes. Use the cut marks to get the labels. 86 A1 15EM 00 Expandable Storage Plus 2104 Models DS4 and TS4 Translated Safety Notices ESCALA 86 A1 15EM 00 Expandable Storage Plus 2104 Models DS4 and TS4 Translated Safety Notices ESCALA 86 A1 15EM 00 Expandable Storage Plus 2104 Models DS4 and TS4 Translated Safety PL160R/T, PL260R/T and PL460R/T Service Guide ESCALA REFERENCE 86 A1 02FA 01 ESCALA PL160R/T, PL260R/T and PL460R/T Service Guide Hardware July 2008 BULL CEDOC 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE REFERENCE 86 A1 02FA 01 The following copyright notice protects this book under Copyright laws which prohibit such actions as, but not limited to, copying, distributing, modifying, and making derivative works. Copyright © IBM, 2007-2008 Copyright © Bull SAS 2007-2008 Printed in France Trademarks and Acknowledgements We acknowledge the rights of the proprietors of the trademarks mentioned in this manual. All brand names and software and hardware product names are subject to trademark and/or patent protection. Quoting of brand and product names is for information purposes only and does not represent trademark misuse. The information in this document is subject to change without notice. Bull will not be liable for errors contained herein, or for incidental or consequential damages in connection with the use of this material. Contents Safety and environmental notices . . . . . . . . . . . . . . . . . . . . . . . .xiii World Trade safety information . . . . . . . . . . . . . . . . . . . . . . . . . . . .xiii German safety information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .xiii Laser safety information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .xiii Laser compliance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .xiii Power and cabling information for NEBS (Network Equipment-Building System) GR-1089-CORE . . . . . .xiv Product recycling and disposal . . . . . . . . . . . . . . . . . . . . . . . . . . . .xiv Battery return program . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .xv About this publication . . . . . . . . . . . . . . . . . . . . . . . . . . . .xvii How to send your comments . . . . . . . . . . . . . . . . . . . . . . . . . . . . .xvii Chapter 1. Starting a Service Call . . . . . . . . . . . . . . . . . . . . . . . . .1 Chapter 2. Isolation Procedures . . . . . . . . . . . . . . . . . . . . . . . . .5 Bus, high-speed link (HSL/RIO/12X) isolation information . . . . . . . . . . . . . . . . . . . .7 PCI bus isolation using AIX, Linux, or the HMC . . . . . . . . . . . . . . . . . . . . . .8 Isolating a PCI bus problem while running AIX or Linux . . . . . . . . . . . . . . . . . .8 Isolating a PCI bus problem from the HMC . . . . . . . . . . . . . . . . . . . . . . .8 Verifying a high-speed link, system PCI bus, or a multi-adapter bridge repair . . . . . . . . . . . .9 Analyzing a RIO/HSL/12X or PCI bus reference code . . . . . . . . . . . . . . . . . . . .11 DSA translation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .12 Card positions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .12 Converting the loop number to RIO/HSL/12X port location labels . . . . . . . . . . . . . . . .14 Fiber optic cleaning procedures . . . . . . . . . . . . . . . . . . . . . . . . . . . .15 Status indications for RIO/HSL . . . . . . . . . . . . . . . . . . . . . . . . . . . .15 RIO/HSL/12X connection problems . . . . . . . . . . . . . . . . . . . . . . . . . .22 Additional RIO/HSL FRUs . . . . . . . . . . . . . . . . . . . . . . . . . . . .24 Powering on and powering off an I/O expansion unit . . . . . . . . . . . . . . . . . . .24 HSL loop configuration and status form . . . . . . . . . . . . . . . . . . . . . . . . .25 Installed features in a PCI bridge set form . . . . . . . . . . . . . . . . . . . . . . . .25 RIO/HSL/12X link status diagnosis form . . . . . . . . . . . . . . . . . . . . . . . .26 HSL/RIO 12X Isolation Procedures . . . . . . . . . . . . . . . . . . . . . . . . . . .27 CONSL01 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .27 RIOIP01 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .28 Main task . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .28 The ports on both ends of the failed link are in different system units on the loop . . . . . . . . . .29 The port on one end of the failed link is in a system unit and the port on the other end is in an I/O unit . .30 The ports on both ends of the failed link are in an I/O unit . . . . . . . . . . . . . . . . .31 Cannot power on unit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .33 Manually detecting the failed link . . . . . . . . . . . . . . . . . . . . . . . . . .33 Refresh the port status . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .33 RIOIP06 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .34 RIOIP08 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .34 RIOIP09 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .35 RIOIP10 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .36 RIOIP11 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .36 RIOIP12 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .38 RIOIP56 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .39 Multi-adapter bridge (MAB) isolation procedures . . . . . . . . . . . . . . . . . . . . . . .41 MABIP02 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .41 MABIP03 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .41 MABIP05 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .41 MABIP50 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .42 MABIP51 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .42 © Copyright IBM Corp. 2008 iii MABIP52 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .42 MABIP53 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .43 MABIP54 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .43 MABIP55 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .43 Communication isolation procedure . . . . . . . . . . . . . . . . . . . . . . . . . . .47 COMIP01, COMPIP1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .47 Intermittent isolation procedures . . . . . . . . . . . . . . . . . . . . . . . . . . . .50 INTIP03 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .52 INTIP05 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .53 INTIP07 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .53 INTIP08 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .54 INTIP09 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .55 INTIP14 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .57 INTIP16 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .57 INTIP18 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .57 INTIP20 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .57 INTIP24 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .58 I/O processor (IOP) isolation procedures . . . . . . . . . . . . . . . . . . . . . . . . .60 IOPIP01 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .62 IOPIP13 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .64 IOPIP16 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .66 IOPIP17 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .69 IOPIP18 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .71 IOPIP19 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .72 IOPIP20 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .73 IOPIP21 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .75 IOPIP22 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .75 IOPIP23 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .76 IOPIP25 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .76 IOPIP26 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .78 IOPIP27 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .79 IOPIP28 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .81 IOPIP29 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .81 IOPIP30 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .81 IOPIP31 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .84 IOPIP32 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .86 IOPIP33 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .87 IOPIP34 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .87 IOPIP40 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .88 IOPIP41 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .89 Licensed internal code (LIC) isolation procedures . . . . . . . . . . . . . . . . . . . . . . .90 LICIP01 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .91 LICIP03 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .92 LICIP04 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .93 LICIP07 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .93 LICIP08 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .96 LICIP11 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .96 How to find the cause code . . . . . . . . . . . . . . . . . . . . . . . . . . . .97 0001 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .97 0002 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .97 0004 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .100 0005 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .101 0006 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .101 0007 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .101 0008 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .101 0009 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .101 000A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .101 000B . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .102 000C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .102 000D . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .102 000E . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .102 iv Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 0010 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .103 0011 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .103 0012 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .103 0015 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .103 0016 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .103 0017 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .103 0018 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .103 0019 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .103 001A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .104 001C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .104 001D . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .104 001E . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .104 001F . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .104 0020 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .105 0021 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .105 0022 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .105 0023 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .105 0024 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .105 0025 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .106 0026 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .106 0027 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .106 0031 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .106 0033 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .106 0034 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .106 0035 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .107 0037 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .107 0099 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .107 LICIP12 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .107 How to find the cause code . . . . . . . . . . . . . . . . . . . . . . . . . . .107 0002 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .108 0004 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .109 0007 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .109 0008 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .110 0009 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .110 000A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .110 000B . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .110 000D . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .110 000E . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .110 002C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .110 002D . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .111 002E . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .111 002F . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .111 0030 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .111 0032 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .111 0099 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .111 LICIP13 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .111 LICIP14 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .116 LICIP15 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .117 LICIP16 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .119 Logical partition (LPAR) isolation procedure . . . . . . . . . . . . . . . . . . . . . . . .120 LPRIP01 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .120 Operations Console isolation procedures . . . . . . . . . . . . . . . . . . . . . . . . .124 OPCIP03 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .125 Power isolation procedures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .128 Power problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .129 Cannot power on system unit . . . . . . . . . . . . . . . . . . . . . . . . . . .129 Cannot power off system or SPCN-controlled I/O expansion unit . . . . . . . . . . . . . . .132 Cannot power on SPCN-controlled I/O expansion unit . . . . . . . . . . . . . . . . . .136 PWR1900 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .141 PWR1905 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .142 Instructions for Models 8203-E4A, 9407-M15, or 9408-M25 . . . . . . . . . . . . . . . . .142 Contents v PWR1906 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .144 PWR1906 Instructions for 5094 and 5294 expansion units . . . . . . . . . . . . . . . . . .144 PWR1907 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .146 PWR1908 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .148 PWR1908 instructions for 5088 or 0588 expansion units . . . . . . . . . . . . . . . . . .148 PWR1909 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .150 Instructions for 5095, 0595, 5790, 5796, 7311-D11, 7311-D20 and 7314-G30 . . . . . . . . . . . .150 PWR1911 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .152 PWR1917 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .157 PWR1918 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .159 PWR1920 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .160 Router isolation procedures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .162 RTRIP01 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .163 RTRIP02 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .163 RTRIP03 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .163 RTRIP05 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .163 RTRIP06 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .164 RTRIP07 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .165 RTRIP08 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .165 SAS isolation procedures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .166 SIP3110 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .168 SIP3111 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .169 SIP3112 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .171 SIP3113 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .172 SIP3120 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .174 SIP3121 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .175 SIP3130 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .175 SIP3131 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .177 SIP3132 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .180 SIP3134 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .181 SIP3140 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .183 SIP3141 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .184 SIP3142 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .185 SIP3143 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .187 SIP3144 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .188 SIP3145 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .191 SIP3146 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .193 SIP3147 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .194 SIP3148 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .195 SIP3149 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .197 SIP3150 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .197 SIP3152 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .199 SIP3153 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .201 Service processor isolation procedures . . . . . . . . . . . . . . . . . . . . . . . . . .202 FSPSP01 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .205 FSPSP02 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .206 FSPSP03 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .206 FSPSP04 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .206 FSPSP05 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .206 FSPSP06 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .207 FSPSP07 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .207 FSPSP09 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .207 FSPSP10 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .207 FSPSP11 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .207 FSPSP12 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .208 FSPSP14 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .208 FSPSP16 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .208 FSPSP17 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .208 FSPSP18 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .208 FSPSP20 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .209 FSPSP22 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .209 vi Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) FSPSP23 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .209 FSPSP24 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .209 FSPSP25 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .209 FSPSP27 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .210 FSPSP28 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .210 FSPSP29 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .210 FSPSP30 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .211 FSPSP31 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .211 FSPSP32 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .211 FSPSP33 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .212 FSPSP34 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .212 FSPSP35 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .213 FSPSP38 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .213 FSPSP42 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .213 FSPSP45 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .214 FSPSP46 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .214 FSPSP47 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .214 FSPSP48 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .214 FSPSP49 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .214 FSPSP50 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .215 FSPSP51 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .215 FSPSP52 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .215 FSPSP54 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .215 FSPSP55 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .216 FSPSP63 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .216 FSPSP64 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .216 FSPSPC1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .217 FSPSPD1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .218 Tape unit isolation procedures . . . . . . . . . . . . . . . . . . . . . . . . . . . . .219 TUPIP02 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .221 TUPIP03 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .223 TUPIP04 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .224 TUPIP06 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .228 Tape unit self-test procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . .229 Running the self-test . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .230 Interpreting the results . . . . . . . . . . . . . . . . . . . . . . . . . . . . .230 Tape device ready conditions . . . . . . . . . . . . . . . . . . . . . . . . . . . .231 Twinaxial workstation I/O processor isolation procedure . . . . . . . . . . . . . . . . . . . .233 TWSIP01 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .234 Workstation adapter isolation procedure . . . . . . . . . . . . . . . . . . . . . . . . .240 WSAIP01 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .241 Workstation adapter console isolation procedure . . . . . . . . . . . . . . . . . . . . . .243 Isolating problems on servers that run the AIX or Linux operating systems . . . . . . . . . . . . . .245 MAP 0210: General problem resolution . . . . . . . . . . . . . . . . . . . . . . . . .246 Problems with loading and starting the operating system (AIX and Linux) . . . . . . . . . . . . .246 Storage I/O card modes and jumpers . . . . . . . . . . . . . . . . . . . . . . . . .250 Setting or changing the mode of an I/O card . . . . . . . . . . . . . . . . . . . . . .250 How to set or change the mode of an I/O card . . . . . . . . . . . . . . . . . . . .250 Mode jumper has been overridden . . . . . . . . . . . . . . . . . . . . . . . . .251 Mode jumper is missing . . . . . . . . . . . . . . . . . . . . . . . . . . . . .252 SCSI service hints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .252 General SCSI Configuration Checks . . . . . . . . . . . . . . . . . . . . . . . . .252 High Availability or Multiple SCSI System Checks . . . . . . . . . . . . . . . . . . . .252 SCSI-2 Single-Ended Adapter PTC Failure Isolation Procedure . . . . . . . . . . . . . . . .252 Determining Where to Start . . . . . . . . . . . . . . . . . . . . . . . . . . .253 External SCSI-2 Single-Ended Bus PTC Isolation Procedure . . . . . . . . . . . . . . . . .253 External SCSI-2 Single-Ended Bus Probable Tripped PTC Causes . . . . . . . . . . . . . . .254 Internal SCSI-2 Single-Ended Bus PTC Isolation Procedure . . . . . . . . . . . . . . . . .254 Internal SCSI-2 Single-Ended Bus Probable Tripped PTC Resistor Causes . . . . . . . . . . . .256 SCSI-2 Differential Adapter PTC Failure Isolation Procedure . . . . . . . . . . . . . . . . .256 External SCSI-2 Differential Adapter Bus PTC Isolation Procedure . . . . . . . . . . . . . . .256 Contents vii SCSI-2 Differential Adapter Probable Tripped PTC Causes . . . . . . . . . . . . . . . . .258 Dual-Channel Ultra SCSI Adapter PTC Failure Isolation Procedure . . . . . . . . . . . . . .258 64-bit PCI-X Dual Channel SCSI Adapter PTC Failure Isolation Procedure . . . . . . . . . . . .259 MAP 0020: Problem determination procedure . . . . . . . . . . . . . . . . . . . . . .259 MAP 0030: Additional Problem Determination . . . . . . . . . . . . . . . . . . . . . .266 MAP 0040: Intermittent Problem Isolation . . . . . . . . . . . . . . . . . . . . . . . .267 MAP 0050: SCSI bus problems . . . . . . . . . . . . . . . . . . . . . . . . . . . .269 Preparing for hot-plug SCSI device or cable deconfiguration . . . . . . . . . . . . . . . . .274 Purpose of this procedure . . . . . . . . . . . . . . . . . . . . . . . . . . .274 After hot-plug SCSI device or cable deconfiguration . . . . . . . . . . . . . . . . . . .275 MAP 0054: PCI-X SCSI Bus Problems . . . . . . . . . . . . . . . . . . . . . . . . .276 MAP 0070: 888 sequence in operator panel display . . . . . . . . . . . . . . . . . . . . .278 MAP 0220: Hot-Swap FRU problem resolution . . . . . . . . . . . . . . . . . . . . . .279 MAP 0230: Platform error problem resolution . . . . . . . . . . . . . . . . . . . . . .284 MAP 0260: System hangs during resource configuration . . . . . . . . . . . . . . . . . . .288 MAP 0270: SCSI RAID problem resolution and verification . . . . . . . . . . . . . . . . . .291 MAP 0280: Console and Keyboard Problem Resolution . . . . . . . . . . . . . . . . . . .295 MAP 0285: Multipath I/O (MPIO) Problem Resolution . . . . . . . . . . . . . . . . . . .296 MAP 0291: Missing Device or Bus Problem Resolution . . . . . . . . . . . . . . . . . . .299 PFW1540: Problem isolation procedures . . . . . . . . . . . . . . . . . . . . . . . .301 PFW1542: I/O problem isolation procedure . . . . . . . . . . . . . . . . . . . . . . .302 PFW1548: Memory and processor subsystem problem isolation procedure . . . . . . . . . . . . .316 PFW1548: Memory and processor subsystem problem isolation procedure without an HMC attached . . .317 PFW1548: Memory and processor subsystem problem isolation procedure when an HMC is attached . . .324 Collect serviceable events in Integrated Virtualization Manager . . . . . . . . . . . . . . . . . .333 Chapter 3. Locating FRUs . . . . . . . . . . . . . . . . . . . . . . . . . . .335 Using location codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .336 Physical Locations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .339 Locations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .339 Addresses . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .347 Chapter 4. Removing and replacing FRUs . . . . . . . . . . . . . . . . . . . .349 External cables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .352 Battery . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .353 Cache battery pack . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .366 Control panel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .375 Disk Drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .379 Replace the disk drive using AIX . . . . . . . . . . . . . . . . . . . . . . . . . . .379 Replace the disk drive using Linux . . . . . . . . . . . . . . . . . . . . . . . . . .382 Prepare to remove the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . .382 Remove the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . .385 Replace the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . .386 Replace the disk drive nonconcurrently . . . . . . . . . . . . . . . . . . . . . . . . .388 Rebuild data on a replacement disk drive using Linux . . . . . . . . . . . . . . . . . . .390 Disk unit backplane . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .391 Ethernet card . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .403 Fan . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .409 GX adapter . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .415 Media device . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .421 Memory DIMM . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .430 PCI adapter . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .438 Remove and replace a PCI adapter contained in a cassette in the system with the power on in AIX . . . .443 Remove and replace a PCI adapter contained in a cassette in the system with the power on in Linux . . . .457 Prerequisites for hot-plugging PCI adapters in Linux . . . . . . . . . . . . . . . . . . .469 Verify that the Linux, hot-plug PCI tools are installed . . . . . . . . . . . . . . . . . . .470 Power supply . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .471 RAID enablement card . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .477 Auxiliary cache card . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .483 Slimline media . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .489 viii Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) System backplane . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .493 Thermal power management device (TPMD) card . . . . . . . . . . . . . . . . . . . . . .525 Voltage regulator module pair . . . . . . . . . . . . . . . . . . . . . . . . . . . . .531 Voltage regulator module - single . . . . . . . . . . . . . . . . . . . . . . . . . . . .539 Vital product data (VPD) card . . . . . . . . . . . . . . . . . . . . . . . . . . . . .547 Chapter 5. Verifying a repair . . . . . . . . . . . . . . . . . . . . . . . . . .557 Verifying the repair in AIX . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .559 Verifying the repair in Linux . . . . . . . . . . . . . . . . . . . . . . . . . . . . .563 Verifying a repair using an i5/OS system or logical partition . . . . . . . . . . . . . . . . . .564 Chapter 6. Closing a service call . . . . . . . . . . . . . . . . . . . . . . . .565 Closing a service call using Integrated Virtualization Manager . . . . . . . . . . . . . . . . . .571 Closing a service call using AIX or Linux . . . . . . . . . . . . . . . . . . . . . . . . .577 Activating and deactivating LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . .583 Deactivate a system attention LED or partition LED using the HMC . . . . . . . . . . . . . . .583 Activate or deactivate identify LED using the HMC . . . . . . . . . . . . . . . . . . . .583 Deactivate a system attention LED or partition LED using the ASMI . . . . . . . . . . . . . . .584 Activate or deactivate identify LED using the ASMI . . . . . . . . . . . . . . . . . . . .584 Chapter 7. Common Service Procedures . . . . . . . . . . . . . . . . . . . . .587 Powering on and powering off . . . . . . . . . . . . . . . . . . . . . . . . . . . .588 Determining a primary or alternative console . . . . . . . . . . . . . . . . . . . . . . . .591 Determining if the system has logical partitions . . . . . . . . . . . . . . . . . . . . . . .593 Separating the 571F/575B card set and moving the cache directory card . . . . . . . . . . . . . . .594 Determining which Processor is the Secondary Service Processor . . . . . . . . . . . . . . . . .600 The System Reference Code (SRC) Format Description . . . . . . . . . . . . . . . . . . . .601 System Reference Code (SRC) Information . . . . . . . . . . . . . . . . . . . . . . . .601 System reference code (SRC) address formats . . . . . . . . . . . . . . . . . . . . . . .601 Hardware SRC formats . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .605 Logical address format . . . . . . . . . . . . . . . . . . . . . . . . . . . . .607 Using the Service Action Log . . . . . . . . . . . . . . . . . . . . . . . . . . . . .609 Problems with noncritical resources . . . . . . . . . . . . . . . . . . . . . . . . . .610 System Service Tools (SST) . . . . . . . . . . . . . . . . . . . . . . . . . . . . .611 Work with communications trace . . . . . . . . . . . . . . . . . . . . . . . . . .611 Using the product activity log . . . . . . . . . . . . . . . . . . . . . . . . . . . .617 Hexadecimal Product Activity Log data . . . . . . . . . . . . . . . . . . . . . . . .617 More information from hexadecimal reports . . . . . . . . . . . . . . . . . . . . . . .620 Reclaiming IOP cache storage . . . . . . . . . . . . . . . . . . . . . . . . . . . .636 Working with Storage Dumps . . . . . . . . . . . . . . . . . . . . . . . . . . . .636 Dedicated Service Tools (DST) . . . . . . . . . . . . . . . . . . . . . . . . . . . .637 Accessing Dedicated Service Tools . . . . . . . . . . . . . . . . . . . . . . . . .637 Selecting function 21 from the control panel . . . . . . . . . . . . . . . . . . . . .638 Control panel function codes on the HMC . . . . . . . . . . . . . . . . . . . . . .638 Performing an IPL to DST . . . . . . . . . . . . . . . . . . . . . . . . . . .640 Performing an alternate IPL to DST (type D IPL) . . . . . . . . . . . . . . . . . . .641 Start a service tool . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .642 IPL type, mode, and speed options . . . . . . . . . . . . . . . . . . . . . . . . .642 Printing the System Configuration List . . . . . . . . . . . . . . . . . . . . . . . . .643 Print the system configuration list . . . . . . . . . . . . . . . . . . . . . . . . .643 Details on the system bus, main storage, or processor . . . . . . . . . . . . . . . . . . .644 Hardware Service Manager . . . . . . . . . . . . . . . . . . . . . . . . . . . . .644 Managing the Advanced System Management Interface (ASMI) . . . . . . . . . . . . . . . .645 Accessing the ASMI . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .646 Accessing the ASMI using a Web browser . . . . . . . . . . . . . . . . . . . . . .646 Accessing the ASMI using an ASCII terminal . . . . . . . . . . . . . . . . . . . . .648 Accessing the ASMI using the HMC . . . . . . . . . . . . . . . . . . . . . . . .648 Displaying error and event logs . . . . . . . . . . . . . . . . . . . . . . . . . .649 Setting the system enclosure type . . . . . . . . . . . . . . . . . . . . . . . . . .650 Setting the system identifiers . . . . . . . . . . . . . . . . . . . . . . . . . . .650 Contents ix Viewing vital product data . . . . . . . . . . . . . . . . . . . . . . . . . . . .651 Clearing all deconfiguration errors . . . . . . . . . . . . . . . . . . . . . . . . .651 Service functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .652 Managing dumps . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .653 Performing dumps . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .653 Performing a platform system or service processor dump . . . . . . . . . . . . . . . . .653 Using the HMC to perform a system dump . . . . . . . . . . . . . . . . . . . . .654 Using the control panel to perform a system dump . . . . . . . . . . . . . . . . . . .654 Using the ASMI to perform a system dump . . . . . . . . . . . . . . . . . . . . .655 Performing a service processor dump . . . . . . . . . . . . . . . . . . . . . . . .655 Using the HMC to perform a service processor dump . . . . . . . . . . . . . . . . . .655 Using the control panel to perform a service processor dump . . . . . . . . . . . . . . .656 Using the ASMI to perform a service processor dump . . . . . . . . . . . . . . . . . .656 Copying a dump . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .656 Using an HMC to copy a dump . . . . . . . . . . . . . . . . . . . . . . . . . .657 Using the AIX command line to copy a dump . . . . . . . . . . . . . . . . . . . . .657 Using the Linux command line to copy a dump . . . . . . . . . . . . . . . . . . . . .657 Using the IBM i5/OS service tools to copy a dump . . . . . . . . . . . . . . . . . . . .658 Copying a platform system or SP dump . . . . . . . . . . . . . . . . . . . . . .658 Copying a main storage dump . . . . . . . . . . . . . . . . . . . . . . . . .658 Reporting a dump . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .659 Using an HMC to report a dump . . . . . . . . . . . . . . . . . . . . . . . . . .659 Using the AIX command line to report a dump . . . . . . . . . . . . . . . . . . . . .660 Using the Linux command line to report a dump . . . . . . . . . . . . . . . . . . . .660 Using the i5/OS service tools to report a main storage or platform system or SP dump . . . . . . .660 Deleting a dump . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .661 Using an HMC to delete a dump . . . . . . . . . . . . . . . . . . . . . . . . . .661 Using the AIX command line to delete a dump . . . . . . . . . . . . . . . . . . . . .662 Using the Linux command line to delete a dump . . . . . . . . . . . . . . . . . . . .662 Using the i5/OS service tools to delete a dump . . . . . . . . . . . . . . . . . . . . .662 Performing an IBM i5/OS main storage dump . . . . . . . . . . . . . . . . . . . . . . .663 Using the HMC to perform an IBM i5/OS main storage dump . . . . . . . . . . . . . . . . .663 Using the control panel to perform an IBM i5/OS main storage dump . . . . . . . . . . . . . .663 Performing a slow boot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .665 Using the HMC to perform a slow boot . . . . . . . . . . . . . . . . . . . . . . . .665 Using the control panel to perform a slow boot . . . . . . . . . . . . . . . . . . . . . .665 Using the ASMI menus to perform a slow boot . . . . . . . . . . . . . . . . . . . . . .666 Determining the primary or alternate console . . . . . . . . . . . . . . . . . . . . . . .667 Primary console workstation requirements . . . . . . . . . . . . . . . . . . . . . . .667 Find the primary console when the system is operational . . . . . . . . . . . . . . . . . . .668 Find the primary console when the system power is off . . . . . . . . . . . . . . . . . . .668 Resetting the service processor . . . . . . . . . . . . . . . . . . . . . . . . . . . .669 Resetting the managed system connection from the HMC . . . . . . . . . . . . . . . . . . .670 Checking for a duplicate IP address . . . . . . . . . . . . . . . . . . . . . . . . . . .671 Running the online and eServer stand-alone diagnostics . . . . . . . . . . . . . . . . . . . .672 Running the online diagnostics . . . . . . . . . . . . . . . . . . . . . . . . . . .672 Running the online diagnostics in service mode . . . . . . . . . . . . . . . . . . . . .673 Running the online diagnostics in service mode with an HMC attached . . . . . . . . . . . .673 Running the online diagnostics in service mode without an HMC attached . . . . . . . . . . .673 Running the online diagnostics in maintenance mode . . . . . . . . . . . . . . . . . . .674 Running the online diagnostics in concurrent mode . . . . . . . . . . . . . . . . . . .674 Running the eServer stand-alone diagnostics from CD-ROM . . . . . . . . . . . . . . . . . .675 Running the eServer stand-alone diagnostics from CD on a server with an HMC attached . . . . . . .676 Selecting testing options . . . . . . . . . . . . . . . . . . . . . . . . . . . .676 Running the eServer stand-alone diagnostics from CD on a server without an HMC attached . . . . .677 Selecting testing options . . . . . . . . . . . . . . . . . . . . . . . . . . . .678 Running the eServer stand-alone diagnostics from a Network Installation Management server . . . . . .679 Concurrent PCI adapter removal and replacement using AIX . . . . . . . . . . . . . . . . . .682 Concurrent PCI adapter removal and replacement using Linux . . . . . . . . . . . . . . . . . .686 Prerequisites for hot-plugging PCI adapters in Linux . . . . . . . . . . . . . . . . . . . .689 Verify that the Linux, hot-plug PCI tools are installed . . . . . . . . . . . . . . . . . . . .689 x Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Chapter 8. Part Information . . . . . . . . . . . . . . . . . . . . . . . . . .691 Appendix. Accessibility features . . . . . . . . . . . . . . . . . . . . . . . .701 Accessibility features . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .701 IBM and accessibility . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .701 Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .703 Trademarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .705 Electronic emission notices . . . . . . . . . . . . . . . . . . . . . . . . . .707 Class A Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .707 Terms and conditions . . . . . . . . . . . . . . . . . . . . . . . . . . . . .711 Contents xi xii Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Safety and environmental notices Safety notices may be printed throughout this guide: v DANGER notices call attention to a situation that is potentially lethal or extremely hazardous to people. v CAUTION notices call attention to a situation that is potentially hazardous to people because of some existing condition. v Attention notices call attention to the possibility of damage to a program, device, system, or data.World Trade safety information Several countries require the safety information contained in product publications to be presented in their national languages. If this requirement applies to your country, a safety information booklet is included in the publications package shipped with the product. The booklet contains the safety information in your national language with references to the U.S. English source. Before using a U.S. English publication to install, operate, or service this product, you must first become familiar with the related safety information in the booklet. You should also refer to the booklet any time you do not clearly understand any safety information in the U.S. English publications. German safety information Das Produkt ist nicht für den Einsatz an Bildschirmarbeitsplätzen im Sinne § 2 der Bildschirmarbeitsverordnung geeignet. Laser safety information IBM servers can use I/O cards or features that are fiber-optic based and that utilize lasers or LEDs. Laser compliance All lasers are certified in the U.S. to conform to the requirements of DHHS 21 CFR Subchapter J for class 1 laser products. Outside the U.S., they are certified to be in compliance with IEC 60825 as a class 1 laser product. Consult the label on each part for laser certification numbers and approval information. CAUTION: This product might contain one or more of the following devices: CD-ROM drive, DVD-ROM drive, DVD-RAM drive, or laser module, which are Class 1 laser products. Note the following information: v Do not remove the covers. Removing the covers of the laser product could result in exposure to hazardous laser radiation. There are no serviceable parts inside the device. v Use of the controls or adjustments or performance of procedures other than those specified herein might result in hazardous radiation exposure.(C026) CAUTION: Data processing environments can contain equipment transmitting on system links with laser modules that operate at greater than Class 1 power levels. For this reason, never look into the end of an optical fiber cable or open receptacle. (C027) CAUTION: This product contains a Class 1M laser. Do not view directly with optical instruments. (C028) © Copyright IBM Corp. 2008 xiii CAUTION: Some laser products contain an embedded Class 3A or Class 3B laser diode. Note the following information: laser radiation when open. Do not stare into the beam, do not view directly with optical instruments, and avoid direct exposure to the beam. (C030) Power and cabling information for NEBS (Network Equipment-Building System) GR-1089-CORE The following comments apply to the IBM servers that have been designated as conforming to NEBS (Network Equipment-Building System) GR-1089-CORE: The equipment is suitable for installation in the following: v Network telecommunications facilities v Locations where the NEC (National Electrical Code) applies The intrabuilding ports of this equipment are suitable for connection to intrabuilding or unexposed wiring or cabling only. The intrabuilding ports of this equipment must not be metallically connected to the interfaces that connect to the OSP (outside plant) or its wiring. These interfaces are designed for use as intrabuilding interfaces only (Type 2 or Type 4 ports as described in GR-1089-CORE) and require isolation from the exposed OSP cabling. The addition of primary protectors is not sufficient protection to connect these interfaces metallically to OSP wiring. Note: All Ethernet cables must be shielded and grounded at both ends. The ac-powered system does not require the use of an external surge protection device (SPD). The dc-powered system employs an isolated DC return (DC-I) design. The DC battery return terminal shall not be connected to the chassis or frame ground. Product recycling and disposal This unit must be recycled or discarded according to applicable local and national regulations. IBM encourages owners of information technology (IT) equipment to responsibly recycle their equipment when it is no longer needed. IBM offers a variety of product return programs and services in several countries to assist equipment owners in recycling their IT products. Information on IBM product recycling offerings can be found on IBM’s Internet site at http://www.ibm.com/ibm/environment/products/prp.shtml. Esta unidad debe reciclarse o desecharse de acuerdo con lo establecido en la normativa nacional o local aplicable. IBM recomienda a los propietarios de equipos de tecnología de la información (TI) que reciclen responsablemente sus equipos cuando éstos ya no les sean útiles. IBM dispone de una serie de programas y servicios de devolución de productos en varios países, a fin de ayudar a los propietarios de equipos a reciclar sus productos de TI. Se puede encontrar información sobre las ofertas de reciclado de productos de IBM en el sitio web de IBM http://www.ibm.com/ibm/environment/products/prp.shtml. Note: This mark applies only to countries within the European Union (EU) and Norway. xiv Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Appliances are labeled in accordance with European Directive 2002/96/EC concerning waste electrical and electronic equipment (WEEE). The Directive determines the framework for the return and recycling of used appliances as applicable throughout the European Union. This label is applied to various products to indicate that the product is not to be thrown away, but rather reclaimed upon end of life per this Directive. In accordance with the European WEEE Directive, electrical and electronic equipment (EEE) is to be collected separately and to be reused, recycled, or recovered at end of life. Users of EEE with the WEEE marking per Annex IV of the WEEE Directive, as shown above, must not dispose of end of life EEE as unsorted municipal waste, but use the collection framework available to customers for the return, recycling, and recovery of WEEE. Customer participation is important to minimize any potential effects of EEE on the environment and human health due to the potential presence of hazardous substances in EEE. For proper collection and treatment, contact your local IBM representative. Battery return program This product may contain sealed lead acid, nickel cadmium, nickel metal hydride, lithium, or lithium ion battery. Consult your user manual or service manual for specific battery information. The battery must be recycled or disposed of properly. Recycling facilities may not be available in your area. For information on disposal of batteries outside the United States, go to http://www.ibm.com/ibm/environment/products/batteryrecycle.shtml or contact your local waste disposal facility. In the United States, IBM has established a return process for reuse, recycling, or proper disposal of used IBM sealed lead acid, nickel cadmium, nickel metal hydride, and other battery packs from IBM Equipment. For information on proper disposal of these batteries, contact IBM at 1-800-426-4333. Please have the IBM part number listed on the battery available prior to your call. For Taiwan: Please recycle batteries. For the European Union: Safety and environmental notices xv Note: This mark applies only to countries within the European Union (EU). Batteries or packaging for batteries are labeled in accordance with European Directive 2006/66/EC concerning batteries and accumulators and waste batteries and accumulators. The Directive determines the framework for the return and recycling of used batteries and accumulators as applicable throughout the European Union. This label is applied to various batteries to indicate that the battery is not to be thrown away, but rather reclaimed upon end of life per this Directive. In accordance with the European Directive 2006/66/EC, batteries and accumulators are labeled to indicate that they are to be collected separately and recycled at end of life. The label on the battery may also include a chemical symbol for the metal concerned in the battery (Pb for lead, Hg for mercury and Cd for cadmium). Users of batteries and accumulators must not dispose of batteries and accumulators as unsorted municipal waste, but use the collection framework available to customers for the return, recycling, and treatment of batteries and accumulators. Customer participation is important to minimize any potential effects of batteries and accumulators on the environment and human health due to the potential presence of hazardous substances. For proper collection and treatment, contact your local IBM representative. For California: Perchlorate Material - special handling may apply. See www.dtsc.ca.gov/hazardouswaste/perchlorate. The foregoing notice is provided in accordance with California Code of Regulations Title 22, Division 4.5 Chapter 33. Best Management Practices for Perchlorate Materials. This product, part, or both may include a lithium manganese dioxide battery which contains a perchlorate substance. xvi Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) About this publication This publication provides service providers with maintenance information for IBM® System i™ (9407-M15 and 9408-M25) and System p™ 520 (8203-E4A). This publication includes start-of-call procedures, isolation procedures, and removal and replacement procedures. For information about the accessibility features of this product, for users who have a physical disability, see “Accessibility features,” on page 701. How to send your comments Your feedback is important in helping to provide the most accurate and highest quality information. If you have any comments about this publication, use the Feedback button at http://www.ibm.com/systems/infocenter. Alternatively, you can send your comments to pubsinfo@us.ibm.com. Be sure to include the name of the book, the form number of the book, and the specific location of the text you are commenting on (for example, a page number or table number). © Copyright IBM Corp. 2008 xvii xviii Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Chapter 1. Starting a Service Call A service call is a starting point for service actions. You should begin all service actions with this procedure. From this point, you will be guided to the appropriate information to help you perform the necessary steps to repair the server. Note: In this topic, control panel and operator panel are synonymous. Before beginning, record information to help you return the server to the same state that the customer typically uses, such as: v The IPL type that the customer typically uses for the server v The IPL mode that is used by the customer on this server v The way in which the server is configured or partitioned 1. Has problem analysis been performed using the procedures in Problem Analysis, SA76-0111? Yes No ? Perform problem analysis using the procedures in Problem Analysis, SA76-0111. 2. Is the failing server managed by a Hardware Management Console (HMC)? No Yes ? Continue with step 7 on page 2. 3. Is the failing server managed by Integrated Virtualization Manager (IVM)? No Yes ? Go to “Collect serviceable events in Integrated Virtualization Manager” on page 333, and then return here and continue with next step. 4. Do you have a Field Replaceable Unit (FRU), location code, and an action plan to replace a failing FRU? © Copyright IBM Corp. 2008 1 No Yes ? Go to Chapter 3, “Locating FRUs,” on page 335. 5. Do you have an action plan to perform an isolation procedure? No Yes ? Go to Chapter 2, “Isolation Procedures,” on page 5. 6. Choose an action from the following symptom list: v You have a reference code but no action plan: go to Problem Analysis, SA76-0111. v There is a hardware problem that does not generate a reference code: use the Table of Contents to find an appropriate isolation procedure or contact your next level of support. v To check your server’s firmware level or to update it: see the system’s operation guide. v You have an Engineering Change (EC) or a hardware upgrade to apply: use the instructions provided with the change or upgrade. v You suspect a software problem: contact software support v To perform diagnostics or another service function: use the Table of Contents to locate the common service procedure.This ends the procedure. 7. Is the Hardware Management Console (HMC) connected and functional? Yes No ? Start the HMC and attach it to the server. If you experience a problem with the HMC, refer to the Service Guide for the Hardware Management Console Models 7042-CR4 and 7042-C06, SA76-0120. When the HMC is connected and functional, continue with the next step. 2 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 8. Perform the following steps from the HMC that is used to manage the server. During these steps, refer to the service data that was gathered earlier: 1. In the Navigation Area, open Service Management. 2. Select Manage Events. The Manage Serviceable Events - Select Serviceable Events panel will be displayed. 3. Select the status of Open. 4. Select all other selections to ALL and click OK. 5. Scroll through the list and verify that there is a problem with the status of Open to correspond with the customer’s reported problem.Do you find the reported problem, or an open problem near the time of the reported problem? Note: If you are unable to locate the reported problem, and there is more than one open problem near the time of the reported failure, use the earliest problem in the list. Yes No ? Go to step 4 on page 1. 9. Select the serviceable event you want to repair, and select Repair from the Selected menu. Follow the instructions displayed on the HMC. After you complete the repair procedure, the system automatically closes the serviceable event. This ends the procedure. Note: If the Repair procedures are not available, go to step 4 on page 1 Chapter 1. Starting a Service Call 3 4 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Chapter 2. Isolation Procedures Isolation procedures are used together with diagnostic programs that are part of server firmware. If a server is connected to an HMC, these procedures are available on the HMC. Use the HMC procedures to continue isolating the problem. If the server does not have an HMC and you are directed to perform an isolation procedure, the procedures documented here are needed to continue isolating a problem. “Bus, high-speed link (HSL/RIO/12X) isolation information” on page 7Symbolic FRUs, failing items, and bus isolation procedures use the terms “partition” and “logical partition” to indicate any single partition in a system that has multiple partitions. If the system you are working on does not have multiple partitions, then the terms refer to the primary partition. “HSL/RIO 12X Isolation Procedures” on page 27Use RIO/HSL/12X isolation procedures if there is no HMC attached to the server. If the server is connected to an HMC, use the procedures that are available on the HMC to continue FRU isolation. “Multi-adapter bridge (MAB) isolation procedures” on page 41Use multi-adapter bridge (MAB) isolation procedures if there is no HMC attached to the server. If the server is connected to the HMC, use the procedures that are available on the HMC to continue FRU isolation. “Communication isolation procedure” on page 47This topic contains the procedure necessary to isolate a communications failure. Please read and observe the following warnings when using this procedure. “Intermittent isolation procedures” on page 50These procedures help you to correct an intermittent problem. “I/O processor (IOP) isolation procedures” on page 60Use these procedures to isolate a failure in the multiple function I/O card. “Licensed internal code (LIC) isolation procedures” on page 90Use this section to isolate licensed internal code (LIC) problems. “Logical partition (LPAR) isolation procedure” on page 120These procedures help you to identify logical partition (LPAR) configuration conditions and the associated corrective actions. “Operations Console isolation procedures” on page 124These procedures help you to isolate a failure with the Operations Console. “Power isolation procedures” on page 128Use power isolation procedures for isolating a problem in the power system. Use isolation procedures if there is no HMC attached to the server. If the server is connected to an HMC, use the procedures that are available on the HMC to continue FRU isolation. “Router isolation procedures” on page 162These procedures serve as a guide to the correct isolation procedures from the reference code tables. “SAS isolation procedures” on page 166Use the following Serial Attached SCSI (SAS) isolation procedures if an HMC is not attached to the server. If the server is connected to an HMC, use the procedures that are available on the HMC to continue FRU isolation. “Service processor isolation procedures” on page 202Use service processor isolation procedures if there is no HMC attached to the server. If the server is connected to an HMC, use the procedures that are available on the HMC to continue FRU isolation. “Tape unit isolation procedures” on page 219This topic contains the procedures necessary to isolate a failure in a tape device. © Copyright IBM Corp. 2008 5 “Isolating problems on servers that run the AIX or Linux operating systems” on page 245Use the procedures for AIX or Linux servers if there is not an HMC attached to the server. If the server is connected to an HMC, use the procedures that are available on the HMC to continue FRU isolation. “Collect serviceable events in Integrated Virtualization Manager” on page 333Use this procedure when using the Integrated Virtualization Manager (IVM) to check for serviceable events on your system. 6 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Bus, high-speed link (HSL/RIO/12X) isolation information Symbolic FRUs, failing items, and bus isolation procedures use the terms “partition” and “logical partition” to indicate any single partition in a system that has multiple partitions. If the system you are working on does not have multiple partitions, then the terms refer to the primary partition. Read all safety notices below before servicing the system and while performing a procedure. Note: Unless instructed otherwise, always power off the system before removing, exchanging, or installing a field-replaceable unit (FRU). DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) “PCI bus isolation using AIX, Linux, or the HMC” on page 8Use this procedure if you are isolating a PCI bus problem from the HMC or while running AIX® or Linux®. “Verifying a high-speed link, system PCI bus, or a multi-adapter bridge repair” on page 9Use this procedure to verify a repair for the high-speed link, a system PCI bus, or for a multi-adapter bridge. Chapter 2. Isolation Procedures 7 “Analyzing a RIO/HSL/12X or PCI bus reference code” on page 11Use Word 7 of the reference code to determine the bus number, bus type, multi-adapter bridge number, multi-adapter bridge function number, and logical card number from the direct select address (DSA). “DSA translation” on page 12The Direct Select Address (DSA) may be coded in word 7 of the reference code. “Card positions” on page 12This information specifies card positions for the various server models. “Converting the loop number to RIO/HSL/12X port location labels” on page 14Use this table to convert the RIO/HSL/12X loop number to port location labels. “Fiber optic cleaning procedures” on page 15Use the fiber optic cleaning kit and the fiber optic cleaning procedures in ?SY27-2604 Fiber Optic Cleaning Procedures? for all fiber channel connections such as those used in optical high speed link (HSL) connections or fibre channel attached devices. “Status indications for RIO/HSL” on page 15A status indication for a RIO/HSL loop is identified in the reference code. “RIO/HSL/12X connection problems” on page 22The failing component is the RIO/HSL/12X connection “HSL loop configuration and status form” on page 25Use this HSL loop configuration and status form to record the status of the HSL ports in the loop. “Installed features in a PCI bridge set form” on page 25Use this form to record the PCI bridge set card positions, and multi-adapter bridge function numbers. “RIO/HSL/12X link status diagnosis form” on page 26Use this form to record the status of the RIO/HSL/12X links.PCI bus isolation using AIX, Linux, or the HMC Use this procedure if you are isolating a PCI bus problem from the HMC or while running AIX or Linux. If you have an HMC, then this procedure should be performed from the HMC as part of the HMC directed service. If you do not have an HMC, then you should perform this procedure when directed by the maintenance package. Isolating a PCI bus problem while running AIX or Linux Can an IPL be run on the operating system? v No: Perform “MABIP52” on page 42. This ends the procedure. v Yes: Choose from the following: – If you are running AIX, go to “Running the online and eServer stand-alone diagnostics” on page 672 to isolate the PCI bus failure with online diagnostics in concurrent mode. – If you are running Linux, go to “Running the online and eServer stand-alone diagnostics” on page 672 to isolate the PCI bus failure with stand-alone diagnostics.This ends the procedure.Isolating a PCI bus problem from the HMC To isolate a PCI bus problem from the HMC, check the serviceable event view for the server for FRU part locations associated with the serviceable event, then continue with this procedure: 1. Did the serviceable event view provide the locations for the failing FRUs? Yes: Use those locations to exchange the given FRUs one at a time until the problem is resolved. This ends the procedure. No: Continue with the next step. 8 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 2. Go to “DSA translation” on page 12 to determine the Direct Select Address (DSA). 3. Perform the following: a. Record the bus number value (BBBB) from the DSA and convert it to decimal format. b. Search for the decimal system bus number in the partition resources screens on the HMC. c. Record the frame or unit type and continue with the next step.4. Record the Cc value from the DSA. Is the Cc value greater than 00? Yes: Continue with the next step. No: The multi-adapter bridge number and the multi-adapter function number have not been identified, and so the card slot cannot be identified using the DSA. Look in the HMC partition resources for non-reporting or non-operational hardware. That will indicate which cards in which positions need to be replaced. Refer to Finding part locations for the model you are working on for information about the multi-adapter bridge that controls those card slots. That multi-adapter bridge is also a FRU. This ends the procedure.5. Is the right-most character (c) F? No: Continue with the next step. Yes: Only the multi-adapter bridge number has been identified. Record the multi-adapter bridge number (left-most character of Cc) for later use. Because the card slot cannot be identified with the DSA, refer to Finding part locations for the model you are working on for information about the multi-adapter bridge that controls the card slots. Consider all card slots controlled by the multi-adapter bridge to be FRUs. This ends the procedure.6. Refer to “Card positions” on page 12 and use the BBBB and Cc values that you recorded to identify the card position. Then return to the procedure that sent you here. This ends the procedure.Verifying a high-speed link, system PCI bus, or a multi-adapter bridge repair Use this procedure to verify a repair for the high-speed link, a system PCI bus, or for a multi-adapter bridge. Within this procedure, the terms ?system? and ?logical partition? are interchangeable when used individually. 1. Perform this procedure from the logical partition you were in when you were sent to this procedure, or from the HMC if this error was worked from the HMC. 2. If you previously powered off a system or logical partition, or an expansion unit during this service action, then you need to power it off again. 3. Install all cards, cables, and hardware, ensuring that all connections are tight. You can use the system configuration list to verify that the cards are installed correctly. 4. Power on any expansion unit, logical partition or system unit that was powered off during the service action. Is one of the following true? v If the system or a logical partition was powered off during the service action, does the IPL complete successfully to the IPL or does Install the System display? v If an expansion tower was powered off during the service action, does the expansion tower power on complete successfully? v If any IOP or IOA card locations were powered off using concurrent maintenance during the service action, do the slots power on successfully? v If you exchanged a FRU that should appear as a resource or resources to the system, such as an IOA, or I/O bridge, does the new FRU’s resource appear in HSM as operational? Yes: Continue with the next step. Chapter 2. Isolation Procedures 9 No: Verify that you have followed the power off, remove and replace, and power-on procedures correctly. When you are sure that you have followed the procedures correctly, then exchange the next FRU in the list. If there are no more FRUs to exchange, then contact your next level of support. This ends the procedure. 5. Does the system or logical partition have mirrored protection? Select Yes if you are not sure. No: Continue with the next step. Yes: From the Dedicated Service Tools (DST) display, select Work with disk units, and resume mirrored protection for all units that have a suspended status. 6. Choose from the following options: v If you are working from a partition, from the Start a Service Tool display, select Hardware service manager and look for the I/O processors that have a failed or missing status. v If you are working from a HMC, look at the CEC properties.a. Choose the I/O tab. b. Look for IOAs or IOPs that have a failed or missing status.Are all I/O processor cards operational?Note: Ignore any IOPs that are listed with a status of not connected. Yes: Go to step 10. No: Display the logical hardware resource information for the non-operational I/O processors. For all I/O processors and I/O adapters that are failing; record the bus number (BBBB), board (bb) and card information (Cc). Continue with the next step. 7. Perform the following: a. Return to the Dedicated Service Tools (DST) display. b. Display the Product Activity Log. c. Select All logs and search for an entry with the same bus, board, and card address information as the non-operational I/O processor. Do not include informational or statistical entries in your search. Use only entries that occurred during the last IPL. Did you find an entry for the SRC that sent you to this procedure? No: Continue with the next step. Yes: Ask your next level of support for assistance. This ends the procedure. 8. Did you find a B600 6944 SRC that occurred during the last IPL? Yes: Continue with the next step. No: A different SRC is associated with the non-operational I/O processor. Go to the Chapter 1, “Starting a Service Call,” on page 1 procedure and look up the new SRC to correct the problem. This ends the procedure. 9. Is there a B600 xxxx SRC that occurred during the last IPL other than the B600 6944 and informational SRCs? Yes: Use the other B600 xxxx SRC to determine the problem. Go to the Chapter 1, “Starting a Service Call,” on page 1 and look up the new SRC to correct the problem. This ends the procedure. No: You connected an I/O processor in the wrong card position. Use the system configuration list to compare the cards. When you have corrected the configuration, go to the start of this procedure to verify the bus repair. This ends the procedure. 10. If in a partition, use the hardware service manager function to print the system configuration list. Are there any configuration mismatches? No: Continue with the next step. Yes: Ask your next level of support for assistance. This ends the procedure.11. You have verified the repair of the system bus. 10 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) a. If for this service action only an expansion unit was powered off or only the concurrent maintenance function was used for an IOP or IOA, then continue with the next step. b. Otherwise, perform the following to return the system to the customer: 1) Power off the system or logical partition. See “Powering on and powering off” on page 588 for details. 2) Select the operating mode with which the customer was originally running. 3) Power on the system or logical partition.12. If the system has logical partitions and the entry point SRC was B600 xxxx, then check for related problems in other logical partitions that could have been caused by the failing part. This ends the procedure. Analyzing a RIO/HSL/12X or PCI bus reference code Use Word 7 of the reference code to determine the bus number, bus type, multi-adapter bridge number, multi-adapter bridge function number, and logical card number from the direct select address (DSA). Physical card slot labels and card positions for PCI buses are determined by using the DSA and the appropriate system unit or I/O unit card positions. See “Card positions” on page 12 for details. Table 1. RIO/HSL/12X and PCI reference code analysis Word of the reference code Control panel function Panel function characters Format Description 1 11 1–8 B600 uuuu or B700 uuuu uuuu = unit reference code (69xx) 1 – extended reference code information 11 9–16 iiii Frame ID of the failing resource 1 – extended reference code information 11 17–24 ffff Frame location 1 – extended reference code information 11 25–32 bbbb Board position 2 12 1–8 MIGVEP62 or MIGVEP63 See Hardware SRC formats. 3 12 9–16 cccc cccc Component reference code 4 12 17–24 pppp pppp Programming reference code 5 12 25–32 qqqq qqqq Program reference code high order qualifier 6 13 1–8 qqqq qqqq Program reference code low order qualifier 7 13 9–16 BBBB Ccbb See “DSA translation” on page 12 8 13 17–24 TTTT MMMM Type (TTTT) and model (MMMM) of the failing item (if not zero) 9 13 25–32 uuuu uuuu Unit address (if not zero) Chapter 2. Isolation Procedures 11 DSA translation The Direct Select Address (DSA) may be coded in word 7 of the reference code. This DSA is either a PCI system bus number or a RIO/HSL/12X loop number, depending on the type of error. With the following information, and the information in either the card position table (for PCI bus numbers) or the information in the loop-number-to-NIC-port table (for RIO/HSL/12X loop numbers), you can isolate a failing PCI bus or RIO/HSL/12X loop. Use the following instructions to translate the DSA: 1. Separate the DSA into the bus number, multi-adapter bridge number, and multi-adapter bridge function number. The DSA is of the form BBBB Ccxx, and separates into the following parts: v BBBB = bus number v C = multi-adapter bridge number v c = multi-adapter bridge function number v xx = not used2. Is the bus number less than 0684? Yes: The bus number is a PCI bus number in hexadecimal. Convert the number to decimal, and then continue with the next step. No: The bus number is a RIO/HSL/12X loop number in hexadecimal. Convert the number to decimal, and then go to step 4.3. Use one of the following guides to determine the type of system unit or expansion unit in which the bus is located: v If you are using a Hardware Management Console (HMC) interface, view the managed system’s properties on the HMC. v If you are using AIX or Linux, use the command line interface to determine the enclosure type. On the command line, type the following: lshwres -r io --rsubtype bus The result will be in the form: unit_phys_loc=Uxxxx.yyy.zzzzzzz,bus_id=a, ...... Find the bus ID ?a? entry that matches the decimal bus number you determined in step 2. Using the corresponding Uxxxx value, look up the unit model or enclosure type using the Unit Type and Locations table in Chapter 3, “Locating FRUs,” on page 335.4. Perform one of the following: v If you are working with a PCI bus number, refer to “Card positions” to search for the bus number, the multi-adapter bridge number, and the multi-adapter bridge function number that matches the system unit or I/O tower type where the bus is located. This ends the procedure. v If you are working with a RIO/HSL/12X loop number, refer to “Converting the loop number to RIO/HSL/12X port location labels” on page 14 to determine the starting ports for the RIO/HSL/12X loop with the failed link. This ends the procedure. Card positions This information specifies card positions for the various server models. Use the table for the model you are working on: Table 2 on page 13: Card positions for models 8203-E4A, 9407-M15, and 9408-M25 Table 3 on page 13: Card positions for 5095, 0595, and 7311-D20 expansion units Table 4 on page 13: Card positions for 7311-D11 and 5790 expansion units Table 5 on page 14: Card positions for 7314-G30 expansion unit 12 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Table 2. Card positions for models 8203-E4A, 9407-M15, and 9408-M25 Bus number in DSA (hexadecimal/decimal) Item designated by the DSA Position label and PCI data width 200/512 Embedded SAS controller -P1 201/513 Embedded USB controller -P1 202/514 PCI-X IOA card -P1-C4 203/515 PCI-X IOA card -P1-C5 204/516 PCI-E IOA card -P1-C1 205/517 PCI-E IOA card -P1-C2 206/518 Auxiliary cache adapter -P1-C9 207/519 PCI-E IOA card -P1-C3 Table 3. Card positions for 5095, 0595, and 7311-D20 expansion units Bus number Multi-adapter bridge number in DSA Multi-adapter bridge function number in DSA Item designated by the DSA Position label and PCI data width xxxx (assigned by LIC) 2 0 and 1 PCI IOP C01 xxxx (assigned by LIC) 2 2 and 3 PCI IOP or IOA card C02 xxxx (assigned by LIC) 2 4 and 5 PCI IOP or IOA card C03 xxxx (assigned by LIC) 2 6 and 7 IOA card C04 xxxx (assigned by LIC) 2 F Multi-adapter bridge CB1 xxxx (assigned by LIC) 2 Not used Not used C05 xxxx (assigned by LIC) 2 0 and 1 PCI IOP C06 xxxx (assigned by LIC) 2 2 and 3 PCI IOP or IOA card C07 xxxx (assigned by LIC) 2 6 and 7 IOA card C08 xxxx (assigned by LIC) 2 F Multi-adapter bridge CB1 Table 4. Card positions for 7311-D11 and 5790 expansion units Bus number in DSA (hexadecimal) Multi-adapter bridge number in DSA Multi-adapter bridge function number in DSA Item designated by the DSA Position label and PCI data width Lower PCI bus number 2 0 PCI IOA card -P1-C1 (64 bit) Lower PCI bus number 2 1 Lower PCI bus number 2 2 PCI IOA card -P1-C2 (64 bit) Lower PCI bus number 2 3 Lower PCI bus number 2 4 Unused Lower PCI bus number 2 5 Chapter 2. Isolation Procedures 13 Table 4. Card positions for 7311-D11 and 5790 expansion units (continued) Bus number in DSA (hexadecimal) Multi-adapter bridge number in DSA Multi-adapter bridge function number in DSA Item designated by the DSA Position label and PCI data width Lower PCI bus number 2 6 PCI IOA card -P1-C3 (64 bit) Lower PCI bus number 2 7 Lower PCI bus number 2 F Multi-adapter bridge -P1 Higher PCI bus number 2 0 PCI IOA card -P1-C4 (64 bit) Higher PCI bus number 2 1 Higher PCI bus number 2 2 PCI IOA card -P1-C5 (64 bit) Higher PCI bus number 2 3 Higher PCI bus number 2 4 Unused Higher PCI bus number 2 5 Higher PCI bus number 2 6 PCI IOA card -P1-C6 (64 bit) Higher PCI bus number 2 7 Higher PCI bus number 2 F Multi-adapter bridge -P1 Table 5. Card positions for 7314-G30 expansion unit Bus number Multi-adapter bridge number in DSA Multi-adapter bridge function number in DSA Item designated by the DSA Position label and PCI data width xxxx (assigned by LIC) 0 0 PCI-X IOA card -P1-C1 (64 bit) xxxx (assigned by LIC) 0 0 PCI-X IOA card -P1-C2(64 bit) xxxx (assigned by LIC) 0 0 PCI-X IOA card -P1-C3(64 bit) xxxx (assigned by LIC) 0 0 PCI-X IOA card -P1-C4(64 bit) xxxx (assigned by LIC) 0 0 PCI-X IOA card -P1-C5(64 bit) xxxx (assigned by LIC) 0 0 PCI-X IOA card -P1-C6(64 bit) Converting the loop number to RIO/HSL/12X port location labels Use this table to convert the RIO/HSL/12X loop number to port location labels. 14 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Table 6. Converting the loop number to port location labels for Model 8203-E4A, 9407-M15, and 9408-M25. Loop number (hex / dec) Model FRU position RIO/HSL/12X port labels on system unit or processor tower 0686 / 1670 8203-E4A, 9407-M15, and 9408-M25 with RIO -P1-C8 -P1-C8-T1 -P1-C8-T2 0688 / 1672 8203-E4A, 9407-M15, and 9408-M25 with RIO -P1-C6 -P1-C6-T1 -P1-C6-T2 0781 / 1921 8203-E4A, 9407-M15, and 9408-M25 with 12X -P1-C8 -P1-C8-T1 -P1-C8-T2 0782 / 1922 8203-E4A, 9407-M15, and 9408-M25 with 12X -P1-C6 -P1-C6-T1 -P1-C6-T2 Fiber optic cleaning procedures Use the fiber optic cleaning kit and the fiber optic cleaning procedures in ?SY27-2604 Fiber Optic Cleaning Procedures? for all fiber channel connections such as those used in optical high speed link (HSL) connections or fibre channel attached devices. See Chapter 8, “Part Information,” on page 691. This ends the procedure. Status indications for RIO/HSL A status indication for a RIO/HSL loop is identified in the reference code. Use the table below to determine if the status indication requires a service action. Record the rightmost four characters of word 4 of the reference code. These characters are the program return code (PRC), which indicates the RIO/HSL status. The leftmost four characters of word 7 indicate the RIO/HSL loop number (in hexadecimal format). Table 7. Status indicated by the PRC PRC Indicated status 3204 A RIO error was detected, indicating that a RIO link failed. 1. To diagnose the error read through the transport manager flight recorder. 2. Check the failing link. If the link shows not to have failed, check the devices connected to either end of the link. Chapter 2. Isolation Procedures 15 Table 7. Status indicated by the PRC (continued) PRC Indicated status 3205 During IPL, LIC determined that the loop was not complete. v This is expected if there are no I/O units on the loop. v This error can also occur when an I/O unit, shared I/O unit, or another system on the loop did not complete powering on by the time this system’s LIC checked the loop for completeness. As a result, you may see this error in the serviceable event view you are working with. v When you find the same reference code logged from the same IPL against the same resource with a PRC of 3206 or 3208, the problem no longer exists. This can happen because the error was recovered when RIO/HSL hardware came on line, was properly configured, or the diagnostic code determined that there was not a problem based on the combination of machine types, features, configuration, and topology. In this case, you may close the problem entry. v In i5/OS®, this error may also appear in the serviceable event view if any I/O units were removed from the loop without deleting the RIO/HSL I/O bridge resources of those units from Hardware Service Manager (HSM). The service procedure identified with the reference code that sent you here will help you determine if the loop is functioning correctly or if service is required. v This error may also be caused by a problem in a rack, frame, or unit connected to the RIO/HSL loop if the problem prevents the unit from powering on or being detected by LIC. Follow the service procedures for this reference code. When necessary, you may be directed to work on other reference codes before returning to this procedure. 3206 During normal operation an RIO/HSL loop recovered its redundant path. The loop is now complete. 3208 During normal operation an RIO/HSL I/O bridge recovered a failed link on the loop. 3209 See “Indicated status for PRC 3209” below. 3210 An RIO/HSL link switched to a slower speed. The link is designed to run at a faster speed based on the link’s hardware and LIC levels at both ends. If there is a FRU list in the serviceable event view, use it to complete the repair action. If not, perform the following sections of this procedure below: 1. “Determining the RIO/HSL port label” on page 17 2. “Replacing the link’s failing end point FRUs” on page 21 (replace only the ?From Frame ID? end point FRU) Indicated status for PRC 3209 Recoverable CRC (cyclical redundancy check) errors have occurred on the loop. This error requires service action. An RIO/HSL cable or connection must be exchanged. Exchange only the cable that appears in the FRU list of this reference code in this document. If you are working from the serviceable event view, then only the FRU(s) required for this error will be displayed. Choose from the following scenarios: v If there is a serviceable event view entry with a cable FRU listed with a loop number, enclosure/unit ID, and port label identified with the cable FRU, then perform one of the following: – If the cable is optical, it is possible the optical connections need cleaning. You can choose either to clean the cable connections at each end without exchanging the cable, or to exchange the cable. – If the cable is copper, examine the screws that hold the connector at the end of the cable identified in the reference code or the first cable location listed in the serviceable event view entry. It is possible to get CRC errors when the connector screws are not tight. You can choose to tighten the cable connector screws without exchanging the cable only if they are loose. Otherwise, you must exchange the cable. – If neither of these actions resolves the problem, replace the FRUs in the serviceable event view one at a time. This ends the procedure. 16 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) v If the serviceable event view entry does not list any cable FRUs, then exchange the failing items listed in the serviceable event view entry by following the normal service procedures for those FRUs. This will be the case when the RIO/HSL connection is embedded. This ends the procedure. v If the serviceable event view entry has cable FRU(s) listed, but the loop number, enclosure/unit ID, and port label are not all listed with the cable FRU, go to “Determining the RIO/HSL port label” below.Determining the RIO/HSL port label Retrieve and record the following information: v Loop number. The loop number is displayed in hexadecimal format as the four leftmost digits of word 7 in the reference code. Convert the loop number to decimal format using “DSA translation” on page 12. Record both the hexadecimal and decimal formats of the loop number. If the loop is an internal loop on a 515, 520, 525, or 570 (loops 0680, 0683, 0686, 0689 hexadecimal or 1664, 1667, 1670, 1673 decimal), then record ?Internal?. v Frame ID. The frame ID is displayed in hexadecimal format as the four leftmost digits of word 5 of the reference code. You must convert the frame ID to decimal format to match what is displayed in user interfaces and problem views. Record both the hexadecimal and decimal formats of the frame ID. If the frame ID is not zero, then translate the frame ID into the correct machine type, model, and serial number by performing the following: 1. Log on to SST/DST. Attention: Do not IPL to DST. 2. Select Hardware Service Manager. 3. Select Packaging resources. 4. Selecting Display details for each unit listed until the frame ID matches the ID you are working with. Once you find the matching frame ID, record the unit’s machine type, model, and serial number.Note: The frame has an ID of 0000 at this point in the procedure. A frame ID of zero is indicating the NIC/RIO controller in a system unit. v Port number indicator. The port number indicator is the four rightmost digits of word 5 of the reference code.Use the frame ID and port number indicator in the following table to determine the RIO/HSL port label. If you are referred to “Converting the loop number to RIO/HSL/12X port location labels” on page 14, the failing item is in a system unit. You will need the RIO/HSL loop number to determine the FRU. The following tables provide hub, port, and location code information for each of the currently supported models. This data is used to determine which RIO loop to use for the B7006981. The asterisk in the ?Port indicator number? column indicates the leading port of the port pair. Table 8. Determining which RIO/HSL port label to use on a 515, 520, 525, and 550 Model number Loop number Hub Port indicator number Location code 515, 520, and 525 1668 0 0* ?Internal? 1 ?Internal? 1669 2* P1-T4 3 P1-T3 Chapter 2. Isolation Procedures 17 Table 8. Determining which RIO/HSL port label to use on a 515, 520, 525, and 550 (continued) Model number Loop number Hub Port indicator number Location code 550 1668 0 0* ?Internal? 1 ?Internal? 1669 2* P1-T12 3 P1-T11 1676 4 0 P1-C6-T2 (In an expansion card not-onboard) 1 P1-C6-T1 (In an expansion card not-onboard) Table 9. Determining which RIO/HSL port label to use on a 570 Model number Loop number CEC Hub Port indicator number Location code 570 1668 0 0 0* ?Internal? (Top CEC) 1 ?Internal? 1669 0 0 2* P1-T8 3 P1-T9 1676 0 4 0* P1-C7-T2 1 P1-C7-T1 570 1684 1 8 0* ?Internal? (Second down) 1 ?Internal? 1685 1 8 2* P1-T8 3 P1-T9 1692 1 12 0 P1-C7-T2 1 P1-C7-T1 570 1700 2 16 0* ?Internal? (Third down) 1 ?Internal? 1701 2 16 2* P1-T8 3 P1-T9 1708 2 20 0 P1-C7-T2 1 P1-C7-T1 570 1716 3 24 0* ?Internal? (Fourth down) 1 ?Internal? 1717 3 24 2* P1-T8 3 P1-T9 1724 3 28 0 P1-C7-T2 1 P1-C7-T1 18 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Table 10. Determining which RIO/HSL port label to use on a 575 Model number Loop number Hub Port indicator number Location code 575 1668 0 0* ?Internal? 1 ?Internal? 1669 2* P1-T7 3 P1-T8 Table 11. Determining which RIO/HSL port label to use on a 590 and 595 Model number Loop number Hub Port indicator number Location code 590 / 0595 1668 0 0* ?Internal? FSPA 1 ?Internal? 1669 0 2* P1-C1-T1 3 P1-C1-T2 1670 1 0* ?Internal? FSPB 1 ?Internal? 1671 1 2* P1-C3-T1 3 P1-C3-T2 1672 2 2* P1-C5-T1 3 P1-C5-T2 1674 3 2* P1-C6-T1 3 P1-C6-T2 1676 4 2* P1-C8-T1 3 P1-C8-T2 1678 5 2* P1-C9-T1 3 P1-C9-T2 1680 6 2* P1-C11-T1 3 P1-C11-T2 1682 7 2* P1-C13-T1 3 P1-C13-T2 Chapter 2. Isolation Procedures 19 Table 11. Determining which RIO/HSL port label to use on a 590 and 595 (continued) Model number Loop number Hub Port indicator number Location code 590 / 0595 1684 8 0* ?Internal? FSB (Second node position) 1 ?Internal? 1685 8 2* P1-C1-T1 3 P1-C1-T2 1686 9 0* ?Internal? FSPA 1 ?Internal? 1687 9 2* P2-C3-T1 3 P2-C3-T1 1688 10 2* P2-C5-T1 (Optional card, no HSL) 3 P2-C5-T2 1690 11 2* P2-C6-T1 (Required empty airflow) 3 P2-C6-T2 1692 12 2* P2-C8-T1 3 P2-C8-T2 1694 13 2* P2-C9-T1 3 P2-C9-T2 1696 14 2* P2-C11-T1 3 P2-C11-T2 1698 15 2* P2-C13-T1 3 P2-C13-T2 590 / 0595 1700 16 2* P3-C1-T1 (Third position) 3 P3-C1-T2 1702 17 2* P3-C3-T1 3 P3-C3-T2 1704 18 2* P3-C5-T1 3 P3-C5-T2 1706 19 2* P3-C6-T1 3 P3-C6-T2 1708 20 2* P3-C8-T1 3 P3-C8-T2 1710 21 2* P3-C9-T1 3 P3-C9-T2 1712 22 2* P3-C11-T1 3 P3-C11-T2 1714 23 2* P3-C13-T2 3 P3-C13-T2 20 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Table 11. Determining which RIO/HSL port label to use on a 590 and 595 (continued) Model number Loop number Hub Port indicator number Location code 590 / 0595 1716 24 2* P4-C1-T1 3 P4-C1-T2 1718 25 2* P4-C3-T1 3 P4-C3-T2 1720 26 2* P4-C5-T1 3 P4-C5-T2 1722 27 2* P4-C6-T1 3 P4-C6-T2 1724 28 2* P4-C8-T1 3 P4-C8-T2 1726 29 2* P4-C9-T1 3 P4-C9-T2 1728 30 2* P4-C131-T1 3 P4-C11-T2 1730 31 2* P4-C13-T1 3 P4-C13-T2 Note: For 570, exchange the FRU on the correct unit (primary or secondary) by matching the serial number (if available in the FRU list on the serviceable event view) or by matching the loop number to the correct unit using “Converting the loop number to RIO/HSL/12X port location labels” on page 14. Determining if the cable is the cause of the problem If there is a cable attached to the failing port: v If the cable is optical, it is possible the optical connections need cleaning. You can choose either to clean the cable connections at each end without exchanging the cable, or to exchange the cable. v If the cable is copper, examine the screws that hold the connector at the end of the cable identified in the reference code or the first cable location listed in the serviceable event view entry. It is possible to get CRC errors when the connector screws are not tight. You can choose to tighten the cable connector screws without exchanging the cable only if they are loose. Otherwise, you must exchange the cable. v If neither of these actions resolves the problem, replace the cable. Perform “RIOIP08” on page 34 to determine the frame ID and RIO/HSL port label of the other end of the cable you will be exchanging (if you do not already know). Does this correct the problem? Yes: This ends the procedure. No: Continue with replacing the FRUs that the cable is connected to, starting with the FRU listed first.Replacing the link’s failing end point FRUs Use the following table to determine the end point FRUs on the ?From Frame ID? port. Replace the FRUs one at a time. Chapter 2. Isolation Procedures 21 If replacing the end point FRUs for the ?From Frame ID? port does not resolve the error, use the table below to replace the end point FRUs at the other end of the cable. Perform “RIOIP08” on page 34 to determine the system or I/O unit frame ID and RIO/HSL port label of the other end of the cable (if you do not already know). Table 12. End point FRUs Model Loop number (hexadecimal / decimal) End point FRUs 515, 520, 525, 570 0680 / 1664 The loop is embedded in the system unit planar. Exchange the system unit planar. 515, 520, 525, 570 0681 / 1665 For all port number values, the loop’s NIC/RIO controller is embedded in the system unit planar. Exchange the system unit planar. 570 0682 / 1666 For all port number values, the loop’s NIC/RIO controller is in the NIC/RIO controller card. Exchange the NIC/RIO controller. 570 0683 / 16670686 / 16700689 / 1673 The loop is embedded in the system unit planar. Exchange the system unit planar. Note: Exchange the FRU on the correct unit (primary or secondary) by matching the serial number (if available in the FRU list on the serviceable event view) or by matching the loop number to the correct unit using “Converting the loop number to RIO/HSL/12X port location labels” on page 14. This ends the procedure. RIO/HSL/12X connection problems The failing component is the RIO/HSL/12X connection When a RIO/HSL cable is disconnected, it may result in a lost connection between the units even after the cable is reconnected. Attention: To fix this problem, you need to cycle power on the unit with the locked RIO/HSL connection (see “Powering on and powering off an I/O expansion unit” on page 24 below). This problem does not occur with 12X cables.Note: If question marks (???) appear at the end of the location code, then the port could not be determined. Use the location code associated with the other end of the cable. If question marks appear for both port locations, use the isolation procedures suggested in the reference code table for this SRC. 1. Choose from the following: v If you were sent to this procedure from another symbolic FRU, locate that FRU in Table 13 on page 23 to see the description of the RIO/HSL/12X FRU. Then continue with the next step for more information about the FRU. v If you are working with this symbolic FRU in the FRU list, the failing component is an RIO/HSL/12X connection. Diagnostic code could not determine what kind of hardware was involved. The RIO/HSL/12X hardware can be any of the following: – Cable – Embedded RIO/HSL/12X link in a FRU (a planar, for example) – RIO/HSL/12X interposer cardThe RIO/HSL/12X link is on or between the other FRU or FRUs listed for the reference code. Continue with the next step. 22 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Table 13. RIO/HSL symbolic FRUs RIO/HSL FRU Description HSL_OPT This is an optical RIO/HSL cable. When exchanging optical RIO/HSL cables, use the optical cleaning kit and procedures. See “Fiber optic cleaning procedures” on page 15 for details. If an interposer card is called for, be sure the interposer type matches the cable type (optical or copper). HSL1 There is a standard copper RIO/HSL cable at both ends (yellow connectors). HSL1_UN There is a standard copper RIO/HSL cable (yellow connector) at the detecting end, and an unknown connector type at the other end. HSL2 There is a copper HSL2/RIO-G cable at both ends (black connectors), but the length of the cable could not be sensed. HSL2_xx There is a copper HSL2/RIO-G cable at both ends (black connectors). Use the xx value to determine the cable length from this list: v HSL2_01 = 1 meter HSL2 cable v HSL2_03 = 3 meter HSL2 cable v HSL2_08 = 8 meter HSL2 cable v HSL2_10 = 10 meter HSL2 cable v HSL2_15 = 15 meter HSL2 cable v HSL2_17 = 1.75 meter HSL2 cable v HSL2_25 = 2.5 meter HSL2 cable HSLH There is a hybrid RIO/HSL to HSL2/RIO-G cable (yellow connector at one end and black connector at the other end), but the length of the cable could not be sensed. HSL_IB This is an 12X cable (green connectors). HSLH_xx There is a hybrid RIO/HSL to HSL2/RIO-G cable (yellow connector at one end and black connector at the other end). Use the xx value to determine the cable length from this list: v HSLH_06 = 6 meter HSL/RIO to HSL2/RIO-G cable v HSLH_15 = 15 meter HSL/RIO to HSL2/RIO-G cable HSL_I There is a RIO/HSL interposer card for HSL/RIO cables (yellow connectors), HSL2/RIO-G cables (black connectors), or optical cables. The interposer card type could not be sensed. HSL_I2 HSL_I3 HSL_I4 There is a RIO/HSL interposer card for HSL/RIO cables (yellow connectors), HSL2/RIO-G cables (black connectors), or optical cables. Use the x value to determine the interposer card type from this list: v HSL_I2 = Copper HSL2/RIO-G interposer card for HSL2/RIO-G cable connectors (black) in the system unit backplane position C08 or C09 of machine type 9406 model 825. v HSL_I3 = Optical HSL/RIO interposer card for optical RIO/HSL cables in system unit backplane position C08 or C09 of machine type 9406 model 825. v HSL_I4 = HSL2/RIO-G interposer/riser card on a 7040-61D I/O unit in location -P1-Riser or -P2-(Riser). 2. Choose from the following options: v If you are working from the serviceable event view, the location code or FRU description in the view will help determine the actual RIO/HSL/12X hardware to exchange. Continue with the next step. v If you are not working from the serviceable event view, or the view does not have a location code or better FRU description, then determine the location code of other FRUs in the FRU list for the error. Then continue with the next step.3. Use the location code and the information from the preceding table to determine the machine type, model, or unit feature involved in the error. If necessary, use the location code for other FRUs listed in Chapter 2. Isolation Procedures 23 the FRU list for this error to determine the failing RIO/HSL/12X connection and any related FRUs that are part of that connection. Use Chapter 3, “Locating FRUs,” on page 335 to find links to the locations information. Refer to the Parts Information chapter of the Service Guide for the failing machine type/model for part number information. Note: If you exchange all of the FRUs in the FRU list, but the problem still exists, contact your next level of support. You may be directed to exchange additional RIO/HSL/12X FRUs. “Additional RIO/HSL FRUs” has more information about RIO/HSL/12X FRUs on specific models and I/O units.Additional RIO/HSL FRUs The following are RIO/HSL/12X FRUs by model and/or unit type. For the model or unit type you are working on, there may be additional RIO/HSL/12X FRUs which were not listed in the FRU list of the error. Under the direction of your next level of support, you can try exchanging the additional FRUs. 1. In the following table, locate the unit type(s) on which you are working. Exchange the indicated RIO/HSL/12X loop connections (external or embedded) or RIO/HSL/12X interposer card. 2. Did the exchange correct the error? v Yes: The FRU you just replaced was the failing item. Go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. v No: Call your next level of support. This ends the procedure. Table 14. RIO/HSL/12X cable or connections. To find locations information, refer to Chapter 3, “Locating FRUs,” on page 335 or the service guide for the I/O enclosure. System model or unit type Description Location code 5094, 5096, 5294, 5296 HSL (optical) I/O bridge adapter Un-CB1-C10 5094, 5096, 5294, 5296 HSL I/O bridge card Un-CB1-C10 5094, 5096, 5294, 5296 HSL-2 I/O bridge adapter Un-CB1-C10 7311-D11, 5790 RIO/HSL adapter Un-P1-C7 0595, 5095 HSL (optical) I/O bridge adapter Un-P1-C5 7311-D20, 0595, 5095 RIO/HSL I/O bridge adapter Un-P1-C5 7311-D20, 0595, 5095 RIO/HSL-2 I/O bridge adapter Un-P1-C5 7314-G30, 5796 Dual-port 12X adapter Un-P1-C7 1519 Integrated xSeries Adapter Powering on and powering off an I/O expansion unit Follow these steps if you disconnected an RIO/HSL cable from an I/O expansion unit, and the RIO/HSL link on the unit did not recover when you reconnected the cable: 1. From the Hardware Service Manager screen, select Packaging hardware resources. 2. Select the frame that has just accidentally lost its RIO/HSL connection and select Concurrent maintenance. Then press Enter. 3. Select Power off domain. 4. After reconnecting the unit into the RIO/HSL loop, select Power on domain. 24 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) HSL loop configuration and status form Use this HSL loop configuration and status form to record the status of the HSL ports in the loop. Note: You may copy this form as necessary. HSL loop configuration and status worksheet for system _______________, Loop number ___________ Table 15. HSL loop configuration and status form HSL resource information Leading port information Trailing port information Resource type Resource name Frame ID Port number (or internal) Link status (operational or failed) Port number (or internal) Link status (operational or failed) Installed features in a PCI bridge set form Use this form to record the PCI bridge set card positions, and multi-adapter bridge function numbers. Note: You might find it helpful to copy this form as necessary. Table 16. Installed features in a PCI bridge set PCI bridge set card positions Multi-adapter bridge function number Record if IOP or IOA is installed. 0 1 2 3 4 5 6 7 Chapter 2. Isolation Procedures 25 RIO/HSL/12X link status diagnosis form Use this form to record the status of the RIO/HSL/12X links. Column A (starting status) Column B Column C (column A is failed and column B is failed) Column D Column E (column B is failed and column D is failed) Resource with failing link Port info Port status Port status Port status First Frame ID ____ Card Position ____ Port # ____ Port _0 (or internal) ____ Port _0 (or internal) ____ Port _0 (or internal) ____ Port _1 (or internal) ____ Port _1 (or internal) ____ Port _1 (or internal) ____ Second Frame ID ____ Card Position ____ Port # ____ Port _0 (or internal) ____ Port _0 (or internal) ____ Port _0 (or internal) ____ Port _1 (or internal) ____ Port _1 (or internal) ____ Port _1 (or internal) ____ 26 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) HSL/RIO 12X Isolation Procedures Use RIO/HSL/12X isolation procedures if there is no HMC attached to the server. If the server is connected to an HMC, use the procedures that are available on the HMC to continue FRU isolation. “CONSL01”Use this procedure to exchange the I/O processor (IOP) for the system or partition console of i5/OS. “RIOIP01” on page 28Use this procedure to isolate a failure in a RIO/HSL/12X loop using i5/OS service tools. “RIOIP06” on page 34Use HSM to examine the RIO/HSL/12X Loop to determine if other systems are connected to the loop. “RIOIP08” on page 34Starting with the unit ID and RIO/HSL/12X port for one end of an RIO/HSL/12X cable, determine the unit ID and port location for the other end. “RIOIP09” on page 35This procedure offers a description and service action for RIO/HSL/12X reference code B600 6982. “RIOIP10” on page 36Use this procedure to determine if the RIO/HSL/12X loop is complete (with both primary and redundant paths functioning for each unit on the loop). “RIOIP11” on page 36Use this procedure to recover from a B7xx 6982 RIO/HSL/12X failure. “RIOIP12” on page 38Use this procedure to recover from a B7xx 6985 RIO/HSL/12X failure. “RIOIP56” on page 39Use this procedure to restore the RIO 12X link to optimal bandwidth.CONSL01 Use this procedure to exchange the I/O processor (IOP) for the system or partition console of i5/OS. 1. Is the system managed by an HMC? No: Go to step 6 on page 28. Yes: The HMC will be required for this procedure. Move to the HMC and continue with the next step only if the HMC is functional.2. Can the customer power off the partition at this time? Yes: Power off the partition from the operating system console or the HMC. Then, continue with the next step. No: The IOP controlling the partition’s console may be controlling other critical resources. The partition must be powered off to exchange this IOP. Perform this procedure when the customer is able to power off the partition. Then, continue with the next step.3. Perform the following to determine the unit machine type, model, and serial number where the console IOP is located and the location of the console IOP: a. From the Navigation Area of the HMC, select the Management Environment. b. Select and expand the HMC environment for this HMC. c. Select and expand Server and Partition. d. Select Server Management. e. Double-click the i5/OS partition you are working on. f. Select the Settings tab. g. Record the location of the load source IOP. The unit type, model, and serial number are the first three parts of the location code and are separated by periods. h. Continue with the next step. Chapter 2. Isolation Procedures 27 4. Record the frame type or feature by using the frame ID and system configuration listing or by locating the frame with that ID and recording the frame type or feature. 5. Perform the following to exchange the IOP in that card position: a. Go to Chapter 3, “Locating FRUs,” on page 335 and select the unit type and model that you recorded. b. Locate the card position in the FRU locations table and use the exchange procedure that is identified. c. Power on the partition. This ends the procedure.6. The problem is in the i5/OS partition of a system with one or more partitions that is not managed by an HMC. Use the table below to determine the location of the load source IOP and the appropriate exchange procedure. Model Load source IOP location Load source IOA location Link to locations information MMA -P1-C1 embedded SCSI in -P1 Chapter 3, “Locating FRUs,” on page 335 This ends the procedure.RIOIP01 Use this procedure to isolate a failure in a RIO/HSL/12X loop using i5/OS service tools. Follow the steps in the “Main task” and you will be directed to the proper subtasks. Note: During this procedure, you will be disconnecting and reconnecting cables. If errors concerning missing resources (such as disk units and RIO/HSL/12X failures) occur, ignore them. Missing resources will report in again when the loop reinitializes. Main task 1. Were you sent here from a B600 xxxx reference code? No: Continue with the next step. Yes: Use the serviceable event view and the system service documentation to search for a B700 xxxx reference code with the same last four characters reported at approximately the same time. If you find one, perform service on that reference code first, and when you close that problem, close this one as well. If you do not find one, continue with the next step.2. Before powering down any system unit or expansion unit, work with the customer to end all subsystems in all of the partitions using each partition’s console. 3. From the partition control panel, IPL the system or partition to Dedicated Service Tools (DST). Attention: Do not use function 21! 4. Are all system and expansion units on the loop powered on? Yes: Go to step 6. No: Continue with the next step.5. Perform the following: a. Power on all system and expansion units on the loop. If a frame cannot be powered on, perform the “Cannot power on unit” on page 33 subtask below, and then continue with step 6. b. Was the RIO/HSL/12X link error cleared up when the frames were powered back on? v No: Continue with the next step. v Yes: Go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure.6. Perform the following: 28 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) a. Access the Service Action Log (SAL) entry for this error; the field replaceable units (FRUs) should be listed there. Look for part numbers and descriptions for the FRUs containing the RIO/HSL/12X port for two frames. There should also be a FRU for the cable between them. The locations information for the FRUs is the location of the failed ports on the failed link. b. Record the loop number from the SAL (if it is displayed there in one of the FRU descriptions) or from the first four characters of word 7 of the reference code. Go to “Converting the loop number to RIO/HSL/12X port location labels” on page 14 to determine which RIO/HSL/12X cables on the system you are working with. Is this information in the SAL? Yes: Continue with the next step. No: Perform “Manually detecting the failed link” on page 33 below, and then continue with the next step of the main task.7. Is the cable connecting the failed ports an optical cable? No: Go to step 9. Yes: Continue with the next step.8. Perform the following: a. Clean the RIO/HSL/12X cable connectors and ports using the fiber optic cleaning kit and the fiber optic cleaning procedures in ?SY27-2604 Fiber Optic Cleaning Procedures?. b. To determine if cleaning the connectors and ports solved the problem, perform “Manually detecting the failed link” on page 33 below and return to this point. Did the ports you were working on have a status of ?failed?? No: Then the problem is fixed, go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. Yes: Continue with the next step.9. There are now three cases to consider. Continue with the appropriate subtask of this procedure: v “The ports on both ends of the failed link are in different system units on the loop.” v “The port on one end of the failed link is in a system unit and the port on the other end is in an I/O unit” on page 30. v “The ports on both ends of the failed link are in an I/O unit” on page 31.The ports on both ends of the failed link are in different system units on the loop 1. There may be failed hardware that will report a different error on the other system units. Perform the following: a. Resolve any other RIO/HSL/12X problems in the serviceable event view on the other system units. b. Perform “Manually detecting the failed link” on page 33 below and return to this point. Did the ports you were working on have a status of ?failed?? No: Then the problem is fixed, go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. Yes: Continue with the next step.2. Is the cable an optical RIO/HSL/12X cable? Yes: Go to step 4 on page 30. No: Continue with the next step.3. Perform the following: a. Verify that the cables are connected securely. For any cable that was loose, disconnect the cable at that end, wait 30 seconds, and reconnect the cable securely. If there are thumbscrews, you must tighten both thumb screws within 30 seconds of when the cable makes contact with the port. Chapter 2. Isolation Procedures 29 b. If you disconnected and reconnected the cable at either end, perform “Manually detecting the failed link” on page 33 below and return to this point. Did the ports you were working on have a status of ?failed?? No: Then the problem is fixed, go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. Yes: Continue with the next step.4. Replace the cable between the two system unit ports on the failed link. To determine if replacing the cable resolved the problem, perform “Manually detecting the failed link” on page 33 below and return to this point. Did the ports you were working on have a status of ?failed?? No: Then the problem is fixed, go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. Yes: Continue with the next step.5. Exchange the FRU with the RIO/HSL/12X port in one of the system units. If you are working with a serviceable event view and the RIO/HSL/12X FRUs are listed, exchange the FRU corresponding to the first RIO/HSL/12X cable port listed. Otherwise, exchange the FRU that is quickest and easiest to replace). To determine if replacing the FRU resolved the problem, perform “Manually detecting the failed link” on page 33 below and return to this point. Did the ports you were working on have a status of ?failed?? No: Then the problem is fixed, go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. Yes: Continue with the next step.6. Exchange the remaining FRU with the RIO/HSL/12X port on the other system unit. To determine if replacing the FRU resolved the problem, perform “Manually detecting the failed link” on page 33 below and return to this point. Did the ports you were working on have a status of ?failed?? No: Then the problem is fixed, go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. Yes: Continue with the next step.7. Use the procedure “RIO/HSL/12X connection problems” on page 22 to determine if there are any additional RIO/HSL/12X cable-related FRUs, such as interposer cards and internal ribbon cables, that may be on either unit. Did you exchange any additional RIO/HSL/12X FRUs? No: Call your next level of support for further instruction. This ends the procedure. Yes: Continue with the next step.8. To determine if replacing the FRU resolved the problem, perform “Manually detecting the failed link” on page 33 below and return to this point. Did the ports you were working on have a status of ?failed?? No: Then the problem is fixed, go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. Yes: Call your next level of support for further instruction. This ends the procedure.The port on one end of the failed link is in a system unit and the port on the other end is in an I/O unit 1. Switch the two RIO/HSL/12X cables on the I/O unit with the failed port, so that each cable is connected to the port where the other cable was previously connected. Disconnect both cables at the same time, wait 30 seconds, and then reconnect the cables one at a time. Attention: For copper cables with thumbscrews, you must fully connect the cable and tighten the connector’s screws within 30 seconds of when the cable makes contact with the port. Otherwise, the link will fail and you must disconnect and reconnect again. Also, if the connector screws are not tightened, errors will occur on the link and it will fail. 2. Refresh the port status for the first failing resource by performing “Refresh the port status” on page 33 below. Then continue with the next step. 3. Is the port on the system unit that was failed now working? 30 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) No: Continue with the next step. Yes: Exchange the RIO/HSL/12X bridge FRU in the I/O unit. Go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure.4. Switch the cables back to their original positions by disconnecting both cables at the same time, waiting 30 seconds, and then reconnecting the cables one at a time. Attention: For copper cables with thumbscrews, you must fully connect the cable and tighten the connector’s screws within 30 seconds of when the cable makes contact with the port. Otherwise, the link will fail and you must disconnect and reconnect again. Also, if the connector screws are not tightened, errors will occur on the link and it will fail. 5. Exchange the cable between the two ports on the failed link. To determine if replacing the cable resolved the problem, perform “Manually detecting the failed link” on page 33 below and return to this point. Did the ports you were working on have a status of ?failed?? No: Then the problem is fixed, go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. Yes: Continue with the next step.6. Use the procedure “RIO/HSL/12X connection problems” on page 22 to determine if there are any additional RIO/HSL/12X cable-related FRUs, such as interposer cards and internal ribbon cables, that may be on either unit. Did you exchange any additional RIO/HSL/12X FRUs? No: Call your next level of support for further instruction. This ends the procedure. Yes: Continue with the next step.7. Exchange the RIO/HSL/12X FRU that contains the failing port in the system unit. To determine if replacing the FRU resolved the problem, perform “Manually detecting the failed link” on page 33 below and return to this point. Did the ports you were working on have a status of ?failed?? No: Then the problem is fixed, go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. Yes: Continue with the next step.8. To determine if replacing the FRU resolved the problem, perform “Manually detecting the failed link” on page 33 below and return to this point. Did the ports you were working on have a status of ?failed?? No: Then the problem is fixed, go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. Yes: Call your next level of support for further instruction. This ends the procedure.The ports on both ends of the failed link are in an I/O unit 1. Switch the two RIO/HSL/12X cables on the first (or ?From?) cable’s I/O unit with the failed port so that each cable is connected to the port where the other cable was previously connected. Attention: For copper cables with thumbscrews, you must fully connect the cable and tighten the connector’s screws within 30 seconds of when the cable makes contact with the port. Otherwise, the link will fail and you must disconnect and reconnect again. Also, if the connector screws are not tightened, errors will occur on the link and it will fail. 2. Refresh the port status for the first failing resource by performing “Refresh the port status” on page 33 below. Then continue with the next step. 3. Is the port on the I/O unit on which you did not switch the cables now working? No: Go to step 5 on page 32 Yes: Exchange the RIO/HSL/12X I/O bridge card in the I/O unit where you just switched the cables. The continue with the next step. 4. To determine if replacing the FRU resolved the problem, perform “Manually detecting the failed link” on page 33 and return to this point. Did the ports you were working on have a status of ?failed?? Chapter 2. Isolation Procedures 31 No: Then the problem is fixed, go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. Yes: Continue with the next step. 5. Switch the cables back to their original positions. Attention: For copper cables with thumbscrews, you must fully connect the cable and tighten the connector’s screws within 30 seconds of when the cable makes contact with the port. Otherwise, the link will fail and you must disconnect and reconnect again. Also, if the connector screws are not tightened, errors will occur on the link and it will fail. 6. Switch the two RIO/HSL/12X cables on the second (or ?To?) I/O unit with the failed port so that each cable is connected to the port where the other cable was previously connected. Attention: For copper cables with thumbscrews, you must fully connect the cable and tighten the connector’s screws within 30 seconds of when the cable makes contact with the port. Otherwise, the link will fail and you must disconnect and reconnect again. Also, if the connector screws are not tightened, errors will occur on the link and it will fail. 7. Refresh the port status for the first failing resource by performing “Refresh the port status” on page 33. Then continue with the next step. 8. Is the port on the I/O unit on which you did not switch cables now working? No: Go to step 10. Yes: Exchange the RIO/HSL/12X I/O bridge card in the I/O unit where you just switched the cables. Then continue with the next step. 9. To determine if replacing the FRU resolved the problem, perform “Manually detecting the failed link” on page 33 below and return to this point. Did the ports you were working on have a status of ?failed?? No: Then the problem is fixed, go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. Yes: Continue with the next step.10. Switch the cables back to their original positions. Attention: For copper cables with thumbscrews, you must fully connect the cable and tighten the connector’s screws within 30 seconds of when the cable makes contact with the port. Otherwise, the link will fail and you must disconnect and reconnect again. Also, if the connector screws are not tightened, errors will occur on the link and it will fail. 11. Exchange the RIO/HSL/12X cable between the two ports on the failed link. To determine if replacing the cable resolved the problem, perform “Manually detecting the failed link” on page 33, then return to this point. Did the ports you were working on have a status of ?failed?? No: Then the problem is fixed, go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. Yes: Continue with the next step.12. Use the procedure “RIO/HSL/12X connection problems” on page 22 to determine if there are any additional RIO/HSL/12X cable-related FRUs, such as interposer cards and internal ribbon cables, that may be on either unit. Did you exchange any additional RIO/HSL/12X FRUs? No: Call your next level of support for further instruction. This ends the procedure. Yes: Continue with the next step.13. To determine if replacing the FRU resolved the problem, perform “Manually detecting the failed link” on page 33 below and return to this point. Did the ports you were working on have a status of ?failed?? No: Then the problem is fixed, go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. Yes: Call your next level of support for further instruction. This ends the procedure. 32 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Cannot power on unit 1. Work the errors related to powering on the unit(s), and then continue with the next step. If a unit still cannot be powered on, re-cable the RIO/HSL/12X loop without the I/O units and system units that cannot be powered on, allowing the loop to be complete (no disconnected cables). 2. To determine if re-cabling the loop resolved the problem, perform “Manually detecting the failed link” below and return to this point.Manually detecting the failed link 1. Get the loop number from the reference code if you do not already have it. The loop number is a hexadecimal number in word 7 of the reference code. v If you are working from the Product Activity Log (PAL®), then the loop number is the 4 leftmost characters of the DSA in word 7 (BBBB). The loop number is in hexadecimal. Convert the hexadecimal loop number to decimal format before continuing with this procedure. v If you are working from the Service Action Log (SAL), the loop number should be displayed in the FRU description area in decimal format. 2. Sign on to SST or DST (if you have not already done so). Select Start a service tool ? Hardware service manager ? Logical hardware resources ? High-speed link (HSL) resources. 3. Select Resources associated with loop for the RIO/HSL/12X loop with the failed link. The RIO/HSL/12X bridges will be displayed under the loop. 4. Select Display detail for the loop with the failed link. 5. Record the name of the NIC/RIO controller resource you are starting from on the display. You will need to know this name to determine if you have followed the loop around and back to this resource. 6. If the leading port does not have a status of ?failed?, select Follow leading port until a leading port with a ?failed? status is found, or the display is showing information for the starting NIC/RIO resource you recorded. Did you find a leading port with a status of ?failed?? No: The loop is functioning properly. Return to the subtask that sent you here. Yes: Record the resource name at the leading port with a ?failed? status, and the type, model, and serial number for the resource with the failed status. Continue with the next step. 7. Select Follow leading port one more time and note all the information for the resource name with a failed trailing port. 8. Select Display system information and note the power controlling system’s type, model, and serial number (and name, if available). This info may be needed for FRU replacement at a later time. 9. Select Cancel twice to return to the previous screen. 10. Go to each resource name (found above) and select Associated packaging resource(s). This gives the description of the failing item and the unit ID. 11. Select Display detail to find the part number and location associated with the possible failing item. Then return to the step that sent you here.Refresh the port status 1. Wait one minute, and then sign on to SST or DST (if you have not already done so). 2. Select Start a service tool ? Hardware service manager ? Logical hardware resources ? High-speed link (HSL) resources. 3. Move the cursor to the RIO/HSL/12X loop that you want to examine and select Display detail ? Include non-reporting resources. 4. If the display is not already showing the ports for one of the units you are working on, then select Follow leading port. Continue to select Follow leading port until the display is showing the ports for one of the units you are working on. Note the status of the port you were working on. Select Follow leading port until the display is showing the ports for the other unit you are working on, and note the status of the port you were working on. Chapter 2. Isolation Procedures 33 5. Select Cancel ? Refresh ? Display detail for the failing resource you are checking. Note any change in the status for the resource. Then return to the step that sent you here.RIOIP06 Use HSM to examine the RIO/HSL/12X Loop to determine if other systems are connected to the loop. 1. Sign on to SST or DST (if you have not already done so). 2. Select Start a service tool ? Hardware service manager ? Logical hardware resources ? High-speed link (HSL) resources. 3. Move the cursor to the RIO/HSL/12X loop that you want to examine, and select Resources associated with loop. 4. Search for Remote RIO/HSL/12X NICs on the loop. Are there any Remote RIO/HSL/12X NICs on the loop? Yes: You have determined that there are other systems connected to this loop. This ends the procedure. No: You have determined that there are not any other systems connected to this loop. This ends the procedure.RIOIP08 Starting with the unit ID and RIO/HSL/12X port for one end of an RIO/HSL/12X cable, determine the unit ID and port location for the other end. 1. Sign on to SST or to DST if you have not already done so. 2. Select Start a Service Tool ? Hardware Service Manager ? Logical Hardware Resources ? High Speed Link (HSL) Resources. 3. Move the cursor to the RIO/HSL/12X loop that you want to examine, and select Resources associated with loop ? Include non-reporting resources. The display that appears shows the loop resource and all the ?HSL I/O Bridge? and all the ?Remote HSL NIC? resources connected to the loop. 4. Perform the following for each of the HSL I/O Bridge resources listed until you are directed to do otherwise. a. Move the cursor to the HSL I/O Bridge resource and select Associated packaging resource(s). b. Compare the unit ID on the display with the unit ID (in hexadecimal format) that you started with. Are the unit IDs the same? Yes: Continue with the next step. No: Select Cancel to return to the Logical Hardware Associated with HSL Loops display. Repeat this for each HSL I/O Bridge under the loop, until you are directed to do otherwise.5. Perform the following: a. Select Associated logical resource(s). b. Move the cursor to the HSL I/O Bridge resource and select Display detail. c. Examine the Leading port and Trailing port information. Search the display for the RIO/HSL/12X port location label that you recorded prior to starting this procedure. If the label is part of the information for the Leading port, then select Follow leading port. If the label is part of the information for the Trailing port, then select Follow trailing port. d. Perform the step below that matches the function you selected in the previous step: v If you selected Follow leading port, then examine the display for the Trailing port information. Record, on the worksheet that you are using, the RIO/HSL/12X port location label shown on the ?Trailing port from previous resource? line. Record this information as the ?To HSL Port Label?. v If you selected Follow trailing port, then examine the display for the Leading port information. Record, on the worksheet that you are using, the RIO/HSL/12X port location label on the ?Leading port to next resource? line. Record this information as the?To HSL Port Label?. 34 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) e. Record the ?Link type? (Copper or Optical) on the worksheet that you are using in the field describing the cable type. f. Select Cancel ? Cancel ? Cancel to return to the Logical Hardware Associated With HSL Loops? display. g. Record the resource name on the display. h. Move the cursor to the resource with the resource name you recorded in step 5g. i. Select Associated packaging resource(s). j. Record the unit ID. k. Return to the procedure that sent you here. This ends the procedure.RIOIP09 This procedure offers a description and service action for RIO/HSL/12X reference code B600 6982. Note: A fiber optic cleaning kit may be required for optical RIO/HSL/12X connections.Note: This reference code can occur on an RIO/HSL/12X loop when an I/O expansion unit on the loop is powered off for a concurrent maintenance action. 1. Is the reference code in the Service Action Log (SAL) or serviceable event view you are using? Yes: There is a connection failure on an RIO/HSL/12X link. A B600 6984 reference code may also appear in the Product Activity Log (PAL) or error log view you are using. Both reference codes are reporting the same problem. Continue with the next step. No: The reference code is only informational, and requires no service action. This ends the procedure.2. Multiple B600 6982 errors may occur due to retry and recovery activity. Is there a B600 6985 with ?xxxx 3206? in word 4 logged after all B600 6982 errors for the same RIO/HSL/12X loop in the PAL? Yes: The recovery efforts were successful. Close all of the B600 6982 entries for the same loop in the SAL. No service is required. This ends the procedure. No: Continue with the next step.3. Is there a B600 6987 reference code in the SAL, or serviceable event view you are using, logged at about the same time? Yes: Close this problem and work the B600 6987. This ends the procedure. No: Continue with the next step.4. Is there a B600 6981 reference code in the SAL, or serviceable event view you are using, logged at approximately the same time? Yes: Go to step 9 on page 36. No: Continue with the next step.5. Perform “RIOIP06” on page 34 to determine if this loop connects to any other systems and then return here. Note: The loop number can be found in the SAL in the description for the HSL_LNK FRU. Is this loop connected to other systems? Yes: Continue with the next step. No: Go to step 9 on page 36.6. Check for RIO/HSL/12X failures in the serviceable event views on the other systems. RIO/HSL/12X failures are indicated by entries with RIO/HSL/12X I/O bridge and Network Interface Controller (NIC) resources. Ignore B600 6982 and B600 6984 entries. Are there RIO/HSL/12X failures on other systems? Yes: Continue with the next step. No: Go to step 9 on page 36. Chapter 2. Isolation Procedures 35 7. Repair the problems on the other systems and return to this step. After making repairs on the other systems check the PAL of this system. Is there a B600 6985 reference code, with this loop’s resource name, that was logged after the repairs you made on the other systems? Yes: Continue with the next step. No: Go to step 9.8. For the B600 6985 reference code you found, use “Status indications for RIO/HSL” on page 15 to determine if the loop is now complete. Is the loop complete? Yes: The problem has been resolved. Use “RIOIP01” on page 28 to verify that the loop is now working properly. This ends the procedure. No: Continue with the next step.9. The FRU list displayed in the SAL, or serviceable event view you are using, may be different from the failing item list given here. Use the FRU list in the serviceable event view if it is available. Does the reference code appear in the serviceable event view with HSL_LNK or HSLxxxx listed as a symbolic FRU? Yes: Perform “RIOIP01” on page 28. This ends the procedure. No: Exchange the FRUs in the serviceable event view according to their part action codes. This ends the procedure.RIOIP10 Use this procedure to determine if the RIO/HSL/12X loop is complete (with both primary and redundant paths functioning for each unit on the loop). 1. Is the system managed by a Hardware Management Console? Yes: Continue with the next step No: Go to step 3.2. The RIO/HSL/12X loop number found in the first 4 characters of word 7 of the SRC that sent you here is in hexadecimal. Convert this number to decimal. Locate the decimal loop number’s information in the RIO/HSL/12X Topology screen on the HMC. Are all links in this loop operational? Yes: The RIO/HSL/12X loop recovered. Return to the procedure that sent you here. This ends the procedure. No: The RIO/HSL/12X loop did not recover. Return to the procedure that sent you here. This ends the procedure.3. Search in Advanced System Management Interface (ASMI) for a B700 6985 informational SRC logged after the RIO/HSL/12X SRC you are working on. Did you find a B7000 6985 SRC? Yes: Continue with the next step. No: The status of the RIO/HSL/12X loop is unknown. Return to the procedure that sent you here. This ends the procedure.4. Compare the first half of word 7 in the B700 6985 informational log to the value that caused you to be sent to this procedure. Are the two values the same? Yes: The RIO/HSL/12X loop recovered. Return to the procedure that sent you here. This ends the procedure. No: The status of the RIO/HSL/12X loop is unknown. Return to the procedure that sent you here. This ends the procedure.RIOIP11 Use this procedure to recover from a B7xx 6982 RIO/HSL/12X failure. 1. Record the RIO/HSL/12X loop number in the first four characters of word 7 of this SRC and perform “RIOIP10.” 2. Did the RIO/HSL/12X loop recover? 36 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) No: Continue with the next step Yes: Close the problem. This ends the procedure. 3. Work with the customer to determine if a processor enclosure or I/O enclosure on the RIO/HSL/12X loop has powered down normally. 4. Was a processor enclosure or I/O enclosure on the loop powered down normally? No: Go to 6. Yes: The loop remains in a failed state until all processor enclosures and I/O enclosures on the loop are powered on and functioning. Work with the customer to determine if all the powered down enclosures on the loop can be powered on. After all enclosures on the loop are powered on, continue with the next step. 5. Did the RIO/HSL/12X loop recover? No: Continue with the next step. Yes: Close the problem. This ends the procedure. 6. Search for a serviceable event with a 1xxx xxxx SRC logged at approximately the same time and with one or more FRUs in the same unit as those in the FRU list for the SRC you are currently working. 7. Did you find a serviceable event with a 1xxx xxxx SRC? No: Go to 9 Yes: Work to resolve the problem. After you have repaired that error, the RIO/HSL/12X loop may be recovered. After you finish working on the problem, return to this procedure and check to determine if correcting that problem also corrected the RIO/HSL/12X error. To determine if the RIO/HSL/12X loop has recovered, record the RIO/HSL/12X loop number in the first four characters of word 7 of this SRC and perform “RIOIP10” on page 36. 8. Did the RIO/HSL/12X loop recover? No: Continue with the next step. Yes: Close the problem. This ends the procedure. 9. In the serviceable event view, search for a B700 6981 error logged at approximately the same time and on the same RIO/HSL/12X loop (the first four characters of word 7 are the same). 10. Did you find a serviceable event with a B700 6981 SRC at approximately the same time and on the same RIO/HSL/12X loop? No: Go to 12. Yes: Work to resolve the problem. After you have repaired that error, the RIO/HSL/12X loop may be recovered. After you finish working on the problem, return to this procedure and check to determine if correcting that problem also corrected the RIO/HSL/12X error. To determine if the RIO/HSL/12X loop has recovered, record the RIO/HSL/12X loop number in the first four characters of word 7 of this SRC and perform “RIOIP10” on page 36.11. Did the RIO/HSL/12X loop recover? No: Continue with the next step. Yes: Close the problem. This ends the procedure.12. Using the FRU list that you are working with for this SRC, exchange one FRU at a time. After you exchange each FRU, determine if the loop has recovered.To determine if the RIO/HSL/12X loop has recovered, record the RIO/HSL/12X loop number in the first four characters of word 7 of this SRC and perform “RIOIP10” on page 36. After the loop recovers or after you have exchanged all the FRUs, continue with the next step. To replace a FRU, refer to Chapter 3, “Locating FRUs,” on page 335. 13. Did the RIO/HSL/12X loop recover? No: Contact your next level of support. This ends the procedure. Yes: Close the problem. This ends the procedure. Chapter 2. Isolation Procedures 37 RIOIP12 Use this procedure to recover from a B7xx 6985 RIO/HSL/12X failure. 1. Work with the customer to determine if a processor enclosure or I/O enclosure on the RIO loop has powered down normally. 2. Was a processor enclosure or I/O enclosure on the loop powered down normally? No: Go to 4. Yes: The loop remains in a failed state until all processor enclosures and I/O enclosures on the loop are powered on and functioning. Work with the customer to determine if all the powered down enclosures on the loop can be powered on. After all processor enclosures and I/O enclosures on the loop are powered on, check to determine if the RIO/HSL/12X loop is complete. To determine if the RIO/HSL/12X loop has recovered, record the RIO/HSL/12X loop number in the first four characters of word 7 of this SRC and perform “RIOIP10” on page 36. 3. Did the RIO/HSL/12X loop recover? No: Continue with the next step. Yes: Close the problem. This ends the procedure. 4. In the serviceable event view, search for a serviceable event with a 1xxx xxxx SRC logged at approximately the same time and with one or more FRUs in the same enclosure as those in the FRU list for the SRC you were currently working with. 5. Did you find a serviceable event with a 1xxx xxxx SRC? No: Go to 7 Yes: Work to resolve the problem. After you have repaired that error, the RIO/HSL/12X loop may be recovered. After you finish working on the problem, return to this procedure and check to determine if correcting that problem also corrected the RIO/HSL/12X error. To determine if the RIO/HSL/12X loop has recovered, record the RIO/HSL/12X loop number in the first four characters of word 7 of this SRC and perform “RIOIP10” on page 36. 6. Did the RIO/HSL/12X loop recover? No:: Continue with the next step. Yes:: Close the problem. This ends the procedure. 7. In the serviceable event view, search for a B700 6981 or B700 6986 error logged at approximately the same time and on the same RIO/HSL/12X loop (the first four characters of word 7 are the same). 8. Did you find a B700 6981 or a B700 6986 error logged at approximately the same time and on the same RIO/HSL/12X loop?. No: Go to step 10. Yes: Work to resolve the problem. After you have repaired that error, the RIO/HSL/12X loop may be recovered. After you finish working on the problem, return to this procedure and check to determine if correcting that problem also corrected the RIO/HSL/12X error. To determine if the RIO/HSL/12X loop has recovered, record the RIO/HSL/12X loop number in the first four characters of word 7 of this SRC and perform “RIOIP10” on page 36. 9. Did the RIO/HSL/12X loop recover? No: Continue with the next step. Yes: Close the problem. This ends the procedure.10. Search for a processor enclosure or I/O enclosure on the RIO/HSL/12X loop that has not powered up as expected. 11. Did you find a processor enclosure or I/O enclosure on the RIO/HSL/12X loop that has not powered up as expected? No: Go to 13 on page 39. Yes: Go to “Cannot power on SPCN-controlled I/O expansion unit” on page 136 and work that power symptom. Use the first half of word 7 to determine the loop number for later use. After you have repaired that error, the RIO/HSL/12X loop may be recovered. After you finish working 38 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) that power symptom, return to this procedure and check to determine if correcting that problem also corrected the RIO/HSL/12X error. To determine if the RIO/HSL/12X loop has recovered, record the RIO/HSL/12X loop number in the first four characters of word 7 of this SRC and perform “RIOIP10” on page 36.12. Did the RIO/HSL/12X loop recover? No: Continue with the next step. Yes: Close the problem. This ends the procedure.13. Using the FRU list that you are working with for this SRC, exchange one FRU at a time. After you exchange each FRU, determine if the loop has recovered. To determine if the RIO/HSL/12X loop has recovered, record the RIO/HSL/12X loop number in the first four characters of word 7 of this SRC and perform “RIOIP10” on page 36. After the loop recovers or after you have exchanged all the FRUs, continue with the next step. To replace a FRU, refer to Chapter 3, “Locating FRUs,” on page 335. 14. Did the RIO/HSL/12X loop recover? No: Contact your next level of support. This ends the procedure. Yes: Close the problem. This ends the procedure.RIOIP56 Use this procedure to restore the RIO 12X link to optimal bandwidth. 1. Record the RIO/HSL/12X loop number in the first four characters of word 7 of this SRC. The loop number is in hexadecimal format and must be converted to decimal. 2. Is the system managed by a Hardware Management Console? Yes: Continue with the next step. No: Go to step 6 on page 40.3. Perform the following from the Hardware Management Console: a. Select the system you are working on. b. Navigate to Hardware (information) and expand it. c. Select View RIO -12X Topology. d. In the current topology area, scroll until you find the data for the RIO/HSL/12X decimal loop number you identified in step 1.Is the link width 12X? Yes: The RIO 12X cable connection is now operating at the optimal bandwidth. No further action is required. This ends the procedure. No: Continue with the next step.4. Unplug both ends of the cable indicated in the FRU list for at least 30 seconds and then reconnect it. Refresh the View RIO-12X Topology view on the Hardware Management Console and verify that the width is now 12X for the decimal loop number you identified in step 1. Is the link width 12X? Yes: The RIO 12X cable connection is now operating at the optimal bandwidth. No further action is required. This ends the procedure. No: Continue with the next step.5. Replace the cable. Refresh the View RIO-12X Topology view on the Hardware Management Console and verify that the width is now 12X for the decimal loop number you identified in step 1. Is the link width 12X? Yes: The RIO 12X cable connection is now operating at the optimal bandwidth. No further action is required. This ends the procedure. No: Continue replacing the items in the FRU list until the problem is resolved. This ends the procedure. Chapter 2. Isolation Procedures 39 6. Unplug both ends of the cable indicated in the FRU list for at least 30 seconds and then reconnect the cable. It is not possible to concurrently verify that the RIO 12X link has been restored to optimal bandwidth. If the same SRC occurs for this RIO 12X link after the next IPL, the problem has not been resolved. Replace the cable and check for the error condition after the next IPL. Continue replacing the items in the FRU list, and perform an IPL the system each time until the problem has been resolved. This ends the procedure. 40 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Multi-adapter bridge (MAB) isolation procedures Use multi-adapter bridge (MAB) isolation procedures if there is no HMC attached to the server. If the server is connected to the HMC, use the procedures that are available on the HMC to continue FRU isolation. “MABIP02”Use this procedure to resolve a problem with a multi-adapter bridge. “MABIP03”Use this procedure to isolate a failing PCI adapter under a multi-adapter bridge. “MABIP05”Use this procedure to reset an IOP. “MABIP50” on page 42Use this procedure to isolate a failing PCI I/O adapter under a multi-adapter bridge. “MABIP51” on page 42Use this procedure to resolve a problem with a multi-adapter bridge. “MABIP52” on page 42This procedure will isolate a failing PCI adapter from a reference code when an IPL is not successful on the system or logical partition. “MABIP53” on page 43Use this procedure to determine a card position when no location is given for a PCI adapter FRU. “MABIP54” on page 43Use this procedure to isolate the failing PCI I/O adapter from a reference code with a Direct Select Address when the serviceable event view does not indicate a location for the PCI adapter. “MABIP55” on page 43Use this procedure to isolate a failing I/O adapter.MABIP02 Use this procedure to resolve a problem with a multi-adapter bridge. Perform “MABIP51” on page 42. MABIP03 Use this procedure to isolate a failing PCI adapter under a multi-adapter bridge. Perform “MABIP50” on page 42. MABIP05 Use this procedure to reset an IOP. Attention: When the IOP reset is performed, all resources controlled by the IOP will be reset. Perform this procedure only if the customer has verified that the IOP reset can be performed at this time. 1. Go to the SST/DST display in the partition which reported the problem. Use STRSST if i5/OS is running; use function 21 if STRSST does not work; or IPL the partition to DST. 2. On the Start Service Tools Sign On display, type in a user ID with service authority and password. 3. Select Start a service tool ? Hardware service manager ? Logical hardware resources ? System bus resources. 4. Page forward until you find the IOP that you want to reset. For help in identifying the IOP from the Direct Select Address (DSA) in the reference code, see “DSA translation” on page 12. 5. Verify that the IOP are correct by matching the resource name(s) on the display with the resource name(s) in the Service Action Log (SAL) for the problem you are working on. Chapter 2. Isolation Procedures 41 6. Move the cursor to the IOP that you want to reset, and select I/O Debug ? Reset IOP ? IPL IOP.This ends the procedure.MABIP50 Use this procedure to isolate a failing PCI I/O adapter under a multi-adapter bridge. Go to “PCI bus isolation using AIX, Linux, or the HMC” on page 8 to isolate a PCI bus problem from AIX, Linux, or the HMC MABIP51 Use this procedure to resolve a problem with a multi-adapter bridge. Go to “PCI bus isolation using AIX, Linux, or the HMC” on page 8 to isolate a PCI bus problem from AIX, Linux, or the HMC MABIP52 This procedure will isolate a failing PCI adapter from a reference code when an IPL is not successful on the system or logical partition. Attention: The removal and replacement procedure of all FRUs in this procedure are performed using dedicated maintenance (partition powered off). If not, the problem cannot be fixed. 1. Determine the PCI bridge set (multi-adapter bridge domain) by performing the following: a. Record the bus number (BBBB), the multi-adapter bridge number (C) and the multi-adapter bridge function number (c) from the Direct Select Address (DSA) in word 7 of the reference code. See “DSA translation” on page 12 for help in determining these values. b. Use the bus number that you recorded and the System Configuration Listing (or ask the customer) to determine what frame the bus is in. c. Record the frame type where the bus is located. d. Use the System Configuration Listing, the card position table for the frame type that you recorded, the bus number, and the multi-adapter bridge number to determine the PCI bridge set where the failure occurred. The PCI bridge set is the group of card positions controlled by the same multi-adapter bridge on the bus that you recorded. e. Use the card position table to record the PCI bridge set card positions. f. Examine the PCI bridge set in the frame, and record all the positions with IOA cards installed in them.2. Perform the following: a. Power off the partition. b. Remove all the IOA cards in the PCI bridge set identified in step 1. Be sure to record the card position of each IOA so that you can reinstall it in the same position later. To determine the removal and replacement procedures for the IOAs, go to Chapter 3, “Locating FRUs,” on page 335. c. Power on the partition. Does the reference code or failure that sent you to this procedure occur? No: Continue with the next step. Yes: The problem is the multi-adapter bridge. Continue with step 4 on page 43.3. Reinstall one of the IOAs and power on the partition. Does the reference code or failure that sent you to this procedure occur? Yes: The IOA that you just installed is the failing FRU. Replace the IOA.This ends the procedure. No: 42 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) v If a different SRC occurs, return to Chapter 1, “Starting a Service Call,” on page 1 and follow the service procedures for the new reference code. This ends the procedure. v If no SRC occurs and there are more IOAs to install, power off the partition and repeat this step. v If no SRC occurs and there are no more IOAs to install, the problem is intermittent; contact your next level of support. This ends the procedure. 4. Power off the partition. Determine which FRU contains the multi-adapter bridge. Locate the card position table for the frame type that you recorded. Perform the following: a. Using the multi-adapter bridge number that you recorded, search for the multi-adapter bridge function number ?F? in the card position table to determine the card position of the multi-adapter bridge’s FRU. b. Exchange the multi-adapter bridge’s FRU at the card position that you determined for it. See Chapter 3, “Locating FRUs,” on page 335 to determine the correct removal and replacement procedure. c. Install all IOAs in their original positions. d. Power on the partition.Does the reference code or failure that sent you to this procedure occur? No: Go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. Yes: Call your next level of support. This ends the procedure.MABIP53 Use this procedure to determine a card position when no location is given for a PCI adapter FRU. Go to “PCI bus isolation using AIX, Linux, or the HMC” on page 8 to isolate a PCI bus problem from AIX, Linux, or the HMC. MABIP54 Use this procedure to isolate the failing PCI I/O adapter from a reference code with a Direct Select Address when the serviceable event view does not indicate a location for the PCI adapter. Go to “PCI bus isolation using AIX, Linux, or the HMC” on page 8 to isolate a PCI bus problem from AIX, Linux, or the HMC. MABIP55 Use this procedure to isolate a failing I/O adapter. Attention: This procedure is for use with i5/OS only. Go to “PCI bus isolation using AIX, Linux, or the HMC” on page 8 to isolate a PCI bus problem from AIX, Linux, or the HMC. 1. If the system is not IPLed, will it IPL to DST? Option Description Yes No From the SAL display for the reference code, record the count. Continue with the next step. Perform “MABIP54.” This ends the procedure. 2. Go to the SST/DST display in the partition which reported the problem. Use STRSST if i5/OS is running; use function 21 if STRSST does not work; or IPL the partition to DST. 3. On the Start Service Tools Sign On display, type in a user ID with QSRV authority and password. 4. Select Start a service tool > Hardware service manager > Logical hardware resources > System bus resources. Chapter 2. Isolation Procedures 43 5. Is there a resource name logged in the SAL entry? No Yes Continue with the next step. Go to step 12 6. Do you have a location for the I/O processor? Option Description No Yes Record the Direct Select Address (DSA), word 7 of the reference code, from the SAL display. Then continue with the next step. Go to step 10. 7. Return to the HSM System bus resources display. 8. Locate the I/O processor by performing the following: a. Select Display detail. b. Compare the DSA with the bus, card, and board information for the IOP. Note: The card information on the HSM display is in decimal format. You must convert the decimal card information to hexadecimal format to match the DSA format. c. Repeat this step until you find the IOP with the same DSA. 9. Select Cancel, and then go to step 13. 10. Locate the I/O processor in HSM by performing the following for each IOP: a. Select Associated packaging resource(s) > Display detail. b. Repeat until you find the IOP with the same location.11. Select Cancel > Cancel and go to step 13. 12. Page forward until you find the multi-adapter bridge and IOP where the problem exists. Verify that the multi-adapter bridge and IOP are correct by matching the resource name(s) on the display with the resource name(s) in the SAL for the problem you are working on. 13. For the IOP you are working on, select Resources associated with IOP (if the I/O adapters are not already displayed). 14. If there is an IOA that is listed in any state other than ?operational?, then perform steps 15 through 18 on page 45, starting with the disabled IOA by moving the cursor to the disabled IOA. Otherwise, move the cursor to the first IOA that is assigned to the IOP. 15. Select Associated packaging resource(s) > Concurrent maintenance > Power off domain. Record the unit ID of the slot you are powering off. Did the domain power off successfully? 44 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Yes No Perform “MABIP05” on page 41 and then return here and continue with the next step. Choose from the following options: v If only one IOA was listed as failing, power down the system and replace the IOA. Re-IPL the system. If a different reference code occurred, perform problem analysis and work that reference code. If there was no reference code, go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. v If there were multiple failed IOAs and concurrent maintenance did not work on one, then move to the next failed IOA and repeat steps 15 on page 44 through 18. v If concurrent maintenance does not work for multiple failed IOAs, this procedure will not be able to identify a failing I/O adapter. Return to the procedure that sent you here. This ends the procedure. v 16. Did the IOP reset and IPL successfully? Yes No Check for the same failure that sent you to this procedure. Check the system control panel, the SAL for the partition that reported the problem, or the Work with partition status display for the partition that reported the problem. In the SAL, the count will increase if the reference code occurred again. Continue with the next step. This procedure will not be able to identify a failing I/O adapter. Return to the procedure that sent you here. This ends the procedure. 17. Did the same reference code occur after the IOP was reset and IPL’d? Yes No Perform the following: 1. Go to the Hardware Service Manager display. 2. Go to Packaging Hardware Resources. 3. Power on the IOA by selecting Power on domain. 4. Reassign the IOA to the IOP 5. Return to the HSL resource display, showing the IOP and associated resources. 6. Continue with the next step. Go to step 19. 18. Is there any other IOA, assigned to the IOP, that you have not already powered off and on? Header Header Move the cursor to another IOA assigned to the IOP, choosing IOAs with a status of ?unknown? or ?disabled? before moving on to IOAs with a status of ?operational?. Go to step 15 on page 44. Go to step 21 on page 46. 19. The failing IOA is located. Exchange the I/O adapter that you just powered off. Use the location you recorded in step 15 on page 44 to locate the IOA. 20. Power on the IOA that you just exchanged. Does the same reference code that sent you to this procedure still occur? Chapter 2. Isolation Procedures 45 Yes No The IOA is not the failing item. Remove the IOA and reinstall the original IOA. Continue with the next step. You have exchanged the failing IOA. Go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. 21. No failing IOAs were identified. Return to the procedure that sent you here. This ends the procedure. 46 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Communication isolation procedure This topic contains the procedure necessary to isolate a communications failure. Please read and observe the following warnings when using this procedure. DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) “COMIP01, COMPIP1”This procedure helps you to isolate problems with the communications input/output adapter (IOA) or input/output processor (IOP).COMIP01, COMPIP1 This procedure helps you to isolate problems with the communications input/output adapter (IOA) or input/output processor (IOP). Please read and observe the danger notices in “Communication isolation procedure” before proceeding with this procedure. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593. Chapter 2. Isolation Procedures 47 2. To determine which communications hardware to test, use the SRC from the problem summary form, or problem log, For details on line description information, see the Starting a Trace section of “Work with communications trace” on page 611. 3. Perform the following: a. Vary off the resources. b. On the Start a Service Tool display, select Hardware service manager ? Logical hardware resources ? System bus resources ? Resources associated with IOP for the attached IOPs in the list until you display the suspected failing hardware. c. Select Verify on the hardware you want to test. The Verify option may be valid on the IOP, IOA, or port resource. When it is valid on the IOP resource, any replaceable memory will be tested. Communications IOAs are tested by using the Verify option on the port resource.4. Run the IOA/IOP test(s). This may include any of the following: v Adapter internal test v Adapter wrap test (requires adapter wrap plug - available from your hardware service provider). v Processor internal test v Memory test v System port test Does the IOA/IOP test(s) complete successfully? No: The problem is in the IOA or IOP. If a verify test identified a failing memory module, replace the memory module. On multiple card combinations, exchange the IOA card before exchanging the IOP card. Exchange the failing hardware. See Chapter 3, “Locating FRUs,” on page 335. This ends the procedure. Yes: The IOA/IOP is good. Do NOT replace the IOA/IOP. Continue with the next step.5. Before running tests on modems or network equipment, the remaining local hardware should be verified Since the IOA/IOP test(s) have completed successfully, the remaining local hardware to be tested is the external cable. Is the IOA adapter type 2838, with a UTP (unshielded twisted pair) external cable? Yes: Continue with the next step. No: Go to step 8.6. Is the RJ-45 connector on the external cable correctly wired according to the EIA/TIA-568A standard? That is, -Pins 1 and 2 using the same twisted pair, -Pins 3 and 6 using the same twisted pair, -Pins 4 and 5 using the same twisted pair, -Pins 7 and 8 using the same twisted pair. Yes: Continue with the next step. No: Replace the external cable with correctly wired cable. This ends the procedure.7. Do the Line Speed and Duplex values of the line description (DSPLINETH) match the corresponding values for the network device (router, hub or switch) port? No: Change the Line Speed and/or Duplex value for either the line description or the network device (router, hub or switch) port. This ends the procedure. Yes: Go to step 9 on page 49.8. Is the cable wrap test option available as a Verify test option for the hardware you are testing? v Yes: Verify the external cable by running the cable wrap test. A wrap plug is required to perform the test. This plug is available from your hardware service provider. Does the cable wrap test complete successfully? Yes: Continue with the next step. No: The problem is in the cable. Exchange the cable. This ends the procedure. 48 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) v No: The communications IOA/IOP is not the failing item. One of the following could be causing the problem. – External cable. – The network. – Any system or device on the network – The configuration of any system or device on the network. – Intermittent problems on the network. – A new SRC - perform problem analysis or ask your next level of support for assistance. Work with the customer or your next level of support to correct the problem. This ends the procedure.9. All the local hardware is good. This completes the local hardware verification. The communications IOA/IOP and/or external cable is not the failing item. One of the following could be causing the problem: v The network v Any system or device on the network v The configuration of any system or device on the network v Intermittent problems on the network v A new SRC - perform problem analysis or ask your next level of support for assistance Work with the customer or your next level of support to correct the problem. This ends the procedure. Chapter 2. Isolation Procedures 49 Intermittent isolation procedures These procedures help you to correct an intermittent problem. DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005)Use these procedures to correct an intermittent problem, if other problem analysis steps or tables sent you here. Only perform the procedures that apply to your system. Read all safety procedures before servicing the system. Observe all safety procedures when performing a procedure. Unless instructed otherwise, always power off the system or expansion unit where the FRU is located. See “Powering on and powering off” on page 588 before removing, exchanging, or installing a field-replaceable unit (FRU). Use the procedure below to identify intermittent problems and the associated corrective actions. “INTIP03” on page 52Use this procedure to isolate problems with external noise on AC voltage lines. “INTIP05” on page 53Use this procedure to isolate problems with external noise on twinaxial cables. 50 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) “INTIP07” on page 53Use this procedure to lessen the effects of electrical noise (electromagnetic interference, or EMI) on the system. “INTIP08” on page 54Use this procedure to ensure that the system is electrically grounded correctly. “INTIP09” on page 55Use this procedure to check the AC electrical power for the system. “INTIP14” on page 57Use this procedure to isolate problems with station protectors. “INTIP16” on page 57Use this procedure when you need to copy a main storage dump to give to your next level of support. “INTIP18” on page 57Use this procedure to determine if one or more PTFs are available to correct this specific problem. “INTIP20” on page 57Use this procedure to analyze system performance problems. “INTIP24” on page 58Use this procedure to collect data when the service processor reports a suspected intermittent problem. Chapter 2. Isolation Procedures 51 INTIP03 Use this procedure to isolate problems with external noise on AC voltage lines. DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005)Electrical noise on incoming ac voltage lines can cause various system failures. The most common source of electrical noise is lightning. 1. Ask the customer if an electrical storm was occurring at the time of the failure to determine if lightning could have caused the failure. Could lightning have caused the failure? No: Go to step 3 on page 53. Yes: Continue with the next step.2. Determine if lightning protection devices are installed on the incoming ac voltage lines where they enter the building. There must be a dedicated ground wire from the lightning protection devices to earth ground. Are lightning protection devices installed? Yes: Continue with the next step. 52 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) No: Lightning may have caused the intermittent problem. Recommend that the customer install lightning protection devices to prevent this problem from recurring. This ends the procedure.3. Have an installation planning representative perform the following: a. Connect a recording ac voltage monitor to the incoming ac voltage lines of the units that contain the failing devices with reference to ground. b. Set the voltage monitor to start recording at a voltage slightly higher than the normal incoming ac voltage.Does the system fail again with the same symptoms? No: This ends the procedure. Yes: Continue with the next step.4. Look at the recording and see if the voltage monitor recorded any noise when the failure occurred. Did the monitor record any noise when the failure occurred? Yes: Review with the customer what was happening external to the system when the failure occurred. This may help you to determine the source of the noise. Discuss with the customer what to do to remove the noise or to prevent it from affecting the server. This ends the procedure. No: Perform the next intermittent isolation procedure listed in the Isolation procedure column. This ends the procedure.INTIP05 Use this procedure to isolate problems with external noise on twinaxial cables. Electrical noise on twinaxial cables that are not installed correctly may affect the twinaxial workstation I/O processor card. Examples of this include open shields on twinaxial cables, and station protectors that are not being installed where necessary. Check for the following on the system: v There must be no more than 11 connector breaks in a twinaxial cable run. v Station protectors must be installed (in pairs) where a cable enters or leaves a building. v There can only be two station protectors for each twinaxial run. v There is a maximum of seven devices (with addresses 0-6) for each cable run. v There is a maximum cable length of 1524 meters (5000 feet) for each port. v All cable runs must be ended (terminated). v Disconnect all twinaxial cables that are not used. v Remove any cause of electrical noise in the twinaxial cables. v All workstations must be grounded.This ends the procedure. INTIP07 Use this procedure to lessen the effects of electrical noise (electromagnetic interference, or EMI) on the system. 1. Ensure that air flow cards are installed in all adapter card slots that are not used. 2. Keep all cables away from sources of electrical interference, such as ac voltage lines, fluorescent lights, arc welding equipment, and radio frequency (RF) induction heaters. These sources of electrical noise can cause the system to become powered off. 3. If you have an expansion unit, ensure that the cables that attach the system unit to the expansion unit are seated correctly. Chapter 2. Isolation Procedures 53 Note: If the failures occur when people are close to the system or machines that are attached to the system, the problem may be electrostatic discharge (ESD). 4. Have an installation planning representative use a radio frequency (RF) field intensity meter to determine if there is an unusual amount of RF noise near the server. You also can use it to help determine the source of the noise. This ends the procedure.INTIP08 Use this procedure to ensure that the system is electrically grounded correctly. 1. Have an installation planning representative or an electrician (when necessary), perform the following steps. 2. Power off the server and the power network branch circuits before performing this procedure. 3. Ensure the safety of personnel by making sure that all electrical wiring in the United States meets National Electrical Code requirements. 4. Check all system receptacles to ensure that each one is wired correctly. This includes receptacles for the server and all equipment that attaches to the server, including workstations. Do this to determine if a wire with primary voltage on it is swapped with the ground wire, causing an electrical shock hazard. 5. For each unit, check continuity from a conductive area on the frame to the ground pin on the plug. Do this at the end of the mainline ac power cable. The resistance must be 0.1 ohm or less. 6. Ground continuity must be present from each unit receptacle to an effective ground. Therefore, check the following: v The ac voltage receptacle for each unit must have a ground wire connected from the ground terminal on the receptacle to the ground bar in the power panel. v The ground bars in all branch circuit panels must be connected with an insulated ground wire to a ground point, which is defined as follows: – The nearest available metal cold water pipe, only if the pipe is effectively grounded to the earth (see National Electric Code Section 250-81, in the United States). – The nearest available steel beams in the building structure, only if the beam is effectively grounded to the earth. – Steel bars in the base of the building or a metal ground ring that is around the building under the surface of the earth. – A ground rod in the earth (see National Electric Code Section 250-83, in the United States).Note: For installations in the United States only, by National Electrical Code standard, if more than one of the preceding grounding methods are used, they must be connected together electrically. See National Electric Code Section 250, for more information on grounding. v The grounds of all separately derived sources (uninterruptable power supply, service entrance transformer, system power module, motor generator) must be connected to a ground point as defined above. v The service entrance ground bar must connect to a ground point as defined above. v All ground connections must be tight. v Check continuity of the ground path for each unit that is using an ECOS tester, Model 1023-100. Check continuity at each unit receptacle, and measure to the ground point as defined above. The total resistance of each ground path must be 1.0 ohm or less. If you cannot meet this requirement, check for faults in the ground path. v Conduit is sometimes used to meet wiring code requirements. If conduit is used, the branch circuits must still have a green (or green and yellow) wire for grounding as stated above. Note: The ground bar and the neutral bar must never be connected together in branch circuit power panels. 54 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) The ground bar and the neutral bar in the power panels that make up the electrical power network for the server must be connected together. This applies to the first electrically isolating unit that is found in the path of electrical wiring from the server to the service entrance power panel. This isolating unit is sometimes referred to as a separately derived source. It can be an uninterruptable power supply, the system power module for the system, or the service entrance transformer. If the building has none of the above isolating units, the ground bar and the neutral bar must be connected together in the service entrance power panel. 7. Look inside all power panels to ensure the following: v There is a separate ground wire for each unit. v The green (or green and yellow) ground wires are connected only to the ground bar. v The ground bar inside each power panel is connected to the frame of the panel. v The neutral wires are connected only to the neutral bar. v The ground bar and the neutral bar are not connected together, except as stated in step 6 on page 54. 8. For systems with more than one unit, ensure that the ground wire for each unit is not connected from one receptacle to the next in a string. Each unit must have its own ground wire, which goes to the power source. 9. Ensure that the grounding wires are insulated with green (or green and yellow) wire at least equal in size to the phase wires. The grounding wires also should be as short as possible. 10. If extension-mainline power cables or multiple-outlet power strips are used, make sure that they must have a three-wire cable. One of the wires must be a ground conductor. The ground connector on the plug must not be removed. This applies to any extension mainline power cables or multiple-outlet power strips that are used on the server. It also applies for attaching devices such as personal computers, workstations, and modems. Note: Check all extension-mainline power cables and multiple-outlet power strips with an ECOS tester and with power that is applied. Ensure that no wires are crossed (for example, a ground wire crossed with a wire that has voltage on it).This ends the procedure.INTIP09 Use this procedure to check the AC electrical power for the system. 1. Have an installation planning representative or an electrician (when necessary), perform the following steps. 2. Power off the server and the power network branch circuits before performing this procedure. 3. To ensure the safety of personnel, all electrical wiring in the United States must meet National Electrical Code requirements. 4. Check ALL system receptacles to ensure that each is wired correctly. This includes receptacles for the server and all equipment that attaches to the server, including workstations. Do this to determine if a wire with primary voltage on it has been swapped with the ground wire, causing an electrical shock hazard. 5. When three-phase voltage is used to provide power to the server, correct balancing of the load on each phase is important. The units should be connected so that all three phases are used equally. 6. The power distribution neutral must return to the ?separately derived source? (uninterruptable power supply, service entrance transformer, system power module, motor generator) through an insulated wire that is the same size as the phase wire or larger. 7. The server and its attached equipment should be the only units that are connected to the power distribution network that the server gets its power. 8. The equipment that is attached to the server, such as workstations and printers, must be attached to the power distribution network for the server when possible. 9. Check all circuit breakers in the network that supply ac power to the server as follows: Chapter 2. Isolation Procedures 55 v Ensure that the circuit breakers are installed tightly in the power panel and are not loose. v Feel the front surface of each circuit breaker to detect if it is warm. A warm circuit breaker may be caused by: – The circuit breaker that is not installed tightly in the power panel. – The contacts on the circuit breaker that is not making a good electrical connection with the contacts in the power panel. – A defective circuit breaker. – A circuit breaker of a smaller current rating than the current load which is going through it. – Devices on the branch circuit which are using more current than their rating.10. Equipment that uses a large amount of current, such as: Air conditioners, copiers, and FAX machines, should not receive power from the same branch circuits as the system or its workstations. Also, the wiring that provides ac voltage for this equipment should not be placed in the same conduit as the ac voltage wiring for the server. The reason for this is that this equipment generates ac noise pulses. These pulses can get into the ac voltage for the server and cause intermittent problems. 11. Measure the ac voltage to each unit to ensure that it is in the normal range. Is the voltage outside the normal range? No: Continue with the next step. Yes: Contact the customer to have the voltage source returned to within the normal voltage range.12. The remainder of this procedure is only for a server that is attached to a separately derived source. Some examples of separately derived sources are an uninterruptable power supply, a motor generator, a service entrance transformer, and a system power module. The ac voltage system must meet all the requirements that are stated in this procedure and also all of the following: Notes: a. The following applies to an uninterruptable power supply, but it can be used for any separately derived source. b. System upgrades must not exceed the power requirements of your derived source.The uninterruptable power supply must be able to supply the peak repetitive current that is used by the system and the devices that attach to it. The uninterruptable power supply can be used over its maximum capacity if it has a low peak repetitive current specification, and the uninterruptable power supply is already fully loaded. Therefore, a de-rating factor for the uninterruptable power supply must be calculated to allow for the peak-repetitive current of the complete system. To help you determine the de-rating factor for an uninterruptable power supply, use the following: Note: The peak-repetitive current is different from the ?surge? current that occurs when the server is powered on. The de-rating factor equals the crest factor multiplied by the RMS load current divided by the peak load current where the: v Crest factor is the peak-repetitive current rating of the uninterruptable power supply that is divided by the RMS current rating of the uninterruptable power supply. If you do not know the crest factor of the uninterruptable power supply, assume that it is 1.414. v RMS load current is the steady state RMS current of the server as determined by the power profile. v Peak load current is the steady state peak current of the server as determined by the power profile. For example, if the de-rating factor of the uninterruptable power supply is calculated to be 0.707, then the uninterruptable power supply must not be used more than 70.7% of its kVA-rated 56 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) capacity. If the kVA rating of the uninterruptable power supply is 50 kVA, then the maximum allowable load on it is 35.35 kVA (50 kVA multiplied by 0.707). When a three-phase separately derived source is used, correct balancing of the load as specified in step 5 on page 55 is critical. If the load on any one phase of an uninterruptable power supply is more than the load on the other phases, the voltage on all phases may be reduced. 13. If the system is attached to an uninterruptable power supply or motor generator, then check for the following: v The system and the attached equipment should be the only items that are attached to the uninterruptable power supply or motor generator. Equipment such as air conditioners, copiers, and FAX machines should not be attached to the same uninterruptable power supply, or motor generator that the system is attached. v The system unit console and the Electronic Customer Support modem must get ac voltage from the same uninterruptable power supply or motor generator to which the system is attached. This ends the procedure.INTIP14 Use this procedure to isolate problems with station protectors. Station protectors must be installed on all twinaxial cables that leave the building in which the server is located. This applies even if the cables go underground, through a tunnel, through a covered outside hallway, or through a skyway. Station protectors help prevent electrical noise on these cables from affecting the server. 1. Look at the Product Activity Log to determine what workstations are associated with the failure. 2. Determine if station protectors are installed on the twinaxial cables to the failing workstations. Are station protectors installed on the twinaxial cables to the failing workstations? Yes: Perform the next intermittent isolation procedure listed in the Isolation procedure column. This ends the procedure. No: You may need to install station protectors on the twinaxial cables to the failing workstations. This ends the procedure.INTIP16 Use this procedure when you need to copy a main storage dump to give to your next level of support. For some problems, performing a dump of main storage helps to analyze the problem. The data on the dump is analyzed by support personnel to determine the cause of the problem and how to correct it. 1. Copy the main storage dump to tape. See “Copying a dump” on page 656. 2. Ask your next level of support to determine for assistance. This ends the procedure.INTIP18 Use this procedure to determine if one or more PTFs are available to correct this specific problem. 1. Ensure that all PTFs that relate to the problem have been installed. Note: Ensure that the latest platform LIC fix has been installed before you exchange a service processor. 2. Contact your next level of support for more information. This ends the procedure.INTIP20 Use this procedure to analyze system performance problems. Chapter 2. Isolation Procedures 57 1. Look in the Product Activity Log, ASM log, or HMC to determine if any hardware errors occurred at the same time that the performance problem occurred. Did any hardware problems occur at the same time that the performance problem occurred? Yes: Perform problem analysis and correct the hardware errors. This ends the procedure. No: The performance problems are not related to hardware. Continue with the next step.2. Perform the following steps: a. Ask the customer if they have asked software support for any software PTFs that relate to this problem. b. Recommend that the customer install a cumulative PTF package if they have not done so in the past three months. c. Inform the customer that performance could possibly be improved by having Software Support analyze the conditions. d. Inform the customer that your service provider has performance tools. Contact Software Support for more information. This ends the procedure.INTIP24 Use this procedure to collect data when the service processor reports a suspected intermittent problem. It is important that you collect data for this problem so that the problem can be corrected. Use this procedure to collect the data. There are several ways the system can display the SRC. Follow the instructions for the correct display method, defined as follows: v If this SRC is displayed in the Product Activity Log or ASM log, then record all of the SRC data words, save all of the error log data, and contact your next level of support to submit an APAR. v If the control panel is displaying SRC data words scrolling automatically through control panel functions 11, 12 and 13, and the control panel user interface buttons are not responding, then perform “FSPSP02” on page 206 instead of using this procedure. v If the SRC is displayed at the control panel, and the control panel user interface buttons respond normally, then record all of the SRC words.Do not perform an IPL until you perform a storage dump of the service processor. To get a storage dump of the service processor, perform the following: 1. Record the complete system reference code (SRC) (functions 11 through 20). 2. Perform a service processor dump. See “Performing dumps” on page 653. 3. Is a display shown on the console? v Yes: Continue with the next step. v No: The problem is not intermittent. Choose from the following options: – If you were sent here from another procedure, return there and follow the procedure for a problem that is not intermittent. – If the problem continues, replace the service processor hardware. This ends the procedure.4. The problem is intermittent. Copy the IOP dump to tape. See “Performing dumps” on page 653. 5. Complete the IPL. 6. Determine if there are available program temporary fixes (PTFs) for this problem. 7. If a PTF is found, apply the PTF. Then, return here and answer the following question. Did you find and apply a PTF for this problem? v Yes: This ends the procedure. v No: Record the following information, and contact your next level of support. – The complete SRC you recorded in this procedure 58 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) – The service processor dump to tape you obtained in step 4 on page 58. – All known system symptoms: - How often the intermittent problem occurs - System environment (IPL, certain applications) - If necessary, other SRCs that you suspect relate to the problem– Information needed to write an LICTR. This ends the procedure. Chapter 2. Isolation Procedures 59 I/O processor (IOP) isolation procedures Use these procedures to isolate a failure in the multiple function I/O card. Please read all safety procedures before servicing the system. Attention: Unless instructed otherwise, always power off the system or expansion tower where the field replaceable unit (FRU) is located before removing, exchanging, or installing a FRU. Attention: Disconnecting the J15 and J16 cables will not prevent the system unit from powering on. DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) “IOPIP01” on page 62Use this procedure to perform an IPL to dedicated service tool (DST) to determine if the same reference code occurs. “IOPIP13” on page 64Use this procedure to isolate problems on the interface between the I/O card and the storage devices. 60 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) “IOPIP16” on page 66Use this procedure to isolate failing devices that are identified by FI codes FI01105, FI01106, and FI01107. “IOPIP17” on page 69Use this procedure to isolate problems that are associated with SCSI bus configuration errors and device task initialization failures. “IOPIP18” on page 71Use this procedure to isolate problems that are associated with SCSI bus configuration errors and device task initialization failures. “IOPIP19” on page 72You were sent to this procedure from unit reference code (URC) 9010, 9011, or 9013. “IOPIP20” on page 73Use this procedure to isolate the problem when two or more devices are missing from a disk array. “IOPIP21” on page 75Use this procedure to determine the failing disk unit when, a disk unit is not compatible with other disk units in the disk array, or when a disk unit has failed. If the URC is 9025 or 9030, the disk array is running, but it might not be protected. “IOPIP22” on page 75Use this procedure to gather error information and contact your next level of support. “IOPIP23” on page 76You were sent to this procedure from a unit reference code (URC) 9050. “IOPIP25” on page 76Use this procedure to isolate the problem when a device attached to the I/O card has functions that are not given support on the I/O card. “IOPIP26” on page 78Use this procedure to correct the problem when the I/O card recognizes that the attached disk unit must be initialized and formatted. “IOPIP27” on page 79I/O card cache data exists for a missing or failed device. “IOPIP28” on page 81You were sent to this procedure from unit reference code (URC) 9052. “IOPIP29” on page 81The failing item is in a migrated tower. “IOPIP30” on page 81Use this procedure to correct the problem when the system cannot find the required cache data for the attached disk units. “IOPIP31” on page 84Cache data associated with the attached devices cannot be found. “IOPIP32” on page 86You were sent to this procedure from unit reference code (URC) 9011. “IOPIP33” on page 87 The I/O processor card detected a device configuration error. The configuration sectors on the device may be incompatible with the current I/O processor card. “IOPIP34” on page 87You were sent to this procedure from unit reference code (URC) 9027. “IOPIP40” on page 88Use this procedure to isolate the problem when a storage I/O adapter is connected to an incompatible or non-operational auxiliary cache I/O adapter. Chapter 2. Isolation Procedures 61 “IOPIP41” on page 89Use this procedure to correct the problem when an auxiliary cache I/O adapter is not connected to a storage I/O adapter or when an auxiliary cache I/O adapter is connected to an incompatible or non-operational storage I/O adapter.IOPIP01 Use this procedure to perform an IPL to dedicated service tool (DST) to determine if the same reference code occurs. If a new reference code occurs, more analysis may be possible with the new reference code. If the same reference code occurs, you are instructed to exchange the failing items. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593, before continuing with this procedure. 2. Was the IPL performed from disk (Type A or Type B)? No: Continue with the next step. Yes: Go to step 5. 3. Perform the following: a. Ensure that the IPL media is the correct version and level that are needed for the system model. b. Ensure that the media is not physically damaged. c. Choose from the following options to clean the IPL media: v If it is cartridge type optical media (for example, DVD), do not attempt to clean the media. v If it is non-cartridge type media (for example, CD-ROM), wipe the disc in a straight line from the inner hub to the outer rim. Use a soft, lint-free cloth or lens tissue. Always handle the disc by the edges to avoid finger prints. v If it is tape, clean the recording head in the tape unit. Use the correct Cleaning Cartridge Kit provided by your service provider. 4. Perform a Type D IPL in Manual mode. Does a system reference code (SRC) appear on the control panel? v No: Go to step 8 on page 63. v Yes: Is the SRC the same one that sent you to this procedure? Yes: You cannot continue isolating the problem. Use the original SRC and exchange the failing items, starting with the highest probable cause of failure. Refer to the reference code list. If the failing item list contains FI codes, see Chapter 3, “Locating FRUs,” on page 335 to help determine part numbers and location in the system. This ends the procedure. No: A different SRC occurred. Use the new SRC to correct the problem. See Chapter 1, “Starting a Service Call,” on page 1. This ends the procedure. 5. Perform an IPL to DST. See “Performing an IPL to DST” on page 640. Does an SRC appear on the control panel? No: Continue with the next step. Yes: Go to step 10 on page 63. 6. Does either the Disk Configuration Error Report, the Disk Configuration Attention Report, or the Disk Configuration Warning Report display appear on the console? v No: Continue with the next step. v Yes: Select option 5, press F11, then press Enter to display the details. Then, choose from the following options: – If all of the reference codes are 0000, go to “LICIP11” on page 96 and use cause code 0002. – If any of the reference codes are not 0000, go to step 10 on page 63, and use the reference code that is not 0000. 62 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Note: Use the characters in the Type column to find the correct reference code table. 7. Look at the product activity log. See “Using the product activity log” on page 617 for details. Is an SRC logged as a result of this IPL? Yes: Continue with the next step. No: The problem cannot be isolated any more. Use the original SRC and exchange the failing items. Start with the highest probable cause of failure in the failing item list for this reference code. If the failing item list contains FI codes, see Chapter 3, “Locating FRUs,” on page 335 to help determine part numbers and location in the system. This ends the procedure. 8. Does either the Disk Configuration Error Report, the Disk Configuration Attention Report, or the Disk Configuration Warning Report display appear on the console? v Yes: Continue with the next step. v No: Look at the product activity log. See “Using the product activity log” on page 617 for details. Is an SRC logged as a result of this IPL? Yes: Continue with the next step. No: The problem is corrected. This ends the procedure. 9. Select option 5, press F11, then press Enter to display the details. Then, choose from the following options: v If all of the reference codes are 0000, go to “LICIP11” on page 96 and use cause code 0002. v If any of the reference codes are not 0000, continue with the next step and use the reference code that is not 0000. Note: Use the characters in the Type column to find the correct reference code table.10. Record the SRC. Are the SRC and unit reference code (URC) the same ones that sent you to this procedure? Yes: Continue with the next step. No: Use the new SRC or reference code to correct the problem. This ends the procedure.11. Perform the following steps: a. Power off the system or expansion tower. See “Powering on and powering off” on page 588. b. Exchange the FRUs in the failing item list for the SRC you have now. Start with the highest probable cause of failure in the failing item column in the reference code list. Perform the remaining steps of this procedure after you exchange each FRU until you determine the failing FRU.Note: If you exchange a disk unit, do not attempt to save customer data until instructed to do so in this procedure.12. Power on the system or the expansion unit. Does an SRC appear on the control panel? No: Continue with the next step. Yes: Go to step 15 on page 64.13. Does either the Disk Configuration Error Report, the Disk Configuration Attention Report, or the Disk Configuration Warning Report display appear on the console? v Yes: Continue with the next step. v No: Look at the product activity log. See “Using the product activity log” on page 617 for details. Is an SRC logged as a result of this IPL? – Yes: Continue with the next step. – No: The last FRU you exchanged was failing. Note: Before exchanging a disk unit, you should attempt to save customer data. Chapter 2. Isolation Procedures 63 This ends the procedure.14. Select option 5, press F11, then press Enter to display the details. Then, choose from the following options: v If all of the reference codes are 0000, go to “LICIP11” on page 96 and use cause code 0002. v If any of the reference codes are not 0000, go to step 10 on page 63, and use the reference code that is not 0000. Note: Use the characters in the Type column to find the correct reference code table.15. Record the SRC on the Problem summary form. See “Using the product activity log” on page 617 for details. Is the SRC the same one that sent you to this procedure? v Yes: The last FRU you exchanged is not the failing FRU. Go to step 11 on page 63 to continue FRU isolation. v No: Is the SRC B100 4504 or B100 4505 and have you exchanged disk unit 1 in the system unit, or are all the reference codes on the console 0000? – Yes: The last FRU you exchanged was failing. This ends the procedure. Note: Before exchanging a disk unit, you should attempt to save customer data. – No: Use the new SRC or reference code to correct the problem. This ends the procedure.IOPIP13 Use this procedure to isolate problems on the interface between the I/O card and the storage devices. The unit reference code (part of the SRC that sent you to this procedure) indicates the SCSI bus that has the problem: Unit reference code (URC) SCSI bus 3100 0 3101 1 3102 2 3103 3 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593. 2. Were you performing an IPL from removable media (IPL type D) when the error occurred? No: Continue with the next step. Yes: Exchange the FRUs in the failing item list for the reference code that sent you to this procedure. This ends the procedure. 3. Perform the following steps: a. Look in the service action log (see “Using the Service Action Log” on page 609) for other errors logged at or around the same time as the 310x SRC. b. If no entries appear in the service action log, use the product activity log (see “Using the product activity log” on page 617). c. Use the other SRCs to correct the problem before performing an IPL. d. Contact your next level of support as necessary for assistance with SCSI bus problem isolation. e. If the problem is not corrected, continue with the next step. 4. Perform an IPL to DST. See “Performing an IPL to DST” on page 640. Does an SRC appear on the control panel? 64 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) No: Continue with the next step. Yes: Go to step 7. 5. Does one of the following displays appear on the console? v Disk Configuration Error Report v Disk Configuration Attention Report v Disk Configuration Warning Report v Display Unknown Mirrored Load-Source Status v Display Load-Source Failure – Yes: Continue with the next step. – No: Look at the product activity log. See “Using the product activity log” on page 617for details. Is an SRC logged as a result of this IPL? – Yes: Continue with the next step. – No: You cannot continue isolating the problem. Use the original SRC and exchange the failing items, starting with the highest probable cause of failure see the failing item list for this reference code. If the failing item list contains FI codes, see the Chapter 3, “Locating FRUs,” on page 335 topic to help determine part numbers and location in the system. This ends the procedure. 6. Are all of the reference codes 0000? On some of the displays, you must press F11 to display reference codes. v No: Continue with the next step. Use the reference code that is not 0000. v Yes: Go to “LICIP11” on page 96 and use cause code 0002. This ends the procedure. 7. Is the SRC the same one that sent you to this procedure? Yes: Continue with the next step. No: Record the SRC. Then use the SRC description to correct the problem. This ends the procedure. 8. Perform the following: a. Power off the system or the expansion tower. See “Powering on and powering off” on page 588 for details. b. Find the I/O card identified in the failing item list. c. Remove the I/O card and install a new I/O card. This item has the highest probability of being the failing item. d. Power on the system or the expansion tower.Does an SRC appear on the control panel? No: Continue with the next step. Yes: Go to step 12 on page 66. 9. Does one of the following displays appear on the console? v Disk Configuration Error Report v Disk Configuration Attention Report v Disk Configuration Warning Report v Display Unknown Mirrored Load-Source Status v Display Load-Source Failurev Yes: Does the Display Unknown Mirrored Load-Source Status display appear on the console? Note: On some of these displays, you must press F11 to display reference codes. – Yes: Continue with the next step. – No: Are all of the reference codes 0000? Chapter 2. Isolation Procedures 65 - No: Go to step 12 using the reference code that is not 0000. - Yes: Go to “LICIP11” on page 96 and use cause code 0002. This ends the procedure.v No: Go to step 11.10. Is the reference code the same one that sent you to this procedure? v No: Either a new reference code occurred, or the reference code is 0000. There may be more than one problem. The original I/O card may be failing, but it must be installed in the system to continue problem isolation. Install the original I/O card by doing the following: a. Power off the system or the expansion tower. See “Powering on and powering off” on page 588 for details. b. Remove the I/O card you installed in step 8 on page 65 and install the original I/O card. Note: Do not power on the system or the expansion unit now. A device connected to the I/O card could be the failing item. Go to “IOPIP16,” step (9) to continue isolating the problem. This ends the procedure.v Yes: Go to step 13.11. Look at the product activity log. See “Using the product activity log” on page 617 for details. Is an SRC logged as a result of this IPL? Yes: Continue with the next step. No: The I/O card, which you removed in step 8 on page 65, is the failing item. This ends the procedure.12. Is the SRC or reference code the same one that sent you to this procedure? Yes: Continue with the next step. No: Record the SRC. Then use the SRC description to correct the problem. This ends the procedure.13. The original I/O card is not the failing item. Install the original I/O card by doing the following: a. Power off the system or the expansion tower. See “Powering on and powering off” on page 588 for details. b. Remove the I/O card you installed in step 8 on page 65 of this procedure and install the original I/O card. Note: Do not power on the system or the expansion unit now.A device connected to the I/O card could be the failing item. Go to “IOPIP16,” step (9) to continue isolating the problem. This ends the procedure.IOPIP16 Use this procedure to isolate failing devices that are identified by FI codes FI01105, FI01106, and FI01107. During this procedure, you will remove devices that are identified by the FI code, and then you will perform an IPL to determine if the symptoms of the failure have disappeared, or changed. You should not remove the load-source disk until you have shown that the other devices are not failing. Removing the load-source disk can change the symptom of failure, although it is not the failing unit. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Use the Hardware Service Manager (HSM) verify function (use DST or SST), and verify that all tape and optical units attached to the SCSI bus (identified by FI01105, FI01106, or FI01107) are operating correctly. See Chapter 5, “Verifying a repair,” on page 557 for details. Note: Do not IPL the system to get to DST. 3. Choose from the following options: 66 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) v If verification was successful for all tape and optical units, then go to step 5. v If any tape or optical device could not be verified, or if it failed verification, then exchange the failing item. See the Chapter 3, “Locating FRUs,” on page 335 and continue with the next step. 4. Use the Hardware Service Manager (HSM) verify function (use SST or DST) and verify that the exchanged item is operating correctly. See Chapter 5, “Verifying a repair,” on page 557 for details. Was the verification successful? No: Replace the exchanged device with the original. See Chapter 3, “Locating FRUs,” on page 335 and continue with the next step. Yes: The newly exchanged tape or optical device was the failing item. This ends the procedure. 5. Perform an IPL to DST. See “Performing an IPL to DST” on page 640. Does an SRC appear on the control panel? No: Continue with the next step. Yes: Go to step 8. 6. Does one of the following displays appear on the console? v Disk Configuration Error Report v Disk Configuration Attention Report v Disk Configuration Warning Report v Display Unknown Mirrored Load-Source Status v Display Load-Source Failure Note: On some of these displays, you must press F11 to display reference codes. The characters under Type are the same as the 4 leftmost characters of word 1. The characters under Reference Code are the same as the 4 rightmost characters of word 1. – No: Continue with the next step. – Yes: Are all of the reference codes 0000? No: Go to step 8, and use the reference code that is not 0000. Yes: Go to “LICIP11” on page 96 and use cause code 0002. This ends the procedure. 7. Look at the Product Activity Log. See “Using the product activity log” on page 617 for details. Is a reference code logged as a result of this IPL? Yes: Continue with the next step. No: You cannot continue isolating the problem. Use the original reference code and exchange the failing items, starting with the highest probable cause of failure. If the failing item list contains FI codes, see Chapter 3, “Locating FRUs,” on page 335 for additional details. This ends the procedure. 8. Is the SRC or reference code the same one that sent you to this procedure? Yes: Continue with the next step. No: Record the SRC or reference code. Then, use the SRC or reference code to correct the problem. This ends the procedure. 9. Isolate the failing device by doing the following: a. Power off the system or the expansion unit if it is powered on. See “Powering on and powering off” on page 588. b. Go to Chapter 3, “Locating FRUs,” on page 335 to find the devices identified by FI code FI01105, FI01106, or FI01107 in the failing item list. c. Disconnect one of the devices that are identified by the FI code, other than the load-source disk unit. Note: The tape, or optical units should be the first devices to be disconnected, if they are attached to the SCSI bus identified by FI01105, FI01106, or FI01107. Chapter 2. Isolation Procedures 67 d. Go to step 11.10. Continue to isolate the possible failing items by doing the following: a. Power off the system or the expansion unit. See Powering on and powering off. b. Disconnect the next device that is identified by FI codes FI01105, FI01106, or FI01107 in the FRU list. See the note in step 9 on page 67. Do not disconnect disk unit 1 (load-source disk) until you have disconnected all other devices and the load-source disk is the last device that is identified by these FI codes.11. Power on the system or the expansion tower. Does an SRC appear on the control panel? No: Continue with the next step. Yes: Go to step 14.12. Does one of the following displays appear on the console? v Disk Configuration Error Report v Disk Configuration Attention Report v Disk Configuration Warning Report v Display Unknown Mirrored Load-Source Status v Display Load-Source Failure Note: On some of these displays, you must press F11 to display reference codes. The characters under Type are the same as the 4 leftmost characters of word 1. The characters under Reference Code are the same as the 4 rightmost characters of word 1. – Yes: Go to step 14. – No: Look at the Product Activity Log. See “Using the product activity log” on page 617 for details. Is a reference code logged as a result of this IPL? No: Continue with the next step. Yes: Go to step 14.13. You are here because the IPL completed successfully. The last device you disconnected is the failing item. Is the failing item a disk unit? No: Exchange the failing item and reconnect the devices you disconnected previously. See the Chapter 3, “Locating FRUs,” on page 335. This ends the procedure. Yes: Exchange the failing FRU. Before exchanging a disk drive, you should attempt to save customer data. This ends the procedure.14. Is the SRC or reference code the same one that sent you to this procedure? Yes: Continue with the next step. No: Record the SRC or reference code on the Problem summary form. Then go to step 16.15. The last device you disconnected is not failing. Have you disconnected all the devices that are identified by FI codes FI01105, FI01106, or FI01107 in the FRU list? No: Leave the device disconnected and return to step 10 to continue isolating the possible failing items. Yes: Replace the device backplane or backplanes associated with the devices you removed in the earlier steps. If the device backplane does not fix the problem, then you cannot continue isolating the problem. Use the original SRC and exchange the failing items, starting with the highest probable cause of failure. If the failing item list contains FI codes, see Chapter 3, “Locating FRUs,” on page 335 for additional information. This ends the procedure.16. Is the SRC B1xx 4504, and have you disconnected the load-source disk unit? (The load-source disk unit is disconnected by disconnecting disk unit 1.) 68 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) v Yes: Continue with the next step. v No: Does one of the following displays appear on the console, and are all reference codes 0000? – Disk Configuration Error Report – Disk Configuration Attention Report – Disk Configuration Warning Report – Display Unknown Mirrored Load-Source Status – Display Load-Source Failure Note: On some of these displays, you must press F11 to display reference codes. The characters under Type are the same as the 4 leftmost characters of word 1. The characters under Reference Code are the same as the 4 rightmost characters of word 1. Yes: Continue with the next step. No: A new SRC or reference code occurred. Perform problem analysis and correct the problem. This ends the procedure.17. The last device you disconnected may be the failing item. Exchange the last device you disconnected. See the Chapter 3, “Locating FRUs,” on page 335. Note: Before exchanging a disk drive, you should attempt to save customer data. Was the problem corrected by exchanging the last device you disconnected? No: Continue with the next step. Yes: This ends the procedure.18. Reconnect the devices you disconnected previously in this procedure. 19. Use the original SRC and exchange the failing items, starting with the highest probable cause of failure. Do not exchange the FRU that you exchanged in this procedure. If the failing item list contains FI codes, see Chapter 3, “Locating FRUs,” on page 335 to help determine part numbers and location in the system. This ends the procedure.IOPIP17 Use this procedure to isolate problems that are associated with SCSI bus configuration errors and device task initialization failures. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Were you performing an IPL from removable media (IPL type D) when the error occurred? v Yes: Exchange the FRUs in the failing item list for the reference code that sent you to this procedure. v No: Perform an IPL to DST. See “Performing an IPL to DST” on page 640. Does an SRC appear on the control panel? No: Continue with the next step. Yes: Go to step 5 on page 70.3. Does either the Disk Configuration Error Report, the Disk Configuration Attention Report, or the Disk Configuration Warning Report display appear on the console? v No: Continue with the next step. v Yes: Does one of the following messages appear in the list? – Missing disk units in the configuration – Missing mirror protection disk units in the configuration – Device parity protected units in exposed mode. - No: Continue with the next step. Chapter 2. Isolation Procedures 69 - Yes: Select option 5, press F11, then press Enter to display the details. Then, choose from the following options: v If all of the reference codes are 0000, go to “LICIP11” on page 96 and use cause code 0002. v If any of the reference codes are not 0000, go to step 5, and use the reference code that is not 0000. Note: Use the characters in the Type column to find the correct reference code table.4. Look at the product activity log. See “Using the Service Action Log” on page 609. Is an SRC logged as a result of this IPL? Yes: Continue with the next step. No: You cannot continue isolating the problem. Use the original SRC and exchange the failing items, starting with the highest probable cause of failure (see the failing item list for this reference code in the (System Reference Codes)) topic. If the failing item list contains FI codes, see (Failing items) to help determine part numbers and location in the system. This ends the procedure.5. Record the SRC. Is the SRC the same one that sent you to this procedure? v No: A different SRC or reference code occurred. Use the new SRC or reference code to perform problem analysis and correct the problem. This ends the procedure. v Yes: Determine the device unit reference code (URC) from the SRC. If the Disk Configuration Error Report, the Disk Configuration Attention Report, or the Disk Configuration Warning Report display appears on the console, the device URC is displayed under Reference Code. This is on the same line as the missing device. Is the device unit reference code 3020, 3021, 3022, or 3023? Yes: Continue with the next step. No: Go to step 7.6. A unit reference code of 3020, 3021, 3022, or 3023 indicates that there is a problem on an I/O card SCSI bus. The problem can be caused by a device that is attached to the I/O card that: v Is not supported. v Does not match system configuration rules. For example, there are too many devices that are attached to the bus. v Is failing. Perform the following: a. Look at the characters on the control panel Data display or the Problem Summary Form for characters 9 - 16 of the top 16 character line of function 12 (word 3). Use the format BBBB-Cc-bb (BBBB = bus, Cc = card, bb = board) to determine the card slot location for the I/O card (see (Locations and addresses)). b. The unit reference code indicates the SCSI bus that has the problem: URC SCSI Bus 3020 0 3021 1 3022 2 3023 3 c. To find the bus and device locations, see Chapter 3, “Locating FRUs,” on page 335. d. Find the printout that shows the system configuration from the last IPL and compare it to the present system configuration. Note: If configuration is not the problem, a device on the SCSI bus may be failing. e. If you need to perform isolation on the SCSI bus, go to “IOPIP16” on page 66. This ends the procedure.7. The possible failing items are FI codes FI01105 (90%) and FI01112 (10%). Find the device unit address from the SRC (see “The System Reference Code (SRC) Format Description” on page 601). Use this 70 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) information to find the physical location of the device. Record the type and model numbers to determine if the addressed I/O card supports this device. Is the device given support on your system? v No: Continue with the next step. v Yes: Perform the following: a. Exchange the device. b. Perform an IPL to DST. See “Performing an IPL to DST” on page 640. Does this correct the problem? No: Ask your next level of support for assistance. This ends the procedure. Yes: This ends the procedure.8. Perform the following steps: a. Remove the device. b. Perform an IPL to DST. See “Performing an IPL to DST” on page 640.Does this correct the problem? No: Ask your next level of support for assistance. This ends the procedure. Yes: This ends the procedure.IOPIP18 Use this procedure to isolate problems that are associated with SCSI bus configuration errors and device task initialization failures. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Perform an IPL to DST. See “Performing an IPL to DST” on page 640. Does an SRC appear on the control panel? v Yes: Go to step 5 on page 72. v No: Does either the Disk Configuration Error Report, the Disk Configuration Attention Report, or the Disk Configuration Warning Report display appear on the console? Yes: Continue with the next step. No: Go to step 4.3. Does one of the following messages appear in the list? v Missing disk units in the configuration v Missing mirror protection disk units in the configuration v Device parity protected units in exposed mode. – No: Continue with the next step. – Yes: Select option 5, press F11, and then press Enter to display the details. Choose from the following options: - If all of the reference codes are 0000, go to “LICIP11” on page 96 and use cause code 0002. - If any of the reference codes are not 0000, go to step 5 on page 72, and use the reference code that is not 0000. Note: Use the characters in the Type column to find the correct reference code table.4. Look at the product activity log (see (Log) for details). Is an SRC logged as a result of this IPL? Yes: Continue with the next step. No: You cannot continue isolating the problem. Use the original SRC and exchange the failing items, starting with the highest probable cause of failure in the failing item column in the reference Chapter 2. Isolation Procedures 71 code list. If the failing item list contains FI codes, see Chapter 3, “Locating FRUs,” on page 335to help determine part numbers and location in the system. This ends the procedure.5. Record the SRC. Is the SRC the same one that sent you to this procedure? Yes: Continue with the next step. No: A different SRC or reference code occurred. Use the new SRC or reference code to correct the problem. This ends the procedure.6. Determine the device unit reference code (URC) from the SRC. If the Disk Configuration Error Report, the Disk Configuration Attention Report, or the Disk Configuration Warning Report display appears on the console, the device URC is displayed under Reference Code. This is on the same line as the missing device. Is the device unit reference code 3020? v No: Continue with the next step. v Yes: A device reference code of 3020 indicates that a device is attached to the addressed I/O card. Either it is not supported, or it does not match system configuration rules. For example, there are too many devices that are attached to the bus. Perform the following steps: a. Find the printout that shows the system configuration from the last IPL and compare it to the present system configuration. b. Use the unit address and the physical address in the SRC to help you with this comparison. c. If configuration is not the problem, a device on the SCSI bus may be failing. Use FI code FI00884 in the Chapter 3, “Locating FRUs,” on page 335 table to help find the failing device. d. If you need to perform isolation on the SCSI bus, go to “IOPIP16” on page 66. This ends the procedure.7. The possible failing items are FI codes FI01105 (90%) and FI01112 (10%). Find the device unit address from the SRC. Use this information to find the physical location of the device. Record the type and model numbers to determine if the addressed I/O card supports this device. Is the device given support on your system? v No: Continue with the next step. v Yes: Perform the following steps: a. Exchange the device. b. Perform an IPL to DST. See “Performing an IPL to DST” on page 640. Does this correct the problem? No: Ask your next level of support for assistance. This ends the procedure. Yes: This ends the procedure.8. Perform the following steps: a. Remove the device. b. Perform an IPL to DST. See “Performing an IPL to DST” on page 640.Does this correct the problem? No: Ask your next level of support for assistance. This ends the procedure. Yes: This ends the procedure.IOPIP19 You were sent to this procedure from unit reference code (URC) 9010, 9011, or 9013. Contact your next level of support for assistance. 72 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) IOPIP20 Use this procedure to isolate the problem when two or more devices are missing from a disk array. You were sent to this procedure from unit reference code (URC) 9020 or 9021. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593, before continuing with this procedure. 2. Access SST/DST by doing one of the following: v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media. 3. Have any other I/O card or device SRCs (other than a 902F SRC) occurred at about the same time as this error? v Yes: Use the other I/O card or device SRCs to correct the problem. This ends the procedure. v No: Has the I/O card, or have the devices been repaired or reconfigured recently? – Yes: Continue with the next step. – No: Contact your next level of support for assistance. This ends the procedure. 4. Did you perform a D IPL to get to DST? v Yes: Continue with the next step. v No: Perform the following steps: a. Access the Product Activity Log and display the SRC that sent you here and view the ?Additional Information? to record the formatted log information. Record all devices that are missing from the disk array. These are the array members that have both a present address of 0 and an expected address that is not 0. Note: There might be more than one Product Activity Log entry with the same Log ID. Access any additional entries by pressing the enter key from the ?Display Detail Report for Resource? screen. View the ?Additional Information? for each entry to record the formatted log information. For example: There might be an xxxx902F SRC entry in the Product Activity Log if there are more than 10 disk units in the array. b. Continue with step 6. 5. A formatted display of hexadecimal information for Product Activity Log entries is not available. In order to interpret the hexadecimal information, see “More information from hexadecimal reports” on page 620. Record all devices that are missing from the disk array. These are the array members that have both a present address of 0 and an expected address that is not 0. Note: There might be an xxxx902F SRC entry in the Product Activity Log if there are more than 10 disk units in the array. In order to interpret the hexadecimal information for these additional disk units, see “More information from hexadecimal reports” on page 620. 6. There are three possible ways to correct the problem: a. Find the missing devices and install them in the correct physical locations in the system. If you can find the missing devices and want to continue with this repair option, then continue with the next step. b. Stop the disk array that contains the missing devices. Attention: Customer data might be lost. If you want to continue with this repair option, go to step 8 on page 74. c. Initialize and format the remaining members of the disk array. Chapter 2. Isolation Procedures 73 Attention: Customer data will be lost. If you want to continue with this repair option, go to step 9. 7. Perform the following: a. Install the missing devices in the correct locations in the system. See Chapter 3, “Locating FRUs,” on page 335. b. Power on the system. See Powering on and powering off.Does the IPL complete successfully? No: You have a new problem. Perform problem analysis and correct the problem. This ends the procedure. Yes: This ends the procedure. 8. You have chosen to stop the disk array that contains the missing devices. Attention: Customer data might be lost. Perform the following: a. If you are not already using dedicated service tools, perform an IPL to DST. See “Performing an IPL to DST” on page 640. If you cannot perform a type A or B IPL, perform a type D IPL from removable media. b. Select Work with disk units. Did you get to DST with a Type D IPL? Yes: Continue with the next step. No: Select Work with disk configuration ? Work with device parity protection. Then, continue with the next step.c. Select Stop device parity protection. d. Follow the on-line instructions to stop device parity protection. e. Perform an IPL from disk.Does the IPL complete successfully? No: You have a new problem. Perform problem analysis and correct the problem.This ends the procedure. Yes: This ends the procedure. 9. You have chosen to initialize and format the remaining members of the disk array. Perform the following steps: Attention: Customer data will be lost. a. If you are not already using dedicated service tools, perform an IPL to DST. See “Performing an IPL to DST” on page 640. If you cannot perform a type A or B IPL, perform a type D IPL from removable media. b. Select Work with disk units. Did you get to DST with a Type D IPL? Yes: Continue with the next step. No: Select Work with disk unit recovery ? Disk unit problem recovery procedures, and continue with the next step.10. Select Initialize and format disk unit. 11. Follow the online instructions to format and initialize the disk units. 12. Perform an IPL from disk. Does the IPL complete successfully? No: You have a new problem. Perform problem analysis and correct the problem. This ends the procedure. Yes: This ends the procedure. 74 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) IOPIP21 Use this procedure to determine the failing disk unit when, a disk unit is not compatible with other disk units in the disk array, or when a disk unit has failed. If the URC is 9025 or 9030, the disk array is running, but it might not be protected. You were sent to this procedure from a unit reference code (URC) of 9025, 9030, or 9032. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Is the device location information for this SRC available in the service action log (see “Using the Service Action Log” on page 609 for details)? Yes: Exchange the disk unit. No: Continue with the next step.3. Access SST/DST by doing one of the following: v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640.4. Perform the following steps: a. Access the Product Activity Log and display the SRC that sent you here. b. Press the F9 key for address information. This is the I/O card address.Note: There may be more than one entry with the same Log ID. Entries with the same Log ID may be accessed by pressing the Enter key from the ?Display Detail Report for Resource? screen. Example: There may be a device specific SRC and/or an xxxx902F SRC entry in the Product Activity Log. The xxxx902F SRC will occur if there are more than 10 disk units in the array. c. Continue with the next step.5. Perform the following steps: a. Return to the SST or DST main menu. b. Select Work with disk units ? Display disk configuration ? Display disk configuration status. c. On the Display disk configuration status display, look for the devices attached to the I/O card that is identified in step 4. d. Find the device that has a status of ?DPY/Unknown? or ?DPY/Failed?. This is the device that is causing the problem. Show the device address by selecting Display Disk Unit Details ? Display Detailed Address. Record the device address. e. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit, or the expansion unit and find the following: v The card slot that is identified by the I/O card direct select address v The disk unit location that is identified by the device addressHave you determined the location of the I/O card and disk unit that is causing the problem? Yes: Exchange the disk unit that is causing the problem. This ends the procedure. No: Ask your next level of support for assistance. This ends the procedure.IOPIP22 Use this procedure to gather error information and contact your next level of support. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593, before continuing with this procedure. Chapter 2. Isolation Procedures 75 2. Access SST/DST by doing one of the following: v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.3. Did you perform a D IPL to get to DST? v Yes: Continue with the next step. v No: Perform the following steps: a. Access the Product Activity Log and display the SRC that sent you here and view the ?Additional Information? to record the formatted log information. Record all the information. Note: There may be more than one Product Activity Log entry with the same Log ID. Access any additional entries by pressing the Enter key from the ?Display Detail Report for Resource? screen. View the ?Additional Information? for each entry to record the formatted log information. Example: There may be an xxxx902F SRC entry in the Product Activity Log if there are more than 10 disk units in the array. b. Continue with step 5.4. A formatted display of hexadecimal information for Product Activity Log entries is not available. In order to interpret the hexadecimal information, see “More information from hexadecimal reports” on page 620. Record all the information. Then continue with the next step. Note: There may be an xxxx902F SRC entry in the Product Activity Log if there are more than 10 disk units in the array. In order to interpret the hexadecimal information for these additional disk units, see “More information from hexadecimal reports” on page 620. 5. Ask your next level of support for assistance. Note: Your next level of support may require the error information you recorded in the previous step.This ends the procedure.IOPIP23 You were sent to this procedure from a unit reference code (URC) 9050. If the failing item is in a migrated tower, use the (Migrated Expansion Tower Problem Analysis, Repair and Parts) manual on the V5R1 Supplemental Manuals web site to fix the problem. Otherwise, contact your next level of support for assistance. IOPIP25 Use this procedure to isolate the problem when a device attached to the I/O card has functions that are not given support on the I/O card. You were sent to this procedure from URC 9008. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593, before continuing with this procedure. 2. Have any other I/O card or device SRCs occurred at about the same time as this error? v Yes: Use the other I/O card or device SRCs to correct the problem. See the system reference codes. This ends the procedure. v No: Has the I/O card, or have the devices been repaired or reconfigured recently? – Yes: Continue with the next step. 76 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) – No: Contact your next level of support for assistance. This ends the procedure.3. Access SST/DST by doing one of the following: v If you can enter a command at the console, access system service tools (SST). “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.4. Did you perform a D IPL to get to DST? v No: Access the Product Activity Log and display the SRC that sent you here. Press the F9 key for address information. This is the I/O card address. Then, view the ?Additional Information? to record the formatted log information. Record the addresses that are not 0000 0000 for all devices listed. Continue with the next step. v Yes: Access the Product Activity Log and display the SRC that sent you here. The direct select address (DSA) of the I/O Card is in the format BBBB-Cc-bb: – BBBB = hexadecimal offsets 4C and 4D – Cc = hexadecimal offset 51 – bb = hexadecimal offset 4F The unit address of the I/O card is hexadecimal offset 18C through 18F. A formatted display of hexadecimal information for Product Activity Log entries is not available. In order to interpret the hexadecimal information, see “More information from hexadecimal reports” on page 620. Record the addresses that are not 0000 0000 for all devices listed. Continue with the next step.5. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit, or the expansion unit. Then find the following: v The card slot that is identified by the I/O card direct select address (DSA) and unit address. If there is no IOA with a matching DSA and unit address, the IOP and IOA are one card. Use the IOP with the same DSA. v The disk unit locations that are identified by the unit addresses. Have you determined the location of the I/O card and the devices that are causing the problem? v No: Ask your next level of support for assistance. This ends the procedure. v Yes: Have one or more devices been moved to this I/O card from another I/O card? Yes: Continue with the next step. No: Ask your next level of support for assistance. This ends the procedure.6. Is the I/O card capable of supporting the devices attached, or is it in the correct mode to support the devices attached? Note: For information on I/O card modes, see “Storage I/O card modes and jumpers” on page 250. v No: Remove the devices from the I/O card. Note: You can remove disk units without installing another disk unit, and the system continues to operate.This ends the procedure. v Yes: Do you want to continue using these devices with this I/O card? Yes: Continue with the next step.v No: Either change the I/O card mode or remove the devices from the I/O card. This ends the procedure.7. Initialize and format the disk units by performing the following steps: Attention: Data on the disk unit will be lost. Chapter 2. Isolation Procedures 77 a. Access SST or DST. b. Select Work with disk units. Did you get to DST with a Type D IPL? Yes: Continue with the next step. No: Select Work with disk unit recovery ? Disk unit problem recovery procedures. Then continue with the next step.c. Select Initialize and format disk unit for each disk unit. When the new disk unit is initialized and formatted, the display shows that the status is complete. This may take 30 minutes or longer. The disk unit is now ready to be added to the system configuration. This ends the procedure.IOPIP26 Use this procedure to correct the problem when the I/O card recognizes that the attached disk unit must be initialized and formatted. You were sent to this procedure from URC 9092. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Have any other I/O card or device SRCs occurred at about the same time as this error? v Yes: Use the other I/O card or device SRCs to correct the problem. This ends the procedure. v No: Has the I/O card, or have the devices been repaired or reconfigured recently? Yes: Continue with the next step. No: Contact your next level of support for assistance. This ends the procedure.3. Access SST/DST by doing one of the following: v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640 for details. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.4. Did you perform a D IPL to get to DST? v Yes: Continue with the next step. v No: Perform the following steps: a. Access the Product Activity Log and display the SRC that sent you here. b. Press the F9 key for address information. This is the I/O card address. c. Then view the ?Additional Information? to record the formatted log information. d. Record the addresses that are not 0000 0000 for all devices listed. e. Continue with step 6 on page 79.5. Perform the following steps: a. Access the Product Activity Log and display the SRC that sent you here. The direct select address (DSA) of the I/O card is in the format BBBB-Cc-bb: v BBBB = hexadecimal offsets 4C and 4D v Cc = hexadecimal offset 51 v bb = hexadecimal offset 4F The unit address of the I/O card is hexadecimal offset 18C through 18F.b. A formatted display of hexadecimal information for Product Activity Log entries is not available. In order to interpret the hexadecimal information, see “More information from hexadecimal reports” on page 620. Record the addresses that are not 0000 0000 for all devices listed. c. Continue with the next step. 78 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 6. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit, or the expansion unit. Then find the following: v The card slot that is identified by the I/O card direct select address (DSA) and unit address. If there is no IOA with a matching DSA and unit address, the IOP and IOA are one card. Use the IOP with the same DSA. v The disk unit locations that are identified by the unit addresses. Have you determined the location of the I/O card and the devices that are causing the problem? v No: Ask your next level of support for assistance. This ends the procedure. v Yes: Have one or more devices been moved to this I/O card from another I/O card? Yes: Continue with the next step. No: Ask your next level of support for assistance. This ends the procedure.7. Do you want to continue using these devices with this I/O card? v Yes: Continue with the next step. v No: Remove the devices from the I/O card. Note: You can remove disk units without installing another disk unit, and the system continues to operate.This ends the procedure.8. Initialize and format the disk units by performing the following steps: Attention: Data on the disk unit will be lost. a. Access SST or DST. b. Select Work with disk units. Did you get to DST with a Type D IPL? Yes: Continue with the next step. No: Select Work with disk unit recovery ? Disk unit problem recovery procedures. Then continue with the next step.c. Select Initialize and format disk unit for each disk unit. When the new disk unit is initialized and formatted, the display shows that the status is complete. This may take 30 minutes or longer. The disk unit is now ready to be added to the system configuration. This ends the procedure.IOPIP27 I/O card cache data exists for a missing or failed device. You were sent to this procedure from a unit reference code (URC) of 9051. Note: For some storage I/O adapters, the cache card is integrated and not removable. Having I/O card cache data for a missing or failed device might be caused by the following conditions: v One or more disk units have failed on the I/O card. v The cache card of the I/O card was not cleared before it was shipped as a MES to the customer. In addition, the service representative moved devices from the I/O card to a different I/O card before performing a system IPL. v The cache card of the I/O card was not cleared before it was shipped to the customer. In addition, residual data was left in the cache card for disk units that manufacturing used to test the I/O card. v The I/O card and cache card were moved from a different system or a different location on this system after an abnormal power off. v One or more disk units were moved either concurrently, or they were removed after an abnormal power off. Chapter 2. Isolation Procedures 79 CAUTION: Any Function 08 power down (including from a D-IPL) is an abnormal power off. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Access SST/DST by doing one of the following: v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media. 3. Did you perform a D IPL to get to DST? v Yes: Continue with the next step. v No: Perform the following steps: a. Access the Product Activity Log and display the SRC that sent you here. b. Press the F9 key for address information. This is the I/O card address. c. Then view the ?Additional Information? to record the formatted log information. Record the device types and serial numbers for those devices that show a unit address of 0000 0000. d. Continue with step 5. 4. Perform the following steps: a. Access the Product Activity Log and display the SRC that sent you here. The direct select address (DSA) of the I/O card is in the format BBBB-Cc-bb: v BBBB = hexadecimal offsets 4C and 4D. v Cc = hexadecimal offset 51 v bb = hexadecimal offset 4F The unit address of the I/O card is hexadecimal offset 18C through 18F. A formatted display of hexadecimal information for Product Activity Log entries is not available. In order to interpret the hexadecimal information, see “More information from hexadecimal reports” on page 620. b. Record the device types and serial numbers for those devices that show a unit address of 0000 0000. c. Continue with the next step. 5. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit, or the expansion unit. Find the card slot that is identified by the I/O card direct select address (DSA) and unit address. If there is no IOA with a matching DSA and unit address, the IOP and IOA are one card. Use the IOP with the same DSA. 6. Choose from the following options: v If the devices from step 3 of this procedure have never been installed on this system, continue with the next step. v If the devices are not in the current system disk configuration, go to step 9 on page 81. v Otherwise, the devices are part of the system disk configuration; go to step 11 on page 81. 7. Choose from the following options: v If this I/O card and cache card were moved from a different system, continue with the next step. v Otherwise, the cache card was shipped to the customer without first being cleared. Perform the following: a. Make a note of the serial number, the customer number, and the device types and their serial numbers. These were found in step 3. b. Inform your next level of support. 80 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) c. Then go to step 10 to clear the cache card and correct the URC 9051 problem. 8. Install both the I/O card and the cache cards back into their original locations. Then re-IPL the system. There could be data in the cache card for devices in the disk configuration of the original system. After an IPL to DST and a normal power off on the original system, the cache card will be cleared. It is then safe to move the I/O card and the cache card to another location. 9. One or more devices that are not currently part of the system disk configuration were installed on this I/O card. Either they were removed concurrently, they were removed after an abnormal power off, or they have failed. Continue with the next step. 10. Use the Reclaim IOP cache storage procedure to clear data from the cache for the missing or failed devices as follows: a. Perform an IPL to DST. See “Performing an IPL to DST” on page 640. If you cannot perform a type A or B IPL, perform a type D IPL from removable media. b. Reclaim the cache adapter card storage. See “Reclaiming IOP cache storage” on page 636.11. Choose from the following options: v If this I/O card and cache card were moved from a different location on this system, go to step 8. v If the devices from step 3 on page 80 of this procedure are now installed on another I/O card, and they were moved there before the devices were added to the system disk configuration, go to step 7 on page 80. (On an MES, the disk units are sometimes moved from one I/O card to another I/O card. This problem will result if manufacturing did not clear the cache card before shipping the MES.) v Otherwise, continue with the next step.12. One or more devices that are currently part of the system disk configuration are either missing or failed, and have data in the cache card. Consider the following: v The problem may be because devices were moved from the I/O card concurrently, or they were removed after an abnormal power off. If this is the case, locate the devices, power off the system and install the devices on the correct I/O card. v If no devices were moved, look for other errors logged against the device, or against the I/O card that occurred at approximately the same time as this error. Continue the service action by using these system reference codes.IOPIP28 You were sent to this procedure from unit reference code (URC) 9052. If the failing item is in a migrated tower, use the (Migrated Expansion Tower Problem Analysis, Repair and Parts) manual on the V5R1 Supplemental Manuals web site to fix the problem. Otherwise, contact your next level of support for assistance. IOPIP29 The failing item is in a migrated tower. You were sent to this procedure from URC 9012. Perform SDIOP-PIP29 in the (Migrated Expansion Tower Problem Analysis, Repair and Parts) manual on the V5R1 Supplemental Manuals web site . IOPIP30 Use this procedure to correct the problem when the system cannot find the required cache data for the attached disk units. You were sent to this procedure from URC 9050. Chapter 2. Isolation Procedures 81 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Did you just exchange the storage input/output (I/O) adapter as a result of a failure? No: Continue with step 13 on page 83. Yes: Continue with the next step. 3. Are you working with a 571F/575B card set? No: Go to step 5. Yes: Continue with the next step. 4. Remove the 571F/575B card set. Create a new card set with the following: Note: Label all parts (both original and new) before moving them. v The new replacement 571F storage IOA. v The cache directory card from the original 571F storage IOA. v The original 575B auxiliary cache adapter.See “Separating the 571F/575B card set and moving the cache directory card” on page 594. v Ensure that the SCSI cable and the battery power cable on the top edge of the storage side of the card are connected to the top edge of the auxiliary cache side of the card. v Reinstall this card set into the system and go to step 6. 5. Remove the I/O adapter. Install the new replacement storage I/O adapter with the following parts installed on it: Note: Label all parts (both old and new) before moving them. v The cache directory card from the original storage I/O adapter. On adapters with removable cache cards, the cache directory card will move with the removable cache card. v The removable cache card from the original storage I/O adapter (this applies to only the 571E and some 2780 I/O adapters).See Chapter 3, “Locating FRUs,” on page 335 for information on removing and replacing parts.. v If the I/O adapter is attached to an auxiliary cache I/O adapter, ensure that the SCSI cable on the last port of the new replacement storage I/O adapter is connected to the auxiliary cache I/O adapter. For a list of auxiliary cache I/O adapters, see Chapter 8, “Part Information,” on page 691. 6. Did the 9050 SRC that sent you to this procedure occur on a type-D IPL? Yes: Perform a type-D IPL and continue with the next step. No: Continue with the next step. 7. Has a new 9010 or 9050 SRC occurred in the Service Action Log or Product Activity Log? No: Go to step 10. Yes: Continue with the next step. 8. Was the new SRC 9050? No: Continue with the next step. Yes: Contact your next level of support. This ends the procedure. 9. The new SRC was 9010. Reclaim the cache storage. See “Reclaiming IOP cache storage” on page 636. Note: When an auxiliary cache I/O adapter that is connected to the storage I/O adapter logs a 9055 SRC in the Product Activity Log, the reclaim does not result in lost sectors. Otherwise, the reclaim does result in lost sectors, and the system operator might want to restore data from the most recent saved tape after you complete the repair. 10. Are you working with a 571F/575B card set? No: Go to step 12 on page 83. 82 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Yes: Continue with the next step.11. Remove the 571F/575B card set. Create a new card set with the following: v The new 571F storage IOA v The cache directory card from the new 571F storage IOA v The new 575B auxiliary cache adapterSee “Separating the 571F/575B card set and moving the cache directory card” on page 594. v Ensure that the SCSI cable and the battery power cable on the top edge of the storage side of the card are connected to the top edge of the auxiliary cache side of the card. v Reinstall this card set into the system. This ends the procedure.12. Remove the I/O adapter. Install the new replacement storage I/O adapter that has the following parts installed on it: v The cache directory card from the new storage I/O adapter. On adapters with removable cache cards, the cache directory card will move with the removable cache card. v The removable cache card from the new storage I/O adapter (this applies to only the 571E and some 2780 I/O adapters).See Chapter 3, “Locating FRUs,” on page 335. v If the I/O adapter is attached to an auxiliary cache I/O adapter, ensure that the SCSI cable on the last port of the new replacement storage I/O adapter is connected to the auxiliary cache I/O adapter. For a list of auxiliary cache I/O adapters, see Chapter 8, “Part Information,” on page 691.This ends the procedure 13. Identify the affected disk units using information in the Product Activity Log. Access SST/DST by doing one of the following: v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.14. Did you perform a D IPL to get to DST? v Yes: Continue with the next step. v No: Perform the following steps: a. Access the Product Activity Log and display the SRC that sent you here, then view Additional Information to record the formatted log information. The Device Errors detected field indicates the total number of disk units that are affected. The Device Errors logged field indicates the number of disk units for which detailed information is provided. Under the Device heading, the unit address, type, and serial number are provided for up to three disk units. Additionally, the controller type and serial number for each of these disk units indicates the adapter to which the disk was last attached when it was operational.Note: You might find more than one Product Activity Log entry with the same Log ID. Access any additional entries by pressing Enter from the Display Detail Report for Resource screen. View Additional Information for each entry, and record the formatted log information. For example: You might find an entry for an xxxx902F SRC in the Product Activity Log when the array includes more than 10 disk units. b. Continue with step 16 on page 84.15. A formatted display of hexadecimal information for Product Activity Log entries is not available. To interpret the hexadecimal information, see “More information from hexadecimal reports” on page 620. The Device Errors detected field indicates the total number of disk units that are affected. The Device Errors logged field indicates the number of disk units for which detailed information is provided. Under the Device heading, the unit address, type, and serial number are provided for up Chapter 2. Isolation Procedures 83 to three disk units. Additionally, the controller type and serial number for each of these disk units indicates the adapter to which the disk was last attached when it was operational.Note: You might find an entry for an xxxx902F SRC entry in the Product Activity Log when the array includes more than 10 disk units. To interpret the hexadecimal information for these additional disk units, see “More information from hexadecimal reports” on page 620. 16. Has the I/O card or have the devices been repaired or reconfigured recently? Yes: Continue with the next step. No: Contact your next level of support for assistance. This ends the procedure.17. You can use one of the following repair options to correct the problem: v Reunite the adapter and disk units identified in previous steps so that the cache data can be written to the disk units. If you can find the devices and adapters and want to continue with this repair option, then continue with the next step. v If the data for the disk units identified in previous steps is not needed on this or any other system, initialize and format these disk units. Attention: This repair option causes a loss of customer data. If you want to continue with this repair option, go to step 19.18. Perform the following: a. Restore the adapter and disk units back to their original configuration. For more information, see Chapter 3, “Locating FRUs,” on page 335. After the system writes cache data to the disk units and you power off the system normally, you can move the adapter and disk units to another location. b. Power on the system. For more information, see Powering on and powering off. Does the IP complete successfully? No: Perform problem analysis to correct the new problem. This ends the procedure. Yes: This ends the procedure.19. You have chosen to initialize and format the identified disk units. Perform the following steps: Attention: Performing the following steps causes a loss of customer data. a. If you are not already using dedicated service tools, perform an IPL to DST. For more information, see “Performing an IPL to DST” on page 640. If you cannot perform a type A or B IPL, perform a type D IPL from removable media. b. Select Work with disk units. Did you get to DST with a Type D IPL? Yes: Continue with the next step. No: Select Work with disk unit recovery > Disk unit problem recovery procedures, then continue with the next step.20. Select Initialize and format disk unit. 21. Follow the online instructions to format and initialize the disk units. 22. Perform an IPL from disk. Does the IPL complete successfully? No: Perform problem analysis and correct the new problem. This ends the procedure. Yes: This ends the procedure.IOPIP31 Cache data associated with the attached devices cannot be found. You were sent to this procedure from URC 9010. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Has the system been powered off for several days? 84 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) v No: Go to step 4. v Yes: The cache battery pack may be depleted. Do NOT replace the I/O adapter or the cache battery pack. Reclaim the cache storage. See “Reclaiming IOP cache storage” on page 636. Then continue with the next step. Note: When an auxiliary cache I/O adapter connected to the storage I/O adapter logs a 9055 SRC in the Product Activity Log, the Reclaim does not result in lost sectors. Otherwise, the Reclaim does result in lost sectors, and the system operator may want to restore data from the most recent saved tape after you complete the repair. 3. Does the IPL complete successfully? No: Contact your next level of support. This ends the procedure. Yes: This ends the procedure. 4. Are you working with a 571F/575B card set? No: Go to step 6. Yes: Continue with the next step. 5. Remove the 571F/575B card set. Create a new card set with the following: Note: Label all parts (both original and new) before moving them. v The new replacement 571F storage IOA v The cache directory card from the original 571F storage IOA v The original 575B auxiliary cache adapterSee “Separating the 571F/575B card set and moving the cache directory card” on page 594. v Ensure that the SCSI cable and the battery power cable on the top edge of the storage side of the card are connected to the top edge of the auxiliary cache side of the card. v Reinstall this card set into the system and go to step 7. 6. Remove the I/O adapter. Install the new replacement storage I/O adapter with the following parts installed on it: Note: Label all parts (both old and new) before moving them. v The cache directory card from the original storage I/O adapter. On adapters with removable cache cards, the cache directory card will move with the removable cache card. v The removable cache card from the original storage I/O adapter (this applies to only the 571E and some 2780 I/O adapters).See Chapter 3, “Locating FRUs,” on page 335. v If the I/O adapter is attached to an auxiliary cache I/O adapter, ensure that the SCSI cable on the last port of the new replacement storage I/O adapter is connected to the auxiliary cache I/O adapter. For a list of auxiliary cache I/O adapters, see Chapter 8, “Part Information,” on page 691. 7. Did the 9010 SRC that sent you to this procedure occur on a type-D IPL? No: Continue with the next step. Yes: Perform a type-D IPL and continue with the next step. 8. Has a new 9010 or 9050 SRC occurred in the Service Action Log? No: Go to step 11 on page 86. Yes: Continue with the next step. 9. Was the new SRC 9050? No: Continue with the next step. Yes: Contact your next level of support. This ends the procedure.10. The new SRC was 9010. Reclaim the cache storage. See “Reclaiming IOP cache storage” on page 636. Chapter 2. Isolation Procedures 85 Note: When an auxiliary cache I/O adapter that is connected to the storage I/O adapter logs a 9055 SRC in the Product Activity Log, the reclaim does not result in lost sectors. Otherwise, the reclaim does result in lost sectors, and the system operator might want to restore data from the most recent saved tape after you complete the repair. 11. Are you working with a 571F/575B card set? No: Go to step 13. Yes: Continue with the next step.12. Remove the 571F/575B card set. Create a new card set with the following: v The new 571F storage IOA v The cache directory card from the new 571F storage IOA v The new 575B auxiliary cache adapterSee “Separating the 571F/575B card set and moving the cache directory card” on page 594. v Ensure that the SCSI cable and the battery power cable on the top edge of the storage side of the card are connected to the top edge of the auxiliary cache side of the card. v Reinstall this card set into the system.This ends the procedure. 13. Remove the I/O adapter. Install the new replacement storage I/O adapter that has the following parts installed on it: v The cache directory card from the new storage I/O adapter. On adapters with removable cache cards, the cache directory card will move with the removable cache card. v The removable cache card from the new storage I/O adapter (this applies to only the 571E and some 2780 I/O adapters).See Chapter 3, “Locating FRUs,” on page 335. v If the I/O adapter is attached to an auxiliary cache I/O adapter, ensure that the SCSI cable on the last port of the new replacement storage I/O adapter is connected to the auxiliary cache I/O adapter. For a list of auxiliary cache I/O adapters, see the Chapter 8, “Part Information,” on page 691.This ends the procedureIOPIP32 You were sent to this procedure from unit reference code (URC) 9011. Attention: There is data in the cache of this I/O card, that belongs to devices other than those that are attached. Customer data may be lost. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Did you just exchange the storage I/O adapter as a result of a failure? v No: Continue with the next step. v Yes: Reclaim the cache storage. See “Reclaiming IOP cache storage” on page 636. Does the IPL complete successfully? No: You have a new problem. Perform problem analysis and correct the problem. This ends the procedure. Yes: This ends the procedure.3. Have the I/O cards been moved or reconfigured recently? v No: Ask your next level of support for assistance. This ends the procedure. v Yes: Perform the following steps: 86 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) a. Power off the system. See Powering on and powering off for details. b. Restore all I/O cards to their original position. c. Select the IPL type and mode that are used by the customer. d. Power on the system. Does the IPL complete successfully? No: Ask your next level of support for assistance. This ends the procedure. Yes: This ends the procedure.IOPIP33 The I/O processor card detected a device configuration error. The configuration sectors on the device may be incompatible with the current I/O processor card. You were sent to this procedure from unit reference code (URC) 9001. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to (Determine if the system has logical partitions) before continuing with this procedure. 2. Has the I/O adapter been replaced with a different type of I/O adapter, or have the devices been moved from a different type of I/O adapter to this one? v No: Contact your next level of support. This ends the procedure. v Yes: Continue with the next step.3. Does the disk unit contain data that needs to be saved? v Yes: Continue with the next step. v No: Initialize and format the disk units. Attention: Any data on the disk unit will be lost. Perform the following:a. Access SST or DST. b. Select Work with disk units. c. Did you get to DST with a type D IPL? v No: Select Work with disk unit recovery ? Disk unit problem recovery procedures. Then, continue with the next step. v Yes: Continue with the next step.d. Select Initialize and format disk unit for each disk unit. When the new disk unit is initialized and formatted, the display will show that the status is complete. This may take 30 minutes or longer. e. The disk unit is now ready to be added to the system configuration. This ends the procedure.4. The disk unit contains data that needs to be saved. v If the I/O adapter has been replaced with a different type of I/O adapter, reinstall the original I/O adapter. Then continue with the next step. v If the disk units have been moved from a different type of I/O adapter to this one, return the disk units to their original I/O adapter. Then continue with the next step.5. Stop parity protection on the disk units, and power down the system normally with the I/O adapter in an operational state. The I/O adapter or disk units can now be returned to the configuration at the beginning of this procedure. This ends the procedure.IOPIP34 You were sent to this procedure from unit reference code (URC) 9027. The I/O processor card detected that an array is not functional due to the present hardware configuration. Chapter 2. Isolation Procedures 87 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to (Determine if the system has logical partitions) before continuing with this procedure. 2. Has the I/O adapter been replaced with a different I/O adapter, or have the devices been moved from a different I/O adapter to this one? v No: Perform (IOPIP22). This ends the procedure. v Yes: Perform the following:a. Power off the system. See (Power on/off the system and logical partitions). b. Restore all I/O cards or devices to their original position. c. Power on the system.3. Does the IPL complete successfully? v No: Ask your next level of support for assistance. This ends the procedure. v Yes: This ends the procedure.IOPIP40 Use this procedure to isolate the problem when a storage I/O adapter is connected to an incompatible or non-operational auxiliary cache I/O adapter. Perform the following: 1. Are you working on a 571F/575B combination storage and auxiliary cache IOA card set (uses two card slot locations)? v Yes: Continue with step 2. v No: Continue with step 3.2. Find the location of the card. v Use the location displayed in the Service Action Log. If the Service Action Log does not have a location, determine the address of the I/O adapter. See The System Reference Code (SRC) Format Description. v The location identified is for the 571F side of the card set.You must configure both the 571F and the 575B in the same partition. Are both the 571F side of the card set and the 575B side of the card set configured in the same partition? v Yes: Replace the entire card set. See Chapter 3, “Locating FRUs,” on page 335. This ends the procedure. v No: Change the configuration so that the same partition controls both cards in the card set. This ends the procedure.3. Ensure that the SCSI cable on the last port of the storage I/O adapter is connected to the auxiliary cache I/O adapter. Do the following: a. Use the location of the storage I/O adapter displayed in the Service Action Log. If the Service Action Log does not have a location, determine the address of the storage I/O adapter. See The System Reference Code (SRC) Format Description. b. Determine the location of the storage I/O adapter. See Chapter 3, “Locating FRUs,” on page 335. c. Ensure that the SCSI cable on the last port of the storage I/O adapter is properly connected to an auxiliary cache I/O adapter. d. Ensure that both the auxiliary cache I/O adapter and the storage I/O adapter are in the same partition. e. Ensure that the slot power indicator is lit for the auxiliary cache I/O adapter. If it is not, use concurrent maintenance to power on the slot.4. Ensure that the auxiliary cache I/O adapter is supported for the storage I/O adapter to which it is connected. See the PCI adapter installation instructions for information about which adapters are compatible. 88 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 5. Replace the SCSI cable on the last port of the storage I/O adapter that connects to the auxiliary cache I/O adapter. See Chapter 8, “Part Information,” on page 691 for cable part number information. If this does not fix the problem, replace the auxiliary cache I/O adapter. This ends the procedure.IOPIP41 Use this procedure to correct the problem when an auxiliary cache I/O adapter is not connected to a storage I/O adapter or when an auxiliary cache I/O adapter is connected to an incompatible or non-operational storage I/O adapter. Perform the following: 1. Are you working on a 571F/575B combination storage and auxiliary cache IOA card set (uses two card slot locations)? v Yes: Continue with step 2. v No: Continue with step 3.2. Find the location of the card. v Use the location displayed in the Service Action Log. If the Service Action Log does not have a location, determine the address of the I/O adapter. See The System Reference Code (SRC) Format Description. v The location identified is for the 575B side of the card set.You must configure both the 571F and the 575B in the same partition. Are both the 571F side of the card set and the 575B side of the card set configured in the same partition? v Yes: Replace the entire card set. See Chapter 3, “Locating FRUs,” on page 335 for the model you are working on. This ends the procedure. v No: Change the configuration so that the same partition controls both cards in the card set. This ends the procedure.3. Ensure that the SCSI cable of the auxiliary cache I/O adapter is connected to the last port of the storage I/O adapter. Do the following: a. Use the location of the auxiliary cache I/O adapter displayed in the Service Action Log. If the Service Action Log does not have a location, determine the address of the auxiliary cache I/O adapter. See The System Reference Code (SRC) Format Description. b. Determine the location of the auxiliary cache I/O adapter. See Chapter 3, “Locating FRUs,” on page 335 for the model you are working on. c. Ensure that the SCSI cable of the auxiliary cache I/O adapter is properly connected to the last port of the storage I/O adapter. d. Ensure that both the auxiliary cache I/O adapter and the storage I/O adapter are in the same partition. e. Ensure that the slot power indicator is lit for the storage I/O adapter. If it is not, use concurrent maintenance to power on the slot.4. Did you just replace the auxiliary cache I/O adapter because of a failure and did the new replacement auxiliary cache I/O adapter log a 9073 URC? v Yes: The SCSI bus on the storage I/O adapter may be disabled as a result of the initial failure. Use Hardware Service Manager to re-IPL the storage I/O adapter that is connected to the new replacement auxiliary cache I/O adapter. This ends the procedure. v No: Continue with the next step.5. Ensure that the auxiliary cache I/O adapter is supported for the storage I/O adapter to which it is connected. See the PCI adapter installation instructions for information about which adapters are compatible. 6. Replace the SCSI cable that connects the auxiliary cache I/O adapter to the storage I/O adapter. See Chapter 8, “Part Information,” on page 691 for cable part number information. If this does not fix the problem, replace the storage I/O adapter. This ends the procedure. Chapter 2. Isolation Procedures 89 Licensed internal code (LIC) isolation procedures Use this section to isolate licensed internal code (LIC) problems. Please read and observe all safety procedures before servicing the system and while preforming a procedure. Attention: Unless instructed otherwise, always power off the system or expansion tower where the FRU is located before removing, exchanging, or installing a field-replaceable unit (FRU). DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) “LICIP01” on page 91Licensed Internal Code detected an IOP programming problem. “LICIP03” on page 92Dedicated service tools (DST) found a permanent program error, or a hardware failure occurred. “LICIP04” on page 93The initial program load (IPL) service function ended. “LICIP07” on page 93The system detected a problem while communicating with a specific I/O processor. 90 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) “LICIP08” on page 96Licensed Internal Code detected an operating system program problem. “LICIP11” on page 96Use this procedure to isolate a system STARTUP failure in the initial program load (IPL) mode. “LICIP12” on page 107Use this procedure to isolate an Independent Auxiliary Storage Pool (IASP) vary on failure. “LICIP13” on page 111A disk unit seems to have stopped communicating with the system. “LICIP14” on page 116Licensed Internal Code detected a card slot test failure. “LICIP15” on page 117Use this procedure to help you recover from an initial program load (IPL) failure. “LICIP16” on page 119Use this procedure to identify an adapter that is operational but is not located in the same partition as its associated adapter.LICIP01 Licensed Internal Code detected an IOP programming problem. You will need to gather data to determine the cause of the problem. If using OptiConnect, and the IOP is connected to another system, then collect this information from both systems. Read the “Licensed internal code (LIC) isolation procedures” on page 90 before continuing with this procedure. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Is the system operational: Did the SRC come from the Service Action Log, Product Activity Log, problem log, or system operator message? v No: Go to step 9 on page 92. v Yes: Is this a x6xx5121 SRC? No: Continue with the next step. Yes: Go to step 4. 3. If the IOP has DASD attached to it, then the IOP dump is in SID87 (or SID187 if the DASD is mirrored). Copy the IOP dump. See “Working with Storage Dumps” on page 636. 4. Print the Product Activity Log, including any IOP dumps, to removable media for the day which the problem occurred. Select the option to obtain HEX data. 5. Use the ?Licensed Internal Code log? service function under DST/SST to copy the LIC log entries to removable media for the day that the problem occurred. 6. Copy the system configuration list. See “Printing the System Configuration List” on page 643. 7. Provide the dumps to Service Support. 8. Check the Logical Hardware Resource STATUS field using Hardware Service Manager. If the status is not Operational then IPL the IOP using the I/O Debug option. Ignore resources with a status of not connected. To IPL a failed IOP, the following command can be used: VRYCFG CFGOBJ(XXXX) CFGTYPE(*CTL) STATUS(*RESET) or use DST/SST Hardware Service Manager. If the IPL does not work: v Check the Service Action Log for new SRC entries. See “Using the Service Action Log” on page 609. Use the new SRC and perform problem analysis to correct the problem. v If there are no new SRCs in the Service Action Log, contact your next level of support. This ends the procedure. Chapter 2. Isolation Procedures 91 9. Has the system stopped but the DST console is still active: Did the SRC come from the Main Storage Dump manager screen on the DST console? Yes: Continue with the next step. No: Go to step 15.10. Complete a Problem Summary Form using the information in words 1-9 from the control panel, or from the DST Main Storage Dump screen. 11. The system has already taken a partial main storage dump for this SRC and automatically re-IPLed to DST. 12. Copy the main storage dump to tape. See “Working with Storage Dumps” on page 636. 13. When the dump is completed, the system will re-IPL automatically. Sign on to DST or SST. Obtain the data in steps 3 on page 91, 4 on page 91, 5 on page 91, and 6 on page 91. 14. Provide the dumps to Service Support. This ends the procedure. 15. Has the system stopped with an SRC at the control panel? Yes: Continue with the next step. No: Go to step 2 on page 91.16. Complete a Problem Summary Form using the information in words 1-9 from the control panel, or from the DST Main storage dump screen. 17. Do NOT power off the system. Perform a manual IPL to DST, and start the Main storage dump manager service function. 18. Copy the main storage dump to tape. 19. Obtain the data in steps 3 on page 91, 4 on page 91, 5 on page 91, and 6 on page 91. 20. Re-IPL the system. 21. Has the system stopped with an SRC at the control panel? Yes: Using the new SRC, perform problem analysis to correct the problem. This ends the procedure. No: Provide the dumps to Service Support. This ends the procedure.LICIP03 Dedicated service tools (DST) found a permanent program error, or a hardware failure occurred. Read the danger notices in the “Licensed internal code (LIC) isolation procedures” on page 90 before continuing with this procedure. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Does URC 50FF occur? No: Continue with the next step. Yes: Go to step 4.3. Perform a main storage dump, then perform an IPL by doing the following: a. Perform a main storage dump. See “Working with Storage Dumps” on page 636. b. Select Manual mode and perform an IPL to DST. See “Performing an IPL to DST” on page 640. c. When the IPL has completed, the system console should be at the DST display. Go to step 7 on page 93. If the DST display does not appear on the system console, contact your next level of support.4. Perform a main storage dump, then perform an IPL by doing the following: a. Perform a main storage dump. See “Working with Storage Dumps” on page 636. b. Select Manual mode and perform an IPL to DST. See “Performing an IPL to DST” on page 640.Does a display appear? 92 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) No: Continue with the next step. Yes: Go to step 7.5. If the problem is in the logical partition, exchange the first workstation I/O processor card on bus 1. See Removing and Replacing FRUs. 6. Select Manual mode and perform an IPL to DST. See “Performing an IPL to DST” on page 640. Is the IPL or Install the System display shown? Yes: Continue with the next step. No: Ask your next level of support for assistance and report the problem. This ends the procedure.7. Copy the main storage dump to removable media. See “Working with Storage Dumps” on page 636. 8. Report a Licensed Internal Code problem to your next level of support. This ends the procedure.LICIP04 The initial program load (IPL) service function ended. Dedicated service tools (DST) was in the disconnected status or lost communications with the IPL console because of a console failure and could not communicate with the user. Read the danger notices in “Licensed internal code (LIC) isolation procedures” on page 90 before continuing with this procedure. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Select function 21 (Make DST Available) on the control panel and press Enter to start DST again. Does the DST Sign On display appear? v No: Continue with the next step. v Yes: Perform the following steps (see “Dedicated Service Tools (DST)” on page 637 for details): a. Select Start a Service Tool ? Licensed Internal Code log. b. Perform a dump of the Licensed Internal Code log to tape. See “Start a service tool” on page 642 for details. c. Return here and continue with the next step.3. Perform a main storage dump. See “Working with Storage Dumps” on page 636 for details. 4. Copy the main storage dump to removable media. See “Working with Storage Dumps” on page 636 for details. 5. Report a Licensed Internal Code problem to your next level of support. This ends the procedure.LICIP07 The system detected a problem while communicating with a specific I/O processor. The problem could be caused by Licensed Internal Code, the I/O processor card, or by bus hardware. Read the danger notices in “Licensed internal code (LIC) isolation procedures” on page 90 before continuing with this procedure. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Did a previous procedure have you power off the system, perform an IPL in Manual mode, and is the system in Manual mode now? v Yes: Continue with the next step. v No: Perform the following steps: a. Power off the system. See “Powering on and powering off” on page 588 for details. Chapter 2. Isolation Procedures 93 b. Select Manual mode on the control panel. See “IPL type, mode, and speed options” on page 642 for details. c. Power on the system. d. Continue with the next step. 3. Does the SRC that sent you to this procedure appear on the control panel? v No: Continue with the next step. v Yes: Use the information in the SRC to determine the card direct select address. If the SRC is B6006910, you can use the last 8 characters of the top 16 character line of function 13 (word 7) to find the card direct select address in BBBBCcbb format. BBBB Bus number Cc Card direct select address bb board address Go to step 11 on page 95. 4. Does the console display indicate a problem with missing disks? Yes: Continue with the next step. No: Go to step 6. 5. Perform the following steps: a. Go to the DST main menu. b. On the DST sign-on display, enter the DST full authority user ID and password. See “Dedicated Service Tools (DST)” on page 637 for details. c. Select Start a service tool ? Hardware service manager. d. Check for the SRC in the service action log. See “Using the Service Action Log” on page 609. Did you find the same SRC that sent you to this procedure? v Yes: Note the date and time for that SRC. Go to the Product Activity Log and search all logs to find the same SRC. When you have found the SRC, go to step 9 on page 95. v No: Perform the following steps: 1) Return to the DST main menu. 2) Perform an IPL and return to the Display Missing Disk Units display. 3) Go to “LICIP11” on page 96. This ends the procedure. 6. Does the SRC that sent you to this procedure appear on the console or on the alternative console? v Yes: Continue with the next step. v No: Does the IPL complete successfully to the IPL or Install the System display? Yes: Continue with the next step. No: A different SRC occurred. Use the new SRC to correct the problem. This ends the procedure. 7. Perform the following: a. Use the full-authority password to sign on to DST. b. Search All logs in the product activity log looking for references of SRC B600 5209 and the SRC that sent you to this procedure. Note: Search only for SRCs that occurred during the last IPL.Did you find B600 5209 or the same SRC that sent you to this procedure? v Yes: Go to step 10 on page 95. v No: Did you find a different SRC than the one that sent you to this procedure? Yes: Continue with the next step. No: The problem appears to be intermittent. Ask your next level of support for assistance. This ends the procedure. 8. Use the new SRC to correct the problem. This ends the procedure. 94 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 9. Use F11 to move through alternative views of the log analysis displays until you find the card position and frame ID of the failing IOP associated with the SRC. Was the card position and frame ID available, and did this information help you find the IOP? No: Continue with the next step. Yes: Go to step 12.10. Perform the following steps: a. Display the report for the log entry of the SRC that sent you to this procedure. b. Display the additional information for the entry. c. If the SRC is B6006910, use characters 9-16 of the top 16 character line of function 13 (word 7) to find the card direct select address in BBBBCcbb format. BBBB Bus number Cc Card direct select address bb board address 11. Use the BBBBCcbb information and refer to Chapter 3, “Locating FRUs,” on page 335 to determine the failing IOP and its location. 12. Go to “MABIP55” on page 43 to isolate an I/O adapter problem on the IOP you just identified. If this fails to isolate the problem, return here and continue with the next step. 13. Is the I/O processor card you identified in step 9 or step 11 the CFIOP? v No: Continue with the next step. v Yes: Perform the following steps: a. Exchange the failing CFIOP card. See Chapter 3, “Locating FRUs,” on page 335. Note: You will be prompted for the system serial number. Ignore any error messages regarding system configuration that appear during the IPL. b. Go to step 16.14. Perform the following steps: a. Power off the system. b. Remove the IOP card. c. Power on the system.Does the SRC that sent you to this procedure appear on the control panel or appear as a new entry in the service action log or product activity log? v No: Continue with the next step. v Yes: Perform the following steps: a. Power off the system. b. Install the IOP card you just removed. c. Replace the I/O backplane (Un-P1). This ends the procedure.15. Perform the following steps: a. Power off the system. b. Exchange the failing IOP card.16. Power on the system. Does the SRC that sent you to this procedure appear on the control panel, on the console, or on the alternative console? No: Continue with the next step. Yes: Go to step 18 on page 96.17. Does a different SRC appear on the control panel, on the console, or on the alternative console? v Yes: Use the new SRC to correct the problem. This ends the procedure. v No: On the IPL or Install the System display, check for the SRC in the service action log. See “Using the Service Action Log” on page 609 for details. Chapter 2. Isolation Procedures 95 Did you find the same SRC that sent you to this procedure? Yes: Continue with the next step. No: Go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure.18. Perform the following steps: a. Power off the system. b. Remove the IOP card you just exchanged and install the original card. c. Go to (Bus-PIP1). This ends the procedure.19. Ask your next level of support for assistance and report a Licensed Internal Code problem. You may be asked to verify that all PTFs have been applied. If you are asked to perform the following, see the following: v Copy the main storage dump from disk to tape or diskette, see “Working with Storage Dumps” on page 636. v Print the product activity log, see “Using the product activity log” on page 617. v Copy the IOP storage dump to removable media, see “Working with Storage Dumps” on page 636. This ends the procedure.LICIP08 Licensed Internal Code detected an operating system program problem. Read the danger notices in “Licensed internal code (LIC) isolation procedures” on page 90 before continuing with this procedure. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Select Manual mode and perform an IPL to DST. See “Performing an IPL to DST” on page 640. Does the same SRC occur? v Yes: Go to step 5. v No: Does the same URC appear on the console? No: Continue with the next step. Yes: Go to step 4.3. Does a different SRC occur, or does a different URC appear on the console? v Yes: Use the new SRC or reference code to correct the problem. If the procedure for the new SRC sends you back to this procedure, then continue with the next step. This ends the procedure. v No: Select Perform an IPL on the IPL or Install the System display to complete the IPL. Is the problem intermittent? Yes: Continue with the next step. No: This ends the procedure.4. Copy the main storage dump to removable media. See “Working with Storage Dumps” on page 636. 5. Report a Licensed Internal Code problem to your next level of support. This ends the procedure.LICIP11 Use this procedure to isolate a system STARTUP failure in the initial program load (IPL) mode. Ensure you have read the danger notices in “Licensed internal code (LIC) isolation procedures” on page 90 before continuing with this procedure. 96 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) How to find the cause code 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593. 2. Were you given a cause code by another procedure? No: Continue with the next step. Yes: Use the cause code given by the other procedure. Then, go to step 4.3. Look at the Data display characters in word 3. You can obtain these characters by either: v Looking at word 3 on the Problem summary form that was filled out earlier. v Selecting characters 9-16 of the top 16 character line of function 12 (word 3).4. The 4 leftmost characters of word 3 represent the cause code. Select the cause code to go to the correct isolation instructions: “0001” “0010” on page 103 “0020” on page 105 “0031” on page 106 “0002” “0011” on page 103 “0021” on page 105 “0033” on page 106 “0004” on page 100 “0012” on page 103 “0022” on page 105 “0034” on page 106 “0005” on page 101 “0015” on page 103 “0023” on page 105 “0035” on page 107 “0006” on page 101 “0016” on page 103 “0024” on page 105 “0037” on page 107 “0007” on page 101 “0017” on page 103 “0025” on page 106 “0099” on page 107 “0008” on page 101 “0018” on page 103 “0026” on page 106 “0009” on page 101 “0019” on page 103 “0027” on page 106 “000A” on page 101 “001A” on page 104 “000B” on page 102 “001C” on page 104 “000C” on page 102 “001D” on page 104 “000D” on page 102 “001E” on page 104 “000E” on page 102 “001F” on page 104 0001 Disk configuration is missing. 1. Select Manual mode and perform an IPL to DST for the failing partition (see “Performing an IPL to DST” on page 640). Does the Disk Configuration Error Report display appear? Yes: Continue with the next step. No: The IPL completed successfully. This ends the procedure.2. Is Missing Disk Configuration information displayed? Yes: Continue with the next step. No: Go to step 1 on page 98 for cause code 0002.3. On the Missing Disk Configuration display, perform the following: a. Select option 5 ? Display Detailed Report ? Work with disk units ? Work with disk unit recovery ? Recover Configuration. b. Follow the instructions on the display. After the disk configuration is recovered, the system automatically performs an IPL. This ends the procedure.0002 Disk units are missing from the disk configuration. Data from the control panel can be used to find information about the missing disk unit. Chapter 2. Isolation Procedures 97 1. Did you enter this procedure because all the devices listed on the Display Missing Units display (reached from the Disk Configuration Error Report, the Disk Configuration Attention Report, or the Disk Configuration Warning Report display) have a reference code of 0000? No: Continue with the next step. Yes: Go to step 20 on page 100. 2. Have you installed a new disk enclosure in a disk unit and not restored the data to the disk unit? No: Continue with the next step. Yes: Ignore SRC A600 5090. Continue with the disk unit exchange recovery procedure. This ends the procedure. 3. Use words 1-9 from the information recorded on the Problem summary form to determine the disk unit that is missing from the configuration: v Characters 1-8 of the bottom 16 character line of function 12 (word 4) contain the IOP direct select address. v Characters 1-8 of the top 16 character line of function 13 (word 6) contains the disk unit type, level and model number. v Characters 9-16 of the top 16 character line of function 13 (word 7) contains the disk unit serial number. Note: For 2105 and 2107 disk units, the 5 rightmost characters of word 7 contain the disk unit serial number. v Characters 1-8 of the bottom 16 character line of function 13 (word 8) contains the number of missing disk units. Are the problem disk units 432x, 433x, 660x, 671x, or 673x disk units? No: Continue with the next step. Yes: Go to step 5. 4. Attempt to get all devices attached to the MSIOP to Ready status by performing the following: a. The MSIOP address (MSIOP Direct Select Address) to use is characters 1-8 of the bottom 16 character line of function 12 (word 4). b. Verify the following and correct if necessary before continuing with step 10 on page 99. v All cable connections are made correctly and are tight. v All storage devices have the correct signal bus address, as indicated in the system configuration list. v All storage devices are powered on and ready. 5. Did you enter this procedure because there was an entry in the Service Action Log which has the reference code B6005090? Yes: Continue with the next step. No: Go to step 10 on page 99. 6. Are customer jobs running on the system now? Yes: Continue with the next step. No: Ensure that the customer is not running any jobs before continuing with this procedure. Then go to step 10 on page 99. 7. Select System Service Tools (SST) ? Work with disk units ? Display disk configuration ? Display disk configuration status. Are any disk units missing from the configuration (indicated by an asterisk *)? Yes: Continue with the next step. No: This ends the procedure. 8. Do all of the disk units that are missing from the configuration have a status of ?Suspended?? Yes: Continue with the next step. 98 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) No: Ensure that the customer is not running any jobs before continuing with this procedure. Then go to step 10. 9. Use the Service Action Log to determine if there are any entries for the missing disk units (see “Using the Service Action Log” on page 609). Are there any entries in the Service Action Log for the missing disk units that were logged since the last IPL? Yes: Use the information in the Service Action Log, and look up the reference code to read its description.). Perform the action indicated for the unit reference code. This ends the procedure. No: Go to step 21 on page 100.10. Select Manual mode and perform an IPL to DST for the failing partition (see “Performing an IPL to DST” on page 640). Does the Disk Configuration Error Report, the Disk Configuration Attention Report, or the Disk Configuration Warning Report display appear? Yes: Continue with the next step. No: The IPL completed successfully. This ends the procedure.11. Does one of the following messages appear in the list? v Missing disk units in the configuration v Missing mirror protected disk units in the configuration Yes: Continue with the next step. No: Go to step 16.12. Select option 5. Do the missing units have device parity protected status? (Device parity protection status is indicated by ?DPY/? as the first four characters of the status.) Yes: Continue with the next step. No: Go to step 14.13. Is the status DPY/Active? Yes: Continue with the next step. No: Use the Service Action Log to determine if there are any entries for the missing disk units or the IOA/IOP controlling them. See “Using the Service Action Log” on page 609 for details. This ends the procedure.14. Press F11, and press Enter to display the details. Do all of the disk units listed on the display have a reference code of 0000? Yes: Continue with the next step. No: Use the disk unit reference code shown on the display and look up the reference code.. Perform the action indicated for the unit reference code. This ends the procedure.15. Do all of the IOPs or devices listed on the display have a reference code of 0000? No: Use the IOP reference code shown on the display and look up the reference code.. Perform the action indicated for the reference code. This ends the procedure. Yes: Go to step 20 on page 100.16. Does the following message appear in the list: Unknown load-source status? Yes: Continue with the next step. No: Go to step 18.17. Select option 5, press F11, and then press Enter to display the details. Does the Assign Missing Load Source Disk display appear? No: Continue with the next step. Yes: Press Enter to assign the missing load-source disk unit. This ends the procedure.18. Does the following message appear in the list? Load source failure Yes: Continue with the next step. No: The IPL completed successfully. This ends the procedure. Chapter 2. Isolation Procedures 99 19. Select option 5, press F11, and then press Enter to display the details. 20. The number of failing disk unit facilities (actuators) is the number of disk units displayed. A disk unit has a Unit number greater than zero. Find the failing disk unit by type, model, serial number, or address displayed on the console. 21. Is there more than one failing disk device attached to the IOA or MSIOP? Yes: Continue with the next step. No: Go to step 24.22. Use the SAL to determine if there are any entries that occurred around the time of the A6xx/B6xx 5090 SRC (see “Using the Service Action Log” on page 609). Are there any such entries? No: Continue with the next step. Yes: Use the information in the SAL and look up the reference code.). Perform the action indicated for the unit reference code. This ends the procedure.23. Are all the disk devices that are attached to the IOA or MSIOP failing? (If the disk units are using mirrored protection, select Display Disk Status to find out.) No: Continue with the next step. Yes: Go to step 25.24. Look up the SRC listed below for the specific disk unit and exchange the FRUs shown one at a time. Then return here and answer the question below the listed disk units. Disk Unit SRC to look up 2105 21053002 2107 21073002 432x 432x3002 433x 433x3002 660x 660x3002 671x 671x3002 673x 673x3002 Did the disk unit service information correct the problem? No: Continue with the next step. Yes: This ends the procedure.25. Exchange the IOA or MSIOP (see Chapter 3, “Locating FRUs,” on page 335). If exchanging the IOA or MSIOP did not correct the problem, use the original SRC and exchange the failing items, starting with the highest probable cause of failure. If the failing item list contains FI codes, Look up the FI code to help determine parts and locations. This ends the procedure.0004 Some disk units are unprotected but configured into a mirrored ASP. These units were originally DPY protected but protection was disabled. Perform the following: 1. Is the system managed by an HMC? Yes: Select DST by performing the HMC action for Function 21 for the failing partition (see “Control panel function codes on the HMC” on page 638 ). Then continue with the next step. No: Select DST using Function 21 for the failing partition (see “Selecting function 21 from the control panel” on page 638 in “Service functions” on page 652). Then continue with the next step.2. Select Work with disk units and take the actions to protect the system. 100 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) If you do not know what actions to take, select Manual mode and perform an IPL to DST for the failing partition. See “Performing an IPL to DST” on page 640. When the Disk configuration error report appears, the recovery actions are listed in the Help text for the error message ?Unprotected disk units in a mirrored ASP?. This ends the procedure.0005 A disk unit using parity protection is operating in exposed mode. 1. Select Manual mode and perform an IPL to DST for the failing partition. See “Performing an IPL to DST” on page 640. 2. Choose from the following options: v If the same reference code appears, ask your next level of support for assistance. v If no reference code appears and the IPL completes successfully, the problem is corrected. v If a different reference code appears, use it to perform problem analysis and correct the new problem. This ends the procedure.0006 There are new devices attached to the system that do not have Licensed Internal Code installed. Ask your next level of support for assistance. 0007 Some of the configured disk units have device parity protection disabled when the system expected device parity protection to be enabled. 1. Is the system managed by an HMC? Yes: Select DST by performing the HMC action for Function 21 for the failing partition (see “Control panel function codes on the HMC” on page 638). Then continue with the next step. No: Select DST using Function 21 for the failing partition (see “Selecting function 21 from the control panel” on page 638 in “Service functions” on page 652). Then continue with the next step.2. Correct the problem by doing the following: a. Select Work with disk units ? Work with disk unit recovery ? Correct device parity protection. b. Follow the online instructions. This ends the procedure.0008 A disk unit has no more alternate sectors to assign. 1. Determine the failing unit by type, model, serial number or address given in words 4-7. See The System Reference Code (SRC) Format Description. 2. See the service information for the specific storage device. Use the disk unit reference code listed below for service information entry. 432x 102E, 433x 102E, 660x 102E, 671x 102E, 673x 102E ( Look up a reference code.).This ends the procedure.0009 The procedure to restore a disk unit from the tape unit did not complete. Continue with the disk unit exchange recovery procedure. 000A There is a problem with a disk unit subsystem. As a result, there are missing disk units in the system. Perform the following: 1. Is the system managed by an HMC? Yes: Select DST by performing the HMC action for Function 21 for the failing partition (see “Control panel function codes on the HMC” on page 638). Then continue with the next step. Chapter 2. Isolation Procedures 101 No: Select DST using Function 21 for the failing partition (see “Selecting function 21 from the control panel” on page 638 in “Service functions” on page 652). Then continue with the next step.2. On the Service Tools display, select Start a Service Tool ? Product activity log ? Analyze log. 3. On the Select Subsystem Data display, select the option to view All Logs. Note: You can change the From: and To: Dates and Times from the 24-hour default if the time that the customer reported having the problem was more than 24 hours ago. 4. Use the defaults on the Select Analysis Report Options display by pressing Enter. 5. Search the entries on the Log Analysis Report display for system reference codes associated with the missing disk units. 6. Go to Reference codes to correct the problem. This ends the procedure.000B Some system IOPs require cache storage be reclaimed. 1. Is the system managed by an HMC? Yes: Select DST by performing the HMC action for Function 21 for the failing partition (see “Control panel function codes on the HMC” on page 638). Then continue with the next step. No: Select DST using Function 21 for the failing partition (see “Selecting function 21 from the control panel” on page 638 in “Service functions” on page 652). Then continue with the next step.2. Reclaim the cache adapter card storage. See “Reclaiming IOP cache storage” on page 636. Note: The system operator may want to restore data from the most recent saved tape after you complete the repair. This ends the procedure.000C One of the mirror protected disk units has no more alternate sectors to assign. 1. Determine the failing unit by type, model, serial number or address given in words 4-7. See “System Reference Code (SRC) Information” on page 601. 2. See the service information for the specific storage device. Use the disk unit reference code listed below for service information entry. 432x 102E, 433x 102E, 660x 102E, 671x 102E, 673x 102E ( Look up a reference code.). This ends the procedure.000D The system disk capacity has been exceeded. For more information about disk capacity, see iSeries Handbook, GA19-5486-20. 000E Start compression failure. 1. Select Manual mode and perform an IPL to DST for the failing partition. See “Performing an IPL to DST” on page 640. 2. Correct the problem by doing the following: a. Select Work with disk units ? Work with disk unit recovery ? Recover from start compression failure. b. Follow the on-line instructions. This ends the procedure. 102 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 0010 The disk configuration has changed. The operating system must be installed again, and all customer data must be restored. 1. Select Manual mode on the control panel. 2. Perform an IPL to reinstall the operating system. 3. The customer must restore all data from the latest system backup. This ends the procedure.0011 The serial number of the control panel does not match the system serial number. 1. Select Manual mode on the control panel. 2. Perform an IPL. You will be prompted for the system serial number. This ends the procedure.0012 The operation to write the vital product data (VPD) to the control panel failed. Exchange the multiple function I/O processor card. See Chapter 3, “Locating FRUs,” on page 335 for the model you are working on for the location of the card and a link to the remove and replace procedure. 0015 The mirrored load-source disk unit is missing from the disk configuration. Go to step 1 on page 98 for cause code 0002. 0016 A mirrored protected disk unit is missing. Wait six minutes. If the same reference code appears, go to step 1 on page 98 for cause code 0002. 0017 One or more disk units have a lower level of mirrored protection than originally configured. 1. Select Manual mode and perform an IPL to DST for the failing partition. See “Performing an IPL to DST” on page 640. 2. Review the detailed display, which shows the new and the previous levels of mirrored protection. This ends the procedure.0018 Load-source configuration problem. The load-source disk unit is using mirrored protection and is configured at an incorrect address. Ensure that the load-source disk unit is in device location 1. 0019 One or more disk units were formatted incorrectly. The system will continue to operate normally. However, it will not operate at optimum performance. To repair the problem, perform the following: 1. Record the unit number and serial number of the disk unit that is formatted incorrectly. 2. Sign on to DST. See “Accessing Dedicated Service Tools” on page 637. 3. Select Work with disk units ? Work with disk unit configuration ? Remove unit from configuration. 4. Select the disk unit you recorded earlier in this procedure. 5. Confirm the option to remove data from the disk unit. This step may take a long time because the data must be moved to other disk units in the auxiliary storage pool (ASP). 6. When the remove function is complete, select Add unit to configuration. Chapter 2. Isolation Procedures 103 7. Select the disk unit you recorded earlier in this procedure. 8. Confirm the add. The disk unit is formatted during functional operation. This ends the procedure.001A The load-source disk unit data is down-level. The load-source disk unit is mirror protected. The system is using the load-source disk unit that does not have the current level of data. 1. Select Manual mode and perform an IPL to DST for the failing partition. See “Performing an IPL to DST” on page 640. Does the Disk Configuration Error Report display appear? No: The system is now using the correct load-source. This ends the procedure. Yes: Continue with the next step.2. Does a ?Load source failure? message appear in the list? Yes: Continue with the next step. No: The system is now using the correct load-source. This ends the procedure.3. Select option 5, press F11, and then press Enter to display details. The load-source type, model, and serial number information that the system needs is displayed on the console. Is the load-source disk unit (displayed on the console) attached to an MSIOP that cannot be used for a load-source? Yes: Contact your next level of support. This ends the procedure. No: The load-source disk unit is missing. Go to step 1 on page 98 for cause code 0002.001C The disk units that are needed to update the system configuration are missing. Perform an IPL by doing the following: 1. Select Manual mode on the control panel. 2. Perform an IPL. Use the IPL information to determine the cause of the problem. This ends the procedure.001D 1. Is the Disk Configuration Attention Report, or the Disk Configuration Warning Report displayed? Yes: Continue with the next step. No: Ask your next level of support for assistance. This ends the procedure.2. On the Bad Load Source Configuration message line, select 5, and press Enter to rebuild the load-source configuration information. If there are other types of warnings, select option 5 on the warnings, and correct the problem. This ends the procedure.001E The load-source data must be restored. 001F Licensed Internal Code was installed on the wrong disk unit of the load-source mirrored pair. The system performed an IPL on a load source that may not contain the same level of Licensed Internal Code that was installed on the other load source. The type, model, and address of the active device are displayed in words 4-7 of the SRC. Choose from the following options: 104 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 1. If the load-source disk unit in position 1 contains the correct level of Licensed Internal Code, perform the following: a. Select Manual mode and perform an IPL to DST for the failing partition. See “Performing an IPL to DST” on page 640. Is the Disk Configuration Attention Report or Disk Configuration Warning Report displayed? Yes: Select option 5 on the Incorrect Licensed Internal Code Install message line. When the Display Incorrect Licensed Internal Code Install display appears on the console, press Enter. No: The system is now using the correct load source. This ends the procedure.2. If the load-source disk unit in position 1 of the system unit does not contain the correct level of Licensed Internal Code, restore the Licensed Internal Code to the disk unit in position 1 of the system unit. This ends the procedure.0020 The system appears to be a one disk unit system. Select Manual mode and perform an IPL to DST for the failing partition. See “Performing an IPL to DST” on page 640. 0021 The system password verification failed. 1. Select Manual mode and perform an IPL to DST for the failing partition. See “Performing an IPL to DST” on page 640. 2. When prompted, enter the correct system password. If the correct system password is not available perform the following: a. Select Bypass the system password. b. Have the customer contact the marketing representative immediately to order a new system password from your service provider. This ends the procedure.0022 A different compression status was expected on a reporting disk unit. Accept the warning. The reported compression status will be used as the current compression status. 0023 There is a problem with a disk unit subsystem. As a result, there are missing disk units in the system. The system is capable of IPLing in this state. 1. Is the system managed by an HMC? Yes: Select DST by performing the HMC action for Function 21 for the failing partition (see “Control panel function codes on the HMC” on page 638). Then continue with the next step. No: Select DST using Function 21 for the failing partition (see “Selecting function 21 from the control panel” on page 638 in “Service functions” on page 652). Then continue with the next step.2. On the Service Tools display, select Start a Service Tool ? Product activity log ? Analyze log. 3. On the Select Subsystem Data display, select the option to view All Logs. Note: You can change the From: and To: Dates and Times from the 24-hour default if the time that the customer reported having the problem was more than 24 hours ago. 4. Use the defaults on the Select Analysis Report Options display by pressing Enter. 5. Search the entries on the Log Analysis Report display for system reference codes associated with the missing disk units. 6. Go to the Reference codes topic and use the SRC information to correct the problem. This ends the procedure.0024 The system type or system unique ID needs to be entered. Chapter 2. Isolation Procedures 105 1. Select Manual mode and perform an IPL to DST for the failing partition. See “Performing an IPL to DST” on page 640. 2. When prompted, enter the correct system type or system unique ID. This ends the procedure.0025 Hardware Resource Information Persistence disabled. 1. Select Manual mode and perform an IPL to DST for the failing partition. See “Performing an IPL to DST” on page 640. 2. Contact your next level of support for instructions on how to enable the Hardware Resource Information Persistence function. This ends the procedure.0026 A disk unit is incorrectly configured for an LPAR system. 1. Select Manual mode and perform an IPL to DST for the failing partition. See “Performing an IPL to DST” on page 640. 2. On the Service Tools display, select Start a Service Tool ? Product activity log ? Analyze log. 3. On the Select Subsystem Data display, select the option to view All Logs. Note: You can change the From: and To: Dates and Times from the 24-hour default if the time that the customer reported having the problem was more than 24 hours ago. 4. Use the defaults on the Select Analysis Report Options display by pressing Enter. 5. Search the entries on the Log Analysis Report display for system reference codes (B6xx 53xx) that are associated with the error. 6. Using the SRC information, look up the reference code. and use the information to correct the problem. This ends the procedure.0027 The user ASP has overflowed. Contact your next level of support. 0031 A problem was detected with the installation of Licensed Internal Code service displays. The cause may be defective media, the installation media being removed too early, a device problem or a Licensed Internal Code problem. v Ask your next level of support for assistance. Characters 13-16 of the top 16 character line of function 12 (4 rightmost characters of word 3) contain information regarding the install error. v If the customer does not require the service displays to be in the national language, you may be able to continue by performing another system IPL. This ends the procedure.0033 System model not supported. This model of hardware does not support the System Licensed Internal Code version and release that is being used. Use a supported version and release of the System Licensed Internal Code. 0034 Insufficient main storage capacity. There is not enough main storage capacity. For details about how much more capacity is required, see the ?Insufficient Main Storage Capacity? screen, which is displayed when the system is IPLed in manual mode. Typically, this error occurs when you have moved memory between logical partitions, and one partition no longer has a sufficient amount of main storage. 106 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 0035 Data from a User ASP has overflowed into the System ASP (ASP 1). There is not enough free space in the User ASP to move the overflowed data from the System ASP back into the User ASP. The system will continue to run in this condition, but if a disk failure in the System ASP causes the System ASP to be cleared, the data in the User ASP will also be cleared out. You should delete some files or objects from the User ASP so that enough free space exists in the User ASP to allow the data that is overflowed into the System ASP to be moved back. 0037 One or more functional connections to a disk unit in a multi-path environment have not been detected. The connections to the disk unit were established by running ESS Specialist. If you use the server in this state, you may cause a loss of data. You must ensure that all of the functional connections are still established between the disk and the Input/Output Adapters (IOAs) attached to this server and this logical partition. If there is an IOA which has a connection to the disk unit that has been moved to a different logical partition or different server, you should not continue with the IPL. Notify your next level of support. 0099 A Licensed Internal Code program error occurred. Ask your next level of support for assistance. LICIP12 Use this procedure to isolate an Independent Auxiliary Storage Pool (IASP) vary on failure. Message CPDB8E0 occurred if the user attempted to vary on the IASP. Read the Danger notices in “Licensed internal code (LIC) isolation procedures” on page 90 before continuing with this procedure. How to find the cause code 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Were you given a cause code by another procedure? No: Continue with the next step. Yes: Use the cause code given by the other procedure. Then go to step 4.3. Look at the characters in word 3. You can obtain these characters by doing the following: a. On the command line, enter the Start System Service Tools (STRSST) command. If you cannot get to SST, use function 21 to get to DST. Do not IPL the system to get to DST. b. On the Start Service Tools Sign On display, type in a User ID with service authority and password. c. Select Start a Service Tool ? Hardware Service Manager ? Work with service action log. d. On the Select Timeframe display, change the From: Date and Time to a date and time prior to when the user attempted to vary on the IASP. e. Search for a B6005094 system reference code that occurred at the time the user attempted to vary on the IASP. Display the failing item information for this entry. f. Select the function key for Additional details. g. The 4 leftmost characters of word 3 is the cause code to be used in this procedure.4. Find the cause code below: “0002” on page 108 “000A” on page 110 “002C” on page 110 “0030” on page 111 “0004” on page 109 “000B” on page 110 “002D” on page 111 “0032” on page 111 Chapter 2. Isolation Procedures 107 “0007” on page 109 “000D” on page 110 “002E” on page 111 “0099” on page 111 “0009” on page 110 “000E” on page 110 “002F” on page 111 0002 Disk units are missing from the IASP disk configuration. 1. Have you installed a new disk enclosure in a disk unit and not restored the data to the disk unit? v No: Continue with the next step. v Yes: Ignore SRC A600 5094. Continue with the disk unit exchange recovery procedure. This ends the procedure. 2. Use words 1-9 from the information in the Service Action Log to determine the disk unit that is missing from the configuration: v Word 4 contains the IOP direct select address. v Word 5 contains the unit address. v Word 6 contains the disk unit type, level and model number. v Word 7 contains the disk unit serial number. v Word 8 contains the number of missing disk units. Are the problem disk units 432x, 660x, or 671x Disk Units? v Yes: Continue with the next step. v No: Attempt to get all devices attached to the IOP to Ready status by performing the following: a. The IOP address (IOP Direct Select Address) to use is Word 4. b. Verify the following, and correct if necessary: – Ensure all cable connections are made correctly and are tight. – Ensure the configuration within the device is correct. – Ensure all storage devices are powered on and ready.c. Continue with the next step. 3. Perform the following: Select System Service Tools (SST) ? Work with disk units ? Display disk configuration ? Display disk configuration status. Are any disk units missing-indicated with an asterisk (*)- from the IASP configuration? Yes: Continue with the next step. No: This ends the procedure. 4. Use the Service Action Log to determine if there are any entries other than B6xx 5094 for the missing disk units or the IOA or IOP that is controlling them. See “Using the Service Action Log” on page 609. Are there any entries in the Service Action Log other than B6xx 5094 for the missing disk units or the IOA or IOP that is controlling them? No: Continue with the next step. Yes: Use the information in the Service Action Log to solve the problem. See “Using the Service Action Log” on page 609. This ends the procedure. 5. Did you enter this procedure because there was a B6xx 5094 cause code of 0030? v No: Continue with the next step. v Yes: Work with the customer to recover the unknown configuration source disk unit. Use a workstation with System i Navigator installed to select the disk pool with the problem, and then select Recover unknown configuration source for this disk pool. This ends the procedure. 108 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 6. Use Hardware Service Manager to display logical resources connected to the IOP. See “Hardware Service Manager” on page 644. 7. Is every device attached to the IOP failing? v Yes: Continue with the next step. v No: Are all of the disk units that are attached to one IOA missing? – No: Continue with the next step. – Yes: Exchange the IOA. Use the IOP direct select address and the first character of the unit address from step 2 on page 108 to find the location. See Chapter 3, “Locating FRUs,” on page 335. This ends the procedure. 8. Is there more than one storage IOA attached to the IOP? v Yes: Exchange the IOP. Use the IOP direct select address from step 2 on page 108 to find the location. See Chapter 3, “Locating FRUs,” on page 335. This ends the procedure. v No: Go to step 10. 9. Go to the service information for the specific disk unit that is listed below and perform the action indicated. Then return here and answer the following question. v 2105 Disk Units: Use SRC 3002 exchange the FRUs shown one at a time. v 432x, 660x, 671x Disk Units: Use SRC 3002 and exchange the FRUs shown one at a time. Did the disk unit service information correct the problem? No: Continue with the next step. Yes: This ends the procedure.10. Perform the following: a. Exchange the IOA. Use the IOP direct select address and the first character of the unit address from step 2 on page 108 to find the location. See Chapter 3, “Locating FRUs,” on page 335. b. If exchanging the IOA does not correct the problem, exchange the IOP. Use the IOP direct select address from step 2 on page 108 to find the location. See Chapter 3, “Locating FRUs,” on page 335. c. If exchanging the IOP does not correct the problem, exchange the failing items in the following FRU list starting with the first item in the list. 1) FI01140 2) System backplane 3) FI00580 4) AJDG301This ends the procedure.0004 Some disk units are unprotected but configured into a mirrored IASP. These units were originally DPY protected but protection was disabled. Direct the customer to take the actions necessary to start protection on these disk units. This ends the procedure. 0007 Some of the configured disk units have device parity protection disabled when the system expected device parity protection to be enabled. 1. Select Manual mode and perform an IPL to DST. See “Performing an IPL to DST” on page 640. 2. Correct the problem by doing the following: a. Select Work with disk units ? Work with disk unit recovery ? Correct device parity protection mismatch. Chapter 2. Isolation Procedures 109 b. Follow the on-line instructions. This ends the procedure.0008 A disk unit has no more alternate sectors to assign. 1. Determine the failing unit by type, model, serial number or address given in words 4-7. See The System Reference Code (SRC) Format Description. 2. See the service information for the specific storage device. Use the disk unit reference code listed below for service information entry. 432x 102E, 660x 102E, 671x 102E This ends the procedure.0009 The procedure to restore a disk unit from the tape unit did not complete. Continue with the disk unit exchange recovery procedure. This ends the procedure. 000A There is a problem with a disk unit subsystem. As a result, there are missing disk units in the system. Use the Service Action Log to find system reference codes associated with the missing disk units by changing the From: Date and Time on the Select Timeframe display to a date and time prior to when the user attempted to vary on the IASP. For information on how to use the Service Action Log, see “Using the Service Action Log” on page 609. This ends the procedure. 000B Some system IOPs require cache storage be reclaimed. 1. Start SST. 2. Reclaim the cache adapter card storage by performing the following: a. Select Work with disk units ? Work with disk unit recovery ? Reclaim IOP Cache Storage. b. Follow the on-line instructions to reclaim cache storage. c. After you complete the repair, the system operator may want to restore data from the most recently saved tape. This ends the procedure.000D The system disk capacity has been exceeded. For more information about disk capacity, see the iSeries Handbook. This ends the procedure. 000E Start compression failure. 1. Select Manual mode and perform an IPL to DST. See “Performing an IPL to DST” on page 640. 2. Correct the problem by doing the following: a. Select Work with disk units ? Work with disk unit recovery ? Recover from start compression failure. b. Follow the on-line instructions. This ends the procedure.002C A Licensed Internal Code program error occurred. Ask your next level of support for assistance. This ends the procedure. 110 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 002D The IASP configuration source disk unit data is down-level. The system is using the IASP configuration source disk unit that does not have the current level of data. Work with the customer to recover the configuration. On a workstation with System i Navigator installed, select the disk pool with the problem, and then select Recover configuration. This ends the procedure. 002E The Independent ASP is assigned to another system or a Licensed Internal Code program error occurred. Work with the customer to check other systems to determine if the Independent ASP has been assigned to it. If the Independent ASP has not been assigned to another system, ask your next level of support for assistance. This ends the procedure. 002F The system version and release are at a different level than the IASP version and release. The system version and release must be upgraded to be the same as the system version and release in which the IASP was created. This ends the procedure. 0030 The mirrored IASP configuration source disk unit has a disk configuration status of unknown and is missing from the disk configuration. Go to step 1 on page 108 for cause code 0002. 0032 A Licensed Internal Code program error occurred. Ask your next level of support for assistance. This ends the procedure. 0099 A Licensed Internal Code program error occurred. Ask your next level of support for assistance. This ends the procedure. LICIP13 A disk unit seems to have stopped communicating with the system. The system has stopped normal operation until the cause of the disk unit failure is found and corrected. Ensure you have read the Danger notices in “Licensed internal code (LIC) isolation procedures” on page 90 before continuing with this procedure. If the disk unit that stopped communicating with the system has mirrored protection active, normal operation of the system stops for one to two minutes. Then the system suspends mirrored protection for that disk unit and continues normal operation. Note: Do not power off the system or partition using the white button, function 08, ASMI, or HMC immediate power-off when performing this procedure. If this procedure or other isolation procedures referenced by this procedure direct you to IPL or power off the system, Chapter 2. Isolation Procedures 111 v perform a partition main storage dump (see “Performing dumps” on page 653), or v if additional dump information is not needed, perform a function 03 IPL or restart the system or partition using the HMC. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Was a problem summary form completed for this problem? No: Continue with the next step. Yes: Use the problem summary form information and go to step 4. 3. Fill out a Problem Reporting Form ... completely with the instructions provided. 4. Recovery from a device command time-out may have caused the communications loss condition (indicated by an SRC on the control panel or in the HMC). This communications loss condition has the following symptoms: v The A6xx SRC does not increment within two minutes. v The system continues to run normally after it recovers from the communications loss condition and the reference code is cleared from the control panel. Does the communication loss condition have the above symptoms? Yes: Continue with the next step. No: Go to step 7. 5. Verify that all Licensed Internal Code PTFs have been applied to the system. Apply any Licensed Internal Code PTFs that have not been applied to the system. Does the intermittent condition continue? Yes: Print all product activity logs. Print the LIC logs with a major code of 1000. Provide this information to your next level of support. This ends the procedure. No: This ends the procedure. 6. Is the storage hosted by another partition? Yes: Contact your next level of support. No: Continue with the next step. 7. A manual reset of the IOP may clear the attention reference code. Perform the following: If you are working from the control panel: a. Select Manual mode on the control panel. b. Select Function 25 and press Enter. c. Select Function 26 and press Enter. d. Select Function 67 and press Enter to reset the IOP. e. Wait 10 minutes. f. Select Function 25 and press Enter to disable the service functions on the control panel.If you are working from the HMC: a. These need to be updated for the new HMC UI.... b. In the Navigation Area, open the Service Applications folder. c. Select Service Focal Point. d. In the contents area, select Service Utilities. e. In the Service Utilities window, select the system you are working on. f. Select Selected ? Operator Panel Service Functions. g. Select the logical partition, and then select Partition Functions. h. Select Disk Unit IOP Reset/Reload (67). i. Wait 10 minutes. 112 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Did the reset successfully clear the control panel SRC or HMC panel value and can commands be entered on the partition console? No: Continue with the next step. Yes: Look for a Service Action Log (SAL) entry since the last IPL, and use it to fix the problem (see “Using the Service Action Log” on page 609). If a B6xx 5090 SRC occurred since the last IPL, look for other SRC entries and take action on them first. This ends the procedure. 8. Is the SRC the same reference code that sent you here? Yes: The same reference code occurred. Continue with the next step. No: Collect all words of the reference code and perform, problem analysis to resolve the new problem. This ends the procedure. 9. Powering off and powering on the affected IOP domain may clear the attention reference code. Perform the following: If you are working from the control panel: a. Select Manual mode on the control panel. b. Select Function 25 and press Enter. c. Select Function 26 and press Enter. d. Select Function 68 and press Enter to power off the domain. e. After the domain has been powered off or 10 minutes have passed, select Function 69 and press Enter to power on the domain. f. Wait 10 minutes. g. Select Function 25 and press Enter to disable the service functions on the control panel.If you are working from the HMC: a. In the Navigation Area, open the Service Applications folder. b. Select Service Focal Point. c. In the contents area, select Service Utilities. d. In the Service Utilities window, select the system you are working on. e. Select Selected ? Operator Panel Service Functions. f. Select the logical partition, and then select Partition Functions. g. Select Power off domain (68). h. After the domain has been powered off or 10 minutes have passed, select Power on domain (69). i. Wait 10 minutes.Did this successfully clear the control panel SRC or HMC panel value, and can commands be entered on the partition console? No: Continue with the next step. Yes: Look for a SAL entry since the last IPL, and use it to fix the problem (see “Using the Service Action Log” on page 609). If a B6xx 5090 SRC occurred since the last IPL, look for other SRC entries and take action on them first. This ends the procedure.10. Is the SRC the same reference code that sent you here? Yes: The same reference code occurred. Continue with the next step. No: Collect all words of the reference code and perform problem analysis to resolve the new problem. This ends the procedure.11. Perform a main storage dump, then perform an IPL by performing the following: If you are working from the control panel: a. Select Manual mode on the control panel. b. Select Function 22 and press Enter to dump the main storage to the load-source disk unit. c. Wait for SRC A100 300x to occur, indicating that the dump is complete. d. Then perform an IPL to DST (see “Performing an IPL to DST” on page 640). Chapter 2. Isolation Procedures 113 If you are working from the HMC: a. In the Navigation Area, open Server and Partition. b. Select Server Management. c. In the contents area, open the server on which the logical partition is located. d. Select Partitions. e. Right-click the logical partition profile and select Restart Partition. f. In the Restart Partition window, select the Dump restart option. Does a different SRC occur, or does a display appear on the console showing reference codes? No: Continue with the next step. Yes: Perform problem analysis to correct the new problem. This ends the procedure.12. Does the same reference code occur? v Yes: Continue with the next step. v No: The problem is intermittent. Perform the following: a. Print the system product activity log for the magnetic storage subsystem and print the LIC logs with a major code of 1000. b. Copy the main storage dump to removable media (see “Managing dumps” on page 653). c. Contact your next level of support and provide them with this information. This ends the procedure.13. Are characters 7-8 of the top 16 character line of function 12 (2 rightmost characters of word 2) equal to 13 or 17? Yes: Continue with the next step. No: Go to step 16.14. Use the word 1 through 9 information recorded on the Problem summary form to determine the disk unit that stopped communicating with the system: v Characters 9-16 of the top 16 character line of function 12 (word 3) contain the IOP direct select address. v Characters 1-8 of the bottom 16 character line of function 12 (word 4) contains the unit address. v Characters 1-8 of the top 16 character line of function 13 (word 6) may contain the disk unit type, level and model number. v Characters 13-16 of the top 16 character line of function 13 (4 rightmost characters of word 7) may contain the disk unit reference code. v Characters 1-8 of the bottom 16 character line of function 13 (word 8) may contain the disk unit serial number. Note: For 2105 and 2107 disk units, characters 4-8 of the bottom 16 character line of function 13 (5 rightmost characters of word 8) contain the disk unit serial number.15. Is the disk unit reference code 0000? v No: Using the information from step 14, find the table for the indicated disk unit type. Perform problem analysis for the disk unit reference code. This ends the procedure. v Yes: Perform the following steps: a. Determine the IOP type by using characters 9-12 of the bottom 16 character line of function 13 (4 leftmost characters of word 9). b. Find the unit reference code table for the IOP type. Determine the unit reference code by using characters 13-16 of the bottom 16 character line of function 13 (4 rightmost characters of word 9). c. Perform problem analysis for the unit reference code. This ends the procedure.16. Are characters 7-8 of the top 16 character line of function 12 (the two rightmost characters of word 2) equal to 27? 114 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Yes: Continue with the next step. No: Go to step 20.17. Use the word 1 through 9 information recorded on the Problem summary form to determine the disk unit that stopped communicating with the system: v Characters 9-16 of the top 16 character line of function 12 (word 3) contain the IOP direct select address. v Characters 1-8 of the bottom 16 character line of function 12 (word 4) contains the disk unit address v Characters 9-16 of the bottom 16 character line of function 12 (word 5) contains the disk unit type, level and model number. v Characters 1-8 of the bottom 16 character line of function 13 (word 8) contains the disk unit serial number. Note: For 2105 and 2107 Disk Units, characters 4-8 of the bottom 16 character line of function 13 (5 rightmost characters of word 8) contain the disk unit serial number. v Characters 13-16 of the bottom 16 character line of function 13 (4 rightmost characters of word 9) contain the disk unit reference code.18. Is the disk unit reference code 0000? v No: Continue with the next step. v Yes: Find the table for the indicated disk unit type. Then find unit reference code (URC) 3002 in the table, and exchange the FRUs for that URC, one at a time. Note: Do not perform any other isolation procedures that are associated with URC 3002.This ends the procedure.19. Are characters 9-16 of the bottom 16 character line of function 13 (word 9) B6xx 51xx? Yes: Using the B6xx table, perform problem analysis for the 51xx unit reference code. This ends the procedure. No: Using the information from step 17, find the table for the indicated disk unit type. Perform problem analysis for the disk unit reference code. This ends the procedure.20. Are the 2 rightmost characters of word 2 on the Problem summary form equal to 62? No: Use the information in characters 9-16 of the bottom 16 character line of function 13 (word 9) and use this information instead of the information in word 1 for the reference code. This ends the procedure. Yes: Continue with the next step.21. Are characters 9-16 of the top 16 character line of function 12 (word 3) equal to 00010004? Yes: Continue with the next step. No: Go to step 24 on page 116.22. Are characters 13-16 of the bottom 16 character line of function 12 (4 rightmost characters of word 5) equal to 0000? No: Continue with the next step. Yes: Go to step 25 on page 116.23. Note the following: v Characters 13-16 of the bottom 16 character line of function 12 (4 rightmost characters of word 5) contain the disk unit reference code. v Characters 1-8 of the top 16 character line of function 13 (word 6) contains the disk unit address. v Characters 9-16 of the top 16 character line of function 13 (word 7) contain the IOP direct select address. v Characters 1-8 of the bottom 16 character line of function 13 (word 8) contains the disk unit type, level and model number. Chapter 2. Isolation Procedures 115 Find the table for the disk unit type (characters 1-4 of the bottom 16 character line of function 13 - 4 leftmost characters of word 8), and use characters 13-16 of the bottom 16 character line of function 12 (4 rightmost characters of word 5) as the unit reference code. This ends the procedure. 24. Are characters 9-16 of the top 16 character line of function 12 (word 3) equal to 0002000D? v Yes: Continue with the next step. v No: Use the information in characters 9-16 of the bottom 16 character line of function 13 (word 9), instead of the information in word 1 for the reference code, and perform problem analysis. – Characters 1-8 of the top 16 character line of function 13 (word 6) may contain the disk unit address. – Characters 9-16 of the top 16 character line of function 13 (word 7) may contain the IOP direct select address. – Characters 1-8 of the bottom 16 character line of function 13 (word 8) may contain the disk unit type, level and model number. This ends the procedure.25. Note the following: v Characters 1-8 of the top 16 character line of function 13 (word 6) contains the disk unit address. v Characters 9-16 of the top 16 character line of function 13 (word 7) contain the IOP direct select address. v Characters 1-8 of the bottom 16 character line of function 13 (word 8) contains the disk unit type, level and model number. Find the table for the disk unit type (characters 1-4 of the bottom 16 character line of function 13 (4 leftmost characters of word 8) and use 3002 as the unit reference code. Exchange the FRUs for URC 3002 one at a time. This ends the procedure.LICIP14 Licensed Internal Code detected a card slot test failure. 1. Has the I/O adapter moved to a new card location? Yes: Continue with the next step. No: Go to step 4.2. Perform one of the following, and then continue with the next step: v Use the concurrent maintenance option in Hardware Service Manager in SST/DST to power off, remove, reinsert, and power on the I/O adapter. OR v Power off the system, remove and reinsert the I/O adapter. Then IPL the system.3. Does the reference code occur again for this same I/O adapter? v Yes: Continue with the next step. v No: No further service action is needed. This ends the procedure.4. Move the I/O adapter to a different card location, that has no I/O processors in the PCI bridge set, by performing one of the following, and then continue with the next step: v Use the concurrent maintenance option in Hardware Service Manager in SST/DST to power off, remove the I/O adapter, install the I/O adapter in a different card location, and power on the I/O adapter.OR v Power off the system, remove the I/O adapter, install the I/O adapter in a different card location, and then IPL the system.5. Does the same reference code occur again for this I/O adapter? v Yes: Replace the I/O adapter. This ends the procedure. v No: Replace the backplane. 116 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) This ends the procedure.LICIP15 Use this procedure to help you recover from an initial program load (IPL) failure. 1. Is the system HMC-managed or managed by Integrated Virtualization Manager (IVM)? Yes: Continue with the next step. No: Go to step 4. 2. Check the LPAR configuration to ensure that the load source and alternate load source devices are valid. Is the LPAR configuration correct? Yes: Continue with the next step. No: Correct the LPAR configuration problem. This ends the procedure. 3. Is the load source hosted by another partition? Yes: Contact your next level of support. No: Continue with the next step. 4. Did the failure occur when you were performing a type-D IPL? v No: Go to step 10 on page 118. v Yes: Perform the following: a. Ensure that the device is ready and has valid install media. b. Ensure that the device has the correct SCSI address and that any cables are properly connected and terminated. If a correction is made during the above checks, retry the IPL. If none of the above items resolve the problem, continue with the next step. 5. Are the load source and alternate load source devices controlled by the same I/O adapter, and does the load source disk unit have SLIC loaded on it? Yes: Continue with the next step. No: Go to step 7. 6. Perform a type-B IPL in manual mode. Does the same SRC occur? v No: Continue with the next step. v Yes: Replace the following items, one at a time, and retry the IPL until the problem is resolved (see Chapter 3, “Locating FRUs,” on page 335): a. The I/O adapter controlling load source and alternate load source devices. Note: The I/O adapter may be embedded on the system unit backplane. b. The common cable, if present, attached between both the load source and alternate load source and the controlling I/O adapter. c. If none of the items above resolve the problem, contact your next level of support. This ends the procedure. 7. Replace the following items, one at a time, and retry the type-D IPL until the problem is resolved (see Chapter 3, “Locating FRUs,” on page 335): a. Media in the alternate load source device b. Device cables (if present) c. Media device d. Media backplane e. I/O adapter controlling the alternate load source device Note: The I/O adapter may be embedded on the system unit backplane f. If the problem persists after replacing each of these parts, contact your next level of support. This ends the procedure. Chapter 2. Isolation Procedures 117 8. You performed a type A or type B IPL. Is the load source I/O adapter a Fibre Channel adapter? Yes: Continue with the next step. No: Continue with step 10. 9. Perform a type-D IPL in manual mode to DST. Look for other SRCs and use them to resolve the problem. If there are no SRCs, or if the SRCs do not resolve the problem, perform the actions for the 2847 3100 SRC. This ends the procedure. 10. Is the device in a valid location (see Chapter 3, “Locating FRUs,” on page 335)? Yes: Continue with the next step. No: Correct the device location problem and retry the IPL. If the problem persists, continue with the next step.11. Perform a type-D IPL in manual mode to DST. Is the type-D IPL successful? v No: Continue with the next step. v Yes: Look for other SRCs and use them to resolve the problem. If there are no SRCs, or the SRCs do not resolve the problem, replace the following items, one at a time, until the problem is resolved (see Chapter 3, “Locating FRUs,” on page 335): a. Load source disk drive b. Cables (if present) c. Disk drive backplane d. I/O adapter controlling the load source device Note: The I/O adapter may be embedded on the system unit backplane e. Backplane that the I/O adapter is plugged into f. If the problem persists after replacing each of these parts, contact your next level of support. This ends the procedure.12. The type-D IPL in manual mode to DST was not successful. Is the I/O adapter embedded on the system unit backplane? No: Continue with the next step. Yes: Replace the system unit backplane and retry the IPL. If the IPL still fails, contact your next level of support. This ends the procedure.13. Are the load source and alternate load source controlled by the same I/O adapter? No: Go to step 16 on page 119. Yes: Continue with the next step.14. Replace the I/O adapter and perform a type-A or type-B IPL. Does the IPL complete successfully? Yes: This ends the procedure. No: Continue with the next step.15. Perform a type-D IPL in manual mode to DST. Is the type-D IPL successful? v No: Continue with the next step. v Yes: Look for other SRCs and use them to resolve the problem. If there are no SRCs, or the SRCs do not resolve the problem, replace the following items, one at a time, until the problem is resolved (see Chapter 3, “Locating FRUs,” on page 335): a. Load source disk drive b. Cables (if present) c. Disk drive backplane d. I/O adapter controlling the load source device Note: The I/O adapter may be embedded on the system unit backplane e. Backplane that the I/O adapter and I/O processor are plugged into 118 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) f. If the problem persists after replacing each of these parts, contact your next level of support. This ends the procedure.16. Replace the backplane that the I/O adapter is plugged into and retry the IPL. If the IPL still fails, contact your next level of support. This ends the procedure.LICIP16 Use this procedure to identify an adapter that is operational but is not located in the same partition as its associated adapter. An adapter identified that its associated adapter is operational but is not located in the same partition. Use this procedure to identify the serial number and then find the location of the associated adapter and reassign it so that both adapters are in the same partition. Note: If the associated adapter is located in a different i5/OS partition, there might also be a B600690A logged against the associated adapter in that partition. 1. The adapter against which the B600690A is logged has identified that its associated adapter can not be found in this partition. Find the resource name that this error was logged against. This can be obtained from the Service Action Log (see ?Using the Service Action Log?). Then, using the resource name, perform the following: a. Access SST or DST. b. Select Start a Service Tool. c. Select Hardware Service Manager. d. Select Locate resource by resource name. e. Enter the resource name that this error was logged against. f. Take the option to Display detail for the adapter.2. The bottom of the resource detail screen displays any combination of the following information: Attached storage IOA resource name. : Attached storage IOA serial number. : Attached storage IOA link status. . : Or Attached auxiliary IOA resource name: Attached auxiliary IOA serial number: Attached auxiliary IOA link status. : Or Remote storage IOA resource name. . : Remote storage IOA serial number. . : Remote storage IOA link status. . . : 3. Using the serial number information displayed for the Attached or Remote IOA, have the customer determine which partition currently owns the adapter with that serial number by using logical resource or VPD utilities in each of the partitions on the system. Note: The CCIN of the associated adapter is the first four characters of word 6 of the SRC. 4. Then, have the customer ensure that both adapters are owned by the same partition. For further assistance, the customer should contact their software service provider. This ends the procedure. Chapter 2. Isolation Procedures 119 Logical partition (LPAR) isolation procedure These procedures help you to identify logical partition (LPAR) configuration conditions and the associated corrective actions. DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) “LPRIP01”Use this procedure to isolate the problem when LPAR configuration data does not match the current system configuration.LPRIP01 Use this procedure to isolate the problem when LPAR configuration data does not match the current system configuration. 1. Is there only one B6005311 error logged, and is it against the load source device for the partition, in either the Primary or a secondary partition? v Yes: Is the reporting partition the Primary partition? – Yes: Continue with the next step. – No: Go to step 3 on page 121. 120 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) v No: Go to step 4. 2. Was the load source disk unit migrated from another partition within the same system? v Yes: Is this load source device intended to be the load source for the Primary partition? – Yes: To accept the load source disk unit: Go to SST/DST in the current partition and select Work with system partitions ? Recover configuration data ? Accept load source disk unit. This ends the procedure. – No: Power off the system. Return the original load source disk to the Primary partition and perform a system IPL. This ends the procedure.v No: The load source disk unit has not changed. Contact your next level of support. This ends the procedure. 3. The reporting partition is a secondary partition. Since the last IPL of the reporting partition, have one of the following occurred? v Has the Primary partition time/date been moved backward to a time/date earlier than the previous setting? v Has the system serial number been changed? v Was the load source disk unit in this secondary partition, replaced intentionally with a load source from another system or another partition from the same system?v Yes: To accept the load source disk unit: Go to SST/DST in the current partition and select Work with system partitions ? Recover configuration data ? Accept load source disk unitThis ends the procedure. v No: Contact your next level of support. This ends the procedure. 4. Are there multiple B600 5311 SRCs logged in the same partition? v Yes: Continue with the next step. v No: None of the conditions in this procedure have been met, call your next level of support. This ends the procedure. 5. Is the resource for one of the B600 5311 SRCs the load source device and are all of the other B600 5311 entries for resources which are non-configured disk units? Note: To determine if a disk unit is a non-configured disk unit, refer to the ?Work with disk unit options? section in the ?DST options? section of the ?DST chapter? in the iSeries Service Functions information. v Yes: Is the partition that is reporting the error the Primary partition? – Yes: Continue with the next step. – No: Go to step 7.v No: Go to step 8 on page 122. 6. Was the load source disk unit migrated from another partition within the same system? v Yes: Is this load source device intended to be the load source for the Primary partition? – Yes: To accept the load source disk unit: Go to SST/DST in the current partition and select Work with system partitions ? Recover configuration data ? Accept load source disk unitThis ends the procedure. – No: Power off the system. Return the original load source disk to the Primary partition and perform a system IPL. This ends the procedure.v No: The load source disk unit has not changed. Contact your next level of support. This ends the procedure. 7. The reporting partition is a secondary partition. Since the last IPL of the reporting partition, have one of the following occurred: v Has the Primary partition time/date been moved backward to a time/date earlier than the previous setting? v Has the system serial number been changed? Chapter 2. Isolation Procedures 121 v Was the load source disk unit in this secondary partition, replaced intentionally with a load source from another system or another partition from the same system?v Yes: To accept the load source disk unit: Go to SST/DST in the current partition and select Work with system partitions ? Recover configuration data ? Accept load source disk unitThis ends the procedure. v No: Contact your next level of support. This ends the procedure. 8. One or more B600 5311 SRCs have been logged in the same partition. Do all of the B600 5311 errors have a resource which is a non-configured disk unit in the partition?Note: To determine if a disk unit is a non-configured disk unit, refer to the ?Work with disk unit options? section in the ?DST options? section of the ?DST chapter? in the iSeries Service Functions information. v Yes: Continue with the next step. v No: None of the conditions in this procedure have been met, call your next level of support. This ends the procedure. 9. Were any disk unit resources associated with the B600 5311 SRCs added to the partition, since the last IPL of this partition? v No: Continue with the next step. v Yes: Perform the following to clear non-configured disk unit configuration data: a. Go to SST/DST in the partition and select Work with system partitions ? Recover configuration data ? Clear non-configured disk unit configuration data. b. Select each unit in the list which is new to the system and press Enter. c. Continue the system IPL. This ends the procedure.10. None of the resources that are associated with the B600 5311 SRCs are disk units that were added to the partition since the last IPL of the partition. Has a scratch install recently been performed on the partition that is reporting the error(s)? v No: Continue with the next step. v Yes: Go to step 13.11. If a scratch install was not performed, was the clear configuration data option recently used to discontinue LPAR use? v Yes: Continue with the next step. v No: The Clear configuration data option was not used. Contact your next level of support. This ends the procedure.12. Perform the following to clear non-configured disk unit configuration data: a. Go to SST/DST in the partition and select Work with system partitions ? Recover configuration data ? Clear non-configured disk unit configuration data. b. Select each unit in the list which is new to the system and press Enter. c. Continue the system IPL. This ends the procedure.13. Was the load source device previously mirrored before the scratch install? v Yes: Continue with the next step. v No: Go to step 15.14. Perform the following to clear the old configuration data from the disk unit that was mirroring the old load source disk a. Go to SST/DST in the partition and select Work with system partitions ? Recover configuration data ? Clear non-configured disk unit configuration data. b. Select the former load source mirror in the list and press Enter.15. Is the Primary partition reporting the B600 5311 error(s)? v No: This ends the procedure. 122 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) v Yes: Does the customer want multiple partitions on the system? – No: This ends the procedure. – Yes: Use the Recover primary partition configuration data option to retrieve the LPAR configuration data from other devices in the system. a. Go to SST/DST in the primary partition and select Work with system partitions ? Recover configuration data ? Recover primary partition configuration data. The system will perform an automatic IPL. b. Verify the information that appears. - The device should be a former load source device from a secondary partition. - The time and date should reflect a time when that partition was active. It should be more recent than the last change to the logical partition configuration. This ends the procedure. Chapter 2. Isolation Procedures 123 Operations Console isolation procedures These procedures help you to isolate a failure with the Operations Console. DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005)For direct cable only. The following symptoms can be caused by Operations Console failing to connect: v SRC A600 5008 v The status in the Operations Console window remains ?Connecting console?.The following symptoms can be caused by a defective remote control panel cable: v Remote control panel (hung) System control panel functions. v SRC 0000 DDDD with attention light on the system panel. v Remote control panel accepts mode selections, however the system does not respond. v The remote control panel does not function. v The status in the Operations Console window remains ?Connecting console?. 124 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) The following safety notices apply throughout this section. Read all safety procedures before servicing the system. Observe all safety procedures when performing a procedure. Unless instructed otherwise, always power off the system or expansion tower where the field-replaceable unit (FRU) is located before removing, exchanging, or installing a FRU. “OPCIP03”Use this procedure to isolate a bringup failure with Operations Console.OPCIP03 Use this procedure to isolate a bringup failure with Operations Console. DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005)Use this procedure to isolate an Operations Console bringup failure when the SRC on the panel is A6xx5008 or B6xx5008. If you are not using the Operations Console, see A6005004. This procedure only works with cable-connected and LAN configurations. It is not valid for dial connected configurations. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to Determine if the system has logical partitions before continuing with this procedure. Chapter 2. Isolation Procedures 125 2. Is the SRC on the panel A6xx5008 or B6xx5008? v No: This ends the procedure. v Yes: Are you connecting Operations Console using the ASYNC adapter? Yes: Continue with the next step. No: You are connecting using a LAN adapter. Go to step 6.3. Are words 17, 18, and 19 all equal to 00000000? v Yes: Report the problem to your next level of support. This ends the procedure. v No: Is word 17 equal to 00000001? No: Continue with the next step. Yes: The ASYNC adapter was not detected. Ensure that the ASYNC adapter card is installed, or replace the IOA and try again. This ends the procedure. 4. Is word 17 equal to 00000002? v Yes: On the ASYNC adapter card that was found, no cable was detected. Word 18 contains the card position. Locate the ASYNC adapter card in this card position, and ensure that the external cable is attached. Install or replace the external cable. This ends the procedure. v No: Is word 17 equal to 00000003? No: Continue with the next step. Yes: The cable that was detected does not have the correct cable ID. Word 18 contains the card position. Word 19 contains the cable ID. Locate the ASYNC adapter card in this card position, and verify that the correct cable is attached, or replace the cable. This ends the procedure. 5. Is word 17 equal to 00000004? No: Report the problem to you next level of support. This ends the procedure. Yes: Operations Console failed to make a connection because the port is already being used. Word 18 contains the card position. Disconnect the active communications session and try using the resource again. This ends the procedure.6. Are words 13, 14 and 15 all equal to 00000000? v Yes: Report the problem to you next level of support. This ends the procedure. v No: Is word 13 equal to 00000002? No: Continue with the next step. Yes: The LAN hardware failed to activate. Replace the LAN IOA being used. This ends the procedure.7. Is word 13 equal to 00000003? v No: Continue with the next step. v Yes: A hardware error occurred. Word 14 contains the error code, (example: 53001A80). Word 15 contains the card position. Is the error code equal to 53001A80? Yes: The network cable is not attached to the LAN adapter, the cable is defective, or the network is not operational. This ends the procedure. No: The LAN adapter hardware is not operational. Replace the hardware and try again. This ends the procedure. 8. Is word 13 equal to 00000004? v Yes: The console did not respond. Word 14 contains the number of attempts made. Word 15 contains the card position. The system is inserted into the network but there is no connection to the client (PC). Verify the configuration for the network at the system and client; verify the configuration of Operations Console. This ends the procedure. v No: Is word 13 equal to 00000005? No: Report the problem to your next level support. This ends the procedure. 126 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Yes: IP information was received from the console. Word 14 contains the received IP address. Verify the configuration data for the client (PC) or verify the configuration for the network. This ends the procedure. Chapter 2. Isolation Procedures 127 Power isolation procedures Use power isolation procedures for isolating a problem in the power system. Use isolation procedures if there is no HMC attached to the server. If the server is connected to an HMC, use the procedures that are available on the HMC to continue FRU isolation. Some field replaceable units (FRUs) can be replaced with the unit powered on. Follow the instructions in Chapter 3, “Locating FRUs,” on page 335 when directed to remove, exchange, or install a FRU. The following safety notices apply throughout the power isolation procedures. Read all safety procedures before servicing the system and observe all safety procedures when performing a procedure. DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) “Power problems” on page 129Use the following table to learn how to begin analyzing a power problem. “PWR1900” on page 141Determine which procedure to use based on the model number. “PWR1905” on page 142A system unit power supply load fault is occurring. 128 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) “PWR1906” on page 144The server detected an error in the power system. “PWR1907” on page 146A unit was dropped from the SPCN configuration. “PWR1908” on page 148A power supply fault or load fault has occurred in a 5088 or 0588 expansion unit. “PWR1909” on page 150A power supply load fault is occurring in a system expansion unit or I/O tower. “PWR1911” on page 152You are here because of a power problem on a dual line cord system. If the failing unit does not have a dual line cord, return to the procedure that sent you here or go to the next item in the FRU list. “PWR1917” on page 157This procedure is used to display or change the configuration ID. “PWR1918” on page 159A voltage regulator card might be failing. “PWR1920” on page 160Use this procedure to verify that the lights on the server control panel and the display panel on all attached I/O expansion units are operating correctly.Power problems Use the following table to learn how to begin analyzing a power problem. Table 17. Analyzing power problems Symptom What you should do System unit does not power on. See “Cannot power on system unit.” The processor or I/O expansion unit does not power off. See “Cannot power off system or SPCN-controlled I/O expansion unit” on page 132. The system does not remain powered on during a loss of incoming ac voltage and has an uninterruptible power supply (UPS) installed. Refer to the UPS user’s guide that was provided with your unit. An I/O expansion unit does not power on. See “Cannot power on SPCN-controlled I/O expansion unit” on page 136. “Cannot power on system unit”Perform this procedure until you correct the problem and you can power on the system. “Cannot power off system or SPCN-controlled I/O expansion unit” on page 132Use this procedure to analyze a failure of the normal command and control panel procedures to power off the system unit or an SPCN-controlled I/O expansion unit. “Cannot power on SPCN-controlled I/O expansion unit” on page 136You are here because an SPCN-controlled I/O expansion unit cannot be powered on, and might be displaying a 1xxx-C62E reference code.Cannot power on system unit Perform this procedure until you correct the problem and you can power on the system. For important safety information before continuing with this procedure, see “Power isolation procedures” on page 128. 1. Attempt to power on the system (see “Powering on and powering off” on page 588). Does the system power on, and is the system power status indicator light on continuously? Note: The system power status indicator blinks at the slower rate (one blink per two seconds) while powered off, and at the faster rate (one blink per second) during a normal power-on sequence. Chapter 2. Isolation Procedures 129 No: Continue with the next step. Yes: Go to step 13 on page 131. 2. Are there any characters displayed on the control panel (a scrolling dot may be visible as a character)? No: Continue with the next step. Yes: Go to step 5. 3. Are the mainline ac power cables from the power supply, power distribution unit, or external uninterruptible power supply (UPS) to the customer’s ac power outlet connected and seated correctly at both ends? Yes: Continue with the next step. No: Connect the mainline ac power cables correctly at both ends and go to step 1 on page 129. 4. Perform the following: a. Verify that the UPS is powered on (if it is installed). If the UPS will not power on, follow the service procedures for the UPS to ensure proper line voltage and UPS operation. b. Disconnect the mainline ac power cable or ac power jumper cable from the system’s ac power connector at the system. c. Use a multimeter to measure the ac voltage at the system end of the mainline ac power cable or ac power jumper cable. Note: Some system models have more than one mainline ac power cable or ac power jumper cable. For these models, disconnect all the mainline ac power cables or ac power jumper cables and measure the ac voltage at each cable before continuing with the next step. Is the ac voltage from 200 V ac to 240 V ac, or from 100 V ac to 127 V ac? No: Go to step 8 on page 131. Yes: Continue with the next step. 5. Perform the following: a. Disconnect the mainline ac power cable(s) from the power outlet. b. Exchange the system unit control panel (Un-D1) (see “Physical Locations” on page 339). c. Reconnect the mainline ac power cables to the power outlet. d. Attempt to power on the system.Does the system power on? No: Continue with the next step. Yes: The system unit control panel was the failing item. This ends the procedure. 6. Perform the following: a. Disconnect the mainline ac power cable(s) from the power outlet. b. Exchange the power supply or supplies (Un-E1, Un-E2) (see “Physical Locations” on page 339). c. Reconnect the mainline ac power cables to the power outlet. d. Attempt to power on the system.Does the system power on? No: Continue with the next step. Yes: The power supply was the failing item. This ends the procedure. 7. Perform the following: a. Disconnect the mainline ac power cable(s). b. Replace the system backplane (Un-P1) (see “Physical Locations” on page 339). c. Reconnect the mainline ac power cables to the power outlet. d. Attempt to power on the system.Does the system power on? 130 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) No: Continue with the next step. Yes: The system backplane was the failing item. This ends the procedure. 8. Are you working on a system unit with a power distribution unit with tripped breakers? v No: Continue with the next step. v Yes: Perform the following: a. Reset the tripped power distribution breaker. b. Verify that the removable ac power cable is not the problem. Replace the cord, as installed, if it is defective. c. If the breaker continues to trip, install a new power supply (as installed) in each location until the defective one is found. This ends the procedure. 9. Does the system have an external UPS installed? Yes: Continue with the next step. No: Go to step 11.10. Use a multimeter to measure the ac voltage at the external UPS outlets. Is the ac voltage from 200 V ac to 240 V or from 100 V ac to 127 V ac? No: The UPS needs service. For 9910 type UPS, call IBM Service Support. For all other UPS types, have the customer call the UPS provider. In the meantime, go to step 12 to bypass the UPS. Yes: Exchange the ac power cable, as installed (see Chapter 8, “Part Information,” on page 691). This ends the procedure.11. Perform the following: a. Disconnect the mainline ac power cable from the customer’s ac power outlet. b. Use a multimeter to measure the ac voltage at the customer’s ac power outlet. Note: Some system models have more than one mainline ac power cable. For these models, disconnect all the mainline ac power cables and measure the ac voltage at all ac power outlets before continuing with this step. Is the ac voltage from 200 V ac to 240 V ac or from 100 V ac to 127 V ac? Yes: Exchange the mainline ac power cable. See the Chapter 8, “Part Information,” on page 691. Then go to step 1 on page 129. No: Inform the customer that the ac voltage at the power outlet is not correct. When the ac voltage at the power outlet is correct, reconnect the mainline ac power cables to the power outlet. This ends the procedure.12. Perform the following to bypass the UPS unit: a. Power off your system and the UPS unit. b. Remove the signal cable used between the UPS and the system. c. Remove any power jumper cords used between the UPS and the attached devices. d. Remove the country or region-specific power cord used from the UPS to the wall outlet. e. Use the correct power cord (the original country or region-specific power cord that was provided with your system) and connect it to the power inlet on the system. Plug the other end of this cord into a compatible wall outlet. f. Attempt to power on the system.Does the power-on standby sequence complete successfully? Yes: Go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. No: Go to step 5 on page 130.13. Display the selected IPL mode on the system unit control panel. See ’IPL information’ in Progress Codes, SA76-0093. Is the selected mode the same mode that the customer was using when the power-on failure occurred? No: Go to step 15 on page 132. Chapter 2. Isolation Procedures 131 Yes: Continue with the next step.14. Is a function 11 reference code displayed on the system unit control panel? No: Go to step 16. Yes: Return to Chapter 1, “Starting a Service Call,” on page 1. This ends the procedure.15. Perform the following: a. Power off the system. b. Select the mode on the system unit control panel that the customer was using when the power-on failure occurred. c. Attempt to power on the system.Does the system power on? Yes: Continue with the next step. No: Exchange the system unit control panel (Un-D1) (see “Physical Locations” on page 339). This ends the procedure.16. Continue the IPL. Does the IPL complete successfully? Yes: This ends the procedure. No: Return to Chapter 1, “Starting a Service Call,” on page 1. This ends the procedure.Cannot power off system or SPCN-controlled I/O expansion unit Use this procedure to analyze a failure of the normal command and control panel procedures to power off the system unit or an SPCN-controlled I/O expansion unit. Attention: To prevent loss of data, ask the customer to verify that no interactive jobs are running before you perform this procedure. For important safety information before continuing with this procedure, see “Power isolation procedures” on page 128. 1. Is the power off problem on the system unit? No: Continue with the next step. Yes: Go to step 3. 2. Ensure that the SPCN cables that connect the units are connected and seated correctly at both ends. Does the I/O unit power off, and is the power indicator light blinking slowly? Yes: This ends the procedure. No: Go to step 7 on page 133. 3. Attempt to power off the system. Does the system unit power off, and is the power indicator light blinking slowly? No: Continue with the next step. Yes: The system is not responding to normal power off procedures which could indicate a Licensed Internal Code problem. Contact your next level of support. This ends the procedure. 4. Attempt to power off the system using ASMI. Does the system power off? Yes: The system is not responding to normal power off procedures which could indicate a Licensed Internal Code problem. Contact your next level of support. This ends the procedure. No: Continue with the next step. 5. Attempt to power off the system using the control panel power button. Does the system power off? Yes: Continue with the next step. No: Go to step 10 on page 133. 6. Is there a reference code logged in either ASMI, the control panel, or the HMC that indicates a power problem? Yes: Perform problem analysis for the reference code in the log. This ends the procedure. 132 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) No: Contact your next level of support. This ends the procedure. 7. Is the I/O expansion unit that will not power off part of a shared I/O tower loop? Yes: Go to step 9. No: Continue with the next step. 8. Attempt to power off the I/O expansion unit. Were you able to power off the expansion unit? Yes: This ends the procedure. No: Go to step 10. 9. The unit will only power off under certain conditions: v If the unit is in private mode, it should power off with the system unit that is connected by the SPCN frame-to-frame cable. v If the unit is in switchable mode, it should power off if the ?owning? system is powered off or is powering off, and the system unit that is connected by the SPCN frame-to-frame cable is powered off or is powering off. Does the I/O expansion unit power off? No: Continue with the next step. Yes: This ends the procedure.10. Ensure there are no jobs running on the system or partition, and verify that an uninterruptible power supply (UPS) is not powering the system or I/O expansion unit. Then continue with the next step. 11. Perform the following: a. Remove the system or I/O expansion unit ac power cord from the external UPS or, if an external UPS is not installed, from the customer’s ac power outlet. If the system or I/O expansion unit has more than one ac line cord, disconnect all the ac line cords. b. Exchange the following FRUs one at a time (see “Physical Locations” on page 339 and Chapter 8, “Part Information,” on page 691). If the system unit is failing: 1) Power supply (Un-E1 or Un-E2). Go to step 12 2) System backplane (Un-P1) 3) System control panel (Un-D1) If an I/O expansion unit is failing: 1) Each power supply. Go to step 12 2) I/O backplane 3) I/O backplane in the unit preceding the unit that will not power off 4) SPCN frame-to-frame cable This ends the procedure.12. A power supply might be the failing item. Attention: When replacing a redundant power supply, a 1xxx 1504, 1514, 1524, or 1534 reference code may be logged in the error log. If you just removed and replaced the power supply in the location associated with this reference code, and the power supply came ready after the install, disregard this reference code. If you had not previously removed and replaced a power supply, the power supply did not come ready after installation, or there are repeated fan fault errors after the power supply replacement, continue to follow these steps. Is the reference code 1xxx-15xx? No: Continue with the next step. Yes: Perform the following:a. Find the unit reference code in one of the following tables to determine the failing power supply. b. Ensure that the power cables are properly connected and seated. Chapter 2. Isolation Procedures 133 c. Is the reference code 1xxx-1500, 1510, 1520, or 1530 and is the failing unit configured with a redundant power supply option (or dual line cord feature)? v Yes: Perform “PWR1911” on page 152 before replacing parts. v No: Continue with step 12dd. Refer to “Physical Locations” on page 339 to determine the location and part number of the failing item. e. Replace the failing power supply (see the following tables to determine which power supply to replace). f. If the new power supply does not fix the problem, perform the following : 1) Reinstall the original power supply. 2) Try the new power supply in each of the other positions listed in the table. 3) If the problem still is not fixed, reinstall the original power supply and go to the next FRU in the list. 4) For reference codes 1xxx-1500, 1510, 1520, and 1530, exchange the power distribution backplane if a problem persists after replacing the power supply. Table 18. Model 8203-E4A Unit Reference Code Power Supply 1510, 1511, 1512, 1513, 1514, 7110 E1 1520, 1521, 1522, 1523, 1524, 7120 E2 Table 19. Model 8203-E4A (Quiet office) Unit Reference Code Power Supply 1510, 1511, 1512, 1513, 1514, 7110 E1 Attention: For reference codes 1500, 1510, 1520, and 1530, perform “PWR1911” on page 152 before replacing parts. Table 20. 5088, 0588 expansion units Unit Reference Code Power Supply 1510, 1511, 1512, 1513, 1514, 1516 P02 1520, 1521, 1522, 1523, 1524, 1526 P01 Attention: On a dual line cord system, for reference codes 1500, 1510, 1520, and 1530, perform “PWR1911” on page 152 before replacing parts. On a single line cord system, check the ac jumper to the power supply before replacing parts. Table 21. 5094, 5294 I/O expansion units (single line cord) Unit Reference Code Power Supply 1510, 1511, 1512, 1513, 1514, 1516 P01 1520, 1521, 1522, 1523, 1524, 1526 P02 1530, 1531, 1532, 1533, 1534, 1536 P03 Attention: On a dual line cord system, for reference codes 1500, 1510, 1520, and 1530, perform “PWR1911” on page 152 before replacing parts. On a single line cord system, check the ac jumper to the power supply before replacing parts. 134 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Attention: For 5094, 5294 expansion units, do not install power supplies P00 and P01 ac jumper cables on the same ac input module. Table 22. 5094, 5294 I/O expansion units (dual line cord) Unit Reference Code Power Supply 1500, 1501, 1502, 1503 P00 1510, 1511, 1512, 1513, 1514, 1516 P01 1520, 1521, 1522, 1523, 1524, 1526 P02 1530, 1531, 1532, 1533 P03 Table 23. 5095, 0595, 5790, 7311-D11, 7311-D20, 7314-G30 expansion units Unit Reference Code Power Supply 1510, 1511, 1512, 1513, 1514, 1516, 1517 P01/E1 1520, 1521, 1522, 1523, 1524, 1526, 1527 P02/E2 This ends the procedure. 13. Is the reference code 1xxx-2600, 2603, 2605, or 2606? v No: Continue with the next step. v Yes: Perform the following:a. Refer to “Physical Locations” on page 339 to determine the location and part number of the failing item. b. Replace the failing power supply. c. Perform the following if the new power supply does not fix the problem: 1) Reinstall the original power supply. 2) Try the new power supply in each of the other positions listed in the table. 3) If the problem still is not fixed, reinstall the original power supply and go to the next FRU in the list. Attention: Do not install power supplies P00 and P01 ac jumper cables on the same ac input module. Table 24. Failing power supplies System or Feature Code Failing Power Supply 8203-E4A, 9407-M15, or 9408-M25 Un-E1, Un-E2 5094, 5294 (single line cord) P01, P02, P03 5094, 5294 (dual line cord) P00, P01, P02, P03 5088, 0588 P02, P01 5095, 0595 P01, P02 5790, 7311-D11, 7311-D20, 7314-G30 E1, E2 This ends the procedure.14. Is the reference code 1xxx 8455 or 8456? v No: Return to Chapter 1, “Starting a Service Call,” on page 1. This ends the procedure. v Yes: One of the power supplies is missing, and must be installed. Use the following table to determine which power supply is missing, and install the power supply ( see “Physical Locations” on page 339 to determine the part number and exchange procedure). Chapter 2. Isolation Procedures 135 Table 25. Missing power supplies Reference Code Missing Power Supply 1xxx 8455 Un-E1 1xxx 8456 Un-E2 This ends the procedure.Cannot power on SPCN-controlled I/O expansion unit You are here because an SPCN-controlled I/O expansion unit cannot be powered on, and might be displaying a 1xxx-C62E reference code. For important safety information before continuing with this procedure, see “Power isolation procedures” on page 128. 1. Power on the system. 2. Starting from SPCN 0 or SPCN 1 on the system unit (see “Physical Locations” on page 339), go to the first unit in the SPCN frame-to-frame cable sequence that does not power on. Is the Data display background light on, or is the power-on LED blinking, or are there any characters displayed on the I/O expansion unit display panel?Note: The background light is a dim yellow light in the Data area of the display panel. Yes: Go to step 12 on page 138. No: Continue with the next step. 3. Use a multimeter to measure the ac voltage at the customer’s ac power outlet. Is the ac voltage from 200 V ac to 240 V ac, or from 100 V ac to 127 V ac? v Yes: Continue with the next step. v No: Inform the customer that the ac voltage at the power outlet is not correct. This ends the procedure. 4. Is the mainline ac power cable from the ac module, power supply, or power distribution unit to the customer’s ac power outlet connected and seated correctly at both ends? v Yes: Continue with the next step. v No: Connect the mainline ac power cable correctly at both ends. This ends the procedure. 5. Perform the following: a. Disconnect the mainline ac power cable from the ac module, power supply, or power distribution unit. b. Use a multimeter to measure the ac voltage at the ac module, power supply, or power distribution unit end of the mainline ac power cable.Is the ac voltage from 200 V ac to 240 V ac, or from 100 V ac to 127 V ac? No: Continue with the next step. Yes: Go to step 7 on page 137. 6. Are you working on a power distribution unit with tripped breakers? v No: Exchange the mainline ac power cable or power distribution unit (as installed). This ends the procedure. v Yes: Perform the following: a. Reset the tripped power distribution breaker. b. Verify that the removable ac line cord is not the problem. Replace the cord, as installed, if it is defective. c. Install a new power supply (as installed) in all power locations until the defective one is found. 136 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) This ends the procedure. 7. Does the unit you are working on have ac power jumper cables installed? Note: The ac power jumper cables connect from the ac module, or the power distribution unit to the power supplies. Yes: Continue with the next step. No: Go to step 11. 8. Are the ac power jumper cables connected and seated correctly at both ends? v Yes: Continue with the next step. v No: Connect the ac power jumper cables correctly at both ends. This ends the procedure. 9. Perform the following: a. Disconnect the ac power jumper cables from the ac module, or power distribution unit. b. Use a multimeter to measure the ac voltage at the ac module or power distribution unit (that goes to the power supplies).Is the ac voltage at the ac module or power distribution unit from 200 V ac to 240 V ac, or from 100 V ac to 127 V ac? v Yes: Continue with the next step. v No: Exchange the following as they are installed (see Chapter 8, “Part Information,” on page 691): – ac module – Power distribution unit This ends the procedure.10. Perform the following: a. Connect the ac power jumper cables to the ac module, or power distribution unit. b. Disconnect the ac power jumper cable at the power supplies. c. Use a multimeter to measure the voltage of the power jumper cables input to the power supplies.Is the voltage 200 V ac to 240 V ac or 100 V ac to 127 V ac for each power jumper cable? v No: Exchange the power jumper cable. This ends the procedure. v Yes: Exchange the following parts, as installed, one at a time: a. One of the FRUs listed in the following table, according to the enclosure: Table 26. Enclosures FRU to be exchanged Enclosure FRU 5088, 0588 Tower backplane assembly 5094, 5294, 8094-002 Tower backplane assembly 5095, 0595 Tower backplane assembly 7311-D11, 7314-G30, 5790 or 5796 I/O backplane 7311-D20 I/O backplane b. Display unit c. Power supply 1 d. Power supply 2 e. Power supply 3 This ends the procedure.11. Perform the following: Chapter 2. Isolation Procedures 137 a. Disconnect the mainline ac power cable (to the expansion unit) from the customer’s ac power outlet. b. Exchange the following FRUs, one at a time: v Power supply. v One of the FRUs listed in the following table, according to the enclosure: Table 27. Enclosures FRU to be exchanged Enclosure FRU 5088, 0588 Tower backplane assembly 5094, 5294, 8094-002 Tower backplane assembly 5095, 0595 Tower backplane assembly 7311-D11, 7314-G30, 5790 or 5796 I/O backplane 7311-D20 I/O backplane c. Reconnect the mainline ac power cables (from the expansion unit) into the power outlet. d. Attempt to power on the system.Does the expansion unit power on? v Yes: The unit you exchanged was the failing item. This ends the procedure. v No: Repeat this step and exchange the next FRU in the list. If you have exchanged all of the FRUs in the list, ask your next level of support for assistance. This ends the procedure.12. Is there a reference code displayed on the display panel for the I/O unit that does not power on? v Yes: Continue with the next step. v No: Replace the FRU listed in the following table, according to the enclosure: Table 28. Enclosures FRU to be exchanged Enclosure FRU 5088, 0588 Tower backplane assembly 5094, 5294, 8094-002 Tower backplane assembly 5095, 0595 Tower backplane assembly 7311-D11, 7314-G30, 5790 or 5796 I/O backplane 7311-D20 I/O backplane This ends the procedure.13. Is the reference code 1xxx xx2E? v Yes: Continue with the next step. v No: Use the new reference code and return to Chapter 1, “Starting a Service Call,” on page 1. This ends the procedure.14. Do the SPCN optical cables (A) connect the failing unit (B) to the preceding unit in the chain or loop? .---------. (A) SPCN | System | Optical Cables -. .----- SPCN | Unit | | V Optical Adapter | SPCN 0 | .-. V .-. ’----.----’ | +------------+ | | | +------------+ | .----’----. .-------’-+ .-------’-+ .---------. | J15 | |Sec J16| |Sec J15| | Sec | 138 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) |Sec Unit +-+UNIT J15| |Unit J16+-+J15 Unit | | 1 | | 2 | | 3 | | 4 | ’---------’ ’---------’ ’---------’ ’---------’ ^ | ’---- (B) Failing Unit Yes: Continue with the next step. No: Go to step 18 on page 140.15. Remove the SPCN optical adapter (A) from the preceding frame in the string that cannot become powered on. .---------. .--- (A) SPCN Optical Adapter | System | | | Unit | V | SPCN 0 | .-. .-. ’----.----’ | +------------+ | 322 | +------------+ | .----’----. .-------’-+ .-------’-+ .---------. | J15 | |Sec J16| |Sec J15| | Sec | |Sec Unit +-+Unit J15| |Unit J16+-+J15 Unit | | 1 | | 2 | | 3 | | 4 | ’---------’ ’---------’ ’---------’ ’---------’ ^ | ’-- Failing Unit 16. Perform the following: Notes: a. The cable may be connected to either J15 or J16. b. Use an insulated probe or jumper when performing the voltage readings.a. Connect the negative lead of a multimeter to the system frame ground. b. Connect the positive lead of a multimeter to pin 2 of the connector from which you removed the SPCN optical adapter in the previous step of this procedure. c. Note the voltage reading on pin 2. d. Move the positive lead of the multimeter to pin 3 of the connector or SPCN card. e. Note the voltage reading on pin 3.Is the voltage on both pin 2 and pin 3 from 1.5 V dc to 5.5 V dc? v Yes: Continue with the next step. v No: Exchange the FRU listed in the following table, according to the enclosure: Table 29. Enclosures FRU to be exchanged Enclosure FRU 5088, 0588 Tower backplane assembly 5094, 5294, 8094-002 Tower backplane assembly 5095, 0595 Tower backplane assembly 7311-D11, 7314-G30, 5790 or 5796 I/O backplane 7311-D20 I/O backplane This ends the procedure.17. Exchange the following FRUs, one at a time: Chapter 2. Isolation Procedures 139 a. In the failing unit (first frame with a failure indication), replace the following: Table 30. Enclosures FRU to be exchanged Enclosure FRU 5088, 0588 Tower backplane assembly 5094, 5294, 8094-002 Tower backplane assembly 5095, 0595 Tower backplane assembly 7311-D11, 7314-G30, 5790 or 5796 I/O backplane 7311-D20 I/O backplane b. In the preceding unit in the string, replace the FRU in Table 30. c. SPCN optical adapter (A) in the preceding unit in the string. d. SPCN optical adapter (B) in the failing unit. e. SPCN optical cables (C) between the preceding unit in the string and the failing unit.This ends the procedure. (A) SPCN Optical .----- (C) SPCN Adapter ----. | Optical Cables | | | | .-- (B) SPCN .---------. | | | Optical | System | | | | Adapter | Unit | V | V | SPCN 0 | .-. V .-. ’----.----’ | +------------+ | | | +------------+ | .----’----. .-------’-+ .-------’-+ .---------. |Sec J15 | | Sec J16| |Sec J15| | Sec | |Unit J16+-+J15 Unit | |Unit J16+-+J15 Unit | | 1 | | 2 | | 3 | | 4 | ’---------’ ’---------’ ’---------’ ’---------’ ^ | | ’--- Failing Unit 18. Perform the following: a. Power off the system. b. Disconnect the SPCN frame-to-frame cable from the connector of the first unit that cannot be powered on. c. Connect the negative lead of a multimeter to the system frame ground. d. Connect the positive lead of the multimeter to pin 2 of the SPCN cable. Note: Use an insulated probe or jumper when performing the voltage readings. e. Note the voltage reading on pin 2. f. Move the positive lead of the multimeter to pin 3 of the SPCN cable. g. Note the voltage reading on pin 3.Is the voltage on both pin 2 and pin 3 from 1.5 V dc to 5.5 V dc? v No: Continue with the next step. v Yes: Exchange the following FRUs one at a time: 140 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) a. In the failing unit, replace the FRU listed in the following table: Table 31. Enclosures FRU to be exchanged Enclosure FRU 5088, 0588 Tower backplane assembly 5094, 5294, 8094-002 Tower backplane assembly 5095, 0595 Tower backplane assembly 7311-D11, 7314-G30, 5790 or 5796 I/O backplane 7311-D20 I/O backplane b. In the preceding unit in the string, replace the FRU in Table 31. c. SPCN frame-to-frame cable. This ends the procedure.19. Perform the following: a. Follow the SPCN frame-to-frame cable back to the preceding unit in the string. b. Disconnect the SPCN cable from the connector. c. Connect the negative lead of a multimeter to the system frame ground. d. Connect the positive lead of a multimeter to pin 2 of the connector. Note: Use an insulated probe or jumper when performing the voltage readings. e. Note the voltage reading on pin 2. f. Move the positive lead of the multimeter to pin 3 of the connector. g. Note the voltage reading on pin 3.Is the voltage on both pin 2 and pin 3 from 1.5 V dc to 5.5 V dc? v Yes: Exchange the following FRUs one at a time: a. SPCN frame-to-frame cable. b. In the failing unit, replace the following: Table 32. Enclosures FRU to be exchanged Enclosure FRU 5088, 0588 Tower backplane assembly 5094, 5294, 8094-002 Tower backplane assembly 5095, 0595 Tower backplane assembly 7311-D11, 7314-G30, 5790 or 5796 I/O backplane 7311-D20 I/O backplane c. In the preceding unit in the string, replace the FRU in Table 32. This ends the procedure. v No: Exchange the FRU in Table 32 from the unit from which you disconnected the SPCN cable in the previous step of this procedure. This ends the procedure.PWR1900 Determine which procedure to use based on the model number. Follow the instructions for the model or expansion unit you are servicing. For models 8203-E4A, 9407-M15, and 9408-M25 perform “PWR1905” on page 142. Chapter 2. Isolation Procedures 141 For 5094 and 5294 units, perform “PWR1906” on page 144. For 5088 and 0588 units, perform “PWR1908” on page 148. For 5095, 0595, 5790, 5796, 7314-G30. 7311-D11, and 7311-D20 units, perform “PWR1909” on page 150. This ends the procedure. PWR1905 A system unit power supply load fault is occurring. Refer to “Power isolation procedures” on page 128 for important safety information before servicing the system. Instructions for Models 8203-E4A, 9407-M15, or 9408-M25 1. Is the reference code 1xxx 1B01? Yes: Continue with the next step. No: This procedure only isolates problems that cause 1xxx 1B01 to be logged. Return to the procedure that sent you here. This ends the procedure. 2. Perform the following: a. Power off the system and disconnect the ac power cable from the unit you are working on. b. Disconnect all the I/O devices (tape, diskette, optical, and disk units) by sliding them partially out of the system unit (see “Physical Locations” on page 339). c. Remove and label all cards (for example, PCI adapters, GX adapter(s), RIO/HSL, and RAID cards if installed). d. Reconnect the ac power cable or cables to the unit you are working on. e. Power on the system (see “Powering on and powering off” on page 588).Does a power reference code occur? Yes: Continue with the next step. No: Go to step 5. 3. Perform the following: a. Power off the system. b. Replace one of the system fans (see “Physical Locations” on page 339). c. Power on the system (see “Powering on and powering off” on page 588).Does a power reference code occur? Yes: Continue with the next step. No: The fan you just replaced was the failing item. This ends the procedure. 4. Have you tried replacing all the fans? v Yes: Reinstall the fan you just replaced in step 3 and continue with the next step. v No: Perform the following: a. Power off the system. b. Reinstall the fan that you just removed in step 3 to its original location. c. Repeat step 3 5. Perform the following: a. Power off the system. b. Reinstall all the cards (PCI adapters, memory DIMMs, GX adapter(s), RIO/HSL and RAID cards) you removed in step 2 into their original locations. c. Power on the system.Does a power reference code occur? 142 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Yes: Continue with the next step. No: Go to step 8. 6. Perform the following: a. Power off the system. b. Disconnect one of the cards you reinstalled in step 5 on page 142. c. Power on the system.Does a power reference code occur? Yes: Continue with the next step. No: Exchange the last card you disconnected in this step (see “Physical Locations” on page 339). This ends the procedure. 7. Have you disconnected all the cards? No: Repeat step 6. Yes: Reinstall all of the parts removed or exchanged in this procedure and return to Chapter 1, “Starting a Service Call,” on page 1. This ends the procedure. 8. Perform the following: a. Power off the system. b. Reconnect all of the I/O devices (tape, diskette, optical, and disk units) that you disconnected in step 2 on page 142. c. Power on the system.Does a power reference code occur? Yes: Continue with the next step. No: The problem has been resolved. This ends the procedure. 9. Perform the following: a. Power off the system. b. Disconnect one of the I/O devices (tape, diskette, optical, and disk units) that you reconnected in step 8. c. Power on the system.Does a power reference code occur? Yes: Continue with the next step. No: Exchange the last I/O device you disconnected in this step (see “Physical Locations” on page 339). This ends the procedure.10. Have you tried disconnecting all of the I/O devices? No: Repeat step 9. Yes: Continue with the next step.11. Replace the system backplane (Un-P1). Reinstall all of the parts you have removed or exchanged in this procedure. Does a power reference code occur? Yes: Continue with the next step. No: The system is fixed. This ends the procedure.12. Does the system contain only one power supply? Yes: Continue with the next step. No: Go to step 14 on page 144.13. Replace the power supply. Does a power reference code occur? Yes: Contact your service support.This ends the procedure. No: The system is fixed. This ends the procedure. Chapter 2. Isolation Procedures 143 14. Power off the system. Remove one of the power supplies, then power on the system. Does a power reference code occur? Yes: Continue with the next step. No: The power supply you just removed is defective. Replace it. This ends the procedure.15. Power off the system. Reinstall the power supply that was removed in step 14, and remove the other power supply. Power on the system. Does a power reference code occur? Yes: Continue with the next step. No: The power supply you just removed is defective. Replace it. This ends the procedure.16. Power off the system. Replace both power supplies. Power on the system. Does a power reference code occur? Yes: Contact your service support.This ends the procedure. No: The system is fixed. This ends the procedure.PWR1906 The server detected an error in the power system. Please see “Power isolation procedures” on page 128 for important safety information before servicing the system. PWR1906 Instructions for 5094 and 5294 expansion units 1. Perform the following: a. Power off the unit you are working on. b. Disconnect all the I/O devices (tape, diskette, optical, and disk units) from the unit that you are working on by sliding them partially out of the unit. See the service guide for the unit you are servicing. c. Remove and label all of the cards that are installed in the PCI adapter slots. d. Power on the unit you are working on.Does a power reference code occur? Yes: Continue with the next step. No: Go to step 9 on page 145. 2. Perform the following: a. Power off the unit you are working on. b. Remove, in order, all power supplies except the first one (either P00 or P01 depending on the configuration). c. Power on the unit you are working on.Does a power reference code occur? Yes: Continue with the next step. No: Go to step 4 on page 145. 3. Perform the following: a. Power off the unit you are working on. b. Remove the power supply that was left installed in step 2. c. Reconnect the next power supply in order (P01 or P02). d. Power on the unit you are working on.Does a power reference code occur? v Yes: Go to step 5 on page 145. v No: Exchange the power supply you removed in step 3b. 144 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) This ends the procedure. 4. Perform the following: a. Power off the unit you are working on. b. Reconnect the next power supply in order. c. Power on the unit you are working on.Does a power reference code occur? v No: Repeat this step until all power supplies have been reconnected. This ends the procedure. v Yes: Exchange the power supply that you reconnected in this step. This ends the procedure. 5. Perform the following: a. Remove one of the fans from the unit you are working on that you did not previously remove during this procedure. Note: Disregard a fan reference code if it occurs during this step. b. Power on the unit you are working on.Does a power reference code occur? v Yes: Continue with the next step. v No: The fan that you removed in this step is the failing item. This ends the procedure. 6. Have you removed all of the fans, one at a time? v Yes: Install all of the fans and continue with the next step. v No: Perform the following: a. Power off the unit you are working on. b. Reinstall the fan that was removed in step 5 to its original location. c. Repeat step 5. 7. Perform the following: a. Remove the power cable (that was not previously removed) from one of the lower DASD device boards (CB1 or CB2). b. Power on the unit you are working on.Does a power reference code occur? v Yes: Continue with the next step. v No: Replace the DASD device board that the power cable was removed from in this step. This ends the procedure. 8. Is a second lower DASD backplane installed and its power cable connected? v Yes: Repeat step 7. v No: Replace the following one at a time: a. DASD upper device board (DB3) b. I/O enclosure backplane (CB1). This ends the procedure. 9. Perform the following: a. Power off the unit you are working on. b. Reinstall all of the cards that you removed in step 1 on page 144. c. Reconnect the ac power cable to the unit that you are working on. d. Power on the unit you are working on.Does a power reference code occur? Chapter 2. Isolation Procedures 145 Yes: Continue with the next step. No: Go to step 12.10. Perform the following: a. Power off the unit you are working on. b. Disconnect one of the cards that you connected in step 9 on page 145. c. Power on the unit you are working on.Does a power reference code occur? v Yes: Continue with the next step. v No: Exchange the last card that you disconnected in this step. This ends the procedure.11. Have you disconnected all the cards? Yes: Continue with the next step. No: Repeat step 10.12. Perform the following: a. Power off the unit you are working on. b. Reconnect all of the I/O devices (tape, diskette, optical, or disk units) that you disconnected in step 1 on page 144. c. Power on the unit you are working on.Does a power reference code occur? Yes: Continue with the next step. No: This ends the procedure.13. Perform the following: a. Power off the unit you are working on. b. If you have not already done so, disconnect all of the I/O devices (tape, diskette, optical, or disk units) that you reconnected in step 12. c. Reconnect one of the I/O devices (tape, diskette, optical, or disk units) that you just disconnected. d. Power on the unit you are working on.Does a power reference code occur? v No: Continue with the next step. v Yes: Exchange the last I/O device that you reconnected in this step. This ends the procedure.14. Have you reconnected all the I/O devices? Yes: This ends the procedure. No: Repeat step 13 (you can skip part b).PWR1907 A unit was dropped from the SPCN configuration. 1. Is the reference code you are working with 1xxx 913B? v No: Continue with the next step. v Yes: A system power control network (SPCN) firmware update is needed, but not started due to the SPCN firmware update policy setting. A manual update needs to be started.Notes: – Do not perform maintenance on an expansion unit or modify the SPCN network while the SPCN firmware update is being performed. 146 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) – Performing firmware updates or powering off the system will interrupt SPCN firmware updates and the firmware update will need to be started again after these actions.a. Access the Advanced System Management Interface (ASMI) and select System Configuration and then Configure I/O Enclosures. b. Record the current SPCN firmware update policy setting so it can be restored later. c. Change the SPCN firmware update policy setting to expanded and click Save Policy Setting to allow for SPCN firmware updates to be done over the RIO/HSL and serial SPCN interfaces. d. Select Start SPCN Firmware Update. The SPCN firmware will now be downloaded to the expansion units that require an update. e. Change the SPCN firmware update policy setting back to what it was originally set to in step 1b and click on Save Policy Setting. Note: The SPCN firmware update can be stopped using the Stop SPCN Firmware Update button. However, to allow the expansion units to be updated to the latest SPCN firmware level, it is recommended that the firmware update be allowed to complete. The progress of the SPCN firmware update can be monitored by clicking on Configure I/O Enclosures to update the screen. Do not use the browser Back or Refresh button to monitor the update progress. The Power Control Network Firmware Update Status column shows the percentage complete and In Progress is displayed while the download is progresses. Not Required is displayed when the download process completes. This ends the procedure. 2. Is the reference code you are working with 1xxx 90F0? v No: Contact your next level of support. v Yes: A unit was dropped from the SPCN configuration. This can be caused by any of the following: – The rack or unit has lost all ac or dc power. – The SPCN function in the unit has an error. – The SPCN frame-to-frame cables, RIO cable or the remote I/O (RIO) adapter card has failed.3. Using the HMC or ASMI, find the 1xxx 90F0 SRC in the error log (see “Displaying error and event logs” on page 649). Use the option Show details to display the location information for the failing unit. 4. After locating the failing unit, ensure that the SPCN frame-to-frame cable and RIO cables are seated correctly; reseat the cables if necessary. Are the cables connected correctly? v No: Correctly reconnect the cables, or replace them if necessary. This ends the procedure. v Yes: Continue with the next step.5. Are the ac line cords on the failing unit connected properly at both ends? v No: Reconnect the ac line cords, or replace them if necessary. This ends the procedure. v Yes: Continue with the next step.6. Check the voltage at the customer’s ac outlet. Is the voltage correct? v No: Inform the customer that the voltage at the ac power outlet is incorrect. This ends the procedure. v Yes: Continue with the next step.7. Are the power supplies functional? v No: Perform the following: a. Refer to Chapter 3, “Locating FRUs,” on page 335 to determine the location and part number for each power supply, and to find the appropriate procedure for exchanging the power supplies. b. Replace each power supply one at a time, until the problem has been resolved. Chapter 2. Isolation Procedures 147 c. If the problem persists after replacing all of the power supplies, continue with the next step.v Yes: Continue with the next step.8. Use the following table to determine the FRUs to replace. Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions on replacing FRUs. This ends the procedure. Table 33. Expansion enclosures FRUs Enclosure FRU 5088, 0588 Tower backplane assembly 5094, 5294, 8094-002 Tower backplane assembly 5095, 0595 Tower backplane assembly 7311-D11, 7314-G30, 5790 or 5796 I/O backplane 7311-D20 I/O backplane PWR1908 A power supply fault or load fault has occurred in a 5088 or 0588 expansion unit. For important safety information before servicing the system, see “Power isolation procedures” on page 128. For location information, see the service guide for the enclosure you are working on. PWR1908 instructions for 5088 or 0588 expansion units Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions for removing and replacing any FRUs called out by the following procedure. 1. Perform the following: a. Power off the frame that you are working on by removing the ac jumper cable from the power supplies on the 5088 or 0588 expansion unit. b. Remove and label all cards installed in the PCI backplane area. c. Power on the frame by reconnecting the ac jumper cable to the unit.Does a power reference code occur? Yes: Continue with the next step. No: Go to step 8 on page 149.2. Is the reference code 1xxx 2603? v No: Continue with the next step. v Yes: Replace the PCI backplane. This ends the procedure.3. Perform the following: a. Power off the frame you are working on by removing the ac jumper cable from the power supplies on the 5088 or 0588 expansion unit. b. Remove power supply P01. c. Remove fan assembly B01 from power supply P01 and install it on a new power supply P01. d. Install the new power supply P01. e. Power on the frame by reconnecting the ac jumper cable to the unit you are working on.Does a power reference code occur? v Yes: Continue with the next step. v No: The power supply that you replaced in this step was the failing item. This ends the procedure.4. Perform the following: 148 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) a. Power off the frame you are working on by removing the ac jumper cable from the power supplies on the 5088 or 0588 expansion unit. b. Remove the new power supply P01 and replace it with the original power supply P01. c. Remove power supply P02. d. Remove fan assembly B02 from power supply P02 and install it on a new power supply P02. e. Install the new power supply P02. f. Power on the frame by reconnecting the ac jumper cable to the unit you are working on.Does a power reference code occur? v Yes: Continue with the next step. v No: The power supply that you replaced in this step was the failing item. This ends the procedure.5. Perform the following: a. Power off the frame you are working on by removing the ac jumper cable from the power supplies on the 5088 or 0588 expansion unit. b. Remove both the new power supply P02 and the fan assembly B02. c. Reinstall the original power supply P02 and a new fan assembly B02. d. Power on the frame by reconnecting the ac power cables to the unit you are working on.Does a power reference code occur? v Yes: Continue with the next step. v No: The fan assembly B02 that you replaced in this step was the failing item. This ends the procedure.6. Perform the following: a. Power off the frame you are working on by removing the ac jumper cable from the power supplies on the 5088 or 0588 expansion unit. b. Remove power supply P02. c. Replace the new fan assembly B02 with the original fan assembly B02. d. Reinstall power supply P02. e. Remove power supply P01. f. Remove fan assembly B01 and replace it with a new fan assembly B01. g. Reinstall power supply P01. h. Power on the frame by reconnecting the ac jumper cable to the unit you are working on.Does a power reference code occur? v Yes: Continue with the next step. v No: The fan assembly B01 that you removed in this step is the failing item. This ends the procedure.7. Perform the following: a. Power off the frame you are working on by removing the ac jumper cable from the power supplies on the 5088 or 0588 expansion unit. b. Remove power supply P01. c. Replace fan assembly B01 with the original fan assembly B01. d. Reinstall power supply P01. e. Replace the following FRUs one at a time: v Display panel. v PCI backplane assembly CB1. This ends the procedure.8. Perform the following: Chapter 2. Isolation Procedures 149 a. Power off the frame you are working on by removing the ac jumper cable from the power supplies on the 5088 or 0588 expansion unit. b. Reinstall one of the cards that you removed in step 1 on page 148. c. Power on the frame by reconnecting the ac power cables to the unit you are working on.Does a power reference code occur? v No: Continue with the next step. v Yes: Exchange the last card that you reinstalled in this step. This ends the procedure.9. Have you reinstalled all of the cards? v Yes: Replace the PCI backplane assembly CB1. This ends the procedure. v No: Repeat step 8 on page 149, reinstalling the next card.PWR1909 A power supply load fault is occurring in a system expansion unit or I/O tower. For important safety information before servicing the system, refer to “Power isolation procedures” on page 128. Instructions for 5095, 0595, 5790, 5796, 7311-D11, 7311-D20 and 7314-G30 Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions for removing and replacing any FRUs called out by the following procedure. 1. Perform the following: a. Power off the system. b. Disconnect all the I/O devices (tape, diskette, optical, and disk units) from the expansion unit or I/O tower you are working on by sliding them partially out of the unit (see “Physical Locations” on page 339). c. Remove and label all cards installed in the PCI adapters area. d. Power on the system.Does a power reference code occur? Yes: Continue with the next step. No: Go to step 8 on page 151. 2. Perform the following: a. Power off the system. b. Remove one of the fans from the expansion unit or I/O tower that you have not previously removed during this procedure. Note: If a fan reference code occurs during this step, ignore it. c. Power on the system.Does a power reference code occur? v Yes: Continue with the next step. v No: The fan you removed in this step is the failing item. This ends the procedure. 3. Have you removed all of the fans one at a time? v No: Perform the following: a. Power off the system. b. Reinstall the fan that you removed in step 2 into its original location. c. Repeat step 2. 150 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) v Yes: Reinstall all of the fans and continue with the next step. 4. Perform the following: Note: If there are no DASD installed in this enclosure, go to step 6. a. Power off the system. b. Remove the I/O tower power supply cable, at the DASD backplane, that you have not previously removed. c. Power on the system.Does a power reference code occur? v No: The DASD backplane that was disconnected in this step is the failing item. This ends the procedure. v Yes: Continue with the next step. 5. Have you disconnected the power cables from each of the DASD backplanes one at a time? Yes: Continue with the next step. No: Repeat step 4. 6. Perform the following: a. Power off the system. b. Remove a power supply that you have not previously removed, and replace it with a new one. c. Power on the system.Does a power reference code occur? v Yes: Continue with the next step. v No: The power supply that was removed in this step is the failing item. This ends the procedure. 7. Have you removed all of the power supplies one at a time? v Yes: Perform the following: a. Remove the new power supply that you installed in step 6 and reinstall the original power supply. b. Replace the I/O backplane if you are working on a 7311-D11, 7314-G30, 5790 or 5796. Replace the tower backplane if you are working on a 5095 or 0595. This ends the procedure.v No: Remove the new power supply that you installed in step 6 and reinstall the original power supply. Then, repeat step 6. 8. Perform the following: a. Power off the system. b. Reinstall all of the cards you removed in step 1 on page 150. c. Power on the system.Does a power reference code occur? Yes: Continue with the next step. No: Go to step 11 on page 152. 9. Perform the following: a. Power off the system. b. Disconnect one of the cards you reconnected in step 8. c. Power on the system.Does a power reference code occur? v Yes: Continue with the next step. Chapter 2. Isolation Procedures 151 v No: Exchange the last card you disconnected in this step (see Chapter 8, “Part Information,” on page 691) Reinstall all the parts. This ends the procedure.10. Have you disconnected all the cards? v No: Repeat step 9 on page 151. v Yes: Reinstall all the parts and return to Chapter 1, “Starting a Service Call,” on page 1. This ends the procedure.11. Perform the following: a. Power off the system. b. Reconnect all of the I/O devices (tape, diskette, optical, or disk units) that you disconnected in step 1 on page 150. c. Power on the system.Does a power reference code occur? Yes: Continue with the next step. No: This ends the procedure.12. Perform the following: a. Power off the system. b. Disconnect one of the I/O devices you reconnected in step 11. c. Power on the system.Does a power reference code occur? v Yes: Continue with the next step. v No: Exchange the last I/O device you disconnected in this step. This ends the procedure.13. Have you disconnected all of the I/O devices? v No: Repeat step 12. v Yes: Reinstall all the parts and return to Chapter 1, “Starting a Service Call,” on page 1. This ends the procedure.PWR1911 You are here because of a power problem on a dual line cord system. If the failing unit does not have a dual line cord, return to the procedure that sent you here or go to the next item in the FRU list. The following steps are for the system unit, unless other instructions are given. For important safety information before servicing the system, refer to “Power isolation procedures” on page 128. 1. If an uninterruptible power supply is installed, verify that it is powered on before proceeding. 2. Are all the units powered on? v Yes: Go to step 7 on page 154. v No: On the unit that does not power on, perform the following: a. Disconnect the ac line cords from the unit that does not power on. b. Use a multimeter to measure the ac voltage at the system end of both ac line cords. Table 34. Correct ac voltage Model or expansion unit Correct ac voltage Model 8203-E4A and 5095, 0595, and 7311-D20 expansion units 100 V to 127 V or 200 V to 240 V 5796 and 7314-G30 expansion units 200 V to 240 V 152 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Table 34. Correct ac voltage (continued) Model or expansion unit Correct ac voltage 5088, 0588, 5094, 5294, 5790, and 7311-D11 expansion units 200 V to 240 V ac or dc c. Is the voltage correct (refer to Table 34 on page 152)? Yes: Continue with the next step. No: Go to step 6 on page 154. 3. Are you working on a model 8203-E4A or a 5095, 0595, 5790, 5796, 7314-G30, 7311-D11, or 7311-D20 expansion unit? v No: Continue with the next step. v Yes: Perform the following: a. Reconnect the ac line cords. b. Verify that the failing unit fails to power on. c. Replace the failing power supply. Use the table below to determine which power supply needs replacing, and then see Chapter 3, “Locating FRUs,” on page 335 for its location, part number, and exchange procedure. Table 35. Failing power supply for models 8203-E4A and 5095, 0595, 5790, 5796, 7314-G30, 7311-D11 or 7311-D20 expansion units Reference code Models or expansion units Failing item name 1510 8203-E4A Power supply 1 5095, 0595, 5790, 5796, 7314-G30, 7311-D11, 7311-D20 Power supply 1 1520 8203-E4A Power supply 2 5095, 0595, 5790, 5796, 7314-G30, 7311-D11, 7311-D20 Power supply 2 This ends the procedure. 4. Perform the following: a. Reconnect the ac line cord to the ac modules. b. Remove the ac jumper cables at the power supplies. c. Use a multimeter to measure the ac voltage at the power-supply end of the jumper cable.Is the ac voltage from 200 V to 240 V? v No: Continue with the next step. v Yes: Replace the failing power supply. Use the following table to determine which power supply must be replaced, and then see Chapter 3, “Locating FRUs,” on page 335 for its location, part number, and exchange procedure. Attention: Do not install power supplies P00 and P01 ac jumper cables on the same ac module. Table 36. Failing power supply for 5094, 5294, 5088 or 0588 expansion units Reference code Failing item name 1500 Power supply 0 1510 Power supply 1 for 5094 or 5294 Power supply 2 for 5088 or 0588 1520 Power supply 2 for 5094 or 5294 Power supply 1 for 5088 or 0588 1530 Power supply 3 Chapter 2. Isolation Procedures 153 This ends the procedure. 5. Perform the following: a. Disconnect the ac jumper cable at the ac module output. b. Use a multimeter to measure the ac voltage at the ac module output.Is the ac voltage from 200 V to 240 V? v Yes: Exchange the ac jumper cable. This ends the procedure. v No: Exchange the ac module (see Chapter 3, “Locating FRUs,” on page 335). This ends the procedure. 6. Perform the following: a. Disconnect the ac line cords from the customer’s ac power outlet. b. Use a multimeter to measure the ac voltage at the customer’s ac power outlet.Is the ac voltage correct (refer to Table 34 on page 152)? v Yes: Exchange the failing ac line cord. This ends the procedure. v No: Perform the following: a. Inform the customer that the ac voltage at the power outlet is not correct. b. Reconnect the ac line cords to the power outlet after the ac voltage at the power outlet is correct. This ends the procedure. 7. Is the reference code 1xxx00AC? v No: Continue with the next step. v Yes: This reference code may have been caused by an ac outage. If the system will power on without an error, no parts need to be replaced. This ends the procedure. 8. Is the reference code 1xxx1510 or 1520? v No: Continue with the next step. v Yes: Perform the following: a. Use the following table, figures and Chapter 3, “Locating FRUs,” on page 335 to locate the failing parts. Table 37. Power Reference Code Table Model or expansion unit Reference code Locate these parts 8203-E4A 1xxx 1510 Power supply E1 and ac line cord 1 1xxx 1520 Power supply E2 and ac line cord 2 5088 and 0588 (see Figure 2 on page 156) 1xxx 1510 ac jumper cable connected to power supply 2 and the ac module 1xxx 1520 ac jumper cable connected to power supply 1 and the ac module 154 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Table 37. Power Reference Code Table (continued) Model or expansion unit Reference code Locate these parts 5294 and 5094 (see Figure 1) 1xxx 1500 ac jumper cable connected to power supply 0 and the ac module 1xxx 1510 ac jumper cable connected to power supply 1 and the ac module 1xxx 1520 ac jumper cable connected to power supply 2 and the ac module 1xxx 1530 ac jumper cable connected to power supply 3 and the ac module 5095 and 0595 1xxx 1510 Power supply 1 and ac line cord 1 1xxx 1520 Power supply 2 and ac line cord 2 5790, 5796, 7314-G30, 7311-D11, 7311-D20 1xxx 1510 Power supply 1 and ac line cord 1 1xxx 1520 Power supply 2 and ac line cord 2 Figure 1. Dual line cord drawing for 5094 Chapter 2. Isolation Procedures 155 b. Locate the ac line cord or the ac jumper cable for the reference code you are working on. c. Go to step 10. 9. Is the reference code 1xxx 1500 or 1xxx 1530? v No: Perform Problem Analysis using the reference code. This ends the procedure. v Yes: Locate the ac jumper cables for the reference code you are working on (see Table 37 on page 154), and then continue with the next step: – If the reference code is 1xxx 1500, determine the locations of ac jumper cables that connect to power supply P00 (see the preceding figures). – If the reference code is 1xxx 1530, determine the locations of ac jumper cables that connect to power supply P03 (see the preceding figures).10. Perform the following: Attention: Do not disconnect the other system line cord when powered on. Attention: Do not disconnect the other ac jumper cable when powered on. a. For the reference code you are working on, disconnect either the ac jumper cable or the ac line cord from the power supply. b. Use a multimeter to measure the ac voltage at the power supply end of the ac jumper cable or the ac line cord. Figure 2. Dual line cord drawing for 5088, or 0588 installed on 5094 156 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Is the ac voltage correct (see Table 34 on page 152)? No: Continue with the next step. Yes: Exchange the failing power supply. Refer to Table 35 on page 153 and Table 36 on page 153 for its position, and then see Chapter 3, “Locating FRUs,” on page 335 for part numbers and directions to the correct exchange procedures. This ends the procedure.11. Perform the following: a. Disconnect the ac line cords from the power outlet. b. Use a multimeter to measure the ac voltage at the customer’s ac power outlet.Is the ac voltage correct (see Table 34 on page 152)? v Yes: Exchange the following, one at a time: – Failing ac line cord – Failing ac jumper cable (if installed) – Failing ac module (if installed) (see Chapter 3, “Locating FRUs,” on page 335 for part numbers and directions to the correct exchange procedures) This ends the procedure. v No: Perform the following: a. Inform the customer that the ac voltage at the power outlet is not correct. b. Reconnect the ac line cords to the power outlet after the ac voltage at the power outlet is correct. This ends the procedure.PWR1917 This procedure is used to display or change the configuration ID. 1. Use either the Advanced System Management Interface (ASMI) or the control panel to display and change the configuration ID. v If you are using the ASMI, refer to the operations guide for your system and use the instructions for Changing system configuration. Use Table 38 on page 158 to find the correct configuration ID. v If you are using the control panel, continue with the next step.2. Perform the following to display the configuration ID: Attention: The system or unit that will display the ID must be powered off with ac power applied.Notes: v If you have just restored power to the system, the service processor must return to standby before control panel functions will work correctly. Returning the service processor to standby takes a few minutes after the panel appears to be operational. v You must have the panel in manual mode to access function 7 options.a. Select function 07 on the system control panel. Press Enter (07** will be displayed). b. Use the arrow keys to increment or decrement to subfunction A8. 07A8 will be displayed. Press Enter (07A8 00 will be displayed). c. Use the arrow keys to increment or decrement to the first byte of the unit address (usually 3C) for the box you want to check. 07nn will be displayed, where nn is the first byte of the unit address. d. Press Enter (073C 00, for example, will be displayed). e. Use the arrow keys to increment or decrement to the second byte of the unit address (usually 01, 02, etc for I/O expansion units) for the box you want to check. 07nn will be displayed, where nn is the second byte of the unit address (0701, for example, for a unit). Press Enter (0701 00, for example, will be displayed).Note: The display on the addressed I/O expansion unit being addressedshould be blinking on and off while displaying the configuration ID as the last two characters of the bottom line. Chapter 2. Isolation Procedures 157 f. Use the following table to check the unit configuration ID. Table 38. Unit Configuration IDs Model or expansion unit Configuration ID 5088 and 0588 89 5094 and 5294 8A 5095 and 0595 8B 7311-D11, 5790 88 7311-D20 8C 7314-G30, 5796 8D g. Is the correct configuration ID displayed for the tower selected? v No: Continue with the next step. v Yes: Go to step 6.3. You need to set the unit configuration ID. Are you starting this step from the function 01 view on the control panel? v No: To ensure that the control panel operates properly, return to function 01. Do the following: a. The operator panel should still show the incorrect configuration ID (for example, 07C0). b. Press Enter. The control panel will now show 07xx 00 (for example, 07C0 00). c. Use the arrow keys to display 07**, then press Enter. The control panel will now show 07. d. Use the arrow keys to get the display to function 01, then press Enter. You should now be at the regular function 01 control panel view. e. Continue with the next step.v Yes: Continue with step 4.4. Set the unit configuration ID. Do the following: a. Select function 07 on the system control panel. Press Enter (07** will be displayed). b. Use the arrow keys to increment/decrement to subfunction A9 (07A9 will be displayed). Press Enter (07A9 00 will be displayed). c. Use the arrow keys to increment/decrement to the first byte of the unit address (usually 3C) for the box that you want to change. 07nn (073C, for example) will be displayed, where nn is the first byte of the unit address. Press Enter (073C 00, for example, will be displayed). d. Use the arrow keys to increment/decrement to the second byte of the unit address (usually 01, 02, etc for I/O Expansion units) for the box you want to check. 07nn will be displayed, where nn is the second byte of the unit address (01, for example, for a unit). Press Enter (0701 00, for example, will be displayed).Note: The display on the addressed I/O expansion unit will be blinking on and off. e. Use the arrow keys to increment/decrement to the correct configuration ID (refer to Table 38). 07xx will be displayed where xx is the configuration ID. f. Press Enter (07xx 00 will be displayed). After 20 to 30 seconds, the display on the addressed I/O expansion unit will stop blinking and return to the normal display format. Note: To return the panel to normal display, scroll to 07** and press Enter. g. Continue with the next step.5. Power on the system. Do you still get SRC 1xxxx84D0 or 1xxxx840E? v No: This ends the procedure. v Yes: Continue with the next step.6. Perform the following: 158 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) a. Power off the system. b. Exchange the SPCN card in the failing frame. This ends the procedure.PWR1918 A voltage regulator card might be failing. For instructions for removing and replacing any FRUs called out by the following procedure, refer to Chapter 3, “Locating FRUs,” on page 335. 1. Is the reference code 1xxx 8450? No: Continue with the next step. Yes: At least one processor VRM is missing. Inspect all of the processor cards and install the VRM that are missing (see “Physical Locations” on page 339). This ends the procedure.2. Is the reference code 1xxx 2630? No: Continue with the next step. Yes: Replace VRM1A and VRM1B (which are one CRU/FRU) on processor 1 (Un-P1-C13) (see “Physical Locations” on page 339). This ends the procedure.3. Is the reference code 1xxx 2640? No: Continue with the next step. Yes: Replace VRM1A and VRM1B (which are one CRU/FRU) on processor 2 (Un-P1-C14). This ends the procedure.4. Is the reference code 1xxx 2623? v No Continue with the next step. v Yes: Replace the following, if present, one at a time, and in the order listed: a. Base RAID card (Un-P1-C10) b. Auxiliary RAID card (Un-P1-C9)This ends the procedure. 5. Is the reference code 1xxx 2624? v No Continue with the next step. v Yes: Replace the following, if present, one at a time, and in the order listed: a. Ethernet card (Un-P1-C7) b. Base RAID card (Un-P1-C10) c. Auxiliary RAID card (Un-P1-C9) d. System backplane (Un-P1) e. DASD drives f. DASD and media backplane (Un-P2)This ends the procedure. 6. Is the reference code 1xxx 2625? v No Continue with the next step. v Yes: Replace the following, if present, one at a time, and in the order listed: a. Base RAID card (Un-P1-C10) b. Auxiliary RAID card (Un-P1-C9) c. DASD drives d. DASD and media backplane (Un-P2) e. Ethernet card (Un-P1-C7) f. System backplane (Un-P1)This ends the procedure. Chapter 2. Isolation Procedures 159 7. Is the reference code 1xxx 2626? v No Continue with the next step. v Yes: Replace the following, if present: a. Ethernet card (Un-P1-C7)This ends the procedure. 8. Is the reference code 1xxx 2632? v No Continue with the next step. v Yes: Replace the following, one at a time, and in the order listed: a. VRM2 for processor 1 (Un-P1-C18) b. System backplane (Un-P1)This ends the procedure. 9. Is the reference code 1xxx 2642? v No Continue with the next step. v Yes: Replace the following, one at a time, and in the order listed: a. VRM2 for processor 2 (Un-P1-C20) b. System backplane (Un-P1)This ends the procedure.PWR1920 Use this procedure to verify that the lights on the server control panel and the display panel on all attached I/O expansion units are operating correctly. For important safety information before continuing with this procedure, see “Power isolation procedures” on page 128. Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions for removing and replacing any FRUs called out by the following procedure. 1. Activate the lamp test by performing one of the following: v Select function 04 lamp test on the control panel and press Enter. v Sign on to ASMI and click System Configuration -> Service Indicators -> Lamp Test.2. Look at the server control panel and the display panels on all attached I/O towers. The lamp test is active only for 25 seconds after you press Enter. Check the following lights on the server control panel and all I/O enclosures and towers: v Power-on light. v Attention light. v All dots for the 32 character display.Are all the lights on the control panel and the I/O display panels on? v No: Go to step 4. v Yes: These control panel lights are working correctly. Continue with the next step.3. Are any abnormal characters or character patterns (not reference codes or normal display mode) displayed? v No: The lights are operating correctly. This ends the procedure. v Yes: Continue with the next step.4. Verify that all cables are seated correctly. If the problem persists, replace the control panel. If the problem still persists, use the following table to determine the possible causes for the lamp test failure: 160 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Table 39. Failing unit Failing unit FRU 8203-E4A, 9407-M15, or 9408-M25 System backplane 5088, 0588 Tower backplane assembly 5094, 5294, 8094-002 Tower backplane assembly 5095, 0595 Tower backplane assembly 7311-D11, 7314-G30, 5790 or 5796 I/O backplane 7311-D20 I/O backplane Chapter 2. Isolation Procedures 161 Router isolation procedures These procedures serve as a guide to the correct isolation procedures from the reference code tables. DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005)Perform these procedures only when directed to do so from another procedure. “RTRIP01” on page 163Gives a link to a topic that might assist you when exchanging the I/O processor (IOP) for the system or partition console of i5/OS™. “RTRIP02” on page 163Gives a link to a topic that might assist you when diagnosing workstation IOP detected errors. “RTRIP03” on page 163Gives links to topics to assist you when diagnosing workstation IOP detected errors. “RTRIP05” on page 163Use the attached procedure when this reference code occurs for a RIO/HSL/12X loop resource, when an I/O expansion unit on the loop is powered off for a concurrent maintenance action. “RTRIP06” on page 164Use the attached procedure when this reference code occurs in a service action code (SAL). 162 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) “RTRIP07” on page 165Gives a link to assist you when diagnosing a keyboard error. “RTRIP08” on page 165Gives a link to assist when the Licensed Internal Code detected an IOP programming problem.RTRIP01 Gives a link to a topic that might assist you when exchanging the I/O processor (IOP) for the system or partition console of i5/OS™. Perform “CONSL01” on page 27. RTRIP02 Gives a link to a topic that might assist you when diagnosing workstation IOP detected errors. Perform “TWSIP01” on page 234. RTRIP03 Gives links to topics to assist you when diagnosing workstation IOP detected errors. If you have a twinaxial terminal for the console, perform “TWSIP01” on page 234. Otherwise, perform “WSAIP01” on page 241. RTRIP05 Use the attached procedure when this reference code occurs for a RIO/HSL/12X loop resource, when an I/O expansion unit on the loop is powered off for a concurrent maintenance action. Note: This reference code can occur for the RIO/HSL/12X loop resource when an I/O expansion unit on the loop is powered off for a concurrent maintenance action.Note: A fiber optic cleaning kit may be required for optical RIO/HSL/12X connections. 1. Multiple B600 6982 errors may occur due to efforts to retry and recover. If the recovery efforts were successful, there will be a B600 6985 reference code with xxxx 3206 in word 4 logged after all B600 6982 reference codes in the product activity log (PAL). If this is the case, close out all the B600 6982 entries. Then continue with the next step. 2. Is there a B600 6987 reference code in the service action log (SAL) logged at about the same time? Yes: Close this problem and work the B600 6987. This ends the procedure. No: Continue with the next step.3. Is there a B600 6981 reference code in the SAL logged at approximately the same time? Yes: Go to step 8 on page 164. No: Continue with the next step.4. Perform “RIOIP06” on page 34 to determine if any other systems are connected to this loop and then return here. Note: The loop number can be found in the SAL in the description for the HSL_LNK FRU.Are there other systems connect to this loop? Yes: Continue with the next step. No: Go to step 8 on page 164.5. Check for HSL failures in the SALs on the other systems before replacing parts. HSL failures are indicated by SAL entries with HSL I/O bridge and Network Interface Controller (NIC) resources. Ignore B600 6982 and B600 6984 entries. Chapter 2. Isolation Procedures 163 Are there HSL failures on other systems? Yes: Continue with the next step. No: Go to step 8.6. Repair the problems on the other systems and return to this step. After making repairs on the other systems check the PAL of this system. Is there a B600 6985, along with this loop’s resource name, that was logged after the repairs you made on the other systems? Yes: Continue with the next step. No: Go to step 8.7. For the B600 6985 reference code you found, use “Status indications for RIO/HSL” on page 15 to determine if the loop is now complete. Is the loop complete? Yes: The problem has been resolved. This ends the procedure. No: Go to step 8.8. The FRU list displayed in the SAL may be different from the failing item list given here. Use the SAL’s FRU list when it is available. Does this reference code appear in the SAL with the symbolic FRU HSL_LNK listed as a FRU? Yes: Perform “RIOIP01” on page 28. This ends the procedure. No: Exchange the FRUs listed in the SAL according to their part action codes. This ends the procedure.RTRIP06 Use the attached procedure when this reference code occurs in a service action code (SAL). Note: A fiber optic cleaning kit may be required for optical HSL connections. 1. Is the reference code in the service action log (SAL)? v Yes: Continue with the next step. v No: The reference code is informational. Use “Status indications for RIO/HSL” on page 15 to determine what the reference code means. This ends the procedure.2. This error can appear in the SAL if a tower or another system in the loop did not complete powering on before Licensed Internal Code (LIC) checked this loop for errors. Search the product activity log (PAL) for all B600 6985 reference codes logged for this loop and use “Status indications for RIO/HSL” on page 15 to determine if this error requires service. Is further service required? Yes: Continue with the next step. No: This ends the procedure.3. There may be multiple B600 6985 reference codes, with xxxx 3205 in word 4, for the same loop resource in the SAL. This is caused by attempts to retry and recover. If there is a B600 6985 reference code with xxxx 3206 or xxxx 3208 in word 4 after the above B600 6985 entries in the PAL, then the recovery efforts were successful. If this is the case, close all the B600 6985 entries for that loop resource in the SAL. Then continue with the next step. 4. Is there a B600 6981 reference code in the SAL? Yes: Close that problem and go to step 9 on page 165. No: Continue with the next step.5. Perform “RIOIP06” on page 34 to determine if any other systems are connected to this loop and then return here. 164 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Note: The loop number can be found in the SAL in the description for the HSL_LNK FRU.Are there other systems connected to this loop? Yes: Continue with the next step. No: Go to step 9.6. Check for HSL failures in the SALs on the other systems before replacing parts. HSL failures are indicated by SAL entries with HSL I/O bridge and Network Interface Controller (NIC) resources. Ignore B600 6982 and B600 6984 entries. Are there HSL failures on other systems? Yes: Continue with the next step. No: Go to step 9.7. Repair the problems on the other systems and return to this step. After making repairs on the other systems check the PAL of this system. Is there a B600 6985 reference code that was logged after the repairs you made on the other systems? Yes: Continue with the next step. No: Go to step 9.8. For the B600 6985 log you found, use “Status indications for RIO/HSL” on page 15 to determine if the loop is now complete. Is the loop complete? v Yes: The problem has been resolved. This ends the procedure. v No: Go to step 9.9. The FRU list displayed in the SAL may be different from the failing item list given here. Use the SAL’s FRU list when it is available. Does this reference code appear in the SAL with the symbolic FRU HSL_LNK listed as a FRU? v Yes: Perform “RIOIP01” on page 28. This ends the procedure. v No: Exchange the FRUs listed in the SAL according to their part action codes. This ends the procedure.RTRIP07 Gives a link to assist you when diagnosing a keyboard error. Perform “WSAIP01” on page 241. RTRIP08 Gives a link to assist when the Licensed Internal Code detected an IOP programming problem. Perform a system IPL. Is the IPL successful? Yes: Perform “LICIP01” on page 91 to determine the cause of the problem.This ends the procedure. No: Perform the action described in the new reference code. This ends the procedure. Chapter 2. Isolation Procedures 165 SAS isolation procedures Use the following Serial Attached SCSI (SAS) isolation procedures if an HMC is not attached to the server. If the server is connected to an HMC, use the procedures that are available on the HMC to continue FRU isolation. DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) “SIP3110” on page 168This procedure resolves problems when disk units are incompatible or a disk unit is missing or failed. “SIP3111” on page 169This procedure resolves the problem when two or more disk units are missing from a RAID 5 or RAID 6 disk array. “SIP3112” on page 171This procedure resolves the problem when one or more disk array members are not at the required physical locations. “SIP3113” on page 172This procedure resolves problems when a disk array is or would become exposed and parity data is out of synchronization. 166 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) “SIP3120” on page 174This procedure resolves the problem when Cache data associated with attached disk units cannot be found. “SIP3121” on page 175Use this procedure to resolve the following problem: RAID adapter resources not available due to previous problems (SRC xxxx9054). “SIP3130” on page 175Use this procedure to resolve the following problem: Adapter does not support function expected by one or more disk units (SRC xxxx9008). “SIP3131” on page 177Use this procedure to resolve the following problem: Required cache data cannot be located for one or more disk units (SRC xxxx9050). “SIP3132” on page 180Use this procedure to resolve the following problem: Cache data exists for one or more missing or failed disk units (SRC xxxx9051). “SIP3134” on page 181Use this procedure to resolve the following problem: Disk unit requires format before use (SRC xxxx9092). “SIP3140” on page 183Use this procedure to resolve the following problem: Multiple adapters connected in an invalid configuration (SRC xxxx9073) “SIP3141” on page 184Use this procedure to resolve the following problem: Multiple adapters not capable of similar functions or controlling same set of devices (SRC xxxx9074) “SIP3142” on page 185Use this procedure to resolve the following configuration error: incorrect connection between cascaded enclosures (SRC xxxx4010). “SIP3143” on page 187Use this procedure to resolve the following configuration error: connections exceed adapter design limits (SRC xxxx4020). “SIP3144” on page 188Use this procedure to resolve problems with multipath connections. “SIP3145” on page 191Use this procedure to resolve the following problem: Unsupported enclosure function detected (SRC xxxx4110). “SIP3146” on page 193Use this procedure to resolve the configuration error: Incomplete multipath connection between enclosures and device detected (SRC xxxx4041). “SIP3147” on page 194Use this procedure to resolve the following problem: Missing remote adapter (SRC xxxx9076) “SIP3148” on page 195Use this procedure to resolve the following problem: Attached enclosure does not support required multipath function (SRC xxxx4050). “SIP3149” on page 197Use this procedure to resolve the following problem: Incomplete multipath connection between adapter and remote adapter (SRC xxxx9075) “SIP3150” on page 197Use this procedure to perform serial attached SCSI (SAS) fabric problem isolation. “SIP3152” on page 199Use this procedure to resolve possible failed connection problems Chapter 2. Isolation Procedures 167 “SIP3153” on page 201Go to SIP3152.SIP3110 This procedure resolves problems when disk units are incompatible or a disk unit is missing or failed. The following SRCs are possible: v SRC xxxx9025. Indicates that an incompatible disk unit is installed at the disk unit location that caused the array to be exposed. v SRC xxxx9030. Indicates that a disk array is exposed due to a missing or failed disk unit. v SRC xxxx9032. Indicates that a disk unit in a disk array is missing or failed, but the array is still protected.If you received SRC xxxx9030 or xxxx9032, one of the following occurred: v A disk unit has failed and the RAID array protection is exposed or will become exposed if another disk unit fails because no hot spare disk unit was available to replace it. If the array is exposed, then the array will continue to be exposed until the disk unit has been replaced and a manual rebuilding of the array has been started. v A disk unit has failed in a RAID array, but a hot spare was used to automatically start rebuilding the array. Replace, format, and configure the failed disk unit as a hot spare. Note: If the previous hot spare disk unit was a larger capacity than the failed disk unit, ensure that the customer understands that the replacement disk unit might not provide adequate hot spare coverage for all of the arrays under this adapter.1. Is the device location information for this SRC available in the service action log (see “Using the Service Action Log” on page 609 for details)? No: Continue with the next step. Yes: Exchange the disk unit. See “Disk Drive” on page 379. This ends the procedure.2. Identify the affected adapter and disk units by examining the product activity log. Perform one of the following to access system service tools (SST) or dedicated service tools (DST): v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640.3. Perform the following steps: a. Access the product activity log and display the SRC that sent you here. b. Press the F9 key for address information. This is the adapter address. c. Continue with the next step.4. Perform the following steps: a. Return to the SST or DST main menu. b. Select Work with disk units ? Display disk configuration ? Display disk configuration status. c. On the Display disk configuration status display, look for the devices attached to the adapter that is identified in step 3.Is there a device that has a status of RAID 5/Unknown, RAID 6/Unknown, RAID 5/Failed, or RAID 6/Failed? No: Continue with step 7 on page 169. Yes: Continue with the next step. 168 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 5. Find the device that has a status of RAID 5/Unknown, RAID 6/Unknown, RAID 5/Failed, or RAID 6/Failed. This is the device that is causing the problem. Show the device address by selecting Display Disk Unit Details ? Display Detailed Address. Record the device address. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit or of the expansion unit and find the following items: v The slot that is identified by the direct select address of the adapter v The disk unit location that is identified by the device address6. Have you determined the location of the adapter and disk unit that is causing the problem? No: Ask your next level of support for assistance. This ends the procedure. Yes: Exchange the disk unit that is causing the problem. See “Disk Drive” on page 379. This ends the procedure.7. Press the function key to cancel and to return to the Display Disk Configuration menu, then do the following: a. Select Display disk hardware status. b. Find a device that is either Not operational or Read/write protected. c. Display details for the device and get the location of the failed disk unit. d. Exchange the disk unit and configure it as a hot spare. Go to “Disk Drive” on page 379. This ends the procedure.SIP3111 This procedure resolves the problem when two or more disk units are missing from a RAID 5 or RAID 6 disk array. The following SRCs are possible: v xxxx9020 v xxxx9021 v xxxx90221. Identify the affected adapter and disk units by examining the Product Activity Log. Perform the following: a. Access SST or DST. v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.b. Access the Product Activity Log and record address information. If a type D IPL was not performed to get to SST or DST: The log information is formatted. Access the Product Activity Log and display the SRC that sent you here. Press the F9 key for address information. This is the adapter address. Then, press F12 to cancel and return to the previous screen. Then press the F4 key to view the ?Additional Information? to record the formatted log information. Record all devices that are missing from the disk array. These are the array members that have both a current address of 0 and an expected address that is not 0. If a type D IPL was performed to get to DST: The log information is not formatted. Access the Product Activity Log and display the SRC that sent you here. The direct select address (DSA) of the adapter is in the format BBBB-Cc-bb: BBBB Hexadecimal offsets 4C and 4D Chapter 2. Isolation Procedures 169 Cc Hexadecimal offset 51 bb Hexadecimal offset 4FIn order to interpret the hexadecimal information to get device addresses, see “More information from hexadecimal reports” on page 620. Record all devices that are missing from the disk array. These are the array members that have both a current address of 0 and an expected address that is not 0.c. Determine the location of the adapter and the devices that are causing the problem. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit, or the expansion unit. Then find the following items: v The card slot that is identified by the direct select address (DSA) v The disk unit locations that are identified by the unit addresses2. Perform one of the following options (listed in order of preference): Option 1 Power off the system or partition and install the identified disk units in the correct physical locations (that is the expected addresses) in the system. This ends the procedure. Option 2 Stop the disk array that contains the missing devices. Attention: Customer data might be lost. Perform the following: a. If you are not already using dedicated service tools, perform an IPL to DST. See “Performing an IPL to DST” on page 640. If you cannot perform a type A or B IPL, perform a type D IPL from removable media. b. Select Work with disk units. Did you get to DST with a type D IPL? No: Select Work with disk configuration ? Work with device parity protection. Then, continue with substep 2c. Yes: Continue with substep 2c.c. Select Stop device parity protection. d. Follow the online instructions to stop device parity protection. e. Perform an IPL from disk.Does the IPL complete successfully? No: Go to Starting a Service Call. This ends the procedure. Yes: This ends the procedure.Option 3 If the data on the disk units is not needed, initialize and format the remaining members of the disk array by performing the following steps:Attention: Data on the disk unit will be lost. If a type D IPL was not performed to get to SST or DST: a. Access SST or DST. b. Select Work with disk units ? Work with disk unit recovery ? Disk unit problem recovery procedures. c. Select Initialize and format disk unit for each disk unit. When the new disk unit is initialized and formatted, the display shows that the status is complete. This might take 30 minutes or much longer depending on the capacity of the disk unit. The disk unit is now ready to be added to the system configuration. This ends the procedure. 170 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) If a type D IPL was performed to get to DST: a. Access DST. b. Select Work with disk units. c. Select Initialize and format disk unit for each disk unit. When the new disk unit is initialized and formatted, the display shows that the status is complete. This might take 30 minutes or much longer depending on the capacity of the disk unit. The disk unit is now ready to be added to the system configuration. This ends the procedure.SIP3112 This procedure resolves the problem when one or more disk array members are not at the required physical locations. The possible SRC is SRC xxxx9023. 1. Identify the affected adapter and disk units that are not at their required locations by examining the Product Activity Log. Perform the following steps: a. Access SST or DST. v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.b. Access the product activity log and record address information. If a type D IPL was not performed to get to SST or DST: The log information is formatted. Access the Product Activity Log and display the SRC that sent you here. Press the F9 key for address information. This is the adapter address. Then, press F12 to cancel and return to the previous screen. Then press the F4 key to view the additional information to record the formatted log information. Record all devices that are not at their required locations. These are the array members that have a current address and an expected address that do not match. A current address of 0 is acceptable, and no action is needed to correct it for a known failed drive in the array. If a type D IPL was performed to get to DST: The log information is not formatted. Access the Product Activity Log and display the SRC that sent you here. The direct select address (DSA) of the adapter is in the format BBBB-Cc-bb: BBBB Hexadecimal offsets 4C and 4D Cc Hexadecimal offset 51 bb Hexadecimal offset 4FIn order to interpret the hexadecimal information to get device addresses, see “More information from hexadecimal reports” on page 620. Record all devices that are not at their required locations. These are the array members that have a current address and an expected address that do not match. A current address of 0 is acceptable, and no action is needed to correct it for a known failed drive in the array.c. Determine the location of the adapter and the devices that are causing the problem. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit or the expansion unit. Then find the following items: v The card slot that is identified by the direct select address (DSA) v The disk unit locations that are identified by the unit addresses2. Perform only one of the following options (listed in order of preference): Chapter 2. Isolation Procedures 171 Option 1 Power off the system or partition and install the identified disk units in the correct physical locations (that is the expected addresses) in the system. This ends the procedure. Option 2 Stop the disk array that contains the missing devices. Attention: Customer data might be lost. Perform the following: a. If you are not already using dedicated service tools, perform an IPL to DST. See “Performing an IPL to DST” on page 640. If you cannot perform a type A or B IPL, perform a type D IPL from removable media. b. Select Work with disk units. Did you get to DST with a type D IPL? No: Select Work with disk configuration ? Work with device parity protection. Then, continue with substep 2c. Yes: Continue with the next substep 2c.c. Select Stop device parity protection. d. Follow the online instructions to stop device parity protection. e. Perform an IPL from disk.Does the IPL complete successfully? No: Go to Starting a Service Call. This ends the procedure. Yes: This ends the procedure.Option 3 If the data on the disk units is not needed, initialize and format the remaining members of the disk array by performing the following steps:Attention: Data on the disk unit will be lost. If a type D IPL was not performed to get to SST or DST: a. Access SST or DST. b. Select Work with disk units ? Work with disk unit recovery ? Disk unit problem recovery procedures. c. Select Initialize and format disk unit for each disk unit. When the new disk unit is initialized and formatted, the display shows that the status is complete. This might take 30 minutes or much longer depending on the capacity of the disk unit. The disk unit is now ready to be added to the system configuration. This ends the procedure.If a type D IPL was performed to get to DST: a. Access DST. b. Select Work with disk units. c. Select Initialize and format disk unit for each disk unit. When the new disk unit is initialized and formatted, the display shows that the status is complete. This might take 30 minutes or much longer depending on the capacity of the disk unit. The disk unit is now ready to be added to the system configuration. This ends the procedure.SIP3113 This procedure resolves problems when a disk array is or would become exposed and parity data is out of synchronization. The possible SRC is xxxx9027. 172 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 1. Identify the affected adapter and disk units by examining the Product Activity Log. Perform the following steps: a. Access SST or DST. v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.b. Access the Product Activity Log and record address information. If a type D IPL was not performed to get to SST or DST: The log information is formatted. Access the Product Activity Log and display the SRC that sent you here. Press the F9 key for address information. This is the adapter address. Then, press F12 to cancel and return to the previous screen. Then press the F4 key to view the additional information to record the formatted log information. Record all devices that are missing from the disk array. These are the array members that have both a current address of 0 and an expected address that is not 0. If a type D IPL was performed to get to DST: The log information is not formatted. Access the product activity log and display the SRC that sent you here. The direct select address (DSA) of the adapter is in the format BBBB-Cc-bb: BBBB Hexadecimal offsets 4C and 4D Cc Hexadecimal offset 51 bb Hexadecimal offset 4FIn order to interpret the hexadecimal information to get device addresses, see “More information from hexadecimal reports” on page 620. Record all devices that are missing from the disk array. These are the array members that have both a current address of 0 and an expected address that is not 0.c. Determine the location of the adapter and the devices that are causing the problem. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit or the expansion unit. Then find the following: v The card slot that is identified by the direct select address (DSA) v The disk unit locations that are identified by the unit addresses2. Have the adapter or disk units been physically moved recently? No: Contact your hardware service provider. This ends the procedure. Yes: Continue with the next step.3. Perform one of the following two options (listed in order of preference): Option 1 Power off the system or partition and restore the I/O adapter and disk units back to their original configuration. This ends the procedure. Option 2 Stop the disk array that contains the missing devices. Attention: Customer data might be lost. Perform the following: a. If you are not already using dedicated service tools, perform an IPL to DST. See “Performing an IPL to DST” on page 640. If you cannot perform a type A or B IPL, perform a type D IPL from removable media. b. Select Work with disk units. Did you get to DST with a type D IPL? Chapter 2. Isolation Procedures 173 No: Select Work with disk configuration ? Work with device parity protection. Then, continue with substep 3c. Yes: Continue with substep 3c.c. Select Stop device parity protection. d. Follow the online instructions to stop device parity protection. e. Perform an IPL from disk.Does the IPL complete successfully? No: Go to Starting a Service Call. This ends the procedure. Yes: This ends the procedure.Option 3 If the data on the disk units is not needed, initialize and format the disk units by performing the following steps: Attention: Data on the disk unit will be lost. If a type D IPL was not performed to get to SST or DST: a. Access SST or DST. b. Select Work with disk units ? Work with disk unit recovery ? Disk unit problem recovery procedures. c. Select Initialize and format disk unit for each disk unit. When the new disk unit is initialized and formatted, the display shows that the status is complete. This might take 30 minutes or much longer depending on the capacity of the disk unit. The disk unit is now ready to be added to the system configuration. This ends the procedure.If a type D IPL was performed to get to DST: a. Access DST. b. Select Work with disk units. c. Select Initialize and format disk unit for each disk unit. When the new disk unit is initialized and formatted, the display shows that the status is complete. This might take 30 minutes or much longer depending on the capacity of the disk unit. The disk unit is now ready to be added to the system configuration. This ends the procedure.SIP3120 This procedure resolves the problem when Cache data associated with attached disk units cannot be found. The possible SRC is xxxx9010 1. Has the server been powered off for several days? No Go to step 2. Yes Go to step 5 on page 175.2. Using the appropriate service procedures, remove the adapter. Install the new replacement storage adapter with the following parts installed on it: Note: Label all parts (original and new) before moving them. v The cache directory card from the original storage adapter. Refer to Replacing the cache directory card. v The removable cache card from the original storage adapter. This applies only to certain adapters that have a removable cache card.3. Has a new SRC of xxxx9010 or xxxx9050 occurred? 174 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) No Go to step 6. Yes Go to step 44. Was the new SRC xxxx9050? No The new SRC was xxxx9010. Reclaim adapter cache storage. See “Reclaiming IOP cache storage” on page 636.Attention: Data might be lost. When an auxiliary cache adapter connected to the RAID adapter logs a xxxx9055 SRC in the hardware error log, the reclaim process does not result in lost sectors. Otherwise, the reclaim process does result in lost sectors.Note: On the Reclaim Controller Cache Storage results screen, the number of lost sectors is displayed. If the number is 0, there is no data loss. If the number is not 0, data has been lost and the system operator might want to restore data after this procedure is completed.Go to step 6. Yes: Contact your hardware service provider. This ends the procedure.5. If the server has been powered off for several days after an abnormal power-down, the cache battery pack might be depleted. Do not replace the adapter or the cache battery pack. Reclaim adapter cache storage. See “Reclaiming IOP cache storage” on page 636. Attention: Data might be lost. When an auxiliary cache adapter connected to the RAID adapter logs a xxxx9055 SRC in the hardware error log, the reclaim process does not result in lost sectors. Otherwise, the reclaim process does result in lost sectors.Note: On the Reclaim Controller Cache Storage results screen, the number of lost sectors is displayed. If the number is 0, there is no data loss. If the number is not 0, data has been lost and the system operator might want to restore data after this procedure is completed.This ends the procedure. 6. Using the appropriate service procedures, remove the adapter. Install the new replacement storage adapter with the following parts installed on it: v The cache directory card from the new storage adapter. Refer to Replacing the cache directory card. v The removable cache card from the new storage adapter. This only applies to certain adapters which have a removable cache card.This ends the procedure.SIP3121 Use this procedure to resolve the following problem: RAID adapter resources not available due to previous problems (SRC xxxx9054). Power off the system and remove all new or replacement disk units. IPL the system to DST. If you cannot perform a type A or B IPL, perform a type D IPL from removable media. Look for Product Activity Log entries for other reference codes and take action on them. This ends the procedure. SIP3130 Use this procedure to resolve the following problem: Adapter does not support function expected by one or more disk units (SRC xxxx9008). 1. Identify the affected adapter and disk units by examining the Product Activity Log. Perform the following: a. Access SST or DST. Chapter 2. Isolation Procedures 175 v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.b. Access the product activity log and record address information. If a type D IPL was not performed to get to SST or DST: The log information is formatted. Access the Product Activity Log and display the SRC that sent you here. Press the F9 key for address information. This is the adapter address. Then, press F12 to cancel and return to the previous screen. Then press the F4 key to view the ?Additional Information? to record the formatted log information. The Device Errors Detected field indicates the total number of disk units that are affected. The Device Errors Logged field indicates the number of disk units for which detailed information is provided. Under the Device heading, the unit address, type, serial number, and worldwide ID are provided for up to three disk units. Additionally, the adapter type, serial number, and worldwide ID for each of these disk units indicates the adapter to which the disk was last attached when it was operational. If a type D IPL was performed to get to DST: The log information is not formatted. Access the Product Activity Log and display the SRC that sent you here. The direct select address (DSA) of the adapter is in the format BBBB-Cc-bb: BBBB Hexadecimal offsets 4C and 4D Cc Hexadecimal offset 51 bb Hexadecimal offset 4FIn order to interpret the hexadecimal information to get device addresses, see “More information from hexadecimal reports” on page 620. The Device Errors Detected field indicates the total number of disk units that are affected. The Device Errors Logged field indicates the number of disk units for which detailed information is provided. Under the Device heading, the unit address, type, serial number, and worldwide ID are provided for up to three disk units. Additionally, the adapter type, serial number, and worldwide ID for each of these disk units indicates the adapter to which the disk was last attached when it was operational.c. Determine the location of the adapter and the devices that are causing the problem. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit or the expansion unit. Then find the following items: v The card slot that is identified by the direct select address (DSA) v The disk unit locations that are identified by the unit addressesHave you determined the location of the adapter and the devices that are causing the problem? No: Ask your next level of support for assistance. This ends the procedure. Yes: Continue with the next step.2. Have the adapter or disk units been physically moved recently, or were the disk units previously used by the AIX or Linux operating system? No: Ask your next level of support for assistance. This ends the procedure. Yes: Continue with the next step.3. Perform one of the following options (listed in order of preference): Option 1 Power off the system or partition and restore the adapter and disk units back to their original configuration. This ends the procedure. 176 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Option 2 If the data on the disk units is not needed, initialize and format the disk units by performing the following steps: Attention: Data on the disk unit will be lost. If a type D IPL was not performed to get to SST or DST: a. Access SST or DST. b. Select Work with disk units ? Work with disk unit recovery ? Disk unit problem recovery procedures. c. Select Initialize and format disk unit for each disk unit. When the new disk unit is initialized and formatted, the display shows that the status is complete. This might take 30 minutes or much longer depending on the capacity of the disk unit. The disk unit is now ready to be added to the system configuration. This ends the procedure.If a type D IPL was performed to get to DST: a. Access DST. b. Select Work with disk units. c. Select Initialize and format disk unit for each disk unit. When the new disk unit is initialized and formatted, the display shows that the status is complete. This might take 30 minutes or much longer depending on the capacity of the disk unit. The disk unit is now ready to be added to the system configuration. This ends the procedure.SIP3131 Use this procedure to resolve the following problem: Required cache data cannot be located for one or more disk units (SRC xxxx9050). 1. Did you just exchange the adapter as the result of a failure? No: Go to step 3. Yes: Attention: a. The failed adapter that you have just exchanged contains cache data that is required by the disk units that were attached to that adapter. If the adapter that you just exchanged is failing intermittently, reinstalling it and performing an IPL of the system might allow the data to be successfully written to the disk units. After the cache data is written to the disk units and the system is powered off normally, the adapter can be replaced without data being lost. Otherwise, continue with this procedure. b. Label all parts (old and new) before moving them.Go to step 2. 2. Using the appropriate service procedures, remove the adapter. Install the new replacement storage adapter with the following parts installed on it: v The cache directory card from the original storage adapter. Refer to “Replacing the cache directory card.” v The removable cache card from the original storage adapter. This applies only to certain adapters that have a removable cache card.Go to step 8 on page 179. 3. Identify the affected adapter and disk units by examining the Product Activity Log. Perform the following: a. Access SST/DST. v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. Chapter 2. Isolation Procedures 177 v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.b. Access the product activity log and record address information. If a type D IPL was not performed to get to SST/DST: The log information is formatted. Access the Product Activity Log and display the SRC that sent you here. Press the F9 key for address information. This is the adapter address. Then, press F12 to cancel and return to the previous screen. Then press the F4 key to view the additional information to record the formatted log information. The Device Errors detected field indicates the total number of disk units that are affected. The Device Errors Logged field indicates the number of disk units for which detailed information is provided. Under the Device heading, the unit address, type, serial number, and worldwide ID are provided for up to three disk units. Additionally, the adapter type, serial number, and worldwide ID for each of these disk units indicates the adapter to which the disk was last attached when it was operational. If a type D IPL was performed to get to DST: The log information is not formatted. Access the product activity log and display the SRC that sent you here. The direct select address (DSA) of the adapter is in the format BBBB-Cc-bb: BBBB Hexadecimal offsets 4C and 4D Cc hHexadecimal offset 51 bb Hexadecimal offset 4FIn order to interpret the hexadecimal information to get device addresses, see “More information from hexadecimal reports” on page 620. The Device Errors Detected field indicates the total number of disk units that are affected. The Device Errors Logged field indicates the number of disk units for which detailed information is provided. Under the Device heading, the unit address, type, serial number, and worldwide ID are provided for up to three disk units. Additionally, the adapter type, serial number, and worldwide ID for each of these disk units indicates the adapter to which the disk was last attached when it was operational.c. Determine the location of the adapter and the devices that are causing the problem. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit or the expansion unit. Then find the following items: v The card slot that is identified by the direct select address (DSA) v The disk unit locations that are identified by the unit addressesHave you determined the location of the adapter and the devices that are causing the problem? No: Ask your next level of support for assistance. This ends the procedure. Yes: Continue with the next step. 4. Have the adapter or disk units been physically moved recently? No: Contact your hardware service provider. Yes: Go to step 5. 5. Is the data on the disk units needed for this or any other system? No: Go to step 7 on page 179. Yes: Go to step 6 6. Restore the adapter and disk units back to their original configuration. The adapter and disk units must be rejoined so that the cache data can be written to the disk units. After the cache data is written to the disk units and the system is powered off normally, the adapter or disk units can be moved to another location. This ends the procedure 178 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 7. Perform only one of the following options, listed in the order of preference: Option 1 Reclaim adapter cache storage. See Reclaiming IOP cache storage. Attention: Data on the disk array will be lost.This ends the procedure. Option 2 If the data on the disk units is not needed, initialize and format the disk units by performing the following steps: Attention: Data on the disk units will be lost. If a type D IPL was not performed to get to SST or DST: a. Access SST or DST. b. Select Work with disk units ? Work with disk unit recovery ? Disk unit problem recovery procedures. c. Select Initialize and format disk unit for each disk unit. When the new disk unit is initialized and formatted, the display shows that the status is complete. This might take 30 minutes or much longer depending on the capacity of the disk unit. The disk unit is now ready to be added to the system configuration. This ends the procedure.If a type D IPL was performed to get to DST: a. Access DST. b. Select Work with disk units. c. Select Initialize and format disk unit for each disk unit. When the new disk unit is initialized and formatted, the display shows that the status is complete. This might take 30 minutes or much longer depending on the capacity of the disk unit. The disk unit is now ready to be added to the system configuration. This ends the procedure. 8. Has a new SRC xxxx9010 or xxxx9050 occurred? No: Go to step 10. Yes: Go to step 9. 9. Was the new SRC xxxx9050? No: The new SRC was xxxx9010. Reclaim adapter cache storage. See “Reclaiming IOP cache storage” on page 636.Attention: Data might be lost. When an auxiliary cache adapter connected to the RAID adapter logs an xxxx9055 SRC in the hardware error log, the reclaim process does not result in lost sectors. Otherwise, the reclaim process does result in lost sectors.Note: On the Reclaim Controller Cache Storage results screen, the number of lost sectors is displayed. If the number is 0, there is no data loss. If the number is not 0, data has been lost and the system operator might want to restore data after this procedure is completed.Go to step 10. Yes: Contact your hardware service provider.10. Using the appropriate service procedures, remove the adapter. Install the new replacement storage adapter with the following parts installed on it: Chapter 2. Isolation Procedures 179 v The cache directory card from the new storage adapter. Refer to Replacing the cache directory card. v The removable cache card from the new storage adapter. This only applies to certain adapters that have a removable cache card.This ends the procedure.SIP3132 Use this procedure to resolve the following problem: Cache data exists for one or more missing or failed disk units (SRC xxxx9051). The possible causes are: v One or more disk units have failed on the adapter. v One or more disk units were either moved concurrently or were removed after an abnormal power off. v The adapter was moved from a different system or a different location on this system after an abnormal power off. v The cache of the adapter was not cleared before it was shipped to the customer.1. Identify the affected adapter and disk units by examining the product activity log. Perform the following: a. Access SST or DST. v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.b. Access the Product Activity Log and record address information. If a type D IPL was not performed to get to SST or DST: The log information is formatted. Access the product activity log and display the SRC that sent you here. Press the F9 key for address information. This is the adapter address. Then, press F12 to cancel and return to the previous screen. Then press the F4 key to view the additional information to record the formatted log information. The Device Errors Detected field indicates the total number of disk units that are affected. The Device Errors Logged field indicates the number of disk units for which detailed information is provided. Under the Device heading, the unit address, type, serial number, and worldwide ID are provided for up to three disk units. Additionally, the adapter type, serial number, and worldwide ID for each of these disk units indicates the adapter to which the disk was last attached when it was operational. If a type D IPL was performed to get to DST: The log information is not formatted. Access the product activity log and display the SRC that sent you here. The direct select address (DSA) of the adapter is in the format BBBB-Cc-bb: BBBB Hexadecimal offsets 4C and 4D Cc Hexadecimal offset 51 bb Hexadecimal offset 4FIn order to interpret the hexadecimal information to get device addresses, see “More information from hexadecimal reports” on page 620. The Device Errors Detected field indicates the total number of disk units that are affected. The Device Errors Logged field indicates the number of disk units for which detailed information is provided. Under the Device heading, the unit address, type, serial number, and worldwide ID are provided for 180 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) up to three disk units. Additionally, the adapter type, serial number, and worldwide ID for each of these disk units indicates the adapter to which the disk was last attached when it was operational.c. Determine the location of the adapter and the devices that are causing the problem. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit or the expansion unit. Then find the following items: v The card slot that is identified by the direct select address (DSA) v The disk unit locations that are identified by the unit addressesHave you determined the location of the adapter and the devices that are causing the problem? No: Ask your next level of support for assistance. This ends the procedure. Yes: Continue with the next step.2. Are there other disk unit or adapter errors that have occurred at approximately the same time as this error? No: Go to step 3. Yes: Go to step 6.3. Is the data on the disk units (and thus the cache data for the disk units) needed for this or any other system? No: Go to step 7. Yes: Go to step 44. Have the adapter card or disk units been physically moved recently? No: Contact your hardware service provider. Yes: Go to step 5.5. Restore the adapter and disk units back to their original configuration. The adapter and disk units must be rejoined so that the cache data can be written to the disk units. After the cache data is written to the disk units and the system is powered off normally, the adapter or disk units can be moved to another location. This ends the procedure. 6. Take action on the other errors that have occurred at the same time as this error. This ends the procedure. 7. Reclaim adapter cache storage. See “Reclaiming IOP cache storage” on page 636. Attention: Data will be lost. This ends the procedure.SIP3134 Use this procedure to resolve the following problem: Disk unit requires format before use (SRC xxxx9092). The possible causes are: v Disk unit is a previously failed disk unit from a disk array and was automatically replaced by a hot spare disk unit. v Disk unit is a previously failed disk unit from a disk array and was removed and later reinstalled on a different adapter or different location on this adapter. v Appropriate service procedures were not followed when replacing disk units or reconfiguring the adapter, such as not performing a normal power off of the system prior to reconfiguring disk units and adapters. v Disk unit is a member of a disk array, but was detected subsequent to the adapter being configured. v Disk unit has multiple or complex configuration problems.1. Identify the affected adapter and disk units by examining the product activity log. Perform the following: a. Access SST or DST. Chapter 2. Isolation Procedures 181 v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.b. Access the product activity log and record address information. If a type D IPL was not performed to get to SST or DST: The log information is formatted. Access the product activity log and display the SRC that sent you here. Press the F9 key for address information. This is the adapter address. Then, press F12 to cancel and return to the previous screen. Then press the F4 key to view the additional information to record the formatted log information. The Device Errors Detected field indicates the total number of disk units that are affected. The Device Errors Logged field indicates the number of disk units for which detailed information is provided. Under the Device heading, the unit address, type, serial number, and worldwide ID are provided for up to three disk units. Additionally, the adapter type, serial number, and worldwide ID for each of these disk units indicates the adapter to which the disk was last attached when it was operational. If a type D IPL was performed to get to DST: The log information is not formatted. Access the product activity log and display the SRC that sent you here. The direct select address (DSA) of the adapter is in the format BBBB-Cc-bb: BBBB Hexadecimal offsets 4C and 4D Cc Hexadecimal offset 51 bb Hexadecimal offset 4FIn order to interpret the hexadecimal information to get device addresses, see “More information from hexadecimal reports” on page 620. The Device Errors detected field indicates the total number of disk units that are affected. The Device Errors logged field indicates the number of disk units for which detailed information is provided. Under the Device heading, the unit address, type, serial number, and Worldwide ID are provided for up to three disk units. Additionally, the adapter type, serial number, and Worldwide ID for each of these disk units indicates the adapter to which the disk was last attached when it was operational.c. Determine the location of the adapter and the devices that are causing the problem. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit, or the expansion unit. Then find the following items: v The card slot that is identified by the direct select address (DSA) v The disk unit locations that are identified by the unit addressesHave you determined the location of the adapter and the devices that are causing the problem? No: Ask your next level of support for assistance. This ends the procedure. Yes: Continue with the next step.2. Are there other disk unit or adapter errors that have occurred at about the same time as this error? No: Go to step 3. Yes: Go to step 5 on page 183.3. Have the adapter card or disk units been physically moved recently? No: Go to step 4. Yes: Go to step 6 on page 183.4. Is the data on the disk units needed for this or any other system? 182 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) No: Go to step 7. Yes: Go to step 6.5. Take action on the other errors that have occurred at the same time as this error. This ends the procedure. 6. Perform one of the following options that is most applicable to your situation: Option 1 Perform the following to cause the adapter to rediscover the devices and connections: a. Use Hardware Service Manager to perform another IPL of the virtual I/O processor that is associated with this adapter. b. Vary on any other resources attached to the virtual I/O processor.Take action for any other errors that are now occurring. This ends the procedure. Option 2 Power off the system or partition and restore the adapter and disk units to their original configuration. This ends the procedure. Option 3 Remove the disk units from this adapter. This ends the procedure.7. Do the following to format the disk units: Attention: All data on the disk units will be lost. If a type D IPL was not performed to get to SST or DST: a. Access SST or DST. b. Select Work with disk units ? Work with disk unit recovery ? Disk unit problem recovery procedures. c. Select Initialize and format disk unit for each disk unit. When the new disk unit is initialized and formatted, the display shows that the status is complete. This might take 30 minutes or much longer depending on the capacity of the disk unit. The disk unit is now ready to be added to the system configuration. This ends the procedure.If a type D IPL was performed to get to DST: a. Access DST. b. Select Work with disk units. c. Select Initialize and format disk unit for each disk unit. When the new disk unit is initialized and formatted, the display shows that the status is complete. This might take 30 minutes or much longer depending on the capacity of the disk unit. The disk unit is now ready to be added to the system configuration. This ends the procedure.SIP3140 Use this procedure to resolve the following problem: Multiple adapters connected in an invalid configuration (SRC xxxx9073) The possible causes are: v Incompatible adapters are connected to each other. This includes invalid adapter combinations such as the following: – Adapters with different write cache sizes – One adapter is not supported by i5/OS – An adapter that does not support auxiliary cache is connected to an auxiliary cache adapter – An adapter that supports Dual Storage IOAs is connected to another adapter which does not have the same support – Greater than 2 adapters are connected for Dual Storage IOAs Chapter 2. Isolation Procedures 183 – Adapter code levels are not up to date or are not at the same level of functionalityv One adapter, of a connected pair of adapters, is not operating under the i5/OS operating system. Connected adapters must both be controlled by i5/OS. Additionally, both adapters must be in the same system partition. v Adapters connected for Dual Storage IOAs are not cabled correctly. Each type of Dual Storage IOA configuration requires specific cables be used in a supported manner.Determine which of the possible causes applies to the current configuration and take the appropriate actions to correct it. If this does not correct the error, contact your hardware service provider. This ends the procedure. SIP3141 Use this procedure to resolve the following problem: Multiple adapters not capable of similar functions or controlling same set of devices (SRC xxxx9074) 1. This error relates to adapters connected in a Dual Storage IOA configuration. To obtain the reason or description for this failure, you must find the formatted error information in the Product Activity Log. This should also contain information about the connected adapter. Perform the following: a. Access SST/DST. v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.b. Access the Product Activity Log and record address information. If a D IPL was NOT performed to get to SST/DST: The log information is formatted. Access the Product Activity Log and display the SRC that sent you here. Press the F9 key for ?Address Information?. This is the adapter address. Then, press F12 to cancel and return to the previous screen. Then press the F4 key to view the ?Additional Information? to record the formatted log information. The Problem description field indicates the type of problem. The type, serial number, and Worldwide ID of the connected adapter is also available. If a D IPL was performed to get to DST: The log information is not formatted. Access the Product Activity Log and display the SRC that sent you here. The direct select address (DSA) of the adapter is in the format BBBB-Cc-bb: BBBB hexadecimal offsets 4C and 4D Cc hexadecimal offset 51 bb hexadecimal offset 4FIn order to interpret the hexadecimal information to get device addresses, see “More information from hexadecimal reports” on page 620. The Problem description field indicates the type of problem. The type, serial number, and Worldwide ID of the connected adapter is also available.2. Find the problem description and information for the connected adapter (remote adapter) shown in the error log, and perform the action listed for the reason in the following table: 184 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Table 40. RAID array reason for failure Problem description Full description Action Adapter on which to perform the action Secondary does not support RAID level being used by primary. Secondary adapter detected that the primary has a RAID array with a RAID level that the secondary does not support. Customer needs to upgrade the type of secondary adapter or change the RAID level of the array on the primary to a level that is supported by the secondary. Physically change the type of adapter that logged the error. Change RAID level on primary adapter (remote adapter indicated in the error log). Secondary does not support disk unit function being used by primary. Secondary adapter detected a device function that it does not support. Customer might need to upgrade the adapter code or upgrade the type of secondary adapter. Adapter that logged the error. Secondary is unable to find devices found by the primary. Secondary adapter cannot discover all the devices that the primary has. Verify the connections to the devices from the adapter logging the error. View the disk units under each adapter using HSM to determine the SAS port with the problem. Adapter that logged the error. Secondary found devices not found by the primary. Secondary adapter has discovered more devices than the primary. After this error is logged, an automatic failover will occur. Verify the connections to the devices from the remote adapter as indicated in the error log. View the disk units under each adapter using HSM to determine the SAS port with the problem. Remote adapter indicated in the error log. Secondary port not connected to the same numbered port on primary. SAS connections from the adapter to the devices are incorrect. Common disk expansion drawers must be connected to the same numbered SAS port on both adapters. Verify connections and re-cable SAS connections as necessary. Either adapter. Primary lost contact with disk units accessible by secondary. Link failure from primary adapter to devices. An automatic failover will occur. Verify cable connections from the adapter which logged the error. Possible disk expansion drawer failure. Adapter that logged the error. Other Not currently defined. Contact your hardware service provider. This ends the procedure.SIP3142 Use this procedure to resolve the following configuration error: incorrect connection between cascaded enclosures (SRC xxxx4010). The possible causes are: v Incorrect cabling of cascaded device enclosures v Use of an unsupported device enclosure Chapter 2. Isolation Procedures 185 Considerations: v To prevent hardware damage, power off the system, partition, or card slot as appropriate, before connecting and disconnecting cables or devices.1. Identify the affected adapter and its port by examining the product activity log. Perform the following steps: a. Access SST or DST. v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.b. Access the product activity log and record address information. If a type D IPL was not performed to get to SST or DST: The log information is formatted. Access the product activity log and display the SRC that sent you here. Press the F9 key for address information. This is the adapter address. Then, press F12 to cancel and return to the previous screen. Then press the F4 key to view the additional Information to record the formatted log information. The Adapter Port field indicates the port on the adapter reporting the problem. There might be more than one port listed because multiple ports map to the same physical connector. For example, ports 0 through 3 map to the first physical connector, 4 through 7 map to the second physical connector, and so on. The port numbers are labeled on the adapter tailstock. If a type D IPL was performed to get to DST: The log information is not formatted. Access the product activity log and display the SRC that sent you here. The direct select address (DSA) of the adapter is in the format BBBB-Cc-bb: BBBB Hexadecimal offsets 4C and 4D Cc Hexadecimal offset 51 bb Hexadecimal offset 4FIn order to interpret the hexadecimal information to get device addresses, see “More information from hexadecimal reports” on page 620. The Adapter Port field indicates the port on the adapter reporting the problem. There might be more than one port listed because multiple ports map to the same physical connector. For example, ports 0 through 3 map to the first physical connector, 4 through 7 map to the second physical connector, and so on. The port numbers are labeled on the adapter tailstock.c. Determine the location of the adapter that reported the problem. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit or the expansion unit. Then find the following items: v The card slot that is identified by the direct select address (DSA) v The physical connector identified by the port number found on the adapter tailstockHave you determined the location of the adapter and its port? No: Ask your next level of support for assistance. This ends the procedure. Yes: Continue with the next step.2. Review the device enclosure cabling and correct the cabling as required for the device or device enclosure attached to the identified adapter port. To see example device configurations with SAS cabling, see ?Serial attached SCSI cable planning,? in the Site and Hardware Planning Guide, SA76-0091. If unsupported device enclosures are attached, then either remove or replace them with supported device enclosures. 3. Perform the following to cause the adapter to rediscover the devices and connections: 186 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) a. Use Hardware Service Manager to perform another IPL of the virtual I/O processor that is associated with this adapter. b. Vary on any other resources attached to the virtual I/O processor.Did the error recur? No: This ends the procedure. Yes: Contact your hardware service provider. This ends the procedure.SIP3143 Use this procedure to resolve the following configuration error: connections exceed adapter design limits (SRC xxxx4020). The possible causes are: v Unsupported number of cascaded device enclosures v Improper cabling of cascaded device enclosuresConsiderations: v To prevent hardware damage, power off the system, partition, or card slot as appropriate, before connecting and disconnecting cables or devices.1. Identify the affected adapter and its port by examining the product activity log. Perform the following: a. Access SST or DST. v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.b. Access the product activity log and record address information. If a type D IPL was not performed to get to SST or DST: The log information is formatted. Access the product activity log and display the SRC that sent you here. Press the F9 key for address information. This is the adapter address. Then, press F12 to cancel and return to the previous screen. Then press the F4 key to view the additional information to record the formatted log information. The Adapter Port field indicates the port on the adapter reporting the problem. There may be more than one port listed because multiple ports map to the same physical connector. For example, ports 0 through 3 map to the first physical connector, 4 through 7 map to the second physical connector, and so on. The port numbers are labeled on the adapter tailstock. If a type D IPL was performed to get to DST: The log information is not formatted. Access the product activity log and display the SRC that sent you here. The direct select address (DSA) of the adapter is in the format BBBB-Cc-bb: BBBB Hexadecimal offsets 4C and 4D Cc Hexadecimal offset 51 bb Hexadecimal offset 4FIn order to interpret the hexadecimal information to get device addresses, see “More information from hexadecimal reports” on page 620. The Adapter Port field indicates the port on the adapter reporting the problem. There might be more than one port listed because multiple ports map to the same physical connector. For example, ports 0 through Chapter 2. Isolation Procedures 187 3 map to the first physical connector, 4 through 7 map to the second physical connector, and so on. The port numbers are labeled on the adapter tailstock.c. Determine the location of the adapter that reported the problem. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit or the expansion unit. Then find the following items: v The card slot that is identified by the direct select address (DSA) v The physical connector identified by the port number found on the adapter tailstockHave you determined the location of the adapter and its port? No: Ask your next level of support for assistance. This ends the procedure. Yes: Continue with the next step.2. Reduce the number of cascaded device enclosures. Device enclosures can only be cascaded one level deep, and only in certain configurations. Review the device enclosure cabling and correct the cabling as required for the device or device enclosure attached to the identified adapter port. To see example device configurations with SAS cabling, see ?Serial attached SCSI cable planning,? in the Site and Hardware Planning Guide, SA76-0091. 3. Perform the following to cause the adapter to rediscover the devices and connections: a. Use Hardware Service Manager to perform another IPL of the virtual I/O processor that is associated with this adapter. b. Vary on any other resources attached to the virtual I/O processor.Did the error recur? No: This ends the procedure. Yes: Contact your hardware service provider. This ends the procedure.SIP3144 Use this procedure to resolve problems with multipath connections. This procedure is used to resolve the following configuration errors: v Configuration error, incorrect multipath connection (SRC xxxx4030) v Configuration error, incomplete multipath connection between adapter and enclosure detected (SRC xxxx4040)The possible causes are: v Incorrect cabling to device enclosure. Note: Pay special attention to the requirement that a Y0-cable, YI-cable, or X-cable must be routed along the right side of the rack frame (as viewed from the rear) when connecting to a disk expansion drawer. Review the device enclosure cabling and correct the cabling as required. To see example device configurations with serial attached SCSI (SAS) cabling, see ?Serial attached SCSI cable planning,? in the Site and Hardware Planning Guide, SA76-0091. v A failed connection caused by a failing component in the SAS fabric between, and including, the adapter and device enclosure.Considerations: v Power off the system, partition, or card slot before connecting and disconnecting cables or devices, as appropriate, to prevent hardware damage. v Some systems have SAS, PCI-X, and PCIe bus interface logic integrated onto the system boards and use a pluggable RAID Enablement Card (a non-PCI form factor card) for these SAS, PCI-X, and PCIe buses. For these configurations, replacement of the RAID Enablement Card is unlikely to solve a SAS related problem because the SAS interface logic is on the system board. 188 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) v Some systems have the disk enclosure or removable media enclosure integrated in the system with no cables. For these configurations the SAS connections are integrated onto the system boards and a failed connection can be the result of a failed system board or integrated device enclosure. v Some configurations involve a SAS adapter connecting to internal SAS disk enclosures within a system using a cable card. Keep in mind that when the procedure refers to a device enclosure, it could be referring to the internal SAS disk slots or media slots. Also, when the procedure refers to a cable, it could include a cable card. v When using SAS adapters in a Dual Storage IOA configuration, ensure that the actions taken in this procedure are against the primary adapter (that is, not the secondary adapter).Attention: When SAS fabric problems exist, replacing RAID adapters is not recommended without assistance from your service provider. Because the adapter might contain non-volatile write cache data and configuration data for the attached disk arrays, additional problems can be created by replacing an adapter. 1. Was the SRC xxxx4030? No: Go to step 5 on page 190. Yes: Go to step 2.2. Identify the affected adapter and its port by examining the product activity log. Perform the following: a. Access SST or DST. v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.b. Access the product activity log and record address information. If a type D IPL was not performed to get to SST or DST: The log information is formatted. Access the product activity log and display the SRC that sent you here. Press the F9 key for address information. This is the adapter address. Then, press F12 to cancel and return to the previous screen. Then press the F4 key to view the additional information to record the formatted log information. The Adapter Port field indicates the port on the adapter reporting the problem. There may be more than one port listed because multiple ports map to the same physical connector. For example, ports 0 through 3 map to the first physical connector, 4 through 7 map to the second physical connector, and so on. The port numbers are labeled on the adapter tailstock. If a type D IPL was performed to get to DST: The log information is not formatted. Access the product activity log and display the SRC that sent you here. The direct select address (DSA) of the adapter is in the format BBBB-Cc-bb: BBBB Hexadecimal offsets 4C and 4D Cc Hexadecimal offset 51 bb hHexadecimal offset 4FIn order to interpret the hexadecimal information to get device addresses, see “More information from hexadecimal reports” on page 620. The Adapter Port field indicates the port on the adapter reporting the problem. There may be more than one port listed because multiple ports map to the same physical connector. For example, ports 0 through 3 map to the first physical connector, 4 through 7 map to the second physical connector, and so on. The port numbers are labeled on the adapter tailstock. Chapter 2. Isolation Procedures 189 c. Determine the location of the adapter that reported the problem. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit or the expansion unit. Then find the following items: v The card slot that is identified by the direct select address (DSA) v The physical connector identified by the port number found on the adapter tailstockHave you determined the location of the adapter and its port? No: Ask your next level of support for assistance. This ends the procedure. Yes: Continue with the next step.3. Review the device enclosure cabling and correct the cabling as required for the device or device enclosure attached to the identified adapter port. To see example device configurations with SAS cabling, see Serial attached SCSI cable planning, in the Site and Hardware Planning Guide, SA76-0091. 4. Perform the following to cause the adapter to rediscover the devices and connections: a. Use Hardware Service Manager to perform another IPL of the virtual I/O processor that is associated with this adapter. b. Vary on any other resources attached to the virtual I/O processor.Did the error recur? No: This ends the procedure. Yes: Contact your hardware service provider. This ends the procedure.5. The SRC is xxxx4040. Determine if a problem still exists for the adapter that logged this error by examining the SAS connections as follows: a. On the System Service Tools (SST) screen, select Start a Service Tool then press Enter. b. Select Display/Alter/Dump. c. Select Display/Alter storage. d. Select Licensed Internal Code (LIC) data. e. Select Advanced Analysis. f. Type FABQUERY on the entry line and then select it with option 1. g. On the Specify Advanced Analysis Options screen, type -SUB 01 -IOA DCxx -DSP 0 in the Options field, where DCxx is the adapter resource name. Press Enter. Note: More information is available by returning to the Specify Advanced Analysis Options screen and typing -SUB 01 -IOA DCxx -DSP 2 in the Options field, where DCxx is the adapter resource name. Press Enter.Do all expected devices appear in the list and are all paths marked as Operational? No: Continue with the next step. Yes: The error condition no longer exists. This ends the procedure.6. Perform the following to cause the adapter to rediscover the devices and connections: a. Use Hardware Service Manager to re-IPL the virtual I/O processor that is associated with this adapter. b. Vary on any other resources attached to the virtual I/O processor.Note: At this point, ignore any problems found and continue with the next step. 7. Determine if the problem still exists for the adapter that logged this error by examining the SAS connections by performing the actions in step 5 again. Do all expected devices appear in the list and are all paths marked as Operational? NO Go to step 8 on page 191. YES This ends the procedure. 190 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 8. Because the problem persists, some corrective action is needed to resolve the problem. Proceed by doing the following: Perform only one of the following corrective actions (listed in the order of preference). If one of the corrective actions has previously been attempted, proceed to the next one in the list. v Reseat cables if present on adapter and device enclosure. Perform the following: a. Use adapter concurrent maintenance to power off the adapter slot, or power off the system or partition. b. Reseat the cables. c. Use adapter concurrent maintenance to power on the adapter slot, or power on the system or partition.v Replace the cable, if present, from the adapter to the device enclosure. Perform the following: a. Use adapter concurrent maintenance to power off the adapter slot, or power off the system or partition. b. Replace the cables. c. Use adapter concurrent maintenance to power on the adapter slot, or power on the system or partition.v Replace the internal device enclosure or refer to the service documentation for an external expansion drawer. Perform the following steps: a. Power off the system or partition. If the enclosure is external, adapter concurrent maintenance can be used instead to power off the adapter slot. b. Replace the device enclosure. c. Power on the system or partition. If the enclosure is external, use adapter concurrent maintenance instead to power on the adapter slot.v Replace the adapter. The procedure to replace the adapter can be found in “PCI adapter” on page 438. v Contact your service provider.9. To determine if the problem still exists for the adapter that logged this error, examine the SAS connections by performing the actions in step 5 on page 190 again. Do all expected devices appear in the list and are all paths marked as Operational? No: Go to step 8. Yes: This ends the procedure.SIP3145 Use this procedure to resolve the following problem: Unsupported enclosure function detected (SRC xxxx4110). The possible causes are: v Device enclosure or adapter code levels are not up to date. v Unsupported type of device enclosure or device.Considerations: To prevent hardware damage or erroneous diagnostic results, remove power from the system as appropriate before connecting and disconnecting cables or devices. 1. Identify the affected adapter and its port by examining the product activity log. Perform the following: a. Access SST or DST. v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. Chapter 2. Isolation Procedures 191 v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.b. Access the product activity log and record address information. If a type D IPL was not performed to get to SST or DST: The log information is formatted. Access the product activity log and display the SRC that sent you here. Press the F9 key for address information. This is the adapter address. Then, press F12 to cancel and return to the previous screen. Then press the F4 key to view the additional information to record the formatted log information. The Adapter Port field indicates the port on the adapter that is reporting the problem. There may be more than one port listed because multiple ports map to the same physical connector. For example, ports 0 through 3 map to the first physical connector, 4 through 7 map to the second physical connector, and so on. The port numbers are labeled on the adapter tailstock.If a type D IPL was performed to get to DST: The log information is not formatted. Access the product activity log and display the SRC that sent you here. The direct select address (DSA) of the adapter is in the format BBBB-Cc-bb: BBBB Hexadecimal offsets 4C and 4D Cc Hexadecimal offset 51 bb Hexadecimal offset 4FIn order to interpret the hexadecimal information to get device addresses, see “More information from hexadecimal reports” on page 620. The Adapter Port field indicates the port on the adapter reporting the problem. There might be more than one port listed because multiple ports map to the same physical connector. For example, ports 0 through 3 map to the first physical connector, 4 through 7 map to the second physical connector, and so on. The port numbers are labeled on the adapter tailstock.c. Determine the location of the adapter that reported the problem. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit or the expansion unit. Then find the following items: v The card slot that is identified by the direct select address (DSA) v The physical connector identified by the port number found on the adapter tailstock Have you determined the location of the adapter and its port? No: Ask your next level of support for assistance. This ends the procedure. Yes: Continue with the next step.2. Check for the latest PTFs for the device enclosure or adapter and apply them. If unsupported device enclosures or devices are attached, then either remove or replace them with supported device enclosures or devices. Review the device enclosure cabling and correct the cabling as required for the device or device enclosure attached to the identified adapter port. To see example device configurations with SAS cabling, see Serial attached SCSI cable planning, in the Site and Hardware Planning Guide, SA76-0091. 3. Perform the following to cause the adapter to rediscover the devices and connections: a. Use Hardware Service Manager to perform another IPL of the virtual I/O processor that is associated with this adapter. b. Vary on any other resources attached to the virtual I/O processor. Did the error recur? No: This ends the procedure. Yes: Contact your hardware service provider. This ends the procedure. 192 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) SIP3146 Use this procedure to resolve the configuration error: Incomplete multipath connection between enclosures and device detected (SRC xxxx4041). The possible cause is a failed connection caused by a failing component within the device enclosure, including the device itself. Note: The adapter is not a likely cause of this problem. Considerations: v Power off the system, partition, or card slot before connecting and disconnecting cables or devices, as appropriate, to prevent hardware damage. v Some systems have the disk enclosure or removable media enclosure integrated in the system with no cables. For these configurations the SAS connections are integrated onto the system boards and a failed connection can be the result of a failed system board or integrated device enclosure. v Some configurations involve a serial attached SCSI (SAS) adapter connecting to internal SAS disk enclosures within a system using a cable card. Keep in mind that when the procedure refers to a device enclosure, it could be referring to the internal SAS disk slots or media slots. Also, when the procedure refers to a cable, it could include a cable card. v When using SAS adapters in a Dual Storage IOA configuration, ensure that the actions taken in this procedure are against the primary adapter (that is, not the secondary adapter).Attention: Removing functioning disk units in a disk array is not recommended without assistance from your service provider. If functioning disk units are removed, a disk array might become unprotected or failed and additional problems could be created. 1. Determine the resource name of the adapter that reported the problem by performing the following: a. Access SST or DST. b. Access the product activity log and record the resource name that this error is logged against. If the resource name is an adapter resource name, use it and continue with the next step. If the resource name is a disk unit resource name, use Hardware Service Manager to determine the resource name of the adapter that is controlling this disk unit.2. Determine if a problem still exists for the adapter that logged this error by examining the SAS connections as follows: a. On the System Service Tools (SST) display, select Start a Service Tool, and then press Enter. b. Select Display/Alter/Dump. c. Select Display/Alter storage. d. Select Licensed Internal Code (LIC) data. e. Select Advanced Analysis. f. Type FABQUERY on the entry line and then select it with option 1. g. On the Specify Advanced Analysis Options screen, type -SUB 01 -IOA DCxx -DSP 0 in the Options field, where DCxx is the adapter resource name. Press Enter. Note: More information is available by returning to the Specify Advanced Analysis Options screen and typing -SUB 01 -IOA DCxx -DSP 2 in the Options field, where DCxx is the adapter resource name. Press Enter.Do all expected devices appear in the list and are all paths marked as Operational? No: Continue with the next step. Yes: The error condition no longer exists. This ends the procedure.3. Perform the following steps to cause the adapter to rediscover the devices and connections: a. Use Hardware Service Manager to perform another IPL of the virtual I/O processor that is associated with this adapter. Chapter 2. Isolation Procedures 193 b. Vary on any other resources attached to the virtual I/O processor.4. To determine if the problem still exists for the adapter that logged this error, examine the SAS connections by performing the actions in step 2 on page 193 again. Do all expected devices appear in the list and are all paths marked as Operational? No: Continue with the next step. Yes: The error condition no longer exists. This ends the procedure.5. Perform only one of the following corrective actions (listed in the order of preference). If one of the corrective actions has previously been attempted, proceed to the next one in the list. v Review the device enclosure cabling and correct the cabling as required. To see example device configurations with SAS cabling, see Serial attached SCSI cable planning, in the Site and Hardware Planning Guide, SA76-0091. v Reseat the cables, if present, on the adapter and the device enclosure. Perform the following steps: a. Use adapter concurrent maintenance to power off the adapter slot, or power off the system or partition. b. Reseat the cables. c. Use adapter concurrent maintenance to power on the adapter slot, or power on the system or partition.v Replace the cable, if present, from the adapter to the device enclosure. Perform the following steps: a. Use adapter concurrent maintenance to power off the adapter slot, or power off the system or partition. b. Replace the cables. c. Use adapter concurrent maintenance to power on the adapter slot, or power on the system or partition.v Replace the internal device enclosure or refer to the service documentation for an external expansion drawer. Perform the following steps: a. Power off the system or partition. If the enclosure is external, use adapter concurrent maintenance instead to power off the adapter slot. b. Replace the device enclosure. c. Power on the system or partition. If the enclosure is external, adapter concurrent maintenance can be used instead to power on the adapter slot.v Replace the adapter. The procedure to replace the adapter can be found in “PCI adapter” on page 438. v Contact your service provider.6. To determine if the problem still exists for the adapter that logged this error, examine the SAS connections by performing the actions in step 2 on page 193 again. Do all expected devices appear in the list and are all paths marked as Operational? No: Go to step 5. Yes: The error condition no longer exists. This ends the procedure..SIP3147 Use this procedure to resolve the following problem: Missing remote adapter (SRC xxxx9076) 1. An adapter attached in either an Auxiliary Cache or Dual Storage IOA configuration was not discovered in the allotted time. To obtain additional information about the configuration involved, locate the formatted log in the Product Activity Log. a. Access SST/DST. v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. 194 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.b. Access the Product Activity Log and record address information. If a D IPL was NOT performed to get to SST/DST: The log information is formatted. Access the Product Activity Log and display the SRC that sent you here. Press the F9 key for ?Address Information?. This is the adapter address. Then, press F12 to cancel and return to the previous screen. Then press the F4 key to view the ?Additional Information? to record the formatted log information. The ?Type of adapter connection? field indicates the type of configuration involved.If a D IPL was performed to get to DST: The log information is not formatted. Access the Product Activity Log and display the SRC that sent you here. The direct select address (DSA) of the adapter is in the format BBBB-Cc-bb: BBBB hexadecimal offsets 4C and 4D Cc hexadecimal offset 51 bb hexadecimal offset 4FIn order to interpret the hexadecimal information to get device addresses, see “More information from hexadecimal reports” on page 620. The ?Type of adapter connection? field indicates the type of configuration involved.2. Determine which of the following is the cause of your specific error and take the appropriate actions listed. If this does not correct the error, contact your hardware service provider. The possible causes are: v An attached adapter for the configuration is not installed or is not powered on. Some adapters are required to be part of a Dual Storage IOA Configuration. Ensure that both adapters are properly installed and powered on. v If this is an Auxiliary Cache or Dual Storage IOA configuration, then both adapters may not be in the same partition. Ensure that both adapters are assigned to the same partition. v An attached adapter does not support the desired configuration. v An attached adapter for the configuration is failed. Take action on the other errors that have occurred at the same time as this error. v Adapter code levels are not up to date or are not at the same level of functionality. Ensure that the code for both adapters is at the latest level.Note: The adapter that is logging this error will run in a performance degraded mode, without caching, until the problem is resolved. This ends the procedure.SIP3148 Use this procedure to resolve the following problem: Attached enclosure does not support required multipath function (SRC xxxx4050). The possible cause is the use of an unsupported device enclosure. Considerations: v Power off the system, partition, or card slot before connecting and disconnecting cables or devices, as appropriate, to prevent hardware damage.1. Identify the adapter and adapter port associated with the problem by examining the product activity log. Perform the following steps: a. Access SST or DST. Chapter 2. Isolation Procedures 195 v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.b. Access the product activity log and record address information. If a type D IPL was not performed to get to SST or DST: The log information is formatted. Access the product activity log and display the SRC that sent you here. Press the F9 key for address information. This is the adapter address. Then, press F12 to cancel and return to the previous screen. Then press the F4 key to view the additional information to record the formatted log information. The Adapter Port field indicates the port on the adapter reporting the problem. There may be more than one port listed because multiple ports map to the same physical connector. For example, ports 0 through 3 map to the first physical connector, 4 through 7 map to the second physical connector, and so on. The port numbers are labeled on the adapter tailstock.If a type D IPL was performed to get to DST: The log information is not formatted. Access the product activity log and display the SRC that sent you here. The direct select address (DSA) of the adapter is in the format BBBB-Cc-bb: BBBB Hexadecimal offsets 4C and 4D Cc Hexadecimal offset 51 bb Hexadecimal offset 4FIn order to interpret the hexadecimal information to get device addresses, see “More information from hexadecimal reports” on page 620. The Adapter Port field indicates the port on the adapter reporting the problem. There may be more than one port listed because multiple ports map to the same physical connector. For example, ports 0 through 3 map to the first physical connector, 4 through 7 map to the second physical connector, and so on. The port numbers are labeled on the adapter tailstock.c. Determine the location of the adapter that reported the problem. See Chapter 3, “Locating FRUs,” on page 335 and find the diagram of the system unit or the expansion unit. Then find the following items: v The card slot that is identified by the direct select address (DSA) v The physical connector identified by the port number found on the adapter tailstock Have you determined the location of the adapter and its port? No: Ask your next level of support for assistance. This ends the procedure. Yes: Continue with the next step.2. If unsupported device enclosures are attached to the identified adapter port, then either remove or replace them with supported device enclosures. 3. Perform the following steps to cause the adapter to rediscover the devices and connections: a. Use Hardware Service Manager to perform another IPL of the virtual I/O processor that is associated with this adapter. b. Vary on any other resources attached to the virtual I/O processor. Did the error recurr? No: This ends the procedure. Yes: Contact your hardware service provider. This ends the procedure. 196 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) SIP3149 Use this procedure to resolve the following problem: Incomplete multipath connection between adapter and remote adapter (SRC xxxx9075) The possible cause is incorrect cabling between SAS RAID adapters. Power off the system, partition, or card slot before connecting and disconnecting cables or devices, as appropriate, to prevent hardware damage. Review the device enclosure cabling and correct the cabling as required. To see example device configurations with SAS cabling, see ?Serial attached SCSI cable planning,? in the Site and Hardware Planning Guide, order number SA76-0091. This ends the procedure. SIP3150 Use this procedure to perform serial attached SCSI (SAS) fabric problem isolation. Considerations: v Power off the system, partition, or card slot before connecting and disconnecting cables or devices, as appropriate, to prevent hardware damage. v Some systems have SAS, PCI-X, and PCIe bus interface logic integrated onto the system boards and use a pluggable RAID Enablement Card (a non-PCI form factor card) for these SAS, PCI-X, and PCIe buses. For these configurations, replacement of the RAID Enablement Card is unlikely to solve a SAS related problem because the SAS interface logic is on the system board. v Some systems have the disk enclosure or removable media enclosure integrated in the system with no cables. For these configurations the SAS connections are integrated onto the system boards and a failed connection can be the result of a failed system board or integrated device enclosure.Attention: v Before you replace a RAID adapter. Because the adapter might contain nonvolatile write cache data and configuration data for the attached disk arrays, additional problems can be created by replacing an adapter. v Before you remove functioning disk units in a disk array. A disk array might become unprotected or might fail and additional problems might be created if functioning disk units are removed from a disk array. 1. Was the SRC xxxx3020? No Go to step 3. Yes Go to step 2. 2. The possible causes are: v More devices are connected to the adapter than the adapter supports. Change the configuration to reduce the number of devices below what is supported by the adapter. v A SAS device has been incorrectly moved from one location to another. Either return the device to its original location or move the device while the adapter is powered off. v A SAS device has been incorrectly replaced by a SATA device. A SAS device must be used to replace a SAS device.This ends the procedure. 3. Determine the status of the disk units in the array by doing the following steps: a. Access the product activity log and display the SRC that sent you here. b. Press the F9 key for address information. This is the adapter address. Chapter 2. Isolation Procedures 197 c. Return to the SST or DST main menu. d. Select Work with disk units ? Display disk configuration ? Display disk configuration status. e. On the Display disk configuration status screen, look for the devices attached to the adapter that was identified.Is there a device that has a status of ?RAID 5/Unknown?, ?RAID 6/Unknown?, ?RAID 5/Failed?, or ?RAID 6/Failed?? No Go to step 5. Yes Go to step 4 4. Other errors should have occurred related to the disk array having degraded protection. Take action on these errors to replace the failed disk unit and restore the disk array to a fully protected state. This ends the procedure. 5. Have other errors occurred at the same time as this error? No Go to step 7. Yes Go to step 6 6. Take action on the other errors that have occurred at the same time as this error. This ends the procedure. 7. Was the SRC xxxxFFFE? No Go to step 10. Yes Go to step 8. 8. Check for the latest PTFs for the device, device enclosure, and adapter and apply them. Did you find and apply a PTF? No Go to step 10. Yes Go to step 9. 9. This ends the procedure. 10. Identify the adapter and adapter port associated with the problem by examining the product activity log. Perform the following: a. Access SST or DST. v If you can enter a command at the console, access system service tools (SST). See “System Service Tools (SST)” on page 611. v If you cannot enter a command at the console, perform an IPL to DST. See “Performing an IPL to DST” on page 640. v If you cannot perform a type A or B IPL, perform a type D IPL from removable media.b. Access the product activity log and display the SRC that sent you here. Record the adapter address and the adapter port by doing one of the following: v If the SRC is xxxxFFFE, press the F9 key for address information. The adapter address is the bus, board, card information. The port is shown in the I/O bus field. Convert the port value from decimal to hexadecimal. v Press the F9 key for address information. The adapter address is the bus, board, card information. Then, press F12 to cancel and return to the previous screen. Then press the F4 key to view the additional information, if available. The adapter port is characters 1 and 2 of the unit address. For example, if the unit address is 123456FF, the port would be 12. v Go to “Hexadecimal Product Activity Log data” on page 617 to obtain the address information. The adapter address is the bus, board, card information. The adapter port is characters 1 and 2 of the unit address. For example, if the unit address is 123456FF, the port would be 12.11. Use the adapter address to find the location of the adapter (see Chapter 3, “Locating FRUs,” on page 335). On the tailstock of the adapter, find the port identified in the previous step. This is the port that is used to attach the device, or device enclosure, that is experiencing the problem. 198 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 12. Because the problem persists, some corrective action is needed to resolve the problem. Proceed by doing the following: Perform only one of the following corrective actions (listed in the order of preference). If one of the corrective actions has previously been attempted, proceed to the next one in the list. v Reseat cables, if present, on adapter and device enclosure. Perform the following steps: a. Use adapter concurrent maintenance to power off the adapter slot, or power off the system or partition. b. Reseat the cables. c. Use adapter concurrent maintenance to power on the adapter slot, or power on the system or partition.v Replace the cable, if present, from the adapter to the device enclosure. Perform the following: a. Use adapter concurrent maintenance to power off the adapter slot, or power off the system or partition. b. Replace the cables. c. Use adapter concurrent maintenance to power on the adapter slot, or power on the system or partition.v Replace the device. See “Disk Drive” on page 379.Note: If there are multiple devices with a path that is not Operational, then the problem is not likely to be with a device. v Replace the internal device enclosure or refer to the service documentation for an external expansion drawer. Perform the following: a. Power off the system or partition. If the enclosure is external, use adapter concurrent maintenance instead to power off the adapter slot. b. Replace the device enclosure. c. Power on the system or partition. If the enclosure is external, use adapter concurrent maintenance instead to power on the adapter slot.v Replace the adapter. The procedure to replace the adapter can be found in “PCI adapter” on page 438. v Contact your service provider.13. Does the problem still occur after performing the corrective action? No: This ends the procedure. Yes: Go to step 12.SIP3152 Use this procedure to resolve possible failed connection problems This procedure is used to resolve the following problems: v Multipath redundancy level is worse (SRC xxxx4060) v Device bus fabric error (SRC xxxx4100) v Temporary device bus fabric error (SRC xxxx4101)The possible causes are: v A failed connection caused by a failing component in the serial attached SCSI (SAS) fabric between, and including, the adapter and device enclosure. v A failed connection caused by a failing component within the device enclosure, including the device itself.Note: For SRC xxxx4060, the failed connection was previously working, and may have already recovered. Chapter 2. Isolation Procedures 199 Considerations: v Power off the system, partition, or card slot before connecting and disconnecting cables or devices, as appropriate, to prevent hardware damage. v Some systems have SAS, PCI-X, and PCIe bus interface logic integrated onto the system boards and use a pluggable RAID Enablement Card (a non-PCI form factor card) for these SAS, PCI-X, and PCIe buses. For these configurations, replacement of the RAID Enablement Card is unlikely to solve a SAS-related problem because the SAS interface logic is on the system board. v Some systems have the disk enclosure or removable media enclosure integrated in the system with no cables. For these configurations the SAS connections are integrated onto the system boards and a failed connection can be the result of a failed system board or integrated device enclosure. v Some configurations involve a SAS adapter connecting to internal SAS disk enclosures within a system using a cable card. Keep in mind that when the procedure refers to a device enclosure, it could be referring to the internal SAS disk slots or media slots. Also, when the procedure refers to a cable, it could include a cable card. v When using SAS adapters in a Dual Storage IOA configuration, ensure that the actions taken in this procedure are against the primary adapter (not the secondary adapter).Attention: v When SAS fabric problems exist, replacing RAID adapters is not recommended without assistance from your service provider. Because the adapter might contain nonvolatile write cache data and configuration data for the attached disk arrays, additional problems can be created by replacing an adapter. v Removing functioning disk units in a disk array is not recommended without assistance from your service provider. A disk array might become unprotected or might fail if functioning disk units are removed. The removal of functioning disk units might also result in additional problems in the disk array.1. Determine the resource name of the adapter that reported the problem by performing the following: a. Access SST or DST. b. Access the product activity log and record the resource name that this error is logged against. If the resource name is an adapter resource name, use it and continue with the next step. If the resource name is a disk unit resource name, use the Hardware Service Manager to determine the resource name of the adapter that is controlling this disk unit.2. Determine if a problem still exists for the adapter that logged this error by examining the SAS connections as follows: a. On the System Service Tools (SST) screen, select Start a Service Tool and then press Enter. b. Select Display/Alter/Dump. c. Select Display/Alter storage. d. Select Licensed Internal Code (LIC) data. e. Select Advanced Analysis. f. Type FABQUERY on the entry line and then select it with option 1. g. On the Specify Advanced Analysis Options screen, type -SUB 01 -IOA DCxx -DSP 0 in the Options field, where DCxx is the adapter resource name. Press Enter. Note: More information is available by returning to the Specify Advanced Analysis Options screen and typing -SUB 01 -IOA DCxx -DSP 2 in the Options field, where DCxx is the adapter resource name. Press Enter.Do all expected devices appear in the list and are all paths marked as Operational? No: Continue with the next step. Yes: The error condition no longer exists. This ends the procedure.3. Perform the following to cause the adapter to rediscover the devices and connections: 200 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) a. Use Hardware Service Manager to perform another IPL of the virtual I/O processor that is associated with this adapter. b. Vary on any other resources attached to the virtual I/O processor.4. To determine if the problem still exists for the adapter that logged this error, examine the SAS connections by performing the actions in step 2 on page 200 again. Do all expected devices appear in the list and are all paths marked as Operational? No: Continue with the next step. Yes: The error condition no longer exists. This ends the procedure.5. Perform only one of the following corrective actions (listed in the order of preference). If one of the corrective actions has previously been attempted, then proceed to the next one in the list. v Reseat cables, if present, on adapter and device enclosure. Perform the following steps: a. Use adapter concurrent maintenance to power off the adapter slot, or power off the system or partition. b. Reseat the cables. c. Use adapter concurrent maintenance to power on the adapter slot, or power on the system or partition.v Replace the cable, if present, from adapter to device enclosure. Perform the following steps: a. Use adapter concurrent maintenance to power off the adapter slot, or power off the system or partition. b. Replace the cables. c. Use adapter concurrent maintenance to power on the adapter slot, or power on the system or partition.v Replace the device. See “Disk Drive” on page 379.Note: If there are multiple devices with a path that is not Operational, then the problem is not likely to be with a device. v Replace the internal device enclosure or refer to the service documentation for an external expansion drawer. Perform the following steps: a. Power off the system or partition. If the enclosure is external, adapter concurrent maintenance can be used instead to power off the adapter slot. b. Replace the device enclosure. c. Power on the system or partition. If the enclosure is external, adapter concurrent maintenance can be used instead to power on the adapter slot.v Replace the adapter. The procedure to replace the adapter can be found in “PCI adapter” on page 438. v Contact your service provider.6. To determine if the problem still exists for the adapter that logged this error, examine the SAS connections by performing the actions in step 2 on page 200 again. Do all expected devices appear in the list and are all paths marked as Operational? No: Go to step 5. Yes: The error condition no longer exists. This ends the procedure..SIP3153 Go to SIP3152. Chapter 2. Isolation Procedures 201 Service processor isolation procedures Use service processor isolation procedures if there is no HMC attached to the server. If the server is connected to an HMC, use the procedures that are available on the HMC to continue FRU isolation. DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) “FSPSP01” on page 205A part vital to system function has been deconfigured. Review the system error logs for errors that call out parts relevant to each reason code. If replacing those parts does not resolve the error, use this procedure. “FSPSP02” on page 206This procedure is for boot failures that terminate very early in the boot process. “FSPSP03” on page 206A system operator or user error has occurred. “FSPSP04” on page 206A problem has been detected in the service processor firmware. “FSPSP05” on page 206The service processor has detected a problem in the platform firmware. 202 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) “FSPSP06” on page 207The service processor reported a suspected intermittent problem. “FSPSP07” on page 207The time of day has been reset to the default value. “FSPSP09” on page 207A problem has been detected with a memory DIMM, but it cannot be isolated to a specific memory DIMM. “FSPSP10” on page 207The part indicated in the FRU list that follows this procedure is not valid or missing for this system’s configuration. “FSPSP11” on page 207The service processor has detected an error on the RIO/HSL port in the system unit. “FSPSP12” on page 208The DIMM FRU that was previously replaced did not correct the memory error. “FSPSP14” on page 208The service processor cannot communicate with the system firmware. The server firmware will continue to run the system and partitions while it attempts to recover communications. Server firmware recovery actions will continue for approximately 30 to 40 minutes. “FSPSP16” on page 208Save any error log and dump data and contact your next level of support for assistance. “FSPSP17” on page 208A system uncorrectable error has occurred. “FSPSP18” on page 208A problem has been detected in the platform LIC. “FSPSP20” on page 209A failing item has been detected by a hardware procedure. “FSPSP22” on page 209The system has detected that a processor chip is missing from the system configuration because JTAG lines are not working. “FSPSP23” on page 209The system needs to perform a service processor dump. “FSPSP24” on page 209The system is running degraded. Array bit steering may be able to correct this problem without replacing hardware. “FSPSP25” on page 209The server has detected an over-temperature thermal fault. “FSPSP27” on page 210An attention line has been detected as having a problem. If the FRU replaced before this procedure did not correct the problem, perform the following: “FSPSP28” on page 210The resource ID (RID) of the FRU could not be found in the vital product data (VPD) table. “FSPSP29” on page 210The system has detected that all I/O bridges are missing from the system configuration. “FSPSP30” on page 211A problem has been encountered accessing the VPD card or the data found on the VPD card has been corrupted. “FSPSP31” on page 211The service processor has detected that one or more of the required fields in the system VPD has not been initialized. “FSPSP32” on page 211A problem with the enclosure has been found. Chapter 2. Isolation Procedures 203 “FSPSP33” on page 212A problem has been detected in the connection with the HMC. “FSPSP34” on page 212The memory cards are plugged in an invalid configuration and cannot be used by the system. “FSPSP35” on page 213The system has detected a problem with a memory controller. “FSPSP38” on page 213The system has detected an error within the JTAG path. If the FRU replaced before this procedure did not correct the problem, perform the following: “FSPSP42” on page 213An error communicating between two system processors was detected. “FSPSP45” on page 214The system has detected an error with FSI path. “FSPSP46” on page 214Some corrupt areas of flash or RAM have been detected on the service processor. “FSPSP47” on page 214The system has detected an error within the PSI link. To resolve the problem, perform the following: “FSPSP48” on page 214A diagnostic function detects an external processor interface problem. If replacing the FRUs previously listed in the FRU list does not fix the problem, perform the following: “FSPSP49” on page 214A diagnostic function detects an internal processor interface problem. If replacing the FRUs previously listed in the FRU list does not fix the problem, perform the following: “FSPSP50” on page 215A diagnostic function detects a connection problem between a processor chip and a GX chip. If replacing the FRUs previously listed in the FRU list does not fix the problem, perform the following: “FSPSP51” on page 215Runtime diagnostics detected a memory bus correctable error that is exceeding the threshold. The memory bus correctable error will not cause an immediate loss of system operations. However, the system is operating in a degraded mode. “FSPSP52” on page 215A problem has been detected on a memory bus. If replacing the FRUs previously listed in the FRU list does not fix the problem, perform the following: “FSPSP54” on page 215A processor over-temperature has been detected. Check for any environmental issues before replacing any parts. “FSPSP55” on page 216An error has been detected on a bus between two FRUs. The endpoint FRUs have been called out. However, the source of the error could be the bus path between the FRUs. “FSPSP63” on page 216The system has experienced a power error. Please review previous error logs for power-related issues. “FSPSP64” on page 216All the processor support interface (PSI) links of the system are either nonfunctional or deconfigured, so the system cannot perform an IPL appropriately. Look for previous error logs that deconfigure hardware. “FSPSPC1” on page 217If the system hangs after the code that sent you to this procedure appears in the control panel, perform these steps to reset the service processor. “FSPSPD1” on page 218If the system hangs after the code that sent you to this procedure appears in the control panel, perform these steps to reset the service processor. 204 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) FSPSP01 A part vital to system function has been deconfigured. Review the system error logs for errors that call out parts relevant to each reason code. If replacing those parts does not resolve the error, use this procedure. Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions for removing and replacing any FRUs called out by the following procedure. Perform the following steps: 1. Is SRC B1xxB10C or B1xxB10D (xx indicates that the Subsystem ID is irrelevant)? v No: Go to step 5 v Yes: The system has detected that one of the following has occurred: – A memory controller that is required for the system to function is deconfigured. – There is not enough memory. – The memory DIMM is plugged incorrectly.Continue with the next step.2. Perform the following steps: a. Reseat all of the memory DIMMs on the system backplane but do not replace any memory DIMM at this time. b. Perform a slow boot, refer to “Performing a slow boot” on page 665.Does the problem persist? Yes: Continue with the next step. No: Go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure.3. Use the following procedure to replace memory DIMMs, and perform a slow boot after each replacement until the problem is resolved. Memory DIMMs must be installed in quads and in the correct connectors. Refer to “Physical Locations” on page 339 for information on memory DIMMs. a. Replace the first memory DIMM quad. b. Perform a slow boot, refer to “Performing a slow boot” on page 665.Does the problem persist? Yes: Repeat this step and replace the second memory DIMM quad. If you have replaced both of the memory DIMM quads, then continue with the next step. No: Go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure.4. Use the following procedure to replace the system backplane. a. Replace the system backplane (Un-P1). b. Perform a slow boot, refer to “Performing a slow boot” on page 665.Does the problem persist? Yes: Contact your next level of support. This ends the procedure. No: Go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure.5. Is SRC B1xxB107 or B1xxB108? v No: Continue with the next step. v Yes: The system has detected a problem with the clock module. Replace the system backplane (Un-P1). This ends the procedure.6. Is SCR B1xxB106? v No: Continue with the next step. v Yes: The system has detected that the system backplane is deconfigured. Perform the following: a. Replace the system backplane (Un-P1). b. Perform a slow boot, refer to “Performing a slow boot” on page 665. Chapter 2. Isolation Procedures 205 c. Go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure.7. Is SRC B1xxB110 or B1xxB111? v No: Contact your next level of support. This ends the procedure. v Yes: The system has detected that the I/O bridge is deconfigured. Perform the following: a. Replace the system backplane (Un-P1). b. Perform a slow boot, refer to “Performing a slow boot” on page 665. c. Go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure.FSPSP02 This procedure is for boot failures that terminate very early in the boot process. This error path is indicated when the SRC data words are scrolling automatically through control panel functions 11, 12, and 13, and the control panel interface buttons are not responsive. Perform the following: 1. Push the white power button to reset the system and start it on the other side of platform LIC. Note: The white power button will only reset the system and attempt to reach standby. 2. Did an SRC occur after starting the system on the other side? No: Verify that the system’s firmware is at the latest level. Update the system’s firmware on the failing side if necessary.This ends the procedure. Yes: Continue with the next step.3. Is the SRC the same SRC that brought you to this procedure? v No: Return to Chapter 1, “Starting a Service Call,” on page 1 to service this new SRC. This ends the procedure. v Yes: Replace the system backplane (Un-P1). Refer to Chapter 3, “Locating FRUs,” on page 335. If the problem is not resolved, contact your next level of support. This ends the procedure.FSPSP03 A system operator or user error has occurred. Refer to the documentation for the task that you were attempting to perform. FSPSP04 A problem has been detected in the service processor firmware. Refer to Operations Guide for ASMI and for Nonpartitioned Systems (SA76-0094) or Operations Guide for the Hardware Management Console and Managed Systems (SA76-0085) and verify that the system’s firmware is at the latest level. Update the system’s firmware if necessary. FSPSP05 The service processor has detected a problem in the platform firmware. Refer to the Operations Guide for ASMI and for Nonpartitioned Systems, SA76-0094 OR the Operations Guide for the Hardware Management Console and its Managed Systems, SA76-0085 and verify that the system’s firmware is at the latest level. Update the system’s firmware if necessary. 206 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) FSPSP06 The service processor reported a suspected intermittent problem. Contact your next level of support. FSPSP07 The time of day has been reset to the default value. 1. To set the time of day, refer to the systems operations guide. 2. If the problem persists, replace the TOD battery (Un-P1-E1): a. Power off the system (see “Powering on and powering off” on page 588). b. Replace the TOD battery. Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions. c. Perform a slow boot. This ends the procedure.FSPSP09 A problem has been detected with a memory DIMM, but it cannot be isolated to a specific memory DIMM. 1. Replace all the memory DIMMs on the system backplane. See Chapter 3, “Locating FRUs,” on page 335 for instructions. 2. Perform a slow boot after replacing all the memory DIMMs. Refer to “Performing a slow boot” on page 665. This ends the procedure.FSPSP10 The part indicated in the FRU list that follows this procedure is not valid or missing for this system’s configuration. Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions for removing and replacing any FRUs called out by the following procedure. Perform the following to correct the problem: 1. Does word 8 (the 8 leftmost characters in the 2nd line of function 13) of the reference code end with 02 or 04? No: Go to step 3. Yes: Continue with the next step.2. The FRU that is called out after this procedure is either missing or not valid. Is that FRU installed and connected or plugged in correctly? Yes: The installed FRU is not valid. Remove that FRU. Then contact your next level of support to determine the correct FRU. This ends the procedure. No: The FRU is missing. If the FRU is present but not connected, reconnect it and perform a slow boot (see “Performing a slow boot” on page 665). Otherwise, contact your next level of support to determine the missing FRU. This ends the procedure.3. Does word 8 end with 01 or 05? No: Return to the Chapter 1, “Starting a Service Call,” on page 1. This ends the procedure. Yes: The FRU that is listed after this procedure has the same serial number as another FRU in the system. Remove all but one of the FRUs that are listed after this procedure and then perform a slow boot (see “Performing a slow boot” on page 665). This ends the procedure.FSPSP11 The service processor has detected an error on the RIO/HSL port in the system unit. 1. Verify that the system’s firmware is at the latest level. Update the system’s firmware if necessary. Chapter 2. Isolation Procedures 207 2. If the problem persists, replace the system backplane (Un-P1). 3. Perform a slow boot. Refer to “Performing a slow boot” on page 665. This ends the procedure.FSPSP12 The DIMM FRU that was previously replaced did not correct the memory error. Perform the following: 1. Power off the system (see “Powering on and powering off” on page 588). 2. Replace the system backplane (Un-P1). See Chapter 3, “Locating FRUs,” on page 335 for instructions. 3. Perform a slow boot. Refer to “Performing a slow boot” on page 665. This ends the procedure. FSPSP14 The service processor cannot communicate with the system firmware. The server firmware will continue to run the system and partitions while it attempts to recover communications. Server firmware recovery actions will continue for approximately 30 to 40 minutes. Perform the following: 1. Record the time the log was created or when the SRC was first noticed. Continue with the next step. 2. Are progress codes being displayed on the panel? v Yes: Server firmware was able to reset the service processor. This ends the procedure. v No: Continue with the next step.3. Has an A7006995 SRC been displayed on the panel? v Yes: Partitions are being powered off and a server dump will be attempted. Follow the A7006995 SRC description if the partitions do not terminate as requested. This ends the procedure. v No: Continue with the next step.4. Has the A1xx SRC remained on the panel for more than 40 minutes? v Yes: Server firmware could not begin termination of the partitions. Contact your next level of support to assist in attempting to terminate any remaining partitions and forcing a server dump. Collect the dump for support, power off and power on the system. This ends the procedure. v No: Contact your next level of support. This ends the procedure.FSPSP16 Save any error log and dump data and contact your next level of support for assistance. FSPSP17 A system uncorrectable error has occurred. 1. Look for other serviceable events and use the call outs listed with them to correct the problem. 2. If you need to run the system in a degraded mode until you can perform the service actions, do the following: a. Power off the system (see “Powering on and powering off” on page 588). b. Power on the system (see “Powering on and powering off” on page 588) to allow the memory diagnostics to clean up the memory. Guard out any defective parts.This ends the procedure.FSPSP18 A problem has been detected in the platform LIC. 208 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Refer to the Operations Guide for the Hardware Management Console and Managed Systems, SA76-0085, or the Operations Guide for ASMI and for Nonpartitioned Systems, SA76-0094, and verify that the server firmware is at the latest level. Update the server firmware if necessary. FSPSP20 A failing item has been detected by a hardware procedure. Perform a slow boot to run full hardware diagnostics. If a new SRC occurs, repair the system using that reference code. If an incomplete occurs, go to “Accessing the ASMI” on page 646 menus to power off, check for deconfigured components and perform a slow boot. This ends the procedure. FSPSP22 The system has detected that a processor chip is missing from the system configuration because JTAG lines are not working. Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions for removing and replacing any FRUs called out by the following procedure. 1. Power off the system (see “Powering on and powering off” on page 588). 2. Replace the system backplane (Un-P1). 3. Perform a slow boot. Refer to “Performing a slow boot” on page 665. This ends the procedure.FSPSP23 The system needs to perform a service processor dump. 1. Perform a service processor dump (see “Performing a platform system or service processor dump” on page 653). 2. Attempt to perform an IPL on the system. 3. Save the service processor dump to storage (see “Managing dumps” on page 653). 4. Contact your next level of support. This ends the procedure.FSPSP24 The system is running degraded. Array bit steering may be able to correct this problem without replacing hardware. 1. Power off the system (see “Powering on and powering off” on page 588). 2. Go to “Performing a slow boot” on page 665. This ends the procedure. 3. If the problem persists, replace the FRU that is called out after this procedure. See Chapter 3, “Locating FRUs,” on page 335 for instructions. This ends the procedure.FSPSP25 The server has detected an over-temperature thermal fault. 1. Before replacing any server hardware FRU callouts, look in the error log for thermal problems related to fans, power supplies, etc. Perform all service actions for the thermal problem SRCs first before continuing with any other callouts in the current SRC. Thermal problems are associated with 1100 xxxx SRCs, where xxxx may be any of the following: v 1514 v 1524 Chapter 2. Isolation Procedures 209 v 7201 v 7203 v 7205 v 7610 v 7611 v 7620 v 7621 v 7630 v 7631 v 7640 v 76412. If no thermal-related SRCs or problems can be found, replace the server hardware FRU called out in the current SRC. See Chapter 3, “Locating FRUs,” on page 335 for instructions. This ends the procedure.FSPSP27 An attention line has been detected as having a problem. If the FRU replaced before this procedure did not correct the problem, perform the following: Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions on replacing any FRUs called out by this procedure. 1. Was the FRU called out before this procedure a memory DIMM (Un-P1-C14 through Un-P1-C17 or Un-P1-C21 through Un-P1-C24) or a GX adapter card (Un-P1-C8 or Un-P1-C6) No: This ends the procedure. Yes: Continue with the next step.2. Perform the following: a. Replace the system backplane (Un-P1). b. Perform a slow boot. This ends the procedure.FSPSP28 The resource ID (RID) of the FRU could not be found in the vital product data (VPD) table. 1. Find another callout that reads ?FSPxxxx? where xxxx is a 4 digit hex number that represents the RID. Record the RID and the model of the system. 2. Call next level of support to find out which FRU the RID represents. 3. Replace the FRU that the RID represents. 4. Perform a slow boot to ensure full hardware diagnostics. This ends the procedure.FSPSP29 The system has detected that all I/O bridges are missing from the system configuration. 1. Power off the system (see “Powering on and powering off” on page 588). 2. Replace the system backplane (Un-P1). Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions on replacing the system backplane. 3. Perform a slow boot. Refer to “Performing a slow boot” on page 665. This ends the procedure. 210 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) FSPSP30 A problem has been encountered accessing the VPD card or the data found on the VPD card has been corrupted. This error occurred before VPD collection was completed. No location codes have been created. Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions for removing and replacing any FRUs called out by the following procedure. 1. Power off the system (see “Powering on and powering off” on page 588) and remove ac power. 2. Clear any deconfiguration errors for the VPD card. 3. Replace the VPD card (Un-P1-C11). See Chapter 3, “Locating FRUs,” on page 335. 4. Perform a slow boot. Refer to “Performing a slow boot” on page 665). Does the problem persist? Yes: Continue with the next step. No: This ends the procedure.5. Replace the system backplane (Un-P1). This ends the procedure.FSPSP31 The service processor has detected that one or more of the required fields in the system VPD has not been initialized. 1. Log into ASMI with authorized service provider authority (see “Accessing the ASMI” on page 646). 2. Set the system VPD values (see “Setting the system enclosure type” on page 650 and “Setting the system identifiers” on page 650). Note: The service processor will automatically reset when leaving the ASMI after updating the system VPD. 3. Power on the system (“Powering on and powering off” on page 588). This ends the procedure.FSPSP32 A problem with the enclosure has been found. The problem is resulting from one of the following: v The enclosure VPD cannot be found. v The enclosure serial number is not programmed. v The enclosure feature code is not programmed.Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions for removing and replacing any FRUs called out by the following procedure. Perform the following: 1. Record the reason code (the last 4 characters of word 11) from the SRC by looking at the operator panel or accessing the error log with the ASMI. 2. Is the reason code A46F? No: Go to step 6 on page 212. Yes: Continue with the next step.3. Check for and apply any server firmware updates. Does the problem persist? No: This ends the procedure. Yes: Continue with the next step.4. Power off the system (see “Powering on and powering off” on page 588). Chapter 2. Isolation Procedures 211 5. Perform the following: a. Replace the system backplane (Un-P1). b. Perform a slow boot (see “Performing a slow boot” on page 665).Does the problem persist? No: This ends the procedure. Yes: Contact your next level of support. This ends the procedure.6. Is the reason code A460? No: Go to step 8. Yes: Continue with the next step.7. Perform the following: a. Set the enclosure serial number using the ASMI (see “Setting the system identifiers” on page 650). b. The service processor will automatically reset when leaving the ASMI after updating the serial number. c. Perform a slow boot (see “Performing a slow boot” on page 665).Does the problem persist? No: This ends the procedure. Yes: Contact your next level of support. This ends the procedure.8. Is the reason code A45F? No: Contact your next level of support. This ends the procedure. Yes: Continue with the next step.9. Perform the following: a. Set the enclosure feature code using the ASMI (see “Setting the system enclosure type” on page 650). The service processor will automatically reset when leaving the ASMI after updating the serial number. b. Perform a slow boot (see “Performing a slow boot” on page 665).).Does the problem persist? No: This ends the procedure. Yes: Contact your next level of support. This ends the procedure.FSPSP33 A problem has been detected in the connection with the HMC. 1. Ensure that the cable connectors to the network from the HMC, managed system, managed system partitions, and other HMCs are securely connected. If the connections are not secure, plug the cables back into the proper locations and make sure that the connections are good. 2. Check to see if the HMC is working correctly or if the HMC was disconnected incorrectly from the managed system, managed system partitions, and other HMCs. If either has happened, reboot the HMC. 3. Verify that the network connection between the HMC, managed system, managed system partitions, and other HMCs is working properly. 4. If applicable, service the next FRU. See Chapter 3, “Locating FRUs,” on page 335 for instructions. 5. If the problem continues to persist, contact your next level of support. This ends the procedure.FSPSP34 The memory cards are plugged in an invalid configuration and cannot be used by the system. Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions for removing and replacing any FRUs called out by the following procedure. 212 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 1. Is the SRC B1xx C02A? Yes: A memory module is missing from the system. The additional parts in the FRU callout list will include all memory modules in the group with the missing card. To correct the error, visually check the system to determine which of these missing is missing, and add the module. This ends the procedure. No: Continue with the next step.2. Is the SRC B1xx C029? Yes: A memory module is a different type than the other memory modules in the same group. The additional parts in the FRU callout list will include all memory modules in the group that contain the error. To correct the error, exchange the memory modules of the incorrect type with those of the desired type. This ends the procedure. No: Continue with the next step.3. Is the SRC B1xx C02B? Yes: A group of memory modules are missing and are required so that other memory modules on the board can be configured. The additional parts in the FRU callout list will include all missing memory modules in the group. To correct the error, add or remove these modules to the desired locations. This ends the procedure. No: Continue with the next step.4. Is the SRC B1xx C036? Yes: A memory module is not supported by this system. The additional parts in the FRU callout list will include all memory modules in the group that contains the unsupported modules. To correct the error, remove these modules from the system or replace them with the correct type. This ends the procedure. No: Return to Chapter 1, “Starting a Service Call,” on page 1. This ends the procedure.FSPSP35 The system has detected a problem with a memory controller. Perform the following to enable redundant utilization: 1. Power off the system (see “Powering on and powering off” on page 588). 2. “Performing a slow boot” on page 665. This ends the procedure.FSPSP38 The system has detected an error within the JTAG path. If the FRU replaced before this procedure did not correct the problem, perform the following: Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions on replacing any FRUs called out by this procedure. 1. Was the FRU called out before this procedure a memory DIMM (Un-P1-C14 through Un-P1-C17 or Un-P1-C21 through Un-P1-C24) or a GX adapter card (Un-P1-C8 or Un-P1-C6)? No: No: This ends the procedure Yes: Continue with the next step.2. Perform the following: a. Replace the system backplane (Un-P1). b. Perform a slow boot. This ends the procedure.FSPSP42 An error communicating between two system processors was detected. Chapter 2. Isolation Procedures 213 Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions for removing and replacing any FRUs called out by the following procedure. There is a communication error between the processors and the FRUs called out before this procedure. If you were unable to correct the problem by replacing FRUs that were previously specified before coming to this procedure, consider the possibility of failing system backplanes. 1. Power off the system (see “Powering on and powering off” on page 588). 2. Replace the system backplane (Un-P1). 3. Perform a slow boot (see “Performing a slow boot” on page 665). This ends the procedure. FSPSP45 The system has detected an error with FSI path. 1. Replace the system backplane (Un-P1) 2. Perform a slow boot. Refer to “Performing a slow boot” on page 665. This ends the procedure.FSPSP46 Some corrupt areas of flash or RAM have been detected on the service processor. 1. Replace the system backplane (Un-P1). See Chapter 3, “Locating FRUs,” on page 335 for instructions. 2. Perform a slow boot. Refer to “Performing a slow boot” on page 665. This ends the procedure. FSPSP47 The system has detected an error within the PSI link. To resolve the problem, perform the following: Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions for removing and replacing any FRUs called out by the following procedure. 1. Replace the system backplane (Un-P1) 2. Perform a slow boot. Refer to “Performing a slow boot” on page 665. This ends the procedure.FSPSP48 A diagnostic function detects an external processor interface problem. If replacing the FRUs previously listed in the FRU list does not fix the problem, perform the following: 1. Power off the system (see “Powering on and powering off” on page 588). 2. Replace the system backplane (Un-P1). Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions on replacing the system backplane. 3. Perform a slow boot. Refer to “Performing a slow boot” on page 665. This ends the procedure. FSPSP49 A diagnostic function detects an internal processor interface problem. If replacing the FRUs previously listed in the FRU list does not fix the problem, perform the following: 1. Power off the system (see “Powering on and powering off” on page 588). 2. Replace the system backplane (Un-P1). Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions on replacing the system backplane. 3. Perform a slow boot. Refer to “Performing a slow boot” on page 665. 214 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) This ends the procedure. FSPSP50 A diagnostic function detects a connection problem between a processor chip and a GX chip. If replacing the FRUs previously listed in the FRU list does not fix the problem, perform the following: 1. Power off the system (see “Powering on and powering off” on page 588). 2. Replace the system backplane (Un-P1). Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions on replacing the system backplane. 3. Perform a slow boot. Refer to “Performing a slow boot” on page 665. This ends the procedure. FSPSP51 Runtime diagnostics detected a memory bus correctable error that is exceeding the threshold. The memory bus correctable error will not cause an immediate loss of system operations. However, the system is operating in a degraded mode. To resolve the problem, replace the FRU called out after this procedure. Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions. FSPSP52 A problem has been detected on a memory bus. If replacing the FRUs previously listed in the FRU list does not fix the problem, perform the following: 1. Power off the system (see “Powering on and powering off” on page 588). 2. Replace the system backplane (Un-P1). Refer to Chapter 3, “Locating FRUs,” on page 335. 3. Perform a slow boot. Refer to “Performing a slow boot” on page 665. This ends the procedure. FSPSP54 A processor over-temperature has been detected. Check for any environmental issues before replacing any parts. 1. Is the ambient room temperature in the normal operating range (less than 35 degrees C/95 degrees F)? No: Notify the customer. The customer must lower the room temperature so that it is within the normal range. Do not replace any parts. This ends the procedure. Yes: Continue to the next step.2. Are the front and rear of the CEC drawer, and the front and rear rack doors, free of obstructions that would impede the airflow through the drawer? No: Notify the customer. The system must be free of obstructions for proper airflow. Clean the air inlets and exits in the drawer as required. Do not replace any parts. This ends the procedure. Yes: Continue to the next step.3. Are all of the fans, especially those at the back of the power supply, functioning normally? No: Replace any fans that are not turning or are turning slowly. Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions. This ends the procedure. Yes: There are no environmental issues with the cooling of the processors. This ends the procedure. Chapter 2. Isolation Procedures 215 FSPSP55 An error has been detected on a bus between two FRUs. The endpoint FRUs have been called out. However, the source of the error could be the bus path between the FRUs. Refer to Chapter 3, “Locating FRUs,” on page 335 for instructions for removing and replacing any FRUs called out by the following procedure. 1. Is word 6 (the 8 leftmost characters of panel function 13) of the reference code 00000001? No: Go to the next step. Yes: Perform the following:a. The XYZ bus has an error. Replace the FRUs called out after this procedure. b. Did replacing the FRUs correct the problem? Yes: This ends the procedure. No: Perform the following:c. Replace the system backplane (Un-P1). d. Perform a slow boot. This ends the procedure.2. Is word 6 of the reference code 00000002? No: Go to the next step. Yes: Perform the following:a. The AB bus has an error. Replace the FRUs called out after this procedure. b. Did replacing the FRUs correct the problem? Yes: This ends the procedure. No: Perform the following:c. Replace the system backplane (Un-P1). d. Perform a slow boot. This ends the procedure.3. Is word 6 of the reference code 00000004 or 00000005? No: Go to the next step. Yes: Perform the following:a. The processor - GX card bus or PSI bus has an error. Replace the FRUs called out after this procedure. b. Did replacing the FRUs correct the problem? Yes: This ends the procedure. No: Perform the following:c. Replace the system backplane (Un-P1). d. Perform a slow boot. This ends the procedure.4. For any other value in word 6 of the reference code, the bus error is corrected by the FRUs called out after this procedure. Replace the FRUs called out after this procedure. This ends the procedure.FSPSP63 The system has experienced a power error. Please review previous error logs for power-related issues. FSPSP64 All the processor support interface (PSI) links of the system are either nonfunctional or deconfigured, so the system cannot perform an IPL appropriately. Look for previous error logs that deconfigure hardware. 216 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) FSPSPC1 If the system hangs after the code that sent you to this procedure appears in the control panel, perform these steps to reset the service processor. Attention: You should periodically check the system firmware level on all your servers and update the firmware to the latest level, if appropriate. If you were directed to this procedure because the server displayed B1817201, C1001014, or C1001020, or a combination of these codes, the latest firmware can help avoid a recurrence of this problem. Even if the customer cannot update the firmware on this system at this time, all of their systems should be updated to the latest firmware level as soon as possible to help prevent this problem from occurring on other systems. Resetting the service processor on systems with a physical control panel 1. If the Advanced System Management Interface (ASMI) is available, reset the service processor using the ASMI menus. Were you able to use the ASMI menus to reset the service processor? Yes: This ends the procedure. No: Continue with the next step.2. Activate the service processor pinhole reset switch on the system’s control (operator) panel by carefully performing these steps: a. Using an insulated paper clip, unbend the paper clip so that it has a straight section about 2 inches long. b. Insert the clip straight into the hole, keeping the clip perpendicular to the plastic bezel. c. When you engage the reset switch, you should feel the detent of the switch. Pressing the reset switch resets the service processor and causes the system to shut down.3. Reboot the system in slow mode from the permanent side, using control panel function 02 or the ASMI menus, if available. 4. If the hang repeats, verify whether there is a firmware update that is available that fixes the problem; apply if available. For more information, see: v If the system is managed by an HMC, see the Operations Guide for the HMC and Managed Systems, SA76-0085. v If the system is not managed by an HMC, see the Operations Guide for the ASMI and Nonpartitioned Systems, SA76-0094.5. Choose from the following options: v If no firmware update is available, continue with the next step. v If a firmware update is available, apply it using the Service Focal Point in the Hardware Management Console (HMC), if attached.Did the update resolve the problem so that the system now boots? Yes: This ends the procedure. No: You are here because there is no HMC attached to the system, the flash update failed, or the updated firmware did not fix the hang. Continue with the next step.6. Choose from the following options: v If you are a customer, contact your authorized hardware service provider. This ends the procedure. v If you are the authorized hardware service provider, continue with the next step.7. Replace the service processor (see Chapter 3, “Locating FRUs,” on page 335). 8. If replacing the service processor does not fix the problem, contact your next level of support. This ends the procedure. Chapter 2. Isolation Procedures 217 FSPSPD1 If the system hangs after the code that sent you to this procedure appears in the control panel, perform these steps to reset the service processor. Attention: You should periodically check the system firmware level on all your servers and update the firmware to the latest level, if appropriate. If you were directed to this procedure because the server displayed B1817201, C1001014, or C1001020, or a combination of these codes, the latest firmware can help avoid a recurrence of this problem. Even if the customer cannot update the firmware on this system at this time, all of their systems should be updated to the latest firmware level as soon as possible to help prevent this problem from occurring on other systems. Resetting the service processor on systems with a physical control panel 1. If the Advanced System Management Interface (ASMI) is available, reset the service processor using the ASMI menus. Were you able to use the ASMI menus to reset the service processor? Yes: This ends the procedure. No: Continue with the next step.2. Activate the service processor pinhole reset switch on the system’s control (operator) panel by carefully performing these steps: a. Using an insulated paper clip, unbend the paper clip so that it has a straight section about 2 inches long. b. Insert the clip straight into the hole, keeping the clip perpendicular to the plastic bezel. c. When you engage the reset switch, you should feel the detent of the switch. Pressing the reset switch resets the service processor and causes the system to shut down.3. Reboot the system in slow mode from the permanent side, using control panel function 02 or the ASMI menus, if available. 4. If the hang repeats, verify whether there is a firmware update that is available that fixes the problem; apply if available. For more information, see: v If the system is managed by an HMC, see the Operations Guide for the HMC and Managed Systems, SA76-0085. v If the system is not managed by an HMC, see the Operations Guide for the ASMI and Nonpartitioned Systems, SA76-0094.5. Choose from the following options: v If no firmware update is available, continue with the next step. v If a firmware update is available, apply it using the Service Focal Point in the Hardware Management Console (HMC), if attached.Did the update resolve the problem so that the system now boots? Yes: This ends the procedure. No: You are here because there is no HMC attached to the system, the flash update failed, or the updated firmware did not fix the hang. Continue with the next step.6. Choose from the following options: v If you are a customer, contact your authorized hardware service provider. This ends the procedure. v If you are the authorized hardware service provider, continue with the next step.7. Replace the service processor (see Chapter 3, “Locating FRUs,” on page 335). 8. If replacing the service processor does not fix the problem, contact your next level of support. This ends the procedure. 218 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Tape unit isolation procedures This topic contains the procedures necessary to isolate a failure in a tape device. In these procedures, the term tape unit may be any one of the following: v An internal tape drive, including its electronic parts and status indicators v An internal tape drive, including its tray, power regulator, and AMDs v An external tape drive, including its power supply, power switch, power regulator, and AMDs You should interpret the term tape unit to mean the tape drive you are working on. However, these procedures use the terms tape drive and enclosure to indicate a more specific meaning. Read and observe all safety procedures before servicing the system and while performing the procedures in this topic. Unless instructed otherwise, always power off the system or expansion unit where the FRU is located (see Powering on and powering off) before removing, exchanging, or installing a field-replaceable unit (FRU). Chapter 2. Isolation Procedures 219 DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) “TUPIP02” on page 221Use this procedure to perform the 8mm tape drive read self-test and hardware self-test. “TUPIP03” on page 223You were directed here because you may need to exchange a failing part. “TUPIP04” on page 224Use this procedure to reset an IOP and its attached tape units. Read the (overview) before continuing with this procedure. “TUPIP06” on page 228Use this procedure to isolate a Device Not Found message during installation from an alternate device. “Tape unit self-test procedure” on page 229This procedure is designed to allow you to quickly perform a complete set of diagnostic tests on a 6384 or 6387 tape unit. “Tape device ready conditions” on page 231All the conditions that are listed for the device, must be correct for the device to be ready. 220 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) TUPIP02 Use this procedure to perform the 8mm tape drive read self-test and hardware self-test. The write test is performed in “TUPIP03” on page 223. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem (see “Determining if the system has logical partitions” on page 593). 2. Press the Unload switch on the front of the 8mm tape drive. Is a data cartridge present? v No: Continue with the next step. v Yes: Attempt to remove the data cartridge. Can you remove the data cartridge? Yes: Continue with the next step. No: The tape drive is the failing part. Go to (Tape cartridge - Manual removal). After removing the data cartridge, exchange the tape drive (see Chapter 3, “Locating FRUs,” on page 335). This ends the procedure.3. Clean the tape drive by using the cleaning cartridge (part 16G8467). If the tape drive ejects the cleaning cartridge with the Disturbance light on, a new cleaning cartridge is needed. The tape drive unloads automatically when cleaning is complete. Cleaning takes up to 5 minutes. Is the Disturbance light on continuously? No: Continue with the next step. Yes: Repeat this step while using a new cleaning cartridge. If you are using a new cleaning cartridge and the Disturbance light does not go off, the possible failing part is the 8mm tape drive. Go to “TUPIP03” on page 223. This ends the procedure.4. Is the Disturbance light blinking? v No: Continue with the next step. v Yes: Choose from the following: – If the 8mm tape drive is a type 6390, the possible failing part is the 6390 tape drive. – If the 8mm tape drive is a type 7208, the possible failing parts are: a. 7208 tape drive b. Power supply c. AMD Go to “TUPIP03” on page 223. This ends the procedure.5. Perform the read self-test: Notes: a. The 8mm tape drive read self-test and hardware self-test can take up to 5 minutes to run. b. The tape drive runs the read self-test first. During the read self-test, the Read-Write light blinks, and the Disturbance and Ready lights are off.To start the read self-test and hardware self-test, insert the diagnostic cartridge (part 46G2660) into the tape drive. The following conditions indicate that the read self-test ended successfully: v The Read-Write light stops blinking. v The diagnostic cartridge ejects automatically. v The three status lights go on to indicate the start of the hardware test. Does the read self-test end successfully? v No: Does the tape drive eject the diagnostic cartridge? – Yes: Continue with the next step. – No: The tape drive is the failing part. Go to (Tape cartridge - Manual removal). After removing the diagnostic cartridge, exchange the tape drive (see Chapter 3, “Locating FRUs,” on page 335). This ends the procedure. Chapter 2. Isolation Procedures 221 v Yes: Go to step 8.6. Is the Disturbance light blinking approximately four times per second? v No: Continue with the next step. v Yes: The possible failing part is the diagnostic cartridge (part 46G2660). – If this is your first time through this step, get a new diagnostic cartridge. Go to step 3 on page 221 of this procedure to clean the tape drive again. Afterwards, run the read self-test and hardware self-test while using the new diagnostic cartridge. – If this is your second time through this step, the possible failing part is the 8mm tape drive. Go to “TUPIP03” on page 223. This ends the procedure.7. The Disturbance light is blinking approximately once per second. v If the 8mm tape drive is a type 6390, the possible failing part is the 6390 tape drive. v If the 8mm tape drive is a type 7208, the possible failing parts are: a. 7208 tape drive b. Power supply c. AMD Go to “TUPIP03” on page 223. This ends the procedure. 8. The tape drive runs the hardware self-test. During the hardware self-test, the three tape drive status lights are on for 15 to 30 seconds. The three status lights go off when the hardware self-test ends successfully. Does the hardware self-test end successfully? v Yes: Continue with the next step. v No: When the hardware self-test does not end successfully, the following conditions occur: – The three status lights do not go off. – The Ready and Read-Write lights go off. – The Disturbance light blinks approximately once per second. Possible failing parts are: – If the 8mm tape drive is a type 6390, the possible failing part is the 6390 tape drive. – If the 8mm tape drive is a type 7208, the possible failing parts are: a. 7208 tape drive b. Power supply c. AMD Go to “TUPIP03” on page 223. This ends the procedure.9. The read self-test and hardware self-test ended successfully. Was the user’s original tape identified as the probable cause of failure? v Yes: Perform the following: a. Mark and date the data cartridge to indicate that it failed with a permanent error. b. Discard this data cartridge when: – Volume statistics (if available) indicate a problem with the data cartridge. – A total of three permanent errors have occurred with the same data cartridge.c. If possible, continue operations with a new data cartridge. Go to “TUPIP03” on page 223. This ends the procedure. v No: Go to “TUPIP03” on page 223. This ends the procedure. 222 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) TUPIP03 You were directed here because you may need to exchange a failing part. The failing part was determined from one of the following: v Other problem isolation procedures v The Failing item column of the tape unit reference code table v Tape unit service guideNote: Occasionally, the system is available but not performing an alternate IPL (type D IPL). In this instance, any hardware failure of the tape unit I/O processor, or any device attached to it is not critical. With the exception of the loss of the affected devices, the system remains available. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem (see “Determining if the system has logical partitions” on page 593). 2. Do you need to exchange a possible failing device? v No: Do you need to exchange the tape unit I/O processor? No: Continue with the next step. Yes: Exchange the tape unit I/O processor (see Chapter 3, “Locating FRUs,” on page 335). When you have completed the remove and replace procedure, continue with the next step.v Yes: Perform the following: – For an internal tape unit, go to Chapter 3, “Locating FRUs,” on page 335. – For an external tape unit, go to the remove and replace procedures in the device service information. 3. Are you working with a tape unit in the system unit or in an expansion unit? v Yes: Is the system available, and can you enter commands on the command line? No: Continue with the next step. Yes: Go to step 9 on page 224.v No: Continue with the next step. 4. Display the selected IPL type. Is the displayed IPL type D? v No: Do you want to perform an alternate IPL (type D)? No: Continue with the next step. Yes: Go to step 6.v Yes: Go to step 6. 5. Perform an IPL from disk by doing the following: a. Power off the system (see Powering on and powering off). b. Select IPL type A in manual mode. c. Power on the system. d. Go to step 8. 6. Place the first tape of the latest set of SAVSYS tapes or SAVSTG tapes, or the first Software Distribution tape in the alternate IPL tape drive. The tape drive automatically becomes ready for the IPL operation (this may take several minutes). 7. Perform an alternate IPL by doing the following: a. Power off the system. b. Select IPL type D in Manual mode. c. Power on the system. 8. The IPL may take one or more hours to complete. Chapter 2. Isolation Procedures 223 Does an unexpected reference code appear on the control panel, and is the System Attention light on? v No: Does the IPL complete successfully? Yes: Continue with the next step. No: Perform problem analysis to continue analyzing the problem. This ends the procedure.v Yes: Go to step 10. 9. Perform the following to test the tape unit: a. Enter VFYTAP (the Verify Tape command) on the command line. b. Follow the prompts on the Verify Tape displays, then return here and answer the following question.Does the VFYTAP command end successfully? No: Continue with the next step. Yes: This ends the procedure.10. Record the SRC. Is the SRC the same one that sent you to this procedure? Yes: You cannot continue to analyze the problem. Use the original SRC and exchange the FRUs. Begin with the FRU which has the highest percent of probable failure (see the failing item list for this reference code). This ends the procedure. No: A different SRC occurred. Use the new SRC to correct the problem. This ends the procedure.TUPIP04 Use this procedure to reset an IOP and its attached tape units. Read the (overview) before continuing with this procedure. If disk units are attached to an IOP, you must power off the system, then power it on to reset the IOP. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem (see “Determining if the system has logical partitions” on page 593). 2. Is the tape unit powered on? v No: Continue with the next step. v Yes: Perform the following: a. Press the Unload switch on the front of the tape unit you are working on. b. If a data cartridge or a tape reel is present, do not load it until you need it. c. Continue with the next step of this procedure. 3. Verify the following: v If the external device has a power switch, ensure that it is set to the On position. v Ensure that the power and external signal cables are connected correctly. Note: For every 8mm and 1/4 inch tape unit, the I/O bus terminating plug for the SCSI external signal cable is connected internally. These devices do not need and should not have an external terminating plug. 4. Did you press the Unload switch in step 2? v Yes: Can you enter commands on the command line? Yes: Continue with the next step. No: Go to step 11 on page 226.v No: Press the Unload switch on the front of the tape unit you are working on. If a data cartridge or a tape reel is present, do not load it until you need it. Continue with the next step of this procedure. 224 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 5. Has the tape unit operated correctly since it was installed? If you do not know, continue with the next step of this procedure. Yes: Continue with the next step. No: Go to step 11 on page 226. 6. If a system message displayed an I/O processor name, a tape unit resource name, or a device name, record the name for use in the next step. You may continue without a name. Does the I/O processor give support to only one tape unit? If you do not know, continue with the next step of this procedure. v No: Continue with the next step. v Yes: Perform the following. You must complete all parts of this step before you press Enter. a. Enter WRKCFGSTS *DEV *TAP ASTLVL(*INTERMED) (the Work with Configuration Status command) on the command line. b. If the device is not varied off, select Vary off before continuing. c. Select Vary on for the failing tape unit. d. Enter RESET(*YES) (the Reset command) on the command line. e. Press Enter. This ends the procedure. 7. This step determines if the I/O processor for the tape unit gives support to other tape units or to a disk unit. Notes: a. If you cannot determine the tape unit you are attempting to use, go to step 11 (See 11 on page 226). b. System messages refer to other tape units that the I/O processor gives support to as associated devices.Enter WRKHDWRSC *STG (the Work with Hardware Resources command) on the command line. Did you record an I/O processor (IOP) resource name in step 6? v No: Perform the following: a. Select Work with resources for each storage resource IOP (CMB01, SIO1, and SIO2 are examples of storage resource IOPs). b. Find the Configuration Description name of the tape unit you are attempting to use, and then record the Configuration Description names of all tape units that the I/O processor gives support to. c. Record whether the I/O processor for the tape unit also gives support to any disk unit resources. d. Continue with the next step.v Yes: Perform the following: a. Select Work with resources for that resource. b. Record the Configuration description name of all tape units for which the I/O processor provides support. c. Record whether the I/O processor for the tape unit also gives support to any disk unit resources. d. Continue with the next step. 8. Does the I/O processor give support to any disk unit resources? No: Continue with the next step. Chapter 2. Isolation Procedures 225 Yes: The Reset option is not available. Go to step 11. 9. Does the I/O processor give support to only one tape unit? v No: Continue with the next step. v Yes: Perform the following: a. Select Work with configuration description and press Enter. b. Select Work with status and press Enter. Note: You must complete the remaining parts of this step before you press Enter again. c. If the device is not varied off, select Vary off before continuing. d. Select Vary on for the failing tape unit. e. Enter RESET(*YES) (the Reset command) on the command line. f. Press Enter. This ends the procedure.10. Perform the following: a. Enter WRKCFGSTS *DEV *TAP ASTLVL(*INTERMED) (the Work with Configuration Status command) on the command line. b. Select Vary off for the failing tape unit and associated devices (the devices you identified in step 7 on page 225), and then press Enter. Note: You must complete the remaining parts of this step before you press Enter again. c. Select Vary on for the failing tape unit. d. Enter RESET(*YES) (the Reset command) on the command line. e. Press Enter. f. Select Vary on for the associated devices (tape units) you identified in step 7 on page 225. It is not necessary to use the Reset option again.Does a system message indicate that the vary on operation failed? Yes: Continue with the next step. No: This ends the procedure.11. The Reset is not available, or you were not able to find the Configuration Description name when using WRKHDWRSC *STG (the Work with Hardware Resources command). You can perform an I/O processor (IOP) reset by performing an IPL of the I/O processor. All devices that are attached to the IOP will reset. The following steps describe how to load an IOP, how to configure a tape drive, how to vary on tape devices, and how to make tape devices available. 12. Is the tape device you are working on an 8mm tape drive? v No: Continue with the next step. v Yes: Verify the following on the 8mm tape drive: – The power and external signal cables are connected correctly. – The Power switch is set to the On position (pushed in). 226 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Note: The SCSI I/O bus terminating plug for the system-external signal cables is connected internally in the 8mm tape drive. The 8mm tape drive does not need, and must not have an external terminating plug. Is the 8mm tape drive Power light on, and is the Disturbance light off? Yes: Continue with the next step. No: Go to “TUPIP02” on page 221 to correct the problem. This ends the procedure.13. Is a data cartridge or a tape reel installed in the tape device? No: Continue with the next step. Yes: Remove the data cartridge or tape reel. Continue with the next step.14. Can you enter commands on the command line? v Yes: Continue with the next step. v No: Perform the following: a. Power off the system (see Powering on and powering off). b. Power on the system. The system performs an IPL and resets all devices. If the tape device responds to SCSI address 7, the system configures the tape device. This ends the procedure.15. Verify that automatic configuration is on by entering DSPSYSVAL QAUTOCFG (the Display System Value command) on the command line. Is the Autoconfigure device option set to 1? v Yes: Continue with the next step. v No: Perform the following: a. Press Enter to return to the command line. b. Set automatic configuration to On by entering CHGSYSVAL QAUTOCFG ’1’ (the Change System Value command) on the command line.Note: QAUTOCFG resets to its initial value in step 21 on page 228. c. Continue with the next step.16. Perform the following: a. Enter STRSST (the Start SST command) on the command line. b. On the Start Service Tools Sign On display, type in a User ID with QSRV authority and Password. c. Select Start a Service Tool ? Hardware Service Manager ? Logical Hardware Resources ? System Bus Resources. The Logical Hardware Resources on System Bus display shows all of the IOPs. d. Find the IOP you want to reset. You must ensure that no one is using any of the tape units, communication channels, or display stations that are attached to the IOP you want to reset.Does a ?*? indicator appear to the right of the IOP description? v No: Continue with the next step. v Yes: Disk units are attached to the IOP. Perform the following: a. Press F3 until the Exit System Service Tools display appears. b. Press Enter. Chapter 2. Isolation Procedures 227 c. Power off the system (see Powering on and powering off). d. Power on the system. The system performs an IPL and resets all devices. This ends the procedure.17. Perform the following: a. Select I/O debug ? IPL the I/O processor. b. When the IOP reset is complete, continue with the next step of this procedure.18. Perform the following: a. Press F12 to return to the Logical Hardware Resources on System Bus display. b. Select Resources associated with IOP for the IOP you reset.Did the IOP detect the tape unit? v Yes: Continue with the next step. v No: The IOP did not detect the tape unit. Consider the following: – Ensure that the tape unit is powered on and that the signal cables are connected correctly. If you find and correct a power or a signal cable problem, return to step 16 on page 227. – The tape unit may be failing. Go to the tape unit service information and perform the procedures for analyzing device problems. If you find and correct a tape unit problem, return to step 16 on page 227. – If none of the above are true, ask your next level of support for assistance. This ends the procedure.19. Press F3 until the Exit System Service Tools display appears. Then press Enter. 20. Was automatic configuration Off before you performed step 15 on page 227? Yes: Continue with the next step. No: This ends the procedure.21. Enter CHGSYSVAL QAUTOCFG ’0’ (the Change System Value command) on the command line to reset QAUTOCFG to its initial value. This ends the procedure.TUPIP06 Use this procedure to isolate a Device Not Found message during installation from an alternate device. There are several possible causes: v The alternate installation device was not correctly defined. v The alternate installation device was not made ready. v The alternate installation device does not contain installation media. v The alternate installation device is not powered on. v The alternate installation device is not connected properly. v There is a hardware error on the alternate installation device or the attached I/O processor.Read the danger notices in “Tape unit isolation procedures” on page 219 before continuing with this procedure. 1. Is the device that you are using for alternate installation defined as the alternate installation device? v Yes: Is the alternate installation device ready? Yes: Continue with the next step. No: Make the alternate installation device ready and retry the alternate installation. This ends the procedure. 228 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) v No: Correct the alternate installation device information and retry the alternate installation. This ends the procedure.2. Is there installation media in the alternate installation device? v Yes: Is the alternate installation device an external device? Yes: Continue with the next step. No: Go to step 5.v No: Load the correct media and retry the alternate installation. This ends the procedure.3. Is the alternate installation device powered on? v Yes: Make sure that the alternate installation device is properly connected to the I/O processor or I/O adapter card. Is the alternate installation device properly connected? Yes: Go to step 5. No: Correct the problem and retry the alternate installation. This ends the procedure.v No: Continue with the next step.4. Ensure that the power cable is connected tightly to the power cable connector at the back of the alternate device. Ensure that the power cable is connected to a power outlet that has the correct voltage. Set the alternate device Power switch to the Power On position. The Power light should go on and remain on. If a power problem is present, one of the following power failure conditions may occur: v The Power light flashes, then remains off. v The Power light does not go on. v Another indication of a power problem occurs. Does one of the above power failure conditions occur? v No: The alternate device is powered on and runs its power-on self-test. Wait for the power-on self-test to complete. Does the power-on self-test complete successfully? No: Go to the service information for the specific alternate installation device to correct the problem. Then retry the alternate installation. This ends the procedure. Yes: Retry the alternate installation. This ends the procedure.v Yes: Perform the following: a. Go to the service information for the specific alternate device to correct the power problem. b. When you have corrected the power problem, retry the alternate installation. This ends the procedure.5. Was a device error recorded in the Product Activity Log? No: Contact your next level of support. This ends the procedure. Yes: See Reference codes to correct the problem. This ends the procedure.Tape unit self-test procedure This procedure is designed to allow you to quickly perform a complete set of diagnostic tests on a 6384 or 6387 tape unit. The following procedure is designed to allow you to quickly perform a complete set of diagnostic tests on a 6384 or 6387 tape unit, without impacting your system operation. This test can also be used to verify good performance of individual tape cartridges. Enter diagnostic mode: 1. Verify that a cartridge is not loaded in the tape unit. To unload a cartridge, press the eject button on the front of the tape unit. If the cartridge does not eject, refer to (Tape unit - manual removal). Chapter 2. Isolation Procedures 229 2. Press and hold the eject button for about 6 seconds until the amber LED starts flashing slowly, then release the button. The amber (left) LED will flash, indicating that the tape unit is waiting for a cartridge to be inserted.Running the self-test 1. Self-testing begins when a scratch data cartridge is inserted into the tape unit. The Ready (left) LED will flash, indicating that self-testing is in progress.Note: A cartridge must be loaded within 15 seconds, otherwise, the tape unit will automatically revert back to normal operation. If necessary, return to step 1 on page 229 to reenter diagnostic mode. 2. For fastest results, we recommend using an SLR100 Test Tape (P/N 35L0967) which was originally provided with your iSeries™ server. Attention: Use a blank cartridge that does not contain customer data. During this self-test, the cartridge will be rewritten with a test pattern and any customer data will be destroyed.Note: Use a cartridge that is not write-protected. If a write-protected cartridge is inserted while the tape unit is in diagnostic mode, the cartridge will be ejected, see Incorrect cartridge below. Self-testing will only be performed using a write-compatible cartridge type, and with a cartridge that is not damaged, see Incorrect cartridge below. If a cleaning cartridge is inserted while the tape unit is in diagnostic mode, drive cleaning will occur and the tape unit will then return to normal operating mode. Return to step 1 on page 229 to reenter diagnostic mode. 3. At any time, self-testing can be stopped by pressing the eject button. After the current operation is completed, the cartridge will be ejected and tape unit will return to normal operating mode. 4. The Ready (left) LED will continue to flash during the following: v The cartridge load sequence has a approximate duration of 30 seconds. The center LED indicates tape movement. v The hardware test has an approximate duration of 2 and 1/2 minutes. During that time, a static test is performed on tape unit electrical components. No tape motion occurs during this step. v The cartridge load/unload test has an approximate duration of 1 and 1/2 minutes. During that time, the Ready LED will continue to flash while a dynamic test is performed on tape unit mechanical components. Two cartridge load cycles are included. v Duration of the write/read test will vary, depending on what type of cartridge is loaded into the tape unit. When an SLR100 Test Tape is used, typical duration will be 5 minutes. Use of other cartridge types can increase the write/read test duration to 30-40 minutes. During this test, the Ready LED will continue to flash. The center LED indicates tape movement.Interpreting the results Test Passed: When self-testing has completed successfully, and no problems are detected, the cartridge is unloaded from the tape unit and all LEDs are off. Proper function of both the tape unit and tape cartridge have now been verified. Note: A solid amber light indicates that self-testing has completed successfully, but the tape unit requires cleaning. Clean the tape unit by inserting an Dry Process Cleaning Cartridge (P/N 35L0844). Test Failed: The cartridge will remain loaded inside the tape unit, and the amber LED will flash when a problem is detected with either the tape unit or cartridge. Note: To isolate failure to either tape unit or cartridge, return to step 1 on page 229 and repeat this self-test using a different scratch cartridge. 230 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Incorrect cartridge: When the center (green) and right (amber) LEDs flash and a cartridge is unloaded, the tape unit has determined that an incorrect tape cartridge has been inserted, and self-testing cannot be performed. Verify that your tape cartridge is not one of the following: v Write-protected v Damaged v Unsupported media type v Media which is not write-compatible with tape unit.Press the eject button, to end self-test and return the tape unit to normal operating mode. Then return to step 1 on page 229 and run the self-test using another cartridge, or one which is not write-protected. This ends the procedure. Tape device ready conditions All the conditions that are listed for the device, must be correct for the device to be ready. If the device is not ready, use the Action column or other instructions, and go to the service information for the specific tape device. If the system has logical partitions, perform this procedure from the logical partition that reported the problem (see “Determining if the system has logical partitions” on page 593). Table 41. Tape device ready conditions Storage device Ready description Action 3480 or 3490 v Power switch is set to the On position. v Power light is on. v DC Power light is on. v Control unit On-line switch is set to the On-line position. v Control unit Normal/Test switch is set to the Normal position. v Control unit channel Enable/Disable switch is set to the Enable position. v Tape unit On-line/Off-line switch is set to the On-line position. v Tape is loaded. v Tape unit displays Ready U or Ready F. See the 3480 Magnetic Tape Subsystem Operator’s Guide, SA32-0066, or 3490 Magnetic Tape Subsystem Operator’s Guide, SA32-0124, for instructions on making the tape unit ready. Chapter 2. Isolation Procedures 231 Table 41. Tape device ready conditions (continued) Storage device Ready description Action 7208 v Power switch is on (pressed). v Power light is on. v Data cartridge is inserted. v Ready light is on. v System external signal cable is connected to the type 2621 I/O processor and to the 7208 tape drive.Note: The SCSI I/O bus terminating plug for the system external signal cable is connected internally in the 7208 tape drive. The 7208 tape drive does not need, and must not have, an external terminating plug. See the 7208 8 mm Tape Drive Operator’s Manual for instructions on making the tape drive ready. If you cannot make the 7208 Model 012 tape drive ready, go to “TUPIP02” on page 221. 9348 v Power switch is set to the On position. v Power light is on. v Tape is loaded. v Status display shows 00 A002. v On-line light is on. See the 9348 Customer Information manual, SA21-9567, for instructions on making the tape unit ready. If you cannot make the tape unit ready, go to the ?Analyzing Problems? section of 9348 Tape Unit Service Information, SY31-0697. 232 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Twinaxial workstation I/O processor isolation procedure Use the procedure below to isolate a failure which has been detected by the twinaxial workstation I/O processor. If you are using a personal computer, an emulation program must be installed and working. Please read and observe all safety procedures before servicing the system and while performing the procedure below. Attention: Unless instructed otherwise, always power off the system or expansion tower where the FRU is located (see Powering on and powering off) before removing, exchanging, or installing a field-replaceable unit (FRU). DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) Attention: When instructed, remove and connect cables carefully. You may damage the connectors if you use too much force. “TWSIP01” on page 234The workstation IOP detected an error. Chapter 2. Isolation Procedures 233 TWSIP01 The workstation IOP detected an error. Please read the danger notices in “Twinaxial workstation I/O processor isolation procedure” on page 233 before performing this procedure. One of the following occurred: v All of the workstations on one port are not working. v All of the workstations on the system are not working. v One of the workstations on the system is not working. v The reference code table instructed you to perform this procedure. v The Remote Operations Console is not working. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593 before continuing with this procedure. 2. Are you using a workstation adapter console? Note: A personal computer (used as a console) that is attached to the system by using a console cable feature is known as a workstation adapter console. The cable (part number 46G0450, 46G0479, or 44H7504) connects the system port on the personal computer to a communications I/O adapter on the system. No: Continue with the next step. Yes: Go to “WSAIP01” on page 241. This ends the procedure. 3. Is the device you are attempting to repair a personal computer (PC)? No: Continue with the next step. Yes: PC emulation programs operate and report system-to-PC communications problems differently. See the PC emulation information for details on error identification. Then, continue with the next step. 4. Perform the following steps: a. Verify that all the devices you are attempting to repair, the primary console, and any alternative consoles are powered on. b. Verify that the all the devices you are attempting to repair, the primary console, and any alternative consoles have an available status. For more information on displaying the device status, see “Hardware Service Manager” on page 644. c. Verify that the workstation addresses of all workstations on the failing port are correct. Each workstation on the port must have a separate address, from 0 through 6. See the workstation service information for details on how to check addresses. d. Verify that the last workstation on the failing port is terminated. All other workstations on that port must not be terminated. e. Ensure that the cables that are attached to the device or devices are tight and are not visibly damaged. f. If there were any cable changes, check them carefully. g. If all of the workstations on the system are not working, disconnect them by terminating at the console. h. Verify the device operation (see the device information for instructions). i. The cursor position can assist in problem analysis. v If the cursor is in the upper right corner, it indicates a communication problem between the workstation IOP and the device. Continue with the next step. 234 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) v If the cursor is in the upper left corner, it indicates a communication problem between the workstation IOP and the operating system. Perform the following steps: 1) Verify that all current PTFs are loaded. 2) Ask your next level of support for assistance. This ends the procedure. 5. Is the system powered off? Yes: Continue with the next step. No: Go to step 8. 6. Perform the following: a. Power on the system in Manual mode. See “IPL type, mode, and speed options” on page 642 for details. b. Wait for a display to appear on the console or a reference code to appear on the control panel.Does a display appear on the console? v No: Continue with the next step. v Yes: If you disconnected any devices after the console in step 4 on page 234, perform the following: a. Power off the system. b. Reconnect one device. Note: Ensure that you terminate the device you just reconnected and remove the termination from the device previously terminated. c. Power on the system. d. If a reference code appears on the control panel, go to step 9. e. If no reference code appears, repeat steps a through d of this step until you have checked all devices disconnected previously. f. Continue to perform the initial program load (IPL). This ends the procedure. 7. Does the same reference code that sent you to this procedure appear on the control panel? Yes: Continue with the next step. No: Perform problem analysis for this new problem. This ends the procedure. 8. Perform the following to make DST available: a. Ensure that Manual mode on the control panel is selected. b. Select function 21 Make DST Available. c. Check the console and any alternative consoles for a display.Does a display appear on any of the console displays? v No: Continue with the next step. v Yes: If you disconnected any devices after the console in step 4 on page 234, perform the following: a. Power off the system. b. Reconnect one device. Note: Ensure that you terminate the device you just reconnected and remove the termination from the previously terminated device. c. Power on the system. d. If a reference code appears on the control panel or on the HMC, go to step 9. e. If no reference code, repeat steps a through d of this step until you have checked all devices disconnected previously. f. Continue to perform the initial program load (IPL). This ends the procedure. 9. Ensure that the following conditions are met: Chapter 2. Isolation Procedures 235 v The workstation addresses of all workstations on the failing port must be correct. Each workstation on the port must have a separate address, from 0 through 6. See the workstation service information if you need help with checking addresses. Did you find a problem with any of the above conditions? Yes: Continue with the next step. No: Go to step 11.10. Perform the following: a. Correct the problem. b. Select function 21 Make DST Available. c. Check the console and any alternative consoles for a display.Does a display appear on any of the consoles? v Yes: Continue to perform the IPL. This ends the procedure. v No: Does the same reference code appear on the control panel? Yes: Continue with the next step. No: Perform problem analysis for this new problem. This ends the procedure.11. Is the reference code one of the following: 0001, 0003, 0004, 0005, 0006, 0101, 0103, 0104, 0105, 0106, 5004, 5082, B000, D010, or D023? No: Continue with the next step. Yes: Go to step 15 on page 237.12. Does the system have an alternative console on a second workstation IOP? Yes: Continue with the next step. No: Go to step 14.13. There is either a Licensed Internal Code problem, or there are two device failures on the workstation IOPs, consoles, or cables. The console and any alternative consoles are the most probable causes for this failure. v See the service information for the failing display to attempt to correct the problem. If a display is connected to the system by a link protocol converter, use the link protocol converter information to attempt to correct the problem. The link protocol converter may be the failing item. v If you have another working display, you can exchange the console and alternative consoles and perform an IPL to attempt to correct the problem. v Exchange the following parts one at a time until you determine the failing item: a. Console b. Alternative console c. Cables d. Workstation IOA for the console e. The multi-adapter bridge. This ends the procedure.14. The console, cables, or the workstation IOP card is the most probable causes for this failure. If the console is connected to the system by a link protocol converter, the link protocol converter is possibly the failing item. Use one or more of the following options to correct the problem: a. See the service information for the failing displays for more information. If a display is connected to the system by a link protocol converter, see the link protocol converter information to attempt to correct the problem. b. If you have another working display, you can exchange the console and perform an IPL to attempt to correct the problem. c. Exchange the following parts one at a time until you determine the failing item: 1) Console 2) Workstation IOA 236 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 3) The multi-adapter bridge. 4) Twinaxial attachment (cable) This ends the procedure.15. To continue problem analysis, use a port tester, part 93X2040 or 59X4262, which you may have with your tools or the customer may have one. The port tester has either two or three lights. Is a port tester available? v Yes: Continue with the next step. v No: Check or exchange the cables from the system to the failing display. Did this correct the problem? Yes: You corrected the problem. This ends the procedure. No: Go to step 12 on page 236.16. To use the port tester to isolate the problem, perform the following: v Verify that the port tester is operating correctly by doing a self-test. A self-test can be made at any time, even when the port tester is attached to a port or cable. Perform the following steps to do a self-test: a. Move the selector switch to the center (0) position. b. Push and hold the test button until all lights go on. The yellow lights should go on immediately, and the green light should go on approximately 5 seconds later. The port tester is ready for use if all lights go on.v Leave the system power on.17. Find the input cable to the failing console or port. Is the failing console or the failing port attached to a protocol converter? v No: Perform the following: a. Disconnect the input cable from the failing console. b. Connect the port tester to the input cable. c. Continue with the next step.v Yes: Perform the following: a. Disconnect the cable that comes from the system at the protocol converter. b. Connect the port tester to the cable. c. Continue with the next step.18. Perform the following: a. Set the selector switch on the port tester to the left (1) position for a twinaxial connection. Set the switch to the right (2) position for a twisted pair connection. b. Press and hold the test switch on the port tester for 15 seconds and observe the lights. c. Choose from the following options: v If the port tester has three lights, do the following: – If only the top (green) light is on, go to step 27 on page 239. – If both the top (green) and center (yellow) lights are on, go to step 20 on page 238. Note: The center (yellow) light is always on for twisted pair cable and may be on for fiber optical cable. – If only the bottom (yellow) light is on, go to step 21 on page 238. – If all lights are off, go to step 22 on page 238. – If all lights are on, go to step 19 on page 238.v If the port tester has two lights, do the following: – If only the top (green) light is on, go to step 27 on page 239. – If only the bottom (yellow) light is on, go to step 21 on page 238. – If both lights are off, go to step 22 on page 238. Chapter 2. Isolation Procedures 237 – If both lights are on, continue with the next step.19. The tester is in the self-test mode. Check the position of the selector switch. v If the selector switch is not in the correct position, go to step 18 on page 237. v If the selector switch is already in the correct position, the port tester is not working correctly. Exchange the port tester, and go to step 16 on page 237.20. The cable you are testing has an open shield. Note: The open shield can be checked only on the cable from the twinaxial workstation attachment to the device or from device to device. Only one section of cable can be checked at a time. See the SA41-3136, Port Tester Use information.This ends the procedure. 21. The cable network is bad. The wires in the cable between the console and the twinaxial workstation attachment are reversed. Go to step 26. 22. Perform the following: a. Find the twinaxial workstation attachment to which the failing console is attached. b. Disconnect the cable from port 0 on that twinaxial workstation attachment. c. Connect the port tester to port 0 on the attachment. d. Set the selector switch on the port tester to the left (1) position.23. Perform the following: a. Press and hold the test switch on the port tester for 15 seconds and observe the lights. b. If the port tester has three lights, do the following: v If both the top (green) and center (yellow) lights are on, continue with step 24. Note: The center (yellow) light is always on for twisted pair cable and may be on for fiber optical cable. v If only the bottom (yellow) light is on, continue with step 24. v If all lights are off, continue with step 24. v If only the top (green) light is on, go to step 26. v If all lights are on, go to step 25.c. If the port tester has two lights, do the following:v If only the top (green) light is on, go to step 26. v If only the bottom (yellow) light is on, continue with step 24. v If both lights are off, continue with step 24. v If both lights are on, go to step 25.24. The test indicated that there was no signal from the system. Reconnect the cable you disconnected and perform the following: a. Exchange the following parts: 1) Twinaxial workstation IOA card 2) The multi-adapter bridge.b. Power on the system to perform an IPL. This ends the procedure.25. The tester is in the self-test mode. Check the position of the selector switch: v If the selector switch is not in the left (1) position, set the switch to the left (1) position. Then go to step 23. v If the selector switch is already in the left (1) position, the port tester is not working correctly. Exchange the port tester and go to step 22.26. The cable to the workstation is the failing item. Cable maintenance is a customer responsibility. The cable must be repaired or exchanged. Then, power on the system to perform an IPL. This ends the procedure. 238 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 27. The port tester detects most problems, but it does not always detect an intermittent problem or some cable impedance problems. The tester may indicate a good condition, although there is a problem with the workstation IOP card or cables. a. If the failing display is connected to a link protocol converter, the link protocol converter is the failing item. See the link protocol converter service information to correct the problem. b. Exchange the following parts: 1) Console 2) Twinaxial workstation IOA 3) The multi-adapter bridge. 4) Cablesc. If you have another working display, you can exchange the console and perform an IPL to attempt to correct the problem. See the service information for the failing display for more information. d. If exchanging the failing items did not correct the problem and the reference code was a 5002, 5082, or 50FF, there may be a Licensed Internal Code problem. Go to “LICIP03” on page 92. e. The problem may be caused by devices that are attached after the console on port 0. This ends the procedure. Chapter 2. Isolation Procedures 239 Workstation adapter isolation procedure Contains the procedure necessary to isolate a failure that is detected by the workstation adapter, and is used when no display is available with which to perform on-line problem analysis. The workstation adapter detected a problem while communicating with the workstation that is used as the primary console. Note: If you are using a PC, you must install an emulation program. Please read and observe all safety procedures before servicing the system and while performing the procedure below. Attention: Unless instructed otherwise, always power off the system or expansion tower where the FRU is located (see Powering on and powering off) before removing, exchanging, or installing a field-replaceable unit (FRU). DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) “WSAIP01” on page 241Use this procedure if your console has a keyboard error with a ?K? on the display. 240 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) WSAIP01 Use this procedure if your console has a keyboard error with a ?K? on the display. Note: If the console has a keyboard error, there may be a ?K? on the display. See the workstation service information for more information. Perform the following procedure from the logical partition that reported the problem: 1. Select the icon on the workstation to make it the console (you may have already done this). You must save the console selection. 2. Access Dedicated Service Tools (DST) by performing the following: a. Select Manual mode on the control panel. b. Use the selection switch on the control panel to display function 21, Make DST Available, and press Enter on the control panel. c. Wait for a display to appear on the console or for a reference code to appear on the control panel.Does a display appear on the console? No: Continue with the next step. Yes: The problem is corrected. This ends the procedure.3. Isolate the problem to one server and one workstation (console) by performing the following: a. Disconnect the power cable from the workstation. b. Eliminate all workstations, cables, and connector boxes from the network except for one server, one console, two connector boxes, and one cable. c. Ensure that the cables that are connected to the console, the keyboard, and the server are connected correctly and are not damaged.4. Perform the following: a. Ensure that the server console is terminated correctly. b. Set the Power switch on the console to the On position. c. Select the SNA*PS icon on the console. d. See the workstation information for more information.5. Access DST by performing the following: a. Select Manual mode on the control panel. b. Use the selection switch on the control panel to display function 21, Make DST Available, and press Enter on the control panel. c. Wait for a display to appear on the console or for a reference code to appear on the control panel.Does a display appear on the console? No: Continue with the next step. Yes: The problem is in a cable, connector box, or device you disconnected in step 3. This ends the procedure.6. Does the reference code A600 5005 appear on the control panel? Yes: Continue with the next step. No: Perform problem analysis using this reference code. This ends the procedure.7. Do you have another workstation, cable, and two connector boxes you can exchange with the workstation connected to the server? v Yes: Continue with the next step. v No: One of the following is causing the problem: Note: The items at the top of the list have a higher probability of fixing the problem than the items at the bottom of the list. – Workstation adapter Licensed Internal Code Chapter 2. Isolation Procedures 241 – Workstation adapter configuration – Workstation – Cable – Connector box – Workstation IOA – Workstation IOP If you still have not corrected the problem, ask your next level of support for assistance. This ends the procedure.8. Repeat steps 3 on page 241 through 7 on page 241 of this procedure, using a different workstation, cable, and connector boxes. Do you still have a problem? Yes: Continue with the next step. No: The problem is in the cable, connector boxes, or workstation you disconnected. This ends the procedure.9. One of the following is causing the problem: Note: The items at the top of the list have a higher probability of fixing the problem than the items at the bottom of the list. v Workstation adapter Licensed Internal Code v Workstation adapter configuration v Workstation IOA v Communications IOP To bring up a workstation other than the console, perform the following: a. Connect another workstation into this network. b. Select Normal mode on the control panel. c. Perform an IPL. If the sign-on display appears, the following parts are good: v Communications IOP v Workstation IOANote: If a printer connected to this assembly is not working correctly, it may look like the display is bad. Perform a self-test on the printer to ensure that it prints correctly (see the printer service information). If you still have not corrected the problem, ask your next level of support for assistance. This ends the procedure. 242 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Workstation adapter console isolation procedure Contains the procedure necessary to isolate a failure that is detected by the workstation adapter console. Use this procedure when no display is available with which to perform online problem analysis. Note: If you are using a PC, you must install an emulation program. Read all safety procedures before servicing the system. Observe all safety procedures when performing a procedure. Unless instructed otherwise, always power off the system or expansion tower where the FRU is located, see Powering on and powering off before removing, exchanging, or installing a field-replaceable unit (FRU). DANGERWhen working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005)Read and understand the following service procedures before using this section: v Powering on and powering off v “Determining a primary or alternative console” on page 591 v Chapter 3, “Locating FRUs,” on page 335 Chapter 2. Isolation Procedures 243 Note: If the console has a keyboard error, there may be a K on the display. See the workstation service information for more information. 1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to “Determining if the system has logical partitions” on page 593. 2. Ensure that your workstation meets the following conditions: v The workstation that you are using for the console is powered on. v The emulation program is installed and is working. v The input/output adapter (IOA) is installed and the workstation console cable is attached. Notes: a. Card information: Hardware that is associated with 6A59 feature is the type 2745 card. b. Cable information: The cable attaches directly to the IOA.Did you find a problem with any of the conditions listed above? No: Continue with the next step. Yes: Correct the problem. Then, perform an IPL of the system. This ends the procedure.3. Perform the following to make dedicated service tool (DST) available: a. If there is an alternative console, ensure that it is powered on. b. Ensure that Manual mode on the control panel is selected. c. Select function 21, Make DST Available on the control panel, and press Enter.Does a display appear on either the console or any alternative console? No: Continue with the next step. Yes: Complete the IPL. When the operating system display appears, use the Work with Problem command (WRKPRB) or Analyze Problem command (ANZPRB) to analyze and correct or report any console problems. This ends the procedure.4. Do you have SRC A600 5001, A600 5004, A600 5007, or B075 xxxx (where xxxx is any value)? v No: Continue with the next step. v Yes: Perform the following: a. Disconnect any cables that are attached to the IOA. b. Install the wrap plug on the IOA. The 2745 wrap plug label is QQ. c. Perform an IPL in Manual mode.5. Does SRC 6A59 5007 occur? v No: Continue with the next step. v Yes: One of the following is causing the problem: – Workstation emulation program – Workstation – Workstation console cableThis ends the procedure.6. Did SRC A600 5001, A600 5004, or 6A59 5008 occur? No: This is a new problem. Use the new reference code to perform problem analysis for the problem, or ask your next level of support for assistance. This ends the procedure. Yes: The Type 2745 workstation adapter is the failing item. This ends the procedure. 244 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Isolating problems on servers that run the AIX or Linux operating systems Use the procedures for AIX or Linux servers if there is not an HMC attached to the server. If the server is connected to an HMC, use the procedures that are available on the HMC to continue FRU isolation. “MAP 0210: General problem resolution” on page 246Use this procedure to exchange the FRUs in the order of their failure probability. “Problems with loading and starting the operating system (AIX and Linux)” on page 246If the system is running partitions from partition standby (LPAR), the following procedure addresses the problem in which one partition will not boot AIX or Linux while other partitions boot successfully and run the operating system successfully. “Storage I/O card modes and jumpers” on page 250 “SCSI service hints” on page 252Use one or more of the following procedures when servicing SCSI adapter or devices. “MAP 0020: Problem determination procedure” on page 259Use this MAP to get a service request number (SRN) if the customer or a previous MAP provided none. “MAP 0030: Additional Problem Determination” on page 266This MAP is used for problems that still occur after all FRUs indicated by the SRN or error code have been exchanged. “MAP 0040: Intermittent Problem Isolation” on page 267This MAP provides a structured way of analyzing intermittent problems. “MAP 0050: SCSI bus problems” on page 269Use this MAP to analyze problems with a SCSI bus. “MAP 0054: PCI-X SCSI Bus Problems” on page 276Use this MAP to determine which FRUs may need to be replaced in order to solve a SCSI bus-related problem on a PCI-X SCSI or PCI-X SCSI RAID adapter. “MAP 0070: 888 sequence in operator panel display” on page 278Use this MAP when you receive an 888 sequence on the operator panel display or monitor. “MAP 0220: Hot-Swap FRU problem resolution” on page 279Use this procedure to exchange hot-swappable field replaceable units (FRUs). “MAP 0230: Platform error problem resolution” on page 284Use this MAP to resolve problems reported by SRNs A00-xxx to A25-xxxx. “MAP 0260: System hangs during resource configuration” on page 288Use this MAP when the system unit hangs while configuring a resource. “MAP 0270: SCSI RAID problem resolution and verification” on page 291Use this MAP to resolve SCSI RAID adapter, cache, or drive problems. “MAP 0280: Console and Keyboard Problem Resolution” on page 295Use this MAP to resolve console and keyboard problems when the system is booting. “MAP 0285: Multipath I/O (MPIO) Problem Resolution” on page 296Use this MAP to handle SRN A23-001 and ssss-640 (where ssss is the 3 or 4 digit Failing Function Code (FFC) of an SCSD drive) to check the path from adapter to device. “MAP 0291: Missing Device or Bus Problem Resolution” on page 299Use this MAP when a bus or device (such as a disk drive) is reported as a missing resource by the diagnostics. “PFW1540: Problem isolation procedures” on page 301The PFW1540 procedures are used to locate problems in the processor subsystem or I/O subsystem(s). “PFW1542: I/O problem isolation procedure” on page 302This I/O problem-determination procedure isolates I/O card and I/O subsystem failures. When I/O problem isolation is complete, all cables and cards that are failing will have been replaced or reseated. Chapter 2. Isolation Procedures 245 “PFW1548: Memory and processor subsystem problem isolation procedure” on page 316Use this problem isolation procedure to aid in solving memory and processor problems that are not found by normal diagnostics.MAP 0210: General problem resolution Use this procedure to exchange the FRUs in the order of their failure probability. v Step 0210-1 Did the problem report provide a part number for the FRU you are about to replace? NO Go to Step 0210-2. YES Go to Step 0210-3.v Step 0210-2 Find the failing function codes in the Chapter 8, “Part Information,” on page 691, and record the FRU part number and description of each FRU. v Step 0210-3 Obtain the replacement part. v Step 0210-4 Go to Chapter 3, “Locating FRUs,” on page 335, and follow the procedures for the FRU you are servicing.Problems with loading and starting the operating system (AIX and Linux) If the system is running partitions from partition standby (LPAR), the following procedure addresses the problem in which one partition will not boot AIX or Linux while other partitions boot successfully and run the operating system successfully. It is the customer’s responsibility to move devices between partitions. If a device must be moved to another partition to run standalone diagnostics, contact the customer or system administrator. If the optical drive must be moved to another partition, all SCSI devices connected to that SCSI adapter must be moved because moves are done at the slot level, not at the device level. Depending on the boot device, a checkpoint may be displayed on the operator panel for an extended period of time while the boot image is retrieved from the device. This is particularly true for tape and network boot attempts. If booting from an optical drive or tape drive, watch for activity on the drive’s LED indicator. A blinking LED indicates that the loading of either the boot image or additional information required by the operating system being booted is still in progress. If the checkpoint is displayed for an extended period of time and the drive LED is not indicating any activity, there might be a problem loading the boot image from the device. Notes: 1. For network boot attempts, if the system is not connected to an active network or if the target server is inaccessible (which can also result from incorrect IP parameters being supplied), the system will still attempt to boot. Because time-out durations are necessarily long to accommodate retries, the system may appear to be hung. Refer to checkpoint CA00 E174. 2. If the partition hangs with a 4-character checkpoint in the display, the partition must be deactivated, then reactivated before attempting to reboot. 3. If a BA06 000x error code is reported, the partition is already deactivated and in the error state. Reboot by activating the partition. If the reboot is still not successful, go to step 3 on page 247. 246 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) This procedure assumes that a diagnostic CD-ROM and an optical drive from which it can be booted are available, or that diagnostics can be run from a NIM (Network Installation Management) server. Booting the diagnostic image from an optical drive or a NIM server is referred to as running standalone diagnostics. 1. Is a Hardware Management Console attached to the managed system? Yes: Continue with the next step. No: Go to step 3. 2. Look at the service action event error log in Manage Serviceable Events on the HMC. Perform the actions necessary to resolve any open entries that affect devices in the boot path of the partition or that indicate problems with I/O cabling. Then try to reboot the partition. Does the partition reboot successfully? Yes: This ends the procedure. No: Continue with the next step. 3. Boot to the SMS main menu: v If you are rebooting a partition from partition standby (LPAR), go to the properties of the partition and select Boot to SMS, then activate the partition. v If you are rebooting from platform standby, access the ASMI (see “Accessing the ASMI using a Web browser” on page 646. Select Power/Restart Control, then Power On/Off System. In the AIX/Linux partition mode boot box, select Boot to SMS menu ? Save Settings and Power On. At the SMS main menu, select Select Boot Options and verify whether the intended boot device is correctly specified in the boot list. Is the intended load device correctly specified in the boot list? v Yes: Perform the following: a. Remove all removable media from devices in the boot list from which you do not want to load the operating system. b. If you are attempting to load the operating system from a network, go to step 4. c. If you are attempting to load the operating system from a disk drive or an optical drive, go to step 7 on page 248. d. No: Go to step 5. 4. If you are attempting to load the operating system from the network, perform the following: v Verify that the IP parameters are correct. v Use the SMS ping utility to attempt to ping the target server. If the ping is not successful, have the network administrator verify the server configuration for this client. v Check with the network administrator to ensure that the network is up. Also ask the network administrator to verify the settings on the server from which you are trying to load the operating system. v Check the network cabling to the adapter. Restart the partition and try loading the operating system. Does the operating system load successfully? Yes: This ends the procedure. No: Go to step 7 on page 248. 5. Use the SMS menus to add the intended boot device to the boot sequence. Can you add the device to the boot sequence? Yes: Restart the partition. This ends the procedure. No: Continue with the next step. 6. Ask the customer or system administrator to verify that the device you are trying to load from is assigned to the correct partition. Then select List All Devices and record the list of bootable devices that displays. Is the device from which you want to load the operating system in the list? Yes: Go to step 7 on page 248. No: Go to step 10 on page 248. Chapter 2. Isolation Procedures 247 7. Try to load and run standalone diagnostics against the devices in the partition, particularly against the boot device from which you want to load the operating system. You can run standalone diagnostics from an optical drive or a NIM server. To boot standalone diagnostics, follow the detailed procedures in “Running the online and eServer stand-alone diagnostics” on page 672. Note: When attempting to load diagnostics on a partition from partition standby, the device from which you are loading standalone diagnostics must be made available to the partition that is not able to load the operating system, if it is not already in that partition. Contact the customer or system administrator if a device must be moved between partitions in order to load standalone diagnostics. Did standalone diagnostics load and start successfully? Yes: Go to step 8. No: Go to step 14 on page 249. 8. Was the intended boot device present in the output of the option Display Configuration and Resource List, which is run from the Task Selection menu? v Yes: Continue with the next step. v No: Go to step 10. 9. Did running diagnostics against the intended boot device result in a No Trouble Found message? Yes: Go to step 12 on page 249. No: Go to the list of service request numbers and perform the repair actions for the SRN reported by the diagnostics. When you have completed the repair actions, go to step 13 on page 249.10. Perform the following actions: a. Perform the first item in the action list below. In the list of actions below, choose SCSI or IDE based on the type of device from which you are trying to boot the operating system. b. Restart the system or partition. c. Stop at the SMS menus and select Select Boot Options. d. Is the device that was not appearing previously in the boot list now present? Yes: Go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. No: Perform the next item in the action list and then return to step 10b. If there are no more items in the action list, go to step 11 on page 249.Action list: Note: See Chapter 3, “Locating FRUs,” on page 335 for part numbers and links to exchange procedures. a. Verify that the SCSI or IDE cables are properly connected. Also verify that the device configuration and address jumpers are set correctly. b. Do one of the following: v SCSI boot device: If you are attempting to boot from a SCSI device, remove all hot-swap disk drives (except the intended boot device, if the boot device is a hot-swap drive).If the boot device is present in the boot list after you boot the system to the SMS menus, add the hot-swap disk drives back in one at a time, until you isolate the failing device. v IDE boot device: If you are attempting to boot from an IDE device, disconnect all other internal SCSI or IDE devices. If the boot device is present in the boot list after you boot the system to the SMS menus, reconnect the internal SCSI or IDE devices one at a time, until you isolate the failing device or cable.c. Replace the SCSI or IDE cables. d. Replace the SCSI backplane (or IDE backplane, if present) to which the boot device is connected. e. Replace the intended boot device. f. Replace the system backplane. 248 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 11. Choose from the following: v If the intended boot device is not listed, go to “PFW1548: Memory and processor subsystem problem isolation procedure” on page 316. This ends the procedure. v If an SRN is reported by the diagnostics, go to the list of service request numbers and follow the action listed. This ends the procedure.12. Have you disconnected any other devices? Yes: Reinstall each device that you disconnected, one at a time. After you reinstall each device, reboot the system. Continue this procedure until you isolate the failing device. Replace the failing device, then go to step 13. No: Perform an operating system-specific recovery process or reinstall the operating system. This ends the procedure.13. Is the problem corrected? Yes: Go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure. No: If replacing the indicated FRUs did not correct the problem, or if the previous steps did not address your situation, go to “PFW1548: Memory and processor subsystem problem isolation procedure” on page 316. This ends the procedure.14. Is a SCSI boot failure (where you cannot boot from a SCSI-attached device) also occurring? v Yes: Go to “PFW1548: Memory and processor subsystem problem isolation procedure” on page 316. This ends the procedure. v No: Continue to the next step.15. Perform the following actions to determine if another adapter is causing the problem: a. Remove all adapters except the one to which the optical drive is attached and the one used for the console. b. Reload the standalone diagnostics. Can you successfully reload the standalone diagnostics? v Yes: Perform the following: 1) Reinstall the adapters that you removed (and attach devices as applicable) one at a time. After you reinstall each adapter, retry the boot operation until the problem reoccurs. 2) Replace the adapter or device that caused the problem. 3) Go to Chapter 5, “Verifying a repair,” on page 557. This ends the procedure.v No: Continue with the next step.16. The graphics adapter (if installed), optical drive, IDE or SCSI cable, or system board is most likely defective. Does your system have a PCI graphics adapter installed? Yes: Continue with the next step. No: Go to step 1817. Perform the following to determine if the graphics adapter is causing the problem: a. Remove the graphics adapter. b. Attach a TTY terminal to the system port. c. Try to reload standalone diagnostics. Do the standalone diagnostics load successfully? Yes: Replace the graphics adapter. This ends the procedure. No: Continue with the next step.18. Replace the following (if not already replaced), one at a time, until the problem is resolved: a. Optical drive b. IDE or SCSI cable that goes to the optical drive c. System board that contains the integrated SCSI or IDE adapters.If this resolves the problem, go to Chapter 5, “Verifying a repair,” on page 557. If the problem still persists or if the previous descriptions did not address your particular situation, go to “PFW1548: Memory and processor subsystem problem isolation procedure” on page 316. Chapter 2. Isolation Procedures 249 This ends the procedure.Storage I/O card modes and jumpers The 2748 and 2778 I/O cards can run in either compression mode or enhanced mode. v Compression Mode: Compression mode should only be used when you want to run Integrated Hardware Disk Compression. In compression mode, the write cache size is 4 MB. v Enhanced Mode: Enhanced mode has an enhanced write cache size. In enhanced mode, the write cache size for the 2748 I/O card is 26MB, and the maximum compressed write cache size for the 2778 I/O card is 104MB. Enhanced mode also provides support for Extended Adaptive Cache (by attachment of a read cache device) and Extended Adaptive Cache Simulator.The following subtopics are included here: v “Setting or changing the mode of an I/O card.” v When an error regarding the mode of the storage I/O card appears, the error message indicates whether the jumper is missing or overridden. – If the error message is CPPEA20, the I/O card has overridden the mode set by the jumper. See “Mode jumper has been overridden” on page 251. – If the error message is CPPEA21, the I/O card has detected that the jumper is missing. See “Mode jumper is missing” on page 252. “Setting or changing the mode of an I/O card”Provides detailed steps for setting or changing the mode of an I/O card. “Mode jumper has been overridden” on page 251Covers how to correct the error message that appears when the mode jumper has been overridden. “Mode jumper is missing” on page 252The error message CPPEA21 appears when the mode jumper is missing from the storage I/O card.Setting or changing the mode of an I/O card Provides detailed steps for setting or changing the mode of an I/O card. How to set or change the mode of an I/O card: The 2748 and 2778 I/O cards support two modes: compression and enhanced. The rear of the I/O card is marked with a C, indicating compression mode, and an E, indicating enhanced mode. Three pins on the rear of the I/O card determine the mode of the card, depending on which of the pins have a jumper placed on them. The jumper may be moved at any time, but changes only take effect during a system IPL. Refer to the illustration below while changing the mode jumper. 250 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 1. Power off the system, ensuring that no jobs are currently processing (see Powering on and powering off). 2. Choose from the following options: v To set the mode of the card to compressed (C), place the jumper on the top and middle pins. v To set the mode of the card to enhanced (E), place the jumper on the middle and bottom pins.3. Select the IPL type and mode. 4. Power on the system.Mode jumper has been overridden Covers how to correct the error message that appears when the mode jumper has been overridden. The error message CPPEA20 appears when the mode jumper has been overridden. This is due to cache data that is in a conflicting mode. 1. To begin correcting this error, examine the mode jumpers in the rear section of the storage I/O card. Note: The mode jumper is accessible without requiring the removal of the I/O card. 2. Choose from the following options: v If the mode jumper is not in the desired position, follow the steps in “Setting or changing the mode of an I/O card” on page 250. v If the mode jumper is in the desired position and any other I/O card or device reference codes occurred at about the same time as this error, contact your service representative. v If the mode jumper is in the desired position and no other I/O card or device reference codes occurred at about the same time as this error, then power the system down normally and IPL. See Powering on and powering off. This ends the procedure. Figure 3. Changing the mode jumper Chapter 2. Isolation Procedures 251 Mode jumper is missing The error message CPPEA21 appears when the mode jumper is missing from the storage I/O card. 1. Locate the jumper. 2. Power off the system. See Powering on and powering off. 3. Install the jumper in the correct position. See “Setting or changing the mode of an I/O card” on page 250. 4. Power on the system. This ends the procedure.SCSI service hints Use one or more of the following procedures when servicing SCSI adapter or devices. General SCSI Configuration Checks With any type of SCSI problem, begin with the following steps: 1. Verify that all SCSI devices on the SCSI bus have a unique address. 2. Verify that all cables are connected securely and that there is proper termination at both ends of the SCSI bus. 3. Verify that the cabling configuration does not exceed the maximum cable length for the adapter in use. 4. Verify that the adapters and devices that you are working with are at the appropriate microcode levels for the customer situation. If you need assistance with microcode issues, contact your service support structure. 5. If there are multiple SCSI adapters on the SCSI bus, verify that the customer is using the appropriate software (such as HACMP™ or HANFS) to support such an arrangement. If the correct software is not in use, some SCSI errors should be expected when multiple adapters attempt to access the same SCSI device. Also, each adapter should have a unique address.High Availability or Multiple SCSI System Checks If you have a high-availability configuration, or if more than one system is attached to the same SCSI bus, do the following: 1. Verify that the adapters and devices have unique SCSI addresses. The default SCSI adapter address is always 7. If you have more than one adapter on the bus, change the address of at least one by using SMIT (SMIT Devices ? SCSI Adapter ? Change/Show characteristics of an adapter). You must make the changes to the database only, then reboot the system in order for the change to take effect.Note: Diagnostics defaults to using ID 7 (it is recommended that this ID not be used in high availability configurations). 2. If RAID devices such as the 7135 or 7137 are attached, run the appropriate diagnostics for the device. If problems occur, contact your service support structure for assistance. If the diagnostics are run incorrectly on these devices, misleading SRNs can result. 3. Diagnostics cannot be run against OEM devices; doing so results in misleading SRNs. 4. Verify that all cables are connected securely and that both ends of the SCSI bus is terminated correctly. 5. Verify that the cabling configuration does not exceed the maximum cable length for the adapter in use. Refer to the SCSI Cabling section in the RS/6000® eServer™ pSeries® Adapters, Devices, and Cable Information for Multiple Bus Systems for more details on SCSI cabling issues. 6. Verify that the adapter and devices are at the appropriate microcode levels for the customer situation. If you need assistance with microcode issues, contact your service support structure.SCSI-2 Single-Ended Adapter PTC Failure Isolation Procedure Before replacing a SCSI-2 single-ended adapter, use these procedures to determine if a short-circuit condition exists on the SCSI bus. The same positive temperature coefficient (PTC) resistor is used for both the internal and external buses. The PTC protects the SCSI bus from high currents due to short-circuiting 252 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) on the cable, terminator, or device. It is unlikely that the PTC can be tripped by a defective adapter. Unless instructed to do so by these procedures, do not replace the adapter because of a tripped PTC resistor. A fault (short-circuit) causes an increase in PTC resistance and temperature. The increase in resistance causes the PTC to halt current flow. The PTC returns to a low resistive and low temperature state when the fault is removed from the SCSI bus or when the system is turned off. Wait 5 minutes for the PTC resistor to fully cool, then retest. These procedures determine if the PTC resistor is still tripped and then determine if there is a short-circuit somewhere on the SCSI bus. Determining Where to Start Use the following to determine the adapter configuration and select the appropriate procedure: v If there are external cables attached to the adapter, start with the External Bus PTC Isolation Procedure for your type of adapter. v If there are no external cables attached, start with the “Internal SCSI-2 Single-Ended Bus PTC Isolation Procedure” on page 254. v If there is a combination of external and internal cables start with the External Bus PTC Isolation Procedure for your type of adapter. If this procedure does not resolve the problem, continue with the Internal Bus PTC Isolation Procedure for your type of adapter.External SCSI-2 Single-Ended Bus PTC Isolation Procedure Isolate the external SCSI bus PTC fault with the following procedure: Note: The external bus is of single-ended design. 1. Ensure the system power and all externally attached device power is turned off. All testing is accomplished with the power off. 2. Disconnect any internal and external cables from the adapter and remove the adapter from the system. 3. Verify with a digital ohmmeter that the internal PTC resistor, labeled Z1, (refer to the illustration after Internal SCSI-2 Single-Ended Bus PTC Isolation Procedure, step 3 on page 254) is cool and in a low resistance state, typically less than 1/2 Ohm. Measuring across, be sure to probe both sides of the PTC where the solder joints and board come together. The polarity of the test leads is not important. If necessary, allow the PTC resistor to cool and measure again. 4. Locate Capacitor C1 and measure the resistance across it by using the following procedure: a. Connect the positive lead to the side of the capacitor where the + is indicated on the board near C1. Be sure to probe at the solder joint where the capacitor and board come together. b. Connect the negative lead to the opposite side of the capacitor marked ?GND.? Be sure to probe at the solder joint where the capacitor and board come together. c. If there is no short-circuit present, then the resistance reading is high, typically hundreds of Ohms.Note: Because this is a measurement across unpowered silicon devices, the reading is a function of the ohmmeter used. v If there is a fault, the resistance reading is low, typically below 10 Ohms. Because there are no cables attached, the fault is on the adapter. Replace the adapter. Note: Some multi-function meters label the leads specifically for voltage measurements. When using this type of meter to measure resistance, the plus lead and negative lead my not be labeled correctly. If you are not sure that your meter leads accurately reflect the polarity for measuring resistance, repeat this step with the leads reversed. If the short-circuit is not indicated with the leads reversed, the SCSI bus is not faulted (short-circuited). Chapter 2. Isolation Procedures 253 v If the resistance measured was high, proceed to the next step.5. Reattach the external cable to the adapter, then do the following: a. Measure across C1 as previously described. b. If the resistance is still high, in this case above 10 Ohms, then there is no apparent cause for a PTC failure from this bus. If there are internal cables attached, continue to the “Internal SCSI-2 Single-Ended Bus PTC Isolation Procedure.” c. If the resistance is less than 10 Ohms, there is a possibility of a fault on the external SCSI bus. Troubleshoot the external SCSI bus by disconnecting devices and terminators. Measure across C1 to determine whether the fault has been removed. Replace the failing component. Go to Chapter 5, “Verifying a repair,” on page 557.External SCSI-2 Single-Ended Bus Probable Tripped PTC Causes The following list provides some suggestions of things to check when the PTC is tripped: v A short-circuited terminator or cable. Check for bent pins on each connector and removable terminator. v Intermittent PTC failures can be caused by improperly seated cable connectors. Reseat the connector and flex the cable in an attempt to duplicate the fault condition across C1. v Plugging or unplugging a cable or terminator while the system is turned on (hot plugging). v A short-circuited device. v Differential devices or terminators are attached to the single-ended SCSI bus. Note: The SCSI-2 Fast/Wide and Ultra PCI Adapters use an onboard electronic terminator on the external SCSI bus. When power is removed from the adapter, as in the case of this procedure, the terminator goes to a high impedance state and the resistance measured cannot be verified, other than it is high. Some external terminators use an electronic terminator, which also goes to a high impedance state when power is removed. Therefore, this procedure is designed to find a short-circuited or low resistance fault as opposed to the presence of a terminator or a missing terminator.Internal SCSI-2 Single-Ended Bus PTC Isolation Procedure Isolate the internal SCSI bus PTC resistor fault with the following procedure: Note: The internal bus is single-ended. 1. Ensure that system power and all externally attached device power is turned off. 2. Disconnect any internal and external cables from the adapter, then remove the adapter from the system. 3. Verify with a digital ohmmeter, that the internal PTC resistor, labeled Z1, is cool and in a low resistance state, typically less than 1/2 Ohm. Measuring across, be sure to probe both sides of the PTC where the solder joints and board come together. The polarity of the test leads is not important. If necessary, allow the PTC to cool and measure again. Refer to the following illustration. 254 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Note: Ensure that only the probe tips are touching the solder joints. Do not allow the probes to touch any other part of the component. 4. Locate capacitor C1 and measure the resistance across it using the following procedure: a. Connect the positive lead to the side of the capacitor where the + is indicated. Be sure to probe at the solder joint where the capacitor and board come together. b. Connect the negative lead to the opposite side of the capacitor. Be sure to probe at the solder joint where the capacitor and board come together. c. If there is no short-circuit present, the resistance reading is high, typically hundreds of Ohms.Note: Because this is a measurement across unpowered silicon devices, the reading is a function of the ohmmeter used. v If there is a fault, the resistance reading is low, typically below 10 Ohms. Because there are no cables attached, the fault is on the adapter. Replace the adapter. Note: Some multi-function meters label the leads specifically for voltage measurements. When using this type of meter to measure resistance, the plus lead and negative lead my not be labeled correctly. If you are not sure that your meter leads accurately reflect the polarity for measuring resistance, repeat this step with the leads reversed. Polarity is important in this measurement to prevent forward-biasing diodes, which lead to a false low resistance reading. If the short circuit is not indicated with the leads reversed, the SCSI bus is not faulted (short-circuited). v If the resistance is high and there is no internal cable to reattach, there is no apparent cause for the PTC resistor diagnostic failure. v If the resistance is high and there is an internal cable to reattach, proceed to the next step.5. Reattach the internal cable to the adapter, then do the following: a. Measure across C1 as described previously. Chapter 2. Isolation Procedures 255 b. If the resistance is still high, above 25 Ohms, there is no apparent cause for a PTC failure. c. If the resistance is less than 10 Ohms, a fault on the internal SCSI bus is possible. Troubleshoot the internal SCSI bus by disconnecting devices and terminators. Measure across C1 to determine if the fault has been removed.Note: Some internal cables have nonremovable terminators.Internal SCSI-2 Single-Ended Bus Probable Tripped PTC Resistor Causes The following list provides some suggestions of things to check when the PTC is tripped: v A short-circuited terminator or cable. Check for bent pins on each connector and removable terminator. v Intermittent PTC failures can be caused by incorrectly seated cable connectors. Reseat the connector and flex the cable in an attempt to duplicate the fault condition across C1. v A short-circuited device. v On some systems, the terminator is fixed to the internal cable and cannot be removed. If all devices are removed from the cable and the resistance is still low, then the cable should be replaced. Note: The SCSI-2 Fast/Wide and Ultra PCI adapters use an onboard electronic terminator on the internal SCSI bus. When power is removed from the adapter, as in the case of this procedure, the terminator goes to a high impedance state and the resistance measured cannot be verified, other than it is high. Some internal terminators use an electronic terminator, which also goes to a high impedance state when power is removed. Therefore, this procedure is designed to find a short-circuit or low resistance fault as opposed to the presence of a terminator or a missing terminator.SCSI-2 Differential Adapter PTC Failure Isolation Procedure Use this procedure when SRN xxx-240 or xxx-800 has been indicated. The differential adapter can be identified by the 4-B or 4-L on the external bracket plate. Before replacing a SCSI-2 differential adapter, use these procedures to determine if a short-circuit condition exists on the SCSI Bus. The PTC protects the SCSI bus from high currents due to short-circuits on the cable, terminator, or device. It is unlikely that the PTC can be tripped by a defective adapter. Unless instructed to do so by these procedures, do not replace the adapter because of a tripped PTC resistor. A fault (short-circuit) causes an increase in PTC resistance and temperature. The increase in resistance causes the PTC to halt current flow. The PTC returns to a low resistive and low temperature state when the fault is removed from the SCSI bus or when the system is turned off. Wait 5 minutes for the PTC resistor to fully cool, then retest. These procedures determine if the PTC resistor is still tripped and then determine if there is a short-circuit somewhere on the SCSI bus. External SCSI-2 Differential Adapter Bus PTC Isolation Procedure Isolate the external SCSI bus PTC fault with the following procedure: Notes: 1. Ensure that only the probe tips are touching the solder joints. Do not allow the probes to touch any other part of the component. 2. The external bus is differential.1. Ensure that system power and all externally attached device power is turned off. 256 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 2. Check to ensure all devices are marked SCSI Differential and that the terminator on the end of the SCSI bus is also marked differential. If not, you may have a single-ended SCSI device or terminator on the differential SCSI bus. Single-ended devices do not work on a differential SCSI bus and may cause a PTC type error to be reported. The entire SCSI bus may appear to be intermittent. After ensuring the system is completely differential, continue. 3. Disconnect the external cables from the adapter and remove the adapter from the system. 4. Verify with a digital ohmmeter that the internal PTC resistor, labeled Z1, (refer to the illustration on page “External SCSI-2 Differential Adapter Bus PTC Isolation Procedure” on page 256) is cool and in a low resistance state, typically less than 1/2 Ohm. Measuring across, be sure to probe both sides of the PTC resistor where the solder joints and board come together. The polarity of the test leads is not important. If necessary, allow the PTC resistor to cool and measure again. 5. Locate capacitor C1 and measure the resistance across it using the following procedure: a. Connect the negative lead to the side of the capacitor marked ?GND?. Be sure to probe at the solder joint where the capacitor and board come together. b. Connect the positive lead to the side of the capacitor marked ?Cathode D1? on the board near C1. Be sure to probe at the solder joint where the capacitor and board come together. v If there is no fault present, then the resistance reading is 25 to 35 Ohms. The adapter is not faulty. Continue to the next step. v If the resistance measured is higher than 35 Ohms, check to see if RN1, RN2, and RN3 are plugged into their sockets. If these sockets are empty, you are working with a Multi-Initiators or High-Availability system. With these sockets empty, a resistive reading across C1 cannot be verified other than it measures a high resistance (not a short-circuit). If the resistance measurement is not low enough to be suspected as a fault (lower than 10 Ohms), continue to the next step. Chapter 2. Isolation Procedures 257 v If the resistance is high and there is no external cable to reattach, there is no apparent cause for the PTC diagnostic failure. v If the resistance reading is low, typically below 10 Ohms, there is a fault. Because there are no cables attached, the fault is on the adapter. Replace the adapter. v If the resistance measured was high and there is an external cable to reattach, proceed to the next step.6. Reattach the external cable to the adapter. a. Measure across C1 as previously described. b. If the resistance is between 10 to 20 Ohms, there is no apparent cause for a PTC resistor failure. c. If the resistance is less than 10 Ohms, there is a possibility of a fault on the external SCSI bus. Troubleshoot the external SCSI bus by disconnecting devices and terminators. Measure across C1 to determine if the fault has been removed.SCSI-2 Differential Adapter Probable Tripped PTC Causes The following list provides some suggestions of things to check when the PTC is tripped: v A short-circuited terminator or cable. Check for bent pins on each connector and removable terminator. v Intermittent PTC failures can be caused by incorrectly seated cable connectors. Reseat the connector and flex the cable in an attempt to duplicate the fault condition across C1. v Plugging or unplugging a cable or terminator while the system is turned on (hot-plugging). v A short-circuited device. v Single-ended devices are attached to the differential SCSI bus.Dual-Channel Ultra SCSI Adapter PTC Failure Isolation Procedure Use the following procedures if diagnostics testing indicates a potential positive temperature coefficient (PTC) resistor fault or the TERMPWR short-circuited LED is lit. This procedure is used for SRNs 637-240 and 637-800 on the Dual-Channel Ultra SCSI Adapter. If the TERMPWR short-circuited LED is lit, use this procedure to help isolate the source of the problem on the failing channel. 1. Identify the adapter by its label of 4-R on the external bracket. Then, determine if the failure is on channel A or channel B. 2. The same PTC is used for both the internal and external buses. The PTC protects the SCSI bus from high currents due to short-circuits on the cable, terminator, or device. It is unlikely that the PTC can be tripped by a defective adapter. A fault (short-circuit) causes an increase in PTC resistance and temperature. The increase in resistance causes the PTC to halt current flow. The PTC returns to a low resistive and low temperature state when the fault is removed from the SCSI bus or when the system is turned off. Wait 5 minutes for the PTC resistor to fully cool, then retest. 3. If this same error persists, or the TERMPWR short-circuited LED is lit, replace the components of the failing channel in the following order (wait five minutes between steps): a. If the failure is on the external cable, replace the following: 1) Cable 2) Device 3) Attached subsystem 4) Adapterb. If the failure is on the internal cable, replace the following: 1) Cable 2) Device 258 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 3) Backplane 4) Adapterc. If the failure persists, verify that the parts exchanged are in the correct channel (internal or external, A or B). If the errors are still occurring, continue isolating the problem by going to “MAP 0050: SCSI bus problems” on page 269.64-bit PCI-X Dual Channel SCSI Adapter PTC Failure Isolation Procedure Use the following procedures if diagnostics testing indicates a potential self-resetting thermal fuse problem. This procedure is used for SRN 2524-702 on the integrated dual-channel SCSI adapter in a 7039/651 system. 1. Identify the adapter as the one embedded in the system board. Then, determine if the failure is on channel 0 or channel 1. 2. The thermal fuse protects the SCSI bus from high currents due to short-circuits on the terminator, cable, or device. It is unlikely that the thermal fuse can be tripped by a defective adapter. A fault (short-circuit) causes an increase in resistance and temperature of the thermal fuse. The increase in temperature causes the thermal fuse to halt current flow. The thermal fuse returns to a low resistive and low temperature state when the fault is removed from the SCSI bus or when the system is turned off. Wait 10 seconds for the thermal fuse to reset itself and recover, then retest. 3. If the same error persists, replace the components of the failing channel in the following order. Wait 10 seconds for the thermal fuse to reset itself between steps. a. Cable b. Device c. DASD backplane (if present) d. System board (adapter)4. If the failure persists, verify that the parts exchanged are in the correct channel (0 or 1). If the errors are still occurring, continue isolating the problem by going to “MAP 0050: SCSI bus problems” on page 269.MAP 0020: Problem determination procedure Use this MAP to get a service request number (SRN) if the customer or a previous MAP provided none. Purpose of this MAP Use this MAP to get a service request number (SRN) if the customer or a previous MAP provided none. If you are unable to power the system on, refer to the “Power isolation procedures” on page 128. v Step 0020-1 Visually check the server for obvious problems such as unplugged power cables or external devices that are powered off. Did you find an obvious problem? NO Go to Step 0020-2. YES Fix the problem, then go to Chapter 5, “Verifying a repair,” on page 557.v Step 0020-2 Are the AIX online diagnostics installed?Note: If AIX is not installed on the server or partition, answer no to the above question. NO If the operating system is running, perform its shutdown procedure. Get help if needed. Go to Step 0020-4. Chapter 2. Isolation Procedures 259 YES Go to Step 0020-3.v Step 0020-3 Note: When possible, run online diagnostics in service mode. Online diagnostics perform additional functions compared to standalone diagnostics. Run online diagnostics in concurrent mode when the customer does not let you power-off the system unit. To run online diagnostics in service mode, go to substep 5. If the system unit is already running in the service mode and you want to run online diagnostics, proceed to the question at the bottom of this MAP step. Otherwise, continue with 1 through 4 in the following procedure. 1. Log in with root authority or use CE login. If necessary, ask the customer for the password. 2. Enter the diag -a command to check for missing resources. a. If you see a command line prompt, proceed to substep 3 below. b. If the DIAGNOSTIC SELECTION menu is displayed, with the letter M shown next to any resource, select that resource, then press Commit (F7 key). Follow any instructions displayed. If you are prompted with a message Do you want to review the previously displayed error select Yes and press Enter. If an SRN displays, record it, and go to Step 0020-15. If there is no SRN, go to substep 3 below. c. If the MISSING RESOURCE menu is displayed, follow any instructions displayed. If you are prompted with a message Do you want to review the previously displayed error select Yes and press Enter. If an SRN displays, record it, and go to Step 0020-15. If there is no SRN, go to substep 3 below.3. Enter the diag command. 4. Go to Step 0020-5. 5. If the operating system is running, perform its shutdown procedure (get help if needed). 6. Turn off the system unit power and wait 45 seconds before proceeding. 7. Turn on the system unit power. 8. Load the online diagnostics in service mode. 9. Wait until the Diagnostic Operating Instructions display or the system appears to have stopped. Are the Diagnostic Operating Instructions displayed? NO Go to Step 0020-16. YES Go to Step 0020-5.v Step 0020-4 Note: If you are working on a partition, do not remove the power as directed in the following procedure. Remove the power only if you are working on a server that does not have multiple partitions. 1. If the server does not have multiple partitions, disconnect the power from the server, wait 45 seconds, then reconnect the power. 2. If the server supports slow boot (See “Performing a slow boot” on page 665) set the server to perform a slow boot for the next boot that is performed. If the system does not support slow boot, do a normal boot in the next step. 3. Refer to “Running the online and eServer stand-alone diagnostics” on page 672 to load the standalone diagnostics. Before continuing to the next step, ensure that the server power is turned on, or if you are working on a partition, the partition is started. The server or partition should be booting the standalone diagnostics from a CD-ROM or a network server. 4. Wait until the Diagnostic Operating Instructions display or the server boot appears to have stopped. Are the Diagnostic Operating Instructions displayed? NO Go to Step 0020-16. YES Go to Step 0020-5. 260 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) v Step 0020-5 Are the Diagnostic Operating Instructions displayed (screen number 801001) with no obvious problem (for example, blurred or distorted)? NO For display problems, go to Step 0020-12. YES To continue with diagnostics, go to Step 0020-6.v Step 0020-6 Press the Enter key. Is the FUNCTION SELECTION menu displayed (screen number 801002)? NO Go to Step 0020-13. YES Go to Step 0020-7.v Step 0020-7 1. Select the option ADVANCED DIAGNOSTICS ROUTINES. Notes: a. If the terminal type is not defined, do so now. You cannot proceed until this is complete. b. If you have SRNs from a Previous Diagnostics Results screen, process these Previous Diagnostics Results SRNs prior to processing any SRNs you may have received from an SRN reporting screen.2. If the DIAGNOSTIC MODE SELECTION menu (screen number 801003) displays, select the option PROBLEM DETERMINATION. 3. Find your system response in the following table. Follow the instructions in the Action column. System Response Action Previous Diagnostic Results. Do you want to review the previously displayed error? You have a pending item in the error log for which there is no corresponding Log Repair Action. To see this error, select YES at the prompt. Information from the error log is displayed in order of last event first. Record the error code, the FRU names and the location code of the FRUs. Go to Step 0020-15. The RESOURCE SELECTION menu or the ADVANCED DIAGNOSTIC SELECTION menu is displayed (screen number 801006). Go to Step 0020-8. The system halted while testing a resource. Record SRN 110-xxxx, where xxxx is the first four digits of the menu number displayed in the upper-right corner of the diagnostic menu. Go to Step 0020-15. Chapter 2. Isolation Procedures 261 System Response Action The MISSING RESOURCE menu is displayed or the letter M is displayed alongside a resource in the resource list. If the MISSING RESOURCE menu is displayed, follow the displayed instructions until either the ADVANCED DIAGNOSTIC SELECTION menu or an SRN is displayed. If an M is displayed in front of a resource (indicating that it is missing) select that resource then choose the Commit (F7 key). Notes: 1. Run any supplemental media that may have been supplied with the adapter or device, and then return to substep 1 of Step 0020-7. 2. If the SCSI enclosure services device appears on the missing resource list along with the other resources, select it first. 3. ISA adapters cannot be detected by the system. The ISA adapter configuration service aid in standalone diagnostics allows the identification and configuration of ISA adapters.If the ADVANCED DIAGNOSTIC SELECTION menu is displayed, go to Step 0020-11. If an 8-digit error code is displayed, record it and go to Chapter 1, “Starting a Service Call,” on page 1. If an SRN is displayed, record it, and go to Step 0020-15. The message The system will now continue the boot process is displayed continuously on the system unit’s console. Go to Step 0020-4. The message Processing supplemental diagnostic diskette media is displayed continuously on the system unit’s console. Call your service support. The diagnostics begin testing a resource. Note: If the option Problem Determination was selected from the DIAGNOSTIC MODE SELECTION menu, and if a recent error has been logged in the error log, the diagnostics automatically begin testing the resource. Follow the displayed instructions. If the No Trouble Found screen is displayed, press Enter. If another resource is tested, repeat this step. If the ADVANCED DIAGNOSTIC SELECTION menu is displayed, go to Step 0020-11. If an SRN is displayed, record it, and go to Step 0020-15. If an 8-digit error code is displayed, go to Chapter 1, “Starting a Service Call,” on page 1. The system did not respond to selecting the advanced diagnostics option. Go to Step 0020-13. A system unit with a beeper did not beep while booting. Record SRN 111-947 and then go to Step 0020-15 The system unit emits a continuous sound from the beeper. Record SRN 111-947 and then go to Step 0020-15. An SRN or an eight-digit error code is displayed. Record the error code, the FRU names, and the location code for the FRUs. If a SRN is displayed, go to Step 0020-15. If an 8-digit error code is displayed, go to Chapter 1, “Starting a Service Call,” on page 1. The system stopped with a 3-digit or 4-digit code displayed in the operator panel display. Record SRN 101-xxx (where xxx is the rightmost three digits of the displayed code). Go to Step 0020-15. 262 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) System Response Action An 888 message is displayed in the operator panel display. Note: The 888 may or may not be flashing. Go to “MAP 0070: 888 sequence in operator panel display” on page 278. v Step 0020-8 On the DIAGNOSTIC SELECTION or ADVANCED DIAGNOSTIC SELECTION menu, look through the list of resources to make sure that all adapters and SCSI devices are listed including any new resources. Notes: 1. Resources attached to serial and parallel ports may not appear in the resource list. 2. If running diagnostics in a partition within a partitioned system, resources assigned to other partitions will not be displayed on the resource list.Did you find all the adapters or devices on the list? NO Go to Step 0020-9. YES Go to Step 0020-11.v Step 0020-9 Is the new device or adapter an exact replacement for a previous one installed at same location? NO Go to Step 0020-10. YES The replacement device or adapter may be defective. If possible, try installing it in an alternate location if one is available; if it works in that location, then suspect that the location where it failed to appear has a defective slot. Schedule time to replace the hardware that supports that slot. If it does not work in alternate location, suspect a bad replacement adapter or device. If you are still unable to detect the device or adapter, contact your service support structure.v Step 0020-10 Is the operating system software to support this new adapter or device installed? NO Load the operating system software. YES The replacement device or adapter may be defective. If possible, try installing it in an alternate location if one is available; if it works in that location, then suspect that the location where it failed to appear has a defective slot. Schedule time to replace the hardware that supports that slot. If it does not work in alternate location, suspect a bad replacement adapter or device. If you are still unable to detect the device or adapter, contact your service support structure.v Step 0020-11 Select and run the diagnostic test problem determination or system verification on one of the following: – The resources with which the customer is having problems. If the resource is not shown on the DIAGNOSTIC SELECTION menu, then run diagnostics on its parent (the adapter or controller to which the resource is attached). – The resources you suspect are causing a problem. – All resources. Note: When choosing All Resources, interactive tests are not done. If no problem is found running All Resources, select each of the individual resources on the selection menu to run diagnostics tests on to do the interactive tests Chapter 2. Isolation Procedures 263 Find the response in the following table, or follow the directions on the test results screen. Diagnostic Response Action An SRN or an eight-digit error code is displayed on the screen. Record the error code, the FRU names, and the location code for the FRUs. If an SRN is displayed, go to Step 0020-15. If an 8-digit error code is displayed, go to the information center, and perform a search on the error code to obtain the name and location of the failing FRU. Perform the listed action. The TESTING COMPLETE menu and the No trouble was found message are displayed, and you have not tested all of the resources. Press Enter and continue testing other resources. The TESTING COMPLETE menu and the No trouble was found message are displayed, and you have tested all of the resources. Go to Step 0020-14. Note: If you have not run the sysplanar test, do so before going to Step 0020-14. The system halted while testing a resource. Record SRN 110-xxxx, where xxxx is the first three or four digits of the menu number displayed in the upper-right corner of the diagnostic menu screen. Go to Step 0020-15. When running the Online Diagnostics, an installed device does not appear in the test list. Ensure that the diagnostic support for the device was installed. The display configuration service aid can be used to determine whether diagnostic support is installed for the device. Record SRN 110-101. Go to Step 0020-15. Note: Supplemental diskettes may be required if service aids are run from standalone diagnostics. The IBM ARTIC960 Quad T1/E1 adapter diagnostics display a message indicating that the interface board (PMC) is either not installed or is malfunctioning. Install a PMC board if not already installed. When running online diagnostics on any of the IBM ARTIC960 family of adapters and the message indicates that the PMC is not installed, but it is installed, do the following: v Reseat the PMC board, then run diagnostics. v If the response is the same, replace the PMC and then go to Chapter 5, “Verifying a repair,” on page 557. The symptom was not found in the table. Return to the Chapter 1, “Starting a Service Call,” on page 1. v Step 0020-12 The following step analyzes a console display problem. Find your type of console display in the following table. Follow the instructions given in the Action column. Type of Console Display Action TTY-type terminal Be sure the TTY terminal attributes are set correctly. See ?Running the Diagnostic Programs from a TTY Terminal? in “Running the online and eServer stand-alone diagnostics” on page 672. If you did not find a problem with the attributes, go to the documentation for this type of TTY terminal, and continue problem determination. If you do not find the problem, record SRN 111-259, then go the Step 0020-15. Graphics display Go to the documentation for this type of graphics display, and continue problem determination. If you do not find the problem, record SRN 111-82c, then go to Step 0020-15. 264 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Type of Console Display Action HMC (Hardware Management Console) Go to the HMC service procedures in Service Guide for the Hardware Management Console Models 7042-CR4 and 7042-C06, SA76-0120. If HMC tests find no problem, there may be a problem with the communication between the HMC and the managed system. If the HMC communicates with the managed system through a network interface, verify whether the network interface is functional. If the HMC communicates with the managed system through the HMC interface, check the cable between the HMC and the managed system. If it is not causing the problem, suspect a configuration problem of the HMC communications setup. v Step 0020-13 There is a problem with the keyboard. Find the type of keyboard you are using in the following table. Follow the instructions given in the Action column. Keyboard Type Action Type 101 keyboard (U.S.). Identify by the size of the Enter key. The Enter key is in only one horizontal row of keys. Record SRN 111-736, then go to Step 0020-15. Type 102 keyboard (W.T.). Identify by the size of the Enter key. The Enter key extends into two horizontal rows. Record SRN 111-922; then go to Step 0020-15. Kanji-type keyboard. (Identify by the Japanese characters.) Record SRN 111-923; then go to Step 0020-15. TTY terminal keyboard Go to the documentation for this type of TTY terminal and continue problem determination. HMC (Hardware Management Console) Go to the HMC service procedures in Service Guide for the Hardware Management Console Models 7042-CR4 and 7042-C06, SA76-0120. If HMC tests find no problem, there may be a problem with the communication between the HMC and the managed system. If the HMC communicates with the managed system through a network interface, verify whether the network interface is functional. If the HMC communicates with the managed system through the HMC interface, check the cable between the HMC and the managed system. If it is not causing the problem, suspect a configuration problem of the HMC communications setup. v Step 0020-14 The diagnostics did not detect a problem. If the problem is related to either the system unit or the I/O expansion box, refer to the service documentation for that unit. If the problem is related to an external resource, use the problem determination procedures, if available, for that resource. If a problem occurs when running online diagnostics but not when running the stand-alone diagnostics, suspect a software problem. Check for the presence of supplemental diagnostic material, such as diskettes or documentation. This is possibly a problem with software or intermittent hardware. If you think that you have an intermittent hardware problem, go to “MAP 0040: Intermittent Problem Isolation” on page 267. v Step 0020-15 Take the following actions: 1. Handle multiple SRNs and error codes in the following order: a. 8-Digit Error Codes. Chapter 2. Isolation Procedures 265 b. SRNs with a source code other than F or G. c. SRNs with a source code of F. Run online diagnostics in advanced and problem determination mode to obtain maximum isolation. d. SRNs with a source code of G. Note: The priority for multiple SRNs with a source of G is determined by the time stamp of the failure. Follow the action for the SRN with the earliest time stamp first. e. Device SRNs and error codes (5-digit SRNs). If a group has multiple SRNs, it does not matter which SRN is handled first. 2. Find the SRN. If the SRN is not listed, look for it in the following: – Any supplemental service manual for the device – The diagnostic problem report screen for additional information – The ?Service Hints? service aid in “Running the online and eServer stand-alone diagnostics” on page 6723. Perform the action listed. 4. If you replace a part, go to Chapter 5, “Verifying a repair,” on page 557.v Step 0020-16 Look up the AIX IPL progress codes for definitions of configuration program indicators. They are normally 0xxx or 2xxx. Is a configuration program indicator displayed? NO Go to the “Problems with loading and starting the operating system (AIX and Linux)” on page 246. YES Record SRN 101-xxxx (where xxxx is the rightmost three or four digits or characters of the configuration program indicator). Go to Step 0020-17.v Step 0020-17 Is a physical location code or an AIX location code displayed on the operator panel display? NO Go to Step 0020-15. YES Record the location code, then go to Step 0020-15.MAP 0030: Additional Problem Determination This MAP is used for problems that still occur after all FRUs indicated by the SRN or error code have been exchanged. Purpose of This MAP Use this MAP for problems that still occur after all FRUs indicated by the SRN or error code have been exchanged. Note: Check the action text of the SRN before proceeding with this MAP. If there is an action listed, perform that action before proceeding with this MAP. v Step 0030-1 Some external devices (including rack drawers that contain devices) have their own problem determination procedures. If the problem is related to an external device that has its own problem determination procedure, run those procedures if not already run. If they do not correct the problem, continue with this MAP. v Step 0030-2 266 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) The problem may have been caused by a resource that has not been tested. System Checkout tests all resources. If the online Diagnostics are installed and you are able to load them, then All Resources under the Diagnostic Selection menu should be run. If you get a different SRN, look up the SRN in the SRN chapters and perform the listed action. If you are unable to run All Resources under the Diagnostic Selection menu or you do not get another SRN when running it, continue with this MAP. v Step 0030-3 If the problem is related to a SCSI device, SCSI bus, or SCSI controller, go to “MAP 0050: SCSI bus problems” on page 269. If you are unable to isolate the problem with MAP 0050, continue with Step 0030-4. v Step 0030-4 1. Find the resource(s) that are identified by the SRN or error code in the following table. 2. Perform the first action listed for the resource. 3. If you exchange a FRU or change a switch setting, test the resource again. 4. If the action does not correct the problem, perform the next action until all actions have been tried. If an action says to exchange a FRU that you have already exchanged, go to the next action. If an action corrects the problem, go to Chapter 5, “Verifying a repair,” on page 557. 5. If you perform all of the actions and do not correct the problem, check the Service Hints service aid for information. If the service aid does not help, call your service support structure. Failing Resource Repair Action SCSI Device Exchange the SCSI controller. Replace the power supply. Pluggable SCSI or IDE controller Exchange the backplane into which the adapter is plugged. Keyboard, tablet, mouse, dials, LPF keys, diskette drive Check the cable attaching the device to its adapter. If you do not find a problem, exchange the device’s adapter. Pluggable adapters, CPU cards, and controllers Determine whether the adapter contains any attached FRUs such as fuses, DRAMs, and crossover cables. 1. Check or exchange any attached FRU on the resource. 2. If the adapter is plugged into a riser card, check or exchange the riser card. 3. Exchange the backplane into which the adapter is plugged. System and I/O backplanes Contact your service support structure. Built-In system ports Replace the service processor if present. A device attached to the system by a cable and an adapter. 1. Replace the adapter for the device. 2. Replace the cable to the device. IDE Device Replace the cable between the IDE controller and the device. If the IDE controller is packaged on a backplane, replace that backplane, otherwise replace the adapter containing the IDE controller. MAP 0040: Intermittent Problem Isolation This MAP provides a structured way of analyzing intermittent problems. Purpose of This MAP This MAP provides a structured way of analyzing intermittent problems. It consists of two tables: Hardware Symptoms and Software Symptoms. Because software or hardware can cause intermittent problems, consider all symptoms relevant to your problem. Chapter 2. Isolation Procedures 267 How to Use This MAP This MAP contains information about causes of intermittent symptoms. In the following tables, find your symptoms, and read the list of things to check. When you exchange a FRU, go to Chapter 5, “Verifying a repair,” on page 557 to check out the system. Hardware SymptomsNote: This table spans several pages. Symptom of Hardware Problem Things to Check For Any hardware log entry in the error log. Use the Hardware Error Report service aid to view the error log and check for: v Multiple errors on devices attached to the same SCSI bus. v Multiple errors on devices attached to the same async adapter. v Multiple errors on internally installed devices only.Contact your service support structure for assistance with error report interpretation. Hardware-caused system crashes v The connections on the CPU planar or CPU card v Memory modules for correct connections v Connections to the system planar. v Cooling fans operational v The environment for a too-high or too-low operating temperature. v Vibration: proximity to heavy equipment. System unit powers off a few seconds after powering On. v Fan speed. Some fans contain a speed-sensing circuit. If one of these fans is slow, the power supply powers the system unit off. v Correct voltage at the outlet into which the system unit is plugged. v Loose power cables and fan connectors, both internal and external. System unit powers off after running for more than a few seconds. v Excessive temperature in the power supply area. v Loose cable connectors on the power distribution cables. v Fans turning at full speed after the system power has been on for more than a few seconds. Only internally installed devices are failing. Check the following items that are common to more than one device: v Ground connections on all of the disk drives and other types of drives installed. v Loose connections on the power cables to the planars, drives, fans, and battery. v System unit cooling. Is the input air temperature within limits? Are all the fans running at full speed? Are any of the vent areas blocked? v Signal cables to the diskette drives, and the power supply. v SCSI device signal cables for loose connectors and terminators. v Loose SCSI device address jumpers. v Possible contamination of any device that has a cleaning procedure. See the operator guide for cleaning instructions. v Excessive static electricity. v Correct voltage at the system unit power outlet 268 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Symptom of Hardware Problem Things to Check For Only externally attached devices are failing. Check the following items that are common to more than one device. v Check the SCSI signal cables to the devices for loose connectors and terminators. v Check devices that use jumpers to set the SCSI address for loose jumpers. v Check any device that has a cleaning procedure for contamination. See the operator guide for cleaning instructions. v Check for excessive static electricity. v Check the outlet that the device is plugged into for correct voltage. v Check the error log for entries for the adapter driving the failing devices. v Check the temperature of the devices. Are the cooling vents blocked? Are the fans running? v Check for other devices near the failing device that may be radiating noise (displays, printers, and so on). Software Symptoms Symptom of Software Problem Things to Check For Any symptom you suspect is related to software. Use the software documentation to analyze software problems. Check the RETAIN® database for known problems with your type of system unit or software. Software-caused system crashes Check the following software items: v Is the problem only with one application program? v Is the problem only with one device? v Does the problem occur on a recently installed program? v Was the program recently patched or modified in any way? v Is the problem associated with any communication lines? v Check for static discharge occurring at the time of the failure. MAP 0050: SCSI bus problems Use this MAP to analyze problems with a SCSI bus. Purpose of this MAP Use this MAP to analyze problems with a SCSI bus. Considerations v To prevent hardware damage or erroneous diagnostic results from a system with its power turned on, use ?PCI hot-plug manager? subtask ?replace/remove PCI hot plug adapter? before connecting or disconnecting cables or devices. v Also, use this MAP for SCSI adapters that are built into system backplanes or I/O backplanes. If this procedure calls for replacing a SCSI adapter and the SCSI adapter is built into the system backplane or I/O backplane, replace the system backplane or I/O backplane as appropriate. v If the failure is a terminator power failure (SRNs xxx-226, xxx-240, xxx-800), always allow five minutes for the PTC to cool. v The differential version of the adapter has socket-type terminators to support high-availability. If this is the adapter’s configuration, the terminators would have been removed from the adapter. MAP steps requiring the removal of the cable from the adapter do not apply, because an adapter that is not terminated always fails diagnostics. Proper SCSI diagnostics require proper termination. If the Chapter 2. Isolation Procedures 269 configuration involves a Y-cable, leave it, with the appropriate terminator, attached to the adapter. Or, place an external differential terminator on the external port. v If the system uses shared disk-drive hardware or a high-availability configuration, ensure that the other system that is sharing the devices is not using the devices. For additional information concerning high-availability configurations, see “SCSI service hints” on page 252. v For intermittent problems that cannot be resolved with this MAP, refer to “SCSI service hints” on page 252. v If the SCSI bus is attached to a RAID subsystem, refer to the RAID subsystem documentation for any problem determination. If the RAID adapter is a PCI-X RAID adapter, refer to the PCI-X SCSI RAID Controller Reference Guide for AIX.Follow the steps in this MAP to isolate a SCSI bus problem.Note: This procedure removes devices and components from a SCSI bus until a problem or a symptom or problem is eliminated. If you follow the entire procedure, you will remove all components of a SCSI bus in the following order: 1. Hot-swap devices 2. Devices that are not hot-swap 3. SCSI Enclosure Services (SES) device or enclosures 4. SCSI cables 5. SCSI adapterDo the following: v Step 0050-1 Have changes been made recently to the SCSI configuration? NO Go to Step 0050-2. YES Go to Step 0050-5.v Step 0050-2 Are there any hot-swap devices (SCSI disk drives or media devices) controlled by the adapter? NO Go to Step 0050-3. YES Go to Step 0050-11.v Step 0050-3 Are there any devices other than hot-swappable devices controlled by the adapter? NO Go to Step 0050-4. YES Go to Step 0050-13.v Step 0050-4 Is an enclosure or drawer that supports hot-swap devices controlled by the adapter? NO Go to Step 0050-22. YES Go to Step 0050-15.v Step 0050-5 This step handles cases where recent changes have been made to the SCSI configuration. Using the first three digits of the SRN, refer to the FFC listing and determine if the adapter is single-ended or differential. Is the adapter a single-ended adapter? NO Go to Step 0050-6. YES Go to Step 0050-7. 270 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) v Step 0050-6 The adapter’s termination jumper settings may be incorrect. Power off the system, and inspect jumper J7. Are the jumpers correct? NO Go to Step 0050-8. YES Go to Step 0050-9.v Step 0050-7 If the adapter is not being used in a high-availability configuration, be sure sockets RN1, RN2, and RN3 are populated. If the adapter is being used in a high-availability configuration, be sure sockets RN1, RN2, and RN3 are not populated. Go to Step 0050-9. v Step 0050-8 1. Correct the jumper settings and reinstall the adapter and all cables. 2. Power on the system, and run diagnostics in system verification mode on the adapter. Did the diagnostic pass? NO Go to Step 0050-9. YES Go to Step 0050-10.v Step 0050-9 Check for the following problems: – Address conflicts between devices. – Cabling problems such as configurations that exceed the maximum cable lengths, missing termination, or excessive termination. Did you find a problem? NO Go to Step 0050-2. YES Go to Step 0050-10.v Step 0050-10 1. Correct the problem. 2. Power on the system, and run diagnostics in system verification mode on the adapter. Did a failure occur? NO Go to Chapter 5, “Verifying a repair,” on page 557. YES Go to Step 0050-2.v Step 0050-11 This step determines if a hot-swap device is causing the failure. 1. Go to “Preparing for hot-plug SCSI device or cable deconfiguration” on page 274. 2. Disconnect all hot-swap devices attached to the adapter. 3. Go to “After hot-plug SCSI device or cable deconfiguration” on page 275. 4. If the Missing Options menu displays, select the option The resource has been turned off, but should remain in the system configuration for all the devices that were disconnected. 5. Run the diagnostics in system verification mode on the adapter. Did a failure occur? NO Go to Step 0050-12. YES Go to Step 0050-3.v Step 0050-12 Chapter 2. Isolation Procedures 271 Go to “Preparing for hot-plug SCSI device or cable deconfiguration” on page 274. Reconnect the hot-plug devices one at a time. After reconnecting each device, do the follow: 1. Go to “After hot-plug SCSI device or cable deconfiguration” on page 275. 2. Rerun the diagnostics on the adapter. 3. If the adapter fails, the problem may be with the last device reconnected. Perform these substeps: a. Follow repair procedures for that last device. b. Rerun diagnostics on the adapter. c. If diagnostics fail, replace the SES backplane corresponding to the slot for the device. d. Rerun diagnostics. e. If diagnostics fail, replace the last device. f. Rerun diagnostics on the adapter. g. If diagnostics pass, go to Chapter 5, “Verifying a repair,” on page 557. Otherwise, contact your support center. Note: A device problem can cause other devices attached to the same SCSI adapter to go into the defined state. Ask the system administrator to make sure that all devices attached to the same SCSI adapter as the device that you replaced are in the available state.4. If no errors occur, the problem could be intermittent. Make a record of the problem. Running the diagnostics for each device on the bus may provide additional information.v Step 0050-13 This step determines if a device other than a hot-swappable device is causing the failure. Follow these steps: 1. Go to “Preparing for hot-plug SCSI device or cable deconfiguration” on page 274. 2. Disconnect all devices attached to the adapter (except for the device from which you boot to run diagnostics; you may want to temporarily move this device to another SCSI port while you are trying to find the problem). 3. Go to “After hot-plug SCSI device or cable deconfiguration” on page 275. 4. If the Missing Options menu displays, select the option The resource has been turned off, but should remain in the system configuration for all the devices that were disconnected. 5. Run the diagnostics in system verification mode on the adapter. Did a failure occur? NO Go to Step 0050-14. YES Go to Step 0050-4.v Step 0050-14 Reconnect the devices one at time. After reconnecting each device, follow this procedure: 1. Rerun the diagnostics in system verification mode on the adapter. 2. If there is a failure, the problem should be with the last device reconnected. Follow the repair procedures for that device, then go to Chapter 5, “Verifying a repair,” on page 557. 3. If no errors occur, the problem could be intermittent. Make a record of the problem. Running the diagnostics for each device on the bus may provide additional information.v Step 0050-15 This step determines whether the SCSI Enclosure Services (SES) controller, which provides hot-plug capability for SCSI drives in the server, is causing the problem. Note: In most cases the SES controller is integrated on the backplane that is used to connect SCSI devices, for example a disk drive backplane. If your system has hot-plug capability and the SES controller is separate from the SCSI drive backplane, there will be an intermediate card on the 272 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) SCSI bus between the SCSI adapter and the device or SCSI backplane. You will have to make a visual check to see if there are any intermediate cards on the SCSI bus that is displaying a problem. Does a separate SES controller plug into the SCSI device backplane? NO Go to Step 0050-18. YES Go to Step 0050-16.v Step 0050-16 Follow these steps: 1. Power off the system. 2. Remove the intermediate SES controller card. Locate the SES controller part number under Chapter 8, “Part Information,” on page 691. 3. Power on the system. 4. If the Missing Options menu displays, select the option The resource has been turned off, but should remain in the system configuration for all the devices that were disconnected. 5. Run the diagnostics in system verification mode on the adapter. Did a failure occur? NO Go to Step 0050-17. YES Go to Step 0050-18.v Step 0050-17 Follow these steps: 1. Power off the system. 2. Replace the intermediate SES controller card. 3. Go to Chapter 5, “Verifying a repair,” on page 557.v Step 0050-18 Follow these steps: 1. Go to “Preparing for hot-plug SCSI device or cable deconfiguration” on page 274. 2. Disconnect all cables attached to the SCSI adapter. For SCSI differential adapters in a high-availability configuration, see Considerations. 3. Go to “After hot-plug SCSI device or cable deconfiguration” on page 275. 4. If the Missing Options menu displays, select the option The resource has been turned off, but should remain in the system configuration for all the devices that were disconnected. 5. Run the diagnostics in system verification mode on the adapter. Did a failure occur? NO Go to Step 0050-19. YES Replace the adapter, then go to Chapter 5, “Verifying a repair,” on page 557.v Step 0050-19 Follow these steps: 1. Go to “Preparing for hot-plug SCSI device or cable deconfiguration” on page 274. 2. Reconnect the cables to the adapter. Does the SES controller (an intermediated SES controller) plug into the backplane? NO Go to Step 0050-20. YES Go to Step 0050-21.v Step 0050-20 Follow these steps: Chapter 2. Isolation Procedures 273 1. Replace the SES controller. Locate the intermediate SES controller part number under Chapter 8, “Part Information,” on page 691. 2. Power on the system. 3. If the Missing Options menu displays, select the option The resource has been turned off, but should remain in the system configuration for all the devices that were disconnected. 4. Run the diagnostics in system verification mode on the adapter. Did a failure occur? NO Go to Chapter 5, “Verifying a repair,” on page 557. YES Go to Step 0050-21.v Step 0050-21 One of the cables remaining in the system is defective. Refer to Chapter 8, “Part Information,” on page 691 for the cable part numbers. Replace the parts one at a time in the order listed. Follow these steps for each FRU replaced: 1. Rerun the diagnostics for the adapter. 2. If there is any failure, continue with the next FRU. 3. If there is no failure, go to Chapter 5, “Verifying a repair,” on page 557.v Step 0050-22 Follow these steps: 1. Go to “Preparing for hot-plug SCSI device or cable deconfiguration.” 2. Disconnect all cables attached to the adapter (except for the cable to the device from which you boot to run diagnostics; you may want to temporarily move this device to another SCSI port while you are trying to find the problem). 3. Go to “After hot-plug SCSI device or cable deconfiguration” on page 275. 4. If the Missing Options menu displays, select the option The resource has been turned off, but should remain in the system configuration for all the devices that were disconnected. 5. Run the diagnostics on the adapter. Did a failure occur? NO Go to Step 0050-23. YES Replace the adapter, then go to Chapter 5, “Verifying a repair,” on page 557.v Step 0050-23 One of the cables remaining in the system is defective. Refer to Chapter 8, “Part Information,” on page 691 for the cable part numbers. Replace the parts one at time in the order listed. Follow these steps for each FRU replaced: 1. Rerun the diagnostics for the adapter. 2. If there is any failure, continue with the next FRU. 3. If there is no failure, go to Chapter 5, “Verifying a repair,” on page 557. “Preparing for hot-plug SCSI device or cable deconfiguration”Use this procedure when you are preparing to deconfigure a hot-plug SCSI device or cable. This procedure will help determine if a SCSI device or SCSI device cable is causing your system problem. “After hot-plug SCSI device or cable deconfiguration” on page 275Use this procedure after you deconfigure or hot-plug a SCSI device to ensure that the replaced component was successfully installed.Preparing for hot-plug SCSI device or cable deconfiguration Use this procedure when you are preparing to deconfigure a hot-plug SCSI device or cable. This procedure will help determine if a SCSI device or SCSI device cable is causing your system problem. Purpose of this procedure: 274 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Use this procedure to determine if a SCSI device or SCSI device cable is causing the symptom. Disconnect all cables attached to the adapter, (except for the cable to the device from which you boot to run diagnostics; temporarily move these device cables to another SCSI port while you are trying to find the problem). 1. Go to “Running the online and eServer stand-alone diagnostics” on page 672 and perform the prerequisite tasks described in the Before you begin topic. 2. Determine which SCSI adapter you plan to remove the cables or devices from. 3. Adapter slots are numbered on the rear of the system unit, record the slot number and location of each adapter being removed. 4. Ensure that any processes or applications that might use the adapter are stopped. 5. Enter the system diagnostics by logging in as ?root? user or as the ?celogin? user. Type the diag command on the AIX command line. 6. When the DIAGNOSTIC OPERATING INSTRUCTIONS menu displays, press Enter. The FUNCTION SELECTION menu appears. 7. From the FUNCTION SELECTION menu, select Task Selection, then press Enter. 8. From the Task Selection list, select PCI Hot Plug Manager. 9. From the PCI Hot Plug Manager menu, select Unconfigure a Device, then press Enter. 10. Press F4 or ESC 4 to display the Device Names menu. 11. Select the adapter from which you are removing the cables or devices in the Device Names menu. 12. In the Keep Definition field, use the tab key to answer Yes. 13. In the Unconfigure Child Devices field, use the Tab key to answer Yes, then press Enter. The ARE YOU SURE screen displays. 14. Press Enter to verify the information. A successful deconfiguration is indicated by the ?OK? message displayed next to the Command field at the top of the screen. 15. Press F4 or ESC 4 twice to return to the Hot Plug Manager menu. 16. Select replace/remove PCI Hot Plug adapter. 17. Select the slot that has the adapter you want to remove the cables or devices from in the system. 18. Select Remove. Note: A fast blinking amber LED located at the back of the machine indicates the slot that you selected. 19. Press Enter. This places the adapter in the ?action? state, meaning it is ready to be removed from the system. (You do not need to remove the adapter, unless it makes removing the cables attached to it easier).After hot-plug SCSI device or cable deconfiguration Use this procedure after you deconfigure or hot-plug a SCSI device to ensure that the replaced component was successfully installed. 1. Press Enter, then continue to follow the screen instructions until you receive a message that the replacement is successful. A successful replacement is indicated by the OK message displayed next to the command field at the top of the screen. 2. Press the F3 or ESC 3 key to return to the PCI Hot-Plug Manager menu. 3. Press the F3 or ESC 3 key to return to the TASK selection list. 4. Select Log Repair Action. 5. Select the adapter you just removed the cables or devices from, then press Enter. 6. Press Commit (F7 or ESC 7), then press Enter. 7. Press the F10 or the ESC 0 key to exit diagnostics. 8. Type the diag -a command on the command line. Chapter 2. Isolation Procedures 275 MAP 0054: PCI-X SCSI Bus Problems Use this MAP to determine which FRUs may need to be replaced in order to solve a SCSI bus-related problem on a PCI-X SCSI or PCI-X SCSI RAID adapter. Purpose of This MAP Use the following to determine which FRUs may need to be replaced in order to solve a SCSI bus-related problem on a PCI-X SCSI or PCI-X SCSI RAID adapter. Considerations v Remove power from the system before connecting and disconnecting cables or devices, as appropriate, to prevent hardware damage or erroneous diagnostic results. v Note that some systems have SCSI and PCI-X bus interface logic integrated onto the system boards and use a pluggable RAID enablement card (a non-PCI form factor card) for these SCSI and PCI-X buses. An example of such a RAID enablement card is FC 5709. For these configurations, replacement of the RAID enablement card is unlikely to solve a SCSI bus-related problem, because the SCSI bus interface logic is on the system board. v Some adapters provide two connectors, one internal and one external, for each SCSI bus. For this type of adapter, it is not acceptable to use both connectors for the same SCSI bus at the same time. SCSI bus problems are likely to occur if this is done. However, it is acceptable to use an internal connector for one SCSI bus and an external connector for another SCSI bus. The internal and external connectors are labeled to indicate which SCSI bus they correspond to.Attention: RAID adapters should not be replaced when SCSI bus problems exist, except with assistance from your service support structure. Because the adapter may contain non-volatile write cache data and configuration data for the attached disk arrays, additional problems can be created by replacing an adapter when SCSI bus problems exist. Attention: Do not remove functioning disks in a disk array without assistance from your service support structure. A disk array may become degraded or failed if functioning disks are removed, and additional problems may be created. Follow the steps in this MAP to isolate a PCI-X SCSI bus problem. v Step 0054-1 Identify the SCSI bus on which the problem is occurring on by examining the hardware error log. To view the hardware error log, do the following: 1. Invoke diagnostics and select Task Selection on the Function Selection screen. 2. Select Display Hardware Error Report. 3. Choose one of the following options: – If the type of adapter is not known, select Display Hardware Errors for Any Resource. – If the adapter is a PCI-X SCSI adapter, select Display Hardware Errors for PCI-X SCSI Adapters. – If the adapter is a PCI-X SCSI RAID adapter, select Display Hardware Errors for PCI-X SCSI RAID Adapters.4. Select the resource, or select All Resources if the resource is not known. If you had previously selected Display Hardware Errors for Any Resource, then select All Resources. 5. On the Error Summary screen, look for an entry with a SRN corresponding to the problem which sent you here, and select it. 276 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Note: If multiple entries exist for the SRN, some entries might be old or the problem has occurred on multiple entities (adapters, disk arrays, or devices). Older entries can be ignored; however, this MAP may need to be used multiple times if the same problem has occurred on multiple entities. 6. Select the hardware error log to view. While viewing the hardware error log, under the Detail Data and SENSE DATA headings, identify the first four bytes of the hexadecimal data (for example, nnnn nnnn nnnn nnnn ...). The four bytes identified in the error log can be interpreted as: 00bb ssLL where: – bb, when not FF, identifies the adapter’s SCSI bus – ss, when not FF, identifies the SCSI ID of a device – LL, when not FF, identifies the logical unit number (LUN) of a device Go to Step 0540-2. v Step 0054-2 Are the last two bytes of the four bytes identified in Step 0540-1, equal to FFFF (for example, 00bb FFFF, where bb identifies the adapter’s SCSI bus)? NO Go to Step 0540-4. YES Go to Step 0540-3.v Step 0054-3 While the error persists, replace the components of the failing SCSI bus in the following order: 1. Cable on bus bb (if present) 2. Adapter (if SCSI bus interface logic is on the adapter) or system board (if SCSI bus interface logic is on the system board) To replace a component, and see if the problem was corrected, do the following: 1. Follow the removal and replacement procedure for the component as previously described in this step. 2. Run diagnostics in system verification mode on the adapter. When the problem is resolved, go to Chapter 5, “Verifying a repair,” on page 557. v Step 0054-4 Are the last two bytes of the four bytes identified in Step 0540-1, equal to FF00 (for example, 00bb FF00, where bb identifies the adapter’s SCSI bus)? NO Go to Step 0540-6. YES Go to Step 0540-5.v Step 0054-5 While the error persists, replace the components of the failing SCSI bus in the following order: 1. Cable on bus bb (if present) 2. Adapter (if SCSI bus interface logic is on the adapter) or system board (if the SCSI bus interface logic is on the system board) 3. DASD backplane attached to bus bb (if present) To replace a component, and see if the problem was corrected, do the following: 1. Follow the removal and replacement procedure for the component as previously described in this step. 2. Run diagnostics in system verification mode on the adapter. When the problem is resolved, go to Chapter 5, “Verifying a repair,” on page 557. v Step 0054-6 Chapter 2. Isolation Procedures 277 While the error persists, replace the components of the failing SCSI bus in the following order: 1. Device on bus bb with SCSI ID ss 2. Cable on bus bb (if present) 3. Adapter (if SCSI bus interface logic is on the adapter) or system board (if SCSI bus interface logic is on the system board) To replace a component and see if the problem was corrected, do the following: 1. Follow the removal and replacement procedure for the component as previously described in this step. 2. Run diagnostics in system verification mode on the adapter. When the problem is resolved, go to Chapter 5, “Verifying a repair,” on page 557.MAP 0070: 888 sequence in operator panel display Use this MAP when you receive an 888 sequence on the operator panel display or monitor. Purpose of this MAP An 888 sequence in operator panel display suggests that either a hardware or software problem has been detected and a diagnostic message is ready to be read. Note: The 888 will not necessarily flash on the operator panel display. v Step 0070-1 Perform the following steps to record the information contained in the 888 sequence message. 1. Wait until the 888 sequence displays. 2. Record, in sequence, every code displayed after the 888. On systems with a 3-digit or a 4-digit operator panel, you may need to press the system’s ?reset? button to view the additional digits after the 888. Stop recording when the 888 digits reappear. 3. Go to Step 0070-2.v Step 0070-2 Using the first code that you recorded, use the following list to determine the next step to use. Type 102 Go to Step 0070-3. Type 103 Go to Step 0070-4.v Step 0070-3 A type 102 message generates when a software or hardware error occurs during system execution of an application. Use the following information to determine the content of the type 102 message. The message readout sequence is: 102 = Message type RRR = Crash code (the three-digit code that immediately follows the 102) SSS = Dump status code (the three-digit code that immediately follows the Crash code). Record the crash code and the dump status code from the message you recorded in Step 0070-1. Are there additional codes following the dump status? No Go to Step 0070-5. Yes The message also has a type 103 message included in it. Go to Step 0070-4 to decipher the SRN and field replaceable unit (FRU) information in the type 103 message.Note: Type 102 messages have no associated SRNs. v Step 0070-4 278 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) A type 103 message is generated by the hardware when certain types of hardware errors are detected. Use the following steps and information you recorded in Step 0070-1 to determine the content of the type 103 message. The message readout sequence is: 103 = Message type (x)xxx (y)yyy = SRN (where (x)xxx = the three- or four-digit code following the 103 and (y)yyy is the three- or four-digit code following the (x)xxx code). 1. Record the SRN and FRU location codes from the recorded message. 2. Find the SRN in the Service Request Number List and perform the indicated action.Note: The only way to recover from an 888 type of halt is to turn off the system unit. v Step 0070-5 Perform the following steps: 1. Turn off the system unit power. 2. Turn on the system unit power, and load the online diagnostics in service mode. 3. Wait until one of the following conditions occurs: – You are able to load the diagnostics to the point where the Diagnostic Mode Selection menu displays. – The system stops with an 888 sequence. – The system appears to hang. Is the Diagnostic Mode Selection menu displayed? No Go to Chapter 1, “Starting a Service Call,” on page 1. Yes Go to Step 0070-6.v Step 0070-6 Run the All Resources options under Advanced Diagnostics in Problem Determination Mode. Was an SRN reported by the diagnostics? No This is possibly a software-related 888 sequence. Follow the procedure for reporting a software problem. Yes Record the SRN and its location code information. Find the SRN in the SRN Listing and do the indicated action.MAP 0220: Hot-Swap FRU problem resolution Use this procedure to exchange hot-swappable field replaceable units (FRUs). Note: The FRU you want to hot plug might have a defect on it that can cause the hot-plug operation to fail. If, after following the hot-plug procedure, you continue to get an error message that indicates that the hot-plug operation has failed, schedule a time for deferred maintenance when the system containing the FRU can be powered off. Then go to MAP 0210: General problem resolution, Step 0210-2 and answer NO to the question Do you want to exchange this FRU as a hot-plug FRU? Attention: If the FRU is a disk drive or an adapter, ask the system administrator to perform the steps necessary to prepare the device for removal. v Step 0220-1 1. If the system displayed a FRU part number on the screen, use that part number to exchange the FRU. If there is no FRU part number displayed on the screen, refer to the SRN listing. Record the SRN source code and the failing function codes in the order listed. 2. Find the failing function codes in the FFC listing, and record the FRU part number and description of each FRU. Chapter 2. Isolation Procedures 279 3. To determine if the part is hot-swappable, refer to the Chapter 3, “Locating FRUs,” on page 335 procedure for the part. Does this system unit support hot-swapping of the first FRU listed? NO Go to “MAP 0210: General problem resolution” on page 246. YES Go to Step 0220-2.v Step 0220-2 Is the FRU a hot-swap power supply or fan? NO Go to Step 0220-4. YES Go to Step 0220-3.v Step 0220-3 Note: Refer to Chapter 3, “Locating FRUs,” on page 335 for the part. 1. Remove the old FRU. 2. Install the new FRU. 3. Enter the diag command. Go to Step 0220-14. v Step 0220-4 Is the FRU a hot-plug PCI adapter? NO Go to Step 0220-5. YES Go to Step 0220-12.v Step 0220-5 Is the FRU a SCSI hot-plug device? NO Go to Step 0220-11. YES Go to Step 0220-6.v Step 0220-6 Is the hot-plug drive located within a system unit? NO Go to Step 0220-8. YES Go to Step 0220-7.v Step 0220-7 Refer to the removal and replacement procedures for your system in Chapter 3, “Locating FRUs,” on page 335. Go to Step 0220-13. v Step 0220-8 Does the hot-plug drive’s enclosure have procedures for removing and replacing SCSI disk drives? NO Go to Step 0220-9. YES If a hot-plug procedure exists, use that procedure to remove the old hot-plug SCSI disk drive and replace it with a new hot-plug SCSI disk drive. Otherwise, if no hot-plug procedure exists, use the power off procedure to remove the old SCSI disk drive and replace it with a new SCSI disk drive. Go to Step 0220-13.v Step 0220-9 1. Ask the customer to back up the data on the drive that you intend to replace onto another drive. 2. Verify that the disk drive is in the defined state. The amber LED on the hot-plug disk drive should be off. Is the hot-plug disk drive’s amber LED unlit? 280 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) NO Ask the customer to remove the hot-plug disk drive from the operating system configuration (refer the customer to the system management guide for more information). YES Go to Step 0220-10.v Step 0220-10 Using the hot-plug task service aid, replace the hot-plug drive using the following procedure: 1. Use the option List the SES Devices to show the configuration of the hot-plug slots. Identify the slot number of the adapter for the FRU you want to replace. 2. Select the option REPLACE/REMOVE a Device Attached to an SES Device. 3. Select the slot which contains the SCSI hot-plug drive you want to replace. Press Enter. You will see a fast blinking green light on the front on the hot-plug drive indicating that it is ready for removal. Note: Refer to the ?Installing hardware? section of the information; locate the server information that you are servicing and follow the tables to locate the correct removal or replacement procedure. 4. Remove the old hot-plug drive. 5. Install the new hot-plug drive. After the hot-plug drive is in place, press Enter. 6. Press Exit. Wait while configuration is done on the drive, until you see the ?hot-plug task? on the service aid menu. Go to Step 0220-15. v Step 0220-11 Attention: Do not remove functioning disks in a disk array attached to a PCI-X SCSI RAID controller without assistance from your service support structure. A disk array may become degraded or failed if functioning disks are removed and additional problems may be created. If you still need to remove a RAID array disk attached to a PCI-X SCSI RAID controller, use the SCSI and SCSI RAID hot-plug manager. Using the hot-plug task service aid, replace the hot-plug drive using the hot plug RAID service aid: Note: The drive you want to replace must be either a SPARE or FAILED drive. Otherwise, the drive would not be listed as an ?Identify and remove resources selection? within the RAID HOT-PLUG DEVICES screen. In that case you must ask the customer to put the drive into FAILED state. Refer the customer to the Operating System and Device management in the AIX library for more information. Ask the customer to back up the data on the drive that you intend to replace. 1. Select the option RAID HOT-PLUG DEVICES within the HOT-PLUG TASK under DIAGNOSTIC SERVICE AIDS. 2. Select the RAID adapter that is connected to the RAID array containing the RAID drive you want to remove, then select COMMIT. 3. Choose the option IDENTIFY in the IDENTIFY AND REMOVE RESOURCES menu. 4. Select the physical disk which you want to remove from the RAID array and press Enter. 5. The disk will go into the IDENTIFY state, indicated by a flashing light on the drive. Verify that it is the physical drive you want to remove, then press Enter. 6. At the IDENTIFY AND REMOVE RESOURCES menu, choose the option REMOVE and press Enter. 7. A list of the physical disks in the system that may be removed will be displayed. If the physical disk you want to remove is listed, select it and press Enter. The physical disk will go into the REMOVE state, as indicated by the LED on the drive. If the physical disk you want to remove is not listed, it is not a SPARE or FAILED drive. Ask the customer to put the drive in the FAILED state before you can proceed to remove it. Refer the customer to the Operating System and Device management in the AIX library for more information. 8. Refer to the service guide for the system unit or enclosure that contains the physical drive for removal and replacement procedures for the following substeps: a. Remove the old hot-plug RAID drive. Chapter 2. Isolation Procedures 281 b. Install the new hot-plug RAID drive. After the hot-plug drive is in place, press Enter. The drive will exit the REMOVE state, and will go to the NORMAL state after you exit diagnostics. Note: There are no elective tests to run on a RAID drive itself under diagnostics (the drives are tested by the RAID adapter).9. This completes the repair. Return the system to the customer. Ask the customer to add the physical disk drive to the original configuration within the RAID. Refer them to system management guide for more information.v Step 0220-12 1. Remove the old adapter FRU and replace it with the new adapter FRU. Refer to the Chapter 3, “Locating FRUs,” on page 335 procedure for the part. 2. Enter the diag command. 3. Go to the FUNCTION SELECTION menu, and select the option Advanced Diagnostics Routines. 4. When the DIAGNOSTIC MODE SELECTION menu displays, select the option System Verification. 5. Go to Step 0220-14.v Step 0220-13 1. If not already running diagnostics, enter the diag command. Note: If you are already running service mode diagnostics and have just performed the task Configure Added/Replaced Devices (under the SCSI Hot Swap manager of the Hot Plug Task service aid), you must use the F3 key to return to the DIAGNOSTIC OPERATING INSTRUCTIONS menu before proceeding with the next step, or else the drive might not appear on the resource list. 2. Go to the FUNCTION SELECTION menu, and select the option Advanced Diagnostics Routines. 3. When the DIAGNOSTIC MODE SELECTION menu displays, select the option System Verification. Does the hot-plug SCSI disk drive you just replaced appear on the resource list? NO Verify that you have correctly followed the procedures for replacing hot-plug SCSI disk drives in the system service guide. If the disk drive still does not appear in the resource list, go to “MAP 0210: General problem resolution” on page 246 to replace the resource that the hot-plug SCSI disk drive is plugged into. YES Go to Step 0220-14.v Step 0220-14 Run the diagnostic test on the FRU you just replaced. Did the diagnostics run with no trouble found? NO Go to Step 0220-15. YES Go to Chapter 5, “Verifying a repair,” on page 557. Before returning the system to the customer, if a hot-plug disk has been removed, ask the customer to add the hot-plug disk drive to the operating system configuration. Refer to Operating System and Device Management in the AIX library for more information.v Step 0220-15 1. Use the option Log Repair Action in the TASK SELECTION menu to update the AIX error log. If the repair action was reseating a cable or adapter, select the resource associated with your repair action. If it is not displayed on the resource list, select sysplanar0. Note: On systems with a fault indicator LED, this changes the fault indicator LED from the fault state to the normal state. 2. While in diagnostics, go to the FUNCTION SELECTION menu. Select the option Advanced Diagnostics Routines. 282 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 3. When the DIAGNOSTIC MODE SELECTION menu displays, select the optionSystem Verification. Run the diagnostic test on the FRU you just replaced, or sysplanar0. Did the diagnostics run with no trouble found? NO Go to Step 0220-16. YES If you changed the service processor or network settings, restore the settings to the value they had prior to servicing the system. If you performed service on a PCI RAID subsystem involving changing of the RAID adapter cache card or changing the configuration on RAID disks, ask the customer to run ?PCI SCSI disk array manager? using smitty to resolve the PCI SCSI RAID adapter configuration. The following is an example of how the customer would resolve the configuration: 1. At the AIX command line, type smitty pdam. 2. On the PCI SCSI Disk Array Manager screen, select RECOVERY OPTIONS. 3. If a previous configuration exists on the replacement adapter, this must be cleared. Select Clear PCI SCSI RAID Adapter Configuration. Press F3. 4. On the Recovery Options screen, select RESOLVE PCI SCSI RAID ADAPTER CONFIGURATION. 5. On the Resolve PCI SCSI RAID Adapter Configuration screen, select ACCEPT CONFIGURATION on DRIVES. 6. On the PCI SCSI RAID Adapter selection menu, select the adapter that you changed. 7. On the next screen, press Enter. 8. On the ?Are You Sure?? selection menu, press Enter to continue. 9. You receive an OK status message when the recovery is complete. If you get a Failed status message, verify that you are preforming recovery on the correct adapter, then do this complete procedure. When you complete the recovery, exit smitty to return to the AIX command line.Go to Chapter 5, “Verifying a repair,” on page 557. v Step 0220-16 Does the original problem persist? NO If a FRU was replaced, run the log repair action service aid under the online diagnostics for the resource that was replaced. If the resource associated with your action is not displayed on the resource list, select sysplanar0. If steps were taken to make the device ready for removal, inform the system administrator of the steps required to return the system to the original state. Go to Chapter 5, “Verifying a repair,” on page 557. YES Go to Step 0220-17.v Step 0220-17 Have you exchanged all the FRUs that correspond to the failing function codes? NO Go to Step 0220-18. YES The SRN did not identify the failing FRU. Schedule a time to run diagnostics in service mode. If the same SRN is reported in service mode, go to “MAP 0030: Additional Problem Determination” on page 266.v Step 0220-18 Note: Before proceeding, remove the FRU you just replaced and install the original FRU in its place. Does the system unit support hot-swapping of the next FRU listed? NO Go to “MAP 0210: General problem resolution” on page 246. YES The SRN did not identify the failing FRU. Schedule a time to run diagnostics in service mode. If the same SRN is reported in service mode, go to Step 0220-14. Chapter 2. Isolation Procedures 283 MAP 0230: Platform error problem resolution Use this MAP to resolve problems reported by SRNs A00-xxx to A25-xxxx. Step 0230-1 1. The last character of the SRN is bit-encoded as follows:8 4 2 1 | | | | | | | Replace all FRUs listed | | Hot-swap is supported | Software or Firmware could be the cause Reserved 2. Refer to the last character in the SRN. A 4, 5, 6, or 7 indicates a possible software or firmware problem. Does the last character indicate a possible software or firmware problem? NO Go to Step 0230-4YES Go to Step 0230-2. Step 0230-2 Ask the customer if any software or firmware has been installed recently. Has any software or firmware been installed recently? NO Go to Step 0230-4. YES Go to Step 0230-3. Step 0230-3 Suspect the new software or firmware. Check with your support center for any known problems with the new software or firmware. Are there any known problems with the software or firmware? NO Go to Step 0230-4. YES Obtain and follow the procedure to correct the software problem. This completes the repair.Step 0230-4 Were any FRUs or location code reported with the SRN? NO Go to Step 0230-5. YES Go to Step 0230-9 Step 0230-5 Run the diagnostics in problem determination mode on sysplanar0. Were there any FRUs reported with the SRN? NO Go to Step 0230-6. YES Go to Step 0230-9 284 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Step 0230-6 Did the system display: ?Previous Diagnostic Results - Do you want to review the previously displayed error?? NO Go to Step 0230-7. YES You have a pending item in the error log for which there is no corresponding log repair action. To see this error, select YES at the prompt. Information from the error log displays in order of last event first. Record the error code, the FRU names and the location code of the FRUs. Go to Step 0230-7 Step 0230-7 Were there any other SRNs that begin with an A00 to A1F reported? NO Go to Step 0230-8. YES Go to Step 0230-1 and use the new SRN.Step 0230-8 Attempt to boot the system on slow boot mode. If the system boots, run the diagnostics in problem determination mode on sysplanar0 Were any new error codes or SRNs reported? NO Call your support center. YES Follow the procedure for the new error code or SRN. Step 0230-9 1. Obtain the list of physical location codes and FRU numbers that were listed on the Problem Report Screen. The list can be obtained by running the sysplanar0 diagnostics or using the task Display Previous Diagnostic Results. 2. Record the physical location codes and FRU numbers. 3. Refer to the last character in the SRN. A 2, 3, 6, or 7 indicates that hot-swap is possible. Does the last character indicate that hot-swap is possible? NO Go to Step 0230-10. YES Go to Step 0230-14 Step 0230-10 Note: If necessary, refer to “Powering on and powering off” on page 588 for information on system shutdown and powering the system on and off. 1. If the operating system is running, perform the operating system’s shutdown procedure. 2. Turn off power to the system. 3. Refer to the last character in the SRN. A 1, 3, 5, or 7 indicates that all FRUs listed on the Problem Report Screen need to be replaced. For SRNs ending with any other character, exchange one FRU at a time, in the order listed.Note: Use the appropriate procedure under Chapter 4, “Removing and replacing FRUs,” on page 349 to replace any FRUs indicated. 4. Turn on power to the system. Chapter 2. Isolation Procedures 285 5. Load “Running the online diagnostics in service mode” on page 673.Note: If the Diagnostics Operating Instructions do not display or you are unable to select the option Task Selection , check for loose cards, cables, and obvious problems. If you do not find a problem, go to “MAP 0020: Problem determination procedure” on page 259 and get a new SRN. 6. Wait until the Diagnostics Operating Instructions are displayed or the system appears to stop. 7. Press Enter. 8. Select Diagnostic Routines at the function selection menu. 9. Select System Verification. 10. If a missing options exist, particularly if it is related to the device that was replaced, resolve the missing options before proceeding 11. Select the option Task Selection. 12. Select the option Log Repair Action. 13. Log a repair action for each replaced resource. 14. If the resource associated with your repair action is not displayed on the resource list, select sysplanar0. 15. Return to the Task Selection Menu. 16. If the FRU that was replaced was memory and the system is running as a full system partition, select Run Exercisers and run the short exerciser on all the resources, otherwise proceed to Step 0230-15. 17. If you ran the exercisers in Step 0230-10, substep 16, return to the Task Selection menu. 18. Select Run Error Log Analysis and run analysis on all the resources. Was a problem reported? NO The repair is complete. Go to Chapter 5, “Verifying a repair,” on page 557. YES Go to Step 0230-11. Step 0230-11 Is the problem the same as the original problem? NO The symptom has changed. Check for loose cards, cables, and obvious problems. If you do not find a problem, go to “MAP 0020: Problem determination procedure” on page 259 and get a new SRN. YES Go to Step 0230-12. Step 0230-12 Look at the physical location codes and FRU part numbers you recorded. Have you exchanged all the FRUs that were listed? NO Go to Step 0230-13. YES The SRN did not identify the failing FRU. Call your support person for assistance. Step 0230-13 1. After performing a shutdown of the operating system, turn off power to the system. 2. Remove the new FRU and install the original FRU. 3. Exchange the next FRU in list. 4. Turn on power to the system. 286 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 5. Load “Running the online diagnostics in service mode” on page 673.Note: If the Diagnostics Operating Instructions do not display or you are unable to select the option Task Selection, check for loose cards, cables, and obvious problems. If you do not find a problem, go to “MAP 0020: Problem determination procedure” on page 259 and get a new SRN. 6. Wait until the Diagnostics Operating Instructions are displayed or the system appears to stop. 7. Press Enter. 8. Select Diagnostic Routines at the function selection menu. 9. Select System Verification. 10. If a missing options exist, particularly if it is related to the device that was replaced, resolve the missing options before proceeding. 11. Select the option Task Selection. 12. Select the option Log Repair Action. 13. Log a repair action for each replaced resource. 14. If the resource associated with your action does not appear on the Resource List, select sysplanar0. 15. Return to the Task Selection Menu. 16. If the FRU that was replaced was memory and the system is running as a full system partition, select Run Exercisers and run the short exerciser on all the resources, otherwise proceed to Step 0230-15. 17. If you ran the exercisers in Step 0230-13, substep 16, return to the Task Selection menu. 18. Select Run Error Log Analysis and run analysis on all the resources. Was a problem reported? NO The repair is complete. Go to Chapter 5, “Verifying a repair,” on page 557. YES Go to Step 0230-11. Step 0230-14 The FRUs can be hot-swapped. If you do not want to use the hot-swap, go to Step 0230-10.Note: See the hot-swap procedures in the Chapter 4, “Removing and replacing FRUs,” on page 349 section. 1. Refer to the last character in the SRN. A 1, 3, 5, or 7 indicates that all FRUs listed on the Problem Report Screen must be replaced. For SRNs ending with any other character, exchange one FRU at a time, in the order listed. 2. If available, use the CE Login and enter the diag command.Note: If CE Login is not available, have the system administrator enter superuser mode and then enter the diag command. 3. After the Diagnostics Operating Instructions display, press Enter. 4. Select the option Task Selection. 5. Select the option Log Repair Action. 6. If the resource associated with your action is not displayed on the Resource List, select sysplanar0. 7. Log a repair action for each replaced resource. 8. Return to the Task Selection menu. 9. For systems running as a full system partition, select Run Exercisers and run the short exerciser on all resources. Chapter 2. Isolation Procedures 287 10. Use the option Log Repair Action in the Task Selection menu to update the AIX error log. If the repair action was reseating a cable or adapter, select the resource associated with your repair action. If it is not displayed on the resource list, select sysplanar0.Note: On systems with a Fault Indicator LED, this changes the Fault Indicator LED from the fault state to the normal state.Was a problem reported? NO The repair is complete. Return the system to the customer. YES Go to Step 0230-15. Step 0230-15 Is the problem the same as the original problem? NO The symptom has changed. Check for loose cards, cables, and obvious problems. If you do not find a problem, go to “MAP 0020: Problem determination procedure” on page 259 and get a new SRN. YES Go to Step 0230-16. Step 0230-16 Look at the physical location codes and FRU part numbers you recorded. Have you exchanged all the FRUs that were listed? NO Go to Step 0230-17. YES The SRN did not identify the failing FRU. Call your support person for assistance. Step 0230-17 1. Remove the new FRU and install the original FRU. 2. Exchange the next FRU in the list. 3. Return to the Task Selection Menu. 4. Select the option Log Repair Action. 5. Log a repair action for each replaced resource. 6. If the resource associated with your action is not displayed on the Resource List, select sysplanar0. 7. Return to the Task Selection Menu. 8. For systems running as a full system partition, select Run Exercisers and run the short exerciser on all resources. 9. If you ran the exercisers in substep Step 0230-17, substep 8. 10. Select Run Error Log Analysis and run analysis on all exchanged resources. Was a problem reported? NO The repair is complete. Return the system to the customer. YES Go to Step 0230-15.MAP 0260: System hangs during resource configuration Use this MAP when the system unit hangs while configuring a resource. 288 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Purpose of this MAP This MAP handles problems when the system unit hangs while configuring a resource. v Step 0260-1 The last three or four digits of the SRN following the dash (-) match a failing function code number. Look at the Chapter 8, “Part Information,” on page 691 and find the failing function code that matches the last three or four digits of your SRN, following the dash. Record the FRU part number and description (use the first FRU part listed when multiple FRUs are listed). The physical location code, AIX location code, or device name is displayed on the operator panel. Do you have either a physical location code or AIX location code displayed? No Go to Step 0260-4. Yes Go to Step 0260-2.v Step 0260-2 Are there any FRUs attached to the device described by the physical location code or AIX location code? No Go to Step 0260-6. Yes Go to Step 0260-3.v Step 0260-3 Remove this kind of FRU attached to the device described in the location code one at a time. Note whether the system still hangs after each device is removed. Repeat this step until you no longer get a hang, or all attached FRUS have been removed from the adapter or device. Has the symptom changed? No Go to Step 0260-4. Yes Use the location code of the attached device that you removed when the symptom changed, and go to Step 0260-6.v Step 0260-4 Does your system unit contain only one of this kind of FRU? No Go to Step 0260-5. Yes Go to Step 0260-6.v Step 0260-5 One of the FRUs of this kind is defective. Remove this kind of FRU one at a time. Test the system unit after each FRU is removed. Stop when the test completes successfully or when you have removed all of the FRUs of this kind. Were you able to identify a failing FRU? No Go to “PFW1540: Problem isolation procedures” on page 301. Yes Go to Step 0260-6.v Step 0260-6 1. Turn off the system unit. 2. Exchange the FRU identified by the location code or Step 0260-5. Is this system capable of running online diagnostics in service mode? No Go to Step 0260-7. Yes Go to Step 0260-8.v Step 0260-7 1. Turn on the system unit. Chapter 2. Isolation Procedures 289 2. Load the standalone diagnostics and refer to “Running the online and eServer stand-alone diagnostics” on page 672. 3. Wait until the Diagnostic Operating Instructions display or the system appears to have stopped. Are the DIAGNOSTIC OPERATING INSTRUCTIONS displayed? NO Go to Step 0260-9. YES Go to Chapter 5, “Verifying a repair,” on page 557.v Step 0260-8 1. Turn on the system unit. 2. Load the standalone diagnostics and refer to “Running the online and eServer stand-alone diagnostics” on page 672. 3. Wait until the Diagnostic Operating Instructions display or the system appears to have stopped. Are the DIAGNOSTIC OPERATING INSTRUCTIONS displayed? NO Go to Step 0260-9. YES Go to Chapter 5, “Verifying a repair,” on page 557.v Step 0260-9 Look at the operator panel display. Is the number displayed the same as the last three or four digits after the dash (-) of your SRN? NO The symptom changed. Check for loose cards, cables, and obvious problems. If you do not find a problem, go to “MAP 0020: Problem determination procedure” on page 259 and get a new SRN. YES Go to Step 0260-10.v Step 0260-10 Was the FRU you exchanged an adapter or a planar? NO Go to Step 0260-11. YES Go to “PFW1540: Problem isolation procedures” on page 301.v Step 0260-11 Was the FRU you exchanged a device? NO Go to “PFW1540: Problem isolation procedures” on page 301. YES Go to Step 0260-12.v Step 0260-12 The adapter for the device may be causing the problem. 1. Turn off the system unit. 2. Exchange the adapter for the device. Note: If the AIX operating system is not used on the system, start diagnostics from an alternate source. 3. Turn on the system unit. If c31 is displayed, follow the instructions to select a console display. 4. Load the standalone diagnostics and refer to “Running the online and eServer stand-alone diagnostics” on page 672. 5. Wait until the DIAGNOSTIC OPERATING INSTRUCTIONS display or the system appears to have stopped. Are the DIAGNOSTIC OPERATING INSTRUCTIONS displayed? NO Go to “PFW1540: Problem isolation procedures” on page 301. YES Go to Chapter 5, “Verifying a repair,” on page 557. 290 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) MAP 0270: SCSI RAID problem resolution and verification Use this MAP to resolve SCSI RAID adapter, cache, or drive problems. Purpose of This MAP Use this MAP to resolve SCSI RAID adapter, cache, or drive problems. Notes: 1. This MAP assumes that the RAID adapter and drive microcode is at the correct level. 2. This MAP applies only to PCI, not PCI-X, RAID adapters.Attention: If the FRU is a disk drive or an adapter, ask the system administrator to perform the steps necessary to prepare the device for removal. v Step 0270-1 1. If the system displayed a FRU part number on the screen, use that part number. If there is no FRU part number displayed on the screen, refer to the SRN listing. Record the SRN source code and the failing function codes in the order listed. 2. Find the failing function codes in the FFC listing, and record the FRU part number and description of each FRU. Go to Step 0270-2. v Step 0270-2 Is the FRU a RAID drive? NO Go to Step 0270-6. YES Go to Step 0270-3.v Step 0270-3 If the RAID drive you want to replace is not already in the failed state, then ask the customer to run the PCI SCSI Disk Array Manager using smit to fail the drive that you want to replace. An example of this procedure is: 1. Log in as root user. 2. Type smit pdam. 3. Select Fail a Drive in a PCI SCSI Disk Array. 4. Select the appropriate disk array by placing the cursor over that array and press Enter. 5. Select the appropriate drive to fail based on the Channel and ID called out in diagnostics. The Fail a Drive screen will appear. 6. Verify that you are failing the correct drive by looking at the Channel ID row. Press Enter when verified correct. Press Enter again. 7. Press F10 and type smit pdam 8. Select Change/Show PCI SCSI RAID Drive Status ? Remove a Failed Drive. 9. Select the drive that just failed. Go to Step 0270-4. v Step 0270-4 Replace the RAID drive using the RAID HOT PLUG DEVICES service aid: Note: The drive you want to replace must be either a SPARE or FAILED drive. Otherwise, the drive would not be listed as an IDENTIFY AND REMOVE RESOURCES selection within the RAID HOT PLUG DEVICES screen. In that case you must ask the customer to put the drive into FAILED state. For information on putting the drive in a FAILED state, refer the customer to the PCI Dual Channel Ultra320 SCSI RAID Adapter Installation and Using Guide, order number SC23-1324. Chapter 2. Isolation Procedures 291 1. Select the option RAID HOT PLUG DEVICES within the HOT PLUG TASK under DIAGNOSTIC SERVICE AIDS. 2. Select the RAID adapter that is connected to the RAID array containing the RAID drive you want to remove, then select COMMIT. 3. Choose the option IDENTIFY in the IDENTIFY AND REMOVE RESOURCES menu. 4. Select the physical disk which you want to remove from the RAID array and press Enter. The disk will go into the IDENTIFY state, indicated by a flashing light on the drive. 5. Verify that it is the physical drive you want to remove, then press Enter. 6. At the IDENTIFY AND REMOVE RESOURCES menu, choose the option REMOVE and press Enter. A list of the physical disks in the system that may be removed will be displayed. 7. If the physical disk you want to remove is listed, select it and press Enter. The physical disk will go into the REMOVE state, as indicted by the LED on the drive. If the physical disk you want to remove is not listed, it is not a SPARE or FAILED drive. Ask the customer to put the drive in the FAILED state before you can proceed to remove it. For information on putting the drive in a FAILED state, refer the customer to the PCI Dual Channel Ultra320 SCSI RAID Adapter Installation and Using Guide, order number SC23-1324. 8. Refer to the service guide for the system unit or enclosure that contains the physical drive for removal and replacement procedures for the following substeps: a. Remove the old hot-swap RAID drive. b. Install the new hot-swap RAID drive. After the hot-swap drive is in place, press Enter. The drive will exit the REMOVE state, and will go to the NORMAL state after you exit diagnostics. Note: There are no elective tests to run on a RAID drive itself under diagnostics (the drives are tested by the RAID adapter).Go to Step 0270-5. v Step 0270-5 If the RAID did not begin reconstructing automatically, perform the following steps. Adding a Disk to the RAID array and Reconstructing: Ask the customer to run the PCI SCSI Disk Array Manager using smit. An example of this procedure is: 1. Log in as root user. 2. Type smit pdam. 3. Select Change/Show PCI SCSI RAID Drive Status. 4. Select Add a Spare Drive. 5. Select the appropriate adapter. 6. Select the channel and ID of the drive that was replaced. 7. Press Enter when verified. 8. Press F3 until you return to the Change/Show PCI SCSI RAID Drive Status screen. 9. Select Add a Hot Spare. 10. Select the drive you just added as a spare. If there was no hot spare previously installed in the array, the array will begin reconstructing immediately. Reconstruction time will vary based on the size of the RAID array. Allow 1-2 hours for completion. To check the progress of the reconstruction: 1. Log in as root user. 2. Type smit pdam. 3. Select List PCI SCSI RAID Arrays. 4. Choose the array containing the drive you replaced. 292 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) If the state of the RAID array is reconstructing, then it is in process of reconstructing. If it is optimal, then reconstruction has completed. 5. Press F10 to exit. Go to Step 027017. v Step 0270-6 Is the FRU a RAID adapter base card, RAID adapter cache card, or RAID adapter battery? NO Go to Step 0270-15. YES Go to Step 0270-7.v Step 0270-7 Do you want to change the FRU using a hot-swap operation? NO Power off the system, and remove the RAID adapter. Go to Step 0270-8. YES Remove the RAID adapter. Go to Step 0270-8.v Step 0270-8 Is the FRU you want to replace a RAID adapter cache card or RAID adapter battery? NO Go to Step 0270-10. YES Go to Step 0270-9.v Step 0270-9 Replace the FRU onto the existing base card. Go to Step 0270-11. v Step 0270-10 After physically removing the base card from the system, remove any other good FRUs (RAID cache card or cache battery) from the RAID base card adapter. Plug these FRUs on to the replacement RAID base card adapter FRU. Go to Step 0270-11. v Step 0270-11 Did you change the FRU using a hot-swap operation? NO Install the RAID adapter assembly into the system. Power on the system and log in to AIX. Go to Step 0270-12. YES Install the RAID adapter assembly into the system. Go to Step 0270-12.v Step 0270-12 Was the replacement FRU a RAID base card? NO Go to Step 0270-14. YES Go to Step 0270-13.v Step 0270-13 Attention: Prior to cabling the SCSI RAID adapter to the subsystem, check for preexisting configurations on the replacement SCSI RAID base card. The replacement base card can overwrite your system’s configuration data if it already has a configuration written to it. Check it before cabling the SCSI RAID subsystem array. Ask to customer to check for preexisting configuration on the SCSI RAID base card. Below is an example of this procedure: 1. Log in as root (if not already root). 2. Type smit pdam. 3. Select List PCI SCSI RAID Arrays. 4. If no RAID arrays are listed, then there are no preexisting configurations on the base card. Chapter 2. Isolation Procedures 293 5. Press F10 key to exit. If a preexisting configuration exists on the base card, ask the customer to run the PCI SCSI Disk Array Manager using smitty. 1. Log in as root (if not already root). 2. Type smit pdam from the AIX command prompt (if not already in the RAID manager). 3. Select Recovery Options. 4. Select Clear PCI SCSI RAID Adapter Configuration. Select the adapter that you just installed. Press Enter to confirm. 5. Return to the Recovery Options menu (if not already there). Select Resolve PCI SCSI RAID Adapter Configuration. Select Accept Configuration on Drives. Select the adapter that you just installed. Press Enter to confirm. The configuration on the new adapter should now match the configuration existent on the drives. 6. Press F10 to exit. You may now proceed to cable the RAID system array. Go to Step 0270-16. v Step 0270-14 Ask the customer to resynchronize the RAID array configuration: 1. Log in as root (if not already root). 2. Type smit pdam. 3. Select Recovery Options. 4. Select Resolve PCI SCSI RAID Adapter Configuration. 5. Select Retry Current Configuration. 6. Select the appropriate scraid (SCSI RAID) adapter. A message will be displayed as to the success of the operation. 7. Press F10 to exit. Go to Step 0270-16. v Step 0270-15 Other RAID FRUs require that the system be shut down prior to replacement. 1. If the operating system is running, perform the operating system shutdown procedure (get help if needed). 2. Turn off the system power. 3. Replace the FRU indicated by the FFC. Go to Step 0270-16. v Step 0270-16 Run the diagnostics in system verification mode on the RAID subsystem. Did the diagnostics run with no trouble found? NO Go to Step 0270-18. YES Go to Step 0270-17.v Step 0270-17 1. Use the option Log Repair Action in the TASK SELECTION menu to update the AIX error log. Select scraidX (where X is the RAID adapter number of the RAID subsystem you have been working on). Note: On systems with fault indicator LED, this changes the fault indicator LED from the Fault state to the Normal state. 2. While in diagnostics, go to the FUNCTION SELECTION menu. Select the option Advanced Diagnostics Routines. 294 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 3. When the DIAGNOSTIC MODE SELECTION menu displays, select the option System Verification. Run the diagnostic test on scraidX (where X is the RAID adapter number). Did the diagnostics run with no trouble found? NO Go to the Step 0270-18. YES If you changed the service processor or network settings, restore the settings to the value they had prior to servicing the system.This completes the repair; return the system to the customer. Go to Chapter 6, “Closing a service call,” on page 565. v Step 0270-18 Have you exchanged all the FRUs that correspond to the failing function codes? NO Go to Step 0270-19. YES The SRN did not identify the failing FRU. Schedule a time to run diagnostics in service mode. If the same SRN is reported in service mode, go to “MAP 0030: Additional Problem Determination” on page 266.v Step 0270-19 Note: Note: Before proceeding, remove the FRU you just replaced and install the original FRU in its place. Use the next FRU on the list and go to Step 0270-2.MAP 0280: Console and Keyboard Problem Resolution Use this MAP to resolve console and keyboard problems when the system is booting. Purpose of This MAP Use this MAP to resolve console and keyboard problems when the system is booting. For other boot problems and concerns, go to “Problems with loading and starting the operating system (AIX and Linux)” on page 246. Entry Table Keyboard problem Go to Step 0280-1. Graphic adapter problem Go to Step 0280-2. Terminal problem Go to Step 0280-3. v Step 0280-1 The system fails to respond to keyboard entries. This problem is most likely caused by a faulty keyboard, keyboard adapter, or keyboard cable. Try the FRUs in the following order. Test each FRU by retrying the failing operation. 1. Keyboard. 2. Keyboard adapter, typically located on the system board. 3. Keyboard cable, if not included with the keyboard. Were you able to resolve the problem? NO Go to “MAP 0030: Additional Problem Determination” on page 266. YES Go to Chapter 5, “Verifying a repair,” on page 557.v Step 0280-2 1. Some systems have a graphic adapter POST. If a graphic adapter POST is supported and it indicates a failure, follow the procedures in the system guide to resolve the problem. Chapter 2. Isolation Procedures 295 2. If a graphic adapter POST is supported and it does not indicate a failure, suspect the display or display cable. 3. If the system does not have a graphic adapter POST, go to the display’s problem determination procedures. If you do not find a problem, replace the graphics adapter. Were you able to resolve the problem? NO Go to “MAP 0030: Additional Problem Determination” on page 266. YES Go to Chapter 5, “Verifying a repair,” on page 557.v Step 0280-3 Go to the problem determination procedure for the terminal. If you do not find a problem, suspect the serial port adapter or terminal cable. Were you able to resolve the problem? NO Call your support person. YES Go to Chapter 5, “Verifying a repair,” on page 557.MAP 0285: Multipath I/O (MPIO) Problem Resolution Use this MAP to handle SRN A23-001 and ssss-640 (where ssss is the 3 or 4 digit Failing Function Code (FFC) of an SCSD drive) to check the path from adapter to device. Purpose of this MAP Use this MAP to handle SRN A23-001 and ssss-640 (where ssss is the 3 or 4 digit FFC of an SCSD drive) to check the path from adapter to device. Note: Not all devices support MPIO. Before proceeding with this MAP, make sure that the devices on both ends of the missing path support MPIO. v Step 0285-1 Look at the problem report screen for the missing path. After the resource name and FRU, the next column identifies the missing path between resources (for example, scsi0 -> hdisk1). This indicates the missing path between the two resources, scsi0 (the parent resource) and hdisk1 (the child resource). Is the cabling present between the two resources? NO Go to Step 0285-2. YES Go to Step 0285-4.Note: In the following MAP steps, if no path previously existed between a parent and child device, the child device will have to be changed from the defined to the available state, otherwise you will be unable to select the child device to which you want to establish a path. v Step 0285-2 1. Power off the system. 2. Connect the correct cable between the two resources. 3. Power on the system, rebooting AIX. 4. At the AIX command line, type smitty mpio. 5. Choose MPIO Path Management. 6. Select Enable Paths. 7. Select Enable Paths for a Device. 8. In the Device Name selection, choose the name of the child device (the device at the end of the path). 9. In the Paths to Enable selection, select the missing path (between the parent and child device). 296 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 10. Press Enter. Did the smitty menu complete with no errors? NO Go to Step 0285-4. YES Go to Step 0285-3.v Step 0285-3 To verify that the device path is present, rerun diag -a. When the resource list displays, select the child resource that had the missing path. If the letter P is shown in front of the resource, you still have a path problem. If no letter P is shown in front of the resource or if the system returns to the command prompt or you get a message without the resource list being displayed, then the problem has been resolved. Do you still have a path problem? NO Go to Chapter 5, “Verifying a repair,” on page 557. YES Go to Step 0285-4.v Step 0285-4 1. Power off the system. 2. Reseat the cables between the device and the adapter that have the missing path. 3. Power on the system, and reboot AIX. 4. At the AIX command line, type smitty mpio. 5. Choose MPIO Path Management. 6. Select Enable Paths. 7. Select Enable Paths for a Device. 8. In the Device Name selection, choose the name of the child device (the device at the end of the path). 9. In the Paths to Enable selection, select the missing path (between the parent and child device). 10. Press Enter. Did the smitty menu complete with no errors? NO Go to Step 0285-6. YES Go to Step 0285-5.v Step 0285-5 To verify that the device path is present, rerun diag -a When the resource list displays, select the child resource that had the missing path. If the letter P is shown in front of the resource, you still have a path problem. If no letter P is shown in front of the resource or if the system returns to the command prompt or you get a message without the resource list being displayed, then the problem has been resolved. Do you still have a path problem? NO Go to Chapter 5, “Verifying a repair,” on page 557. YES Go to Step 0285-6.v Step 0285-6 You may have a problem with the driver interface or connector on either the parent or child device. 1. Power off the system (refer to the system service guide if necessary). 2. Remove the cables to the parent device. 3. Replace the cable(s) that go between the parent and child device (if present). 4. Reattach the cables to the parent device. 5. Power on the system, rebooting AIX (refer to the system service guide if necessary). 6. At the AIX command line, type smitty mpio. Chapter 2. Isolation Procedures 297 7. Choose MPIO Path Management. 8. Select Enable Paths. 9. Select Enable Paths for a Device. 10. In the Device Name selection, choose the name of the child device (the device at the end of the path). 11. In the Paths to Enable selection, select the missing path (between the parent and child device). 12. Press Enter. Did the smitty menu complete with no errors? NO Go to Step 0285-7. YES Go to Step 0285-8.v Step 0285-7 To verify that the device path is present, rerun diag -a. When the resource list displays, select the child resource that had the missing path. If the letter P is shown in front of the resource, you still have a path problem. If the letter P is not shown in front of the resource, or if the system returns to the command prompt, or you get a message without the resource list being displayed, then the problem has been resolved. Do you still have a problem? NO Go to Chapter 5, “Verifying a repair,” on page 557. YES Go to Step 0285-8.v Step 0285-8 You may have a problem with the driver interface or connector on either the parent or child device. 1. Power off the system. 2. Remove the cables to the parent device. 3. Replace the parent device. 4. Reattach the cables to the parent device. 5. Power on the system, rebooting AIX. 6. At the AIX command line, type smitty mpio. 7. Choose MPIO Path Management. 8. Select Enable Paths. 9. Select Enable Paths for a Device. 10. In the Device Name selection, choose the name of the child device (the device at the end of the path). 11. In the Paths to Enable selection, select the missing path (between the parent and child device). 12. Press Enter. Did the smitty menu complete with no errors? NO Go to Step 0285-9. YES Go to Step 0285-10.v Step 0285-9 To verify that the device path is present, rerun diag -a. When the resource list displays, select the child resource that had the missing path. If the letter P is shown in front of the resource, you still have a path problem. If no letter P is shown in front of the resource or if the system returns to the command prompt or you get a message without the resource list being displayed, then the problem has been resolved. Do you still have a path problem? NO Go to Chapter 5, “Verifying a repair,” on page 557. 298 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) YES Go to Step 0285-10.v Step 0285-10 You may have a problem with the driver interface or connector on the child device. 1. Power off the system. 2. Remove the cables to the child device. 3. Replace the child device (in the case of a SCSI or SAF-TE backplane, replace the backplane first, followed by the child device). 4. Reattach the cables to the child device. 5. Power on the system, rebooting AIX. 6. At the AIX command line, type smitty mpio. 7. Choose MPIO Path Management. 8. Select Enable Paths. 9. Select Enable Paths for a Device. 10. In the Device Name selection, choose the name of the child device (the device at the end of the path). 11. In the Paths to Enable selection, select the missing path (between the parent and child device). 12. Press Enter. Did the smitty menu complete with no errors? NO Neither the cabling, nor the parent, nor the child seem to be causing the problem. Contact your service support structure. YES Go to Step 0285-11.v Step 0285-11 Rerun diag -a to verify that the device path is present. When the resource list displays, select the child resource that had the missing path. If the letter P is shown in front of the resource, you still have a path problem. If no letter P is shown in front of the resource or if the system returns to the command prompt or you get a message without the resource list being displayed, then the problem has been resolved. Do you still have a path problem? NO Go to Chapter 5, “Verifying a repair,” on page 557. YES Neither the cabling, nor the parent, nor the child seem to be causing the problem. Contact your service support structure.MAP 0291: Missing Device or Bus Problem Resolution Use this MAP when a bus or device (such as a disk drive) is reported as a missing resource by the diagnostics. Purpose of this MAP Use this MAP when a bus or device (such as a disk drive) is reported as a missing resource by the diagnostics. v Step 0291-1 The device may be missing because of a power problem. If the missing device is located in a drawer or enclosure, do the following: 1. Check for any environmental problem indicators such as power or cooling that may indicate a problem. 2. If a problem is indicated, follow the service documentation to resolve the problem. Go to Step 0291-2. Chapter 2. Isolation Procedures 299 v Step 0291-2 Inspect the cables (signal and power) of the missing device. Be sure all connections are in place and power is present. Refer to the system or enclosure documentation containing the power wiring diagram or system cable diagram to locate specific cables, determine the cable numbering, and check for a problem-determination procedure. Look for obvious power cabling problems, such as missing or loose cable connectors. Power problems can sometimes be identified by checking other devices that use the same power source (such as a diskette drive and a SCSI tape drive, even though they have different controllers). If other devices that share a power source are reported as missing devices, suspect the power source as the problem. If there is a power problem, refer to “Power problems” on page 129. Did you find a problem? NO Go to Step 0291-3. YES Correct the problem, then go to Chapter 5, “Verifying a repair,” on page 557.v Step 0291-3 Is the missing device a SCSI device installed in a SCSI Enclosure Services (AIX resource SESx) or SAF-TE (AIX resource SAFTEx), or a SCSI device installed in either type of enclosure? NO Go to “MAP 0210: General problem resolution” on page 246. YES Go to Step 0291-4.v Step 0291-4 Run the advanced diagnostics in problem determination mode on the SCSI Enclosure Services or SAF-TE device.Note: A no trouble found result, or if you get another SRN with the same digits before the dash as you previously had from the diagnostics, indicates that you did not get a different SRN. Did you get a different SRN than when you ran the diagnostics previously? NO Go to Step 0291-5. YES Take the following action: 1. Look up the SRN.Note: If the SRN is not listed in the Service Request Number Lists, look for additional information in the following: – Any supplemental service manual for the device. – The diagnostic Problem Report screen. – The Service Hints service aid in “Running the online and eServer stand-alone diagnostics” on page 672.2. Perform the action listed.v Step 0291-5 Power off the system. Disconnect all hot-swap devices attached to the adapter. Reconnect the hot-swap devices one at time. After reconnecting each device, do the following: 1. Power on the system and boot the system in the same mode that you were in when you received the symptom that led you to this MAP “Powering on and powering off” on page 588. 2. At an AIX command prompt, run the diag -a command to check for missing options. 3. Verify that the device you just added to the system is present in the system configuration. If other devices are now missing, the problem may be with the last device that was reconnected or reinstalled. Perform these substeps: a. If the device that was just reinstalled in the system is missing, or if additional devices are now missing, replace the last device that was reinstalled. 300 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) b. Rerun diag -a to check for missing options. c. If devices are no longer missing, go to Chapter 5, “Verifying a repair,” on page 557. Otherwise, contact your support center. Note: A device problem can cause other devices attached to the same SCSI adapter to go into the defined state. Ask the system administrator to make sure that all devices attached to the same SCSI adapter as the device that you replaced are in the available state.4. If no devices were missing, the problem could be intermittent. Make a record of the problem. Running the diagnostics for each device on the bus may provide additional information. If you have not replaced FFCs B88, 190, and 152 go to “MAP 0210: General problem resolution” on page 246, using FFCs (in order): B88, 190, and 152.PFW1540: Problem isolation procedures The PFW1540 procedures are used to locate problems in the processor subsystem or I/O subsystem(s). If a problem is detected, these procedures help you isolate the problem to a failing unit. Find the symptom in the following table; then follow the instructions given in the Action column. Problem Isolation Procedures Symptom/Reference Code/Checkpoint Action You have or suspect an I/O card or I/O subsystem failure.You received one of the following SRNs or reference codes: 101-000, 101-517, 101-521, 101-538, 101-551 to 101-557, 101-559 to 101-599, 101-662, 101-727, 101-c32, 101-c33, 101-c70 Go to “PFW1542: I/O problem isolation procedure” on page 302. You have or suspect a memory or processor subsystem problem. You received the following SRN or reference code: 101-185 Go to “PFW1548: Memory and processor subsystem problem isolation procedure” on page 316. If you were directed to the PFW1540 procedure by an SRN and that SRN is not listed in this table. Go to “PFW1542: I/O problem isolation procedure” on page 302. FRU identify LEDs Your system is configured with an arrangement of LEDs that help identify various components of the system. These include but are not limited to: v Rack identify beacon LED (optional rack status beacon) v Processor subsystem drawer identify LED v I/O drawer identify LED v RIO port identify LED v FRU identify LED v Power subsystem FRUs v Processor subsystem FRUs v I/O subsystem FRUs v I/O adapter identify LED v DASD identify LEDThe identify LEDs are arranged hierarchically with the FRU identify LED at the bottom of the hierarchy, followed by the corresponding processor subsystem or I/O drawer identify LED, and the corresponding rack identify LED to locate the failing FRU more easily. Any identify LED in the system may be flashed; refer to “Managing the Advanced System Management Interface (ASMI)” on page 645. Chapter 2. Isolation Procedures 301 Any identify LED in the system may also be flashed by using the AIX diagnostic programs task “Identify and Attention Indicators”. The procedure to use the AIX diagnostics task “Identify and Attention Indicators” is outlined in “Tasks and Service Aids” in “Running the online and eServer stand-alone diagnostics” on page 672. PFW1542: I/O problem isolation procedure This I/O problem-determination procedure isolates I/O card and I/O subsystem failures. When I/O problem isolation is complete, all cables and cards that are failing will have been replaced or reseated. Notes: 1. To avoid damage to the system or subsystem components, unplug the power cords before removing or installing a part. 2. This procedure assumes that either: v An optical drive is installed and connected to the integrated EIDE adapter, and an AIX diagnostic CD-ROM is available. OR v AIX standalone diagnostics can be booted from a NIM server.3. If a power-on password or privileged-access password is set, you are prompted to enter the password before the AIX diagnostic CD-ROM can load. 4. The term POST indicators refers to the device mnemonics that appear during the power-on self-test (POST). 5. The service processor might have been set by the user to monitor system operations and to attempt recoveries. You might want to disable these options while you diagnose and service the system. If these settings are disabled, make notes of their current settings so that they can be restored before the system is turned back over to the customer. The following settings may be of interest. Monitoring (also called surveillance) From the ASMI menu, expand the System Configuration menu, then click Monitoring. Disable both types of surveillance. Auto power restart (also called unattended start mode) From the ASMI menu, expand Power/Restart Control, then click Auto Power Restart, and set it to disabled. Wake on LAN® From the ASMI menu, expand Wake on LAN, and set it to disabled. Call Out From the ASMI menu, expand the Service Aids menu, then click Call-Home/Call-In Setup. Set the call-home system port and the call-in system port to disabled.6. Verify that the system has not been set to boot to the SMS menus or to the open firmware prompt. From the ASMI menu, expand Power/Restart Control to view the menu, then click Power On/Off System. The AIX/Linux partition mode boot indicates Continue to Operating System.Use this procedure to locate defective FRUs not found by normal diagnostics. For this procedure, diagnostics are run on a minimally configured system. If a failure is detected on the minimally configured system, the remaining FRUs are exchanged one at a time until the failing FRU is identified. If a failure is not detected, FRUs are added back until the failure occurs. The failure is then isolated to the failing FRU. Perform the following procedure: v PFW1542-1 1. Ensure that the diagnostics and the operating system are shut down. 2. Turn off the power. 302 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 3. Select slow system boot speed on the power on/off system menu under the power/restart control menu on the ASMI. 4. Turn on the power. 5. Insert the AIX diagnostic CD-ROM into the optical drive. Does the optical drive appear to operate correctly? NO Go to “Problems with loading and starting the operating system (AIX and Linux)” on page 246. YES Continue to PFW1542-2.v PFW1542-2 1. When the keyboard indicator is displayed (the word ?keyboard?), if the system or partition gets that far in the IPL process, press the 5 key on the firmware console. 2. If you are prompted to do so, enter the appropriate password. Is the ?Please define the System Console? screen displayed? NO Continue toPFW1542-3. YES Go to PFW1542-4.v PFW1542-3 The system is unable to boot standalone AIX diagnostics. Check the service processor error log (using the ASMI) and the operator panel for additional error codes resulting from the slow boot that was performed in PFW1542-1. Did the slow boot generate a different error code or partition firmware hang from the one that originally sent you to PFW1542? NO If you were sent here by an error code, and the error code did not change as the result of a slow boot, you have a processor subsystem problem. Go to “PFW1548: Memory and processor subsystem problem isolation procedure” on page 316. If you were sent here because the system is hanging on a partition firmware checkpoint, and the hang condition did not change as a result of the slow boot, go to PFW1542-5. YES Restore fast boot on the power on/off system menu from the ASMI. Look up the new error code in the reference code index and perform the listed actions.v PFW1542-4 The system stopped with the Please define the System Console prompt on the system console. Standalone diagnostics can be booted. Perform the following: 1. Follow the instructions on the screen to select the system console. 2. When the DIAGNOSTIC OPERATING INSTRUCTIONS screen is displayed, press Enter. 3. If the terminal type has not been defined, you must use the option Initialize Terminal on the FUNCTION SELECTION menu to initialize the AIX operating system environment before you can continue with the diagnostics. This is a separate operation from selecting the firmware console. 4. Select Advanced Diagnostic Routines. 5. When the DIAGNOSTIC MODE SELECTION menu displays, select System Verification to run diagnostics on all resources. Did running diagnostics produce a different symptom? NO Continue with the following sub-step. YES Return to the Problem Analysis procedures with the new symptom.6. Record any devices missing from the list of all adapters and devices. Continue with this procedure. When you have fixed the problem, use this record to verify that all devices appear when you run system verification. Are there any devices missing from the list of all adapters and devices? Chapter 2. Isolation Procedures 303 NO Reinstall all remaining adapters, if any, and reconnect all devices. Return the system to its original configuration. Be sure to select fast boot on the power on/off system menu on the ASMI. Go to Chapter 5, “Verifying a repair,” on page 557. YES The boot attempts that follow will attempt to isolate any remaining I/O subsystem problems with missing devices. Ignore any codes that may display on the operator panel unless stated otherwise. Continue to PFW1542-5.v PFW1542-5 Examine RIO port 0 of the first RIO bus card (Un-P1-C8-T2) in the base system unit drawer. Are there any I/O subsystems attached to this RIO card? NO Go to PFW1542-29. YES Continue to PFW1542-6.v PFW1542-6 There may be devices missing from one of more of the I/O subsystems, or one or more devices in the I/O subsystems may be causing the system or a partition to hang during IPL.Note: Various types of I/O subsystems might be connected to this system. The RIO and InfiniBand® ports on these subsystems are shown in the following table. Use this table to determine the physical location codes of the RIO or InfiniBand™ connectors that are mentioned in the remainder of this MAP. Table 42. RIO ports location table 8203-E4A, 9407-M15, or 9408-M25 (base system) adapters 7314-G30 or 5796 7311-D20 or 5095 RIO port 0 Un-P1-C8-T2 Un-P1-C6-T2 Un-P1-C7-T2 Un-P1-C05-00 or Un-P1-C05-T2 (bottom connector) RIO port 1 Un-P1-C8-T1 Un-P1-C6-T1 Un-P1-C7-T1 Un-P1-C05-01 or Un-P1-C05-T1 (top connector) Note: Before continuing, check the cabling from the base system to the I/O subsystem(s) to ensure that the system is cabled correctly. Refer to the cabling information for your I/O enclosure for valid configurations. Record the current cabling configuration and then continue with the following steps. In the following steps, the term RIO cable means either RIO cable or InfiniBand cable. 1. Turn off the power. Record the location and machine type and model number, or feature number, of each I/O drawer or tower. In the following steps, use this information to determine the physical location codes of the RIO connectors that are referred to by their logical names. For example, if I/O subsystem #1 is a 7311-D20 drawer, RIO port 0 is Un-P1-C05-T2. 2. At the base system drawer, disconnect the cable connection at RIO port 0 (Un-P1-C8-T2). 3. At the other end of the RIO cable referred to in step 2 of PFW1542-6, disconnect the I/O subsystem port connector 0. The RIO cable that was connected to RIO port 0 in the base system should now be loose; remove it. Record the location of this I/O subsystem and call it ?subsystem #1?. 4. Examine the connection at the I/O port connector 1 of the I/O subsystem recorded in step 3 of PFW1542-6. If the RIO cable attached to I/O port connector 1 connects to the I/O port connector 0 of another I/O subsystem, record the location of the next I/O subsystem that is connected to I/O port 1 of subsystem #1, then go to step 8 of PFW1542-6. 304 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 5. At the base system, disconnect the cable connection at RIO port 1 (Un-P1-C8-T1) and reconnect it to RIO port 0 (Un-P1-C8-T2). 6. At the I/O subsystem recorded in step 3 of PFW1542-6, disconnect the I/O port connector 1 and reconnect to I/O port 0. 7. Verify that a single RIO cable connects base system RIO port 0 (Un-P1-C8-T2) to the I/O subsystem recorded in step 4 port 0. Go to step 28 of PFW1542-6. 8. Record the location of the next I/O subsystem and call it ?subsystem #2?. This is the I/O subsystem that is connected to I/O port 1 of subsystem #1. 9. Examine the connection at the I/O port 1 of subsystem #2 recorded in step 8 of PFW1542-6. If the RIO cable attached to I/O port 1 connects to the I/O port 0 of another I/O subsystem, record the location of the next I/O subsystem that is connected to I/O port 1 of subsystem #2 and call it ?subsystem #3?. Go to step 13 of PFW1542-6. 10. The RIO cable attached to the I/O port 1 of subsystem #2 is attached to port 1 (Un-P1-C8-T1) of the base system. At the base system, disconnect the cable connection at RIO port 1 (Un-P1-C8-T1) and reconnect it to RIO port 0 (Un-P1-C8-T2). 11. On subsystem #2, disconnect the cable from I/O port 1 and reconnect it to I/O port 0 of subsystem #1. 12. Verify that a single RIO cable connects base system RIO port 0 to one or two I/O subsystems. Go to step 28 of PFW1542-6. 13. Examine the connection at the I/O port 1 of subsystem #3 recorded in step 9 of PFW1542-6. If the RIO cable attached to I/O port 1 connects to the I/O port 0 of another I/O subsystem, record the location of the next I/O subsystem that is connected to I/O port 1 of the subsystem #3 and call it ?subsystem #4?. Go to step 17 of PFW1542-6. 14. The RIO cable attached to the I/O port 1 of subsystem #3 is attached to port 1 (Un-P1-C8-T1) of the base system. At the base system, disconnect the cable connection at RIO port 1 (Un-P1-C8-T1) and reconnect it to RIO port 0 (Un-P1-C8-T2). 15. On subsystem #3, disconnect the cable from I/O port 1 and reconnect it to I/O port 0 of subsystem #1. 16. Verify that a single RIO cable connects base system RIO port 0 to three I/O subsystems. Go to step 28 of PFW1542-6. 17. Examine the connection at the I/O port 1 of subsystem #4 recorded in step step 13 of PFW1542-6. If the RIO cable attached to I/O port 1 connects to the I/O port 0 of another I/O subsystem, record the location of the next I/O subsystem that is connected to I/O port 1 of subsystem #4 and call it ?subsystem #5?. Go to step step 21 of PFW1542-6. 18. The RIO cable attached to the I/O port 1 of subsystem #4 is attached to port 1 (Un-P1-C8-T1) of the base system. At the base system, disconnect the cable connection at RIO port 1 (Un-P1-C8-T1) and reconnect it to RIO port 0 (Un-P1-C8-T2). 19. On subsystem #4, disconnect the cable from I/O port 1 and reconnect it to I/O port 0 of subsystem #1. 20. Verify that a single RIO cable connects base system RIO port 0 to four I/O subsystems. Continue to step 28 of PFW1542-6. 21. Examine the connection at the I/O port 1 of subsystem #5 recorded in step step 17 of PFW1542-6. If the RIO cable attached to I/O port 1 connects to the I/O port 0 of another I/O subsystem, record the location of the next I/O subsystem that is connected to I/O port 1 of subsystem #5 and call it ?subsystem #6?. Go to step step 25 of PFW1542-6 22. The RIO cable attached to the I/O port 1 of subsystem #5 is attached to port 1 (Un-P1-C8-T1) of the base system. At the base system, disconnect the cable connection at RIO port 1 (Un-P1-C8-T1) and reconnect it to RIO port 0 (Un-P1-C8-T2). 23. On subsystem #5, disconnect the cable from I/O port 1 and reconnect it to I/O port 0 of subsystem #1. Chapter 2. Isolation Procedures 305 24. Verify that a single RIO cable connects base system RIO port 0 to five I/O subsystems. Go to step 28 of PFW1542-6 25. The RIO cable attached to the I/O port 1 of subsystem #6 is attached to port 1 (Un-P1-C8-T1) of the base system. At the base system, disconnect the cable connection at RIO port 1 (Un-P1-C8-T1) and reconnect it to RIO port 0 (Un-P1-C8-T2). 26. On subsystem #6, disconnect the cable from I/O port 1 and reconnect it to I/O port 0 of subsystem #1. 27. Verify that a single RIO cable connects base system RIO port 0 to six I/O subsystems. Continue with step 28 of PFW1542-6 28. Turn on the power to boot the standalone AIX diagnostics from CD-ROM. 29. If the ?Please define the system console? screen is displayed, follow the directions to select the system console. 30. Use the option Display Configuration and Resource List to list all of the attached devices and adapters. 31. Verify that all adapters and the attached devices are listed. Did the ?Please define the System Console? screen display and are all adapters and attached devices listed? NO Go to PFW1542-7. YES The RIO cable that was removed in step 3 above is defective. Replace this RIO cable.– If six I/O subsystems are chained to RIO port 0 (Un-P1-C8-T2) of the base system, connect the new RIO cable from subsystem #6 I/O port 1 to base system RIO port 1 (Un-P1-C8-T1). – If five I/O subsystems are chained to RIO port 0 (Un-P1-C8-T2) of the base system, connect the new RIO cable from subsystem #5 I/O port 1 to base system RIO port 1 (Un-P1-C8-T1). – If four I/O subsystems are chained to RIO port 0 (Un-P1-C8-T2) of the base system, connect the new RIO cable from subsystem #4 I/O port 1 to base system RIO port 1 (Un-P1-C8-T1). – If three I/O subsystems are chained to RIO port 0 (Un-P1-C8-T2) of the base system, connect the new RIO cable from subsystem #3 I/O port 1 to base system RIO port 1 (Un-P1-C8-T1). – If two I/O subsystems are chained to RIO port 0 (Un-P1-C8-T2) of the base system, connect the new RIO cable from subsystem #2 I/O port 1 to base system RIO port 1 (Un-P1-C8-T1). – If one I/O subsystem is chained to RIO port 0 (Un-P1-C8-T2) of the base system, connect the new RIO cable from subsystem #1 I/O port 1 to base system RIO port 1 (Un-P1-C8-T1). Restore the system to its original configuration. Go to Chapter 5, “Verifying a repair,” on page 557. v PFW1542-7 The I/O attached to the RIO ports on the other GX adapter will now be isolated. Power off the system. Disconnect the cable connection at RIO port 0 (Un-P1-C8-T2) of the base system. v PFW1542-8 1. Turn on the power to boot the standalone diagnostic CD-ROM. 2. If the ?Please define the system console? screen is displayed, follow the directions to select the system console. 3. Use the option Display Configuration and Resource List to list all of the attached devices and adapters. 4. Check that all adapters and attached devices in the base system are listed. If the ?Please define the system console? screen was not displayed or all adapters and attached devices are not listed, the problem is in the base system. Was the ?Please define the system console? screen displayed and were all adapters and attached devices listed? NO Go to PFW1542-29. YES Go to PFW1542-21. 306 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) v PFW1542-9 For subsystem #1: Are there any adapters in the I/O subsystem? NO Go to PFW1542-10. YES Go to PFW1542-15.v PFW1542-10 For subsystem #2: Are there any adapters in the I/O subsystem? NO Go to PFW1542-11. YES Go to PFW1542-16.v PFW1542-11 For subsystem #3: Are there any adapters in the I/O subsystem? NO Go to PFW1542-12. YES Go to PFW1542-17.v PFW1542-12 For subsystem #4: Are there any adapters in the I/O subsystem? NO Go to PFW1542-13. YES Go to PFW1542-18.v PFW1542-13 For subsystem #5: Are there any adapters in the I/O subsystem? NO Go to PFW1542-14. YES Go to PFW1542-19.v PFW1542-14 For subsystem #6: Are there any adapters in the I/O subsystem? NO Go to PFW1542-23. YES Go to PFW1542-20.v PFW1542-15 (Subsystem #1) 1. If it is not already off, turn off the power. 2. Label and record the locations of any cables attached to the adapters, then disconnect the cables. 3. Record the slot numbers of the adapters. 4. Remove all adapters from the I/O subsystem. 5. Turn on the power to boot the standalone diagnostic CD-ROM. 6. If the ASCII terminal displays Enter 0 to select this console, press the 0 (zero) key on the ASCII terminal’s keyboard. 7. If the ?Please select the system console? screen is displayed, follow the directions to select the system console. 8. Use the option Display Configuration and Resource List to list all adapters and attached devices. 9. Check that all adapters and attached devices are listed. Was the ?Please define the system console? screen displayed and were all adapters and attached devices listed? NO Go to PFW1542-10. YES Go to PFW1542-21.v PFW1542-16 (Subsystem #2) 1. If it is not already off, turn off the power. 2. Label and record the locations of any cables attached to the adapters, then disconnect the cables. Chapter 2. Isolation Procedures 307 3. Record the slot numbers of the adapters. 4. Remove all adapters from the I/O subsystem. 5. Turn on the power to boot the standalone diagnostic CD-ROM. 6. If the ASCII terminal displays Enter 0 to select this console, press the 0 (zero) key on the ASCII terminal’s keyboard. 7. If the ?Please select the system console? screen is displayed, follow the directions to select the system console. 8. Use the option Display Configuration and Resource List to list all adapters and attached devices. 9. Check that all adapters and attached devices are listed. Was the ?Please define the system console? screen displayed and were all adapters and attached devices listed? NO Go to PFW1542-11. YES Go to PFW1542-21.v PFW1542-17 (Subsystem #3) 1. If it is not already off, turn off the power. 2. Label and record the locations of any cables attached to the adapters, then disconnect the cables. 3. Record the slot numbers of the adapters. 4. Remove all adapters from the I/O subsystem. 5. Turn on the power to boot the standalone diagnostic CD-ROM. 6. If the ASCII terminal displays Enter 0 to select this console, press the 0 (zero) key on the ASCII terminal’s keyboard. 7. If the ?Please select the system console? screen is displayed, follow the directions to select the system console. 8. Use the option Display Configuration and Resource List to list all adapters and attached devices. 9. Check that all adapters and attached devices are listed. Was the ?Please define the system console? screen displayed and were all adapters and attached devices listed? NO Go to PFW1542-12. YES Go to PFW1542-21.v PFW1542-18 (Subsystem #4) 1. If it is not already off, turn off the power. 2. Label and record the locations of any cables attached to the adapters, then disconnect the cables. 3. Record the slot numbers of the adapters. 4. Remove all adapters from the I/O subsystem. 5. Turn on the power to boot the standalone diagnostic CD-ROM. 6. If the ASCII terminal displays Enter 0 to select this console, press the 0 (zero) key on the ASCII terminal’s keyboard. 7. If the ?Please define the system console? screen is displayed, follow the directions to select the system console. 8. Use the option Display Configuration and Resource List to list all adapters and attached devices. 9. Check that all adapters and attached devices are listed. Was the ?Please define the system console? screen displayed and were all adapters and attached devices listed? NO Go to PFW1542-13. YES Go to PFW1542-21. 308 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) v PFW1542-19 (Subsystem #5) 1. If it is not already off, turn off the power. 2. Label and record the locations of any cables attached to the adapters, then disconnect the cables. 3. Record the slot numbers of the adapters. 4. Remove all adapters from the I/O subsystem. 5. Turn on the power to boot the standalone diagnostic CD-ROM. 6. If the ASCII terminal displays Enter 0 to select this console, press the 0 (zero) key on the ASCII terminal’s keyboard. 7. If the ?Please define the system console? screen is displayed, follow the directions to select the system console. 8. Use the option Display Configuration and Resource List to list all adapters and attached devices. 9. Check that all adapters and attached devices are listed. Was the ?Please define the system console? screen displayed and were all adapters and attached devices listed? NO Go to PFW1542-14. YES Go to PFW1542-21.v PFW1542-20 (Subsystem #6) 1. If it is not already off, turn off the power. 2. Label and record the locations of any cables attached to the adapters, then disconnect the cables. 3. Record the slot numbers of the adapters. 4. Remove all adapters from the I/O subsystem. 5. Turn on the power to boot the standalone diagnostic CD-ROM. 6. If the ASCII terminal displays Enter 0 to select this console, press the 0 (zero) key on the ASCII terminal’s keyboard. 7. If the ?Please define the system console? screen is displayed, follow the directions to select the system console. 8. Use the option Display Configuration and Resource List to list all adapters and attached devices. 9. Check that all adapters and attached devices are listed. Was the ?Please define the system console? screen displayed and were all adapters and attached devices listed? NO Go to PFW1542-23. YES Go to PFW1542-21.v PFW1542-21 If the ?Please define the system console? screen was displayed and all adapters and attached devices were not listed, the problem is with one of the adapters or attached devices that was removed or disconnected from the I/O subsystem. 1. Turn off the power. 2. Reinstall one adapter or device that was removed. Use the original adapters in their original slots when reinstalling adapters. 3. Turn on the power to boot the standalone diagnostic CD-ROM. 4. If the ?Please define the system console? screen is displayed, follow the directions to select the system console. 5. Use the option Display Configuration and Resource List to list all adapters and attached devices. 6. Check that all adapters and attached devices are listed. Was the ?Please define the system console? screen displayed and were all adapters and attached devices listed? Chapter 2. Isolation Procedures 309 NO Go to PFW1542-22. YES Reinstall the next adapter and device and return to the beginning of this step. Repeat this process until an adapter or device causes the ?Please define the System Console? screen to not display, or all attached devices and adapters to not be listed. After installing all of the adapters and the ?Please define the System Console? screen does display and all attached devices and adapters are listed, return the system to its original configuration. Go to Chapter 5, “Verifying a repair,” on page 557.v PFW1542-22 Replace the adapter you just installed with a new adapter and retry booting AIX standalone diagnostics from CD-ROM. 1. If the ?Please define the system console? screen is displayed, follow the directions to select the system console. 2. Use the option Display Configuration and Resource List to list all adapters and attached devices. 3. Check that all adapters and attached devices are listed. Was the ?Please define the system console? screen displayed and were all adapters and attached devices listed? NO The I/O subsystem backplane is defective. Replace the I/O subsystem backplane. In all 4 subsystem types, the I/O subsystem backplane is Un-CB1. Then go to PFW1542-24. YES The adapter was defective. Go to PFW1542-24.v PFW1542-23 1. Turn off the power. 2. Disconnect the I/O subsystem power cables. 3. Replace the following parts, one at a time, if present, in the sequence listed: a. I/O subsystem #1 backplane b. I/O subsystem #2 backplane c. I/O subsystem #3 backplane d. I/O subsystem #4 backplane e. I/O subsystem #5 backplane f. I/O subsystem #6 backplane g. The RIO interface in the base system to which the RIO cables are presently attached.4. Reconnect the I/O subsystem power cables. 5. Turn on the power. 6. Boot standalone AIX diagnostics from CD. 7. If the ?Please define the System Console? screen is displayed, follow directions to select the system console. 8. Use the option Display Configuration and Resource List to list all adapters and attached devices. 9. Check that all attached devices and adapters are listed. Did the ?Please define the System Console? screen display and are all attached devices and adapters listed? NO Replace the next part in the list and return to the beginning of this step. Repeat this process until a part causes the ?Please define the System Console? screen to be displayed and all adapters and attached devices to be listed. If you have replaced all the items listed above and the ?Please define the System Console? screen does not display or all attached devices and adapters are not listed, check all external devices and cabling. If you do not find a problem, contact your next level of support for assistance. YES Go to PFW1542-22.v PFW1542-24 310 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) The item you just replaced fixed the problem. 1. Turn off the power. 2. If a display adapter with keyboard and mouse were installed, reinstall the display adapter, keyboard, and mouse. 3. Reconnect the tape drive (if previously installed) to the internal SCSI bus cable. 4. Plug in all adapters that were previously removed but not reinstalled. 5. Reconnect the I/O subsystem power cables that were previously disconnected. Return the system to its original condition. Go to Go to Chapter 5, “Verifying a repair,” on page 557. v PFW1542-25 1. Turn off the power. 2. At the base system, reconnect the cable connection at RIO port 0 (Un-P1-C8-T2) recorded in PFW1542-7. 3. At the base system, reconnect the cable connection at RIO port 1 (Un-P1-C8-T1) recorded in PFW1542-7. 4. Reconnect the power cables to the I/O subsystems that were found attached to the base system RIO ports mentioned in step 2 and step 3 of PFW1542-25. All I/O subsystems that were attached to the base system RIO port 0 and RIO port 1 should now be reconnected to the base system. 5. Make sure the I/O subsystem(s) are cabled correctly. See the cabling information for your I/O subsystem. 6. Turn on the power to boot standalone AIX diagnostics from CD-ROM. 7. If the ?Please define the System Console? screen is displayed, follow the directions to select the system console. 8. Use the option Display Configuration and Resource List to list all adapters and attached devices. 9. Check that all adapters and attached devices are listed. Did the ?Please define the System Console? screen display and are all attached devices and adapters listed? NO Go to PFW1542-9 to isolate a problem in an I/O subsystem attached to the base system RIO bus on the system backplane. YES Go to PFW1542-26.v PFW1542-26 Is there a second GX adapter plugged into Un-P1-C6 in the base system, and if so, is there at least one I/O subsystem attached to it? NO Go to PFW1542-29. YES Continue to PFW1542-27.v PFW1542-27 1. Turn off the power. 2. At the base system, reconnect the cable connection at RIO port 0 on the second GX adapter (Un-P1-C6-T2) recorded in PFW1542-7. 3. At the base system, reconnect the cable connection at RIO port 1 on the second GX adapter (Un-P1-C6-T1) recorded in PFW1542-7. 4. Reconnect the power cables to the I/O subsystems that were attached to the second GX adapter’s ports mentioned in substeps 2 and 3 in this step. All I/O subsystems that were attached to RIO port 0 on the second GX adapter (Un-P1-C6-T2) and RIO port 1 on the second GX adapter (Un-P1-C6-T1) in the base system should now be reconnected to the system. 5. Make sure that the I/O subsystem(s) are cabled correctly as shown in the cabling information for your I/O expansion unit. 6. Turn on the power to boot the standalone AIX diagnostic CD-ROM. Chapter 2. Isolation Procedures 311 7. If the Please define the system console screen is displayed, follow the directions to select the system console. 8. Use the option Display Configuration and Resource List to list all adapters and attached devices. 9. Verify that all adapters and attached devices are listed. Did the ?Please define the system console? screen display and are all adapters and attached devices listed? NO Go to PFW1542-28 to isolate the problems in the I/O subsystems that are attached to the second GX adapter in the base system. YES Go to PFW1542-29.v PFW1542-28 At the base system, reconnect the second I/O subsystem to the RIO ports on the base system’s GX adapter at Un-P1-C6-T1 and Un-P1-C6-T2. The RIO ports on these subsystems are shown in the following table. Use this table to determine the physical location codes of the RIO connectors that are mentioned in the remainder of this MAP. Table 43. RIO ports location table 8203-E4A, 9407-M15, or 9408-M25 (base system) adapters 7314-G30 or 5796 7311-D20 or 5095 RIO port 0 Un-P1-C8-T2 Un-P1-C6-T2 Un-P1-C7-T2 Un-P1-C05-00 or Un-P1-C05-T2 (bottom connector) RIO port 1 Un-P1-C8-T1 Un-P1-C6-T1 Un-P1-C7-T1 Un-P1-C05-01 or Un-P1-C05-T1 (top connector) Note: Before continuing, check the cabling from the base system to the I/O subsystem(s) to ensure that the system is cabled correctly. Record the current cabling configuration and then continue with the following steps. 1. Turn off the power. Record the location and machine type and model number, or feature number, of each I/O drawer or tower. In the following steps, use this information to determine the physical location codes of the RIO connectors that are referred to by their logical names. For example, if I/O subsystem #1 is a 7311/D20 drawer, RIO port 0 is Un-P1-C05-T2. 2. At the base system, disconnect the cable connection at RIO port 0 (Un-P1-C6-T2). 3. At the other end of the RIO cable referred to in step 2 of PFW1542-28, disconnect the I/O subsystem port connector 0. The RIO cable that was connected to RIO port 0 on the expansion card should now be loose; remove it. Record the location of this I/O subsystem and call it ?subsystem #1?. 4. Examine the connection at the I/O port connector 1 of the I/O subsystem recorded in step 3 of PFW1542-28. If the RIO cable attached to I/O port connector 1 connects to the I/O port connector 0 of another I/O subsystem, record the location of the next I/O subsystem that is connected to I/O port 1 of subsystem #1, then go to step 8 of PFW1542-28. 5. At the base system, disconnect the cable connection at RIO port 1 (Un-P1-C6-T1) and reconnect it to RIO port 0 (Un-P1-C6-T2). 6. At the I/O subsystem recorded in step 3 of PFW1542-28, disconnect the I/O port connector 1 and reconnect to I/O port 0. 7. Verify that a single RIO cable connects base system RIO port 0 (Un-P1-C6-T2) to the I/O subsystem recorded in step 4 of PFW1542-28 port 0. Go to step 28 of PFW1542-28. 8. Record the location of the next I/O subsystem and call it ?subsystem #2?. This is the I/O subsystem that is connected to I/O port 1 of subsystem #1. 312 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 9. Examine the connection at the I/O port 1 of subsystem #2 recorded in step 8 of PFW1542-28. If the RIO cable attached to I/O port 1 connects to the I/O port 0 of another I/O subsystem, record the location of the next I/O subsystem that is connected to I/O port 1 of subsystem #2 and call it ?subsystem #3?. Go to step 13 of PFW1542-28. 10. The RIO cable attached to the I/O port 1 of subsystem #2 is attached to port 1 (Un-P1-C6-T1) of the base system. At the base system, disconnect the cable connection at RIO port 1 (Un-P1-C6-T1) and reconnect it to RIO port 0 (Un-P1-C6-T2). 11. On subsystem #2, disconnect the cable from I/O port 1 and reconnect it to I/O port 0 of subsystem #1. 12. Verify that a single RIO cable connects base system RIO port 0 to two I/O subsystems. Go to step 28 of PFW1542-28. 13. Examine the connection at the I/O port 1 of subsystem #3 recorded in step 9 of PFW1542-28. If the RIO cable attached to I/O port 1 connects to the I/O port 0 of another I/O subsystem, record the location of the next I/O subsystem that is connected to I/O port 1 of the subsystem #3 and call it ?subsystem #4?. Go to step 17 of PFW1542-28. 14. The RIO cable attached to the I/O port 1 of subsystem #3 is attached to port 1 (Un-P1-C6-T1) of the base system. At the base system, disconnect the cable connection at RIO port 1 (Un-P1-C6-T1) and reconnect it to RIO port 0 (Un-P1-C6-T2). 15. On subsystem #3, disconnect the cable from I/O port 1 and reconnect it to I/O port 0 of subsystem #1. 16. Verify that a single RIO cable connects base system RIO port 0 to three I/O subsystems. Go to step 28 of PFW1542-28. 17. Examine the connection at the I/O port 1 of subsystem #4 recorded in step 13 of PFW1542-28. If the RIO cable attached to I/O port 1 connects to the I/O port 0 of another I/O subsystem, record the location of the next I/O subsystem that is connected to I/O port 1 of subsystem #4 and call it ?subsystem #5?. Go to step 21 of PFW1542-28 18. The RIO cable attached to the I/O port 1 of subsystem #4 is attached to port 1 (Un-P1-C6-T1) of the base system. At the base system, disconnect the cable connection at RIO port 1 (Un-P1-C6-T1) and reconnect it to RIO port 0 (Un-P1-C6-T2). 19. On subsystem #4, disconnect the cable from I/O port 1 and reconnect it to I/O port 0 of subsystem #1. 20. Verify that a single RIO cable connects base system RIO port 0 to four I/O subsystems. Go to step 27 of PFW1542-28. 21. Examine the connection at the I/O port 1 of subsystem #5 recorded in step 17 of PFW1542-28. If the RIO cable attached to I/O port 1 connects to the I/O port 0 of another I/O subsystem, record the location of the next I/O subsystem that is connected to I/O port 1 of the subsystem #5 and call it ?subsystem #6?. Go to step 25 of PFW1542-28. 22. The RIO cable attached to the I/O port 1 of subsystem #5 is attached to port 1 (Un-P1-C6-T1) of the base system. At the base system, disconnect the cable connection at RIO port 1 (Un-P1-C6-T1) and reconnect it to RIO port 0 (Un-P1-C6-T2). 23. On subsystem #5, disconnect the cable from I/O port 1 and reconnect it to I/O port 0 of subsystem #1. 24. Verify that a single RIO cable connects base system RIO port 0 to five I/O subsystems. Go to step 28 of PFW1542-28. 25. The RIO cable attached to the I/O port 1 of subsystem #6 is attached to port 1 (Un-P1-C6-T1) of the base system. At the base system, disconnect the cable connection at RIO port 1 (Un-P1-C6-T1) and reconnect it to RIO port 0 (Un-P1-C6-T2). 26. On subsystem #6, disconnect the cable from I/O port 1 and reconnect it to I/O port 0 of subsystem #1. 27. Verify that a single RIO cable connects base system RIO port 0 to six I/O subsystems. Go to step 28 of PFW1542-28. Chapter 2. Isolation Procedures 313 28. Turn on the power to boot the standalone AIX diagnostics from CD-ROM. 29. If the ?Please define the system console? screen is displayed, follow the directions to select the system console. 30. Use the option Display Configuration and Resource List to list all of the attached devices and adapters. 31. Verify that all adapters and the attached devices are listed. Did the ?Please define the system console? screen display and are all adapters and attached devices listed? NO Go to PFW1542-9. YES The RIO cable that was removed in step 3 above is defective. Replace this RIO cable.– If six I/O subsystems are chained to RIO port 0 (Un-P1-C6-T2) of the base system, connect the new RIO cable from subsystem #6 I/O port 1 to base system RIO port 1 (Un-P1-C8-T1). – If five I/O subsystems are chained to RIO port 0 (Un-P1-C6-T2) of the base system, connect the new RIO cable from subsystem #5 I/O port 1 to base system RIO port 1 (Un-P1-C8-T1). – If four I/O subsystems are chained to RIO port 0 (Un-P1-C6-T2) of the base system, connect the new RIO cable from subsystem #4 I/O port 1 to base system RIO port 1 (Un-P1-C8-T1). – If three I/O subsystems are chained to RIO port 0 (Un-P1-C6-T2) of the base system, connect the new RIO cable from subsystem #3 I/O port 1 to base system RIO port 1 (Un-P1-C8-T1). – If two I/O subsystems are chained to RIO port 0 (Un-P1-C6-T2) of the base system, connect the new RIO cable from subsystem #2 I/O port 1 to base system RIO port 1 (Un-P1-C8-T1). – If one I/O subsystem is chained to RIO port 0 (Un-P1-C6-T2) of the base system, connect the new RIO cable from subsystem #1 I/O port 1 to base system RIO port 1 (Un-P1-C8-T1). Restore the system back to its original configuration. Go to Chapter 5, “Verifying a repair,” on page 557. v PFW1542-29 Are there any adapters in the PCI slots in the base system? NO Go to PFW1542-30. YES Go to PFW1542-32.v PFW1542-30 Replace the system backplane, Un-P1. Continue to PFW1542-31. v PFW1542-31 1. Boot standalone AIX diagnostics from CD. 2. If the ?Please define the System Console? screen is displayed, follow directions to select the system console. 3. Use the Display Configuration and Resource List to list all adapters and attached devices. 4. Check that all adapters and attached devices are listed. Did the ?Please define the System Console? screen display and are all attached devices and adapters listed? NO Go to PFW1542-35. YES Go to PFW1542-24.v PFW1542-32 1. If it is not already off, turn off the power. 2. Label and record the location of any cables attached to the adapters. 3. Record the slot number of the adapters. 4. Remove all adapters from slots 1, 2, 3, 4, 5, and 6 in the base system that are not attached to the boot device. 314 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 5. Turn on the power to boot standalone AIX diagnostics from CD-ROM. 6. If the ASCII terminal displays Enter 0 to select this console, press the 0 key on the ASCII terminal’s keyboard. 7. If the ?Please define the System Console? screen is displayed, follow directions to select the system console. 8. Use the option Display Configuration and Resource List to list all adapters and attached devices. 9. Check that all adapters and attached devices are listed. Did the ?Please define the System Console? screen display and are all attached devices and adapters listed? NO Go to PFW1542-35. YES Continue to PFW1542-33.v PFW1542-33 If the ?Please define the System Console? screen does display and all adapters and attached devices are listed, the problem is with one of the adapters or devices that was removed or disconnected from the base system. 1. Turn off the power. 2. Reinstall one adapter and device that was removed. Use the original adapters in their original slots when reinstalling adapters. 3. Turn on the power to boot standalone AIX diagnostics from the optical drive. 4. If the ?Please define the System Console? screen is displayed, follow the directions to select the system console. 5. Use the Display Configuration and Resource List to list all adapters and attached devices. 6. Check that all adapters and attached devices are listed. Did the ?Please define the System Console? screen display and are all attached devices and adapters listed? NO Continue to PFW1542-34. YES Return to the beginning of this step to continue reinstalling adapters and devices.v PFW1542-34 Replace the adapter you just installed with a new adapter and retry the boot to standalone AIX diagnostics from CD-ROM. 1. If the ?Please Define the System Console? screen is displayed, follow directions to select the system console. 2. Use the option Display Configuration and Resource List to list all adapters and attached devices. 3. Check that all adapters and attached devices are listed. Did the ?Please define the System Console? screen display and are all attached devices and adapters listed? NO Go to PFW1542-30. YES The adapter you just replaced was defective. Go to PFW1542-24.v PFW1542-35 1. Turn off the power. 2. Disconnect the base system power cables. 3. Replace the following parts, one at a time, in the sequence listed: a. Optical drive b. Removable media backplane and cage assembly c. Disk drive backplane and cage assembly d. I/O backplane, location Un-P1 Chapter 2. Isolation Procedures 315 e. Service processor4. Reconnect the base system power cables. 5. Turn on the power. 6. Boot standalone AIX diagnostics from CD. 7. If the ?Please define the System Console? screen is displayed, follow directions to select the system console. 8. Use the option Display Configuration and Resource List to list all adapters and attached devices. 9. Check that all adapters and attached devices are listed. Did the ?Please define the System Console? screen display and are all adapters and attached devices listed? NO Replace the next part in the list and return to the beginning of this step. Repeat this process until a part causes the Please define the System Console screen to be displayed and all adapters and attached devices to be listed. If you have replaced all the items listed above and the Please define the System Console screen does not display or all adapters and attached devices are not listed, check all external devices and cabling. If you do not find a problem, contact your next level of support for assistance. YES Go to PFW1542-24.PFW1548: Memory and processor subsystem problem isolation procedure Use this problem isolation procedure to aid in solving memory and processor problems that are not found by normal diagnostics. Notes: 1. To avoid damage to the system or subsystem components, unplug the power cords before removing or installing any part. 2. This procedure assumes that either: v An optical drive is installed and connected to the integrated EIDE adapter, and an AIX diagnostic CD-ROM is available. OR v AIX standalone diagnostics can be booted from a NIM server.3. If a power-on password or privileged-access password is set, you are prompted to enter the password before the AIX diagnostic CD-ROM can load. 4. The term POST indicators refers to the device mnemonics that appear during the power-on self-test (POST). 5. The service processor might have been set by the user to monitor system operations and to attempt recoveries. You might want to disable these options while you diagnose and service the system. If these settings are disabled, make notes of their current settings so that they can be restored before the system is turned back over to the customer. The following settings may be of interest. Monitoring (also called surveillance) From the ASMI menu, expand the System Configuration menu, then click Monitoring. Disable both types of surveillance. Auto power restart (also called unattended start mode) From the ASMI menu, expand Power/Restart Control, then click Auto Power Restart, and set it to disabled. Wake on LAN From the ASMI menu, expand Wake on LAN, and set it to disabled. 316 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Call Out From the ASMI menu, expand the Service Aids menu, then click Call-Home/Call-In Setup. Set the call-home system port and the call-in system port to disabled.6. Verify that the system has not been set to boot to the System Management Services (SMS) menus or to the open firmware prompt. From the ASMI menu, expand Power/Restart Control to view the menu, then click Power On/Off System. The AIX/Linux partition mode boot should say ?Continue to Operating System?. 7. The service processor might have recorded one or more symptoms in its error/event log. Use the Advanced System Management Interface (ASMI) menus to view the error/event log. v If you arrived here after performing a slow boot, look for a possible new error that occurred during the slow boot. If there is a new error, and its actions call for a FRU replacement, perform those actions. If this does not resolve the problem, go to PFW1548-1. v If an additional slow boot has not been performed, or if the slow boot did not yield a new error code, look at the error that occurred just before the original error. Perform the actions associated with that error. If this does not resolve the problem, go to PFW1548-1. v If a slow boot results in the same error code, and there are no error codes before the original error code, go to PFW1548-1.8. To continue with this procedure, select one of the following files according to the system you are servicing: v “PFW1548: Memory and processor subsystem problem isolation procedure without an HMC attached” v “PFW1548: Memory and processor subsystem problem isolation procedure when an HMC is attached” on page 324 “PFW1548: Memory and processor subsystem problem isolation procedure without an HMC attached” “PFW1548: Memory and processor subsystem problem isolation procedure when an HMC is attached” on page 324PFW1548: Memory and processor subsystem problem isolation procedure without an HMC attached This procedure is used to locate defective FRUs not found by normal diagnostics. For this procedure, diagnostics are run on a minimally configured system. If a failure is detected on the minimally configured system, the remaining FRUs are exchanged one at a time until the failing FRU is identified. If a failure is not detected, FRUs are added back until the failure occurs. The failure is then isolated to the failing FRU. Note: the system backplane has two memory DIMM quads: Un-P1-C14 to Un-P1-C17, and Un-P1-C21 to Un-P1-C24. See “Locations” on page 339. Perform the following procedure: v PFW1548-1 1. Ensure that the diagnostics and the operating system are shut down. Is the system at ?service processor standby?, indicated by 01 in the control panel? NO Replace the system backplane, location: Un-P1. Return to step PFW1548-1. YES Continue with substep 2.2. Turn on the power using either the white button or the ASMI menus. Does the system reach an operating system login prompt, or if booting the AIX diagnostic CD-ROM, is the ?Please define the System Console? screen displayed? NO Go to PFW1548-3. YES Go to PFW1548-2.3. Insert the AIX diagnostic CD-ROM into the optical drive. Chapter 2. Isolation Procedures 317 Note: If you cannot insert the diagnostic CD-ROM, go to PFW1548-2. 4. When the word keyboard is displayed on an ASCII terminal or a directly attached keyboard, press the number 5 key. 5. If you are prompted to do so, enter the appropriate password. Is the ?Please define the System Console? screen displayed? NO Go to PFW1548-2. YES Go to PFW1548-14.v PFW1548-2 1. Insert the AIX diagnostic CD-ROM into the optical drive.Note: If you cannot insert the diagnostic CD-ROM, go to step PFW1548-3. 2. Turn on the power using either the white button or the ASMI menus. If the diagnostic CD-ROM is not in the optical drive, insert it now. If you are prompted to do so, enter the appropriate password. Is the ?Please define the System Console? screen displayed? NO Go to PFW1548-3. YES Go to PFW1548-14.v PFW1548-3 1. Turn off the power. 2. If you have not already done so, configure the service processor (using the ASMI menus) with the instructions in note 6 on page 317 at the beginning of this procedure, then return here and continue. 3. Exit the service processor (ASMI) menus and remove the power cords. 4. Disconnect all external cables (parallel, system port 1, system port 2, keyboard, mouse, USB devices, SPCN, Ethernet on the system planar, and so on). Also disconnect all of the external cables attached to the service processor.Go to the next step. v PFW1548-4 1. If this is a deskside system, remove the service access cover. If this is a rack-mounted system, place the drawer into the service position and remove the service access cover. Also remove the front cover. 2. Record the slot numbers of the PCI adapters and I/O expansion cards if present. Label and record the locations of all cables attached to the adapters. Disconnect all cables attached to the adapters and remove all of the adapters. 3. Remove the removable media/disk drive enclosure assembly by pulling out the blue tabs at the bottom of the enclosure, then sliding the enclosure out approximately three centimeters. 4. Remove and label the disk drives from the media/disk drive enclosure assembly. 5. Remove a memory DIMM quad. Notes: a. Place the memory DIMM locking tabs in the locked (upright) position to prevent damage to the tabs. b. Memory DIMMs must be installed in quads and in the correct connectors. Refer to “Physical Locations” on page 339 for information on memory DIMMs.6. Plug in the power cords and wait for 01 in the upper-left corner of the control panel display. 7. Turn on the power using the white button. Does the system reach an operating system login prompt, or if booting the AIX diagnostic CD-ROM, is the ?Please define the System Console? screen displayed? 318 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) NO Go to PFW1548-7. YES Go to the next step.v PFW1548-5 Were any memory DIMMs removed from system backplane? NO Go to PFW1548-8. YES Go to the next step.v PFW1548-6 1. Turn off the power, and remove the power cords. 2. Replug the memory DIMMs that were removed from the system backplane in PFW1548-2 in their original locations. Notes: a. Place the memory DIMM locking tabs into the locked (upright) position to prevent damage to the tabs. b. Memory DIMMs must be installed in quads in the correct connectors. Refer to “Physical Locations” on page 339 for information on memory DIMMs.3. Plug in the power cords and wait for 01 in the upper-left corner of the control panel display. 4. Turn on the power using the white button. Does the system reach an operating system login prompt, or if booting the AIX diagnostic CD-ROM, is the ?Please define the System Console? screen displayed? NO A memory DIMM in the quad you just replaced in the system is defective. Turn off the power, remove the power cords, and exchange the memory DIMMs quad with new or previously removed memory DIMM quad. Repeat this step until the defective memory DIMM quad is identified, or both memory DIMM quads have been exchanged. Note: the system backplane has two memory DIMM quads: Un-P1-C14 to Un-P1-C17 and Un-P1-C21 to Un-P1-C24. See “Locations” on page 339. If your symptom did not change and both the memory DIMM quads have been exchanged, call your service support person for assistance. If the symptom changed, check for loose cards and obvious problems. If you do not find a problem, go to the Problem Analysis procedures and follow the instructions for the new symptom. YES Go to the next step.v PFW1548-7 One of the FRUs remaining in the system unit is defective.Note: If a memory DIMM is exchanged, ensure that the new memory DIMM is the same size and speed as the original memory DIMM. 1. Turn off the power, remove the power cords, and exchange the following FRUs, one at a time, in the order listed: a. Memory DIMMs. Exchange one quad at a time with new or previously removed DIMM quads b. System backplane, location: Un-P1 c. Power supplies, locations: Un-E1 and Un-E2.2. Plug in the power cords and wait for 01 in the upper-left corner of the control panel display. 3. Turn on the power using the white button. Does the system reach an operating system login prompt, or if booting the AIX diagnostic CD-ROM, is the ?Please define the System Console? screen displayed? Chapter 2. Isolation Procedures 319 NO Reinstall the original FRU. Repeat the FRU replacement steps until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to the Problem Analysis procedures and follow the instructions for the new symptom. YES Go to Chapter 5, “Verifying a repair,” on page 557.v PFW1548-8 1. Turn off the power. 2. Reconnect the system console. Notes: a. If an ASCII terminal has been defined as the firmware console, attach the ASCII terminal cable to the S1 connector on the rear of the system unit. b. If a display attached to a display adapter has been defined as the firmware console, install the display adapter and connect the display to the adapter. Plug the keyboard and mouse into the keyboard connector on the rear of the system unit.3. Turn on the power using the white button. (If the diagnostic CD-ROM is not in the optical drive, insert it now.) 4. If the ASCII terminal or graphics display (including display adapter) is connected differently from the way it was previously, the console selection screen appears. Select a firmware console. 5. Immediately after the word keyboard is displayed, press the number 1 key on the directly attached keyboard, or an ASCII terminal. This action activates the system management services (SMS). 6. Enter the appropriate password if you are prompted to do so. Is the SMS screen displayed? NO One of the FRUs remaining in the system unit is defective. Exchange the FRUs that have not been exchanged, in the following order: 1. If you are using an ASCII terminal, go to the problem determination procedures for the display. If you do not find a problem, do the following: a. Replace the system backplane, location: Un-P1.2. If you are using a graphics display, go to the problem determination procedures for the display. If you do not find a problem, do the following: a. Replace the display adapter. b. Replace the backplane in which the graphics adapter is plugged. Repeat this step until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to the Problem Analysis procedures and follow the instructions for the new symptom.YES Go to the next step.v PFW1548-9 1. Make sure the diagnostic CD-ROM is inserted into the optical drive. 2. Turn off the power and remove the power cords. 320 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 3. Use the cam levers to reconnect the disk drive enclosure assembly to the I/O backplane. 4. Reconnect the removable media/disk drive enclosure assembly by sliding the media enclosure toward the rear of the system, then pressing the blue tabs. 5. Plug in the power cords and wait for 01 in the upper-left corner of the operator panel display. 6. Turn on the power using the white button. (If the diagnostic CD-ROM is not in the optical drive, insert it now.) 7. Immediately after the word keyboard is displayed, press the number 5 key on either the directly attached keyboard or an ASCII terminal keyboard. 8. Enter the appropriate password if you are prompted to do so. Is the ?Please define the System Console? screen displayed? NO One of the FRUs remaining in the system unit is defective. Exchange the FRUs that have not been exchanged, in the following order: 1. Optical drive 2. Removable media enclosure. 3. System backplane, Un-P1.Repeat this step until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to Problem Analysis procedures and follow the instructions for the new symptom. YES Go to the next step.v PFW1548-10 The system is working correctly with this configuration. One of the disk drives that you removed from the disk drive backplanes may be defective. 1. Make sure the diagnostic CD-ROM is inserted into the optical drive. 2. Turn off the power and remove the power cords. 3. Install a disk drive in the media/disk drive enclosure assembly. 4. Plug in the power cords and wait for the OK prompt to display on the operator panel display. 5. Turn on the power. 6. Immediately after the word keyboard is displayed, press the number 5 key on either the directly attached keyboard or an ASCII terminal keyboard. 7. Enter the appropriate password if you are prompted to do so. Is the ?Please define the System Console? screen displayed? NO Exchange the FRUs that have not been exchanged, in the following order: 1. Last disk drive installed 2. Disk drive backplane.Repeat this step until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to Problem Analysis procedures and follow the instructions for the new symptom. YES Repeat this step with all disk drives that were installed in the disk drive backplane. After all of the disk drives have been reinstalled, go to the next step. Chapter 2. Isolation Procedures 321 v PFW1548-11 The system is working correctly with this configuration. One of the devices that was disconnected from the system backplane may be defective. 1. Turn off the power and remove the power cords. 2. Attach a system backplane device (for example: system port 1, system port 2, USB, keyboard, mouse, Ethernet) that had been removed. After all of the I/O backplane device cables have been reattached, reattached the cables to the service processor one at a time. 3. Plug in the power cords and wait for 01 in the upper-left corner on the operator panel display. 4. Turn on the power using the white button. (If the diagnostic CD-ROM is not in the optical drive, insert it now.) 5. If the Console Selection screen is displayed, choose the system console. 6. Immediately after the word keyboard is displayed, press the number 5 key on either the directly attached keyboard or on an ASCII terminal keyboard. 7. Enter the appropriate password if you are prompted to do so. Is the ?Please define the System Console? screen displayed? NO The last device or cable that you attached is defective. To test each FRU, exchange the FRUs in the following order: 1. Device and cable (last one attached) 2. System backplane, location: Un-P1.If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to Problem Analysis procedures and follow the instructions for the new symptom. YES Repeat this step until all of the devices are attached. Go to the next step.v PFW1548-12 The system is working correctly with this configuration. One of the FRUs (adapters) that you removed may be defective. 1. Turn off the power and remove the power cords. 2. Install a FRU (adapter) and connect any cables and devices that were attached to the FRU. 3. Plug in the power cords and wait for the OK prompt to display on the operator panel display. 4. Turn on the power using the white button. (If the diagnostic CD-ROM is not in the optical drive, insert it now.) 5. If the Console Selection screen is displayed, choose the system console. 6. Immediately after the word keyboard is displayed, press the number 5 key on either the directly attached keyboard or on an ASCII terminal keyboard. 7. Enter the appropriate password if you are prompted to do so. Is the ?Please define the System Console? screen displayed? NO Go to the next step. YES Repeat this step until all of the FRUs (adapters) are installed. Go to Chapter 5, “Verifying a repair,” on page 557.v PFW1548-13 The last FRU installed or one of its attached devices is probably defective. 1. Make sure the diagnostic CD-ROM is inserted into the optical drive. 2. Turn off the power and remove the power cords. 322 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 3. Starting with the last installed adapter, disconnect one attached device and cable. 4. Plug in the power cords and wait for the 01 in the upper-left corner on the operator panel display. 5. Turn on the power using either the white button. (If the diagnostic CD-ROM is not in the optical drive, insert it now.) 6. If the Console Selection screen is displayed, choose the system console. 7. Immediately after the word keyboard is displayed, press the number 5 key on either the directly attached keyboard or on an ASCII terminal keyboard. 8. Enter the appropriate password if you are prompted to do so. Is the ?Please define the System Console? screen displayed? NO Repeat this step until the defective device or cable is identified or all devices and cables have been disconnected. If all the devices and cables have been removed, then one of the FRUs remaining in the system unit is defective. To test each FRU, exchange the FRUs in the following order: 1. Adapter (last one installed) 2. System backplane, location: Un-P1.If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to the Problem Analysis procedures and follow the instructions for the new symptom. YES The last device or cable that you disconnected is defective. Exchange the defective device or cable and then go to the next step.v PFW1548-14 1. Follow the instructions on the screen to select the system console. 2. When the DIAGNOSTIC OPERATING INSTRUCTIONS screen is displayed, press Enter. 3. Select Advanced Diagnostics Routines. 4. If the terminal type has not been defined, you must use the option Initialize Terminal on the FUNCTION SELECTION menu to initialize the AIX diagnostic environment before you can continue with the diagnostics. This is a separate operation from selecting the console display. 5. If the NEW RESOURCE screen is displayed, select an option from the bottom of the screen. Note: Adapters and devices that require supplemental media are not shown in the new resource list. If the system has adapters or devices that require supplemental media, select option 1. 6. When the DIAGNOSTIC MODE SELECTION screen is displayed, press Enter. 7. Select All Resources. If you were sent here from step PFW1548-18, select the adapter or device that was loaded from the supplemental media. Did you get an SRN? NO Go to step PFW1548-16. YES Go to the next step.v PFW1548-15 Look at the FRU part numbers associated with the SRN. Have you exchanged all the FRUs that correspond to the failing function codes (FFCs)? NO Exchange the FRU with the highest failure percentage that has not been changed. Chapter 2. Isolation Procedures 323 Repeat this step until all the FRUs associated with the SRN have been exchanged or diagnostics run with no trouble found. Run diagnostics after each FRU is exchanged. Go to Chapter 5, “Verifying a repair,” on page 557. YES If the symptom did not change and all the FRUs have been exchanged, call service support for assistance.v PFW1548-16 Does the system have adapters or devices that require supplemental media? NO Go to step the next step. YES Go to step PFW1548-18.v PFW1548-17 Consult the PCI adapter configuration documentation for your operating system to verify that all adapters are configured correctly. Go to Chapter 5, “Verifying a repair,” on page 557. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. v PFW1548-18 1. Select Task Selection. 2. Select Process Supplemental Media and follow the on-screen instructions to process the media. Supplemental media must be loaded and processed one at a time. Did the system return to the TASKS SELECTION SCREEN after the supplemental media was processed? NO Go to the next step. YES Press F3 to return to the FUNCTION SELECTION screen. Go to step PFW1548-14, substep 4 on page 323.v PFW1548-19 The adapter or device is probably defective. If the supplemental media is for an adapter, replace the FRUs in the following order: 1. Adapter 2. System backplane, location: Un-P1. If the supplemental media is for a device, replace the FRUs in the following order: 1. Device and any associated cables 2. The adapter to which the device is attached Repeat this step until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to the Problem Analysis procedures and follow the instructions for the new symptom. Go to Chapter 5, “Verifying a repair,” on page 557. End of procedure.PFW1548: Memory and processor subsystem problem isolation procedure when an HMC is attached This procedure is used to locate defective FRUs not found by normal diagnostics. For this procedure, diagnostics are run on a minimally configured system. If a failure is detected on the minimally configured system, the remaining FRUs are exchanged one at a time until the failing FRU is identified. If a failure is not detected, FRUs are added back until the failure occurs. The failure is then isolated to the failing FRU. 324 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Note: the system backplane has two memory DIMM quads: Un-P1-C14 to Un-P1-C17, and Un-P1-C21 to Un-P1-C24. See “Locations” on page 339. Perform the following procedure: v PFW1548-1 1. Ensure that the diagnostics and the operating system are shut down. Is the system at ?service processor standby?, indicated by 01 in the control panel? NO Replace the system backplane, location: Un-P1. Return to step PFW1548-1. YES Continue with substep 2.2. Turn on the power using either the white button or the ASMI menus. Does the system reach hypervisor standby as indicated on the HMC? NO Go to PFW1548-3. YES Go to PFW1548-2.3. Insert the AIX diagnostic CD-ROM into the optical drive. Note: If you cannot insert the diagnostic CD-ROM, go to PFW1548-2. 4. When the word keyboard is displayed on an ASCII terminal, a directly attached keyboard, or hardware management console (HMC), press the number 5 key. 5. If you are prompted to do so, enter the appropriate password. Is the ?Please define the System Console? screen displayed? NO Go to PFW1548-2. YES Go to PFW1548-14.v PFW1548-2 Insert the AIX diagnostic CD-ROM into the optical drive.Note: If you cannot insert the diagnostic CD-ROM, go to step PFW1548-3. Turn on the power using either the white button or the ASMI menus. (If the diagnostic CD-ROM is not in the optical drive, insert it now.) After the system has reached hypervisor standby, activate a Linux or AIX partition by clicking the Advanced button on the activation screen. On the Advanced activation screen, select Boot in service mode using the default boot list to boot the diagnostic CD-ROM. If you are prompted to do so, enter the appropriate password. Is the ?Please define the System Console? screen displayed? NO Go to PFW1548-3. YES Go to PFW1548-14.v PFW1548-3 1. Turn off the power. 2. If you have not already done so, configure the service processor (using the ASMI menus) with the instructions in note 6 on page 317 at the beginning of this procedure, then return here and continue. 3. Exit the service processor (ASMI) menus and remove the power cords. 4. Disconnect all external cables (parallel, system port 1, system port 2, keyboard, mouse, USB devices, SPCN, Ethernet on the system planar, and so on). Also disconnect all of the external cables attached to the service processor except the Ethernet cable going to the HMC.Go to the next step. v PFW1548-4 Chapter 2. Isolation Procedures 325 1. If this is a deskside system, remove the service access cover. If this is a rack-mounted system, place the drawer into the service position and remove the service access cover. Also remove the front cover. 2. Record the slot numbers of the PCI adapters and I/O expansion cards if present. Label and record the locations of all cables attached to the adapters. Disconnect all cables attached to the adapters and remove all of the adapters. 3. Remove the removable media/disk drive enclosure assembly by pulling out the blue tabs at the bottom of the enclosure, then sliding the enclosure out approximately three centimeters. 4. Remove and label the disk drives from the media/disk drive enclosure assembly. 5. Remove one of the two memory DIMMs quads. Notes: a. Place the memory DIMM locking tabs in the locked (upright) position to prevent damage to the tabs. b. Memory DIMMs must be installed in quads and in the correct connectors. Refer to “Physical Locations” on page 339 for information on memory DIMMs.6. Plug in the power cords and wait for 01 in the upper-left corner of the control panel display. 7. Turn on the power using either the HMC or the white button. Does the managed system reach power on at hypervisor standby as indicated on the HMC? NO Go to PFW1548-7. YES Go to the next step.v PFW1548-5 Were any memory DIMMs removed from system backplane? NO Go to PFW1548-8. YES Go to the next step.v PFW1548-6 1. Turn off the power, and remove the power cords. 2. Replug the memory DIMMs that were removed from system backplane in PFW1548-2 in their original locations. Notes: a. Place the memory DIMM locking tabs into the locked (upright) position to prevent damage to the tabs. b. Memory DIMMs must be installed in quads in the correct connectors. Refer to “Physical Locations” on page 339 for information on memory DIMMs.3. Plug in the power cords and wait for 01 in the upper-left corner of the control panel display. 4. Turn on the power using either the HMC or the white button. Does the managed system reach power on at hypervisor standby as indicated on the HMC? NO A memory DIMM in the quad you just replaced in the system is defective. Turn off the power, remove the power cords, and exchange the memory DIMM quad with new or previously removed memory DIMM quad. Repeat this step until the defective memory DIMM quad is identified, or both memory DIMM quads have been exchanged. Note: the system backplane has two memory DIMM quads: Un-P1-C1x-C1 to Un-P1-C1x-C4, and Un-P1-C1x-C6 to Un-P1-C1x-C9. See “Locations” on page 339. If your symptom did not change and both the memory DIMM quads have been exchanged, call your service support person for assistance. 326 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) If the symptom changed, check for loose cards and obvious problems. If you do not find a problem, go to the Problem Analysis procedures and follow the instructions for the new symptom. YES Go to the next step.v PFW1548-7 One of the FRUs remaining in the system unit is defective.Note: If a memory DIMM is exchanged, ensure that the new memory DIMM is the same size and speed as the original memory DIMM. 1. Turn off the power, remove the power cords, and exchange the following FRUs, one at a time, in the order listed: a. Memory DIMMs. Exchange one quad at a time with new or previously removed DIMM quads b. System backplane, location: Un-P1 c. Power supplies, locations: Un-E1 and Un-E2.2. Plug in the power cords and wait for 01 in the upper-left corner of the control panel display. 3. Turn on the power using either the HMC or the white button. Does the managed system reach power on at hypervisor standby as indicated on the HMC? NO Reinstall the original FRU. Repeat the FRU replacement steps until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to the Problem Analysis procedures and follow the instructions for the new symptom. YES Go to Chapter 5, “Verifying a repair,” on page 557.v PFW1548-8 1. Turn off the power. 2. Reconnect the system console. Notes: a. If an ASCII terminal has been defined as the firmware console, attach the ASCII terminal cable to the S1 connector on the rear of the system unit. b. If a display attached to a display adapter has been defined as the firmware console, install the display adapter and connect the display to the adapter. Plug the keyboard and mouse into the keyboard connector on the rear of the system unit.3. Turn on the power using either the HMC or the white button. (If the diagnostic CD-ROM is not in the optical drive, insert it now.) After the system has reached hypervisor standby, activate a Linux or AIX partition by clicking the Advanced button on the activation screen. On the Advanced activation screen, select Boot in service mode using the default boot list to boot the diagnostic CD-ROM. 4. If the ASCII terminal or graphics display (including display adapter) is connected differently from the way it was previously, the console selection screen appears. Select a firmware console. 5. Immediately after the word keyboard is displayed, press the number 1 key on the directly attached keyboard, an ASCII terminal or HMC. This activates the system management services (SMS). 6. Enter the appropriate password if you are prompted to do so. Is the SMS screen displayed? NO One of the FRUs remaining in the system unit is defective. Chapter 2. Isolation Procedures 327 Exchange the FRUs that have not been exchanged, in the following order: 1. If you are using an ASCII terminal, go to the problem determination procedures for the display. If you do not find a problem, do the following: a. Replace the system backplane, location: Un-P1.2. If you are using a graphics display, go to the problem determination procedures for the display. If you do not find a problem, do the following: a. Replace the display adapter. b. Replace the backplane in which the graphics adapter is plugged. Repeat this step until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to the Problem Analysis procedures and follow the instructions for the new symptom.YES Go to the next step.v PFW1548-9 1. Make sure the diagnostic CD-ROM is inserted into the optical drive. 2. Turn off the power and remove the power cords. 3. Use the cam levers to reconnect the disk drive enclosure assembly to the I/O backplane. 4. Reconnect the removable media/disk drive enclosure assembly by sliding the media enclosure toward the rear of the system, then pressing the blue tabs. 5. Plug in the power cords and wait for 01 in the upper-left corner of the operator panel display. 6. Turn on the power using either the HMC or the white button. (If the diagnostic CD-ROM is not in the optical drive, insert it now.) After the system has reached hypervisor standby, activate a Linux or AIX partition by clicking the Advanced button on the activation screen. On the Advanced activation screen, select Boot in service mode using the default boot list to boot the diagnostic CD-ROM. 7. Immediately after the word keyboard is displayed, press the number 5 key on either the directly attached keyboard or an ASCII terminal keyboard. 8. Enter the appropriate password if you are prompted to do so. Is the ?Please define the System Console? screen displayed? NO One of the FRUs remaining in the system unit is defective. Exchange the FRUs that have not been exchanged, in the following order: 1. Optical drive 2. Removable media enclosure. 3. System backplane, Un-P1.Repeat this step until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to Problem Analysis procedures and follow the instructions for the new symptom. YES Go to the next step.v PFW1548-10 The system is working correctly with this configuration. One of the disk drives that you removed from the disk drive backplanes may be defective. 328 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 1. Make sure the diagnostic CD-ROM is inserted into the optical drive. 2. Turn off the power and remove the power cords. 3. Install a disk drive in the media/disk drive enclosure assembly. 4. Plug in the power cords and wait for the OK prompt to display on the operator panel display. 5. Turn on the power. 6. Immediately after the word keyboard is displayed, press the number 5 key on either the directly attached keyboard or an ASCII terminal keyboard. 7. Enter the appropriate password if you are prompted to do so. Is the ?Please define the System Console? screen displayed? NO Exchange the FRUs that have not been exchanged, in the following order: 1. Last disk drive installed 2. Disk drive backplane.Repeat this step until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to Problem Analysis procedures and follow the instructions for the new symptom. YES Repeat this step with all disk drives that were installed in the disk drive backplane. After all of the disk drives have been reinstalled, go to the next step.v PFW1548-11 The system is working correctly with this configuration. One of the devices that was disconnected from the system backplane may be defective. 1. Turn off the power and remove the power cords. 2. Attach a system backplane device (for example: system port 1, system port 2, USB, keyboard, mouse, Ethernet) that had been removed. After all of the I/O backplane device cables have been reattached, reattached the cables to the service processor one at a time. 3. Plug in the power cords and wait for 01 in the upper-left corner on the operator panel display. 4. Turn on the power using either the HMC or the white button. (If the diagnostic CD-ROM is not in the optical drive, insert it now.) After the system has reached hypervisor standby, activate a Linux or AIX partition by clicking the Advanced button on the activation screen. On the Advanced activation screen, select Boot in service mode using the default boot list to boot the diagnostic CD-ROM. 5. If the Console Selection screen is displayed, choose the system console. 6. Immediately after the word keyboard is displayed, press the number 5 key on either the directly attached keyboard or on an ASCII terminal keyboard. 7. Enter the appropriate password if you are prompted to do so. Is the ?Please define the System Console? screen displayed? NO The last device or cable that you attached is defective. To test each FRU, exchange the FRUs in the following order: 1. Device and cable (last one attached) 2. System backplane, location: Un-P1.If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. Chapter 2. Isolation Procedures 329 If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to Problem Analysis procedures and follow the instructions for the new symptom. YES Repeat this step until all of the devices are attached. Go to the next step.v PFW1548-12 The system is working correctly with this configuration. One of the FRUs (adapters) that you removed may be defective. 1. Turn off the power and remove the power cords. 2. Install a FRU (adapter) and connect any cables and devices that were attached to the FRU. 3. Plug in the power cords and wait for the OK prompt to display on the operator panel display. 4. Turn on the power using either the HMC or the white button. (If the diagnostic CD-ROM is not in the optical drive, insert it now.) After the system has reached hypervisor standby, activate a Linux or AIX partition by clicking the Advanced button on the activation screen. On the Advanced activation screen, select Boot in service mode using the default boot list to boot the diagnostic CD-ROM. 5. If the Console Selection screen is displayed, choose the system console. 6. Immediately after the word keyboard is displayed, press the number 5 key on either the directly attached keyboard or on an ASCII terminal keyboard. 7. Enter the appropriate password if you are prompted to do so. Is the ?Please define the System Console? screen displayed? NO Go to the next step. YES Repeat this step until all of the FRUs (adapters) are installed. Go to Chapter 5, “Verifying a repair,” on page 557.v PFW1548-13 The last FRU installed or one of its attached devices is probably defective. 1. Make sure the diagnostic CD-ROM is inserted into the optical drive. 2. Turn off the power and remove the power cords. 3. Starting with the last installed adapter, disconnect one attached device and cable. 4. Plug in the power cords and wait for the 01 in the upper-left corner on the operator panel display. 5. Turn on the power using either the HMC or the white button. (If the diagnostic CD-ROM is not in the optical drive, insert it now.) After the system has reached hypervisor standby, activate a Linux or AIX partition by clicking the Advanced button on the Advanced activation screen. On the Advanced activation screen, select Boot in service mode using the default boot list to boot the diagnostic CD-ROM. 6. If the Console Selection screen is displayed, choose the system console. 7. Immediately after the word keyboard is displayed, press the number 5 key on either the directly attached keyboard or on an ASCII terminal keyboard. 8. Enter the appropriate password if you are prompted to do so. Is the ?Please define the System Console? screen displayed? NO Repeat this step until the defective device or cable is identified or all devices and cables have been disconnected. If all the devices and cables have been removed, then one of the FRUs remaining in the system unit is defective. To test each FRU, exchange the FRUs in the following order: 1. Adapter (last one installed) 2. System backplane, location: Un-P1. 330 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to the Problem Analysis procedures and follow the instructions for the new symptom. YES The last device or cable that you disconnected is defective. Exchange the defective device or cable then go to the next step.v PFW1548-14 1. Follow the instructions on the screen to select the system console. 2. When the DIAGNOSTIC OPERATING INSTRUCTIONS screen is displayed, press Enter. 3. Select Advanced Diagnostics Routines. 4. If the terminal type has not been defined, you must use the option Initialize Terminal on the FUNCTION SELECTION menu to initialize the AIX diagnostic environment before you can continue with the diagnostics. This is a separate operation from selecting the console display. 5. If the NEW RESOURCE screen is displayed, select an option from the bottom of the screen. Note: Adapters and devices that require supplemental media are not shown in the new resource list. If the system has adapters or devices that require supplemental media, select option 1. 6. When the DIAGNOSTIC MODE SELECTION screen is displayed, press Enter. 7. Select All Resources. (If you were sent here from step PFW1548-18, select the adapter or device that was loaded from the supplemental media). Did you get an SRN? NO Go to step PFW1548-16. YES Go to the next step.v PFW1548-15 Look at the FRU part numbers associated with the SRN. Have you exchanged all the FRUs that correspond to the failing function codes (FFCs)? NO Exchange the FRU with the highest failure percentage that has not been changed. Repeat this step until all the FRUs associated with the SRN have been exchanged or diagnostics run with no trouble found. Run diagnostics after each FRU is exchanged. Go to Chapter 5, “Verifying a repair,” on page 557. YES If the symptom did not change and all the FRUs have been exchanged, call service support for assistance.v PFW1548-16 Does the system have adapters or devices that require supplemental media? NO Go to step the next step. YES Go to step PFW1548-18.v PFW1548-17 Consult the PCI adapter configuration documentation for your operating system to verify that all adapters are configured correctly. Go to Chapter 5, “Verifying a repair,” on page 557. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. v PFW1548-18 1. Select Task Selection. Chapter 2. Isolation Procedures 331 2. Select Process Supplemental Media and follow the on-screen instructions to process the media. Supplemental media must be loaded and processed one at a time. Did the system return to the TASKS SELECTION SCREEN after the supplemental media was processed? NO Go to the next step. YES Press F3 to return to the FUNCTION SELECTION screen. Go to step PFW1548-14, substep 4 on page 331.v PFW1548-19 The adapter or device is probably defective. If the supplemental media is for an adapter, replace the FRUs in the following order: 1. Adapter 2. System backplane, location: Un-P1. If the supplemental media is for a device, replace the FRUs in the following order: 1. Device and any associated cables 2. The adapter to which the device is attached Repeat this step until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to the Problem Analysis procedures and follow the instructions for the new symptom. Go to Chapter 5, “Verifying a repair,” on page 557. End of procedure. 332 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Collect serviceable events in Integrated Virtualization Manager Use this procedure when using the Integrated Virtualization Manager (IVM) to check for serviceable events on your system. 1. Did you receive a reference code on the IVM console? Yes: Record the reference code, and return to the procedure that sent you here to further isolate the problem. No: Continue with the next step to check for serviceable events.2. In the IVM navigation bar, under Service Management, select Manage Serviceable Events. 3. In the Serviceable Event Selection Criteria field, select the criteria for the serviceable events that you want to view and click Apply. The serviceable events are displayed in a table. The table contains a list that shows all serviceable events matching your selection criteria. 4. Select one of the following options to perform an action on your serviceable events. v To view properties, continue with step 5. v To view associated FRUs, continue with step 6. v To close a serviceable event, continue with step 7. v To view additional details for a serviceable event, continue with step 8.5. To view properties for a serviceable event, select the serviceable event that you want to view properties for and select Properties. The Serviceable Event Comments window opens. 6. To view associated field replaceable units (FRUs), select the serviceable event that you want to view FRUs for, and select View associated FRUs. The serviceable event window opens. 7. To close a serviceable event, select one or more of the serviceable events that you want to close, and select Close Events. The Serviceable Event Comments window opens. a. Enter comments if necessary, and click Close Events. b. On the Close Serviceable Event dialog box, click Yes to close the serviceable event.8. To view additional details for a serviceable event, select the serviceable event that you want to view additional details for and select View additional details. The Serviceable Event Comments window opens. This ends this procedure. Chapter 2. Isolation Procedures 333 334 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Chapter 3. Locating FRUs You can identify the enclosure in which a FRU is plugged by its location code. The first character of the location code is always U followed by a four character feature code or enclosure type as shown in the following example: U789C.001.10ABCDE-P3-C31 In this example, the enclosure type is 789C. The next three characters of the location code indicate the model of the enclosure (001 in the example). The next string of characters provides the enclosure serial number (10ABCDE in the example). Using this information, locate the enclosure with the FRU you want to replace. Find the enclosure type in the following table and go to the service guide for that enclosure: Feature Code (Utttt) Model Enclosure Service Guide 789C 8203-E4A, 9407-M15, or 9408-M25 Go to “Physical Locations” on page 339. 1300 7311-D11 Refer to the Service Guide for the 7311 Model D11, SA76-0115. 5796 7314-G30 Refer to the Service Guide for the 7314 Model G30, SA76-0083. 0595 7311-D20 Refer to the Service Guide for the 7311 Model D20, SA76-0116. 5786 5787 7031-T24 Refer to the Service Guide for the 7031 Models D24 and T24, SA76-0118. 5788 5789 7031-D24 Refer to the Service Guide for the 7031 Models D24 and T24, SA76-0118. 5886 5886 Refer to the Service Guide for the 5886 Expansion Unit, SA76-0121. Locate the FRU The string of characters following the enclosure serial number identifies the FRU location within the enclosure: U7879.001.10ABCDE-P3-C31 In this example, P3-C31 is the location of the FRU to be replaced. Use the graphics and tables to locate the FRU and link to its Removal and Replacement procedure. “Using location codes” on page 336The information provided in this topic defines specific sections of a location code string. Use this topic when you need help in understanding the meaning of a location code. “Physical Locations” on page 339Use the following illustrations to help you map a location code to a position on the server. © Copyright IBM Corp. 2008 335 Using location codes The information provided in this topic defines specific sections of a location code string. Use this topic when you need help in understanding the meaning of a location code. Use the following table to link to a specific topic that you need additional information on when reading your location code. Location code topics that might be of interest Description “Location code overview” Contains background information on the use of location codes. “Physical location codes” on page 337 Provides a definition for physical location code. “Logical location codes” on page 337 Provides a definition of what a logical location code is. “Location code format” on page 337 Provides descriptive information of the Un value in the location code string. For example U789C.001. “Location code labels” on page 338 Provides a table that identifies and defines the location code labels. The location code labels begin with an alpha character and follow the system serial number. For example, U789C.001.10ABCDE-P3-C31-T2-L23. (The system serial number is the ?10ABCDE? in the previous example.) The P3, C31, T2, and L23 all contain an alpha character that is identified in the Location code labels table. “Worldwide unique identifier” on page 338 Provides a definition for the world unique identifier. This group of digits follows the resource code labels and always begins with the letter W. Location code overview Servers (system unit and expansion units) use physical location codes to provide mapping of replaceable units. Location codes are produced by the server’s firmware, which structures them so that they can be used to identify specific parts in a system. The location code format is the same for all servers. If you are working with a specific location code, the unit type and model immediately follow the first character (Utttt.mmm). Match the unit type and model to a link, as shown in the Unit type and locations table. If the location code ends with -Txx-Lxx, the server’s firmware could not identify the physical location. When a physical location cannot be identified, a logical location code is provided. Where logical location codes occur in enclosures, the locations article for the enclosure has the known conversions listed. For logical location codes with no conversion, contact your next level of support. If the location code begins with UTMPx, the expansion I/O unit’s machine type, model and serial number have not been set yet and this is a temporary unit identifier. To identify the unit, examine the display panels on all of the expansion I/O units connected to the server until you find one with the same characters in the first 5 digits of the top line in the unit’s display. Record the unit’s real machine type and model from the unit label. Match the unit’s machine type and model in the Unit type and locations table and follow the link to determine the service information. Note: Locations for units that are not in the preceding list are either not supported or there is a problem in the firmware. Contact your next level of support. 336 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Physical location codes Physical location codes provide a mapping of logical functions and components, such as backplanes, removable modules, connectors, ports, cables, and devices, to their specific locations within the physical structure of the server. Logical location codes If the physical location cannot be mapped to a physical location code, the server’s firmware will generate a logical location code. A logical location code is a sequence of location labels that identify the path that the system uses to communicate with a given resource. Note: A resource has as many logical location codes as it has logical connections to the system. For example, an external tape device connected to two I/O adapters will have two logical location codes. An example of a logical location code is: U789C.001.10ABCDE-P3-C31-T2-L23 The first part of the location code (through the T2 label) represents the physical location code for the resource that communicates with the target resource. The remainder of the logical location code (L23) represents exactly which resource is indicated. Location code format The location code is an alphanumeric string of variable length, consisting of a series of location identifiers, separated by a dash. An example of a physical location for a fan is Un-A1. The first position, represented by Un (where n is equal to any string contained between the U and the hyphen) in the preceding example, is displayed in one of the following forms: Note: In location codes the U is a constant digit, however, the numbered positions following the U are variables and are dependent on your server. Each column defines the numbers following the U in the beginning of the location code. Machine type and model number in its location codes Feature codes and sequence numbers in its location code Utttt.mmm.sssssss-A1 Uffff.ccc.sssssss-A1 The leftmost code is always U The leftmost code is always U tttt represents the unit type of the enclosure (drawer or node) ffff represents the feature code of the enclosure (drawer or node) mmm represents the model of the enclosure ccc represents the sequence number of the enclosure sssssss represents the serial number for the enclosure sssssss represents the serial number of the enclosure Note: The mmm or ccc number might not be displayed on all location codes for all servers. If the mmm value is not displayed, the location code is displayed in one of the following forms: v Utttt.sssssss-A1 v Uffff.sssssss-A1 The location code is hierarchical; that is, each location identifier in the string represents a physical part. The order (from left to right), in which each identifier is shown, allows you to determine which parts contain other parts in the string. Chapter 3. Locating FRUs 337 The - (dash) separator character represents a relationship between two components in the unit. In the example of the fan, whose location code is Un-A1, the - (dash) shows that the fan (A1) is contained in the base unit (or Un). Modules, adapters, cables, and devices are all parts that are plugged into another part. Their location codes will always show that they are plugged into another part as components of the server. Another example, Un-P1-C9 is a memory DIMM, with (C9) plugged into a backplane (P1), which is inside the unit (Un). For more information about the various location code label prefixes, refer to Location code labels. Location code labels The following table describes the location code label prefixes. Note: These labels apply to system units only. Table 44. Location code label prefixes for system units Prefix Description Example A Air-moving device Fan, blower C Card connector IOP, IOA, DIMM, processor card D Device Diskette, control panel E Electrical Battery, power supply, ac charger L Logical path SAS target IDE address, Fibre Channel LUN N Horizontal placement for an empty rack location P Planar System backplane T Port, external cable U Unit V Virtual planar W Worldwide unique ID X EIA value for an empty rack location Y Firmware FRU Worldwide unique identifier A worldwide unique identifier location label consists of the prefix ?W? followed by a maximum of 16 uppercase hexadecimal digits with no leading zeros. A location code may or may not consist of a worldwide unique identifier. When present, the worldwide unique identifier location label follows the location label of the resource that interfaces with the resource having the worldwide unique identifier, usually a port. 338 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Physical Locations Use the following illustrations to help you map a location code to a position on the server. “Locations” “Addresses” on page 347You can find part locations using location codes.Locations Use this information to help you map a location code to a position on the unit. Notes: v To activate the identify LED, see “Activating and deactivating LEDs” on page 583. v After replacing any part on a server or expansion unit, go to Chapter 5, “Verifying a repair,” on page 557. v Information regarding parts can be found in Chapter 8, “Part Information,” on page 691.The following diagrams show field replaceable unit (FRU) layouts in the system. Use these diagrams with the following tables. Rack views Figure 4. Rack front view with 3.5 inch DASD Chapter 3. Locating FRUs 339 Figure 5. Rack top view, without fans Figure 6. Rack top view, with fans 340 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Deskside views Figure 7. Rack rear view Figure 8. Deskside front view with 3.5 inch DASD Chapter 3. Locating FRUs 341 Figure 9. Deskside side view without fans Figure 10. Deskside side view with fans 342 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) The following table provides location codes for parts that comprise the server. Table 45. FRU Location Table. Failing item name Physical location code Identify LED Failing item removal and replacement procedures System unit Un Fans Fan 1 Un-A1 Yes “Fan” on page 409 Fan 2 Un-A2 Yes “Fan” on page 409 Fan 3 Un-A3 Yes “Fan” on page 409 Fan 4 Un-A4 Yes “Fan” on page 409 Power supplies Power supply 1 Un-E1 Yes “Power supply” on page 471 Power supply 2 Un-E2 Yes “Power supply” on page 471 Backplanes Figure 11. Deskside rear view Chapter 3. Locating FRUs 343 Table 45. FRU Location Table (continued) . Failing item name Physical location code Identify LED Failing item removal and replacement procedures Input/output backplane with embedded: v RIO Hub/HSL NIC v RIO/HSL link v Ethernet controller v USB controller (AIX or Linux only) v SAS controllers v Logic oscillator v Service processor v Processor 1 v Processor 2 v SPCN v Time-of-day Un-P1 Yes “System backplane” on page 493 Time-of-day battery Un-P1-E1 “Battery” on page 353 DASD and media backplane Un-P2 Yes “Disk unit backplane” on page 391 DVD Un-P2-D2 “Slimline media” on page 489 System backplane cables System serial cable 1 (top connector) Un-P1-T3 “External cables” on page 352 System serial cable 2 (bottom connector) Un-P1-T4 “External cables” on page 352 Embedded USB front cable Un-D1-T1 “External cables” on page 352 Embedded USB rear cable 1 Un-P1-T7 “External cables” on page 352 Embedded USB rear cable 2 Un-P1-T8 “External cables” on page 352 SPCN cable 1 (top connector) Un-P1-T1 “External cables” on page 352 SPCN cable 2 (bottom connector) Un-P1-T2 “External cables” on page 352 HMC 1 (left connector) Un-P1-T5 “External cables” on page 352 HMC 2 (right connector) Un-P1-T6 “External cables” on page 352 Dual 1 GB Ethernet cable 1 Un-P1-C7-T1 “External cables” on page 352 Dual 1 GB Ethernet cable 2 Un-P1-C7-T2 “External cables” on page 352 Quad 1 GB Ethernet port 1 Un-P1-C7-T1 “External cables” on page 352 Quad 1 GB Ethernet cable 2 Un-P1-C7-T2 “External cables” on page 352 Quad 1 GB Ethernet cable 3 Un-P1-C7-T3 “External cables” on page 352 Quad 1 GB Ethernet cable 4 Un-P1-C7-T4 “External cables” on page 352 External SAS cable Un-P2-T1 “External cables” on page 352 344 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Table 45. FRU Location Table (continued) . Failing item name Physical location code Identify LED Failing item removal and replacement procedures Internal SAS cable Un-P1-T9 Processor and processor regulator Voltage regulator 1A for processor 1 Un-P1-C13 Yes Two VRMs are handled as a pair; they have one location code. If a VRM fails, both LEDs are lit at the same time. See “Voltage regulator module pair” on page 531 Voltage regulator 1B for processor 1 Un-P1-C13 Yes Two VRMs are handled as a pair; they have one location code. If a VRM fails, both LEDs are lit at the same time. See “Voltage regulator module pair” on page 531 Voltage regulator 2 for processor 1 Un-P1-C18 Yes “Voltage regulator module - single” on page 539 Voltage regulator 1A for processor 2 Un-P1-C19 Yes Two VRMs are handled as a pair; they have one location code. If a VRM fails, both LEDs are lit at the same time. See “Voltage regulator module pair” on page 531 Voltage regulator 1B for processor 2 Un-P1-C19 Yes Two VRMs are handled as a pair; they have one location code. If a VRM fails, both LEDs are lit at the same time. See “Voltage regulator module pair” on page 531 Voltage regulator 2 for processor 2 Un-P1-C20 Yes “Voltage regulator module - single” on page 539 VPD card Un-P1-C11 Yes “Vital product data (VPD) card” on page 547 Adapters PCI-E adapter in slot 1 Un-P1-C1 Yes “PCI adapter” on page 438 PCI-E adapter in slot 2 Un-P1-C2 Yes “PCI adapter” on page 438 PCI-E adapter in slot 3 Un-P1-C3 Yes “PCI adapter” on page 438 PCI-X adapter in slot 4 Un-P1-C4 Yes “PCI adapter” on page 438 PCI-X adapter in slot 5 Un-P1-C5 Yes “PCI adapter” on page 438 Ethernet card Un-P1-C7 Yes “PCI adapter” on page 438 RIO/HSL/GX Dual-Port 12X Channel Attach adapter controller slot 1 Un-P1-C8 Yes “GX adapter” on page 415 RIO/HSL/GX Dual-Port 12X Channel Attach adapter controller slot 2 Un-P1-C6 Yes “GX adapter” on page 415 RIO/HSL/GX Dual-Port 12X Channel Attach adapter controller slot 1 port 1 Un-P1-C8-T1 Yes, for RIO adapter controller No, for 12X Channel Attach adapter “External cables” on page 352 Chapter 3. Locating FRUs 345 Table 45. FRU Location Table (continued) . Failing item name Physical location code Identify LED Failing item removal and replacement procedures RIO/HSL/GX Dual-Port 12X Channel Attach adapter controller slot 1 port 2 Un-P1-C8-T2 Yes, for RIO adapter controller No, for 12X Channel Attach adapter “External cables” on page 352 RIO/HSL/GX Dual-Port 12X Channel Attach adapter controller slot 2 port 1 Un-P1-C6-T1 Yes, for RIO adapter controller No, for 12X Channel Attach adapter “External cables” on page 352 RIO/HSL/GX Dual-Port 12X Channel Attach adapter controller slot 2 port 2 Un-P1-C6-T2 Yes, for RIO adapter controller No, for 12X Channel Attach adapter “External cables” on page 352 Auxiliary cache card for system backplane Un-P1-C9 Yes “Auxiliary cache card” on page 483 Auxiliary cache card battery Un-P1-C9-E1 No “Cache battery pack” on page 366 RAID enablement card for system backplane Un-P1-C10 Yes “RAID enablement card” on page 477 Thermal management card Un-P1-C12 Yes “Thermal power management device (TPMD) card” on page 525 Memory modules Memory module 1 Un-P1-C14 Yes “Memory DIMM” on page 430 Memory module 2 Un-P1-C15 Yes “Memory DIMM” on page 430 Memory module 3 Un-P1-C16 Yes “Memory DIMM” on page 430 Memory module 4 Un-P1-C17 Yes “Memory DIMM” on page 430 Memory module 5 Un-P1-C21 Yes “Memory DIMM” on page 430 Memory module 6 Un-P1-C22 Yes “Memory DIMM” on page 430 Memory module 7 Un-P1-C23 Yes “Memory DIMM” on page 430 Memory module 8 Un-P1-C24 Yes “Memory DIMM” on page 430 Device physical locations Removable media Un-P2-D1 No “Media device” on page 421 Removable media Un-P2-D2 No “Media device” on page 421 Disk drive 1 Un-P2-D3 Yes “Disk Drive” on page 379 346 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Table 45. FRU Location Table (continued) . Failing item name Physical location code Identify LED Failing item removal and replacement procedures Disk drive 2 Un-P2-D4 Yes “Disk Drive” on page 379 Disk drive 3 Un-P2-D5 Yes “Disk Drive” on page 379 Disk drive 4 Un-P2-D6 Yes “Disk Drive” on page 379 Disk drive 5 Un-P2-D7 Yes “Disk Drive” on page 379 Disk drive 6 Un-P2-D8 Yes “Disk Drive” on page 379 Control panel Control panel Un -D1 “Control panel” on page 375 Temperature sensor Un -D1 “Control panel” on page 375 Server firmware Server firmware Un-Y1 Addresses You can find part locations using location codes. Use the address to find the location for the 9407-M15 and the 9408-M25, then go to “Physical Locations” on page 339 to find additional information. Table 46. IOA, and device address information for the 9407-M15 and the 9408-M25 running i5/OS Position Possible failing item DSA (BBBBCcbb) Unit address Un -P1 Embedded SAS controller 02001000 (primary) 7FFFFFFF Un -P1-C1 PCI-E adapter 02040000 Not applicable Un -P1-C2 PCI-E adapter 02050000 Not applicable Un -P1-C3 PCI-E adapter 02070000 Not applicable Un -P1-C4 PCI-X adapter 02021000 Not applicable Un -P1-C5 PCI-X adapter 02031000 Not applicable Un -P1-C9 PCI auxiliary cache card 02060000 7FFFFFFF Un -P1-C10 RAID card 02001000 7FFFFFFF Un -P2-D1 Removable media 02001000 020000FF Un -P2-D2 Removable media 02001000 030000FF Un -P2-D3 Disk drive 1 02001000 040200FF Un -P2-D4 Disk drive 2 02001000 040300FF Un -P2-D5 Disk drive 3 02001000 040400FF Un -P2-D6 Disk drive 4 02001000 040500FF Un -P2-D7 Disk drive 5 02001000 040600FF Un -P2-D8 Disk drive 6 02001000 040700FF Chapter 3. Locating FRUs 347 348 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Chapter 4. Removing and replacing FRUs Use these procedures to remove and replace failing parts. Before you begin a replacement, perform these tasks: 1. If you are performing a replacement procedure that might put your data at risk, ensure, if possible, that you have a current backup of your system or logical partition (including operating systems, licensed programs, and data). 2. Review the installation or replacement procedure for the feature or part. 3. Note the significance of color on your system. Blue or terra-cotta on a part of the hardware indicates a touch point where you can grip the hardware to remove it from or install it in the system, open or close a latch, and so on. terra-cotta might also indicate that the part can be removed and replaced with the system or logical partition power on. 4. Ensure that you have access to a medium, flat-blade screwdriver. 5. If parts are incorrect, missing, or visibly damaged, contact the provider of your parts, or your next level of support. Attention: Failure to follow the step-by-step sequence for FRU removal and installation may result in FRU or system damage. Use the following precautions whenever you handle electronic components or cables. v Attach a wrist strap to an unpainted metal surface of your hardware to prevent Electrostatic Discharge (ESD) from damaging your hardware. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted metal surface of the system for a minimum of 5 seconds. v Keep all electronic components in the shipping container or envelope until you are ready to install them. v If you remove, then reinstall an electronic component, temporarily place the component on an ESD pad or blanket. DANGER © Copyright IBM Corp. 2008 349 When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) “External cables” on page 352Use this procedure to service the external cables. “Battery” on page 353Use this procedure to service the battery. “Cache battery pack” on page 366Use this procedure to service the cache battery pack. “Control panel” on page 375Use this procedure to service the control panel. “Disk Drive” on page 379Use this procedure to replace the disk drive. “Disk unit backplane” on page 391Use this procedure to service the disk unit backplane. “Ethernet card” on page 403Use this procedure to service the Ethernet card. “Fan” on page 409Use this procedure to service the fan. “GX adapter” on page 415Use this procedure to service the GX adapter. 350 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) “Media device” on page 421Use this procedure to service the media device. “Memory DIMM” on page 430Use this procedure to service a memory DIMM. “PCI adapter” on page 438Use this procedure to service the PCI adapter. “Power supply” on page 471Use this procedure to service the power supply. “RAID enablement card” on page 477Use this procedure to service the RAID enablement card. “Auxiliary cache card” on page 483Use this procedure to service the auxiliary cache card. “Slimline media” on page 489Use this procedure to service the Slimline media. “System backplane” on page 493Use this procedure to replace the system backplane. “Thermal power management device (TPMD) card” on page 525Use this procedure to service the thermal power management device (TPMD) card. “Voltage regulator module pair” on page 531Use this procedure to service the voltage regulator module pair. “Voltage regulator module - single” on page 539Use this procedure to service a single voltage regulator module. “Vital product data (VPD) card” on page 547Use this procedure to service the vital product data (VPD) card. Chapter 4. Removing and replacing FRUs 351 External cables Use this procedure to service the external cables. 1. Remove the external cable Trace the cable and record the system location at the other end, and then remove the cable. 2. Install the external cable Attention: To prevent damaging the FRU or the system, use the following precautions before plugging cables into a connector or adapter: v Make sure that you have the right type of cable for the connector or adapter. v Make sure that the cable plug is correctly aligned with the connector or adapter. v Make sure that the tongue on the HSL/RIO cable plug matches the white location keys on the connector.Install the cable, then route and install the other end of the cable to the system location that was previously recorded. 3. Go to Chapter 5, “Verifying a repair,” on page 557. 352 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Battery Use this procedure to service the battery. 1. Access the ASMI The Advanced System Management Interface (ASMI) is the interface to the service processor that is required to perform general and administrator-level service tasks, such as reading service processor error logs, reading vital product data, setting up the service processor, and controlling the system power. The ASMI may also be referred to as the service processor menus. To access the ASMI through the Hardware Management Console (HMC), complete the following steps: 1. Ensure that the server you are working with is selected. 2. Click Tasks. 3. Click Operations, then click Advanced System Management (ASM). Note: If there is more than one service processor, you must select the primary service processor. 4. Log on to the ASMI using the administrator userid and password.When you have accessed the ASMI, continue with the next step. 2. Check and record the server settings Prior to replacing this FRU, check and record all server settings as you may need to update the system after you update this FRU. This can be done through the Advanced System Management Interface (ASMI) . 1. Check and record the server firmware level indicated on the right edge of the status frame, the area where the Log out button is located. You may need to update the system to the latest server firmware code level after you update this FRU. 2. Check and record the service processor settings you previously set using ASMI. You will need to reset these service processor settings after you update this FRU or the default settings will be used. Settings to record include the following: v Power/Restart Control settings. v System Service Aids settings. v System Configuration settings, System Name setting. v Network Services settings. v Performance Setup settings. v Login Profile settings.3. Record any service processor settings you may have set using operating system commands. 3. Shut down the partitions and power off the system This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. Chapter 4. Removing and replacing FRUs 353 4. Disconnect the power cords 1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing. 5. Place into service position Note to tower users: The instructions in this procedure are written based on the rack model. Tower users may still use this procedure. However, tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. 354 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 6. Place into service position (continued) Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked. 7. Remove the service access cover 1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components. Chapter 4. Removing and replacing FRUs 355 8. Remove the fans 1. Push the latch (A) upward to disengage the fan from the fan cage. 2. Pull the fan (B) out of the fan cage. 3. Repeat these steps until all fans are removed from the fan cage. 9. Remove the fan cage 1. Pull the four retaining tabs (A) that secure the fan cage to its enclosure. 2. Lift the fan cage (B) straight up and out of the enclosure. 356 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 10. Remove the time-of-day battery Attention: When removing the battery, do not use metallic tool to disengage it from its slot. Remove the time-of-day battery (A) from the system backplane, being careful to observe the polarity (+/-) of the battery. v Do not: v Burn or charge the battery (this to avoid possible explosion, battery contains lithium). v Throw or immerse into water. v Heat to more than 100 degrees C (212 degrees F). v Repair or disassemble.Exchange only with the IBM-approved part. Recycle or discard the battery as instructed by local regulations. In the United States, IBM has a process for the collection of this battery. For information, call 1-800-426-4333. Have the IBM part number for the battery unit available when you call. 11. Insert the time-of-day battery Insert the time-of-day battery (A) in the slot on the system backplane, being sure to insert the battery with the polarity (+/-) the same as the battery that was removed. Chapter 4. Removing and replacing FRUs 357 12. Install the fan cage 1. Check that the four retaining tabs are in open position. 2. Lower the fan cage (A) into its location on the enclosure, aligning the blue tabs with the holes. 3. Push firmly on the fan cage so that you are sure it fits into its position. 4. Tighten the four retaining tabs (B) securing the fan cage to its enclosure. 13. Install the fans 1. Lower the fan (A) into its location in the fan cage. 2. Push down on the fan until it locks into place. 3. Secure the fan by closing the latch (B). 4. Repeat these steps until all fans are installed. 358 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 14. Install the service access cover 1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover. 15. Place into operating position Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position. Chapter 4. Removing and replacing FRUs 359 16. Place into operating position (continued) 1. Replace and tighten the two screws (B) that secure the system unit (A) to the rack. 2. Close the front rack door of the unit you are servicing. 17. Install the power cords If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door. 18. Restore network connectivity The service processor must connect to the network to be accessible from the HMC. 360 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) How will the IP Address of the Service Processor be managed? Obtain an IP Address automatically from a DHCP server Configure a static IP Address ? Go to step20. 19. Restore service processor settings Enable network access to the service processor by proceeding as follows: If the network connection uses the Dynamic Host Configuration Protocol (DHCP) to establish an IP address, and uses the Hardware Management Console (HMC) as the DHCP server, no additional configuration is necessary to enable network access to the service processor. Perform the following steps: 1. Ensure that the service processor is connected to the existing service network by verifying that the HMC cable is connected to the HMC port on the system backplane. 2. If not already connected, connect all system power cables by plugging them into power outlets. Note: Do not start the system at this time. Do you have network access to the service processor? Yes No Go to step 22 on page 362. Go to step 24 on page 362 20. Restore service processor settings through the ASMI The Advanced System Management Interface (ASMI) is the interface to the service processor that is required to perform general and administrator-level service tasks, such as reading service processor error logs, reading vital product data, setting up the service processor, controlling the system power, and setting service processor network ports. The ASMI may also be referred to as the service processor menus. The ASMI can be accessed through https. As the network connection uses static IP address assignments, perform the following: 1. To know your service processor network ports, if necessary, you can use the control panel in manual mode. Refer to service functions to get control panel documentation in that case. 2. Connect a client with a Web browser directly to the service processor network ports you previously noted. It could be something like the following URLs: v https://169.254.2.147 v https://169.254.3.1473. If not already connected, connect all system power cables by plugging them into power outlets.Note: Do not start the system at this time. 4. Log on to the ASMI with the ?admin? User ID and ?admin? default Password. 5. Change the ?admin? User ID’s Password and the ?general? User ID’s Password when prompted. 6. To configure network access perform the following: a. Click on Network Configuration under the Network Services node. b. On the right pane, configure eth0 and eth1 network interfaces, choose for Type of IP Address ’Static’, set a Host name, an IP address, a Subnet mask. Chapter 4. Removing and replacing FRUs 361 21. Restore service processor settings With the network connection now configured to use static IP address assignments, try to access the service processor network ports. Do you have network access to the service processor? Yes No ? Go to step 25 on page 363 22. Access the ASMI If you are already connected to the ASMI, click Next to continue. Otherwise to access the ASMI through the Hardware Management Console (HMC), complete the following steps: 1. Ensure that the server you are working with is selected. 2. Click Tasks. 3. Click Operations, then click Advanced System Management (ASM). Note: If there is more than one service processor, you must select the primary service processor. 23. Go to step 27 on page 363. 24. Restore service processor settings If the network connection uses DHCP to establish an IP address, but does not use an HMC as the DHCP server, perform the following: 1. Complete any network configuration necessary to allow the DHCP server to recognize and assign an IP address to the service processor. 2. Ensure that the service processor is connected to the existing service network by verifying that the network cable is connected to the network port. 3. If not already connected, connect all system power cables by plugging them into power outlets. Note: Do not start the system at this time. Do you have network access to the service processor? No Yes ? Go to step 22 362 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 25. Restore service processor settings You have indicated that you are still not able to access the service network. You will need to reset the service processor by pressing the pin-hole switch (E) on the operator panel. 26. Go to step 18 on page 360. 27. Restore service processor settings Change the ?admin? User ID’s password when prompted by performing the following steps: 1. In the navigation area of the ASMI, expand Login Profile. 2. Select Change Password under Login Profile. 3. Specify the required information, and click Continue.As a security measure, you are required to enter your current user password in the Current password for current user ID field. Note: Passwords can be any combination of up to 64 alphanumeric characters. The default password for the ?admin? User ID is ?admin?. After your initial login to the ASMI, the ?admin? password must be changed. Chapter 4. Removing and replacing FRUs 363 28. Restore service processor settings Set the system name. The system name can be changed to any valid ASCII string. It does NOT have to follow the initialized MachineType-Model#-Serial# format. To change the system name, perform the following: 1. In the navigation area of the ASMI, expand System Configuration. 2. Select System Name under System Configuration. 3. Enter the desired system name. 4. Click Save settings to update the system name to the new value.The new system name is displayed in the status frame, the area where the Log out button is located. If another method, such as the HMC, is used to change the system name, the status frame does not reflect the change. 29. Restore service processor settings Set the time-of-day. 1. In the navigation area of the ASMI, expand System Configuration. 2. Select Time of Day under System Configuration. 3. If the system is powered off, the right pane displays a form that shows the current date (month, day, and year) and time (hours, minutes, seconds). 4. Change either the date or the time or both, and click Save settings. 30. Restore service processor settings Reenter any of the following settings that were previously changed through the ASMI, unless you want to use the default settings. Settings to be set include the following: 1. Power/Restart Control settings. 2. System Service Aids settings. 3. System Configuration settings, if not already done. 4. Network Services settings. 5. Performance Setup settings. 6. Login Profile settings. 364 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 31. Restore service processor settings 1. Reset any service processor settings that you may have set using operating system commands. You recorded these settings at the beginning of this procedure. 2. If you choose to reset the HMC Access password, perform the following: 3. a. From the HMC GUI (preferred method): b. 1) Expand the Systems Management folder in the navigation tree pane. 2) Double click the Servers folder. 3) Use the checkbox to select a server, in the central panel. 4) Under the Operations folder, select Change Password. 5) Provide the desired information.c. d. Or, from the HMC command line, type: e. /usr/hmcrbin/chsyspwd -m managedsystem -t access --passwd --newpasswd newpassword where: v The value for managedsystem is the new service processor’s managed system name. v No value for --passwd is entered thereby allowing authentication. v The value for newpasswd is the newpassword value. 32. Restore partition data on the service processor, if you had a partition in your system Profile data stored in the managed server has been cleared or corrupted. To recover profile data using the HMC, proceed as follows: 1. Expand the Systems Management folder in the navigation tree pane. 2. Double click the Servers folder. 3. Use the checkbox to select a server, in the central panel. 4. In the Configuration folder if you have the Manage Partition Data folder, select Restore. 33. Go to Chapter 5, “Verifying a repair,” on page 557. Chapter 4. Removing and replacing FRUs 365 Cache battery pack Use this procedure to service the cache battery pack. 1. Cache battery pack replacement procedure information Attention: To prevent possible data loss, ensure that the cache battery pack is in an error state before replacing it. This will ensure all cache data is written to disk and that it is safe to replace the cache battery pack. By following this procedure, you are going to: 1. Force the cache battery pack into an error state. 2. Perform the cache battery pack replacement.You will be allowed to resume this procedure later if you cannot ensure that the cache battery pack is in an error state. Forcing the cache battery pack into an error state will result in the following: v The system logs an error. v Data caching becomes disabled on the selected controller. v System performance could become significantly degraded until the cache battery pack is replaced on the selected controller. v The battery pack can be safely replaced field on the controller rechargeable battery information screen will show Yes. v The cache data present LED will stop flashing. v The cache battery pack requires replacement.Ensure that you have the correct type of cache battery pack to perform the replacement. 2. Table 47. Is the system powered off? No Yes ? Go to step9 on page 368 3. Select the operating system: AIX Linux i5/OS ? Go to step 8 on page 368. Go to step6 on page 367. 366 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 4. Force the cache battery pack into an error state on AIX To force the cache battery pack into an error state in order to prevent possible data loss, proceed as follows on the system or partition containing the adapter: 1. Navigate to the IBM SAS Disk Array Manager as described below: a. At the command prompt, type smit, and press Enter. b. Select Devices. c. Select Disk Array. d. Select IBM SAS Disk Array. e. Select IBM SAS Disk Array Manager from the menu with options for configuring and managing the IBM SAS RAID Controller.2. Select Diagnostics and Recovery Options. 3. Select Controller Rechargeable Battery Maintenance. 4. Select Force Controller Rechargeable Battery Error. 5. Select the IBM SAS RAID Controller whose battery you want to replace, using this option places the battery into the error state, which requires it to be replaced. 6. Press Enter. 7. Determine that it is safe to replace the cache battery pack. Refer to How to display rechargeable battery information below. It is safe to replace the cache battery pack when Yes is displayed next to Battery pack can be safely replaced. You may need to reselect the option to Display Controller Rechargeable Battery Information multiple times as it may take several minutes before it is safe to replace the cache battery pack.How to display rechargeable battery information 1. Navigate to the IBM SAS Disk Array Manager as described above. 2. Select Diagnostics and Recovery Options. 3. Select Controller Rechargeable Battery Maintenance. 4. Select Display Controller Rechargeable Battery Information. 5. Select the IBM SAS RAID Controller. 5. Go to step 9 on page 368. 6. To prevent data loss, ensure that the cache battery pack is in an error state To force the cache battery pack into an error state in order to prevent possible data loss, proceed as follows on the system or partition containing the adapter: 1. Be sure that you are signed on to the system with at least service level authority. 2. Type strsst on the command line and press Enter. 3. Type your service tools userid and service tools password on the System Service Tools (SST) Sign On display. Press Enter. 4. Select Start a Service Tool from the System Service Tools (SST) display. Press Enter. 5. Select Hardware Service Manager from the Start a Service Tool display. Press Enter. 6. Select Work with resources containing cache battery packs from the Hardware Service Manager display. Press Enter. 7. Select Force battery pack into error state for the I/O card you are working with from the Work with Resources containing cache battery packs display. Press Enter. 8. On the Force Battery Packs Into Error State display, verify that the correct I/O adapter has been selected and press the function key that confirms your choice. 9. Return to the Work with Resources containing cache battery packs display and select Display battery information. Ensure that the field Safe to replace cache battery = YES. Note: This may take several minutes and you may need to press the refresh key to see the field update. Chapter 4. Removing and replacing FRUs 367 7. Go to step 9. 8. Force the cache battery pack into an error state on Linux To force the cache battery pack into an error state in order to prevent possible data loss, proceed as follows on the system or partition containing the adapter: 1. Run the iprconfig utility by typing iprconfig. 2. Select Work with disk unit recovery. 3. Select Work with resources containing cache battery packs. 4. Select your adapter and type 2. Then press Enter to force the battery error. Note: Using this option places the battery into the error state, which requires it to be replaced. 5. If you are sure you want to force a battery error, type c to confirm. If you do not want to force a battery error, type q to cancel. 6. Determine that it is safe to replace the cache battery pack. Refer to How to display rechargeable battery information below. It is safe to replace the cache battery pack when Yes is displayed next to Battery pack can be safely replaced. You may need to reselect the option to Display Controller Rechargeable Battery Information multiple times as it may take several minutes before it is safe to replace the cache battery pack.How to display rechargeable battery information 1. Run the iprconfig utility by typing iprconfig. 2. Select Work with disk unit recovery. 3. Select Work with resources containing cache battery packs. 4. Select your adapter and type 1. 5. Then press Enter to display battery information. 9. Place into service position Note to tower users: The instructions in this procedure are written based on the rack model. Tower users may still use this procedure. However, tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. 368 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 10. Place into service position (continued) Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked. 11. Remove the service access cover 1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components. Chapter 4. Removing and replacing FRUs 369 12. Check cache data present LED Examine the LED (A) near the cache battery pack on the RAID enablement card. This LED is called the cache data present LED. If the LED is flashing, there is cache data that has not been written to disk. If the LED is not flashing, all cache data has been written to disk. Is the Cache Battery LED flashing? Yes, the LED is flashing No, the LED is not flashing ? Go to step 14 on page 371 13. Stop removing the cache battery pack Attention: v The flashing LED indicates that there is cache data that has not been written to disk. v This data will be lost if the battery is removed at this time. v Do not replace the cache battery pack at this time.Complete this service procedure without replacing the cache battery pack. If the system is powered on: 1. Retry the procedure from the beginning to ensure that all cache data is written to disk before the battery is replaced.If the system is powered off: 1. Power on the system and activate the partitions. 2. Retry the procedure from the beginning but this time with power on to ensure that all cache data is written to disk before the battery is replaced. Go to step 21 on page 372. 370 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 14. Remove the cache battery pack Attention: Do not remove battery if LED (C) is flashing; cache data exists. 1. Move the cache battery lever (A) away from the connector to disengage the battery from the connector. 2. Slide the cache battery pack (B) out of the mounting guides and remove it from the controller. 15. Install the cache battery pack Attention: When installing the cache battery pack, ensure that the cache battery pack is disconnected for at least 60 seconds before connecting the new battery. This is the minimum amount of time needed for the card to recognize that the battery has been replaced. 1. Slide the cache battery pack (B) into the mounting guides on the controller until it is seated in the battery connector (C). 2. Move the lever (A) to the latched position to fully seat the battery into the connector. 16. Table 48. Is the system powered off? No Yes ? Go to step 21 on page 372. Chapter 4. Removing and replacing FRUs 371 17. Re-enable the cache Select the operating system: AIX Linux i5/OS ? The write cache is automatically enabled. Go to step 21. Go to step 20. 18. Re-enabling caching on AIX Restart the adapter’s write cache by doing the following: 1. Navigate to the IBM SAS Disk Array Manager by using the steps in Using the IBM SAS Disk Array Manager. 2. Select Diagnostics and Recovery Options. 3. Select Controller Rechargeable Battery Maintenance. 4. Select Start Adapter Cache. 5. Select your IBM SAS RAID Controller whose battery you just replaced. 6. Press Enter. 19. Go to step 21. 20. Re-enabling caching on i5/OS Restart the adapter’s write cache by doing the following: 1. Return to the Work with Resources containing Cache Battery Packs display and select Start IOA cache. Press Enter. 2. Verify that a message stating that the Cache was started has been received. 21. Install the service access cover 1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover. 372 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 22. Place into operating position Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position. 23. Place into operating position (continued) 1. Replace and tighten the two screws (B) that secure the system unit (A) to the rack. 2. Close the front rack door of the unit you are servicing. Chapter 4. Removing and replacing FRUs 373 24. Install the power cords If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door. 25. Go to Chapter 5, “Verifying a repair,” on page 557. 374 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Control panel Use this procedure to service the control panel. 1. Shut down the partitions and power off the system This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. 2. Disconnect the power cords 1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing. Chapter 4. Removing and replacing FRUs 375 3. Remove the front cover Note to Tower Users: The instructions in this procedure are written based on the Rack model. Tower users may still use this procedure. However, Tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. 3. Grab the cover at points (C) and pull it out and away from the system unit. 4. Remove the operator panel 1. Release the operator panel by pushing the release tab (A) to the left. 2. Grasp the edges of the operator panel and pull it partially out of the bay. 3. Press the locking pins (C) inward and gently pull the operator panel completely out of the bay, taking care not to pull on the cables (B). 4. Rotate the operator panel downward if needed, and disconnect the operator panel cables (B) from the operator panel. 5. Record the operator panel cables orientation, for reconnecting them the same way. 376 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 5. Install the operator panel 1. Carefully guide both operator panel cables (A) through the cables conduit in the extension of the operator panel while gently sliding the operator panel into the operator panel bay. 2. Connect the operator panel cables (A) to the operator panel. 3. Rotate the front of the operator panel upward and continue sliding it into the bay until the operator panel engages and locks into place. 6. Install the front cover 1. Position the cover on the front of the system unit so that the two screws (C) align with the screw holes on the front of the system unit (A). 2. Push the cover at points (B) to attach it to the front of the system unit as shown in the following figure. 3. Tighten the two screws that secure the system unit to the rack. 4. Close the front rack door. Chapter 4. Removing and replacing FRUs 377 7. Install the power cords If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door. 8. Go to Chapter 5, “Verifying a repair,” on page 557. 378 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Disk Drive Use this procedure to replace the disk drive. Your system can be powered off or powered on when you replace the disk drive. Do one of the following: v If the operating system is not running, or if the disk drive to be replaced is in the AIX or Linux root volume group (rootvg) and it is not protected with either a Redundant Array of Independent Disks (RAID) or mirroring, use the procedure for replacing the disk drive with the power off. The power-off procedures are also known as nonconcurrent in the following information. v If the operating system is running, and if the disk drive to be replaced is in not the AIX or Linux root volume group (rootvg) and it is protected with either a Redundant Array of Independent Disks (RAID) or mirroring, you can replace a disk drive with the power on. The power-on procedures are also known as concurrent in the following information. 1. Do you want to replace the disk drive concurrently with system operations and the unit powered on? Yes No ? Go to “Replace the disk drive nonconcurrently” on page 388. 2. Choose the procedure that is appropriate for the operating system running in the partition that the disk drive resource is assigned to. For AIX, go to “Replace the disk drive using AIX.” For Linux, go to “Replace the disk drive using Linux” on page 382. “Replace the disk drive using AIX”Use this procedure to remove and replace a disk drive concurrently using AIX. Replace the disk drive using LinuxUse this procedure to replace a disk drive in a location that is controlled by a system or logical partition that is running Linux. “Replace the disk drive nonconcurrently” on page 388 “Rebuild data on a replacement disk drive using Linux” on page 390To rebuild data on the replacement disk drive, complete the steps listed here.Replace the disk drive using AIX Use this procedure to remove and replace a disk drive concurrently using AIX. Chapter 4. Removing and replacing FRUs 379 1. Remove the disk drive as a resource: 1. Log in as root user or use CE Login. 2. At the command line, type diag and press Enter. 3. Press Enter to continue. 4. On the Function Selection display, select Task Selection. 5. Select Hot Plug Task. 6. Select RAID Hot Plug Devices. 7. Select the adapter that is connected to the array that contains the disk drive you want to replace and press Enter. 8. Select Commit. 9. On the Identify and Remove Resources display, select Identify. 10. Select the disk drive you want to remove and press Enter. The disk drive slot enters the identify state. The concurrent maintenance light for the slot begins flashing. The following figure shows the locations of the concurrent maintenance lights. Verify that the flashing light corresponds to the location for the disk drive you want to remove, and press Enter. The disk drive slot exits the identify state. 11. On the Identify and Remove Resources display, select Remove. A list of the disk drives that you can remove is shown. If the disk drive you want to remove does not appear on the list, ask your system administrator to put the disk drive in the failed state before continuing with this procedure. For information, see AIX System Management Guide: Operating System and Devices. 12. Select the disk drive you want to remove and press Enter. The disk drive slot enters the remove state and power is removed from the slot. The concurrent maintenance light for the slot begins flashing rapidly. Note: If the disk drive slot is not in the Remove state, contact your next level of support. 380 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 2. Remove the Front Cover Note to tower users: The instructions in this procedure are written based on the rack model. Tower users may still use this procedure. However, tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, rack-specific FRU orientation, and so on. 1. Open the front rack door. 2. Remove the two thumbscrews (A) that secure the system unit (B) to the rack. 3. Push the cover release latches (C) in the direction of the arrows to release the cover from the system unit. 4. Pull the cover out and away from the system unit. 3. Remove the disk drive: 1. Unlock the disk drive handle (A) and pull the disk drive out toward you as shown. 2. Support the bottom of the disk drive as you slide it out of the system unit. Do not hold the disk drive by the handle. The concurrent maintenance light for the slot turns off when you remove the disk drive. Chapter 4. Removing and replacing FRUs 381 4. Install the disk drive: 1. Support the bottom of the disk drive as you align it with the guide rails in the system unit. Do not hold the disk drive by the handle. 2. Slide the disk drive into the system until it stops. 3. Push the disk drive handle in until it locks. Note: When you install a disk drive, ensure that the drive is fully seated and all the way into the system. 5. Add the new disk drive as a resource: 1. Press Enter. The disk drive slot exits the remove state and enters the normal state. 2. Exit to the RAID Hot-Plug Devices menu. Press the F3 or ESC 3 key to return. 3. Exit to the Task selection display. 4. Select Log Repair Action. 5. Select the disk drive that you replaced and then press Enter. 6. Select Commit after you have made your selection and then press Enter. 7. Exit to the command line. 6. To rebuild data on the replacement disk drive, refer to the information in the AIX System Management Guide: Operating System and Devices. 7. Go to Chapter 5, “Verifying a repair,” on page 557. This completes this procedure.Replace the disk drive using Linux Use this procedure to replace a disk drive in a location that is controlled by a system or logical partition that is running Linux. Prepare to remove the disk drive 1. Log in as root user. 2. Type iprconfig on the command line of the Linux session and press Enter. The IBM Power RAID Configuration Utility display is shown. 382 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 3. Select Analyze log. Press Enter. The Kernel Messages Log display is shown. 4. Select View most recent ipr error messages from the Kernel Messages Log display. Press Enter. 5. Find the entry in the log for the disk drive you want to replace. 6. Record the location information for the disk drive. Note: The location information has the form of 2:0:8:0. In this example, 2 is the SCSI host number, 0 is the SCSI bus, 8 is the SCSI target ID, and 0 is the LUN (logical unit). 7. Return to the command line. 8. Type the following: ls -ld /sys/class/scsi_host/host#/device where # is the SCSI host number. Press Enter. 9. Record the PCI location information. Note: The PCI location information has the form of 61:01:0:2. 10. Type iprconfig on the command line and press Enter. The IBM Power RAID Configuration Utility display is shown. 11. Select Display hardware status from the IBM Power RAID Configuration Utility display. Press Enter. The Display Hardware Status display is shown. Kernel Messages Log Select one of the following: 1. View most recent ipr error messages 2. View ipr error messages 3. View all kernel error messages 4. View iprconfig error messages 5. Set root kernel message log directory 6. Set default editor 7. Restore defaults 8. View ipr boot time messages Selection: e=Exit Figure 12. Kernel Messages Log Chapter 4. Removing and replacing FRUs 383 12. Look for the disk drive at the PCI location you recorded. The disk drive might have a Failed status. 13. If the disk drive you want to replace is unprotected or in use, move the data from the disk drive before continuing with this procedure. For information, see the PCI-X SCSI RAID Controller Reference Guide for Linux. 14. Type option 1 (Display hardware resource information details) next to the disk drive you want to replace. Press Enter. A Disk Hardware Resource Information Details display similar to the following is shown. 15. Record the physical location information. 16. Return to the IBM Power RAID Configuration Utility display. Display Hardware Status Type option, press Enter. 1=Display hardware resource information details OPT Name PCI/SCSI Location Description Status --- ------ -------------------------- ------------------------- ---------------- 0000:01:01.0.0/ PCI-X SCSI Adapter Operational 0000:41:01.0.1/ PCI-X SCSI Adapter Operational sda 0000:41:01.0.1/0:3:0 Physical Disk Active sdb 0000:41:01.0.1/0:4:0 Physical Disk Active sdc 0000:41:01.0.1/0:8:0 Physical Disk Active sdd 0000:41:01.0.1/1:3:0 Physical Disk Active sde 0000:41:01.0.1/1:4:0 Physical Disk Active sdf 0000:41:01.0.1/1:5:0 Physical Disk Active 0001:61:01.0.2/ PCI-X SCSI RAID Adapter Operational sdg 0001:61:01.0.2/0:3:0 Physical Disk Active 0001:61:01.0.2/0:6:0 Advanced Function Disk Active sdi 0001:61:01.0.2/0:9:0 Physical Disk Active sdh 0001:61:01.0.2/255:0:0 RAID 10 Disk Array Failed 0001:61:01.0.2/0:4:0 RAID 10 Array Member Failed 0001:61:01.0.2/0:5:0 RAID 10 Array Member Failed e=Exit q=Cancel r=Refresh t=Toggle Figure 13. Example Display Hardware Status Disk Unit Hardware Resource Information Details Manufacturer . . . . . . . . . . . . . . : IBM Product ID . . . . . . . . . . . . . . . : ST336607LC Firmware Version . . . . . . . . . . . . : 43353048 (C50H) Serial Number. . . . . . . . . . . . . . : 00006719 Capacity . . . . . . . . . . . . . . . . : 36.40 GB Resource Name. . . . . . . . . . . . . . : /dev/sdd Physical location PCI Address. . . . . . . . . . . . . . . : 0001:50:01.0 SCSI Host Number . . . . . . . . . . . . : 2 SCSI Channel . . . . . . . . . . . . . . : 0 SCSI Id. . . . . . . . . . . . . . . . . : 4 SCSI Lun . . . . . . . . . . . . . . . . : 0 Extended Details FRU Number . . . . . . . . . . . . . . . : 00P2676 More... Press Enter to Continue e=Exit q=Cancel f=PageDn b=PageUp Figure 14. Example Disk Hardware Resource Information Details display 384 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Remove the disk drive 1. From the IBM Power RAID Configuration Utility display, select Work with disk unit recovery. Press Enter. 2. From the Work with Disk Unit Recovery display, select Concurrent remove device. Press Enter. A Concurrent Device Remove display is shown, similar to the following display. 3. Type option 1 (Select) next to the location for the disk drive you want to replace. Press Enter. 4. The Verify Device Concurrent Remove display is shown. The concurrent maintenance light turns on for that disk drive slot. 5. On the Verify Device Concurrent Remove display, verify that the selected disk drive is the disk drive you want to replace, then press Enter. The identify light turns on for the disk drive. Attention: To prevent loss of data, ensure that the disk drive is not in use. 6. The Complete Device Concurrent Remove display is shown. Concurrent Device Remove Choose a single location for remove operations 1=Select OPT Name PCI/SCSI Location Description Status --- ------ -------------------------- ------------------------- ----------------- sdc 0000:41:01.0.1/0:8:0 Physical Disk Active 0000:41:01.0.1/0:5: Empty sdb 0000:41:01.0.1/0:4:0 Physical Disk Active sda 0000:41:01.0.1/0:3:0 Physical Disk Active 0000:41:01.0.1/1:8: Empty sdf 0000:41:01.0.1/1:5:0 Physical Disk Active sde 0000:41:01.0.1/1:4:0 Physical Disk Active sdd 0000:41:01.0.1/1:3:0 Physical Disk Active 0001:61:01.0.2/0:8: Empty sdh 0001:61:01.0.2/0:9:0 Physical Disk Active sdg 0001:61:01.0.2/0:3:0 Physical Disk Active Figure 15. Example Concurrent Device Remove display Chapter 4. Removing and replacing FRUs 385 Attention: v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic discharge from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted metal surface of the system for a minimum of 5 seconds.7. Squeeze and pull the handle of the disk drive out toward you before you remove the disk drive as shown in Figure 16. If the handle is not all the way out, the disk drive will not slide out of the system or expansion unit. 8. Support the bottom of the disk drive as you slide it out of the system or expansion unit. Do not hold the disk drive by the handle. 9. Press Enter on the IBM Power RAID Configuration Utility to indicate that you have removed the disk drive.Replace the disk drive 1. Find the package that contains the new disk drive. Attention: Disk drives are fragile. Handle them with care. 2. Remove the disk drive from its protective package. 3. Unlock the disk drive handle and pull the disk drive out toward you before you install the disk drive. If the handle is not all the way out, the disk drive will not slide into the system or expansion unit. 4. From the IBM Power RAID Configuration Utility display, select Work with disk unit recovery. Press Enter. 5. From the Work with Disk Unit Recovery display, select Concurrent add device. Press Enter. Figure 16. 386 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) A Concurrent Device Add display similar to the following is shown. 6. Type option1 (Select) next to the location from which you removed the disk drive. The Verify Device Concurrent Add display is shown. 7. Press Enter on the Verify Device Concurrent Add display. The Complete Device Concurrent Add display is shown. 8. Support the bottom of the disk drive as you align it with the guide rails in the system or expansion unit. Do not hold the disk drive by the handle. 9. Slide the disk drive all the way into the system or expansion unit and then lock it in place by pushing in the disk drive handle, as shown in Figure 18. 10. Press Enter on the Complete Device Concurrent Add display to indicate that the disk drive is installed. 11. Rebuild the data on the replacement disk drive. For instructions, go to “Rebuild data on a replacement disk drive using Linux” on page 390. 12. Go to Chapter 5, “Verifying a repair,” on page 557. This completes this procedure. Concurrent Device Add Choose a single location for add operations 1=Select OPT Name PCI/SCSI Location Description Status --- ------ -------------------------- ------------------------- ----------------- 0000:41:01.0.1/0:5: Empty 0000:41:01.0.1/1:8: Empty 0001:61:01.0.2/0:8: Empty Figure 17. Example Concurrent Device Add display Figure 18. Chapter 4. Removing and replacing FRUs 387 Replace the disk drive nonconcurrently Use this procedure to remove and replace a disk drive nonconcurrently. 1. Power off the system and remove the power cords The component will not be serviced concurrently. If this system is powered on, power it off before continuing. If possible, shut down any running applications and the operating system before powering off the system. Once the system is powered off, remove all power cords from all of the power supplies. 2. Remove the front cover Note to Tower Users: The instructions in this procedure are written based on the Rack model. Tower users may still use this procedure. However, Tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. 3. Grab the cover at points (C) and pull it out and away from the system unit. 388 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 3. Remove the disk drive: 1. Unlock the disk drive handle (A) and pull the disk drive out toward you as shown. 2. Support the bottom of the disk drive as you slide it out of the system or expansion unit. Do not hold the disk drive by the handle. 4. Install the disk drive: 1. Support the bottom of the disk drive as you align it with the guide rails in the system unit. Do not hold the disk drive by the handle. 2. Slide the disk drive into the system until it stops. 3. Push the disk drive handle in until it locks. Note: When you install a disk drive, ensure that the drive is fully seated and all the way into the system. Chapter 4. Removing and replacing FRUs 389 5. If you removed the power cords, reinstall the power cords for all of the power supplies. 6. Go to Chapter 5, “Verifying a repair,” on page 557. This completes this procedure. Rebuild data on a replacement disk drive using Linux To rebuild data on the replacement disk drive, complete the steps listed here. For an unprotected disk drive If the disk drive you are replacing is in a RAID Level 0 disk array or in a failed RAID Level 5 or RAID Level 10 disk array, perform these tasks: 1. Re-create the disk array. 2. Re-create the file systems on the disk array. 3. Copy the data back to the restored disk array from your backup media. For information on these tasks, see the PCI-X SCSI RAID Controller Reference Guide for Linux, SA23-1327. 390 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Disk unit backplane Use this procedure to service the disk unit backplane. 1. Shut down the partitions and power off the system This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. 2. Disconnect the power cords 1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing. Chapter 4. Removing and replacing FRUs 391 3. Remove the front cover Note to Tower Users: The instructions in this procedure are written based on the Rack model. Tower users may still use this procedure. However, Tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. 3. Grab the cover at points (C) and pull it out and away from the system unit. 4. Remove the slimline media or filler panel device 1. If there is a slimline media or filler panel device in the system, push the terra cotta plastic retaining tab (A) approximately 5mm to the right away from the device. 2. While depressing the terra cotta plastic retaining tab, grasp the device (B) and pull it out of the system. 392 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 5. Remove the operator panel 1. Release the operator panel by pushing the release tab (A) to the left. 2. Grasp the edges of the operator panel and pull it partially out of the bay. 3. Press the locking pins (C) inward and gently pull the operator panel completely out of the bay, taking care not to pull on the cables (B). 4. Rotate the operator panel downward if needed, and disconnect the operator panel cables (B) from the operator panel. 5. Record the operator panel cables orientation, for reconnecting them the same way. 6. Remove the disk units (DASD) and fillers Note: Support the bottom of the disk unit while you slide it out. Do not hold the disk unit by the handle. 1. Open the disk units handles by pinching the two surfaces of the handles together. 2. Pull all of the disk units out of the disk unit cage and backplane. 3. Repeat these steps for any disk unit fillers that are present. Chapter 4. Removing and replacing FRUs 393 7. Place into service position Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked. 8. Remove the service access cover 1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components. 394 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 9. Remove the fans 1. Push the latch (A) upward to disengage the fan from the fan cage. 2. Pull the fan (B) out of the fan cage. 3. Repeat these steps until all fans are removed from the fan cage. 10. Remove the fan cage 1. Pull the four retaining tabs (A) that secure the fan cage to its enclosure. 2. Lift the fan cage (B) straight up and out of the enclosure. Chapter 4. Removing and replacing FRUs 395 11. Disconnect the media device cables, if you have a media device 1. Media device cable (A) does not need to be touched. 2. Squeeze the media device cable (B) as shown and disconnect it from its connector on the disk unit backplane. 3. Disconnect the media device cable (C) from its connector on the disk unit backplane. 12. Remove the disk unit cage and backplane 1. Rotate the handles outward in the direction shown to release the disk unit cage and backplane. 2. Slide the disk unit cage and backplane out of the system unit. 396 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 13. Install the disk unit cage and backplane 1. Slide the replacement disk unit cage and backplane into the system until you feel it engages and locks into place. 2. Rotate the handles in the direction shown to secure the disk unit cage and backplane. 14. Connect the media device cables, if you have a media device 1. Media device cable (A) does not need to be touched. 2. Connect the media device cables (B) and (C) to their connectors on the disk unit backplane. Chapter 4. Removing and replacing FRUs 397 15. Install the disk units (DASD) and fillers 1. Slide the disk units back into the disk unit cage and backplane. 2. Push in the handles of each disk unit to lock them into place. 3. Repeat these steps for any disk unit fillers that are present. 16. Install the operator panel 1. Carefully guide both operator panel cables (A) through the cables conduit in the extension of the operator panel while gently sliding the operator panel into the operator panel bay. 2. Connect the operator panel cables (A) to the operator panel. 3. Rotate the front of the operator panel upward and continue sliding it into the bay until the operator panel engages and locks into place. 398 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 17. Install the slimline media or filler panel device 1. If there was a slimline media or filler panel device present in the system, align the device (A) with the slimline media bay. 2. Push it into the system until the retaining tab (B) locks the device into place. 18. Install the fan cage 1. Check that the four retaining tabs are in open position. 2. Lower the fan cage (A) into its location on the enclosure, aligning the blue tabs with the holes. 3. Push firmly on the fan cage so that you are sure it fits into its position. 4. Tighten the four retaining tabs (B) securing the fan cage to its enclosure. Chapter 4. Removing and replacing FRUs 399 19. Install the fans 1. Lower the fan (A) into its location in the fan cage. 2. Push down on the fan until it locks into place. 3. Secure the fan by closing the latch (B). 4. Repeat these steps until all fans are installed. 20. Install the service access cover 1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover. 400 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 21. Place into operating position Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position. 22. Install the front cover 1. Position the cover on the front of the system unit so that the two screws (C) align with the screw holes on the front of the system unit (A). 2. Push the cover at points (B) to attach it to the front of the system unit as shown in the following figure. 3. Tighten the two screws that secure the system unit to the rack. 4. Close the front rack door. Chapter 4. Removing and replacing FRUs 401 23. Install the power cords If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door. 24. Go to Chapter 5, “Verifying a repair,” on page 557. 402 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Ethernet card Use this procedure to service the Ethernet card. 1. Shut down the partitions and power off the system This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. 2. Disconnect the power cords 1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing. 3. Disconnect the external cables 1. Carefully label the external cables connected to the FRU you are servicing so that they can be reconnected in exactly the same place after the repair. 2. Disconnect the external cables. Chapter 4. Removing and replacing FRUs 403 4. Place into service position Note to tower users: The instructions in this procedure are written based on the rack model. Tower users may still use this procedure. However, tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. 5. Place into service position (continued) Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked. 404 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 6. Remove the service access cover 1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components. 7. Remove the Ethernet adapter 1. Slide the latch (A) in the direction shown to release the Ethernet adapter (B). 2. Carefully grasp the Ethernet adapter by its blue handle, and pull it straight out of its connector on the system backplane. Chapter 4. Removing and replacing FRUs 405 8. Install the Ethernet adapter 1. Check the latch (A) is in the open position to let the Ethernet adapter slot accessible. 2. Carefully grasp the Ethernet adapter (B) by its blue handle, and align the card with its connector on the system backplane. 3. Press the Ethernet adapter firmly into its connector. 4. Slide the latch (A) back in the direction of the arrow to secure the Ethernet adapter. 9. Install the service access cover 1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover. 406 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 10. Place into operating position Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position. 11. Place into operating position (continued) 1. Replace and tighten the two screws (B) that secure the system unit (A) to the rack. 2. Close the front rack door of the unit you are servicing. Chapter 4. Removing and replacing FRUs 407 12. Reconnect the external cables Attention: To prevent damaging the FRU or the system, use the following precautions before plugging cables into a connector or adapter: v Make sure that you have the right type of cable for the connector or adapter. v Make sure that the cable plug is correctly aligned with the connector or adapter. v Make sure that the tongue on the HSL/RIO cable plug matches the white location keys on the connector.External cables were labeled before disconnection so that they can be reconnected in exactly the same place after the repair. Reconnect the external cables you previously removed to the FRU you are servicing. 13. Install the power cords If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door. 14. Go to Chapter 5, “Verifying a repair,” on page 557. 408 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Fan Use this procedure to service the fan. 1. Place into service position Note to tower users: The instructions in this procedure are written based on the rack model. Tower users may still use this procedure. However, tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. 2. Place into service position (continued) Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked. Chapter 4. Removing and replacing FRUs 409 3. Remove the service access cover 1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components. 4. Can repair be concurrent? You must determine if the repair can continue concurrently (i.e. with power on). To continue the repair concurrently, the following conditions must be true: v Fans must already be installed at the other three fan locations. v Each of these fans has a green LED and an amber LED. The LEDs (A) must be set as follows: – Green - on – Amber - offIf any of the conditions are not true, the repair can continue only after powering off the unit containing the FRU that is being repaired. If the unit is already powered off, you may proceed with a non-concurrent repair. 410 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 5. Select an action Nonconcurrent repair. Power off the unit and continue the repair. Concurrent repair. Continue the repair with unit power on. Delay the repair. ? Go to step 9. This ends the procedure. 6. Shut down the partitions and power off the system This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. 7. Disconnect the power cords 1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing. 8. Go to step 10 on page 412. 9. Locate the hot-swap cooling fan Locate the fan you are servicing. Note: Each fan unit has two LEDs. The green LED indicates the fan is operating properly. The amber LED indicates a fan failure. Chapter 4. Removing and replacing FRUs 411 10. Remove the fan 1. Locate the failing fan to be removed. 2. Push the latch (A) upward to disengage the fan from the fan cage. 3. Pull the fan out of the fan cage. 11. Install the fan 1. Lower the fan into its location in the fan cage. 2. Push down on the fan until it locks into place. 3. Secure the fan (A) by closing the latch. 412 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 12. Install the service access cover 1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover. 13. Place into operating position Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position. Chapter 4. Removing and replacing FRUs 413 14. Place into operating position (continued) 1. Replace and tighten the two screws (B) that secure the system unit (A) to the rack. 2. Close the front rack door of the unit you are servicing. 15. Install the power cords If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door. 16. Go to Chapter 5, “Verifying a repair,” on page 557. 414 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) GX adapter Use this procedure to service the GX adapter. 1. Shut down the partitions and power off the system This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. 2. Disconnect the power cords 1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing. Chapter 4. Removing and replacing FRUs 415 3. Place into service position Note to tower users: The instructions in this procedure are written based on the rack model. Tower users may still use this procedure. However, tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. 4. Place into service position (continued) Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked. 416 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 5. Remove the service access cover 1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components. 6. Disconnect the external cables 1. Carefully label the external cables connected to the FRU you are servicing so that they can be reconnected in exactly the same place after the repair. 2. Disconnect the external cables. 7. Remove the GX adapter 1. Slide the latch in the direction of the arrow (A) to release the GX adapter. 2. Pinch the GX adapter latches (C) and rotate them in the direction shown to release the GX adapter from the slot. 3. Gently pull the GX adapter straight up and out of the system. Chapter 4. Removing and replacing FRUs 417 8. Install the GX adapter 1. Align the GX adapter with its connector on the system backplane. 2. Gently push the GX adapter straight into the system. 3. Rotate the latches (C) in the direction shown until they latch. 4. Slide the latch back in the direction of the arrow (B) to secure all adapters. 9. Reconnect the external cables Attention: To prevent damaging the FRU or the system, use the following precautions before plugging cables into a connector or adapter: v Make sure that you have the right type of cable for the connector or adapter. v Make sure that the cable plug is correctly aligned with the connector or adapter. v Make sure that the tongue on the HSL/RIO cable plug matches the white location keys on the connector.External cables were labeled before disconnection so that they can be reconnected in exactly the same place after the repair. Reconnect the external cables you previously removed to the FRU you are servicing. 418 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 10. Install the service access cover 1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover. 11. Place into operating position Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position. Chapter 4. Removing and replacing FRUs 419 12. Place into operating position (continued) 1. Replace and tighten the two screws (B) that secure the system unit (A) to the rack. 2. Close the front rack door of the unit you are servicing. 13. Install the power cords If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door. 14. Go to Chapter 5, “Verifying a repair,” on page 557. 420 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Media device Use this procedure to service the media device. 1. Shut down the partitions and power off the system This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. 2. Disconnect the power cords 1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing. Chapter 4. Removing and replacing FRUs 421 3. Remove the front cover Note to Tower Users: The instructions in this procedure are written based on the Rack model. Tower users may still use this procedure. However, Tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. 3. Grab the cover at points (C) and pull it out and away from the system unit. 4. Place into service position Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked. 422 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 5. Remove the service access cover 1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components. 6. Remove the fans 1. Push the latch (A) upward to disengage the fan from the fan cage. 2. Pull the fan (B) out of the fan cage. 3. Repeat these steps until all fans are removed from the fan cage. Chapter 4. Removing and replacing FRUs 423 7. Remove the fan cage 1. Pull the four retaining tabs (A) that secure the fan cage to its enclosure. 2. Lift the fan cage (B) straight up and out of the enclosure. 8. Disconnect the media device cables 1. Disconnect the media device cable (A) from its connector at the rear of the media device. 2. Media device cables (B) and (C) do not need to be touched. 424 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 9. Remove the media device While pushing the media device release latches (A) on both the left and right sides, grasp the device (B) and pull it out of the system unit. 10. Install the media device Push the media device (A) into the system unit until you feel the latches (B) lock. Chapter 4. Removing and replacing FRUs 425 11. Connect the media device cables 1. Connect the media device cable (A) to its connector at the rear of the media device. 2. Media device cables (B) and (C) do not need to be touched here. 12. Install the fan cage 1. Check that the four retaining tabs are in open position. 2. Lower the fan cage (A) into its location on the enclosure, aligning the blue tabs with the holes. 3. Push firmly on the fan cage so that you are sure it fits into its position. 4. Tighten the four retaining tabs (B) securing the fan cage to its enclosure. 426 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 13. Install the fans 1. Lower the fan (A) into its location in the fan cage. 2. Push down on the fan until it locks into place. 3. Secure the fan by closing the latch (B). 4. Repeat these steps until all fans are installed. 14. Install the service access cover 1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover. Chapter 4. Removing and replacing FRUs 427 15. Place into operating position Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position. 16. Install the front cover 1. Position the cover on the front of the system unit so that the two screws (C) align with the screw holes on the front of the system unit (A). 2. Push the cover at points (B) to attach it to the front of the system unit as shown in the following figure. 3. Tighten the two screws that secure the system unit to the rack. 4. Close the front rack door. 428 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 17. Install the power cords If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door. 18. Go to Chapter 5, “Verifying a repair,” on page 557. Chapter 4. Removing and replacing FRUs 429 Memory DIMM Use this procedure to service a memory DIMM. 1. Shut down the partitions and power off the system This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. 2. Disconnect the power cords 1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing. 430 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 3. Place into service position Note to tower users: The instructions in this procedure are written based on the rack model. Tower users may still use this procedure. However, tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. 4. Place into service position (continued) Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked. Chapter 4. Removing and replacing FRUs 431 5. Remove the service access cover 1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components. 6. Remove the fans 1. Push the latch (A) upward to disengage the fan from the fan cage. 2. Pull the fan (B) out of the fan cage. 3. Repeat these steps until all fans are removed from the fan cage. 432 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 7. Remove the fan cage 1. Pull the four retaining tabs (A) that secure the fan cage to its enclosure. 2. Lift the fan cage (B) straight up and out of the enclosure. 8. Remove the memory DIMM 1. Locate the memory DIMM you want to remove. 2. Remove the memory DIMM by pushing the tabs (A) out and then down. The tabs’ lever action forces the memory DIMM out of the connector. 3. Pull the memory DIMM (B) out of the connector. Chapter 4. Removing and replacing FRUs 433 9. Install the memory DIMM Note: Memory DIMMs are keyed to prevent a DIMM from being installed incorrectly. Note the locations of the key tabs within the DIMM connector before attempting to install the DIMM. 1. Ensure that the connector locking tabs (A) are pushed out in the unlocked position before installing a new memory DIMM. 2. Carefully grasp the memory DIMM (B) along two edges and align the connector. 3. Insert the memory DIMM firmly into the connector. 4. Secure the memory DIMM by pushing in the locking tabs (A). 10. Install the fan cage 1. Check that the four retaining tabs are in open position. 2. Lower the fan cage (A) into its location on the enclosure, aligning the blue tabs with the holes. 3. Push firmly on the fan cage so that you are sure it fits into its position. 4. Tighten the four retaining tabs (B) securing the fan cage to its enclosure. 434 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 11. Install the fans 1. Lower the fan (A) into its location in the fan cage. 2. Push down on the fan until it locks into place. 3. Secure the fan by closing the latch (B). 4. Repeat these steps until all fans are installed. 12. Install the service access cover 1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover. Chapter 4. Removing and replacing FRUs 435 13. Place into operating position Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position. 14. Place into operating position (continued) 1. Replace and tighten the two screws (B) that secure the system unit (A) to the rack. 2. Close the front rack door of the unit you are servicing. 436 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 15. Install the power cords If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door. 16. Go to Chapter 5, “Verifying a repair,” on page 557. Chapter 4. Removing and replacing FRUs 437 PCI adapter Use this procedure to service the PCI adapter. 1. Are you doing a concurrent repair? You may service the PCI adapter concurrently or nonconcurrently. For more information, see PCI Adapters, SA76-0152. If you are servicing the PCI adapter concurrently, continue with step 4. Otherwise, continue with the next step. 2. Shut down the partitions and power off the system This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. 3. Disconnect the power cords 1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing. 4. Disconnect the external cables 1. Carefully label the external cables connected to the FRU you are servicing so that they can be reconnected in exactly the same place after the repair. 2. Disconnect the external cables. 438 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 5. Place into service position Note to tower users: The instructions in this procedure are written based on the rack model. Tower users may still use this procedure. However, tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. 6. Place into service position (continued) Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked. Chapter 4. Removing and replacing FRUs 439 7. Remove the service access cover 1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components. 8. Remove the PCI adapter 1. For a long PCI adapter, squeeze the terra cotta latch (D) on top of the front support, and swing it up in the open position. 2. Slide the latch in the direction of the arrow (A) to release the PCI adapters. 3. Locate the PCI adapter you want to remove. 4. Carefully grasp the PCI adapter (C) by its top edge or upper corners, and pull the PCI adapter up and out of its connector on the system backplane. 5. If you opened the terra cotta latch on top of the front support, close it now. 440 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 9. Install the PCI adapter 1. Place the PCI adapter, component-side up, on a flat, static-protective surface. 2. Set any jumpers or switches on the PCI adapter as instructed by the adapter’s manufacturer. 3. For a long PCI adapter, squeeze the terra cotta latch (D) on top of the front support, and swing it up in the open position. 4. Carefully grasp the PCI adapter (C) by its top edge, and align the PCI adapter with its connector on the system backplane. 5. Press the PCI adapter firmly into its connector. 6. Slide the latch back in the direction of the arrow (B) to secure all adapters. 7. If you opened the terra cotta latch on top of the front support, close it now. 10. Install the service access cover 1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover. Chapter 4. Removing and replacing FRUs 441 11. Place into operating position Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position. 12. Place into operating position (continued) 1. Replace and tighten the two screws (B) that secure the system unit (A) to the rack. 2. Close the front rack door of the unit you are servicing. 442 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 13. Reconnect the external cables Attention: To prevent damaging the FRU or the system, use the following precautions before plugging cables into a connector or adapter: v Make sure that you have the right type of cable for the connector or adapter. v Make sure that the cable plug is correctly aligned with the connector or adapter. v Make sure that the tongue on the HSL/RIO cable plug matches the white location keys on the connector.External cables were labeled before disconnection so that they can be reconnected in exactly the same place after the repair. Reconnect the external cables you previously removed to the FRU you are servicing. 14. Install the power cords If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door. 15. Go to Chapter 5, “Verifying a repair,” on page 557. “Remove and replace a PCI adapter contained in a cassette in the system with the power on in AIX” “Remove and replace a PCI adapter contained in a cassette in the system with the power on in Linux” on page 457Remove and replace a PCI adapter contained in a cassette in the system with the power on in AIX Procedures performed on a PCI adapter with the system power on in AIX, also known as hot-plug procedures, require the system administrator to take the PCI adapter offline prior to performing the operation. Before taking an adapter offline, the devices attached to the adapter must be taken offline as well. This action prevents a service representative or user from causing an unexpected outage for system users. To remove and replace an adapter, do the following steps: 1. Determine the location of the PCI adapter in the system. Chapter 4. Removing and replacing FRUs 443 2. Record the slot number and location of each adapter being removed. Adapter slots are numbered on the rear of the system unit. 3. Ensure that any processes or applications that might use the adapter are stopped. 4. Enter the system diagnostics by logging in as root user or as the celogin user, type diag at AIX command line. 5. When the DIAGNOSTIC OPERATING INSTRUCTIONS menu displays, press Enter. 6. At the FUNCTION SELECTION menu, select Task Selection, then press enter. 7. At the Task Selection list, select PCI Hot Plug Manager. 8. Select Unconfigure a Device, then press Enter. 9. Press F4 (or Esc +4) to display the Device Names menu. 10. Select the adapter you are removing in the Device Names menu. 11. Use the Tab key to answer YES to Keep Definition. Use the Tab key again to answer YES to Unconfigure Child Devices, then press Enter. 12. The ARE YOU SURE screen displays. Press Enter to verify the information. Successful unconfiguration is indicated by the OK message displayed next to the Command field at the top of the screen. 13. Press F3 (or Esc +3) twice to return to the Hot Plug Manager menu. 14. Select replace/remove PCI Hot Plug adapter. 15. Select the slot that has the device to be removed from the system. 16. Select Replace. Or, If you need to remove an adapter without replacing it, select Remove. A fast-blinking amber LED located at the back of the machine near the adapter indicates that the slot has been identified. 17. Press Enter. This places the adapter in the action state, meaning it is ready to be removed from the system. 18. Label, and then disconnect all cables attached to the adapter you plan to remove. 19. Lift up on the PCI adapter EMC shield A as shown in Figure 19 on page 445 and then rotate it up and away from the cassette as shown in Figure 20 on page 446. 444 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Figure 19. Lift up on the EMC shield Chapter 4. Removing and replacing FRUs 445 20. Remove the cassette. Lift up the lower cassette handle B as shown in the following figure. Pull the PCI cassette C out of the system. Attention: A cassette containing either a PCI adapter or filler panel must be placed in the PCI adapter slot of the system unit for proper air flow and cooling. Figure 20. Rotate the EMC shield into the open position 446 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 21. Place the cassette with the cover facing up on an approved ESD surface. The cover will have a label on it. 22. Install the replacement adapter into the PCI adapter cassette using the following instructions. If the cassette is not going to contain a replacement adapter, use this same procedure to place an adapter filler panel in the cassette. Follow these steps: a. Remove the cassette cover by doing the following steps: 1) Slide the cover latch A to disengage it from the pivot pin C as shown in the following figure. 2) Lift the cover B off of the pivot pin. 3) Slide the cover off of the cassette. Figure 21. PCI adapter cassette removed from the system unit Chapter 4. Removing and replacing FRUs 447 b. Remove the adapter from the cassette by doing the following steps: 1) Unlock the adapter retainers by rotating the retainer clip A into the horizontal position. See Figure 23 on page 449. Notes: v The edge of the adapter located at the end of the cassette that contains the cassette handles is called the adapter tailstock. v Two retainers are located at the top of the cassette, along the top edge of the adapter. Two more retainers are located at the edge of the cassette opposite of the adapter tailstock. v When the retainer clip is in the horizontal position, the adapter retainers are unlocked and can slide away from the card. v If the corner support retainer is used, unlock it, and then slide the corner support retainer away from the card.2) Push the adapter retainers B away from the adapter. 3) Unlock the adapter tailstock clamp C. 4) Rotate the adapter out of the cassette by grasping the edge of the adapter opposite the tailstock, and then firmly rotate the adapter toward the bottom of the cassette. 5) Lift the adapter out of the tailstock retaining channel. Figure 22. PCI adapter cassette cover removed 448 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) c. Place the new adapter in the cassette. Attention: A cassette containing either a PCI adapter or filler panel must be placed in the PCI adapter slot of the system unit for proper air flow and cooling. If the cassette is not going to contain a PCI adapter, use this same procedure to place an adapter filler panel in the cassette. 1) Remove any shipping handles or brackets attached to the adapter. 2) With the tailstock clamp in the open position, insert the adapter firmly into the tailstock retaining channel A. See Figure 24 on page 450. 3) Rotate the adapter toward the top of the cassette and into place. 4) Close the tailstock clamp. Figure 23. Adapter removed from the PCI adapter cassette Chapter 4. Removing and replacing FRUs 449 5) Position the adapter retainers to support the adapter, and then rotate the retainer clip B into the closed position. See Figure 24. Note: Two retainers are located at the top of the cassette, along the top edge of the adapter. Two more retainers are located at the edge of the cassette opposite of the adapter tailstock. When the adapter retainer clip is in the horizontal position, the adapter retainers are unlocked and can slide toward the adapter. Place the retainers on the adapter according to the length of the adapter being used. Select the appropriate instructions: v Adapter-cassette retainer placement for large adapters v Adapter-cassette retainer placement for mid-sized adapters v Adapter-cassette retainer placement for small adaptersAdapter-cassette retainer placement for large adapters v Place and lock the retainers B. See Figure 25 on page 451. Attention: Use of the lower corner support retainer might interfere with the docking of the PCI card when positioned within the system. Ensure the retainer does not interfere with the adapter connectors on the system backplane. v Ensure the adapter edge is seated in each retainer groove A. If the shape of the adapter or the presence of a connector will not allow the adapter edge to be seated into the retainer groove, ensure the retainer is still locked firmly against that edge or connector. Figure 24. Adapter removed from the PCI adapter single-width cassette 450 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Adapter-cassette retainer placement for mid-sized adapters v Remove the adapter stabilizer C. See Figure 26 on page 452. v Place and lock the retainers B. v Ensure the adapter edge is seated in each retainer groove A. If the shape of the adapter or the presence of a connector will not allow the adapter edge to be seated into the retainer groove, ensure the retainer is still locked firmly against that edge or connector. Figure 25. Long adapter in the PCI adapter cassette with the supports and stabilizer in place Chapter 4. Removing and replacing FRUs 451 Adapter-cassette retainer placement for small adapters v Remove the adapter stabilizer C. See Figure 27 on page 453. v Place the hookarm D into the hole in the corner of the adapter. This supports the card when it is undocked from the connector on the system backplane. v Place and lock the retainers B. v Ensure the adapter edge is seated in each retainer groove A. If the shape of the adapter or the presence of a connector will not allow the adapter edge to be seated into the retainer groove, ensure the retainer is still locked firmly against that edge or connector. Figure 26. Medium-length adapter in the PCI adapter cassette with the supports in place 452 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) d. Replace the cassette cover by doing the following steps: 1) Slide the cover B into position on the cassette. 2) While holding the cover latch A in the open position, place the cover over the pivot pin C. 3) Release the cover latch to lock the cover into place. Figure 27. Short adapter in the PCI adapter cassette with the supports and the hookarm in place Chapter 4. Removing and replacing FRUs 453 23. At the back of the system, lift the cassette cover flap and identify the cassette slot you want to use. 24. Ensure the lower cassette handle is pressed up toward the retainer clip. This places the adapter in the correct position to be docked in the system. 25. Lift and hold the PCI adapter EMC shield in the open position. See Figure 19 on page 445 and Figure 20 on page 446. 26. Slide the cassette C into the cassette slot as shown in the following figure. 27. When the cassette is fully inserted into the system, firmly press downward on the lower cassette handle B to lock the adapter in its connector. Figure 28. PCI adapter cassette cover replaced 454 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 28. Connect the adapter cables. 29. Lower the PCI adapter EMC shield A into the closed position, close the shield latch, then close the rear rack door. Figure 29. PCI adapter cassette removed from the system unit Chapter 4. Removing and replacing FRUs 455 30. Return to the AIX screen. Choose one of the following options:. v If you are replacing an adapter, go to step 31. v If you removing an adapter without replacing it, follow these steps: a. Press enter and continue to follow the screen instructions until you receive a message that the removal is successful. Successful removal is indicated by the OK message displayed next to the Command field at the top of the screen. b. Press F10 to exit the Hot-Plug Manager. c. Run the diag -a command. If the system responds with a menu or prompt, follow the instructions to complete the configuration. d. Continue at step 40 on page 457.31. Press enter and continue to follow the screen instructions until you receive a message that the replacement is successful. Successful replacement is indicated by the OK message displayed next to the Command field at the top of the screen. 32. Press the F3 (or Esc+3) key to return to the PCI Hot-Plug Manager menu. 33. Press the F3 (or Esc+3) key to return to the TASK selection list. 34. Select Log Repair Action. 35. Select the resource just replaced, press Enter, press Commit (F7 or ESC 7), then press Enter. 36. Press F3 (or Esc+3) to return to TASK Selection List. Figure 30. PCI adapter EMC shield in the closed position 456 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 37. Select Hot Plug Task, press enter. 38. Select PCI Hot Plug Manager, then select Configure a defined device, then press Enter. 39. Select the device just replaced from the list, then press Enter. The device is now configured. 40. Press the F10 key to exit the diagnostic program. Remove and replace a PCI adapter contained in a cassette in the system with the power on in Linux Procedures performed on a PCI adapter with the system power on in Linux, also known as hot-plug procedures, require the system administrator to take the PCI adapter offline prior to performing the operation. Before taking an adapter offline, the devices attached to the adapter must be taken offline as well. This action prevents a service representative or user from causing an unexpected outage for system users. Before performing this procedure, do the following: v Ensure that the system meets the “Prerequisites for hot-plugging PCI adapters in Linux” on page 469 v “Verify that the Linux, hot-plug PCI tools are installed” on page 470.To remove and replace an adapter, do the following steps: 1. Determine the location of the PCI adapter in the system. 2. Record the slot number and location of each adapter being removed. Adapter slots are numbered on the rear of the system unit. 3. Ensure that any processes or applications that might use the adapter are stopped. 4. Run the drslot_chrp_pci command to enable an adapter to be replaced: For example, to replace the PCI adapter in slot U7879.001.DQD014E-P1-C3 run this command: drslot_chrp_pci -R -s U7879.001.DQD014E-P1-C3 Follow the instructions on the display to complete the task. 5. Label, and then disconnect all cables attached to the adapter you plan to remove. 6. Lift up on the PCI adapter EMC shield A as shown in Figure 31 on page 458 and then rotate it up and away from the cassette as shown in Figure 32 on page 459. Chapter 4. Removing and replacing FRUs 457 Figure 31. Lift up on the EMC shield 458 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 7. Remove the cassette. Lift up the lower cassette handle B as shown in the following figure. Pull the PCI cassette C out of the system. Attention: A cassette containing either a PCI adapter or filler panel must be placed in the PCI adapter slot of the system unit for proper air flow and cooling. Figure 32. Rotate the EMC shield into the open position Chapter 4. Removing and replacing FRUs 459 8. Place the cassette with the cover facing up on an approved ESD surface. The cover will have a label on it. 9. Install the replacement adapter into the PCI adapter cassette using the following instructions. If the cassette is not going to contain a replacement adapter, use this same procedure to place an adapter filler panel in the cassette. Follow these steps: a. Remove the cassette cover by doing the following steps: 1) Slide the cover latch A to disengage it from the pivot pin C as shown in the following figure. 2) Lift the cover B off of the pivot pin. 3) Slide the cover off of the cassette. Figure 33. PCI adapter cassette removed from the system unit 460 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) b. Remove the adapter from the cassette by doing the following steps: 1) Unlock the adapter retainers by rotating the retainer clip A into the horizontal position. See Figure 35 on page 462. Notes: v The edge of the adapter located at the end of the cassette that contains the cassette handles is called the adapter tailstock. v Two retainers are located at the top of the cassette, along the top edge of the adapter. Two more retainers are located at the edge of the cassette opposite of the adapter tailstock. v When the retainer clip is in the horizontal position, the adapter retainers are unlocked and can slide away from the card. v If the corner support retainer is used, unlock it, and then slide the corner support retainer away from the card.2) Push the adapter retainers B away from the adapter. 3) Unlock the adapter tailstock clamp C. 4) Rotate the adapter out of the cassette by grasping the edge of the adapter opposite the tailstock, and then firmly rotate the adapter toward the bottom of the cassette. 5) Lift the adapter out of the tailstock retaining channel. Figure 34. PCI adapter cassette cover removed Chapter 4. Removing and replacing FRUs 461 c. Place the new adapter in the cassette. Attention: A cassette containing either a PCI adapter or filler panel must be placed in the PCI adapter slot of the system unit for proper air flow and cooling. If the cassette is not going to contain a PCI adapter, use this same procedure to place an adapter filler panel in the cassette. 1) Remove any shipping handles or brackets attached to the adapter. 2) With the tailstock clamp in the open position, insert the adapter firmly into the tailstock retaining channel A. See Figure 36 on page 463. 3) Rotate the adapter toward the top of the cassette and into place. 4) Close the tailstock clamp. Figure 35. Adapter removed from the PCI adapter cassette 462 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 5) Position the adapter retainers to support the adapter, and then rotate the retainer clip B into the closed position. See Figure 36. Note: Two retainers are located at the top of the cassette, along the top edge of the adapter. Two more retainers are located at the edge of the cassette opposite of the adapter tailstock. When the adapter retainer clip is in the horizontal position, the adapter retainers are unlocked and can slide toward the adapter. Place the retainers on the adapter according to the length of the adapter being used. Select the appropriate instructions: v Adapter-cassette retainer placement for large adapters v Adapter-cassette retainer placement for mid-sized adapters v Adapter-cassette retainer placement for small adaptersAdapter-cassette retainer placement for large adapters v Place and lock the retainers B. See Figure 37 on page 464. Attention: Use of the lower corner support retainer might interfere with the docking of the PCI card when positioned within the system. Ensure the retainer does not interfere with the adapter connectors on the system backplane. v Ensure the adapter edge is seated in each retainer groove A. If the shape of the adapter or the presence of a connector will not allow the adapter edge to be seated into the retainer groove, ensure the retainer is still locked firmly against that edge or connector. Figure 36. Adapter removed from the PCI adapter single-width cassette Chapter 4. Removing and replacing FRUs 463 Adapter-cassette retainer placement for mid-sized adapters v Remove the adapter stabilizer C. See Figure 38 on page 465. v Place and lock the retainers B. v Ensure the adapter edge is seated in each retainer groove A. If the shape of the adapter or the presence of a connector will not allow the adapter edge to be seated into the retainer groove, ensure the retainer is still locked firmly against that edge or connector. Figure 37. Long adapter in the PCI adapter cassette with the supports and stabilizer in place 464 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Adapter-cassette retainer placement for small adapters v Remove the adapter stabilizer C. See Figure 39 on page 466. v Place the hookarm D into the hole in the corner of the adapter. This supports the card when it is undocked from the connector on the system backplane. v Place and lock the retainers B. v Ensure the adapter edge is seated in each retainer groove A. If the shape of the adapter or the presence of a connector will not allow the adapter edge to be seated into the retainer groove, ensure the retainer is still locked firmly against that edge or connector. Figure 38. Medium-length adapter in the PCI adapter cassette with the supports in place Chapter 4. Removing and replacing FRUs 465 d. Replace the cassette cover by doing the following steps: 1) Slide the cover B into position on the cassette. 2) While holding the cover latch A in the open position, place the cover over the pivot pin C. 3) Release the cover latch to lock the cover into place. Figure 39. Short adapter in the PCI adapter cassette with the supports and the hookarm in place 466 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 10. At the back of the system, lift the cassette cover flap and identify the cassette slot you want to use. 11. Ensure the lower cassette handle is pressed up toward the retainer clip. This places the adapter in the correct position to be docked in the system. 12. Lift and hold the PCI adapter EMC shield in the open position. See Figure 31 on page 458 and Figure 32 on page 459. 13. Slide the cassette C into the cassette slot as shown in the following figure. 14. When the cassette is fully inserted into the system, firmly press downward on the lower cassette handle B to lock the adapter in its connector. Figure 40. PCI adapter cassette cover replaced Chapter 4. Removing and replacing FRUs 467 15. Connect the adapter cables. 16. Lower the PCI adapter EMC shield A into the closed position, close the shield latch, then close the rear rack door. Figure 41. PCI adapter cassette removed from the system unit 468 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 17. Run the lsslot command to verify that the slot is occupied. For example, Enter lsslot -c pci -s U7879.001.DQD014E-P1-C3 The following is an example of the information displayed by this command: # Slot Description Device(s) U7879.001.DQD014E-P1-C3 PCI-X capable, 64 bit, 133MHz slot 0001:40:01.0 Prerequisites for hot-plugging PCI adapters in LinuxIn the course of installing, removing, or replacing a PCI adapter with the system power on in Linux you might need complete some prerequisite tasks. Use the information in this section to identify those prerequisites. Verify that the Linux, hot-plug PCI tools are installedIn the course of installing, removing, or replacing a PCI adapter with the system power on in Linux you might need use the hot-plug PCI tools. Use the procedure in this section to verify that you have the hot-plug PCI tools installed.Prerequisites for hot-plugging PCI adapters in Linux In the course of installing, removing, or replacing a PCI adapter with the system power on in Linux you might need complete some prerequisite tasks. Use the information in this section to identify those prerequisites. Figure 42. PCI adapter EMC shield in the closed position Chapter 4. Removing and replacing FRUs 469 The Linux, system administrator needs to take the PCI adapter offline prior to removing, replacing, or installing a PCI adapter with the system power on (hot-plugging). Before taking an adapter offline, the devices attached to the adapter must be taken offline as well. This action prevents a service representative or user from causing an unexpected outage for system users. Before hot-plugging adapters for storage devices, ensure file systems on those devices are unmounted. After hot-plugging adapters for storage devices, ensure the file systems on those devices are remounted. Before hot-plugging an adapter, ensure that the server or partition is at the correct level of the Linux operating system (Linux 2.6 or later). Install the POWER™ Linux Service Aids. These service aids enable system serviceability, as well to improve system management. If you are using a Linux on POWER distribution with Linux kernel version 2.6 or greater, you can install the Service Aids that gives you access to more capabilities, which can help you diagnose problems on your system. This software is available at the Service and productivity tools for Linux on POWER Web site. (http://techsupport.services.ibm.com/server/lopdiags) Verify that the Linux, hot-plug PCI tools are installed In the course of installing, removing, or replacing a PCI adapter with the system power on in Linux you might need use the hot-plug PCI tools. Use the procedure in this section to verify that you have the hot-plug PCI tools installed. 1. Enter the following command to verify that the hot-plug PCI tools are installed: rpm -aq | grep rpa-pci-hotplug If the command does not list any rpa-pci-hotplug packages, the PCI Hot Plug tools are not installed. 2. Enter the following command to ensure that the rpaphp driver is loaded: ls -l /sys/bus/pci/slots/ The directory should contain data. If the directory is empty, the driver is not loaded or the system does not contain hot-plug PCI slots. The following is an example of the information displayed by this command: drwxr-xr-x 15 root root 0 Feb 16 23:31 . drwxr-xr-x 5 root root 0 Feb 16 23:31 .. drwxr-xr-x 2 root root 0 Feb 16 23:31 0000:00:02.0 drwxr-xr-x 2 root root 0 Feb 16 23:31 0000:00:02.2 drwxr-xr-x 2 root root 0 Feb 16 23:31 0000:00:02.4 drwxr-xr-x 2 root root 0 Feb 16 23:31 0001:00:02.0 drwxr-xr-x 2 root root 0 Feb 16 23:31 0001:00:02.2 drwxr-xr-x 2 root root 0 Feb 16 23:31 0001:00:02.4 drwxr-xr-x 2 root root 0 Feb 16 23:31 0001:00:02.6 drwxr-xr-x 2 root root 0 Feb 16 23:31 0002:00:02.0 drwxr-xr-x 2 root root 0 Feb 16 23:31 0002:00:02.2 drwxr-xr-x 2 root root 0 Feb 16 23:31 0002:00:02.4 drwxr-xr-x 2 root root 0 Feb 16 23:31 0002:00:02.6 If the directory does not exist, run the following command to mount the filesystem: mount -t sysfs sysfs /sys 3. Ensure the following tools are available in the /usr/sbin directory. v lsslot v drslot_chrp_pci4. Return to the procedure that sent you here. 470 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Power supply Use this procedure to service the power supply. 1. Can repair be concurrent? You must determine if the repair can continue concurrently. To continue the repair concurrently, the following conditions must be true: v Another power supply must already be installed. v Its three LEDs must be set as follows: – AC Input - on, not blinking – DC Output - on, not blinking – Fault/Identify - offIf any of the conditions are not true, the repair can continue only after powering off the unit containing the FRU that is being repaired. If the unit is already powered off, you may proceed with a non-concurrent repair. 2. Select an action: Nonconcurrent repair. Power off the unit and continue the repair. Concurrent repair. Continue the repair with unit power on. Delay the repair. ? Go to step 8 on page 473. This ends the procedure. 3. Shut down the partitions and power off the system This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. Chapter 4. Removing and replacing FRUs 471 4. Disconnect the power cords 1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing. 5. Remove the front cover Note to Tower Users: The instructions in this procedure are written based on the Rack model. Tower users may still use this procedure. However, Tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. 3. Grab the cover at points (C) and pull it out and away from the system unit. 472 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 6. Remove the power supply Attention: Do not attempt to open the covers of the power supply. Power supplies are not serviceable and are to be replaced as a unit. 1. Disconnect the power cable (A) from the power supply you are removing. 2. While pushing the lever (B) down, pull the power supply straight out of the system. 7. Go to step 10 on page 474. 8. Remove the front cover Note to Tower Users: The instructions in this procedure are written based on the Rack model. Tower users may still use this procedure. However, Tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. 3. Grab the cover at points (C) and pull it out and away from the system unit. Chapter 4. Removing and replacing FRUs 473 9. Remove the power supply Attention: Do not attempt to open the covers of the power supply. Power supplies are not serviceable and are to be replaced as a unit. 1. The power supply you are servicing should have its Fault/Identify LED (amber LED) blinking. 2. Disconnect the power cable (A) from the power supply you are removing. 3. While pushing the lever (B) down, pull the power supply straight out of the system. 10. Install the power supply 1. While pushing the lever (A) down, carefully slide the power supply into its location in the system. 2. Connect the power cable (B) to the power supply. 11. Are you doing a concurrent repair? Yes No 474 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Are you doing a concurrent repair? ? Go to step 13. 12. Is power supply operational? You must determine if the new power supply is operational. The new power supply is operational if its LEDs are set as follows: v AC Input - on, not blinking v DC Output - on, not blinking v Fault/Identify - blinking or offIf the LEDs are set differently, you must determine why. Make sure the power supply is plugged into a working AC power source, try installing a different power supply, or call your next level of support. 13. Install the front cover 1. Position the cover on the front of the system unit so that the two screws (C) align with the screw holes on the front of the system unit (A). 2. Push the cover at points (B) to attach it to the front of the system unit as shown in the following figure. 3. Tighten the two screws that secure the system unit to the rack. 4. Close the front rack door. Chapter 4. Removing and replacing FRUs 475 14. Install the power cords If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door. 15. Go to Chapter 5, “Verifying a repair,” on page 557. 476 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) RAID enablement card Use this procedure to service the RAID enablement card. 1. Shut down the partitions and power off the system This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. 2. Disconnect the power cords 1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing. Chapter 4. Removing and replacing FRUs 477 3. Place into service position Note to tower users: The instructions in this procedure are written based on the rack model. Tower users may still use this procedure. However, tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. 4. Place into service position (continued) Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked. 478 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 5. Remove the service access cover 1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components. 6. Remove the RAID enablement card 1. Push the tab (A) in the direction shown to release the card. 2. Gently pull the card (B) straight up and out of the enclosure. Chapter 4. Removing and replacing FRUs 479 7. Install the RAID enablement card 1. Push the card (B) down in the direction shown, along the sliders of the middle divider. 2. Align the card with its connector on the system backplane. 3. Push the tab (A) in the direction shown and gently push the card down until it engages into its slot. 4. Release the tab (A) to lock the card into its slot. 8. Install the service access cover 1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover. 480 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 9. Place into operating position Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position. 10. Place into operating position (continued) 1. Replace and tighten the two screws (B) that secure the system unit (A) to the rack. 2. Close the front rack door of the unit you are servicing. Chapter 4. Removing and replacing FRUs 481 11. Install the power cords If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door. 12. Go to Chapter 5, “Verifying a repair,” on page 557. 482 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Auxiliary cache card Use this procedure to service the auxiliary cache card. 1. Shut down the partitions and power off the system This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. 2. Disconnect the power cords 1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing. Chapter 4. Removing and replacing FRUs 483 3. Place into service position Note to tower users: The instructions in this procedure are written based on the rack model. Tower users may still use this procedure. However, tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. 4. Place into service position (continued) Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked. 484 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 5. Remove the service access cover 1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components. 6. Remove the auxiliary cache card Note: When removing the auxiliary cache card, handle it by its edges, and not by the attached battery. 1. Squeeze the terra cotta latch (A) on top of the front support, and swing it up in the open position. 2. Push the latch (B) in the direction shown to release the card. 3. Gently pull the card (C) straight up and out of the enclosure. Chapter 4. Removing and replacing FRUs 485 7. Install the auxiliary cache card Note: When installing the auxiliary cache card, handle it by its edges, and not by the attached battery. 1. Align the card (C) with its connector on the system backplane. 2. Push the latch (B) in the direction shown, and gently push the card (C) down until it engages into its slot. 3. Release the latch (B) to lock the card into its slot. 4. Close the terra cotta latch on top of the front support (A). 8. Install the service access cover 1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover. 486 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 9. Place into operating position Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position. 10. Place into operating position (continued) 1. Replace and tighten the two screws (B) that secure the system unit (A) to the rack. 2. Close the front rack door of the unit you are servicing. Chapter 4. Removing and replacing FRUs 487 11. Install the power cords If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door. 12. Go to Chapter 5, “Verifying a repair,” on page 557. 488 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Slimline media Use this procedure to service the Slimline media. 1. Remove the front cover Note to Tower Users: The instructions in this procedure are written based on the Rack model. Tower users may still use this procedure. However, Tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. 3. Grab the cover at points (C) and pull it out and away from the system unit. 2. Can repair be concurrent? You must determine if the repair can continue concurrently (i.e. with power on). If the top LED is green as shown in the figure on the right, you may proceed with a concurrent repair. Otherwise, the repair can continue only after powering off the unit. If the unit is already powered off, you may proceed with a non-concurrent repair. Chapter 4. Removing and replacing FRUs 489 3. Select an action: Nonconcurrent repair. Power off the unit and continue the repair. Concurrent repair. Continue the repair with unit power on. Delay the repair. ? Go to step 6 on page 491. This ends the procedure. 4. Shut down the partitions and power off the system This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. 5. Disconnect the power cords 1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing. 490 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 6. Remove the slimline media device 1. Push the terra cotta plastic retaining tab (A) approximately 5mm to the right away from the slimline media device. 2. While depressing the terra cotta plastic retaining tab, grasp the device (B) and pull it out of the system. 7. Install the slimline media device 1. Align the replacement device (A) with the slimline media bay. 2. Push it into the system until the retaining tab (B) locks the device into place. Chapter 4. Removing and replacing FRUs 491 8. Install the front cover 1. Position the cover on the front of the system unit so that the two screws (C) align with the screw holes on the front of the system unit (A). 2. Push the cover at points (B) to attach it to the front of the system unit as shown in the following figure. 3. Tighten the two screws that secure the system unit to the rack. 4. Close the front rack door. 9. Install the power cords If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door. 10. Go to Chapter 5, “Verifying a repair,” on page 557. 492 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) System backplane Use this procedure to replace the system backplane. 1. Access the ASMI The Advanced System Management Interface (ASMI) is the interface to the service processor that is required to perform general and administrator-level service tasks, such as reading service processor error logs, reading vital product data, setting up the service processor, and controlling the system power. The ASMI may also be referred to as the service processor menus. To access the ASMI through the Hardware Management Console (HMC), complete the following steps: 1. Ensure that the server you are working with is selected. 2. Click Tasks. 3. Click Operations, then click Advanced System Management (ASM). Note: If there is more than one service processor, you must select the primary service processor. 4. Log on to the ASMI using the administrator userid and password.When you have accessed the ASMI, continue with the next step. 2. Check and record the server settings Prior to replacing this FRU, check and record all server settings as you may need to update the system after you update this FRU. This can be done through the Advanced System Management Interface (ASMI) . 1. Check and record the server firmware level indicated on the right edge of the status frame, the area where the Log out button is located. You may need to update the system to the latest server firmware code level after you update this FRU. 2. Check and record the service processor settings you previously set using ASMI. You will need to reset these service processor settings after you update this FRU or the default settings will be used. Settings to record include the following: v Power/Restart Control settings. v System Service Aids settings. v System Configuration settings, System Name setting. v Network Services settings. v Performance Setup settings. v Login Profile settings.3. Record any service processor settings you may have set using operating system commands. 3. Shut down the partitions and power off the system This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. Chapter 4. Removing and replacing FRUs 493 4. Disconnect the power cords 1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing. 5. Remove the front cover Note to Tower Users: The instructions in this procedure are written based on the Rack model. Tower users may still use this procedure. However, Tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. 3. Grab the cover at points (C) and pull it out and away from the system unit. 494 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 6. Remove the power supply or both power supplies 1. Disconnect the power cable (A) from the first power supply you are removing, either E1 or E2. 2. While pushing the lever (B) down, pull the power supply straight out of the system. 3. Repeat these steps to remove the other power supply, if present. 7. Disconnect all the external cables 1. Carefully label the external cables connected to the FRU you are servicing so that they can be reconnected in exactly the same place after the repair. 2. This can include cables on the following ports: a. Serial b. SPCN c. GX d. Ethernet e. PCI f. USB3. 4. Then disconnect the external cables. Chapter 4. Removing and replacing FRUs 495 8. Place into service position Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked. 9. Remove the service access cover 1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components. 496 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 10. Remove the fans 1. Push the latch (A) upward to disengage the fan from the fan cage. 2. Pull the fan (B) out of the fan cage. 3. Repeat these steps until all fans are removed from the fan cage. 11. Remove the fan cage 1. Pull the four retaining tabs (A) that secure the fan cage to its enclosure. 2. Lift the fan cage (B) straight up and out of the enclosure. Chapter 4. Removing and replacing FRUs 497 12. Disconnect the media device cables 1. Disconnect the media device cable (A) from its connector at the rear of the media device. 2. Media device cables (B) and (C) do not need to be touched. 13. Remove the disk unit cage and backplane 1. Rotate the handles outward in the direction shown to release the disk unit cage and backplane. 2. Slide the disk unit cage and backplane out of the system unit. 498 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 14. Remove the Ethernet adapter 1. Slide the latch (A) in the direction shown to release the Ethernet adapter (B). 2. Carefully grasp the Ethernet adapter by its blue handle, and pull it straight out of its connector on the system backplane. 15. Remove the GX adapter divider Grasp the GX adapter divider by its top edge, and slide it up and out of the system. Chapter 4. Removing and replacing FRUs 499 16. Remove all GX adapters, if present Note: Record the slot location of all adapters that are going to be removed, so that they can be reinstalled in exactly the same place after the repair. 1. If the latch is in position (B), slide it in the direction of the arrow (A) to release the GX adapters. 2. Pinch the GX Adapter latches (C) and rotate them in the direction shown to release the GX adapter from the slot. 3. Gently pull the GX adapter straight up and out of the system. 4. Repeat these steps to remove a second GX adapter, if present. 17. Remove the PCI adapter dividers 1. Squeeze the terra cotta latch (A) on top of the front support, and swing it up in the open position. 2. Flex the front edge (B) of the PCI adapter divider out of the bracket and toward the rest of the divider. 3. Pull the back edge of the divider away from the retention notches (C) in the system chassis. 4. Repeat these steps for the other PCI adapter dividers that need to be removed. 500 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 18. Remove all PCI adapters Note: Record the slot location of all adapters that are going to be removed, so that they can be reinstalled in exactly the same place after the repair. 1. If there is a long PCI adapter, squeeze the terra cotta latch (D) on top of the front support, and swing it up in the open position. 2. If the latch is in position (B), slide it in the direction of the arrow (A) to release the PCI adapters. 3. Carefully grasp the PCI adapter (C) by its top edge or upper corners, and pull the PCI adapter up and out of its connector on the system backplane. 4. Store the PCI adapter in a safe place. 5. Repeat these steps until all PCI adapters are removed. 19. Remove the auxiliary cache card, if present Note: When removing the auxiliary cache card, handle it by its edges, and not by the attached battery. 1. If needed, squeeze the terra cotta latch (A) on top of the front support, and swing it up in the open position. 2. Push the latch (B) in the direction shown to release the card. 3. Gently pull the card (C) straight up and out of the enclosure. Chapter 4. Removing and replacing FRUs 501 20. Remove the PCI front support 1. Loosen the blue retaining tab (A) that secures the PCI front support to its enclosure. 2. Lift the PCI front support (B) up and out of the enclosure. 21. Remove the RAID enablement card, if present 1. Push the tab (A) in the direction shown to release the card. 2. Gently pull the card (B) straight up and out of the enclosure. 502 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 22. Remove the TPMD card, if present Gently pull the TPMD card (A) straight up and out of its slot on the system backplane. 23. Remove the middle system divider 1. Loosen the two blue retaining tabs (A) that secure the middle system divider from the system chassis. 2. Grasp the middle system divider (B) and lift it out of the system. Chapter 4. Removing and replacing FRUs 503 24. Remove the VPD card 1. Press gently on the handle (A) to unlock the VPD card (B) from its slot on the system backplane. 2. Pull the VPD card out of its slot 25. Remove the voltage regulator module pairs 1. Locate the first voltage regulator module pair. 2. Push the connector tabs (A) out and then down to unlock one module. 3. Pull the module (B) out of the connector. 4. Repeat these steps to remove the other module in the pair. 5. Keep the two modules of the pair together. 6. Repeat these steps to remove the other module pair. 504 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 26. Remove the single voltage regulator modules 1. Locate the first single voltage regulator module. 2. Push the connector tabs (A) out and then down to unlock one module. 3. Pull the module (B) out of the connector. 4. Repeat these steps to remove the other voltage regulator module. 27. Remove all memory DIMMs 1. Remove the memory DIMM by pushing the tabs (A) out and then down. The tabs’ lever action forces the memory DIMM out of the connector. 2. Pull the memory DIMM (B) out of the connector. 3. Repeat these steps until all memory DIMMs have been removed from the system backplane being replaced. Chapter 4. Removing and replacing FRUs 505 28. Remove the line cord assembly 1. Loosen the upper right thumbscrew (A) located on the back of the system chassis. 2. Slide the line cord assembly (B) toward the front of the system to unlock it. 3. Without removing the line cords, grasp the line cord assembly, disengage it, pull it upward, and place it next to the system chassis. 29. Remove the system backplane Note: Take care when removing the system backplane. Do not lift the system backplane by any of the attached modules. 1. Loosen the two thumbscrews (A) located on the back of the system chassis, as shown in the figure. 2. Push the system backplane (B) toward the front of the system - about 10 mm (half inch) - to unlock it from the system chassis. 3. Grasp the system backplane by the metal frame (C) located at the back of the backplane. 4. Carefully lift the system backplane out of the chassis. 506 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 30. Install the system backplane Note: Take care when installing the system backplane. Do not lift the system backplane by any of the attached modules. 1. Carefully grasp the replacement system backplane (A) along two edges. 2. Lower the system backplane at an angle, with the front of the backplane connecting with the system chassis first. 3. When the system backplane lays at the bottom of the system chassis, slide it toward the back of the system chassis - about 10 mm (half inch) - until it locks. 4. Tighten the two thumbscrews (B) located on the back of the system chassis, as shown in the figure, to secure the backplane to the chassis. 31. Install the line cord assembly 1. Place the line cord assembly (A) into position, ensuring the slots (B) on the line cord assembly align with the pins on the system chassis. 2. Slide the line cord assembly toward the back of the system to lock it into place. 3. Insert and tighten the upper right thumbscrew (C) located on the back of the system chassis, to secure the line cord assembly. Chapter 4. Removing and replacing FRUs 507 32. Install all memory DIMMs Attention: Memory DIMMs are keyed to prevent a DIMM from being installed improperly. Note the locations of the key tabs within the DIMM connector before attempting to install the DIMM. 1. Ensure that the connector locking tabs (A) are pushed out in the unlocked position before installing a memory DIMM. 2. Carefully grasp the memory DIMM (B) along two edges and align the connector. 3. Insert the memory DIMM firmly into the connector. 4. Secure the memory DIMM by pushing in the locking tabs (A). 5. Repeat these steps until all memory DIMMs have been installed. 33. Install the two single voltage regulator modules 1. Ensure that the connector tabs (A) are pushed out in the unlocked position before installing a new voltage regulator module. 2. Carefully grasp the module (B) along two edges and align it with the connector. 3. Insert the module into the connector. 4. Push the connector tabs into the locked position. 5. Repeat these steps to install the other single voltage regulator module. 508 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 34. Install the two voltage regulator module pairs 1. Install the voltage regulator modules, pair by pair. 2. a. Ensure that the connector tabs (A) are pushed out in the unlocked position before installing the module (B). b. Carefully grasp the module along two edges and align it with the connector. c. Insert the module into the connector. d. Push the connector tabs into the locked position. e. Repeat these steps to install the other module in the pair.3. Repeat these steps to install the other pair of modules. 35. Install the VPD card 1. Press gently on the handle (A) to insert the VPD card (B) into its slot on the system backplane. 2. Push the VPD card into place until it is fully seated. Chapter 4. Removing and replacing FRUs 509 36. Install the middle system divider 1. Align the two blue tabs (B) that secure the middle system divider (A) with the two holes. 2. Slide the middle system divider down and into its position, and align it with the pin (C) in the middle on the system backplane. 3. Tighten the two blue tabs (B) that secure the middle system divider with the system chassis. 37. Install the TPMD card, if previously removed Gently push the TPMD card (A) straight into its slot on the system backplane. 510 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 38. Install the RAID enablement card, if previously removed 1. Push the card (B) down in the direction shown, along the sliders of the middle divider. 2. Align the card with its connector on the system backplane. 3. Push the tab (A) in the direction shown and gently push the card down until it engages into its slot. 4. Release the tab (A) to lock the card into its slot. 39. Install the PCI front support 1. Lower the PCI front support (A) into its location on the enclosure, aligning the tab with the hole on the back of the front chassis. 2. Tighten the blue tab (B) that secures the PCI front support. Chapter 4. Removing and replacing FRUs 511 40. Install the auxiliary cache card, if previously removed Note: When installing the auxiliary cache card, handle it by its edges, and not by the attached battery. 1. Squeeze the terra cotta latch (A) on top of the front support, and swing it up in the open position. 2. Align the card (C) with its connector on the system backplane. 3. Push the latch (B) in the direction shown, and gently push the card (C) down until it engages into its slot. 4. Release the latch (B) to lock the card into its slot. 41. Install all PCI adapters, if previously removed Note: Reinstall the cards in the same slot locations as before the repair. 1. If there is a long PCI adapter and if needed, squeeze the terra cotta latch (D) on top of the front support, and swing it up in the open position. 2. If needed, slide the latch in the direction of the arrow (A) to let the PCI adapter slot accessible. 3. Carefully grasp the PCI adapter (C) by its top edge, and align the PCI adapter with its connector on the system backplane. 4. Press the PCI adapter firmly into its connector. 5. Repeat these steps until all PCI adapters are installed. 512 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 42. Install the PCI adapter dividers 1. If needed, squeeze the terra cotta latch (C) on top of the front support, and swing it up in the open position. 2. Grasp the PCI adapter divider by its top edge and align the back edge of the divider with the retention notches (A) in the system chassis. 3. Slide the front edge of the divider (B) into its slot. 4. Repeat these steps for the other PCI adapter dividers that need to be installed. 5. Close the terra cotta latch (C) on top of the front support. 43. Install all GX adapters, if previously removed Note: Reinstall the cards in the same slot locations as before the repair. 1. If needed, slide the latch in the direction of the arrow (A) to let the GX adapter slot accessible. 2. Align the GX adapter with its connector on the system backplane. 3. Gently push the GX adapter straight into the system. 4. Rotate the latches (C) in the direction shown until they latch. 5. Repeat these steps to install a second GX adapter, if previously removed. Chapter 4. Removing and replacing FRUs 513 44. Install the GX adapter divider 1. Align the GX adapter divider (A) with its sliders (B) and (C). 2. Slide the GX adapter divider down and into the system. 45. Install the Ethernet adapter 1. Check the latch (A) is in the open position to let the Ethernet adapter slot accessible. 2. Carefully grasp the Ethernet adapter (B) by its blue handle, and align the card with its connector on the system backplane. 3. Press the Ethernet adapter firmly into its connector. 4. Slide the latch (A) back in the direction of the arrow to secure the Ethernet adapter. 514 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 46. Install the disk unit cage and backplane 1. Slide the replacement disk unit cage and backplane into the system until you feel it engages and locks into place. 2. Rotate the handles in the direction shown to secure the disk unit cage and backplane. 47. Connect the media device cables 1. Connect the media device cable (A) to its connector at the rear of the media device. 2. Media device cables (B) and (C) do not need to be touched here. Chapter 4. Removing and replacing FRUs 515 48. Install the fan cage 1. Check that the four retaining tabs are in open position. 2. Lower the fan cage (A) into its location on the enclosure, aligning the blue tabs with the holes. 3. Push firmly on the fan cage so that you are sure it fits into its position. 4. Tighten the four retaining tabs (B) securing the fan cage to its enclosure. 49. Install the fans 1. Lower the fan (A) into its location in the fan cage. 2. Push down on the fan until it locks into place. 3. Secure the fan by closing the latch (B). 4. Repeat these steps until all fans are installed. 516 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 50. Install the service access cover 1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover. 51. Place into operating position Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position. Chapter 4. Removing and replacing FRUs 517 52. Install the power supply or both power supplies 1. While pushing the lever (A) down, carefully slide the power supply into its location either E1 or E2 in the system. 2. Connect the power cable (B) to the power supply. 3. Repeat these steps to install the other power supply, if present. 53. Install the front cover 1. Position the cover on the front of the system unit so that the two screws (C) align with the screw holes on the front of the system unit (A). 2. Push the cover at points (B) to attach it to the front of the system unit as shown in the following figure. 3. Tighten the two screws that secure the system unit to the rack. 4. Close the front rack door. 518 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 54. Reconnect all the external cables Attention: To prevent damaging the FRU or the system, use the following precautions before plugging cables into a connector or adapter: v Make sure that you have the right type of cable for the connector or adapter. v Make sure that the cable plug is correctly aligned with the connector or adapter. v Make sure that the tongue on the HSL/RIO cable plug matches the white location keys on the connector.External cables were labeled before disconnection so that they can be reconnected in exactly the same place after the repair. Reconnect all external cables to the rear of the machine you are servicing. This can include cables on the following ports: 1. Serial 2. SPCN 3. GX 4. Ethernet 5. PCI 6. USB 55. Install the power cords If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door. 56. Restore network connectivity The service processor must connect to the network to be accessible from the HMC. How will the IP Address of the Service Processor be managed? Obtain an IP Address automatically from a DHCP server Configure a static IP Address ? Go to step58 on page 520. Chapter 4. Removing and replacing FRUs 519 57. Restore service processor settings Enable network access to the service processor by proceeding as follows: If the network connection uses the Dynamic Host Configuration Protocol (DHCP) to establish an IP address, and uses the Hardware Management Console (HMC) as the DHCP server, no additional configuration is necessary to enable network access to the service processor. Perform the following steps: 1. Ensure that the service processor is connected to the existing service network by verifying that the HMC cable is connected to the HMC port on the system backplane. 2. If not already connected, connect all system power cables by plugging them into power outlets. Note: Do not start the system at this time. Do you have network access to the service processor? Yes No Go to step 60 on page 521. Go to step 62 on page 521 58. Restore service processor settings through the ASMI The Advanced System Management Interface (ASMI) is the interface to the service processor that is required to perform general and administrator-level service tasks, such as reading service processor error logs, reading vital product data, setting up the service processor, controlling the system power, and setting service processor network ports. The ASMI may also be referred to as the service processor menus. The ASMI can be accessed through https. As the network connection uses static IP address assignments, perform the following: 1. To know your service processor network ports, if necessary, you can use the control panel in manual mode. Refer to service functions to get control panel documentation in that case. 2. Connect a client with a Web browser directly to the service processor network ports you previously noted. It could be something like the following URLs: v https://169.254.2.147 v https://169.254.3.1473. If not already connected, connect all system power cables by plugging them into power outlets.Note: Do not start the system at this time. 4. Log on to the ASMI with the ?admin? User ID and ?admin? default Password. 5. Change the ?admin? User ID’s Password and the ?general? User ID’s Password when prompted. 6. To configure network access perform the following: a. Click on Network Configuration under the Network Services node. b. On the right pane, configure eth0 and eth1 network interfaces, choose for Type of IP Address ’Static’, set a Host name, an IP address, a Subnet mask. 59. Restore service processor settings With the network connection now configured to use static IP address assignments, try to access the service processor network ports. Do you have network access to the service processor? Yes No ? Go to step 63 on page 522 520 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 60. Access the ASMI If you are already connected to the ASMI, click Next to continue. Otherwise to access the ASMI through the Hardware Management Console (HMC), complete the following steps: 1. Ensure that the server you are working with is selected. 2. Click Tasks. 3. Click Operations, then click Advanced System Management (ASM). Note: If there is more than one service processor, you must select the primary service processor. 61. Go to step 65 on page 522. 62. Restore service processor settings If the network connection uses DHCP to establish an IP address, but does not use an HMC as the DHCP server, perform the following: 1. Complete any network configuration necessary to allow the DHCP server to recognize and assign an IP address to the service processor. 2. Ensure that the service processor is connected to the existing service network by verifying that the network cable is connected to the network port. 3. If not already connected, connect all system power cables by plugging them into power outlets. Note: Do not start the system at this time. Do you have network access to the service processor? No Yes ? Go to step 60 Chapter 4. Removing and replacing FRUs 521 63. Restore service processor settings You have indicated that you are still not able to access the service network. You will need to reset the service processor by pressing the pin-hole switch (E) on the operator panel. 64. Go to step 56 on page 519. 65. Restore service processor settings Change the ?admin? User ID’s password when prompted by performing the following steps: 1. In the navigation area of the ASMI, expand Login Profile. 2. Select Change Password under Login Profile. 3. Specify the required information, and click Continue.As a security measure, you are required to enter your current user password in the Current password for current user ID field. Note: Passwords can be any combination of up to 64 alphanumeric characters. The default password for the ?admin? User ID is ?admin?. After your initial login to the ASMI, the ?admin? password must be changed. 522 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 66. Restore service processor settings Set the system name. The system name can be changed to any valid ASCII string. It does NOT have to follow the initialized MachineType-Model#-Serial# format. To change the system name, perform the following: 1. In the navigation area of the ASMI, expand System Configuration. 2. Select System Name under System Configuration. 3. Enter the desired system name. 4. Click Save settings to update the system name to the new value.The new system name is displayed in the status frame, the area where the Log out button is located. If another method, such as the HMC, is used to change the system name, the status frame does not reflect the change. 67. Restore service processor settings Set the time-of-day. 1. In the navigation area of the ASMI, expand System Configuration. 2. Select Time of Day under System Configuration. 3. If the system is powered off, the right pane displays a form that shows the current date (month, day, and year) and time (hours, minutes, seconds). 4. Change either the date or the time or both, and click Save settings. 68. Restore service processor settings Reenter any of the following settings that were previously changed through the ASMI, unless you want to use the default settings. Settings to be set include the following: 1. Power/Restart Control settings. 2. System Service Aids settings. 3. System Configuration settings, if not already done. 4. Network Services settings. 5. Performance Setup settings. 6. Login Profile settings. Chapter 4. Removing and replacing FRUs 523 69. Restore service processor settings 1. Reset any service processor settings that you may have set using operating system commands. You recorded these settings at the beginning of this procedure. 2. If you choose to reset the HMC Access password, perform the following: 3. a. From the HMC GUI (preferred method): b. 1) Expand the Systems Management folder in the navigation tree pane. 2) Double click the Servers folder. 3) Use the checkbox to select a server, in the central panel. 4) Under the Operations folder, select Change Password. 5) Provide the desired information.c. d. Or, from the HMC command line, type: e. /usr/hmcrbin/chsyspwd -m managedsystem -t access --passwd --newpasswd newpassword where: v The value for managedsystem is the new service processor’s managed system name. v No value for --passwd is entered thereby allowing authentication. v The value for newpasswd is the newpassword value. 70. Restore partition data on the service processor, if you had a partition in your system Profile data stored in the managed server has been cleared or corrupted. To recover profile data using the HMC, proceed as follows: 1. Expand the Systems Management folder in the navigation tree pane. 2. Double click the Servers folder. 3. Use the checkbox to select a server, in the central panel. 4. In the Configuration folder if you have the Manage Partition Data folder, select Restore. 71. Go to Chapter 5, “Verifying a repair,” on page 557. 524 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Thermal power management device (TPMD) card Use this procedure to service the thermal power management device (TPMD) card. 1. Shut down the partitions and power off the system This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. 2. Disconnect the power cords 1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing. Chapter 4. Removing and replacing FRUs 525 3. Place into service position Note to tower users: The instructions in this procedure are written based on the rack model. Tower users may still use this procedure. However, tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. 4. Place into service position (continued) Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked. 526 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 5. Remove the service access cover 1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components. 6. Remove the TPMD card Gently pull the TPMD card (A) straight up and out of its slot on the system backplane. Chapter 4. Removing and replacing FRUs 527 7. Install the TPMD card Gently push the TPMD card (A) straight into its slot on the system backplane. 8. Install the service access cover 1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover. 528 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 9. Place into operating position Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position. 10. Place into operating position (continued) 1. Replace and tighten the two screws (B) that secure the system unit (A) to the rack. 2. Close the front rack door of the unit you are servicing. Chapter 4. Removing and replacing FRUs 529 11. Install the power cords If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door. 12. Go to Chapter 5, “Verifying a repair,” on page 557. 530 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Voltage regulator module pair Use this procedure to service the voltage regulator module pair. 1. Shut down the partitions and power off the system This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. 2. Disconnect the power cords 1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing. Chapter 4. Removing and replacing FRUs 531 3. Place into service position Note to tower users: The instructions in this procedure are written based on the rack model. Tower users may still use this procedure. However, tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. 4. Place into service position (continued) Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked. 532 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 5. Remove the service access cover 1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components. 6. Remove the fans 1. Push the latch (A) upward to disengage the fan from the fan cage. 2. Pull the fan (B) out of the fan cage. 3. Repeat these steps until all fans are removed from the fan cage. Chapter 4. Removing and replacing FRUs 533 7. Remove the fan cage 1. Pull the four retaining tabs (A) that secure the fan cage to its enclosure. 2. Lift the fan cage (B) straight up and out of the enclosure. 8. Remove the voltage regulator module pair 1. Locate the voltage regulator module pair. You will need to remove both voltage regulator modules of the pair. 2. Push the connector tabs (A) out and then down to unlock the module. 3. Pull the module (B) out of the connector. 4. Repeat these steps to remove the other module of the pair. 534 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 9. Install a voltage regulator module pair 1. Ensure that the connector tabs (A) are pushed out in the unlocked position before installing a new module. 2. Carefully grasp the module (B) along two edges and align it with the connector. 3. Insert the module into the connector. 4. Push the connector tabs into the locked position. 5. Repeat these steps to install the other voltage regulator module in the pair. 10. Install the fan cage 1. Check that the four retaining tabs are in open position. 2. Lower the fan cage (A) into its location on the enclosure, aligning the blue tabs with the holes. 3. Push firmly on the fan cage so that you are sure it fits into its position. 4. Tighten the four retaining tabs (B) securing the fan cage to its enclosure. Chapter 4. Removing and replacing FRUs 535 11. Install the fans 1. Lower the fan (A) into its location in the fan cage. 2. Push down on the fan until it locks into place. 3. Secure the fan by closing the latch (B). 4. Repeat these steps until all fans are installed. 12. Install the service access cover 1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover. 536 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 13. Place into operating position Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position. 14. Place into operating position (continued) 1. Replace and tighten the two screws (B) that secure the system unit (A) to the rack. 2. Close the front rack door of the unit you are servicing. Chapter 4. Removing and replacing FRUs 537 15. Install the power cords If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door. 16. Go to Chapter 5, “Verifying a repair,” on page 557. 538 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Voltage regulator module - single Use this procedure to service a single voltage regulator module. 1. Shut down the partitions and power off the system This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. 2. Disconnect the power cords 1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing. Chapter 4. Removing and replacing FRUs 539 3. Place into service position Note to tower users: The instructions in this procedure are written based on the rack model. Tower users may still use this procedure. However, tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. 4. Place into service position (continued) Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked. 540 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 5. Remove the service access cover 1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components. 6. Remove the fans 1. Push the latch (A) upward to disengage the fan from the fan cage. 2. Pull the fan (B) out of the fan cage. 3. Repeat these steps until all fans are removed from the fan cage. Chapter 4. Removing and replacing FRUs 541 7. Remove the fan cage 1. Pull the four retaining tabs (A) that secure the fan cage to its enclosure. 2. Lift the fan cage (B) straight up and out of the enclosure. 8. Remove a single voltage regulator module 1. Locate the voltage regulator module. 2. Push the connector tabs (A) out and then down to unlock the module. 3. Pull the module (B) out of the connector. 542 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 9. Install a single voltage regulator module 1. Ensure that the connector tabs (A) are pushed out in the unlocked position before installing a new voltage regulator module. 2. Carefully grasp the module (B) along two edges and align it with the connector. 3. Insert the module into the connector. 4. Push the connector tabs into the locked position. 10. Install the fan cage 1. Check that the four retaining tabs are in open position. 2. Lower the fan cage (A) into its location on the enclosure, aligning the blue tabs with the holes. 3. Push firmly on the fan cage so that you are sure it fits into its position. 4. Tighten the four retaining tabs (B) securing the fan cage to its enclosure. Chapter 4. Removing and replacing FRUs 543 11. Install the fans 1. Lower the fan (A) into its location in the fan cage. 2. Push down on the fan until it locks into place. 3. Secure the fan by closing the latch (B). 4. Repeat these steps until all fans are installed. 12. Install the service access cover 1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover. 544 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 13. Place into operating position Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position. 14. Place into operating position (continued) 1. Replace and tighten the two screws (B) that secure the system unit (A) to the rack. 2. Close the front rack door of the unit you are servicing. Chapter 4. Removing and replacing FRUs 545 15. Install the power cords If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door. 16. Go to Chapter 5, “Verifying a repair,” on page 557. 546 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Vital product data (VPD) card Use this procedure to service the vital product data (VPD) card. 1. Save the system identifiers 1. On the ASMI Welcome pane, if you have not already logged in, specify your userid and password, and click Log in. Note: To perform this operation, your authority level must be Administrator or Authorized service provider. 2. In the navigation area, expand System Configuration and then Program Vital Product Data. 3. In the navigation area, select System Brand under Program Vital Product Data. 4. Manually record the value for the System Brand, which appears in the right pane. 5. In the navigation area, select System Keywords under Program Vital Product Data. 6. Manually record the values for the Machine type-model, System serial number and System unique ID, which appear in the right pane. 2. Shut down the partitions and power off the system This FRU cannot be serviced concurrently. Every partition must be shut down and the system powered off to continue the repair. Authorization MUST COME FROM THE CUSTOMER to shut down all running partitions and power off the system. If the customer will not allow the partitions to be shut down or the system to be powered off, delay the service procedure. If the customer is using a secondary HMC to manage the system, make sure they are NOT powering on the system at this time. 3. Disconnect the power cords 1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing. Chapter 4. Removing and replacing FRUs 547 4. Place into service position Note to tower users: The instructions in this procedure are written based on the rack model. Tower users may still use this procedure. However, tower users should ignore specific references to rack-only features such as rack doors, rack thumbscrews, and rack-specific FRU orientation, etc. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. 5. Place into service position (continued) Notes: 1. When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. 2. Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. 3. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far.While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked. 548 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 6. Remove the service access cover 1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit.Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components. 7. Remove the fans 1. Push the latch (A) upward to disengage the fan from the fan cage. 2. Pull the fan (B) out of the fan cage. 3. Repeat these steps until all fans are removed from the fan cage. Chapter 4. Removing and replacing FRUs 549 8. Remove the fan cage 1. Pull the four retaining tabs (A) that secure the fan cage to its enclosure. 2. Lift the fan cage (B) straight up and out of the enclosure. 9. Remove the VPD card 1. Press gently on the handle (A) to unlock the VPD card (B) from its slot on the system backplane. 2. Pull the VPD card out of its slot 550 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 10. Install the VPD card 1. Press gently on the handle (A) to insert the VPD card (B) into its slot on the system backplane. 2. Push the VPD card into place until it is fully seated. 11. Install the fan cage 1. Check that the four retaining tabs are in open position. 2. Lower the fan cage (A) into its location on the enclosure, aligning the blue tabs with the holes. 3. Push firmly on the fan cage so that you are sure it fits into its position. 4. Tighten the four retaining tabs (B) securing the fan cage to its enclosure. Chapter 4. Removing and replacing FRUs 551 12. Install the fans 1. Lower the fan (A) into its location in the fan cage. 2. Push down on the fan until it locks into place. 3. Secure the fan by closing the latch (B). 4. Repeat these steps until all fans are installed. 13. Install the service access cover 1. Place the service access cover (A) on top of the system, about 25 mm (1 inch) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover. 552 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 14. Place into operating position Note: When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (B) by sliding them toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches have locked into position. 15. Place into operating position (continued) 1. Replace and tighten the two screws (B) that secure the system unit (A) to the rack. 2. Close the front rack door of the unit you are servicing. Chapter 4. Removing and replacing FRUs 553 16. Install the power cords If you removed the power cords, perform the following steps. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. 3. Close the back rack door. 17. Power on the system. Power on the system. 18. Access the ASMI If you are already connected to the ASMI, click Next to continue. Otherwise to access the ASMI through the Hardware Management Console (HMC), complete the following steps: 1. Ensure that the server you are working with is selected. 2. Click Tasks. 3. Click Operations, then click Advanced System Management (ASM). Note: If there is more than one service processor, you must select the primary service processor. 19. Set the system identifiers 1. On the ASMI Welcome pane, if you have not already logged in, specify your userid and password, and click Log In. Note: To perform this operation, your authority level must be Administrator or Authorized service provider. 2. In the navigation area, expand System Configuration and Program Vital Product Data. 3. Select System Brand. 4. If the system brand is not set, and only in that case, enter the correct system brand and click Continue. Note: If you do not know the correct system brand, contact your next level of support. 5. Select System Keywords under Program Vital Product Data. 6. If the backed up system keywords are correct, click Save settings. Otherwise, enter the values for the Machine type-model, System serial number, and System unique ID. Set the Reserved field to blank spaces unless directed otherwise by Level 4 technical support. Note: If you do not know the system unique ID, contact your next level of support. You cannot boot the system until valid values are entered for all fields. You can change these entries only once. 7. Click Save settings to update the system keywords and save them to the VPD. 554 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 20. Go to Chapter 5, “Verifying a repair,” on page 557. Chapter 4. Removing and replacing FRUs 555 556 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Chapter 5. Verifying a repair Use this information to verify hardware operation after making repairs to the system. Choose from the following: v To verify the repair of a system that is currently powered off, go to step 1. v To verify the repair of a system that is currently powered on with no operating system loaded, go to step 3. v To verify the repair of a system that is currently powered on and an operating system is loaded, go to step 5.1. Power on the server and all attached I/O enclosures. Did all the enclosures power on? No Yes ? Go to Step 3. 2. Choose from the following: v If the original problem was that an enclosure would not power on and you have another FRU to replace, locate and replace the next FRU. v If the next FRU in the FRU list is an isolation procedure, perform the isolation procedure. v If the original problem was that an enclosure would not power on and you have an isolation procedure to perform, perform the isolation procedure. v If the original problem was that an enclosure would not power on and there are no more FRUs or isolation procedures in the FRU list, contact your next level of support. v If you have a new problem, perform problem analysis and repair the new problem. 3. Load the operating system. Did the operating system load successfully? No Yes ? Go to Step 5. 4. Choose from the following: v If the original problem was a failing disk drive that contained the operating system software, go to step 5. v If the original problem was that the operating system would not load and you have another FRU to replace, go to your FRU locations section to locate the next FRU. v If the next FRU in the FRU list is an isolation procedure, perform the isolation procedure. v If the original problem was that the operating system would not load and you have an isolation procedure to perform, perform the isolation procedure. v If the original problem was that the operating system would not load and there are no more FRUs or isolation procedures in the FRU list, contact your next level of support. v If you have a new problem, perform problem analysis and repair the new problem. 5. Choose from the following: “Verifying the repair in AIX” on page 559 “Verifying the repair in Linux” on page 563 © Copyright IBM Corp. 2008 557 “Verifying a repair using an i5/OS system or logical partition” on page 564If you have installed a new feature or part, verify that the system recognizes the feature or part by using the i5/OS system service tools. Use the procedure in this section to accomplish this task. 558 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Verifying the repair in AIX Use this MAP to check out the server after a repair is completed. 1. Did you replace a disk drive in the root volume group? NO Go to step 3 YES Continue with the next step. 2. Run standalone diagnostics from either a CD or from a Network Installation Management (NIM) server. Did you encounter any problems? NO Reinstall the operating system and continue with step 5. YES If the original problem still exists, replace the FRU or perform the isolation procedure that is next in the FRU list. If you have reached the end of the FRU list, contact your next level of support. If a new problem has occurred, go to Beginning Problem Analysis. 3. Did you replace a FRU with power on and concurrently with system operations? NO Go to step 5. YES Continue with the next step. 4. Did you use an AIX diagnostics service aid hot-swap operation to change the FRU? YES Go to step 6. NO Go to step 7 on page 560. Note: The AIX diagnostic service aid was used if a resource was removed using the Hot Plug task. 5. Note: If any FRUs have been removed that should be reinstalled, reinstall them now. Perform the following: 1. If the system is not powered on, power it on now. 2. Perform a slow boot. 3. Wait until the AIX operating system login prompt displays or until system activity on the operator panel or display apparently has stopped. 4. Did you encounter any problems?NO Continue at step 6. YES If the original problem still exists, replace the FRU or perform the isolation procedure that is next in the FRU list. If you have reached the end of the FRU list, contact your next level of support. If a new problem has occurred, go to Beginning Problem Analysis. 6. If the Resource Repair Action menu is already displayed, go to step 9 on page 560; otherwise, perform the following: Chapter 5. Verifying a repair 559 1. Log into the operating system either with root authority (if needed, ask the customer to enter the password) or use the CE login. 2. Enter the diag -a command and check for missing resources. Follow any instructions that display. If an SRN displays, suspect a loose card or connection. If no instructions display, no resources were detected as missing. Continue with the next step. 7. Perform the following: 1. Enter diag at the command prompt. 2. Press Enter. 3. Select the Diagnostics Routines option. 4. When the DIAGNOSTIC MODE SELECTION menu displays, select System verification. 5. When the DIAGNOSTIC SELECTION menu displays, select the All Resources option or test the FRUs you exchanged, and any devices that are attached to the FRU(s) you exchanged, by selecting the diagnostics for the individual FRU.Did the RESOURCE REPAIR ACTION menu (801015) display? NO Continue with the next step. YES Go to step 9. 8. Did the TESTING COMPLETE, no trouble was found menu (801010) display? YES Use the Log Repair Action option, if not previously logged, in the TASK SELECTION menu to update the AIX error log. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. Note: If the system attention indicator is on, this action will set it back to the normal state. Go to step 11 on page 561. NO If the original problem still exists, replace the FRU or perform the isolation procedure that is next in the FRU list. If you have reached the end of the FRU list, contact your next level of support. If a new problem has occurred, go to Beginning Problem Analysis. 9. When a test is run on a resource in system verification mode, and that resource has an entry in the AIX error log, if the test on the resource was successful, the RESOURCE REPAIR ACTION menu displays. After replacing a FRU, you must select the resource for that FRU from the RESOURCE REPAIR ACTION menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced.Note: If the system attention indicator is on, this action will set it back to the normal state. 560 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Perform the following: 1. Select the resource that has been replaced from the RESOURCE REPAIR ACTION menu. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. 2. Press Commit after you make your selections.Did another Resource Repair Action (801015) display? NO If the No Trouble Found menu displays, go to step 11. YES Continue with the next step. 10. The parent or child of the resource you just replaced may also require that you run the RESOURCE REPAIR ACTION service aid on it. When a test is run on a resource in system verification mode, and that resource has an entry in the AIX error log, if the test on the resource was successful, the RESOURCE REPAIR ACTION menu displays. After replacing that FRU, you must select the resource for that FRU from the RESOURCE REPAIR ACTION menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced.Note: If the system attention indicator is on, this action will set it back to the normal state. Perform the following: 1. From the RESOURCE REPAIR ACTION menu, select the parent or child of the resource that has been replaced. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. 2. Press COMMIT after you make your selections. 3. If the No Trouble Found menu displays, continue with the next step. 11. If you changed the service processor or network settings, as instructed in previous MAPs, restore the settings to the value they had prior to servicing the system. If you ran standalone diagnostics from CD-ROM, remove the standalone diagnostics CD-ROM from the system. Did you perform service on a RAID subsystem involving changing of the PCI RAID adapter cache card or changing the configuration? Note: This information does not apply to the PCI-X RAID adapter or cache. NO Go to the close of call procedure. YES Continue with the next step. 12. Use the Recover Options selection to resolve the RAID configuration. To do this, perform the following: Chapter 5. Verifying a repair 561 1. On the PCI SCSI Disk Array Manager screen, select Recovery options. 2. If a previous configuration exists on the replacement adapter, this must be cleared. Select Clear PCI SCSI Adapter Configuration. Press F3. 3. On the Recovery Options screen, select Resolve PCI SCSI RAID Adapter Configuration. 4. On the Resolve PCI SCSI RAID Adapter Configuration screen, select Accept Configuration on Drives. 5. On the PCI SCSI RAID Adapter selections menu, select the adapter that you changed. 6. On the next screen, press Enter. 7. When you get the Are You Sure selection menu, press Enter to continue. 8. If you get a Failed status message, verify that you selected the correct adapter, then repeat this procedure. When the recovery is complete, exit the operating system. 9. Go to the close of call procedure. 562 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Verifying the repair in Linux 1. Run standalone diagnostics from either a CD or from a Network Installation Management (NIM) server. For more information, see Running the eServer stand-alone diagnostics from CD-ROM. Did you encounter any problems? NO Reboot the operating system and continue with the close of call procedure. YES If the original problem still exists, replace the FRU or perform the isolation procedure that is next in the FRU list. If you have reached the end of the FRU list, contact your next level of support. If a new problem has occurred, perform problem analysis and repair the new problem. Chapter 5. Verifying a repair 563 Verifying a repair using an i5/OS system or logical partition If you have installed a new feature or part, verify that the system recognizes the feature or part by using the i5/OS system service tools. Use the procedure in this section to accomplish this task. 1. Deactivate the failing item indicator light and close the service action log entry. 2. Be sure you are signed on with at least service level authority. 3. On the command line of the i5/OS session, type strsst and press Enter. Note: If you cannot get to the System Service Tools display, use function 21 from the control panel. Alternatively, if the system is managed by Hardware Management Console (HMC), use the Service Focal Point Utilities to get to the Dedicated Service Tools (DST) display. 4. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign On display. Press Enter. Note: The service tools password is case-sensitive. 5. Select Start a service tool from the System Service Tools (SST) display. Press Enter. 6. Select Hardware service manager from the Start a Service Tool display. Press Enter. 7. Select Logical hardware resources (buses, IOPs, controllers) from the Hardware Service Manager display. This option allows you to display and work with logical resources. Logical hardware resources are the functional resources of the system used by the operating system. 8. Press Enter. When you select one of the options from the Logical Hardware Resources display, you can do the following: v Display logical hardware resource status v Display or change logical hardware resource information v Display associated packaging hardware resourcesFor more information on collecting information and performing specific functions using the options and function keys from the Logical Hardware Resources display or for details on the symbols that appear next to the Resource Description field, see the online Help information. 564 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Chapter 6. Closing a service call Perform these procedures to close problem numbers, clear hardware messages, and prepare the server to return to the customer. Follow this checklist before performing the procedures: v Return the server to the state that the customer normally uses such as IPL type, IPL mode, and the way the system is configured or partitioned. Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. v While you were performing the problem analysis on the original serviceable event, other serviceable events might have been opened. Close all serviceable events that were opened as a result of your service activity. v Ensure that server verification has been performed and there are no problems that require additional service actions. v If the repair was done using the HMC online repair procedures, ensure that the original serviceable event is now closed. 1. For future reference, record the SRC or symptom and the location code of the FRU you replaced. Is the server managed by an HMC? Yes No ? Do one of the following: v If the server is managed by IVM, Go to “Closing a service call using Integrated Virtualization Manager” on page 571. v If the server is not partitioned and running AIX or Linux, go to “Closing a service call using AIX or Linux” on page 577. 2. On the HMC, open Manage Serviceable Events and examine the service action event log for any open service action events. 3. Are there any service action events that are open? Yes No ? If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 583. Return the system to the customer. This completes the repair. 4. Record the list of open service action events. © Copyright IBM Corp. 2008 565 5. From the list of serviceable events recorded in step 4 on page 565, perform the following step (6) through step 32 on page 569 for each open service action event. 6. Determine the error class of the serviceable event. Record for future use. 7. Examine the details of the open service action event. Is the error code associated with this service action event the same as recorded in Step 1 on page 565? No Yes ? Go to step 11 on page 567. 8. Examine the FRU list of the service action event. Are there any FRUs listed for the service action event? Yes No ? Go to step 11 on page 567. 9. Is the FRU list identical (same FRUs, same number of FRUs, and same order of FRUs) to the FRU list of the error code recorded in step 1 on page 565? No Yes ? Go to step 11 on page 567. 10. The FRU list is different. Is the FRU you replaced and recorded in step 1 on page 565 in the list of FRUs for this service action event? Yes No ? Go to step 32 on page 569. Note: There are service action events that will remain open when you leave this MAP. Further service actions may be required to complete the repair. 566 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 11. Examine the details of this service action event, and record the partitions involved in this service action event for use in a later step. 12. Is the error code associated with this service action event of the form A11-xxx or A01-xxx? Yes No ? Go to step 17. 13. Have you begun a list of ?Axx? partitions from prior service action events that you processed in this MAP? No Yes ? Go to step 15. 14. Begin a new list of ?Axx? partitions by copying the list of partitions obtained in step 11. Go to step 16. 15. Add the partition list obtained in step 11 to the existing list of ?Axx? partitions obtained from processing previous service action events in this MAP. 16. Remove all entries in the list of all partitions you recorded in step 11. If you are referred to the list of partitions obtained in step 11 in future steps, the list is empty. Go to step 17. 17. Select and highlight the service action event from the Error Associated With This Serviceable Event window. 18. Click Close Event. 19. Add comments for the serviceable event. Include any unique additional information. Click OK. The following steps will add or update FRU information. Chapter 6. Closing a service call 567 20. Did you replace, add, or modify a FRU of the open service action event? Yes No ? Go to step 22. 21. From the FRU list, select a FRU that you need to update. Double-click on the FRU, and update the FRU information. Go to step 23. 22. Select the No FRU Replaced for this Serviceable Event option. 23. Click OK to close the service action event. 24. Is the list of all partitions you recorded in step 11 on page 567 empty? No Yes ? Go to step 32 on page 569. 25. Does the list of all partitions you recorded in step 11 on page 567 contain more than one entry? Yes No ? Go to step 32 on page 569. 26. Is the error class recorded in step 25 AIX? Yes No ? Go to step 32 on page 569. 27. Perform the following steps for each entry in the list of all partitions you recorded in step 11 on page 567, except the partition you were using to debug the original problem. 28. From the HMC virtual terminal window of a partition in the list of all partitions, type diag at the AIX command prompt. 568 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 29. When the diagnostic operating instructions are displayed, do the following: 1. Press Enter. 2. Select the Task Selection option. 3. Select the Log Repair option. 4. Select the resource associated with the repair action. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your repair action is not displayed on the Resource List, select sysplanar0. 5. Click Commit after you have made your selection. Note: If the terminal type is not defined, you are prompted to define it before you can proceed. 30. Exit from diagnostics in this partition and return to the AIX prompt. 31. Have all the partitions in the list of all partitions you recorded in step 11 on page 567 been processed? Yes No ? Go to step 24 on page 568 to process the next partition in the list you recorded in step 11 on page 567. 32. Have all the serviceable events recorded in step 4 on page 565 been processed? Yes No ? Go to step 5 on page 566 and process the next service action event in the list of serviceable events recorded in step 4 on page 565. 33. While processing all service action events, were you directed to step 14 on page 567? Yes No ? If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 583. Return the system to the customer. This completes the repair. Note: If during the processing of the list of open service action events, some service action events remained open, further service actions may be required to complete the repair. Chapter 6. Closing a service call 569 34. Perform the following steps for each entry in the list of ?Axx? partitions you began recording in step 14 on page 567, except the partition you were using to debug the original problem. 35. From the HMC virtual terminal window of a partition in the list of ?Axx? partitions, type diag at the AIX command prompt. 36. When the diagnostic operating instructions are displayed, do the following: 1. Press Enter. 2. Select the Task Selection option. Note: If the terminal type is not defined, you are prompted to define it before you can proceed. 3. Select the Log Repair option. 4. Select the resource associated with the repair action. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your repair action is not displayed on the Resource List, select sysplanar0. 5. Click Commit after you have made your selection. 37. Exit from diagnostics in this partition and return to the AIX prompt. 38. Have all the partitions in the list of ?Axx? partitions you began recording in step 14 on page 567 been processed? Yes No ? Go to step 34 to process the next partition in the list you recorded in step 14 on page 567. 39. If the system attention LED is still on, turn off the LED as described in Activating and deactivating LEDs. This completes the repair. Return the system to the customer. Note: If during the processing of the list of open service action events, some service action events remained open, further service actions may be required to complete the repair. “Closing a service call using Integrated Virtualization Manager” on page 571Perform these procedures to close problem numbers, clear hardware messages, and prepare the server to return to the customer. “Closing a service call using AIX or Linux” on page 577If the server is not connected to an HMC and not using IVM, perform these procedures to close problem numbers, clear hardware messages, and prepare the server to return to the customer. “Activating and deactivating LEDs” on page 583Use this procedure to activate or deactivate LEDs. 570 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Closing a service call using Integrated Virtualization Manager Perform these procedures to close problem numbers, clear hardware messages, and prepare the server to return to the customer. Follow this checklist before performing the procedures: v Return the server to the state that the customer normally uses such as IPL type, IPL mode, and the way the system is configured or partitioned. Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. v While you were performing the problem analysis on the original serviceable event, other serviceable-event numbers might have been opened. Close all serviceable events that were opened as a result of your service activity. v Ensure that server verification has been performed and there are no problems that require additional service actions. v If the repair was done using the IVM online repair procedures, ensure that the original serviceable event is now closed. 1. For future reference, record the SRC or symptom and the location code of the FRU you replaced. 2. On the IVM, open Manage Serviceable Events and look at existing serviceable events. 3. Are there any service action events that are open? Yes No ? If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 583. Return the system to the customer. This completes the repair. 4. Record the list of open service action events. 5. From the list of serviceable events recorded in step 4, perform step 6 through step 30 on page 575 for each open service action event. 6. Determine the error class of the serviceable event. Record for future use. Chapter 6. Closing a service call 571 7. Examine the details of the open service action event. Is the error code associated with this service action event the same as recorded in Step 1 on page 571? No Yes ? Go to step 11. 8. Examine the FRU list of the service action event. Are there any FRUs listed for the service action event? Yes No ? Go to step 11. 9. Is the FRU list identical (same FRUs, same number of FRUs, and same order of FRUs) to the FRU list of the error code recorded in step 1 on page 571? No Yes ? Go to step 11. 10. The FRU list is different. Is the FRU you replaced and recorded in step 1 on page 571 in the list of FRUs for this service action event? Yes No ? Go to step 30 on page 575. Note: There are service action events that will remain open when you leave this MAP. Further service actions may be required to complete the repair. 11. Examine the details of this service action event, and record the partitions involved in this service action event for use in a later step. 12. Is the error code associated with this service action event of the form A11-xxx or A01-xxx? Yes No ? Go to step 17 on page 573. 572 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 13. Have you begun a list of ?Axx? partitions from prior service action events that you processed in this MAP? No Yes ? Go to step 15. 14. Begin a new list of ?Axx? partitions by copying the list of partitions obtained in step 11 on page 572. Go to step 16. 15. Add the partition list obtained in step 11 on page 572 to the existing list of ?Axx? partitions obtained from processing previous service action events in this MAP. 16. Remove all entries in the list of all partitions you recorded in step 11 on page 572. If you are referred to the list of partitions obtained in step 11 on page 572 in future steps, the list is empty. Go to step 17. 17. Select and highlight the service action event from the Manage Serviceable Events window. 18. Click Close Event. 19. Add comments for the serviceable event. Include any unique additional information. Click OK. The following steps will add or update FRU information. 20. Did you replace, add, or modify a FRU of the open service action event? Yes No ? Go to step 22 on page 574. 21. Click OK to close the service action event. Chapter 6. Closing a service call 573 22. Is the list of all partitions you recorded in step 11 on page 572 empty? No Yes ? Go to step 30 on page 575. 23. Does the list of all partitions you recorded in step 11 on page 572 contain more than one entry? Yes No ? Go to step 30 on page 575. 24. Is the error class recorded in step 23 AIX? Yes No ? Go to step 30 on page 575. 25. Perform the following steps for each entry in the list of all partitions you recorded in step 11 on page 572, except the partition you were using to debug the original problem. 26. From the IVM virtual terminal window of a partition in the list of all partitions, type diag at the AIX command prompt. 27. When the diagnostic operating instructions are displayed, do the following: 1. Press Enter. 2. Select the Task Selection option. 3. Select the Log Repair option. 4. Select the resource associated with the repair action. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your repair action is not displayed on the Resource List, select sysplanar0. 5. Click Commit after you have made your selection. Note: If the terminal type is not defined, you are prompted to define it before you can proceed. 28. Exit from diagnostics in this partition and return to the AIX prompt. 574 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 29. Have all the partitions in the list of all partitions you recorded in step 11 on page 572 been processed? Yes No ? Go to step 25 on page 574 to process the next partition in the list you recorded in step 11 on page 572. 30. Have all the serviceable events recorded in step 4 on page 571 been processed? Yes No ? Go to step 5 on page 571 and process the next service action event in the list of serviceable events recorded in step 4 on page 571. 31. While processing all service action events, were you directed to step 14 on page 573? Yes No ? If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 583. Return the system to the customer. This completes the repair. Note: If during the processing of the list of open service action events, some service action events remained open, further service actions may be required to complete the repair. 32. Perform the following steps for each entry in the list of ?Axx? partitions you began recording in step 14 on page 573, except the partition you were using to debug the original problem. 33. From the IVM virtual terminal window of a partition in the list of ?Axx? partitions, type diag at the AIX command prompt. Chapter 6. Closing a service call 575 34. When the diagnostic operating instructions are displayed, do the following: 1. Press Enter. 2. Select the Task Selection option. Note: If the terminal type is not defined, you are prompted to define it before you can proceed. 3. Select the Log Repair option. 4. Select the resource associated with the repair action. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your repair action is not displayed on the Resource List, select sysplanar0. 5. Click Commit after you have made your selection. 35. Exit from diagnostics in this partition and return to the AIX prompt. 36. Have all the partitions in the list of ?Axx? partitions you began recording in step 14 on page 573 been processed? Yes No ? Go to step 32 on page 575 to process the next partition in the list you recorded in step 14 on page 573. 37. If the system attention LED is still on, turn off the LED as described in Activating and deactivating LEDs. This completes the repair. Return the system to the customer. Note: If during the processing of the list of open service action events, some service action events remained open, further service actions may be required to complete the repair. 576 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Closing a service call using AIX or Linux If the server is not connected to an HMC and not using IVM, perform these procedures to close problem numbers, clear hardware messages, and prepare the server to return to the customer. Follow this checklist before performing the procedures: v Return the server to the state that the customer normally uses such as IPL type, IPL mode, and the way the system is configured or partitioned. Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. v While you were performing the problem analysis on the original serviceable event, other serviceable-event numbers might have been opened. Close all serviceable events that were opened as a result of your service activity. v Ensure that server verification has been performed and there are no problems that require additional service actions. v If the repair was done using the IVM online repair procedures, ensure that the original serviceable event is now closed. 1. Did you use an AIX diagnostics service aid hot-swap operation to change the FRU? No Yes ? Go to step 4 on page 578 2. Note: If the system planar or battery has been replaced and you are loading diagnostics from a server over a network, it may be necessary for the customer to set the network boot information for this system before diagnostics can be loaded. The system time and date information should also be set when the repair is completed. Do you have any FRUs (for example cards, adapters, cables, or devices) that were removed during problem analysis that you want to put back into the system? No Yes ? Reinstall all of the FRUs that were removed during problem analysis. Go to step 3 3. Is the system or logical partition that you are performing a repair action on running the AIX operating system? Yes No ? Go to step 5 on page 578 Chapter 6. Closing a service call 577 4. Does the system or logical partition you are performing a repair action on have AIX installed? Note: Answer ?NO? to this question if you have just replaced a hard disk in the root volume group. No Yes ? Go to step 7 on page 579 5. Run stand-alone diagnostics in problem determination mode from either a CD ROM or from a NIM server. Note: For instruction on running stand-alone diagnostics from a CD and not using an HMC, go to Running the eServer stand-alone diagnostics from CD on a server without an HMC attached. For instruction on running stand-alone diagnostics from a NIM server, go to Running the eServer stand-alone diagnostics from a Network Installation Management server. Did you encounter any problems? No Yes ? Go to Problem Analysis, order number SA76-0111. 6. The system hardware is functioning correctly. If the system attention LED is still on, turn off the LED as described in Activating and deactivating LEDs. This completes the repair. Note: If during the processing of the list of open service action events, some service action events remained open, further service actions may be required to complete the repair. Return the server to the state that the customer normally uses such as IPL type, IPL mode, and the way the system is configured or partitioned. This might require a reboot of the operating system.Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. 578 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 7. 1. If the system supports slow boot (See “Performing a slow boot” on page 665), do a slow boot on the system. If the system does not support slow boot, do a normal boot. 2. Power on the system. 3. Wait until the AIX operating system login prompt displays or until system activity on the operator panel or display apparently has stopped.Did the AIX Login Prompt display? Yes No ? Go to Problem Analysis, order number SA76-0111. 8. If the Resource Repair Action menu is already displayed, go to 12 on page 580; otherwise, do the following: 1. Log into the operating system either with root authority (if needed, ask the customer to enter the password) or use the CE login. 2. Enter the diag -a command and check for missing resources. Follow any instructions that display. If an SRN displays, suspect a loose card or connection. If no instructions display, no resources were detected as missing. Continue on to 9. 9. 1. Enter diag at the command prompt. 2. Press Enter. 3. Select the Diagnostics Routines option. 4. When the DIAGNOSTIC MODE SELECTION menu displays, select Problem determination. 5. When the ADVANCED DIAGNOSTIC SELECTION menu displays, select the All Resources option or test the FRUs you exchanged, and any devices that are attached to the FRU(s) you exchanged, by selecting the diagnostics for the individual FRU.Did the RESOURCE REPAIR ACTION menu (801015) display? No Yes ? Go to step 13 on page 580. 10. Did the TESTING COMPLETE, no trouble was found menu (801010) display? Yes No ? There is still a problem. Go to Problem Analysis, order number SA76-0111. Chapter 6. Closing a service call 579 11. Use the Log Repair Action option, if not previously logged, in the TASK SELECTION menu to update the AIX error log. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. Note: If the system attention indicator is on, this will set it back to the normal state. Go to step 14 on page 581. 12. When a test is run on a resource in system verification mode, and that resource has an entry in the AIX error log, if the test on the resource was successful, the RESOURCE REPAIR ACTION menu displays. After replacing a FRU, you must select the resource for that FRU from the RESOURCE REPAIR ACTION menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced. Note: If the system attention indicator is on, this action will set it back to the normal state. Do the following: 1. Select the resource that has been replaced from the RESOURCE REPAIR ACTION menu. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. 2. Press Commit after you make your selections. Did another Resource Repair Action (801015) display? Yes No ? If the No Trouble Found menu displays, go to step 14 on page 581 13. The parent or child of the resource you just replaced may also require that you run the RESOURCE REPAIR ACTION service aid on it. When a test is run on a resource in system verification mode, and that resource has an entry in the AIX error log, if the test on the resource was successful, the RESOURCE REPAIR ACTION menu displays. After replacing that FRU, you must select the resource for that FRU from the RESOURCE REPAIR ACTION menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced. Note: If the system attention indicator is on, this action will set it back to the normal state. 580 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Do the following: 1. From the RESOURCE REPAIR ACTION menu, select the parent or child of the resource that has been replaced. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. 2. Press COMMIT after you make your selections. Did the No Trouble Found menu display? Yes No ? Go to Problem Analysis, order number SA76-0111. 14. If you changed the service processor or network settings, as instructed in previous MAPs, restore the settings to the value they had prior to servicing the system. If you ran stand-alone diagnostics from CD-ROM, remove the stand-alone diagnostics CD-ROM from the system. Did you perform service on a RAID subsystem involving changing of the PCI RAID adapter cache card or changing the configuration? Note: This does not refer to the PCI-X RAID adapter or cache. Yes No ? Go to step 16 on page 582. 15. Use the Recover Options selection to resolve the RAID configuration. To do this, do the following: 1. On the PCI SCSI Disk Array Manager screen, select Recovery options. 2. If a previous configuration exists on the replacement adapter, this must be cleared. Select Clear PCI SCSI Adapter Configuration. Press F3. 3. On the Recovery Options screen, select Resolve PCI SCSI RAID Adapter Configuration. 4. On the Resolve PCI SCSI RAID Adapter Configuration screen, select Accept Configuration on Drives. 5. On the PCI SCSI RAID Adapter selections menu, select the adapter that you changed. 6. On the next screen, press Enter. 7. When you get the Are You Sure selection menu, press Enter to continue. 8. You should get an OK status message when the recover is complete. If you get a Failed status message, verify that you selected the correct adapter, then repeat this procedure. When recover is complete, exit the operating system. 9. Go to 16 on page 582 Chapter 6. Closing a service call 581 16. The system hardware is functioning correctly. Return the server to the state that the customer normally uses such as IPL type, IPL mode, and the way the system is configured or partitioned.Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. 582 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Activating and deactivating LEDs Use this procedure to activate or deactivate LEDs. Choose from the following: v “Deactivate a system attention LED or partition LED using the HMC” v “Activate or deactivate identify LED using the HMC” v “Deactivate a system attention LED or partition LED using the ASMI” on page 584 v “Activate or deactivate identify LED using the ASMI” on page 584Deactivate a system attention LED or partition LED using the HMC You can deactivate a system attention LED or a logical partition LED. For example, you might determine that a problem is not a high priority and decide to repair the problem at a later time. However, you want to be alerted if another problem occurs, so you must deactivate the system attention LED so that it can be activated again if another problem occurs. 1. In the navigation area, open Systems management. 2. Select the server you are working on by checking the box next to its name. 3. Open Operations. 4. Open LED Status. 5. Select View System Attention. The system attention LED window opens. The selected system and its LED state are displayed in the upper part of the window. The logical partition and its LED state are displayed in the lower part of the window. From the system attention LED window, you can deactivate both the system attention LED and the logical partition LED. 6. Select Deactivate System Attention LED from the Action menu. A confirmation window is displayed that provides the following information: v A verification that the system attention LED was deactivated. v An indication that there still might be open problems within the system. v An indication that you cannot activate the system attention LED.7. Select one of the logical partitions in the lower table, and select Deactivate partition LED from the Partition Operations menu. A confirmation window is displayed that provides the following information: v A verification that the logical partition LED was deactivated. v An indication that there still might be open problems within the logical partition. v An indication that you cannot activate the logical partition LED.Activate or deactivate identify LED using the HMC The system provides several LEDs that help identify various components, such as enclosures or field replaceable units (FRUs), in the system. For this reason, they are called Identify LEDs. You can activate or deactivate the following types of identify LEDs: v Identify LED for an enclosure If you want to add an adapter to a specific drawer (enclosure), you need to know the machine type, model, and serial number (MTMS) of the drawer. To determine whether you have the correct MTMS for the drawer that needs the new adapter, you can activate the LED for a drawer and verify that the MTMS corresponds to the drawer that requires the new adapter. v Identify LED for a FRU associated with a specified enclosure If you want to hook up a cable to a specific I/O adapter, you can activate the LED for the adapter which is a field replaceable unit (FRU), and then physically check to see where you should hook up the cable. This is especially useful when you have several adapters with open ports. Chapter 6. Closing a service call 583 To activate or deactivate an identify LED for an enclosure or FRU, follow these steps: 1. In the navigation area, open Systems management. 2. Select the server you are working on by checking the box next to its name. 3. Open Operations. 4. Open LED Status. 5. Select Identify LED. The Identify LED, Select Enclosure window opens. 6. To activate or deactivate an identify LED for an enclosure, select an enclosure from the table, and click either Activate LED or Deactivate LED. The associated LED is either turned on or off. 7. To activate or deactivate an identify LED for a FRU, select an enclosure from the table, select Selected ? List FRUs. 8. Select one or more FRUs from the table, and click either Activate LED or Deactivate LED. The associated LED is either turned on or off.Deactivate a system attention LED or partition LED using the ASMI The system attention indicator provides a visual signal that the system as a whole requires attention or service. Each system has a single system attention indicator. When an event occurs that either needs your intervention or that of service and support, the system attention indicator lights continuously. The system attention indicator is turned on when an entry is made in the service processor error log. The error entry is transmitted to the system level and operating system error logs. To perform this operation, your authority level must be one of the following: v Administrator v Authorized service providerTo turn off the system attention indicator, do the following: 1. On the ASMI Welcome pane, specify your user ID and password, and click Log In. 2. In the navigation area, expand System Configuration and Service Indicators. 3. Select System Attention Indicator. 4. In the right pane, click Turn off system attention indicator. If the attempt is unsuccessful, an error message is displayed.Activate or deactivate identify LED using the ASMI You can specify the location code of any indicator to view or modify its current state. If you provide the wrong location code, the advanced system manager attempts to go to the next higher level of the location code. The next level is the base-level location code for that field replaceable unit (FRU). For example, a user types the location code for the FRU located on the second I/O slot of the third enclosure in the system. If the location code for the second I/O slot is incorrect (the FRU does not exist at this location), an attempt to set the indicator for the third enclosure is initiated. This process continues until a FRU is located or no other level is available. To perform this operation, your authority level must be one of the following: v Administrator v Authorized service providerTo change the current state of an indicator, do the following: 1. On the ASMI Welcome pane, specify your user ID and password, and click Log In. 2. In the navigation area, expand System Configuration and Service Indicators. 584 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 3. Select Indicators by Location code. 4. In the right pane, enter the location code of the FRU and click Continue. 5. Select the preferred state from the list. 6. Click Save settings. Chapter 6. Closing a service call 585 586 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Chapter 7. Common Service Procedures This chapter contains procedures that may be common to various isolation and repair procedures. 1. “Powering on and powering off” on page 588Provides information for powering on and powering off the system. 2. “Determining a primary or alternative console” on page 591The primary console is the first workstation that the system identifies. It is attached to the first input/output adapter (IOA) or input-output processor (IOP) that supports workstations. The alternative console is the workstation that functions as the console when the primary console is not operational. 3. “Determining if the system has logical partitions” on page 593 4. “Separating the 571F/575B card set and moving the cache directory card” on page 594 5. “Determining which Processor is the Secondary Service Processor” on page 600Use this procedure to determine which service processor on your system is the secondary service processor. 6. “The System Reference Code (SRC) Format Description” on page 601The format of the SRC that you see depends on the system or server. 7. “Using the Service Action Log” on page 609Use this procedure to search for an entry in the Service Action Log (SAL) that matches the time, reference code, or resource of the reported problem. 8. “Service functions” on page 652Service functions provides enough detail for the service provider to gather information about hardware problems while under the direction of the next level of support. 9. “Managing dumps” on page 653After you have performed a dump, or one occurs automatically, use this information to manage the dump, including copying, reporting, and deleting it. 10. “Performing an IBM i5/OS main storage dump” on page 663A main storage dump is a collection of data from the i5/OS’s main storage. It can be manually initiated when the system administrator suspects a system wait or loop, or automatically by the server firmware as the result of a system failure. 11. “Performing a slow boot” on page 665Learn how to perform a slow boot using the HMC, control panel, or ASMI menus depending on your configuration. 12. “Determining the primary or alternate console” on page 667Use this procedure to determine the primary or alternate console. 13. “Resetting the service processor” on page 669Use this procedure to reset the service processor. 14. “Resetting the managed system connection from the HMC” on page 670Use this procedure to reset the managed system connection from the HMC. 15. “Checking for a duplicate IP address” on page 671Use this procedure to check for a duplicate IP address. 16. “Running the online and eServer stand-alone diagnostics” on page 672Use these tools to diagnose hardware problems on your AIX or Linux systems or partitions. 17. “Concurrent PCI adapter removal and replacement using AIX” on page 682 18. “Concurrent PCI adapter removal and replacement using Linux” on page 686 © Copyright IBM Corp. 2008 587 Powering on and powering off Provides information for powering on and powering off the system. Attention: If possible, shut down all applications and partitions before powering off the system. Using either the power-on button on the control panel or entering commands at the Hardware Management Console (HMC) to stop the system can cause unpredictable results in the data files. Also, the next time you start the system, it might take longer if all applications are not ended before stopping the system. Install the power cords Note: Before powering on a system, ensure that the power cords have been plugged in to all of the power supplies. Power on a system using an HMC To power on a managed system, complete the following steps: 1. In the Navigation area, expand the Systems Management icon. 2. Expand the Servers icon. 3. Choose the check box next to the name of the desired server. The tasks available for that server will be enabled. 4. From the Tasks drop down menu, choose Operations -> Power on. Follow any additional on-screen instructions.Power on a system without an HMC To start a system that is not managed by an HMC, follow these steps: 1. Press the power button A, as shown in the following figure, on the control panel. Observe the following after pressing the power button: v The power-on light begins to blink visibly faster. v The system cooling fans are activated after approximately 30 seconds and begin to accelerate to operating speed. v Progress indicators, also referred to as checkpoints, appear on the control panel display while the system is being started. v The power-on light on the control panel stops blinking and remains on, indicating that system power is on.2. On a rack-mounted system unit, open the front rack door, if necessary. On a deskside system unit, open the front door. 588 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 3. Before you press the power button on the control panel, ensure that power is connected to the system unit as follows: v All system power cables are connected to a power source. v The power-on light is slowly blinking. v The top of the display contains 01 V=F. 4. Press the power button A, as shown in the figure, on the control panel. 5. Observe the following after pressing the power button: v The power-on light begins to blink visibly faster. v The system cooling fans are activated after approximately 30 seconds and begin to accelerate to operating speed. v Progress indicators, also referred to as checkpoints, appear on the control panel display while the system is being started. The power-on light on the control panel stops blinking and remains on, indicating that system power is on.Power off a system using an HMCAttention: If possible, shut down the running logical partitions on the managed system before powering off the managed system. Powering off the managed system without shutting down the logical partitions first causes the logical partitions to shut down abnormally and can cause data loss. 1. In the Navigation area, expand the Systems Management icon. 2. Expand the Servers icon. 3. Choose the check box next to the name of the desired server. A selection of tasks available for that server will be displayed. 4. From the Tasks menu, choose Operations -> Power off. Follow any additional on-screen instructions. 5. Continue with “Remove the power cords” on page 590 Figure 43. Control Panel Chapter 7. Common Service Procedures 589 Power off a system without an HMC 1. If there is an operating system running, proceed with step 2. Otherwise, press the power button A, as shown in Figure 43 on page 589. The system power turns off. Proceed to step 4 2. Log in to the system as a user with the authority to run the shutdown command. 3. At the command line, enter one of the following commands: v If your system is running AIX, type shutdown. v If your system is running Linux, type shutdown -h now.The command stops the operating system. The system power turns off. 4. The power-on light begins to slowly blink, and the system goes into a standby state. 5. Continue with “Remove the power cords.”Remove the power cords Unplug any power cables that are attached to the unit from electrical outlets. Remove all power cords from the server. Important: The system might be equipped with a second power supply. Before continuing with the procedure, ensure that all power sources to the system have been completely disconnected. (L003) or 590 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Determining a primary or alternative console The primary console is the first workstation that the system identifies. It is attached to the first input/output adapter (IOA) or input-output processor (IOP) that supports workstations. The alternative console is the workstation that functions as the console when the primary console is not operational. A console is a workstation that allows you to view and control system operations. The primary console is the first workstation that the system identifies. It is attached to the first input/output adapter (IOA) or input-output processor (IOP) that supports workstations. The alternative console is the workstation that functions as the console when the primary console is not operational. The system can assign up to two alternative consoles. The first alternative console can only be a twinaxial workstation that is attached to the same IOP as the primary console. The next alternative console is a workstation that is attached to the next IOA or IOP that is capable of supporting workstations. The IOA or IOP that supports a console must be on the system bus (bus 1). If a workstation is not correctly attached to the first IOA or IOP that is capable of attaching workstations, the system will not assign a primary console. If it does not assign a primary console, the system displays a reference code on the control panel. If the system is set for Manual mode, it stops during the IPL. For more information on how to determine the primary and alternative consoles, see “Identifying the consoles when the system is operational.” Primary console requirements For a workstation to be the primary console, it must be operational and attached to the system bus. It must also have the correct port and address assigned. If the workstation is a personal computer, it must also have an active workstation emulation program. The workstation requirements are: v Twinaxial workstation – Port 0, Address 0, Bus 1v ASCII workstation – Port 0, Bus 1v Personal computer attached to ASCII IOP – Port 0, Bus 1 – Personal computer software to emulate a 316x or 3151 terminalv Personal computer attached to Twinaxial IOP – Port 0, Address 0, Bus 1 – 5250 emulator software active on personal computerFor more information on how to determine the primary and alternative consoles, see “Identifying the consoles when the system is operational.” Identifying the consoles when the system is operational When the system is operational, you can determine the primary and alternative consoles by performing one of the following: v Look at the display. – If the dominant operating system is i5/OS, look for a sign-on display that shows DSP01 in the upper right corner. DSP01 is the name that the system assigns to the primary console.Note: This resource name might have been changed by the customer. Chapter 7. Common Service Procedures 591 v Use system commands to assist in identifying the consoles. See the system operation information for more details on commands. v Use the Hardware Service Manager function to assist in identifying the consoles: 1. Select the System bus resources option on the Hardware Service Manager display. The System Bus Resources display allows you to view the logical hardware resources for the system bus. Look for a (<) symbol next to an IOP. The (<) symbol indicates that the console attaches to this IOP. 2. Select the Resources associated with IOP and the Display detail options to collect more information about the consoles. 592 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Determining if the system has logical partitions This procedure will help you determine if the system has logical partitions. 1. Is the system managed by an HMC? Yes: Continue with the next step. No: The system does not have logical partitions. You must have an HMC to have logical partitions. Return to the procedure that sent you here. This ends the procedure.2. Determine if the system has multiple logical partitions by performing the following from the HMC: a. Expand the Systems Management folder in the Navigation area. b. Expand the Servers folder. c. Click the name of the managed system you are working with. The partitions will be displayed.Is there more than one logical partition listed? No: The system does not have multiple logical partitions. Return to the procedure that sent you here. This ends the procedure. Yes: The system has multiple logical partitions. Return to the procedure that sent you here. This ends the procedure. Chapter 7. Common Service Procedures 593 Separating the 571F/575B card set and moving the cache directory card To complete this procedure, you will need a T-10 TORX bit or driver. Attention: To avoid loss of cache data, you must not remove the cache battery during this procedure. To separate the 571F/575B card set and move the cache directory card, do the following:Important: All cards are sensitive to electrostatic discharge. 1. Label both sides of the card before separating. 2. Go to the next step if you are not servicing a 571F/575B card set containing a light pipe assembly. If you are servicing a 571F/575B card set that contains a light pipe assembly, you will need to remove the light pipes. To remove the light pipes, do the following: a. Place the 571F/575B card set adapter on an ESD protective surface. b. Remove the light pipe retaining screw C from the 571F/575B card set. c. Slide the light pipe assembly D from between the 571F/575B cards. d. Put both the screw and the light pipe assembly in a safe place. 3. Place the 571F/575B card set adapter on an ESD protective surface and orient it as shown. 594 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 4. Disconnect battery cable A and SCSI cable B from the 571F storage adapter. Leave the other end of the cables attached to the 575B auxiliary cache adapter. 5. To prevent possible card damage, first loosen all 5 retaining screws C BEFORE removing them. After all five retaining screws have been loosened, remove the screws C from the 571F storage adapter. Chapter 7. Common Service Procedures 595 6. Carefully lift the 571F storage adapter off the standoffs and set it on the ESD protective surface. 7. Turn the 571F storage adapter over so the components are facing up and locate the cache directory card D on the 571F storage adapter. It is the small rectangular card mounted on the I/O card. 596 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 8. Unseat the connector on the cache directory card by wiggling the two corners that are farthest from the mounting pegs. To disengage the mounting pegs, pivot the cache directory card back over the mounting peg. 9. Move the cache directory card to the replacement 571F storage adapter and seat it on the connector and mounting pegs. 10. To reassemble the cards do the above procedure in reverse order. To prevent possible card damage, insert all 5 screws C before tightening any of them. Note: If you are installing the 571F/575B card set adapter into a cassette, do the following: a. Remove the adapter handle B as shown in the following illustration. b. If applicable, remove the light-pipe assembly. c. Remove the two plastic covers from the SCSI jumper cable A as shown in the following figure: Chapter 7. Common Service Procedures 597 d. Ensure that battery cable A and SCSI cable B are reconnected to the 575B auxiliary cache adapter. e. Go to Place an adapter in the PCI adapter double-wide cassette to complete the installation.11. If applicable, to replace the light pipes, do the following: a. Ensure that the 571F/575B cards are correctly assembled. b. If necessary, remove the 571F/575B card handle A from the back of the card set.Tip: If you misplace the light-pipe retaining screw C, use either of the screws that secure the card handle B to the card. c. Slide the light pipe assembly D between the 571F/575B cards. Ensure that the threaded screw hole located on the light pipe assembly is aligned with the screw clearance hole located on the card . 598 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 12. Return to the procedure that sent you here. This ends this procedure. Chapter 7. Common Service Procedures 599 Determining which Processor is the Secondary Service Processor Use this procedure to determine which service processor on your system is the secondary service processor. 1. Locate the HMC1 and HMC2 port pairs for service processor zero and for service processor one. (See Chapter 3, “Locating FRUs,” on page 335 for your specific model.) 2. Is only one service processor with cables attached to its HMC ports? No: Continue to the next step. Yes: The secondary service processor is the service processor with no cables attached to the HMC ports. This ends the procedure.3. Use the HMC to access Advanced System Management Interface (ASMI) to determine which service processor is the secondary service processor. First, verify ASMI connectivity by logging onto ASMI. Then pull all HMC cables from HMC ports for service processor zero (leaving service processor one HMC cables attached). Log onto ASMI again. Were you successfully able to log onto ASMI? No: The service processor with HMC cables still attached is the secondary service processor. Reattach the HMC cable(s) to service processor zero. This ends the procedure. Yes: The cables just removed identify the secondary service processor. Reattach the HMC cables to the secondary service processor. This ends the procedure. 600 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) The System Reference Code (SRC) Format Description The format of the SRC that you see depends on the system or server. “System Reference Code (SRC) Information”Helps you understand how SRCs work. “System reference code (SRC) address formats”Use this procedure to determine the IOP, IOA, or device address from either a control panel or a console display. “Hardware SRC formats” on page 605SRC format examples.System Reference Code (SRC) Information Helps you understand how SRCs work. Use this information to understand how SRCs work. The system reference code (SRC) is a sequence of data words (codes) that: v Identifies a system status v Describes a detected hardware, Licensed Internal Code (LIC), or software failure v Describes the unit that is reporting the failure and its locationSRCs can display on the control panel, appear in a system console message, or appear in the product activity log. The system uses the following procedure to gather information about errors: 1. The system gathers information about an error as the system detects the error. 2. The Product Activity Log (PAL) records and stores the error information. 3. An SRC encodes information used to evaluate or identify a system-detected hardware or software error, failure, or status. The failure information may include the failing condition or part (or unit) that can be exchanged or replaced and its location. 4. The SRC links to the problem determination procedure when the element detecting the failure cannot isolate the failing condition. 5. The SRC link completes the isolation process.In summary, the SRC provides the first-failure-data-capture information. The customer or support representative uses SRCs during problem determination, repair actions, and verification. The SRC provides a link to the problem determination procedures (hardcopy or softcopy). Use the SRC and the problem determination procedures to isolate the problem. System reference code (SRC) address formats Use this procedure to determine the IOP, IOA, or device address from either a control panel or a console display. Use this procedure to determine the IOP, IOA, or device address from either a control panel or a console display. 1. Was the SRC and addressing information obtained either from a control panel display, the Main Storage Dump Manager display, or the HMC Partition Properties Reference Code words display? v No: Continue with the next step. v No: Contact your next level of support. This ends the procedure. v Yes: Addressing information is in the Bus-Bus-Bus-Bus/Card-card/board-board (BBBBCcbb) format as follows: Chapter 7. Common Service Procedures 601 – If characters 7 and 8 of the top 16-character line of function 12 (word two) are 60: - The direct select address is characters 1-8 of the bottom 16-character line of function 12 (word four) in the format ?BBBBCcbb?. - The device address is characters 9-16 of the bottom 16-character line of function 12 (word five) in the format ?abcd efgh?.– If characters 7 and 8 of the top 16-character line of function 12 (word two) are 13, 17, 27, or 29: - The direct select address is characters 9-16 of the top 16-character line of function 12 (word three) in the format ?BBBBCcbb?. - The device address is characters 1-8 of the bottom 16-character line of function 12 (word four) in the format ?abcd efgh?.– If characters 7 and 8 of the top 16-character line of function 12 (word two) are 62: - The direct select address may be characters 9-16 of the top 16-character line of function 13 (word seven) in the format ?BBBBCcbb?. - The device address may be characters 1-8 of the top 16-character line of function 13 (word six) in the format ?abcd efgh?.Go to step 9 on page 604. 2. Are you using the HMC? No: Continue with the next step. Yes: Open the server with the problem, then go to the i5/OS partition with the problem and use the Service Action Log (SAL) to find the entry related to this problem (see “Using the Service Action Log” on page 609), and then go to step 7 on page 603. 3. Are you using the Service Action Log (SAL)? No: Continue with the next step. Yes: Go to step 7 on page 603. 4. Are you using i5/OS communication-diagnostic panels that are displayed by the commands VFYCMN, WRKPRB, or ANZPRB? Yes: Go to step 11 on page 605. No: Continue with the next step. 5. Addressing information is in the Bus/Board/Card format. Note: This format shows the system bus number in decimal form. If no address information is available in the console error message, check the Product Activity Log (PAL) entry that is associated with the console error message. Below is an example of a PAL display for a storage device. Other devices (for example, workstations or communications) have similar displays. For more information about the PAL, see “Using the product activity log” on page 617 in the “Service functions” on page 652. +--------------------------------------------------------------------------------+ |Display Address Information for Resource | | | | Serial Resource | |Name Type Model Number Name | |STORAGE 6718 050 83-00834 DD001 | | | |Physical Location: | | U7879.001.10ABCDE-P3-D3 | | | | | | | |Logical Address: | |PCI Bus: Storage: |Figure 44. Product Activity Log display example 602 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) |System Bus . . . . . . : 3 I/O Adapter . . . . . . : 2 | |System Board . . . . . : 0 I/O Bus . . . . . . . . : 1 | |System Card . . . . . . : 32 Controller. . . . . . . : 4 | | Device . . . . . . . . : 0 | | | +--------------------------------------------------------------------------------+ In the above example, the contents of ?Storage? are defined as follows. Table 49. Storage definitions Reference Description I/O Bus This is the bus to which the device is connected. I/O Adapter This is the adapter that the I/O bus connects to on the I/O processor. Note: For SAS I/O adapters, this field is not applicable. Controller This is the logical controller for the addressed device. Device This is the logical device that is attached to the logical device controller. 6. Use the figure below to understand how addresses in the Bus/Board/Card format relate to the Bus-Bus-Bus-Bus/Card-Card/Board-Board format. You can also use it to understand how addresses in the Bus/Board/Card format relate to the direct select addresses used in Chapter 3, “Locating FRUs,” on page 335. Note: When converting from Bus/Board/Card format to Bus-Bus-Bus-Bus/Card-Card/Board-Board format, the numbers must be converted from Decimal to Hexadecimal. Go to step 10 on page 604 for IOA and device address information. 7. Display the FRU details for the problem that you are attempting to fix. The location information in the field-replacable unit detail is only available for IOPs, IOAs, and devices. Is the FRU an IOP, IOA, or device, and is there location information in the Location field of the display? No: Continue with the next step. Yes: Use the location information and return to the procedure that sent you here. This ends the procedure. 8. There is no location information in the Location field. Are there additional words displayed at the top of the screen? Figure 45. Format conversion Chapter 7. Common Service Procedures 603 v No: Go to “Hexadecimal Product Activity Log data” on page 617 in “Service functions” on page 652 to obtain the address information. Then return here and continue with the next step. v Yes: Use the additional words to find the address information: – If characters 7 and 8 of word two are 60: - The direct select address is word four in the format ?BBBBCcbb?. - The device address is word five in the format ?abcd efgh?.– If characters 7 and 8 of word two are 13, 17, 27, or 29: - The direct select address is word three in the format ?BBBBCcbb?. - The device address is word four in the format ?abcd efgh?.– If characters 7 and 8 of word two are 62: - The direct select address may be word seven in the format ?BBBBCcbb?. - The device address may be word six in the format ?abcd efgh?. Continue with the next step. 9. The BBBBCcbb format is the hexadecimal direct select address of a card location in the system unit. The format definition is: v BBBB—System Bus v Cc—System Card v bb—System Board Continue with the next step for IOA and device address information. 10. Unit (Device) Address Definitions are explained in an ?abcd efgh? format. For example, if the unit address is 0001 00FF, the address relates to the ?abcd efgh? format as follows: The characters ?abcd efgh? can have different meanings, depending on the subsystem that reports the problem. However, they define the information that is contained in the unit (device) address as follows. Table 50. Character definitions SCSI attached disk, tape, or optical a I/O adapter b I/O bus cd Controller ef Device gh FF SAS attached disk, tape, or optical Not Applicable I/O adapter ab I/O bus cd Controller ef Device gh FF Communications a I/O bus b Adapter 0 0 0 1 0 0 F F (a b c d e f g h) Figure 46. Example of Unit (Device) Address definitions 604 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Table 50. Character definitions (continued) SCSI attached disk, tape, or optical c 0 d Port e 0 f Channel GH FF Workstation a I/O Bus b Adapter cd Port e 0 f Device g 0 h Session 11. i5/OS communication-diagnostic panels that are displayed by the commands VFYCMN, WRKPRB, and ANZPRB use the following address format: x1 x2 x3x4 Table 51. Decimal address values Value Description x1 = System bus The one, two, or three digit decimal representation of the hexadecimal Bus value of the Bus-Bus-Bus-Bus/Card-Card/Board-Board format Address at the top of Figure 45 on page 603. x2 = System board The one or two digit decimal representation of the hexadecimal Board value of the Bus-Bus-Bus-Bus/Card-Card/Board-Board format Address at the top of Figure 45 on page 603. x3 = System card The one or two digit decimal representation of the hexadecimal Card value of the Bus-Bus-Bus-Bus/Card-Card/Board-Board format Address at the top of Figure 45 on page 603. x4 = Adapter location The one or two digit decimal representation of the hexadecimal b Adapter value of the Communications unit address value in step 9 on page 604 (See step 10 on page 604). 12. Return to the procedure that sent you here. This ends the procedure.Note: See “Hardware SRC formats” for more formatting information.Hardware SRC formats SRC format examples. Notes: 1. Other formats exist; if necessary contact your next level of support. 2. For more information on formats 60–63, see LIC-Detected Problem Reference Code Formats. Chapter 7. Common Service Procedures 605 Table 52. SRC format examples SRC Word Number Panel function, digits x to y (from left to right) IOP SRC format 13 IOP SRC format 17 Device SRC format 27 IOP detected a failure format 29 LIC SRC Format 60 LIC SRC format 61 LIC SRC format 62 LIC SRC format 63 1 11, 1-8 TTTT RRRR TTTT RRRR tttt rrrr B0XX RRRR A6xx RRRR B6xx RRRR B6xx RRrr or B7xx Rrrr A600 RRRR 2 12, 1-8 MIGV EP13 MIGV EP17 MIGV EP27 MIGV EP29 MIGV EP60 MIGV EP61 MIGV EP62 MIGV EP63 3 12, 9-16 BBBB Ccbb BBBB Ccbb BBBB Ccbb BBBB Ccbb PPPP 0000 cccc cccc cccc cccc AAAA BBBB 4 12, 17-24 aaaa aaaa aaaa aaaa aaaa aaaa SAPP UUFF BBBB Ccbb pppp pppp pppp pppp CCCC DDDD 5 12, 25-32 TTTT LMMM TTTT LMMM tttt lmmm TTTT FMMV aaaa aaaa qqqq qqqq qqqq qqqq EEEE FFFF 6 13, 1-8 uuuu uuuu tttt lmmm TTTT LMMM tttt lmmm tttt mmmm qqqq qqqq qqqq qqqq GGGG HHHH 7 13, 9-16 uuuu uuuu ZZZZ rrrr ZZZZ RRRR zzzz rrrr ssss ssss 0000 0000 BBBB Ccbb JJJJ KKKK 8 13, 17-24 uuuu uuuu ssss ssss ssss ssss 0sss ssss NNNN 0000 0000 0000 TTTT MMMM TTTT 0MMM 9 13, 25-32 uuuu uuuu uuuu uuuu uuuu uuuu uuuu uuuu 0000 0000 0000 0000 0000 0000 SSSS SSSS Table 53. SRC format examples (continued) SRC format Description AAAA Type of I/O Bus (see “Logical address format” on page 607 — field ’A’). aaaa Unit address. BBBB System Bus Number (see “Logical address format” on page 607 — field ’B’). BBBBCcbb Bus, bus, bus, bus, card, card, and board, board address (Direct Select Address or DSA). v For PCI system buses 0001-0003 and 0018-FFFF the Cc field is a one digit multi-adapter bridge number and a one digit multi-adapter bridge function number. v For SPD system buses 0004-0017 the Cc field is a two digit card number. CCCC System Board Number (see “Logical address format” on page 607 — field ’C’). cccc cccc Component reference code. Format 62 varies by component. DDDD System Card Number (see “Logical address format” on page 607 — field ’D’). EEEE (see “Logical address format” on page 607 — Unit Address type — field ’E’). FFFF (see “Logical address format” on page 607 — Unit Address type — field ’F’). GGGG (see “Logical address format” on page 607 — Unit Address type — field ’G’). HHHH (see “Logical address format” on page 607 — Unit Address type — field ’H’). JJJJ (see “Logical address format” on page 607 — Unit Address type — field ’J’). KKKK (see “Logical address format” on page 607 — Unit Address type — field ’K’). LMMM Level indicator plus 3? digit model number (for example, 0001). 606 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Table 53. SRC format examples (continued) (continued) SRC format Description lmmm Level indicator plus 3? digit model number of failing device. 0MMM Model number of failing device (i.e. MMM=’001’). MIGV EP General system status. NNNN Number of disk device facilities that are missing. PPPP Cause code. pppp pppp Programming reference code. qqqq qqqq Programming reference code qualifier. RRRR Unit reference code (URC). rrrr Outboard failing unit reference code (URC). SAPP UUFF Unit address. SSSSSSSS System Reference Code — this SRC is used to determine why the primary console failed to respond. ssss Serial number of failing unit. TTTT Type number or card identification number (hex). tttt Outboard failing unit type number (i.e. 6607). uuuu Unit-specific data. ww SRC type. ZZZZ Reserved. “Logical address format”Use the logical address to identify the resource entry.Logical address format Use the logical address to identify the resource entry. Use the logical address to identify the resource entry. To sort by logical address, select the Analyze log option on the Product Activity Log display. Then select the F9 key (Sort by...) and sort by logical address. For more information on the address, use the address information function. The logical address format, A/B/C/D-E/F/G/H/J/K, has the following definitions: A Type of I/O bus (transport) B System bus number C System board number (for busses that connect card enclosures) D System card (bus unit) number - Separates the bus address from the unit address E Unit address type FGHJK Unit address dataThe values of F, G, H, J, and K vary, depending on the unit address type (E). Use the following information to determine the Unit address data (FGHJK) representation. Chapter 7. Common Service Procedures 607 Table 54. Unit address (E) definitions Unit Address Value Unit Address Definition Unit Address Data E F G H J K 0 Reserved - - - - - 1 Communications I/O bus Adapter Port Channel - 2 Storage I/O adapter I/O bus Controller Device - 3 Workstation I/O bus Adapter Port Device Session 4 Auxiliary Processor I/O bus Auxiliary Processor Adapter Port - 5 Library I/O adapter I/O bus Library Controller Device 6 Cryptography I/O bus Adapter Device - - The Logical address field contains the bus address and the unit address. The bus address describes the hardware bus, board, and card information. The unit address describes the subsystem and identifying data (see Table 54). Note: To locate a device or card that is within the system unit, use the physical address information. 608 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Using the Service Action Log Use this procedure to search for an entry in the Service Action Log (SAL) that matches the time, reference code, or resource of the reported problem. 1. On the command line, enter the Start System Service Tools command (STRSST). If you cannot get to SST, use function 21 to get to DST. 2. On the Start Service Tools Sign On display, type in a user ID with QSRV authority and password. 3. Select Start a Service Tool ? Hardware Service Manager ? Work with service action log. 4. On the Select Timeframe display, change the From: Date and Time to a date and time prior to when the customer reported having the problem. 5. Search for an entry that matches one or more conditions of the problem: v Reference code v Resource v Time v Failing item list 6. Perform the following: v Choose Display the failing item information to display the SAL entry. v Use the Display details option to display part location information. All new entries in the SAL represent problems that require a service action. It may be necessary to handle any problem in the log even if it does not match the original problem symptom. The information displayed in the date and time fields are the date and time for the first occurrence of the specific reference code for the resource displayed during the time range selected. 7. Did you find an entry in the SAL? v Yes: Continue with the next step. v No: Go to “Problems with noncritical resources” on page 610. This ends the procedure. 8. Does ?See the service information system reference code tables for further problem isolation? appear near the top of the display or are there procedures in the FRU list? v Yes: Perform the following steps: a. Go to the list of reference codes and use the reference code that is indicated in the log to find the correct reference code table and unit reference code. b. Perform all actions in the Description/Action column before exchanging failing items. Note: When exchanging failing items, the part numbers and locations found in the SAL entry should be used.This ends the procedure.v No: Display the failing item information for the SAL entry. Items at the top of the failing item list are more likely to fix the problem than items at the bottom of the list. Continue with the next step. Notes: a. Some failing items are required to be exchanged in groups until the problem is solved. b. Other failing items are flagged as mandatory exchange and must be exchanged before the service action is complete, even if the problem appears to have been repaired. c. Use the ?Part Action Code? field in the SAL display to determine if failing items are to be replaced in groups or as mandatory exchanges. d. Unless the ?Part Action Code? of a FRU indicates group or mandatory exchange, exchange the failing items one at a time until the problem is repaired. Use the help function to determine the meaning of Part Action Codes. 9. Perform the following steps to help resolve the problem: Chapter 7. Common Service Procedures 609 a. To display location information, choose the function key for Additional details. If location information is available, go to Chapter 3, “Locating FRUs,” on page 335 for the model you are working on to determine what removal and replacement procedure to perform. To turn on the failing item’s identify light, use the ?indicator on? option.Note: In some cases where the failing item does not contain a physical identify light, a higher level identify light will be activated (for example, the planar or unit containing the failing item). The location information should then be used to locate the actual failing item. b. If the failing item is Licensed Internal Code, contact your next level of support for the correct PTF to apply.10. After exchanging an item, perform the following: a. Go to Chapter 5, “Verifying a repair,” on page 557 and return here. b. If the failing item indicator was turned on during the removal and replacement procedure, use the ?indicator off? option to turn off the indicator. c. If all problems have been resolved for the partition, use the ?Acknowledge all errors? function at the bottom of the SAL display. d. Close the log entry by selecting Close a NEW entry on the SAL Report display. This ends the procedure. “Problems with noncritical resources”Use this procedure to help you determine the cause of problems with noncritical resources. “System Service Tools (SST)” on page 611 “Using the product activity log” on page 617This procedure can help you learn how to use the Product Activity Log (PAL). “Hexadecimal Product Activity Log data” on page 617Use the Display Hexadecimal Report Function or follow the steps below to display hexadecimal error log dump data. “More information from hexadecimal reports” on page 620You can get additional information for IOP 90xx reference code product activity log entries at DST by using a D-IPL. “Reclaiming IOP cache storage” on page 636 “Working with Storage Dumps” on page 636 “Dedicated Service Tools (DST)” on page 637 “Printing the System Configuration List” on page 643 “Hardware Service Manager” on page 644 “Managing the Advanced System Management Interface (ASMI)” on page 645Understand how to manage the Advanced System Management Interface (ASMI).Problems with noncritical resources Use this procedure to help you determine the cause of problems with noncritical resources. 1. Is there an SRC in an 8-character format available on the Problem Summary form? Note: If the operator has not filled out the problem summary form, go to the problem reporting procedure for the operating system in use. No: Continue with the next step. Yes: Perform problem analysis using the SRC.This ends the procedure.2. Does the problem involve a workstation resource? v No: Continue with the next step. v Yes: Perform the following steps: – Check that the workstation is operational. 610 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) – Verify that the cabling and addressing for the workstation is correct. – Perform any actions indicated in the system operator message. If you need further assistance, contact your next level of support. This ends the procedure.3. Does the problem involve a removable media resource? No: Continue with the next step. Yes: Go to “Using the product activity log” on page 617 to resolve the problem. This ends the procedure.4. Does the problem involve a communications resource? v No: Contact your next level of support. This ends the procedure. v Yes: Are there any system operator messages that indicate a communications-related problem has occurred? – No: Contact your next level of support. This ends the procedure. – Yes: Perform any actions indicated in the system operator message. If you need further assistance, contact your next level of support. This ends the procedure.System Service Tools (SST) The system service tools (SST) run one or more Licensed Internal Code (LIC) or hardware service functions under the control of the operating system. SST allows you to perform service functions concurrently with the customer’s application programs. The printer that is used for output must be an SCS-type data stream printer that is attached to the same workstation input-output processor (IOP) or storage media unit. “Work with communications trace”The Work with communications trace option is available only under System Service Tools (SST). Select this option from the Start a Service Tool display.Work with communications trace The Work with communications trace option is available only under System Service Tools (SST). Select this option from the Start a Service Tool display. This option allows you to start or stop a communications line trace on a configuration object. Once you run the trace, the data can be formatted. You can view the formatted data by printing it. You should use communications trace for the following situations: v To isolate errors that you cannot isolate using the communications verification procedure v To collect more data when the problem analysis procedures do not provide enough information on the problem v If you suspect a communications protocol violation problem or some other line problem v If you suspect line noise v When the error messages indicate that there is an SNA BIND problemRunning and interpreting the communications trace requires detailed knowledge of communications protocols. To obtain the most accurate sample of your line status, whenever possible, start the communications trace before varying on the lines . Note: Online help from the Work with Communications Trace display allows you to view a list of protocols. Chapter 7. Common Service Procedures 611 Options and Function Keys The Work with Communications Traces display has the following options and function keys: v Start trace (F6) This function key allows you to start tracing the data on a communications configuration object. The Start Trace display appears after you press this function key. For more information, see “Starting a communications trace” on page 614. v Stop trace (option 2) This option appears only on the Work with Communications Traces display. It allows you to end the trace and stop collecting data. A trace must be stopped before you can format, print, or delete the data. To ensure that the trace is stopped, press the Refresh function key and check the trace status field. v Delete trace (option 4) The trace must be stopped (option 2) before you can use this option. Select this option to delete the trace information when you no longer need the data. Traces are not automatically deleted when you leave a communications trace function. Delete trace releases system space so that you can start other traces without having to increase the maximum amount of storage provided for the communications trace. Traces that are not deleted are listed on the Work with Communications Traces display. v Format and print trace (option 6) Use this option to format and print the trace. For more information on the Format and print trace option, see page “Format and print trace” on page 615. v Display message (option 7) Use this option to view a message that indicates the status of the trace that stopped because of an error. v Restart trace (option 8) Use this option to start a trace that has been stopped. v Change size (F10) Use this function key to change the amount of storage that all traces use. You might want to increase the amount of storage, if your system has many active traces or traces that will be active for a long time. v Display buffer size or display trace status (F11) This function key allows you to select and alternate between buffer size and trace status. v Refresh (F5) This function key updates the data on the display.Status conditions After the Work with communications trace option is selected from the Start a Service Tool display, the Work with Communications Traces display appears. 612 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Details of the trace, including status, are displayed. The trace status can be one of the following: Condition Description Waiting The trace is waiting for the configuration object to be varied on (not collecting data). Active The trace data is being collected. Stopping The trace is stopping. Stopped The trace has stopped (not collecting data). Error An error occurred on the configuration object while the trace was collecting data. The data might or might not be collected. The trace stopped. Starting The trace is being started by another user. You cannot stop or delete this trace. Formatting Trace is being formatted. Work with Communications Traces Type choice, press Enter. 2=Stop trace 4=Delete trace 6=Format and print trace 7=Display message 8=Restart trace Configuration Opt Object Type Trace Description Protocol Trace Status _ LosAngeles Line Test LosAngeles Line SDLC Active _ Mpls NWI Test Mpls ISDN Stopped _ Tucson Line Test Tucson Line ASYNC Waiting F3=Exit F5=Refresh F6=Start trace F10=Change size F11=Display buffer size F12=Cancel Figure 47. Example Work with Communications Trace display Chapter 7. Common Service Procedures 613 Starting a communications trace The following is an example of a Start Trace display: Start Trace Configuration object . . . . . . . . . _________ Type . . . . . . . . . . . . . . . . . 1 1=Line 2=Network interface 3=Network server Trace description . . . . . . . . . . ____________________ Buffer size (in kilobytes) . . . . . . 1 1=128, 2=256, 3=2M, 4=4M, 5=6M, 6=8M, 7=16M, 8=32M, 9=64M Stop on buffer full . . . . . . . . . N Y=Yes, N=No Data direction . . . . . . . . . . . 3 1=Sent, 2=Received, 3=Both Number of bytes to trace Beginning bytes . . . . . . . . . . *CALC VALUE, *CALC Ending bytes . . . . . . . . . . . *CALC VALUE, *CALC Type choices, press Enter. F3=Exit F5=Refresh F12=Cancel Enter the name of a communications configuration object description in the Configuration object prompt. Describe the trace in the Trace description field. This field can help you identify the trace. You can select a buffer size to hold the communications data that the trace collects. The default buffer size is 1=128K bytes. Base the size of the buffer on the speed of the communications line and on the amount of time necessary to trace the data. For high speed lines or long periods of tracing, a larger buffer size is recommended. If you specify Yes in the field, the trace stops when the buffer is full. This option is useful for viewing the initial data that is coming across a line. If you specify No, the trace continues until you stop it. In this case, the data in the buffer will be written over each time the buffer is full. You can select the direction of data to be traced by specifying one of the three selections in the Data direction field: v Only data that the system is sending (option 1) v Only data that the system is receiving (option 2) v Both the data sent and received by the system (option 3)Note: If you specify option 1 (Sent) for lines that are in SDLC short hold mode, the trace does not include controller names. You can select how much data is traced in a frame of data. The value that is entered is the amount that is saved as part of the trace. The minimum value that is allowed for both the beginning and the end value is 36 bytes. Those 36-byte minimum values include the protocol header. The configuration object you are tracing determines the maximum value that is allowed. Notes: 1. The BSC protocol ignores the beginning and ending byte values. 2. The SDLC, high-level data link control (HDLC), X.25, frame relay, Ethernet, distributed data interface (DDI), ATM, and token-ring protocols ignore the ending byte value. 614 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 3. The minimum value that is allowed for the beginning byte value for local area network (LAN) protocols (including frame relay protocols and ATM protocols) is 72 bytes.When all required options are complete, press the Enter key. The Work with Communications Trace display appears. If the communications object being traced is in varied off status when you start the trace, the trace remains in waiting status until you vary the communications object on. If the object is varied on, the trace will immediately go to the active state, and the system will begin to collect trace data. The trace remains active until one of the following occurs: v You select the option to stop the trace on the Work with Communications Traces display. v You vary off the configuration object being traced. v The configuration object being traced has an error. v The trace buffer is full, and the option to stop on buffer full was selected.Note: You can exit the communications trace function to do other work while the trace is active. Traces are not automatically deleted when you exit a communications trace function. To return to the communications trace: 1. Select the Start a service tool option from the service tools (SST or DST) display. 2. Select the Work with communications traces option. 3. From the Work with Communications Traces display, you can check the status of the trace or select to stop the trace.Format and print trace You must select the Stop Trace option before you format or print the trace. The Format and print trace option allows you to select various formatting options and prepare the trace data for printing. The options vary for each protocol. The following is an example of the Format Trace Data display: Notes: Format Trace Data Configuration object . . . . . . TRNLINE Type . . . . . . . . . . . . . . LINE Type choices, press Enter. Controller . . . . . . . . . . *ALL *ALL, name Data representation . . . . . 3 1=ASCII, 2=EBCDIC, 3=*CALC Format RR, RNR commands . . . N Y=Yes, N=No Format Broadcast data . . . . Y Y=Yes, N=No Format MAC or SMT data only. . N Y=Yes, N=No Format UI data only . . . . . N Y=Yes, N=No Format TCP/IP data only . . . N Y=Yes, N=No F3=Exit F5=Refresh F12=Cancel Figure 48. Example Format Trace Data Display Chapter 7. Common Service Procedures 615 1. To view all the data associated with the trace, press the Enter key without changing to the defaults on the Format a Trace Data display . The information is not in any special format. 2. Select the option to format and print only the data that you want to see. 3. All options available for formatting depend on the protocol of the line that is being traced. Only valid choices appear for each protocol. 4. Not all combinations of options are valid for all protocols. Error messages appear, if you select an option that is not valid, . If an error message appears, you must change your selections. Use the F5 (Refresh) function key to restore all options to their default values. 5. For more information about these options, use online help .When the format of the trace data is complete, the output can be printed on the console printer (the printer that is attached to the same I/O processor as the console). Only SCS-type data streams are supported. The format of the trace data is: v An introduction page that contains, for example, configuration object, type, protocol, start and stop dates and times, trace options, and formatting options. v A help page, to assist in understanding the output. Help information for a specific protocol is given because the trace data differs for each communications protocol. v The formatted appears as follows: The width of the file is 132 characters. The data is in hexadecimal representation and either American National Standard Code for Information Interchange (ASCII) or EBCDIC character representation. The columns of the trace output common to all protocols are: Record number The number of the trace record. Shows if the record type is sent (S) or received (R). Notes: 1. The letter C in this column indicates that an X.21 short hold mode connection was cleared. 2. If an M appears, a modem change has occurred.Data length The amount of data, in decimal, that the record contains. Record status The protocol-dependent return code for the trace record. 00000000 is successful; no errors were found. Other return codes are listed in the functional specification for the protocol that is running or the port manager. Record Data Record Record Data Controller Number Number Poll/ Number S/R Length Status Timer Type Name/Number Command Sent Received Final ------ --- ------ -------- --------------- ------ ------------- ------- ------ -------- ----- 7 R 69 00000000 12:29:56.72963 EBCDIC ZSDLLC30 /01 XID ON Data . . . . . : 3245056150080000 0084C00000000000 01010B0000010900 00000007000E0DF4 *.&;...D{....................4* DADBCC4BDACBCDCE EFCGC5F31017F116 1101130011F9F4F0 F6F5F0F0F1F0F1F0 *ABC.ABCDEFG..1......94065001010* C4C6C44040 *DFD * 8 S 0 00000000 12:29:56.76081 EBCDIC ABCLLC30 /01 SNRM ON 9 R 0 00000000 12:29:56.78450 EBCDIC ABCLLC30 /01 UA ON 52 S 110 00000000 12:29:57.76210 EBCDIC ABCLLC30 /01 I 0 0 ON Data . . . . . : 2D00010200006B81 0031001307B0B051 3107878686870706 0200000000000000 *......,A..........GFFG..........* 944300000CDADBCC 4BEAEBCCD3D3C4F3 F02E000902E2D5C1 E2E5C3D4C7090300 *M....ABC.AABCLLD30....SNASVCMG...* 00BCF9CAD600000D 04DADBCC4BEAEBCC D3D3C4F3F00A1300 7B00BCF9CAD60000 *..9.O....ABC.ABCLLD30...#..9.O..* 000CDADBCC4BEAEB CCD3D3C4F4F0 *..ABC.ABCLLD40 * 53 R 110 00000000 12:29:57.80065 EBCDIC ABCLLC30 /01 I 0 1 ON Data . . . . . : 2F00010200006B81 0031001307B0B051 3107878686870706 0200000000000000 *......,A..........GFFG..........* 944300000CDADBCC 4BEAEBCCD3D3C4F4 F02E000902E2D5C1 E2E5C3D4C7090300 *M....ABC.ABCLLD40....SNASVCMG...* 00BCF9D56600000D 04DADBCC4BEAEBCC D3D3C4F4F00A1300 7B00BCF9D5660000 *..9N.....ABC.ABCLLD40...#..9N...* 000CDADBCC4BEAEB CCD3D3C4F3F0 *..ABC.ABCLLD30 * * * * * * E N D O F E X A M P L E P R I N T O U T * * * * * Figure 49. Example Formatted Trace Data Output for SDLC 616 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Record timer The time that each event occurs. Depending on the communications hardware that is being used, the record timer will be either: 1. A time of day value, HH:MM:SS.NNNNN (where H=hours, M=minutes, S=seconds, and N=subseconds), based on the system time when the trace was stopped. 2. A relative time in decimal seconds. This timer value provides the relative time between events.Data type Shows whether the traced data is printed in ASCII or EBCDIC character representation. If the character representation of the data is mostly periods, you might want to format the data again using the other option for data representation. Controller name/number Indicates which controller originated the frame or record. In some conditions, this data is not available, and the column remains blank.Note: The formatted trace output is not security protected. Customer passwords are shown going across the line. Using the product activity log This procedure can help you learn how to use the Product Activity Log (PAL). 1. To locate a problem, find an entry in the product activity log for the symptom you are seeing. a. On the command line, enter the Start System Service Tools command: STRSST If you cannot get to SST, select DST. Note: Do not IPL the system or partition to get to DST. b. On the Start Service Tools Sign On display, type in a User ID with service authority and password. c. From the System Service Tools display, select Start a Service Tool ? Product activity log ? Analyze log. d. On the Select Subsystem Data display, select the option to view All Logs. Note: You can change the From: and To: Dates and Times from the 24-hour default if the time that the customer reported having the problem was more than 24 hours ago. e. Use the defaults on the Select Analysis Report Options display by pressing the Enter key. f. Search the entries on the Log Analysis Report display. Note: For example, a 6380 Tape Unit error would be identified as follows: System Reference Code: 6380CC5F Class: Perm Resource Name: TAP012. Find an SRC from the product activity log that best matches the time and type of the problem the customer reported. Did you find an SRC that matches the time and type of problem the customer reported? Yes: Use the SRC information to correct the problem. This ends the procedure. No: Contact your next level of support. This ends the procedure.Hexadecimal Product Activity Log data Use the Display Hexadecimal Report Function or follow the steps below to display hexadecimal error log dump data. Chapter 7. Common Service Procedures 617 Perform the following: 1. Find an entry in the Product Activity Log for the symptom (SRC) you are using to locate the problem. a. Select System Service Tools (SST). If you cannot get to SST, select DST.Note: Do not IPL the system to get to DST. b. On the Service Tools display, select the Start a service tool option. c. Select the Product activity log option on the Start a Service Tool display. d. Select the Analyze log option on the Product Activity Log display. e. On the Select Subsystem Data display, select the option to view All Logs. Note: If the SRC you are using occurred more than 24 hours ago, change the “From:” and “To:” Dates and Times from the 24-hour default. f. Use the defaults on the Select Analysis Report Options display by pressing the Enter key. g. Find the entry in the Product Activity Log that matches that SRC.2. Select the Display Detail Report for Resource display. (See the example of this display below.) 3. Press F6 to display the hexadecimal report. The direct select address (DSA) is in the format BBBB-Cc-bb: v BBBB = hexadecimal offsets 4C and 4D v Cc = hexadecimal offset 51 v bb = hexadecimal offset 4F Display Detail Report for Resource Serial Resource Name Type Model Number Name STORAGE 6512 001 10-4294009 SI05 Log ID . . . . . . . . . : 06020132 Sequence . . . . . . . : 2248 Date . . . . . . . . . . : 05/20/95 Time . . . . . . . . . : 08:37:55 Reference code . . . . . : 3400 Secondary code . . . . : 00000000 Table ID . . . . . . . . : 65120001 IPL source/state . . . : B/7 Class . . . . . . . . . . : Permanent System Ref Code . . . . . : 65123400 I/O processor card detected device error Press Enter to continue. F3=Exit F6=Hexadecimal report F9=Address Information F10=Previous detail report F12=Cancel Figure 50. Example Hexadecimal Detail Report for Resource 618 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 4. Is offset 170 - 173 = 00000000? v Yes: Continue with step 5. v No: Continue with step 6 on page 620.5. Page forward until you locate the offset location X’000180’. If the rightmost hex digit of offset 000182 is 8, then the unit address is hexadecimal offset 18C through 18F. If the rightmost hex digit of offset 000182 is 2, then the unit address is hexadecimal offset 192 through 195 Record this address information. Return to the procedure that sent you here, if one. Display Hexadecimal Report for Resource Serial Resource Name Type Model Number Name STORAGE 6512 001 10-4294009 SI05 Offset 0 1 2 3 4 5 6 7 8 9 A B C D E F EBCDIC 000000 C5D3F0F4 00000160 00E00180 77B7695E EL04...-.......; 000010 E8A70000 000008C8 20E00602 01320000 Y......H........ 000020 34000000 11000000 00000000 009C0000 ................ 000030 F6F5F1F2 F0F0F140 F1F060F4 F2F9F4F0 6512001 10-42940 000040 F0F9F6F5 F1F20000 00000001 00060000 096512.......... 000050 00020000 FFFFFFFF FFFFFFFF FFFF0000 ................ 000060 00000000 00000000 00000000 F4404040 ............4 000070 40000000 00000000 00000000 00000000 ............... 000080 00000000 00000000 00000000 00000000 ................ 000090 00000000 00000000 00000000 00000000 ................ 0000A0 00000000 00000000 00000000 00000000 ................ 0000B0 00000000 00000000 40404040 40404040 ........ More... Press Enter to continue. F3=Exit F12=Cancel Figure 51. Example Hexadecimal Report for ResourceDisplay Hexadecimal Report for Resource Serial Resource Name Type Model Number Name STORAGE 6512 001 10-4294009 SI05 Offset 0 1 2 3 4 5 6 7 8 9 A B C D E F EBCDIC 000180 D0007800 65120001 34000620 1100FFFF }............... 000190 FFFFFFFF 31090000 00001100 03C00000 .............{.. 0001A0 00000000 00000000 00000000 00000000 ................ 0001B0 00000000 00000000 00000000 00000000 ................ 0001C0 00000000 00000000 00000000 E0000008 ................ 0001D0 00000000 00000000 00000000 00000000 ................ 0001E0 00000018 00000002 E00401F0 10002000 ...........0.... 0001F0 00000000 40000000 00000000 000002CC .... ........... 000200 CCCC0000 00000000 00800000 00000000 ................ 000210 08100000 00000000 F0200040 00800000 ........0.. .... 000220 00000000 04000000 08000040 E000E221 ........... ..S. 000230 800084F3 00000002 0000001B 110003C0 ...3...........{ More... Press Enter to continue. F3=Exit F12=Cancel Figure 52. Hexadecimal Report for Resource Offset Location Example (where the rightmost hex digit of offset 000182 is 8) Chapter 7. Common Service Procedures 619 6. Using hexadecimal arithmetic, add the value in offsets 170 - 173, the value in offsets 174 - 177, and 00000002. For example, 000001C8 + 00000410 + 00000002 = 000005DA If the value of the rightmost hex digit at this offset is 8: Using hexadecimal arithmetic, add 0000000A to the offset you calculated. This will be the location of the unit address in the form abcdefgh. For example, 00005DA + 0000000A = 000005E4 If the value of the rightmost hex digit at this offset is 2: Using hexadecimal arithmetic, add 00000010 to the offset you calculated. This will be the location of the unit address in the form abcdefgh. For example, 00005DA + 00000010 = 000005EA Record this address information. If you were sent here by another procedure, return to the procedure that sent you here. This ends the procedure.More information from hexadecimal reports You can get additional information for IOP 90xx reference code product activity log entries at DST by using a D-IPL. Note: The term I/O processor may indicate an IOP or an IOA. Below are examples of how to interpret hexadecimal information for a D-IPL. Note: Formatting is available (F4=Additional Information) for IPL from disk (A- or B-IPL). Formatting is not available in DST from a D-IPL. Use the information below and the example displays on the following pages to interpret the hexadecimal dump information that begins at offset hex 000180. If the rightmost hex digit of the offset 000182 is 8, get the format from offset 00091F, then use the following table to determine what figure to use. Table 55. Format number to figure usage Format number Reference the following figures 01 Figure 54 on page 622 Figure 55 on page 623 Figure 56 on page 623 Display Hexadecimal Report for Resource Serial Resource Name Type Model Number Name STORAGE 571A 001 10-4294009 DC01 Offset 0 1 2 3 4 5 6 7 8 9 A B C D E F EBCDIC 000180 F8407200 F5F7F1C1 F0F0F0F1 3400031D 8 ..571A0001.... 000190 00233104 00FFFFF4 F0F0F0F0 F0F0F0F0 .......400000000 0001A0 1104E255 F0F0F0F0 F0F0F0F0 40404040 ..S.00000000 0001B0 40404000 00FF0000 04440000 A1D0038D ..........}.. 0001C0 0713000C 00000000 00000000 70000200 ................ More... Press Enter to continue. F3=Exit F12=Cancel Figure 53. Hexadecimal Report for Resource Offset Location Example (where the rightmost hex digit of offset 000182 is 2) 620 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Table 55. Format number to figure usage (continued) Format number Reference the following figures 02 Figure 57 on page 624 Figure 58 on page 625 Figure 59 on page 625 03 Figure 60 on page 626 Figure 61 on page 626 Figure 62 on page 626 04 Figure 63 on page 627 Figure 64 on page 628 Figure 65 on page 628 If the rightmost hex digit of offset 000182 is 2, get the format number from offset 0001B5, then use the following table to determine what table to use. Table 56. Format number Reference the following tables 01 Table 57 on page 628 02 Table 58 on page 629 03 Table 59 on page 630 04 Table 60 on page 631 11 Table 61 on page 631 12 Table 62 on page 632 13 Table 63 on page 632 14 Table 64 on page 633 17 Table 65 on page 635 20 Table 66 on page 636 Chapter 7. Common Service Procedures 621 Device formatting log example (unformatted information) Display Hexadecimal Report for Resource Serial Resource Name Type Model Number Name STORAGE 6607 070 00-68753 Offset 0 1 2 3 4 5 6 7 8 9 A B C D E F EBCDIC 000180 F8407800 65320001 90920210 010400FF 8 .......k...... 000190 66070000 00001306 87531400 123D0001 ........g....... 0001A0 00000000 00000000 CA000003 00000000 ................ 0001B0 65320001 07050011 00000000 00000000 ................ 0001C0 00000004 00000003 00000000 00000000 ................ 0001D0 F6F6F0F7 13068753 010400FF 00000000 6607..g......... 0001E0 00000000 65320001 07050011 47000001 ................ 0001F0 07069205 C0004000 00000021 00000030 ..k.{. ......... 000200 00000000 00000000 F6F6F0F7 00071575 ........6607.... 000210 010300FF 00000000 00000000 65320001 ................ 000220 07050011 47000001 07069205 C0004000 ..........k.{. . 000230 00000000 00000000 00000000 00000000 ................ 000240 F6F6F0F7 00259DE8 010200FF 00000000 6607...Y........ 000250 00000000 65320001 07050011 47000001 ................ 000260 07069205 C0404040 00000000 00000000 ..k.{ ........ 000270 00000000 00000000 E2E5C3C4 D6C3E2C3 ........SVCDOCSC 000280 D7D7F2F9 C1F800D5 00000000 00000000 PP29A8.N........ 000290 00000000 00000000 00000000 00000000 ................ 0002A0 00000000 00000000 00000000 00000000 ................ 0002B0 00000000 00000000 00000000 00000000 ................ 0002C0 00000000 00000000 00000000 00000000 ................ 0002D0 00000000 00000000 C3D7D7F8 F9F2F9D5 ........CPP8929N 0002E0 F6F5F3F2 F9F0F9F2 F0F0F0F0 F0F0F0F0 6532909200000000 0002F0 F0F0F0F0 F0F1F6C3 F0F1F3F3 0000D5D5 0000016C0133..NN 000300 00000000 00000000 00000000 00000000 ................ Figure 54. Example Hexadecimal Report, Device Formatting Log (unformatted information) 622 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Device formatting log template Device formatting log example (as formatted information for A or B IPL) Display Hexadecimal Report for Resource Serial Resource Name Type Model Number Name STORAGE 6607 070 00-68753 Offset 0 1 2 3 4 5 6 7 8 9 A B C D E F 000180 -------- -------- -------- -------- 000190 -------- -------- -------- ------01 0001A0 -------- -------- -------- -------- 0001B0 -------- -------- -------- -------- 0001C0 # errors # errors -------- -------- detected logged -------- -------- 0001D0 Device1 Device1 Device1 New Dev1 Type Serial UnitAddr Type 0001E0 New Dev1 IOP type IOP Cache Serial & model Serial Type/Mod 0001F0 Cache -------- -------- -------- Serial -------- -------- -------- 000200 -------- -------- Device2 Device2 -------- -------- Type Serial 000210 Device2 New Dev2 New Dev2 IOP type UnitAddr Type Serial & model 000220 IOP Cache Cache -------- Serial Type/Mod Serial -------- 000230 -------- -------- -------- -------- -------- -------- -------- -------- 000240 Device3 Device3 Device3 New Dev3 Type Serial UnitAddr Type 000250 New Dev3 IOP type IOP Cache Serial & model Serial Type/Mod 000260 Cache -------- -------- -------- Serial -------- -------- -------- Figure 55. Example Hexadecimal Report, Device Formatting Log Template Display Additional Information for Resource Serial Resource Name Type Model Number Name STORAGE 6607 070 00-68753 Device Errors detected . . . . . . . : 4 Device Errors logged . . . . . . . . : 3 Device . . . . . . . . . : 1 2 3 Unit Address . . . . : 010400FF 010300FF 010200FF Type . . . . . . . . : 6607 6607 6607 Serial Number . . . : 13068753 00071575 00259DE8 New Device Information: Type . . . . . . . . : **** **** **** Serial Number . . . : 00000000 00000000 00000000 I/O Processor Information: Type . . . . . . . . : 6532 6532 6532 Serial Number . . . : 07050011 07050011 07050011 Cache Adaptor Card Information: Type . . . . . . . . : 4700 4700 4700 Serial Number . . . : 07069205 07069205 07069205 Figure 56. Example Hexadecimal Report, Device Formatting (As formatted information for A or B IPL) Chapter 7. Common Service Procedures 623 Array member formatting log example (unformatted information) Display Hexadecimal Report for Resource Serial Resource Name Type Model Number Name STORAGE 6532 001 00-50011 Offset 0 1 2 3 4 5 6 7 8 9 A B C D E F EBCDIC 000180 F4407800 65320001 90210210 0FFFFFFF 4 .............. 000190 00000000 00000000 00001400 12400002 ............. .. 0001A0 00000000 00000000 CA000003 00000000 ................ 0001B0 65320001 07050011 00000000 00000000 ................ 0001C0 F6F6F0F7 0025ED09 000200FF 000200FF 6607............ 0001D0 F6F6F0F7 00158135 000100FF 000100FF 6607..a......... 0001E0 F6F6F0F7 00259DE8 010200FF 00000000 6607...Y........ 0001F0 F6F6F0F7 00008326 010100FF 00000000 6607..c......... 000200 F6F6F0F7 00071575 010300FF 00000000 6607............ 000210 F6F6F0F7 13068753 010400FF 00000000 6607..g......... 000220 00000000 00000000 00000000 00000000 ................ 000230 00000000 00000000 00000000 00000000 ................ 000240 00000000 00000000 00000000 00000000 ................ 000250 00000000 00000000 00000000 00000000 ................ 000260 000000FF 00000004 00000000 00000000 ................ 000270 00000000 E2E5C3C4 D6C3E2C3 D7D7F2F9 ....SVCDOCSCPP29 000280 C1F800D5 00000000 00000000 00000000 A8.N............ 000290 00000000 00000000 00000000 00000000 ................ 0002A0 00000000 00000000 00000000 00000000 ................ 0002B0 00000000 00000000 00000000 00000000 ................ 0002C0 00000000 00000000 00000000 00000000 ................ 0002D0 00000000 C3D7D7C5 C1F0F2D5 F6F5F3F2 ....CPPEA02N6532 0002E0 F9F0F2F1 F0F0F0F0 F0F0F0F0 F0F0F0F0 9021000000000000 0002F0 F0F0C4F4 F0F1F3F3 0000D5D5 00000000 00D40133..NN.... 000300 00000000 00000000 00000000 00000000 ................ Figure 57. Example Hexadecimal Report, Array Member Formatting Log (unformatted information) 624 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Array member formatting log template Array member formatting log example (as formatted information for A or B IPL) Display Hexadecimal Report for Resource Serial Resource Name Type Model Number Name STORAGE 6532 001 00-50011 Offset 0 1 2 3 4 5 6 7 8 9 A B C D E F 000180 -------- -------- -------- -------- 000190 -------- -------- -------- ------02 0001A0 -------- -------- -------- -------- 0001B0 -------- -------- -------- -------- 0001C0 Member0 Member0 Expected Current Type Serial Address Address 0001D0 Member1 Member1 Expected Current Type Serial Address Address 0001E0 Member2 Member2 Expected Current Type Serial Address Address 0001F0 Member3 Member3 Expected Current Type Serial Address Address 000200 Member4 Member4 Expected Current Type Serial Address Address 000210 Member5 Member5 Expected Current Type Serial Address Address 000220 Member6 Member6 Expected Current Type Serial Address Address 000230 Member7 Member7 Expected Current Type Serial Address Address 000240 Member8 Member8 Expected Current Type Serial Address Address 000250 Member9 Member9 Expected Current Type Serial Address Address Figure 58. Example Hexadecimal Report, Array Member Formatting Log Template Display Additional Information for Resource Serial Resource Name Type Model Number Name STORAGE 6532 001 00-50011 Array Member . . . . . . : 0 1 2 3 4 Type . . . . . . . . : 6607 6607 6607 6607 6607 Serial Number . . . : 0025ED09 00158135 00259DE8 00008326 00071575 Unit Address: Current . . . . . . : 000200FF 000100FF 00000000 00000000 00000000 Expected . . . . . . : 000200FF 000100FF 010200FF 010100FF 010300FF Array Member . . . . . . : 5 6 7 8 9 Type . . . . . . . . : 6607 **** **** **** **** Serial Number . . . : 13068753 00000000 00000000 00000000 00000000 Unit Address: Current . . . . . . : 00000000 00000000 00000000 00000000 00000000 Expected . . . . . . : 010400FF 00000000 00000000 00000000 00000000 Figure 59. Example Hexadecimal Report, Array Member Formatting (As formatted information for A or B IPL) Chapter 7. Common Service Procedures 625 Configuration formatting log example (unformatted information) Configuration formatting log template Configuration formatting log example (as formatted information for A or B IPL) Display Hexadecimal Report for Resource Serial Resource Name Type Model Number Name STORAGE 6532 001 00-50011 Offset 0 1 2 3 4 5 6 7 8 9 A B C D E F EBCDIC 000180 50407800 65320001 90120210 0FFFFFFF & .............. 000190 00000000 00000000 00001400 123C0003 ................ 0001A0 00000000 00000000 CA000003 00000000 ................ 0001B0 65320001 07050011 00000000 00000000 ................ 0001C0 00000000 00000000 47000001 07069205 ..............k. 0001D0 F8F6C7F8 F1F4F9C3 D7D7F8F9 F8F100D5 86G8149CPP8981.N 0001E0 00000000 00000000 00000000 00000000 ................ 0001F0 00000000 00000000 00000000 00000000 ................ 000200 00000000 00000000 00000000 00000000 ................ 000210 00000000 00000000 00000000 00000000 ................ 000220 00000000 00000000 00000000 00000000 ................ 000230 C3D7D7C5 C1F0F2D5 F6F5F3F2 F9F0F1F2 CPPEA02N65329012 000240 F0F0F0F0 F0F0F0F0 F0F0F0F0 F0F0C4F4 00000000000000D4 000250 F0F1F3F4 0000E8D5 00000000 00000000 0134..YN........ Figure 60. Example Hexadecimal Report, Configuration Formatting log (unformatted information) Display Hexadecimal Report for Resource Serial Resource Name Type Model Number Name STORAGE 6532 001 00-50011 Offset 0 1 2 3 4 5 6 7 8 9 A B C D E F 000180 -------- -------- -------- -------- 000190 -------- -------- -------- ------03 0001A0 -------- -------- -------- -------- 0001B0 Current Current Current Current IOP type IOP Cache Cache & model serial type serial 0001C0 Expected Expected Expected Expected IOP type IOP Cache Cache & model serial type serial Figure 61. Example Hexadecimal Report, Configuration Formatting Log Template Display Additional Information for Resource Serial Resource Name Type Model Number Name STORAGE 6532 001 00-50011 Configuration . . . . . . : Current Expected I/O Processor Information: Type . . . . . . . . : 6532 0000 Serial Number . . . : 07050011 00000000 Cache Adaptor Card Information: Type . . . . . . . . : 0000 4700 Serial Number . . . : 00000000 07069205 Figure 62. Example Hexadecimal Report, Configuration Formatting Log (As formatted information for A or B IPL) 626 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Array addendum formatting log example (unformatted information) Display Hexadecimal Report for Resource Serial Resource Name Type Model Number Name STORAGE 2757 001 10-0322005 DC03 Offset 0 1 2 3 4 5 6 7 8 9 A B C D E F EBCDIC 000000 C5D3F0F5 000000A0 00E00180 82745D0F EL05........b.). 000010 1FD78000 00002CE0 20100084 049F0000 .P.........d.... 000020 902F0000 11000000 00000000 00C70000 .............G.. 000030 F2F7F5F7 F0F0F140 F1F060F0 F3F2F2F0 2757001 10-03220 000040 F0F50000 0000F2F7 F5F70002 00020000 05....2757...... 000050 00200000 0002FFFF FFFFFFFF 0004FFFF ................ 000060 00000000 00000000 00000000 40404040 ............ 000070 40404040 40400000 00000000 00000000 .......... 000080 00000000 00000000 00000000 00000000 ................ 000090 00000000 00000000 00000000 00000000 ................ 0000A0 00000000 00000000 00000000 00000000 ................ 0000B0 00000000 00000000 40404040 40404040 ........ 0000C0 40404040 40404040 40404040 40404040 0000D0 40404040 40404040 40404040 40404040 0000E0 F2F7F5F7 F0F0F0F1 0000E2D3 C9C3C9D6 27570001..SLICIO 0000F0 4040E5F5 D9F2D4F0 40400000 00000000 V5R2M0 ...... 000100 00B90000 00000000 000000C7 00000000 ...........G.... 000110 00000000 00000000 00000000 00000000 ................ 000120 00000000 00000000 00000000 00000000 ................ 000130 00000000 00000000 00000000 00000000 ................ 000140 C4C3F0F3 40404040 40404040 40404040 DC03 000150 D5C5C5C4 E5C1D3C9 C4D740D5 00070001 NEEDVALIDP N.... 000160 00000000 A0200007 00000000 00000000 ................ 000170 00000000 00000000 0003001E 000000A0 ................ 000180 A0407800 27570001 902F0200 4FFFFFFF . ...*......|... 000190 00000000 00000000 00001400 623E0004 ................ 0001A0 F6F6F0F7 0026121D 410500FF 410500FF 6607............ 0001B0 F6F6F0F7 00248B85 410400FF 410400FF 6607...e........ 0001C0 F6F6F0F7 0014BE1F 410300FF 410300FF 6607............ 0001D0 F6F6F0F7 0014C0F0 410200FF 410200FF 6607..{0........ 0001E0 F6F6F0F7 001403C8 410100FF 410100FF 6607...H........ 0001F0 F6F6F0F7 0014BEA4 410800FF 410800FF 6607...u........ 000200 00000000 00000000 00000000 00000000 ................ 000210 00000000 00000000 00000000 00000000 ................ Figure 63. Example Hexadecimal Report, Array Addendum Formatting Log (unformatted information) Chapter 7. Common Service Procedures 627 Array addendum log template Array addendum log example (as formatted information for A or B IPL) Table 57. Format 01 Description Hex offset Device errors detected 0001D6 - 0001D9 Device errors logged 0001DA - 0001DD Device 1 type 0001E6 - 0001E9 Device 1 serial number 0001EA - 0001ED Device 1 unit address 0001EE - 0001F1 New device 1 type 0001F2 - 0001F5 Display Hexadecimal Report for Resource Serial Resource Name Type Model Number Name STORAGE 6532 001 00-50011 Offset 0 1 2 3 4 5 6 7 8 9 A B C D E F 000180 -------- -------- -------- -------- 000190 -------- -------- -------- ------04 0001A0 Member10 Member10 Expected Current Type Serial Address Address 0001B0 Member11 Member11 Expected Current Type Serial Address Address 0001C0 Member12 member12 Expected Current Type Serial Address Address 0001D0 Member13 Member13 Expected Current Type Serial Address Address 0001E0 Member14 Member14 Expected Current Type Serial Address Address 0001F0 Member15 Member15 Expected Current Type Serial Address Address 000200 Member16 Member16 Expected Current Type Serial Address Address 000210 Member17 Member17 Expected Current Type Serial Address Address Figure 64. Example Hexadecimal Report, Array Addendum Formatting Log Template Display Additional Information for Resource Serial Resource Name Type Model Number Name STORAGE 2757 001 10-0322005 DC03 Array Member . . . . . . : 10 11 12 13 14 Type . . . . . . . . : 6607 6607 6607 6607 6607 Serial Number . . . : 0026121D 00248B85 0014BE1F 0014C0F0 001403C8 Unit Address: Current . . . . . . : 410500FF 410400FF 410300FF 410200FF 410100FF Expected . . . . . . : 410500FF 410400FF 410300FF 410200FF 410100FF Array Member . . . . . . : 15 16 17 Type . . . . . . . . : 6607 **** **** Serial Number . . . : 0014BEA4 00000000 00000000 Unit Address: Current . . . . . . : 410800FF 00000000 00000000 Expected . . . . . . : 410800FF 00000000 00000000 Figure 65. Example Hexadecimal Report, Array Addendum Formatting (as formatted information for A or B IPL) 628 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Table 57. Format 01 (continued) Description Hex offset New device 1 serial number 0001F6 - 0001F9 Device 1 IOA type and model 0001FA - 0001FD Device 1 IOA serial number 0001FE - 000201 Device 1 Cache type and model 000202 - 000205 Device 1 Cache serial number 000206 - 000209 Device 2 type 00021E - 000221 Device 2 serial number 000222 - 000225 Device 2 unit address 000226 - 000229 New device 2 type 00022A - 00022D New device 2 serial number 00022E - 000231 Device 2 IOA type and model 000232 - 000235 Device 2 IOA serial number 000236 - 000239 Device 2 cache type and model 00023A - 00023D Device 2 cache serial number 00023E - 000241 Device 3 type 000256 - 000259 Device 3 serial number 00025A - 00025D Device 3 unit address 00025E - 000261 New device 3 type 000262 - 000265 New device 3 serial number 000266 - 000269 Device 3 IOA type and model 00026A - 00026D Device 2 IOA serial number 00026E - 000271 Device 3 cache type and model 000272 - 000275 Device 3 cache serial number 000276 - 000279 Table 58. Format 02 Description Hex offset Array member 0 type 0001D6 - 0001D9 Array member 0 serial number 0001DA - 0001DD Array member 0 expected unit address 0001DE - 0001E1 Array member 0 current unit address 0001E2 - 0001E5 Array member 1 type 0001E6 - 0001E9 Array member 1 serial number 0001EA - 0001ED Array member 1 expected unit address 0001EE - 0001F1 Array member 1 current unit address 0001F2 - 0001F5 Array member 2 type 0001F6 - 0001F9 Array member 2 serial number 0001FA - 0001FD Array member 2 expected unit address 0001FE - 000201 Array member 2 current unit address 000202 - 000205 Array member 3 type 000206 - 000209 Array member 3 serial number 00020A - 00020D Chapter 7. Common Service Procedures 629 Table 58. Format 02 (continued) Description Hex offset Array member 3 expected unit address 00020E - 000211 Array member 3 current unit address 000212 - 000215 Array member 4 type 000216 - 000219 Array member 4 serial number 00021A - 00021D Array member 4 expected unit address 00021E - 000221 Array member 4 current unit address 000222 - 000225 Array member 5 type 000226 - 000229 Array member 5 serial number 00022A - 00022D Array member 5 expected unit address 00022E - 000231 Array member 5 current unit address 000232 - 000235 Array member 6 type 000236 - 000239 Array member 6 serial number 00023A - 00023D Array member 6 expected unit address 00023E - 000241 Array member 6 current unit address 000242 - 000245 Array member 7 type 000246 - 000249 Array member 7 serial number 00024A - 00024D Array member 7 expected unit address 00024E - 000251 Array member 7 current unit address 000252 - 000255 Array member 8 type 000256 - 000259 Array member 8 serial number 00025A - 00025D Array member 8 expected unit address 00025E - 000261 Array member 8 current unit address 000262 - 000265 Array member 9 type 000266 - 000269 Array member 9 serial number 00026A - 00026D Array member 9 expected unit address 00026E - 000271 Array member 9 current unit address 000272 - 000275 Table 59. Format 03 Description Hex offset Current I/O card type 0001C6 - 0001C7 Current I/O card serial number 0001CA - 0001CD Current cache type 0001CE - 0001CF Current cache serial number 0001D2 - 0001D5 Expected I/O card type 0001D6 - 0001D7 Expected I/O card serial number 0001DA - 0001DD Expected cache type 0001DE - 0001DF Expected cache serial number 0001E2 - 0001E5 630 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Table 60. Format 04 Description Hex offset Array member 10 type 0001B6 - 0001B9 Array member 10 serial number 0001BA - 0001BD Array member 10 expected unit address 0001BE - 0001C1 Array member 10 current unit address 0001C2 - 0001C5 Array member 11 type 0001C6 - 0001C9 Array member 11 serial number 0001CA - 0001CD Array member 11 expected unit address 0001CE - 0001D1 Array member 11 current unit address 0001D2 - 0001D5 Array member 12 type 0001D6 - 0001D9 Array member 12 serial number 0001DA - 0001DD Array member 12 expected unit address 0001DE - 0001E1 Array member 12 current unit address 0001E2 - 0001E5 Array member 13 type 0001E6 - 0001E9 Array member 13 serial number 0001EA - 0001ED Array member 13 expected unit address 0001EE - 0001F1 Array member 13 current unit address 0001F2 - 0001F5 Array member 14 type 0001F6 - 0001F9 Array member 14 serial number 0001FA - 0001FD Array member 14 expected unit address 0001FE - 000201 Array member 14 current unit address 000202 - 000205 Array member 15 type 000206 - 000209 Array member 15 serial number 00020A - 00020D Array member 15 expected unit address 00020E - 000211 Array member 15 current unit address 000212 - 000215 Array member 16 type 000216 - 000219 Array member 16 serial number 00021A - 00021D Array member 16 expected unit address 00021E - 000221 Array member 16 current unit address 000222 - 000225 Array member 17 type 000226 - 000229 Array member 17 serial number 00022A - 00022D Array member 17 expected unit address 00022E - 000231 Array member 17 current unit address 000232 - 000235 Table 61. Format 11 Description Hex offset SCSI sense data 0001C4 - 0001E3 Worldwide ID of device with problem 0001E4 - 001EB Unit address of device with problem 0001EC - 0001EF SCSI command descriptor block 0001F4 - 000203 Problem description 000204 - 000223 Chapter 7. Common Service Procedures 631 Table 62. Format 12 Description Hex offset Current I/O card type and model 0001C4 - 0001CB Current I/O card serial number 0001CC - 0001D3 Current I/O card worldwide ID 0001D4 - 0001DB Current cache type and model 0001DC - 0001E3 Current cache serial number 0001E4 - 0001EB Current cache worldwide ID 0001EC - 0001F3 Previous I/O card type and model 0001FC - 000203 Previous I/O card serial number 000204 - 00020B Previous I/O card worldwide ID 00020C - 000213 Previous cache type and model 000214 - 00021B Previous cache serial number 00021C - 000223 Previous cache worldwide ID 000224 - 00022B Table 63. Format 13 Description Hex offset Device errors detected 0001FC - 0001FF Device errors logged 000200 - 000203 Device 1 type 000204 - 000207 Device 1 serial number 000208 - 00020F Device 1 worldwide ID 000210 - 000217 Device 1 unit address 000218 - 00021B New device 1 type 00021C - 00021F New device 1 serial number 000220 - 000227 New device 1 worldwide ID 000228 - 00022F Device 1 IOA type and model 000230 - 000237 Device 1 IOA serial number 000238 - 00023F Device 1 IOA worldwide ID 000240 - 000247 Device 1 cache type and model 000248 - 00024F Device 1 cache serial number 000250 - 000257 Device 1 cache worldwide ID 000258 - 00025F Device 2 type 000260 - 000263 Device 2 serial number 000264 - 00026B Device 2 worldwide ID 00026C - 000273 Device 2 unit address 000274 - 000277 New device 2 type 000278 - 00027B New device 2 serial number 00027C - 000283 New device 2 worldwide ID 000284 - 00028B Device 2 IOA type and model 00028C - 000293 Device 2 IOA serial number 000294 - 00029B Device 2 IOA worldwide ID 00029C - 0002A3 632 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Table 63. Format 13 (continued) Description Hex offset Device 2 cache type and model 0002A4 - 0002AB Device 2 cache serial number 0002AC - 0002B3 Device 2 cache worldwide ID 0002B4 - 0002BB Device 3 type 0002BC - 0002BF Device 3 serial number 0002C0 - 0002C7 Device 3 worldwide ID 0002C8 - 0002CF Device 3 unit address 0002D0 - 0002D3 New device 3 type 0002D4 - 0002D7 New device 3 serial number 0002D8 - 0002DF New device 3 worldwide ID 0002E0 - 0002E7 Device 3 IOA type and model 0002E8 - 0002EF Device 3 IOA serial number 0002F0 - 0002F7 Device 3 IOA worldwide ID 0002F8 - 0002FF Device 3 cache type and model 000300 - 000307 Device 3 cache serial number 000308 - 00030F Device 3 cache worldwide ID 000310 - 000317 Table 64. Format 14 Description Hex offset RAID number 000204 - 00020B Array member 0 type 00020C - 00020F Array member 0 serial number 000210 - 000217 Array member 0 worldwide ID 000218 - 00021F Array member 0 expected unit address 000220 - 000223 Array member 0 current unit address 000224 - 000227 Array member 1 type 000228 - 00022B Array member 1 serial number 00022C - 000233 Array member 1 worldwide ID 000234 - 00023B Array member 1 expected unit address 00023C - 00023F Array member 1 current unit address 000240 - 000243 Array member 2 type 000244 - 000247 Array member 2 serial number 000248 - 00024F Array member 2 worldwide ID 000250 - 000257 Array member 2 expected unit address 000258 - 00025B Array member 2 current unit address 00025C - 00025F Array member 3 type 000260 - 000263 Array member 3 serial number 000264 - 00026B Array member 3 worldwide ID 00026C - 000273 Array member 3 expected unit address 000274 - 000277 Array member 3 current unit address 000278 - 00027B Chapter 7. Common Service Procedures 633 Table 64. Format 14 (continued) Description Hex offset Array member 4 type 00027C - 00027F Array member 4 serial number 000280 - 000287 Array member 4 worldwide ID 000288 - 00028F Array member 4 expected unit address 000290 - 000293 Array member 4 current unit address 000294 - 000297 Array member 5 type 000298 - 00029B Array member 5 serial number 00029C - 0002A3 Array member 5 worldwide ID 0002A4 - 0002AB Array member 5 expected unit address 0002AC - 0002AF Array member 5 current unit address 0002B0 - 0002B3 Array member 6 type 0002B4 - 0002B7 Array member 6 serial number 0002B8 - 0002BF Array member 6 worldwide ID 0002C0 - 0002C7 Array member 6 expected unit address 0002C8 - 0002CB Array member 6 current unit address 0002CC - 0002CF Array member 7 type 0002D0 - 0002D3 Array member 7 serial number 0002D4 - 0002DB Array member 7 worldwide ID 0002DC - 0002E3 Array member 7 expected unit address 0002E4 - 0002E7 Array member 7 current unit address 0002E8 - 0002EB Array member 8 type 0002EC - 0002EF Array member 8 serial number 0002F0 - 0002F7 Array member 8 worldwide ID 0002F8 - 0002FF Array member 8 expected unit address 000300 - 000303 Array member 8 current unit address 000304 - 000307 Array member 9 type 000308 - 00030B Array member 9 serial number 00030C - 000313 Array member 9 worldwide ID 000314 - 00031B Array member 9 expected unit address 00031C - 00031F Array member 9 current unit address 000320 - 000323 Array member 10 type 000324 - 000327 Array member 10 serial number 000328 - 00032F Array member 10 worldwide ID 000330 - 000337 Array member 10 expected unit address 000338 - 00033B Array member 10 current unit address 00033C - 00033F Array member 11 type 000340 - 000343 Array member 11 serial number 000344 - 00034B Array member 11 worldwide ID 00034C - 000353 Array member 11 expected unit address 000354 - 000357 Array member 11 current unit address 000358 - 00035B 634 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Table 64. Format 14 (continued) Description Hex offset Array member 12 type 00035C - 00035F Array member 12 serial number 000360 - 000367 Array member 12 worldwide ID 000368 - 00036F Array member 12 expected unit address 000370 - 000373 Array member 12 current unit address 000374 - 000377 Array member 13 type 000378 - 00037B Array member 13 serial number 00037C - 000383 Array member 13 worldwide ID 000384 - 00038B Array member 13 expected unit address 00038C - 00038F Array member 13 current unit address 000390 - 000393 Array member 14 type 000394 - 000397 Array member 14 serial number 000398 - 00039F Array member 14 worldwide ID 0003A0 - 0003A7 Array member 14 expected unit address 0003A8 - 0003AB Array member 14 current unit address 0003AC - 0003AF Array member 15 type 0003B0 - 0003B3 Array member 15 serial number 0003B4 - 0003BB Array member 15 worldwide ID 0003BC - 0003C3 Array member 15 expected unit address 0003C4 - 0003C7 Array member 15 current unit address 0003C8 - 0003CB Array member 16 type 0003CC - 0003CF Array member 16 serial number 0003D0 - 0003D7 Array member 16 worldwide ID 0003D8 - 0003DF Array member 16 expected unit address 0003E0 - 0003E3 Array member 16 current unit address 0003E4 - 0003E7 Array member 17 type 0003E8 - 0003EB Array member 17 serial number 0003EC - 0003F3 Array member 17 worldwide ID 0003F4 - 0003FB Array member 17 expected unit address 0003FC - 0003FF Array member 17 current unit address 000400 - 000403 Table 65. Format 17 Description Hex offset Associated IOA type and model 0001C4 - 0001CB Associated IOA serial number 0001CC - 0001D3 Associated IOA worldwide ID 0001D4 - 0001DB Type of adapter connection (1 = Dual storage IOA pair, 2 = Storage IOA to auxiliary cache) 0001DC - 0001DC Problem description 0001E0 - 00021F Chapter 7. Common Service Procedures 635 Table 66. Format 20 Description Hex offset Problem description 0001C4 - 000203 Path 1 adapter port 00020A Path active (00 = Unknown, 01 = Yes, 10 = No, 11 = Unknown) 00020D, bit 0 - 1 Path state (0000 = Unknown, 0001 = Operational, 0010 = Degraded, 0011 = Failed) 00020D, bit 4 -7 Node 1 port type (0000 = Empty, 0001 = Adapter, 0010 = Expander, 0011 = Device, 0100= LUN) 000210, bit 0 - 3 Node 1 status (0000 = Operational, 0001 = Degraded, 0010 = Failed, 0011 = Suspect, 0100 = Missing, 0101 = Not valid) 000210, bit 4 - 7 Node 1 phy 000212 Node 1 information (1000 = 1.5 Gbps, 1001 = 3.0 Gbps) 000213, bit 4 - 7 Path 1 Node 1 SAS address 000214 - 00021B Additional nodes are self defining Reclaiming IOP cache storage Use this procedure to reclaim IOP cache storage. 1. If you are not already using Dedicated Service Tools, perform an IPL to DST. If you cannot perform a type A or B IPL, perform a type D IPL from removable media. Did you get to DST with a Type D IPL? v No: Continue with the next step. v Yes: Perform the following: a. Select Work with disk units ? Reclaim IOP cache storage. b. Follow the on-line instructions to reclaim cache storage. The system will automatically perform an IPL to a Disk Configuration Attention Report or a Disk Configuration Warning Report display. Select the option to display the report. Because the Reclaim Cache Storage was selected after a D-IPL, the system will now require another Reclaim Cache Storage to be performed.2. Select Work with disk units ? Work with disk unit recovery ? Reclaim IOP Cache Storage. 3. Follow the on-line instructions to reclaim cache storage. Depending on the reason for performing the Reclaim IOP Cache procedure, there may be a message requiring the ASP be cleared before performing this option, or a message stating the number of lost sectors.Note: If there are lost sectors, some objects may be damaged. The customer may have to load the damaged objects again.When the IOP performs a Reset/Reload, if there is a storage IOA attached that is running the local load source, Cxxx xxxx codes will be displayed in the system control panel. The DST console may then have a blank screen with 01B displayed on the system control panel for up to 15 minutes before it is restored. This ends the procedure. Working with Storage Dumps A main storage dump (MSD) is a process of collecting data from the system’s main storage. For more information on main storage dumps, including POWER5 platform system dumps, see Performing a platform or main storage dump. A main storage dump can be performed in the following ways: 636 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) v Automatically - by the service processor as the result of a system failure. v Manually - by using control panel functions when the system waits, loops, or appears to have an operating system failure.Dedicated Service Tools (DST) Dedicated service tools (DST) is used to service Licensed Internal Code (LIC), work with disk units, work with configuration and resources, verify devices and communications, and display logs on the system. DST operates in stand-alone, limited, and full paging environments. The DST tools and functions vary depending on the paging environment and the release level of the operating system. Dedicated Service Tools requirementsNote: It is a requirement to change the password the first time anyone signs on to DST or SST. The authorized service provider needs to give the new (changed) password to the customer or have the customer change the password. To use DST, you must have the following items: v The disk that contains the Licensed Internal Code (the load-source disk). Note: When you perform an alternate initial program load (IPL) to DST, you do not need the load-source disk. v An operational workstation on bus 1 as either a primary console or an operations console. v One workstation for each system partition. v A valid service tools user ID and password to sign on to DST. IBM supplies service tools user IDs that have different levels of privileges.To make a printout, attach the printer to the workstation I/O processor or storage media unit that is performing the service function. The printer that is used with DST for service tool output must be an SCS-type data stream printer. “Accessing Dedicated Service Tools”Covers how to start Dedicated Service Tools. “Start a service tool” on page 642 “IPL type, mode, and speed options” on page 642Set the IPL options from function 02 on the control panel. To permanently change the IPL speed use the CHGIPLA command.Accessing Dedicated Service Tools Covers how to start Dedicated Service Tools. You can start dedicated service tools (DST) in several ways: v Performing an IPL to DST (see “Performing an IPL to DST” on page 640). v Pressing the System Request key while the system is operational. v Selecting function 21 while the system is operational (see “Selecting function 21 from the control panel” on page 638). This is available from the primary partition only. v Performing an alternate IPL to DST (type D IPL) (see “Performing an alternate IPL to DST (type D IPL)” on page 641). After performing a type D IPL on a primary partition, the first step in DST must be configuration recovery. v Using the function key F16, which is not displayed, while in DST debug mode during a step-mode IPL. Chapter 7. Common Service Procedures 637 “Selecting function 21 from the control panel”Provides detailed instructions on selecting function 21 while the system is operational. “Control panel function codes on the HMC”Learn about the control panel function codes used to indicate status and function options for a system that is managed by a Hardware Management Console (HMC). “Performing an IPL to DST” on page 640This procedure is for the entire system, or if logical partitions are installed, for the primary partition. “Performing an alternate IPL to DST (type D IPL)” on page 641An alternate IPL to DST is called a type D IPL or a stand-alone IPL by service representatives.Selecting function 21 from the control panel: Provides detailed instructions on selecting function 21 while the system is operational. Perform the following steps: 1. Select control panel Function 21 (Make DST available). Select Manual mode, then push the Increment (?) or Decrement (?) button on the control panel until 21 appears in the Function/Data display. Press the Enter button on the control panel.Notes: a. If you enter Function 21 and the primary console is powered off or not usable, reference code A600 500x appears on the control panel. Enter Function 21 again to force the DST Sign On display to appear on one of the alternative consoles.2. Primary console or alternate console will display the DST sign on. Note: For more information on how to identify the primary console, see “Determining a primary or alternative console” on page 591. 3. Sign-on to DST. Type QSRV as the valid user ID and get the password from the customer. If the customer has changed the full DST authority user ID or password, ask the customer for the correct values. Note: v If prompted for a password, be sure to give the new (changed) password to the customer. v Since V5R1, you are limited to three sign on attempts to DST. After three unsuccessful attempts, the service tools user ID will be disabled. The system administrator with QSECOFR authority will need to reset the ID.4. The Use Dedicated Service Tools (DST) display appears. The system is in the full paging environment. This ends the procedure.Control panel function codes on the HMC: Learn about the control panel function codes used to indicate status and function options for a system that is managed by a Hardware Management Console (HMC). If you are managing the system with the Hardware Management Console (HMC), use the HMC to perform control panel functions. Servers without physical panels require the HMC to perform the equivalent control panel functions. The HMC affects the physical control panel in the following ways: v Except for some limited auto-platform override function, such as Auto Power On Restart and Timed Power On, the system operating mode value no longer has meaning. v The operating system IPL-type value is disabled in functions 01 and 02. 638 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) v Functions 11-19 do not display partition system reference codes (SRCs). They continue to display SRCs from the platform Licensed Internal Code (LIC). v Functions 21, 22, 34, and 65-70 are not selectable on the physical control panel.The following table lists the control panel functions that you can perform on the HMC. Function Description 01 1. Displays the currently selected IPL type (and logical key mode on some system types). 2. Displays the currently selected IPL speed override for the next IPL.This function is available in both normal and manual operating mode. 02 Performs a slow boot. This function is available in both normal and manual operating mode. 03 Performs a re-IPL. The IPL uses the selected IPL options. This function is available only in manual operating mode and when the system power is on. 04 Performs a lamp test; all displays and indicators are switched on. This function is available in both normal and manual operating mode. 05-06 Reserved. 07 Allows you to set SPCN configuration IDs. This function is available only in manual operating mode when the system is in standby. 08 Causes a fast power off. This function is available only when the system is in manual operating mode and the system power is on. 09-10 Reserved. 11-19 Displays a system reference code (SRC) on the control panel. These functions are available in both normal and manual operating mode when an SRC is available. 20 Displays the machine type, model, processor feature code, processor class indicator, and IPL path description. This function is available in both normal and manual operating mode. 21 Causes the Use Dedicated Service Tool (DST) display to appear on the system console. This function is available only in the manual operating mode and when activated by the operating system. 22 Forces a partition dump. This function is available only in the manual operating mode and when activated by the operating system. 23-24 Reserved. 25-26 Not applicable. 27-29 Reserved. 30 Displays CEC FSP IP address and location. This function is available only in the manual operating mode and when activated by the FSP. 31-33 Reserved. 34 Retries the partition dump. This function is available only in the manual operating mode and when activated by the operating system. 35-41 Reserved. 42-43 Not applicable. 44-49 Reserved. 50-52 Reserved. 53 Unconfigures an intermittently failing processor (repeat GARD function). This function is available only in the manual operating mode. 54 Reserved. 55 Platform dump override. This function is available only in the manual operating mode when activated by the FSP and when CE scroll ranges are enabled. Chapter 7. Common Service Procedures 639 Function Description 56-62 Reserved. 63 Progress indicator history. This function is available only in the manual operating mode. 64 Diagnostic status SRC trace. This function is available only in the manual operating mode. 65 Deactivates remote service. This function is available only in the manual operating mode when activated by the operating system and when CE scroll ranges are enabled. 66 Activates remote service. This function is available only in the manual operating mode when activated by the operating system and when CE scroll ranges are enabled. 67 Disk unit I/O processor (IOP) reset/reload is enabled only by specific disk unit SRCs. This function is available only in the manual operating mode when activated by the operating system and when CE scroll ranges are enabled. 68 Turns off power domains during concurrent maintenance of IOPs and IOAs. This function is available only in the manual operating mode when activated by the operating system and when CE scroll ranges are enabled. 69 Turns on power domains during concurrent maintenance of IOPs and IOAs. This function is available only in the manual operating mode when activated by the operating system and when CE scroll ranges are enabled. 70 Dump service processor control storage. Initiates an I/O processor (IOP) dump; enabled by specific error conditions. This function is available only in the manual operating mode when activated by the operating system and when CE scroll ranges are enabled. Performing an IPL to DST: This procedure is for the entire system, or if logical partitions are installed, for the primary partition. Perform the following steps: 1. Select a type B IPL in Manual mode. Attention: For systems with logical partitions, performing an IPL on the primary partition causes the secondary partitions to be powered down. Failing to power down these secondary partitions will cause an abnormal power-down on the secondary partitions and possible loss of data. 2. Perform one of the following: v If there is a problem log entry after powering on the system, perform a delayed power-off. (This is the data for problem analysis.) When the system is off, press the Power button on the control panel to perform an IPL. Select Function 03 if the delayed power-off takes longer than 40 minutes to complete. Attention: Performing Function 03 may cause damage to objects, and it may also cause data loss. v If there is an attention light after powering on the system, use the Increment (?) or Decrement (?) button to select control panel Function 03. Press Enter on the control panel to perform an IPL. v When the system is powered off, press the Power button on the control panel to perform an IPL. The IPL or Install the System display appears on the primary console.3. Select the Use Dedicated Service Tools option. The DST Sign On display appears. For more information on how to identify the primary console, see “Determining a primary or alternative console” on page 591. 4. Sign on to DST with a valid user ID and password. Remember: Enter a valid user ID and get the password from the customer. If the customer has changed the full DST authority, user ID, or password, ask the customer for the correct values.Note: 640 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) v If prompted for a password, be sure to give the new (changed) password to the customer. v You are limited to three sign on attempts to DST. After three unsuccessful attempts, the service tools user ID will be disabled. A user ID with a service tool security privilege must reset the ID.The Use Dedicated Service Tools (DST) display appears. Restriction: The system is in the limited paging environment. This ends the procedure.Performing an alternate IPL to DST (type D IPL): An alternate IPL to DST is called a type D IPL or a stand-alone IPL by service representatives. An alternate IPL to DST is called a type D IPL or a stand-alone IPL by service representatives. Perform the following steps: 1. To use an alternate installation device, select the DST option to Work with alternate installation device. This option is available in DST from control panel function 21 or a type D IPL. 2. Determine the removable media unit or units for an alternate IPL. Power on the removable media units. 3. Place the media that contains the Licensed Internal Code (first tape of the customer’s system save, the SAVLIC, ISMD tapes, or optical device) in the removable media units. Load the media and make the media units ready. See the device information for instructions on loading the removable media. 4. Power off the system. If the system is running, end all jobs and perform the power down procedure. 5. Select a type D IPL in Manual mode. For information on how to select IPL options, see “IPL type, mode, and speed options” on page 642. 6. Power on the console. For more information on how to identify the primary console, see “Determining a primary or alternative console” on page 591. 7. Load the media and make the media unit ready. 8. Press the Power button to power on the system. Notes: a. Some types of removable media units automatically reset. In this condition, SRC A100 1933 (Media device not ready) may be displayed. b. If the installation device is an optical device, you may need to IPL the system again. This action will make the unit ready. pwrdwnsys restart(*yes) IPL(D) c. If the installation device is a tape unit, system reference code (SRC) A1001933 is displayed until the device automatically makes itself ready. d. The removable media becomes active. There is a delay while the system loads information from the load source. While running a process, the system continuously updates SRCs on the control panel that show the status. 9. Wait for the Install Licensed Internal Code display to appear on the console. The wait varies depending on the speed of the removable media unit and the processor speed for the specific system model.Notes: a. If SRC A600 500x is displayed on the control panel, the system was not able to locate the console. Chapter 7. Common Service Procedures 641 Ensure that the console is powered on (repair if necessary). Select Function 21 on the control panel (see “Selecting function 21 from the control panel” on page 638) to make DST available. b. If an SRC other than A600 500x is displayed on the control panel, go to “Starting Point for All Problems” in the Problem Analysis information for your system.10. The Install Licensed Internal Code display appears on the console. v If you want to access DST, select the Work with Dedicated Service Tools (DST) option and continue to the next step of this procedure. v If you want to install or restore Licensed Internal Code, select the Install Licensed Internal Code option and continue with the install or restore instructions. v If you want to define the device from which the system LIC should be installed, select the Define alternate installation device option. This option also allows you to enable or disable the alternate installation device.11. Reapply PTFs. 12. Sign on to DST as QSRV. Type QSRV as the valid user ID and get the password from the customer. If the customer has changed the full DST authority user ID or password, ask the customer for the correct values. Note: a. If prompted for a password, be sure to give the new (changed) password to the customer. b. You are limited to three sign on attempts to DST. After three unsuccessful attempts, the service tools user ID will be disabled. The system administrator with QSECOFR authority will need to reset the ID.13. The Use Dedicated Service Tools (DST) display appears. Options are limited because the system is in the stand-alone environment. This ends the procedure.Start a service tool Select this option from the Use Dedicated Service (DST) Tools display. Note: The Start a service tool option is also available from the System Service Tools (SST) display. The following options are available from the Start a Service Tool display: v Display / alter dump v Licensed Internal Code log v Licensed Internal Code trace v Hardware service manager v Main storage dump manager v Product activity log v Operator panel functions v Performance data collector v (available only on systems with SST)IPL type, mode, and speed options Set the IPL options from function 02 on the control panel. To permanently change the IPL speed use the CHGIPLA command. Note: Systems managed by the Hardware Management Console (HMC) should use the HMC to perform control panel functions. v Use function 01 to display the current IPL option, including IPL speed. This is the option and speed that the system will use at the next IPL. 642 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) v Use the HDWDIAG (hardware diagnostics) parameter of the CHGIPLA command to permanently change the IPL speed. The values are: The default shipped value *MIN (minimal) for fast diagnostics and *ALL for all (full) diagnostics.IPL speed recommendations v Use CHGIPLA *MIN, or Function 02 F or V=F for most IPLs (to perform a fast IPL that uses minimum diagnostics). v Use CHGIPLA *ALL, or Function 02 S or V=S, if you change hardware or suspect an intermittent hardware failure (to perform a slow IPL that uses all diagnostics). v Use CHGIPLA *ALL, or Function 02 S or V=S during the first Install IPL. You can set this at the panel with Function 02; it affects only the one IPL because the next IPL will again pick up the value set with CHGIPLA.Printing the System Configuration List The System Configuration List printout consists of the following sections (80-character width): 1. System information v System type, model, serial number, and release2. Packaging hardware resource information v Resource name, type-model, serial number, part number, frame ID, card position, and device position3. Logical hardware resource information v Resource name, type-model, serial number, and logical address4. Legend v Descriptions of the indicators v Logical address format information 132-character width printouts consist of the following: v description v type-module v serial number v location data – frame ID – device position – card positionv logical resource name v part number v logical address You can sort the printout by location or logical address data.Print the system configuration list To print the system configuration list, perform the following procedure: 1. On the Start a Service Tool display, select the Hardware service manager option. 2. Select the Print (F6) function key on the Hardware Service Manager display to request a printout. 3. The Print Format Options display appears. You can sort the information by location or logical address.Note: The logical address sort and location sort are available only in 132–character width. 4. Look for the system configuration list printout. Notes: Chapter 7. Common Service Procedures 643 a. If you select the Print function key from a display other than the Hardware Service Manager display, only the information that appears on the display will print. b. Use the last page of the printout (the legend) to assist in interpreting the information. c. The logical and packaging hardware resources have unique resource names. d. You can also view logical address information online from the product activity log.This ends the procedure. To collect more details on the information that is listed on the system configuration list, select the Print function key from a display other than the Hardware Service Manager display. Note: The entire system configuration list will not print. Only the information that appears on the display will print. Details on the system bus, main storage, or processor For details (that include descriptions) on the system bus, main storage, or processor, perform the following procedure: 1. Select the Logical hardware resources option on the Hardware Service Manager display. 2. On the Logical Hardware Resources display, select one of the following options to display details: v System bus resources v Processor resources v Main storage resources v High-speed link resources3. Use the Print function key to print the information. This ends the procedure.Hardware Service Manager You can select the Hardware service manager option from the Start a Service Tool display. It allows you to display and work with system hardware (logical or packaging) and debug input-output processors (IOP) and devices. 644 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Note: The system power control network (SPCN) option appears only on systems with SPCN. The Hardware Service Manager display provides the following information: v System type, model, and serial number v Licensed Internal Code and operating system release information v A list of the hardware service manager options The options available vary depending on the system type and paging environment. v Function keys The function keys vary depending on the display that is shown.Managing the Advanced System Management Interface (ASMI) Understand how to manage the Advanced System Management Interface (ASMI). The Advanced System Management Interface (ASMI) is the interface to the service processor that is required to perform general and administrator-level service tasks, such as reading service processor error logs, reading vital product data, setting up the service processor, and controlling the system power. The ASMI may also be referred to as the service processor menus. Use the following information to help you manage the ASMI: v “Accessing the ASMI using a Web browser” on page 646 v “Accessing the ASMI using an ASCII terminal” on page 648 v “Accessing the ASMI using the HMC” on page 648 “Accessing the ASMI” on page 646You can access the ASMI through a Web browser or an ASCII console. “Displaying error and event logs” on page 649Display a list of all of the error and event logs in the service processor. “Setting the system enclosure type” on page 650Set values that uniquely identify the type of enclosures attached to the system. “Setting the system identifiers” on page 650Set the system-unique ID, system serial number, machine type, and machine model. Hardware Service Manager Attention: This utility is provided for service representative use only. System unit . . . . . . . : 9406-570 10-0020A Release . . . . . . . . . : V5R3M0 Select one of the following: 1. Packaging hardware resources (systems, frames, cards,...) 2. Logical hardware resources (buses, IOPs, controllers,...) 3. Locate resource by resource name 4. Failed and non-reporting hardware resources 5. System power control network (SPCN) 6. Work with service action log 7. Display label location work sheet 8. Device Concurrent Maintenance 9. Work with resources containing cache battery packs Bottom Selection F3=Exit F6=Print configuration F9=Display card gap information F10=Display resources requiring attention F12=Cancel Figure 66. Example Hardware Service Manager display (in the full and limited paging environments) Chapter 7. Common Service Procedures 645 “Viewing vital product data” on page 651View selected or all manufacturer’s VPD, such as serial numbers and part numbers. “Clearing all deconfiguration errors” on page 651Clear error records for specific or all resources in the system.Accessing the ASMI You can access the ASMI through a Web browser or an ASCII console. “Accessing the ASMI using a Web browser”Connect a PC or mobile computer to the server to access the ASMI. The Web interface to the ASMI is available during all phases of system operation including the initial program load (IPL) and run time. “Accessing the ASMI using an ASCII terminal” on page 648Connect the ASCII console to the server to access the ASMI. “Accessing the ASMI using the HMC” on page 648Access the ASMI through the HMC interface.Accessing the ASMI using a Web browser: Connect a PC or mobile computer to the server to access the ASMI. The Web interface to the ASMI is available during all phases of system operation including the initial program load (IPL) and run time. The Web interface to the Advanced System Management Interface (ASMI) is accessible through Microsoft® Internet Explorer 6.0, Netscape 7.1, or Opera 7.23 running on a PC or mobile computer connected to the service processor. The Web interface is available during all phases of system operation including the initial program load (IPL) and run time. However, some of the menu options in the Web interface are unavailable during IPL or run time to prevent usage or ownership conflicts if the system resources are in use during that phase. Note: To perform general and administrator-level service tasks, such as reading service processor error logs, reading vital product data, setting up the service processor, and controlling the system power, you must set up the ASMI. The following instructions apply to systems that are not connected to an HMC. If you are managing the server using an HMC, “Accessing the ASMI using the HMC” on page 648. To set up the Web browser for direct or remote access to the ASMI, complete the following tasks: 1. Connect the power cord from the server to a power source, and wait for the control panel to display 01. 2. Select a PC or mobile computer that has Microsoft Internet Explorer 6.0, Netscape 7.1, or Opera 7.23 to connect to your server. If you do not plan to connect your server to your network, this PC or mobile computer will be your ASMI console. If you plan to connect your server to your network, this PC or mobile computer will be temporarily connected directly to the server for setup purposes only. After setup, you can use any PC or mobile computer on your network that is running Microsoft Internet Explorer 6.0, Netscape 7.1, or Opera 7.23 as your ASMI console. 3. Connect an Ethernet cable from the PC or mobile computer to the Ethernet port labeled HMC1 on the back of the managed system. If HMC1 is occupied, connect an Ethernet cable from the PC or mobile computer to the Ethernet port labeled HMC2 on the back of the managed system. 4. Configure the Ethernet interface on the PC or mobile computer to an IP address and subnet mask within the same subnet as the server so that your PC or mobile computer can communicate with the server. Use the following table to help you determine these values: 646 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Table 67. PC or mobile computer network configuration information Server connector Subnet mask IP address HMC1 255.255.255.0 169.254.2.147 HMC2 255.255.255.0 169.254.3.147 For example, if you connected your PC or mobile computer to HMC1, the IP address for your PC or mobile computer could be 169.254.2.1 and the subnet mask would be 255.255.255.0. Set the gateway IP address to the same IP address as the PC or mobile computer. 5. Use Table 67 to determine the IP address of the Ethernet port that your PC or mobile computer is connected to, and type the IP address in the Address field of your PC’s or mobile computer’s Web browser. For example, if you connected your PC or mobile computer to HMC1, type https://169.254.2.147 in your PC’s or mobile computer’s Web browser. 6. When the login display appears, enter one of the following default user IDs and passwords: Table 68. Default user IDs and passwords for the Web interface User ID Password general general admin admin 7. Change the default password when prompted. 8. If you plan to connect your PC or mobile computer to your network, complete the following steps: a. From the navigation area, expand Network Services. b. Click Network Configuration. c. If your PC or mobile computer is connected to HMC1, fill in the section labeled Network interface eth0. If your PC or mobile computer is connected to HMC2, fill in the section labeled Network interface eth1. Ensure that the following fields are filled in correctly. Table 69. Fields and values for network configuration Field Value Configure this interface? Selected Type of IP address Static IP address This is the same address that you used in step 5. Subnet mask 255.255.255.0 d. Click Save Settings. e. Disconnect your PC or mobile computer from the server. f. Connect the server to your company network. g. Go to any PC or mobile computer on your company network that is running Microsoft Internet Explorer 6.0, Netscape 7.1, or Opera 7.23, and type the following in the Address field of your browser: https://host name.domain name where host name and domain name are the host name and domain name you entered when completing step 8c. h. Log in to the ASMI.9. Set the time of day on the server. Chapter 7. Common Service Procedures 647 Accessing the ASMI using an ASCII terminal: Connect the ASCII console to the server to access the ASMI. The ASCII interface to the ASMI provides a subset of the Web interface functions. The ASCII console is available only when the system is in platform standby state. It is not available during the initial program load (IPL) or run time. The ASMI on an ASCII console is not available during some phases of system operation, such as the initial program load (IPL) and run time. To set up the ASCII console for direct or remote access to the ASMI, complete the following tasks: 1. Complete the following steps to cable an ASCII console or equivalent to the server: Using a cable equipped with a null modem, connect the ASCII console to system connector S1 1 on the back of the server or to system port FS1 on the control panel using an RJ-45 connector. Note: Both system port 1 connections are not available simultaneously; when one is connected, the other is deactivated. 2. Connect the power cord from the server to a power source. 3. Wait for the control panel to display 01. 4. Ensure that your ASCII console is set to the following general attributes. These attributes are the default settings for the diagnostic programs. Be sure your terminal is set to these attributes before proceeding to the next step. General setup attributes 3151 /11/31/41 settings 3151 /51/61 settings 3161 /64 settings Description Line speed 19,200 19,200 19,200 Uses the 19,200 (bits per second) line speed to communicate with the system unit. Word length (bits) 8 8 8 Selects eight bits as a data word length (byte). Parity No No No Does not add a parity bit and is used together with the word length attribute to form the 8–bit data word (byte). Stop bit 1 1 1 Places a bit after a data word (byte). 5. Press a key on the ASCII console to allow the service processor to confirm the presence of the ASCII console. 6. When the login display appears, enter one of the following default user IDs and passwords: Table 70. Default user IDs and passwords for the Web interface User ID Password general general admin admin 7. Change the default password when you are prompted. 8. Set the time of day on the server.Accessing the ASMI using the HMC: Access the ASMI through the HMC interface. To access the Advanced System Management Interface (ASMI) using the Hardware Management Console (HMC), complete the following steps: 648 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 1. Ensure that the HMC is set up and configured. 2. In the navigation area, ensure that the server you are working with is selected. 3. Expand and click Operations. 4. In the content area, click Advanced System Management (ASMI). 5. Log on to the ASMI using the administrator userid and password.Displaying error and event logs Display a list of all of the error and event logs in the service processor. You can view error and event logs that are generated by various service processor firmware components. The content of these logs can be useful in solving hardware or server firmware problems. To perform this operation, your authority level must be one of the following: v General v Administrator v Authorized service providerInformational and error logs can be viewed by all authority levels. Hidden error logs can be viewed by authorized service providers. The following table shows error log types that might be displayed, the conditions that make an error log specific to that error log type, and the user authority level that will allow you to view specific types of error logs: Error log type Conditions User availability Severity Action Informational logs Informational Report to operating system (OS) but not hidden Available to all users Error logs Not informational Report to OS but not hidden Available to all users Hidden logs Not informational and informational Report to OS, hidden, or both Available only to the authorized service provider and users with higher authority. To view error and event logs in summary or full detailed format, do the following: 1. On the ASMI Welcome pane, specify your user ID and password, and click Log In. 2. In the navigation area, expand System Service Aids and click Error/Event Logs. If log entries exist, a list of error and event log entries is displayed in a summary view. 3. To view the full detail format of any of the logs listed, select the log’s corresponding check box and click Show details. When multiple logs are selected, any action applies to each selected log. The full detail information might span several pages. The contents and layout of the full detail output is defined by the event or error logging component. 4. Click Mark as reported to mark platform error entries whose underlying causes have been resolved. By doing so, these entries are not reported to the operating system again when the system reboots. After they are marked, these errors can be overwritten by other errors logged in the service processor history log. Note: The Mark as reported button is available only when your authority level is an authorized service provider. Chapter 7. Common Service Procedures 649 Setting the system enclosure type Set values that uniquely identify the type of enclosures attached to the system. When setting the system enclosure type, ensure that the enclosure serial number field matches the original value, which can be found on a label affixed to the unit. Updating the enclosure serial field keeps the configuration and error information synchronized, and this information is used by the system when creating the location codes. This task must be done using the ASMI, not with the control panel. However, if you do not have access to the ASMI, the system will still operate without updating this information. For example, when replacing the I/O backplane, you must re-enter the original enclosure serial number into the enclosure serial number field to overwrite the serial number that is recorded for the new I/O backplane. Failure to enter the correct enclosure serial number will result in logical partition mappings being incorrect. Note: You cannot boot the system until valid values are entered for all fields in the enclosure-type information. To perform this operation, your authority level must be one of the following: v Administrator v Authorized service providerTo change the system enclosure type, complete the following steps: 1. On the ASMI Welcome pane, specify your user ID and password, and click Log In. 2. In the navigation area, expand System Configuration ? Program Vital Product Data. 3. Select System Enclosures. In the right pane, the current system enclosures are displayed. 4. Enter the desired settings for the Enclosure location, Feature Code/Sequence Number, and Enclosure serial number fields using the naming convention described in the ASMI help. The value of the Enclosure serial number field is different than the serial number of the system. Set the Reserved field to blank spaces unless directed otherwise by Level 3 technical support. 5. Click Save settings to update the system enclosure type information and save it to the VPD.Setting the system identifiers Set the system-unique ID, system serial number, machine type, and machine model. You can set the system-unique ID, serial number, machine type, and machine model. If you do not know the system-unique ID, contact your next level of support. To perform this operation, your authority level must be one of the following: v Administrator v Authorized service providerNote: You cannot boot the system until valid values are entered for all fields. You can change these entries only once. To set the system keywords, do the following: 1. On the ASMI Welcome pane, specify your user ID and password, and click Log In. 2. In the navigation area, expand System Configuration and Program Vital Product Data. 3. Select System Keywords. 4. In the right pane, enter the desired values for the system serial number, machine type, and machine model using the naming convention shown in the ASMI help. Set the Reserved field to blank spaces unless directed otherwise by service and support. 5. Click Continue. The data validation panel is displayed, which includes your entered settings. 650 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 6. Click Save settings to update the system keywords and save them to the VPD.Viewing vital product data View selected or all manufacturer’s VPD, such as serial numbers and part numbers. You can view manufacturer’s vital product data (VPD) stored from the system boot prior to the one in progress now. To perform this operation, your authority level must be one of the following: v General v Administrator v Authorized service providerTo view the VPD, complete the following steps: 1. On the ASMI Welcome pane, specify your user ID and password, and click Log In. 2. In the navigation area, expand System Information and select Vital Product Data. 3. A list of field replaceable units (FRUs) that exist on the system and their descriptions are displayed. Select a single FRU or multiple FRUs from this list that you would like to view. 4. Click Display Details to display the details for selected FRUs, or click Display all details to display details for all VPD entries.Clearing all deconfiguration errors Clear error records for specific or all resources in the system. The ASMI allows you to clear error records for all or individual system hardware resources that include the processor, memory, L2 and L3 cache, I/O hubs, service processor card, and clock card. To clear all deconfiguration errors, your authority level must be an authorized service provider. Note: v Before performing this operation, record error messages or ensure that the error record data is no longer needed; otherwise, you will lose all error data from the hardware resources. v Upon completion of this procedure, the system must be powered off, and in standby mode, for the deconfiguration errors to be cleared.To clear all deconfiguration errors, do the following: 1. If the server is not powered off and in standby state, ask the customer to power off the server. When the server is in standby state (the power on LED on the control panel is blinking slowly), continue with the next step. 2. On the ASMI Welcome pane, specify your user ID and password, and click Log In. 3. In the navigation area, expand System Configuration and Hardware Deconfiguration. 4. Select Clear All Deconfiguration Errors. 5. In the right pane, select the desired hardware resource from the menu. You can select All hardware resources or an individual resource. 6. Click Clear errors for selected hardware resource. 7. Power on the server after clearing the deconfiguration errors. This will cause the FSP to retest the hardware as it powers on. Chapter 7. Common Service Procedures 651 Service functions Service functions provides enough detail for the service provider to gather information about hardware problems while under the direction of the next level of support. This information provides basic information about some of the service functions that are commonly used by authorized service providers. Before using this information, you should know how to service this particular hardware model. You must also know how to safely work with electrical components. Attention: Misuse of service tools or using them with inappropriate security safeguards might result in loss of data, programs, or other objects on the system. 652 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Managing dumps After you have performed a dump, or one occurs automatically, use this information to manage the dump, including copying, reporting, and deleting it. Dumps are deleted automatically when a new dump occurs, except for log dumps. “Performing dumps”Learn more about the different types of dumps that can be performed and how to manage them, including copying, reporting, and deleting them. “Copying a dump” on page 656Use this procedure to copy a current platform dump or current main storage dump when the Main Storage Dump auto copy option is disabled. Copying the current dump allows another dump of the same type to be created. “Reporting a dump” on page 659Find out how to report a dump using the HMC, i5/OS service tools, AIX command line, or Linux command line. “Deleting a dump” on page 661Use this procedure to delete a dump when dump copies are no longer needed by your next level of support or your hardware service provider.Performing dumps Learn more about the different types of dumps that can be performed and how to manage them, including copying, reporting, and deleting them. Dumps can occur automatically after a failure or external reset, or after a manual request. The dump information provides valuable information on system status and should be reported to your next level of support or your hardware service provider. v Platform system dump: A platform system dump is a collection of data from server hardware and firmware, either after a system failure or a manual request. v Service processor dump: A service processor dump is a collection of data from a service processor either after a failure, external reset, or manual request.In addition, there are other dumps that generate automatically. These include: v Platform event log entry dumps (which are generated by server firmware when a platform event log entry is truncated) v Resource dumps v SMP dumps “Performing a platform system or service processor dump”A platform system dump or service processor dump is performed when your next level of support or your service provider suspects a wait or loop in the server firmware. “Performing a service processor dump” on page 655A service processor dump is a collection of data from a service processor either after a failure, external reset, or manual request.Performing a platform system or service processor dump A platform system dump or service processor dump is performed when your next level of support or your service provider suspects a wait or loop in the server firmware. Only perform a platform system or service processor dump under the direction of your next level of support or your service provider, after first providing to your next level of support or service provider a dump of the operating system. Chapter 7. Common Service Procedures 653 Note: Before performing a platform system dump, collect any reference code information from the control panel or the HMC. “Using the HMC to perform a system dump”If your next level of support or your service provider suspects a wait or loop in the server firmware, they might request that you perform a system dump. You can use the HMC to perform a system dump, which is a collection of data from the server hardware and firmware. “Using the control panel to perform a system dump”If your next level of support or your service provider suspects a wait or loop in the server firmware, they might request that you perform a system dump. If you do not have an HMC, you can use the control panel to perform a system dump, which is a collection of data from the server hardware and firmware. “Using the ASMI to perform a system dump” on page 655If your next level of support or your service provider suspects a wait or loop in the server firmware, they might request that you perform a system dump. You can use the ASMI to perform a system dump, which is a collection of data from the server hardware and firmware.Using the HMC to perform a system dump: If your next level of support or your service provider suspects a wait or loop in the server firmware, they might request that you perform a system dump. You can use the HMC to perform a system dump, which is a collection of data from the server hardware and firmware. Use this procedure only under the direction of your next level of support or your service provider, after first providing to your next level of support or service provider a dump of the operating system. Follow these steps to initiate a system dump. 1. Perform a controlled shutdown of all partitions. Note: A system dump with abnormally terminate any running partitions. 2. In the navigation area, open Systems Management. 3. Select the server and open it. 4. Select Serviceability. 5. Select Manage Dumps. 6. Select Action > Initiate System Dump. The dump is automatically saved to the HMC. 7. For details on how to copy, report, or delete a dump after you have completed a dump, refer to “Managing dumps” on page 653.Using the control panel to perform a system dump: If your next level of support or your service provider suspects a wait or loop in the server firmware, they might request that you perform a system dump. If you do not have an HMC, you can use the control panel to perform a system dump, which is a collection of data from the server hardware and firmware. Use this procedure only under the direction of your next level of support or your service provider, after first providing to your next level of support or service provider a dump of the operating system. If you do not have an HMC, follow these steps to perform a system dump: 1. Perform a controlled shutdown of the operating system. Note: A platform system dump with abnormally terminate any running partitions. 2. On the control panel, select Manual mode. 3. Use the increment or decrement buttons to select function 42 ( system dump). 4. Press Enter on the control panel. 654 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 5. Is 00000000 or FF displayed on the control panel for more than 30 seconds? v Yes: The service processor or server firmware is not responding to a request from the control panel. Contact your next level of support or your hardware service provider. This ends the procedure. v No: An attention SRC A1xx3042 is displayed, which indicates that function 42 has been selected.6. Perform the following steps: a. Reselect function 42 and press Enter on the control panel. b. Switch back to Normal mode and wait for the dump to complete. c. Wait for the system to IPL again.7. For details on how to copy, report, or delete a dump after you have completed a dump, refer to “Managing dumps” on page 653.Using the ASMI to perform a system dump: If your next level of support or your service provider suspects a wait or loop in the server firmware, they might request that you perform a system dump. You can use the ASMI to perform a system dump, which is a collection of data from the server hardware and firmware. Use this procedure only under the direction of your next level of support or your service provider, after first providing to your next level of support or service provider a dump of the operating system. To perform a system dump, see Initiating a system dump. Performing a service processor dump A service processor dump is a collection of data from a service processor either after a failure, external reset, or manual request. Only perform a service processor dump under the direction of your next level of support or your service provider, after first providing to your next level of support or service provider a dump of the operating system. “Using the HMC to perform a service processor dump”A service processor dump is a collection of data from a service processor either after a failure, external reset, or manual request. You can use the HMC to perform a service processor dump. “Using the control panel to perform a service processor dump” on page 656A service processor dump is a collection of data from a service processor either after a failure, external reset, or manual request. If you do not have an HMC, you can use the control panel to perform a service processor dump. “Using the ASMI to perform a service processor dump” on page 656A service processor dump is a collection of data from a service processor either after a failure, external reset, or manual request. You can use the ASMI to perform a service processor dump.Using the HMC to perform a service processor dump: A service processor dump is a collection of data from a service processor either after a failure, external reset, or manual request. You can use the HMC to perform a service processor dump. Use this procedure only under the direction of your next level of support or your service provider, after first providing to your next level of support or service provider a dump of the operating system. Follow these steps to initiate a service processor dump. 1. In the navigation area, open Systems Management. 2. Select the server and open it. 3. Select Serviceability. 4. Select Manage Dumps. Chapter 7. Common Service Procedures 655 5. Select Action > Initiate System Dump. The dump is automatically saved to the HMC. 6. For details on how to copy, report, or delete a dump after you have completed a dump, refer to “Managing dumps” on page 653.Using the control panel to perform a service processor dump: A service processor dump is a collection of data from a service processor either after a failure, external reset, or manual request. If you do not have an HMC, you can use the control panel to perform a service processor dump. Use this procedure only under the direction of your next level of support or your service provider, after first providing to your next level of support or service provider a dump of the operating system. If you do not have an HMC, follow these steps to perform a service processor dump: 1. On the control panel, select Manual mode. 2. Use the increment or decrement buttons to select function 43 (service processor dump). 3. Press Enter on the control panel. 4. Is 00000000 or FF displayed on the control panel for more than 30 seconds? v Yes: The service processor or server firmware is not responding to a request from the control panel. Contact your next level of support or your hardware service provider. This ends the procedure. v No: An attention SRC A1xx 3043 is displayed, which indicates that function 43 has been selected.5. Perform the following steps: a. Reselect function 43 and press Enter on the control panel. b. Switch back to Normal mode, and wait for the dump to complete.6. For details on how to copy, report, or delete a dump after you have completed a dump, refer to “Managing dumps” on page 653.Using the ASMI to perform a service processor dump: A service processor dump is a collection of data from a service processor either after a failure, external reset, or manual request. You can use the ASMI to perform a service processor dump. Use this procedure only under the direction of your next level of support or your service provider, after first providing to your next level of support or service provider a dump of the operating system. To perform a service processor dump, see Initiating a service processor dump. Copying a dump Use this procedure to copy a current platform dump or current main storage dump when the Main Storage Dump auto copy option is disabled. Copying the current dump allows another dump of the same type to be created. Note: If you have an HMC, you must manage your platform dumps with the HMC. “Using an HMC to copy a dump” on page 657Learn how to copy a dump using an HMC. “Using the AIX command line to copy a dump” on page 657Learn how to use the AIX command line to copy a dump. “Using the Linux command line to copy a dump” on page 657Learn how to use the Linux command line to copy a dump. “Using the IBM i5/OS service tools to copy a dump” on page 658Learn how to use the i5/OS service tools to copy a dump. 656 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Using an HMC to copy a dump Learn how to copy a dump using an HMC. Note: If you have an HMC, you must manage your platform dumps with the HMC. However, you will need to manage i5/OS main storage dumps with the IBM i5/OS service tools. 1. In the navigation area, open Systems Management. 2. Select the server and open it. 3. Select Serviceability. 4. Select Manage Dumps. 5. Select Selected > Copy Dump. 6. Select the dump to copy. 7. Verify that HMC containing the dump is the HMC being used to copythe dump. 8. To copy the dump to DVD media, select Selected > Copy Dump File to DVD Media. 9. Load the media and follow the instructions on the display. 10. When the copy procedure is successfully completed, remove the media and process it according to the instructions of your next level of support or your hardware service provider. 11. If you encounter a problem with the copy procedure, contact your next level of support or your hardware service provider. This ends the procedure.Using the AIX command line to copy a dump Learn how to use the AIX command line to copy a dump. Copy the dump only if you are directed to do so by your next level of support or service provider. Note: If you have an HMC, you must manage your platform dumps with the HMC. For details, see “Using an HMC to copy a dump.” Platform dumps are saved in AIX under the following directory: /var/adm/platform Note: If the AIX partition becomes the service partition after it was installed, run the following command to create this directory: platform_dump -f jfs2 or platform_dump -f jfs This command will create the /var/adm/ras/platform file system. This is done automatically during installation if the AIX partition was marked as the service partition before it was installed. Using the Linux command line to copy a dump Learn how to use the Linux command line to copy a dump. Copy the dump only if you are directed to do so by your next level of support or service provider. Note: If you have an HMC, you must manage your platform dumps with the HMC. For details, see “Using an HMC to copy a dump.” Platform dumps are saved in Linux under the following directory: /var/log/dump/ Chapter 7. Common Service Procedures 657 Using the IBM i5/OS service tools to copy a dump Learn how to use the i5/OS service tools to copy a dump. Choose from the following options: v To copy a platform system or SP™ dump, go to “Copying a platform system or SP dump.” v To copy a main storage dump, go to “Copying a main storage dump.”Copying a platform system or SP dump: Use this procedure to copy a platform system or SP dump with the i5/OS tools. Note: If you have an HMC, you must manage your platform dumps with the HMC. For details, see “Using an HMC to copy a dump” on page 657. 1. On any command line, enter STRSST. 2. Enter a valid service tools user ID and password. 3. Select option 1 (Start a service tool). 4. Select option 6 (Main storage dump manager). Did option 7 (Work with current platform dump) appear on the Main Storage Dump Manager display? v Yes: There is a current platform dump. Continue with the next step. v No: There is no current platform dump. A dump might have been automatically copied to the Auxiliary Storage Pool (ASP). Go to “Reporting a dump” on page 659. This ends the procedure. 5. Select option 7 (Work with current platform dump). 6. Select option 1 (Display summary). The Platform Dump Summary display shows the platform dump type, date and time of the dump, and reference code. 7. Record the summary information and report it to your next level of support or your hardware service provider. 8. Press F12 (Cancel). 9. Select option 3 (Copy to ASP). The platform dump will be copied to disk storage. After the dump is copied, a message will be displayed indicating whether the copy completed. 10. Did a message indicate ?Copy completed normally?? v Yes: There no longer is an option 7 (Work with current platform dump) listed. This ends the procedure. v No: Continue with the next step.11. Has your next level of support or your hardware service provider requested a media copy of the platform dump? v Yes: Continue to the next step. v No: Work with your next level of support or your hardware service provider on the problem. This ends the procedure.12. If the copy to disk failed, copy a current platform dump to removable media by performing the following steps: a. Select option 2 (Copy to media). b. Load the media and follow the instructions on the display. c. When the copy procedure is successfully completed, remove the media and process it according to the instructions of your next level of support or your hardware service provider.13. If you encounter a problem with the copy procedure, contact your next level of support or your hardware service provider. This ends the procedure.Copying a main storage dump: Use this procedure to copy an i5/OS main storage dump using i5/OS tools. 1. On any command line, enter STRSST. 658 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 2. Enter a valid service tools user ID and password. 3. Select option 1 (Start a service tool). 4. Select option 6 (Main storage dump manager). 5. From the Main Storage Dump Manager display, press Enter. Note: Copying a main storage dump (MSD) to disk storage on the system prevents the MSD from being overwritten in the event that another MSD occurs. 6. Select option 1 (Display/Print). 7. Select option 1 (MSD summary). The Main Storage Dump Summary display shows the system reference code, date and time of the MSD, and Licensed Internal Code (LIC) level. 8. Record the summary information and report it to your next level of support or your hardware service provider. 9. Press F12 (Cancel) twice to return to the Work with Current Main Storage Dump display. 10. Select option 3 (Copy to ASP). 11. Type a dump description and then press Enter to start copying the dump. After the dump is copied, a message is displayed indicating whether the MSD copy completed. 12. Did a message indicate ?Copy completed normally?? v Yes: This ends the procedure. v No: Continue with the next step.13. Has your service provider requested a media copy of the MSD? v Yes: Continue to the next step. v No: Work with your next level of support or your hardware service provider on the problem. This ends the procedure.14. To copy a current MSD to a removable media, perform the following steps: a. Select option 2 (Copy to media). b. Load the media and follow the instruction on the display. c. When the copy procedure completes successfully, remove the media and process it according to your service provider’s instructions. If you encounter a problem with the copy procedure, contact your next level of support or your hardware service provider. This ends the procedure. Reporting a dump Find out how to report a dump using the HMC, i5/OS service tools, AIX command line, or Linux command line. “Using an HMC to report a dump”Learn how to report a dump using an HMC. “Using the AIX command line to report a dump” on page 660Learn how to report a dump using the AIX command line. “Using the Linux command line to report a dump” on page 660Learn how to report a dump using the Linux command line. “Using the i5/OS service tools to report a main storage or platform system or SP dump” on page 660Learn how to report a dump using the i5/OS service tools.Using an HMC to report a dump Learn how to report a dump using an HMC. 1. In the navigation area, open Systems Management. 2. Select the server and open it. 3. Select Serviceability. 4. Select Manage Dumps. 5. Select Selected > Call Home Dump. Chapter 7. Common Service Procedures 659 6. Verify that HMC containing dump is the HMC being used to report the dump. 7. Select Selected ? Call Home Dump. The dump summary display shows the date, time and size of the dump. Report the summary information to your next level of support or your hardware service provider. 8. Has your next level of support or your hardware service provider requested a copy of the dump? v Yes: Continue to the next step. v No: Contact your next level of support or your hardware service provider on the problem. 9. To copy a dump to removable media, perform the following steps: a. Select Copy Dump File to DVD Media. b. Load the media and follow the instructions on the display. c. When the copy procedure is successfully completed, remove the media and process it according to the instructions of your next level of support or your hardware service provider. If you encounter a problem with the copy procedure, contact your next level of support or your hardware service provider.10. To copy a dump to another system, perform the following steps: a. Select Selected ? Copy Dump File to Other System. b. Follow the instructions on the display. c. When the copy procedure is complete, contact your next level of support or your hardware service provider.Using the AIX command line to report a dump Learn how to report a dump using the AIX command line. Report the dump only if you are directed to do so by your next level of support or service provider. Platform dumps are saved in AIX under the following directory: /var/adm/platform Note: If the AIX partition becomes the service partition after it was installed, run the following command to create this directory: platform_dump -f jfs2 or platform_dump -f jfs This command will create the /var/adm/ras/platform file system. This is done automatically during installation if the AIX partition was marked as the service partition before it was installed. Using the Linux command line to report a dump Learn how to report a dump using the Linux command line. Platform dumps are saved in AIX under the following directory: /var/log/dump/ Report the dump only if you are directed to do so by your next level of support or service provider. Using the i5/OS service tools to report a main storage or platform system or SP dump Learn how to report a dump using the i5/OS service tools. If your system has the Main Storage Dump enabled for auto copy, your system might have automatically copied the current MSD to the Auxiliary Storage Pool (ASP) using the dump description Auto Copy. Then it might have performed another IPL of the system, or automatically copied a current platform system or SP dumps to ASP using the dump type as part of the dump description. 660 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) 1. On any command line, enter STRSST. 2. Enter a valid service tools user ID and password. 3. Select option 1 (Start a service tool). 4. Select option 6 (Main storage dump manager). 5. Select option 2 (Work with copies of main storage dumps). 6. Find a new dump near the bottom of the list, type 5 (Display/Print) next to it, and press Enter. 7. Did the Display Main Storage Dump display appear? v Yes: This is a main storage dump. Select option 1 (MSD Summary). Continue with the next step. v No: This is a platform dump. Continue with the next step. 8. Record the summary information so that you can report it to your next level of support or your hardware service provider. The Dump Summary display shows the date and time of the dump, system reference code (SRC) data, and Licensed Internal Code (LIC) level. 9. Press F3 (Exit). 10. If the dump has a description of Auto Copy, rename it so that another auto copy and IPL can occur if necessary. a. Select option 7 (Rename). b. Type a new dump description, and press Enter.11. Has your next level of support or your hardware service provider requested a media copy of the dump? v Yes: Continue with the next step. v No: Contact your next level of support or your hardware service provider on the problem. Continue with “Deleting a dump.”12. To copy an MSD or a platform dump to removable media, perform the following steps: a. Select option 8 (Copy to media). b. Load the media and follow the instructions on the display. c. When the copy procedure completes successfully, remove the media and process it according to the instructions from your next level of support or your hardware service provider. If you encounter a problem with the copy procedure, contact your next level of support or your hardware service provider. 13. Continue with “Deleting a dump.”Deleting a dump Use this procedure to delete a dump when dump copies are no longer needed by your next level of support or your hardware service provider. “Using an HMC to delete a dump”Learn how to use an HMC to delete a dump. “Using the AIX command line to delete a dump” on page 662Learn how to delete a dump using the AIX command line. “Using the Linux command line to delete a dump” on page 662Learn how to delete a dump using the Linux command line. “Using the i5/OS service tools to delete a dump” on page 662Learn how to delete a dump using the i5/OS service tools.Using an HMC to delete a dump Learn how to use an HMC to delete a dump. Dumps are stored in the HMC’s Manage Sirviceable Events tool and are automatically deleted when room is required for new dumps. Use this procedure when you want to manually delete a dump. 1. In the navigation area, open Systems Management. Chapter 7. Common Service Procedures 661 2. Select the server and open it. 3. Select Serviceability. 4. Select Manage Dumps. 5. Select Selected > Delete Dump. 6. Select the dump to delete. 7. Verify that HMC containing the dump is the HMC being used to delete the dump. 8. Select Selected ? Delete Dump.Using the AIX command line to delete a dump Learn how to delete a dump using the AIX command line. Platform dumps are saved in AIX under the following directory: /var/adm/platform Only delete the dump if directed to do so by your next level of support or service provider. Note: If the AIX partition becomes the service partition after it was installed, run the following command to create this directory: platform_dump -f jfs2 or platform_dump -f jfs to create the /var/adm/ras/platform file system. During an installation, this directory is created automatically if the AIX partition was marked as the service partition before it was installed. Using the Linux command line to delete a dump Learn how to delete a dump using the Linux command line. Platform dumps are saved in AIX under the following directory: /var/log/dump/ Only delete the dump if directed to do so by your next level of support or service provider. Using the i5/OS service tools to delete a dump Learn how to delete a dump using the i5/OS service tools. 1. On any command line, enter STRSST. 2. Enter a valid service tools user ID and password. 3. Select option 1 (Start a service tool). 4. Select option 6 (Main storage dump manager). 5. Select option 2 (Work with copies of main storage dumps). 6. If you want to delete any dump copies, type 4 next to the dump copies, and press Enter twice. 7. To exit SST, press F3 (Exit) three times, and then press Enter. 662 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Performing an IBM i5/OS main storage dump A main storage dump is a collection of data from the i5/OS’s main storage. It can be manually initiated when the system administrator suspects a system wait or loop, or automatically by the server firmware as the result of a system failure. Note: Before performing a main storage dump, collect any reference code information from the control panel or the HMC. “Using the HMC to perform an IBM i5/OS main storage dump”Learn how to use the HMC to perform an IBM i5/OS main storage dump. “Using the control panel to perform an IBM i5/OS main storage dump”Learn how to use the control panel to perform an IBM i5/OS main storage dump.Using the HMC to perform an IBM i5/OS main storage dump Learn how to use the HMC to perform an IBM i5/OS main storage dump. Note: Before performing a or main storage dump, collect any reference code information from the control panel or the HMC. Follow these steps to perform a main storage dump from an HMC:Note: If a main storage dump occurred automatically, go to “Managing dumps” on page 653 for information on how to copy, report, or delete a dump. 1. In the navigation area, open Server and Partition. 2. Select Server Management. 3. In the contents area, expand Partitions. 4. Right-click the logical partition and select Restart Partition. 5. Select Dump. 6. Click OK. 7. For details on how to copy, report, or delete a dump after you have completed a dump, refer to “Managing dumps” on page 653.Using the control panel to perform an IBM i5/OS main storage dump Learn how to use the control panel to perform an IBM i5/OS main storage dump. Note: Before performing a or main storage dump, collect any reference code information from the control panel or the HMC. If you do not have an HMC, follow these steps to perform a platform system dump: 1. Verify that no interactive jobs are running. 2. On the control panel, select Manual mode. 3. Use the increment or decrement buttons to select function 22 (main storage dump). 4. Press Enter on the control panel. 5. Is 00000000 or FF displayed on the control panel for more than 30 seconds? v Yes: The service processor or server firmware is not responding to a request from the control panel. Contact your next level of support or your hardware service provider. This ends the procedure. v No: An attention SRC A1xx 3022 is displayed, which indicates that function 22 has been selected. Continue with the next step.6. Perform the following steps on the control panel: a. Reselect function 22. Chapter 7. Common Service Procedures 663 b. Press Enter. c. Switch back to Normal mode and wait for the dump to complete. If function 22 was selected, the appearance of an A2D0 3000 on the console indicates that a successful user-initiated main storage dump was performed. (A B6xx xxxx reference code indicates that a successful i5/OS initiated main storage dump was performed.7. For details on how to copy, report, or delete a dump after you have completed a dump, refer to “Managing dumps” on page 653. 664 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Performing a slow boot Learn how to perform a slow boot using the HMC, control panel, or ASMI menus depending on your configuration. In some cases, you need to perform a slow-mode boot in order to perform extended diagnostic testing. Note: A slow-mode boot might yield a new reference code on the control panel, or new errors in the service processor error log. When the server reports a new error code, record it for use in subsequent steps. “Using the HMC to perform a slow boot”Learn how to perform a slow boot using the HMC. “Using the control panel to perform a slow boot”Learn how to use the control panel to perform a slow boot. “Using the ASMI menus to perform a slow boot” on page 666Learn how to use the ASMI menus to perform a slow boot.Using the HMC to perform a slow boot Learn how to perform a slow boot using the HMC. In some cases, you need to perform a slow-mode boot in order to perform extended diagnostic testing. Note: A slow-mode boot might yield a new reference code on the control panel, or new errors in the service processor error log. When the server reports a new error code, record it for use in subsequent steps. 1. Record any reference codes appearing on the control panel or HMC. 2. Shut down all logical partitions and the server. 3. In the navigation area, open Systems Management. 4. Open Servers. 5. Select the server you want to perform a slow boot on. 6. In the Tasks area, select Properties. 7. Click the Power-On Parameters tab. 8. Record the current boot setting in the Current column next to the Power-on speed override box. 9. Select slow from the list in the Power-on speed override box in the Next column. Note: These settings will apply to all future boots. After you complete the service action, change the settings back to what you recorded in step 8. 10. Click OK. 11. After you are done, return to the step in the procedure that caused you to perform a slow boot. This ends the procedure.Using the control panel to perform a slow boot Learn how to use the control panel to perform a slow boot. In some cases, you need to perform a slow-mode boot in order to perform extended diagnostic testing. Note: A slow-mode boot might yield a new reference code on the control panel, or new errors in the service processor error log. When the server reports a new error code, record it for use in subsequent steps. 1. Record any reference codes appearing on the control panel. 2. Stop the operating system by performing the following steps: Chapter 7. Common Service Procedures 665 a. Log in to the system as a user with the authority to run the shutdown or pwrdwnsys (Power Down System) command. b. At the command line, enter one of the following commands: Note: These commands stop the operating system. The system power turns off, the power-on light begins to slowly blink, and the system goes into a standby state. v If your system is running AIX, type shutdown v If your system is running IBM i5/OS, type pwrdwnsys *immed v If your system is running Linux, type shutdown -h now 3. On the control panel, use the increment or decrement buttons to scroll to function 02. 4. Press Enter to start function 02. The current IPL type (with pointer) and speed is displayed. 5. Use the increment or decrement buttons to scroll through the IPL types and speeds. 6. When manual is shown, press Enter to select the IPL type of manual. The current IPL type and speed (with pointer) is displayed. 7. Use the increment or decrement buttons to scroll through the IPL speeds. 8. When S is shown, press Enter to select the IPL speed of slow. 9. To exit function 02, press Enter multiple times until just 02 is displayed. 10. Power on the server by pressing the white power button. Then return to the procedure that caused you to perform a slow boot. This ends the procedure.Using the ASMI menus to perform a slow boot Learn how to use the ASMI menus to perform a slow boot. In some cases, you need to perform a slow-mode boot in order to perform extended diagnostic testing. Note: A slow-mode boot might yield a new reference code on the control panel, or new errors in the service processor error log. When the server reports a new error code, record it for use in subsequent steps. 1. Record any reference codes appearing on the control panel or HMC. 2. Shut down all logical partitions and the server. 3. On the ASMI Welcome pane, specify your user ID and password, and click Log In. 4. Expand Power/Restart Control and select Power On/Off System. 5. Record the settings so that you can re-enter them after you complete the service actions. 6. On the Power on/off system menu, select the following: v Select Slow in the Set System Boot Speed field. v Select Running in the Boot to system server firmware field.7. Select Save Settings. Note: These settings will apply to all future boots. After you complete the service action, change the settings back to what you recorded in step 5. 8. After you are done, return to the step in the procedure that caused you to perform a slow boot. This ends the procedure. 666 Power Systems: Service Guide for the IBM Power 520 (8203-E4A, 9407-M15, and 9408-M25) Determining the primary or alternate console Use this procedure to determine the primary or alternate console. If the HMC has been configured as the primary console, the system will start the HMC. If the Operations Console has been configured as the primary console, the system will start the Operations Console. If the Operations Console has not been configured, the primary console is a workstation that is attached to the first input/output processor (IOP) that is capable of supporting workstations. In addition to the primary console, the system can assign up to two alternative consoles. The first alternative console can only be a twinax workstation that is attached to the same IOP as the primary console. The second alternative console is a workstation that is attached to the next IOP or input/output adapter (IOA) that is capable of supporting workstations. The IOP that supports the console must be on the first system bus (bus 1). If a workstation is not correctly attached to the first IOP that is capable of attaching workstations, then the system will not assign a primary console. The system will display a reference code on the operator’s panel. In addition, if the IPL (initial program load) mode is set to Manual, the system will stop. “Primary console workstation requirements”Learn about the primary console workstation requirements. “Find the primary console when the system is operational” on page 668Learn how to find the primary console when the system is operational. “Find the primary console when the system power is off” on page 668Learn how to find the primary console when the system power is off.Primary console workstation requirements Learn about the primary console workstation requirements. In order to be the primary console, the workstation must be operational and have the correct port and address. If the workstation is a PC, it must also have an active emulation program on the workstation. The workstation requirements are: v HMC v twinax workstation - Port 0 Address 0 v ASCII workstation - Port 0 v PC attached to ASCII IOP or IOA – Port 0 – PC software to emulate a 316x or 3151 terminalv PC attached to twinax IOP – Port 0 Address 0 – 5250 emulator software active on PCv PC attached to a LocalTalk IOA (6054) – SNAps 5250 Version 1.2 (or above) application – Console capable selected on MacIntosh (IOA converts to Port 0 Address 0)v PC attached to a 2609, 2612, 2699, or 2721 communications IOA – Client Access Console cable attached to the 2609 or 2612 P2 port (part number 46G0450 or 46G0479), 2699 (part number 21H3779), or 2721 (part number 44H7504) Chapter 7. Common Service Procedures 667 – Operations Console cable attached to the 2609 or 2612 (part number 97H7555), 2699 (part number 97H7556), or 2721 (part number 97H7557) - 5250 emulation or Rumba active on PCFind the primary console when the system is operational Learn how to find the primary console when the system is operational. The following methods are available to find the primary console: v Method 1: Look for a sign-on display with a DSP01 in the upper-right corner. v Method 2: If the device name (DSP01) for the console has been changed, you can verify the device name for the primary console by doing the following: 1. Enter DSPCTLD QCTL on any command line. The Display Controller Description display is displayed. 2. Find the Resource name parameter (such as CTL01) and record it. 3. Enter PRTDEVADR rrrrr on any command line, where rrrr is the resource name you recorded.Note: If the printer is active, the data will print. v Method 3: 1. Enter STRSST on any command line. The System Service Tools display is displayed. 2. Select option 1 (Start a service tool). 3. Select option 7 (Hardware service manager). 4. Select option 2 (Logical hardware resources). 5. Select option 1 (System bus resources). On the Logical Hardware Resources on System Bus display, the ? Service and Support > Hardware > Exchange FRU. 3. Select an installed enclosure type from the drop-down list. 4. From the list of FRU types for this enclosure, select a FRU type. 5. Click Next to display a list of locations for the FRU type. 6. Select the location code for a specific FRU. 7. Click Add to add the FRU location to Pending Actions. 8. Click Launch Procedure to begin replacing the FRUs listed in Pending Actions. 9. Click Finish when you have completed the FRU installation. Non-management console-based procedures Use the removal and replacement procedures when you repair, maintain, or exchange your system parts. The removal and replacement procedures are divided into field replaceable units (FRUs) and customer replaceable units (CRUs). CRUs can be removed and replaced by customers, but FRUs can only be removed and replaced by an IBM authorized service representative. Use the following procedures after the server has identified a failing part. The server information details how to activate and deactivate LEDs. After the part or parts are replaced, refer to the server's documentation to verify the repair, and close the service call. Battery Use this procedure to service the battery. 1. Access the ASMI © Copyright IBM Corp. 2010 1 The Advanced System Management Interface (ASMI) is the interface to the service processor that is required to perform general and administrator-level service tasks, such as reading service processor error logs, reading vital product data, setting up the service processor, and controlling the system power. The ASMI might also be referred to as the service processor menus. To access the ASMI, see Accessing the Advanced System Management Interface. When you have accessed the ASMI, continue with the next step. 2. Check and record the server firmware and service processor settings 1. Check and record the admin user ID's password. 2. Check and record the server firmware level prior to replacing this field replaceable unit (FRU). You might need to update the system to the latest server firmware code level after you replace this FRU. 3. Check the service processor settings that you previously set using Advanced System Management (ASM). This can be done by using a management console or a PC to access ASM. You need to reset these service processor settings after you replace this FRU or the default settings will be used. 4. Record any service processor settings you might have set by using operating system commands: v System Name setting v System Power Control settings v System Service Aids settings v System Configuration settings v Network Services settings v Performance Option settings v Login Profile settings v Boot List settings v Host Ethernet Adapter settings 3. Save the partition data. Use the management console to save the partition data. For more information, see Backing up partition profile data. 4. Power off the unit Power should be removed from the unit that contains the battery. To power off the unit, complete the following steps. 1. If the system is running, stop the system as described in Stopping the system or logical partition. 2. If this unit is powered on, power it off now. 5. Disconnect the power cords 2 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures To disconnect the power cords, complete the following steps: 1. Open the rear rack door. 2. Disconnect all power cords from the unit you are servicing. 6. Remove the front cover 1. Open the front rack door. 2. Remove the two thumbscrews (A) that secure the front cover to the system unit. 3. Push the cover release latches (C) in the direction of the arrow to release the cover from the system unit. 4. Pull the cover out and away from the system unit. 7. Place the unit into the service position Removal and replacement procedures for the 8233-E8B and 8236-E8C 3 Attention: When placing the model rack into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. Remove two screws (A) securing the system unit (B) to the rack. 8. Place the unit into the service position (continued) Notes: 1. Ensure that the cables at the rear of the system unit do not catch or bind as you pull the unit forward in the rack. 2. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far. While holding the system-unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked. 9. Remove the service access cover 4 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 1. Loosen the two thumbscrews (A) located at the rear of the cover. 2. Slide the cover (B) toward the rear of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit. Attention: For proper cooling and airflow, replace the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components. 10. Remove the front fan 1. While squeezing the latch (A) on the front right fan, grasp the fan firmly and pull the fan out of the fan cage. 2. Repeat these steps until all system fans are removed from the fan cage. 11. Remove the air baffle Removal and replacement procedures for the 8233-E8B and 8236-E8C 5 Grasp the right side air baffle and pull it out of the fan cage. 12. Remove rear fan. 1. While squeezing the latch (A) on the rear right fan, grasp the fan firmly and pull the fan out of the fan cage. 13. Remove the time-of-day battery 6 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures Remove the time-of-day battery (A) from the system backplane. CAUTION: The battery contains lithium. To avoid possible explosion, do not burn or charge the battery. Do not: v Throw or immerse into water. v Heat to more than 100°C (212°F). v Repair or disassemble. Exchange only with the IBM-approved part. Recycle or discard the battery as instructed by local regulations. 14. Install the time-of-day battery CAUTION: Do not use conductive tools to insert the battery. To avoid corrosion, use latex or nitril gloves to touch the battery. If tools are needed, use plastic tools to avoid electrical shorts on the battery. Install the time-of-day battery (A) in the slot on the system backplane. 15. Install the rear fan. Removal and replacement procedures for the 8233-E8B and 8236-E8C 7 1. Lower the rear fan into its location in the fan cage. 2. Push down on the fan until it locks into place. 16. Install the air baffle 1. Lower the air baffle into its location in the fan cage, ensuring that the arrows on the air baffle point toward the front of the system. 2. Push down on the air baffle until it locks into place. 17. Install the front fan 8 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 1. Lower the fan into its location in the fan cage. 2. Push down on the fan until it locks into place. 18. Install the service access cover 1. Place the service access cover (A) on the top of the system, about 25 mm (1 in.) from the front of the system. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the rear of the service access cover with the two holes on the rear of the system chassis. 4. Tighten the thumbscrews to secure the service access cover. 19. Place the unit in the operating position Removal and replacement procedures for the 8233-E8B and 8236-E8C 9 1. To unlock the rail safety latches (B), slide the latches toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches lock into position. Note: Ensure that the cables at the rear of the system do not catch or bind as you push the system unit back into the rack. 20. Place the unit in the operating position (continued) Replace and tighten the two screws (B) that secure the system unit (A) to the rack. 21. Install the front cover 10 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 1. Position the cover on the front of the system unit so that the two thumbscrews (C) align with the screw holes on the front of the system unit (A). 2. Push the cover release latches (B) in the direction of the arrows to attach the cover to the front of the system unit as shown in the figure. 3. Tighten the thumbscrews. 4. Close the front rack door. 22. Connect the power cords If you removed the power cords, connect all power cords to the unit you are servicing. 23. Power on the system 24. Restore network connectivity Removal and replacement procedures for the 8233-E8B and 8236-E8C 11 The installed service processor must connect to the network to be accessible from the management console. How will the IP address of the service processor be managed? Obtain an IP address automatically from a DHCP server Configure a static IP address ? Go to step 26. 25. Restore service processor settings Note: If the network connection uses the Dynamic Host Configuration Protocol (DHCP) to establish an IP address, and uses the Hardware Management Console (HMC) as the DHCP server, no additional configuration is necessary to enable network access to the service processor. To enable network access to the service processor, complete the following steps. 1. Ensure that the service processor is connected to the existing service network by verifying that the HMC cable is connected to the HMC port on the service processor assembly. 2. If not already connected, connect all system power cables by plugging them into power outlets. Note: Do not start the system at this time. Can you access the service network? Yes No Go to step 28. Go to step 26. 26. Restore service processor settings If the network connection uses static IP address assignments, complete the following steps: 1. Connect a client with a Web browser directly to the service processor network port using one of the following URLs: v https://169.254.2.147 v https://169.254.3.147 Note: For additional information on IP address assignments, see Accessing the ASMI using a PC or notebook and Web browser. 2. If not already connected, connect all system power cables by plugging them into power outlets. Note: Do not start the system at this time. 3. Log in to Advanced System Management (ASM) with the user ID admin and the default password admin. 4. Change the admin user ID's password and the general user ID's password when prompted. 5. Configure network access using the static IP address. 27. Restore service processor settings With the network connection now configured to use static IP address assignments, access the service network. Can you access the service network? Yes No ? Go to step 29. 28. Access the ASMI 12 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures The Advanced System Management Interface (ASMI) is the interface to the service processor that is required to perform general and administrator-level service tasks, such as reading service processor error logs, reading vital product data, setting up the service processor, and controlling the system power. The ASMI might also be referred to as the service processor menus. To access the ASMI, see Accessing the Advanced System Management Interface. When you have accessed the ASMI, continue with the next step. 29. Go to step 33. 30. Restore service processor settings If the network connection uses DHCP to establish an IP address, but does not use a management console as the DHCP server, do the following steps: 1. Complete any network configuration necessary to allow the DHCP server to recognize and assign an IP address to the service processor. 2. Ensure that the service processor is connected to the existing service network by verifying that the network cable is connected to the network port on the service processor assembly. 3. If not already connected, connect all system power cables by plugging them into power outlets. Note: Do not start the system at this time. Can you access the service network? No Yes ? Go to step 28 on page 12. 31. Restore service processor settings You have indicated that you are still not able to access the service network. You will need to reset the service processor network interfaces (and passwords) by moving the reset jumpers. 32. Restore service processor settings Go to step 24 on page 11. 33. Restore service processor settings Change the admin user ID's password when prompted by following these steps: 1. In the navigation area of ASM, expand Login Profile. 2. Select Change Password. 3. Specify the required information, and click Continue. Note: Passwords can be any combination of up to 64 alphanumeric characters. The default password for the administrator ID is admin. After your initial login to ASM, the administrator passwords must be changed. As a security measure, you are required to enter the password for your user ID in the Current password for current user field. 34. Restore service processor settings Removal and replacement procedures for the 8233-E8B and 8236-E8C 13 Set the system name. The system name can be changed to any valid ASCII string. It does not have to follow the initialized machine type, model number, and serial number format. To change the system name, complete the followings steps: 1. In the navigation area of ASM, expand System Configuration. 2. Select System Name. 3. Enter the desired system name using the previous naming convention. 4. Click Save settings to update the system name to the new value. The new system name is displayed in the status frame, the area where the logout button is located. If another method, such as the management console, is used to change the system name, the status frame does not reflect the change. 35. Restore service processor settings Set the time of day by completing these steps: 1. In the navigation area of ASM, expand System Configuration. 2. Select Time of Day. If the system is powered off, the right pane displays a form that shows the current date (month, day, and year) and time (hours, minutes, and seconds). 3. Change the date value, the time value, or both, and click Update Time Of Day. 36. Restore service processor settings Reenter any of the following settings that you previously changed through ASM, unless you want to use the default settings: 1. System Power settings 2. ASMI Service Aids settings 3. System Configuration settings 4. Network Services settings 5. Login Profile settings 37. Restore service processor settings 14 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 1. Reset any service processor settings that you might have set by using the operating system commands. You recorded these settings at the beginning of this procedure. 2. If you reset the management console access password, use one of the following methods: For HMC: a. From the HMC GUI (preferred method): 1) Expand the Systems Management folder in the navigation tree pane. 2) Double click the Servers folder. 3) Select a server in the central panel. 4) Under the Operations folder, select Change Password. 5) Provide the desired information and click OK. b. From the HMC command line, type chsyspwd -m managed system -t access --passwd --newpasswd newpassword where: v The value for managed systemis the new service processor's managed system name. v No value for --passwd is entered thereby allowing authentication. v The value for newpassword is the new password value. For SDMC: a. Go to the Resources tab and click Hosts. b. Select a server in the right pane. c. On the Actions menu, click Operations > Change Password. d. Provide the desired information and click OK. 38. Recover partition data on the service processor The profile data that is stored in the managed server has been cleared or corrupted. To recover profile data by using the HMC, complete the following steps: 1. Expand the Systems Management folder in the navigation tree pane. 2. Double click the Servers folder. 3. Use the check box to select a server, in the central panel. 4. In the Configuration folder if you have the Manage Partition Data folder, select Restore. To recover profile data using the SDMC, complete the following steps: 1. Go to the Resources tab and click Hosts. 2. Select a server in the right pane. 3. On the Actions menu, click System Configuration > Manage Partition Data > Restore. 4. Select the desired restore option and click OK. 39. Restore boot list and Host Ethernet Adapter settings. The boot list might have been erased when the battery was replaced. Verify, and set if necessary, the list of boot devices by booting into the system management services (SMS) menus. Refer to Selecting boot options for information on setting the boot device list. After the operating system is booted, the bootlist command can be used to verify, or make additional changes to, the list of boot devices for a system or partition. To restore the Host Ethernet Adapter settings, see Configuring physical ports on a Host Ethernet Adapter. 40. Verify the repair. Go to Verifying a repair. Removal and replacement procedures for the 8233-E8B and 8236-E8C 15 Removing and installing the cache battery pack To service the cache battery pack, complete the following procedure: 1. Begin with power on. To prevent data loss, the cache battery pack must be in an error state before it is removed from the system. If you have powered the system off, power it on before you continue. 2. Select your operating system: v AIX: Go to step 3 v Linux: Go to step 5 v IBM i: Go to step 7 3. Force the cache battery pack into an error state on AIX® (to prevent possible data loss), as follows: a. Navigate to the IBM SAS Disk Array Manager by using the following steps: 1) At the command prompt, type smit, and press Enter. 2) Select Devices. 3) Select Disk Array. 4) Select IBM SAS Disk Array. 5) Select IBM SAS Disk Array Manager from the menu with options for configuring and managing the IBM SAS RAID Controller. b. Select Diagnostics and Recovery Options. c. Select Controller Rechargeable Battery Maintenance. d. Select Force Controller Rechargeable Battery Error. e. Select the IBM SAS RAID Controller whose battery you want to replace. f. This option places the battery into the error state, which requires it to be replaced. g. Determine that it is safe to replace the cache battery pack. See the following Displaying Rechargeable Battery Information procedure. It is safe to replace the cache battery pack when Yes is displayed next to Battery pack can be safely replaced. You might need to reselect the option Display Controller Rechargeable Battery Information multiple times as it might take several minutes before it is safe to replace the cache battery pack. h. Verify that the Cache Present LED is not flashing. Displaying Rechargeable Battery Information a. Navigate to the IBM SAS Disk Array Manager by using the previous procedure. b. Select Diagnostics and Recovery Options. c. Select Controller Rechargeable Battery Maintenance. d. Select Display Controller Rechargeable Battery Information. e. Select IBM SAS RAID Controller. 4. Go to step 8. 5. Force the cache battery pack into an error state on Linux (to prevent possible data loss), as follows: a. Run the iprconfig utility by typing iprconfig. b. Select Work with disk unit recovery. c. Select Work with resources containing cache battery packs. d. Select your adapter and type 2. Then press Enter to force the battery error. Note: This option places the battery into the error state, which requires it to be replaced. e. If you are sure you want to force a battery error, type c to confirm. If you do not want to force a battery error, type q to cancel. f. Determine that it is safe to replace the cache battery pack. See Displaying Rechargeable Battery Information. 16 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures g. It is safe to replace the cache battery pack when Yes is displayed next to Battery pack can be safely replaced. You might need to reselect the option Display Controller Rechargeable Battery Information multiple times as it might take several minutes before it is safe to replace the cache battery pack. 6. Go to step 8. 7. Force the cache battery pack into an error state on IBM i (to prevent possible data loss), as follows on the system or partition that contains the adapter: a. Be sure that you are signed on to the system with at least service level authority. b. Type strsst on the command line and press Enter. c. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign On display. Press Enter. d. Select Start a Service Tool from the System Service Tools (SST) display and press Enter. e. Select Hardware Service Manager from the Start a Service Tool display and press Enter. f. Select Work with resources containing cache battery packs from the Hardware Service Manager display and press Enter. g. Select Force battery pack into error state for the I/O card that you are working with from the Work with Resources containing cache battery packs display and press Enter. h. On the Force Battery Packs Into Error State display, verify that the correct I/O adapter has been selected and press the function key that confirms your choice. i. Return to the Work with Resources containing cache battery packs display and select Display battery information. Ensure that the field Safe to replace cache battery is set to Yes. This might take several minutes and you might need to press the refresh key to see the field update. 8. Place a rack-mounted system into service position. Go to Placing a 8233-E8B or 8236-E8C in the service position 9. Remove the service access cover. Go to Removing the service access cover from an 8233-E8B or 8236-E8C system 10. Check cache battery LED by examining the LED on the cache battery pack (C). Removal and replacement procedures for the 8233-E8B and 8236-E8C 17 Is the cache battery LED flashing? Yes, the LED is flashing No, the LED is not flashing. ? Go to step 12. 11. Stop removing the cache battery pack The flashing LED indicates that there is cached data on the battery that has not been saved. This data will be lost if the battery is removed at this time. Complete this service procedure without replacing the battery, and then retry it to ensure all cache data is saved before the battery is replaced. Go to step 18. 12. Remove the cache battery pack as follows: 18 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures Attention: Do not remove the cache battery if the LED (C) is flashing because this indicates cache data exists. a. Move the cache battery lever (A) away from the connector to disengage the battery from the connector. b. Slide the cache battery pack (B) out of the mounting guides and remove it from the controller. 13. Install the cache battery pack as follows: Removal and replacement procedures for the 8233-E8B and 8236-E8C 19 Attention: Ensure that the cache battery pack is disconnected for at least 60 seconds before you connect the new battery. This is the minimum amount of time needed for the card to recognize that the battery has been replaced. a. Slide the cache battery pack (B) into the mounting guides on the controller until it is seated in the battery connector (C). b. After the battery is seated in the connector, move the lever (A) to the latched position to fully seat the battery into the connector. 14. Restart the adapter's write cache by doing the following steps: Select your operating system: v AIX: Go to step 15 v Linux: 16 v IBM i: Go to step 17 15. Perform the following steps: a. Navigate to the IBM SAS Disk Array Manager by using the information in step 3a. b. Select Diagnostics and Recovery Options. c. Select Controller Rechargeable Battery Maintenance. d. Select Start Adapter Cache. e. Select the controller with the battery you just replaced and press Enter. f. Go to step 18. 16. Perform the following steps: a. Run the iprconfig utility by typing iprconfig. b. Select Work with disk unit recovery. c. Select Work with resources containing cache battery packs. d. Start the I/O adapter cache on the adapter for the battery that you just replaced by typing 3 and press Enter. 20 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures e. Go to step 18. 17. Perform the following steps: a. Return to the Work with Resources containing Cache Battery Packs display using the information in step 7a to 7f and select the Start IOA cache. Press Enter. b. Ensure that you get the message Cache was started. c. Go to step 18. 18. Install the service access cover. Go to Installing the service access cover on an 8233-E8B or 8236-E8C system 19. Place into operating position. Go to Placing the 8233-E8B or 8236-E8C in the operating position 20. Close the back door on the unit you are servicing. 21. Verify that the new resource is working correctly, as described in Verifying the installed part. Control panel and signal cable You can remove or replace the control panel (operator panel) or signal cable in the server. Removing the control panel with the power off These procedures describe the removal of the control panel with the power off. If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to remove the control panel from the server. For instructions, see Removing a part by using the Systems Director Management Console. If you do not have an SDMC, complete this procedure to remove a control panel. If your system is managed by the Hardware Management Console (HMC), you can use the HMC to complete the steps for removing a control panel. For instructions, see Removing a part by using the Hardware Management Console. If you do not have an HMC, complete this procedure to remove a control panel. To remove the control panel with the power off, complete the following procedure: 1. Perform the prerequisite tasks described in “Before you begin” on page 115. 2. Stop the system. For instructions, see Stopping a system or logical partition. 3. Disconnect the power source from the system by unplugging the system. Note: This system is equipped with a second power supply. Before continuing with this procedure, disconnect all the power sources to the system. 4. Unlock the control panel by pressing release button (A), and pull the control panel out of the system, as shown in the following figure. Remove the control panel by pressing the locking tabs (C) located on either side of the panel, grasping the edges of the panel, and pulling the panel out of its bay. Removal and replacement procedures for the 8233-E8B and 8236-E8C 21 5. Disconnect the control-panel signal cable (B) from the back of the control panel. 6. If you are removing the control panel as part of another procedure, return to that procedure now. 7. Replace the control panel with the power off. For instructions, see Replacing the 8233-E8B or 8236-E8C control panel with the power off. Replacing the control panel with the power off These procedures describe the replacement of the control panel with the power off. If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to replace the control panel in the server. For instructions, see Replacing a part by using the Systems Director Management Console. If you do not have an SDMC, complete this procedure to replace a control panel. If your system is managed by the Hardware Management Console (HMC), you can use the HMC to complete the steps for replacing a control panel. For instructions, see Replacing a part by using the Hardware Management Console. If you do not have an HMC, complete this procedure to replace a control panel. To replace the control panel with the power off, complete the following procedure: 1. Perform the prerequisite tasks described in “Before you begin” on page 115. 2. Remove the control panel with power off. For more information, see Removing the 8233-E8B or 8236-E8C control panel with the power off. 3. If necessary, remove the new control panel from its packaging. 4. Connect the control-panel signal cable (A) to the back of the control panel, as shown in the following figure. 5. Carefully slide the control panel into the control panel bay until you feel the locking tabs lock in place. Figure 1. Removing a control panel from a rack-mounted system 22 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures Note: If the control panel does not lock in place, the media device cable might be in the path of the control panel signal cable. You need to move the media device cable out of the path of the control panel signal cable before replacing the control panel. To access the media device cable, you must remove the fans. 6. If you replaced the control panel as part of another procedure, return to that procedure now. If you replaced the control panel because it was not operational, verify that the new resource is functional. For instructions, see Verifying the installed part. 7. Start the system. For instructions, see Starting the system or logical partition. Note: If a ball or dot appears in the middle of the control panel, toggle between function 02 and function 01 until the ball or dot disappears. Removing and replacing the control panel signal cable These procedures describe the removal and replacement of the control-panel signal cable. If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to remove and replace the control-panel signal cable. For instructions, see Removing a part by using the Systems Director Management Console and Replacing a part by using the Systems Director Management Console. If you do not have an SDMC, complete this procedure to remove and replace a control-panel signal cable. If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the steps for replacing a control-panel signal cable. For instructions, see Replacing a part by using the Hardware Management Console. If you do not have an HMC, complete this procedure to remove and replace a control-panel signal cable. To replace the control-panel signal cable, complete the following procedure: Figure 2. Replacing a control panel in a rack-mounted system Removal and replacement procedures for the 8233-E8B and 8236-E8C 23 1. Perform the prerequisite tasks described in “Before you begin” on page 115. 2. Remove the disk-drive backplane. For instructions, see Removing the 8233-E8B or 8236-E8C disk drive backplane. 3. Remove and replace the control-panel signal cable for the 2.5-inch disk drive using the following instructions: a. To remove the control panel cable, perform the following step: Important: When removing the cable, note where the connections are and how the cable is routed. You can use this information to complete the replacement procedure. 1) Disconnect the signal cable (B) from port (A) on the back of the backplane and lift the cable out of the system. b. Install the replacement control-panel signal cable into the system by performing the following steps: 1) Attach the signal cable (B) to port (A) on the back of the backplane. Figure 3. 2.5-inch disk drive Figure 4. Removing a signal cable in a rack-mounted system 24 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 2) Route the signal cable to the front of the system. 4. Replace the control panel with the power off. For instructions, see Replacing the 8233-E8B or 8236-E8C control panel with the power off. Note: When installing a new signal cable, you can replace the control panel while the disk-drive backplane is still outside the system. After you have replaced the control panel, replace the disk-drive backplane as instructed in Disk drive backplanes. Disk drive backplane Use this procedure to service the disk drive backplane. Removing the disk drive backplane If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to complete the steps for removing the disk drive backplane from the server. For instructions, see Removing a part using the Systems Director Management Console. If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the steps for removing the disk drive backplane from the server. For information about using the HMC to remove the disk drive backplane, see Removing a part using the Hardware Management Console. If you do not have an HMC or SDMC, complete the following steps to remove the disk drive backplane: 1. Perform the prerequisite tasks described in “Before you begin” on page 115. 2. Do one of the following steps, as applicable: a. If you are removing the disk drive backplane for a system upgrade or as part of another procedure, continue to the next step. b. If you are removing the disk drive backplane because of a system failure, use the service action log to help identify the failing part. See Identifying a part . 3. Stop the system or logical partition. For instructions, see Stopping a system or logical partition. 4. Disconnect the power source from the system by unplugging the system. Figure 5. Installing a signal cable in a rack-mounted system Removal and replacement procedures for the 8233-E8B and 8236-E8C 25 Note: This system has a second power supply. Before continuing with this procedure, ensure that the power source to the system has been completely disconnected. (L003) or 5. Remove the front cover. For instructions, see Removing the front cover from an 8233-E8B or 8236-E8C system. 6. Place the system into the service position. For instructions, see Placing a 8233-E8B or 8236-E8C in the service position. 7. Remove the service access cover. For instructions, see Removing the service access cover from an 8233-E8B or 8236-E8C system. 8. Attach the wrist strap. Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. 9. To remove the fans and the fan cage, complete the following steps: a. To remove the fans, squeeze the tab (A) and lift the fan out of the fan cage as shown in Figure 6 on page 27. Note: If the thumbscrews are too tight, you might need a flat-head screwdriver to complete this step. To pull the fan cage out of the system, you might need to unplug the control panel cable to 26 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures allow more space. b. To remove the air baffles, grasp and pull the air baffle out of the fan cage, as shown in Figure 7 on page 28. Figure 6. Removing the fan from the fan cage Removal and replacement procedures for the 8233-E8B and 8236-E8C 27 c. To remove the fan cage, loosen the two thumbscrews (A) and lift the fan cage (B) out of the system as shown in Figure 8. Figure 7. Removing the air baffles from the fan cage Figure 8. Removing the fan cage from the system 28 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 10. Disconnect the cables from the back of the disk drive backplane, as shown in Figure 9. If you have connected an external SAS cable, ensure that you disconnect the SAS cable from the disk drive backplane. For more information about external SAS port, refer to Installing the external SAS port on a model 8233-E8B or 8236-E8C. 11. If you need to replace the backplane because of an upgrade or system failure, complete the following steps to remove the disk drives, depending on your backplane: Note: If you remove a backplane that is populated with disk drives, the backplane will be heavy. Ensure that you can safely complete the procedure. a. To remove the disk drive from the backplane, perform the following steps: Note: Record the location of each disk drive you remove so the disk drive can be replaced into the same location again. 1) Unlock the disk drive handle (A) and pull it toward you to unseat the disk drive. 2) Place one hand under the disk drive and carefully pull the drive out of the system as shown in Figure 10 on page 30. Figure 9. Disconnecting the media device cables from the disk drive backplane Removal and replacement procedures for the 8233-E8B and 8236-E8C 29 12. Remove the media device by pressing in on the latch (A) and sliding the media device out of the system, as shown in the following figure. Figure 10. Removing the disk drive from the backplane Figure 11. Removing the media device from the disk drive backplane 30 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 13. Unlock the disk drive backplane by pulling the handles open from the center of the backplane as shown in Figure 12 and Figure 13. 14. Pull the backplane out of the server. 15. Remove the control panel and signal cable from the backplane to use in the replacement backplane. For instructions, see Control panel and signal cables. Figure 12. Removing the populated disk drive backplane Figure 13. Removing the unpopulated disk drive backplane Removal and replacement procedures for the 8233-E8B and 8236-E8C 31 16. If you removed the backplane as a part of another procedure, continue with that procedure. To replace the backplane or install a new backplane, see Replacing the 8233-E8B or 8236-E8C disk drive backplane. Replacing the disk drive backplane If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to complete the steps for replacing the disk drive backplane in the server. For instructions, see Replacing a part using the Systems Director Management Console. If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the steps for replacing the disk drive backplane in the server. For instructions, see Exchanging a part using the Hardware Management Console. If you do not have an HMC or SDMC, complete the following steps to replace the disk drive backplane: 1. Perform the prerequisite tasks described in “Before you begin” on page 115. 2. Install the control panel and signal cable. For instructions, see Control panel and signal cables. 3. Check for a protective cover that might be present on the lower-right connector pins. If a protective cover is present, remove the cover (A), as shown in Figure 14. 4. Carefully lift the backplane along two sides and align it with the slot in the server. Important: If you remove a backplane that is populated with disk drives, the backplane will be heavy. Ensure that you can safely perform this task before you begin. 5. Insert the backplane firmly into the server, as shown in the following figures. Figure 14. Removing the protective cover from the connector pins 32 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 6. Secure the backplane with the locking tabs. 7. Connect the cables to the backplane, as shown in the following figure. You can also connect an external SAS cable to the backplane. For more information about external SAS port, refer to Installing the external SAS port on a model 8233-E8B or 8236-E8C. Figure 15. Installing the unpopulated disk drive backplane on a rack-mounted model Figure 16. Installing the populated disk drive backplane on a rack-mounted model Removal and replacement procedures for the 8233-E8B and 8236-E8C 33 8. Replace the disk drives, if you removed them earlier. To replace the disk drives, perform the following steps: a. Hold the disk drive by the top and bottom edge as you position the drive and insert it into the expansion unit. Do not hold the disk drive by the handle. b. Slide the disk drive all the way into the system or expansion unit, and push the disk drive handle (A) in until it locks, as shown in the following figure. Figure 17. Connecting the cables to the disk drive backplane 34 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 9. Replace the Slimline media device by inserting it into the slot and pushing it in until the lever (B) locks into place, as shown in the following figure. 10. Replace the fans, air baffles, and the fan cage. Figure 18. Installing the backplane disk drives Removal and replacement procedures for the 8233-E8B and 8236-E8C 35 a. To replace the fan cage (A), align the locator pins through the holes, and tighten the two thumbscrews (B), as shown in Figure 19. b. To install the air baffles, insert the air baffles into their locations in the fan cage, and push down until they lock into place, as shown in Figure 20 on page 37. Figure 19. Replacing the fan cage 36 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures c. To install the fans, insert the fans into their locations in the fan cage, and push down on each fan until it locks into place, as shown in Figure 21. Figure 20. Installing the air baffles in the fan cage Figure 21. Installing the fans in the fan cage Removal and replacement procedures for the 8233-E8B and 8236-E8C 37 11. Replace the service access cover. See Installing the service access cover on an 8233-E8B or 8236-E8C system. 12. Replace the front cover. See Installing the front cover on an 8233-E8B or 8236-E8C system. 13. Place the system in the operating position. For more information, see Placing the 8233-E8B or 8236-E8C in the operating position. 14. Start the system or logical partition. See Starting the system or logical partition. 15. Verify the installed part. For instructions, see Verifying the installed part. External cables Use this procedure to service the external cables except the service processor cable or SMP cable. 1. Remove the external cables Use the graphics in the locations table to locate the cable. Trace the cable and record the system location at the other end, and then remove the cable. 2. Install the external cables Use the graphic in the locations table to locate where the cable is to be installed. Install the cable, and then route and install the other end of the cable to the system location that was previously recorded. 3. Verify the repair Go to Verifying a repair. Fans Use this procedure to service the fan. Removing the fan For information about using the HMC to remove fans, see Removing a part by using the Hardware Management Console To remove the fan from a system that is not managed by the HMC, complete the following steps: 1. Complete the prerequisite tasks described in “Before you begin” on page 115. 2. Determine if the repair can continue concurrently. To continue the repair concurrently, the following conditions must be true: v Air moving devices (AMDs) must be installed in all four positions. Each of these four AMD units has a green LED and an amber LED. The green LED will be on for all four of the fans, but the fan with the failure on it will have the amber fault LED lit. v If only amber fault LED is on, the repair can be performed concurrently. 38 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures Note: If more than one amber fault LED is on then the repair must be done nonconcurrently. 3. Select an action Nonconcurrent repair. Power off the unit and continue the repair. Concurrent repair. Continue the repair with unit power on. Delay the repair. ? Go to step 9 on page 40. This ends the procedure. 4. Stop the system, as described in Stopping a system or logical partition. 5. Disconnect all power cords from the unit you are servicing. 6. If you have a rack-mounted system, place it in the service position, as described in Placing a 8233-E8B or 8236-E8C in the service position. 7. Remove the service access cover, as described in Removing the service access cover from an 8233-E8B or 8236-E8C system. Removal and replacement procedures for the 8233-E8B and 8236-E8C 39 8. Attach the wrist strap Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. 9. Remove the fan a. Locate the failing system fan to be replaced. It is the fan indicated by the amber fault LED being on. b. While squeezing the latch (A), reach through the hole of the failing fan and pull the fan out of the fan cage. If you removed the fan as part of another service action, or if you need to replace the fan that you removed, see Replacing the 8233-E8B or 8236-E8C system unit fan. Replacing the fan Before installing or replacing a feature, ensure that the software required to support the feature is installed on your system. For information about software prerequisites, see IBM Prerequisite. If the required software is not installed, see the following Web sites to download it, and then install it before continuing: v To download firmware and software updates and fixes, see Fix Central. v To download Hardware Management Console (HMC) updates and fixes, see Hardware Management Console Support and downloads. 40 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures If your system is managed by the HMC, use the HMC to complete the steps for replacing the fan in the server. For information about using the HMC to replace fans, see Replacing a part by using the Hardware Management Console. If you do not have an HMC, complete the following steps to replace the fan. 1. Complete the prerequisite tasks described in “Before you begin” on page 115. 2. Attach the wrist strap. Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. 3. If necessary, remove the fan from its antistatic package. 4. Install the fan a. Lower the fan into its location in the fan cage. Orient the fan so the latch is located towards the middle of the system. b. Push down on the fan until it locks into place. 5. Replace the service access cover, as described in Installing the service access cover on an 8233-E8B or 8236-E8C system. 6. Place the system in the operating position, as described in Placing the 8233-E8B or 8236-E8C in the operating position. 7. If you removed the power cords, connect all power cords to the unit you are servicing. 8. Verify the installed part, as described in Verifying the installed part. Removal and replacement procedures for the 8233-E8B and 8236-E8C 41 GX adapters Use this procedure to service the GX adapters. Removing GX adapters If your system is managed by the Hardware Management Console (HMC), use the HMC to remove an adapter. For instructions, see Removing a part by using the Hardware Management Console. If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to complete the steps for removing an adapter. For instructions, see Removing a part by using the Systems Director Management Console. To remove the adapter from a system that is not managed by the HMC or SDMC, complete the following steps: 1. Complete the prerequisite tasks described in “Before you begin” on page 115; however do not disconnect the power cords at this time. 2. Stop the system, as described in Stopping a system or logical partition. 3. If you are working on a rack-mounted system unit, open the rear rack door. 4. Disconnect the power source from the system by unplugging the system. Note: This system is equipped with a second power supply. Before continuing with this procedure, ensure that the power source to the system has been completely disconnected. 5. If necessary, disconnect any cables connected to the adapter at the rear of the system. Tip: Label the cables before removing them from the adapter. 6. Place the system in the service position, as described in Placing a 8233-E8B or 8236-E8C in the service position. 7. Remove the access cover, as described in Removing the service access cover from an 8233-E8B or 8236-E8C system. 8. Attach the wrist strap. Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. 9. Remove the existing adapter assembly by sliding the retainers (A) into the open position. Unlatch the blue locking latches (B) on the adapter assembly by pushing them together and pulling them up into the open position, and then lift the adapter out of the system. 42 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 10. Place an adapter or a filler panel in the expansion slot of the system unit for proper airflow and cooling: v To replace the adapter, see Replacing GX adapters in an 8233-E8B or 8236-E8C with power turned off. v If you are not replacing the adapter, place a filler panel in the slot and continue with the next step. 11. Seal the expansion slot by using an expansion slot cover. 12. Replace the service access cover, as described in Installing the service access cover on an 8233-E8B or 8236-E8C system. 13. If you are working on a rack-mounted system unit, ensure that the system cables are routed correctly through the cable-management arm. 14. If you are working on a rack-mounted system unit, place the system in the operating position as described in Placing the 8233-E8B or 8236-E8C in the operating position. 15. Reconnect all power and signal cables to their respective connectors. 16. Close the back rack door or the back system door. 17. Start the system, as described in Starting the system or logical partition. 18. Verify that the new resource is working correctly, as described in Verifying the installed part. Replace the GX adapter with the power turned off. Replacing GX adapters Note: Replacing this feature is a customer task. You can perform this task yourself, or contact a service provider to perform the task for you. You might be charged a fee by the service provider for this service. If your system is managed by the Hardware Management Console (HMC), use the HMC to replace an adapter. For instructions, see Exchanging a part by using the Hardware Management Console. Figure 22. Adapter removal from a rack-mounted system Removal and replacement procedures for the 8233-E8B and 8236-E8C 43 If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to complete the steps for replacing an adapter. For instructions, see Replacing a part by using the Systems Director Management Console. To replace the adapter on a system that is not managed by the HMC or SDMC, complete the following steps: 1. Remove the adapter from the system unit. 2. Attach the wrist strap. Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. 3. Carefully grasp the new adapter assembly by its top edge, and align the adapter assembly with the slot and its connector on the system backplane. 4. Press the adapter assembly firmly into its connector. When you install an adapter into the system, ensure that it is completely and correctly seated in its connector. 5. Seat the adapter. Push down evenly on both of the blue locking latches (A) at the same time until the latches snap into place, indicating that the adapter assembly is seated correctly and locked into position. Slide the retainers (B) into the closed position. 6. Replace the service access cover, as described in Installing the service access cover on an 8233-E8B or 8236-E8C system. Figure 23. Adapter installation in a rack-mounted system 44 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 7. If you are working on a rack-mounted system unit, ensure that the system cables are routed through the cable-management arm correctly. 8. If you are working on a rack-mounted system unit, place the system in the operating position as described in Placing the 8233-E8B or 8236-E8C in the operating position. 9. Reconnect all power and signal cables to their respective connectors. 10. Close the rear rack door or the rear system door. 11. Start the system, as described in Starting the system or logical partition. 12. Verify that the new resource is working correctly, as described in Verifying the installed part. Host Ethernet Adapter Use this procedure to service the Host Ethernet Adapter. Removing the Host Ethernet Adapter If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the steps for removing the host Ethernet adapter from the server. For instructions, see Replacing a part by using the Hardware Management Console. Note: This procedure is for the internal Host Ethernet Adapter. For PCI Ethernet adapter information, see PCI Adapters. If you do not have an HMC, complete the following steps to remove the Host Ethernet Adapter: Note: Removing the Host Ethernet Adapter is a nonconcurrent service action. 1. Perform the prerequisite tasks described in “Before you begin” on page 115. 2. If you are removing the Host Ethernet Adapter for a system upgrade or as part of another procedure, continue to the next step. If you are removing the Host Ethernet Adapter because of a system failure, use the service action log to help identify the failing part. For more information, see Identifying a part. 3. Stop the system. For instructions, see Stopping a system or logical partition. 4. Disconnect the power source from the system by unplugging the system. Note: This system has a second power supply. Before continuing with this procedure, ensure that the power source to the system has been disconnected. (L003) or Removal and replacement procedures for the 8233-E8B and 8236-E8C 45 5. Place the system in the service position. For more information, see Placing a 8233-E8B or 8236-E8C in the service position. 6. Remove the service access cover. For more information, see Removing the service access cover from an 8233-E8B or 8236-E8C system. Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. 7. Locate the Ethernet adapter you want to remove as shown in the following figures. 8. Label and remove the Ethernet cables from the back of the system. Note cable connector locations for future reference. 9. Unlock the Ethernet adapter by pushing the latch (A) out. 10. Lift the Ethernet adapter (B) out of the connector using the blue handle, as shown in the following figures. 46 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 11. If you removed the adapter as a part of another procedure, continue with that procedure. To replace the adapter or install a new adapter, see Replacing the 8233-E8B or 8236-E8C Host Ethernet Adapter with the power off. Replacing the Host Ethernet Adapter To remove the Host Ethernet Adapter, see Removing the 8233-E8B or 8236-E8C Host Ethernet Adapter with the power off. If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the steps for replacing the host Ethernet adapter in the server. For instructions, see Replacing a part by using the Hardware Management Console. If you do not have an HMC, complete the following steps to replace the Host Ethernet Adapter: Note: Replacing the Host Ethernet Adapter is a nonconcurrent service action. 1. Perform the prerequisite tasks described in “Before you begin” on page 115. 2. Carefully grasp the blue adapter handle. Align the adapter with the connector. Ensure that the hook on the tail stock of the adapter slips into the slot on the back of the system. 3. Insert the adapter (A) firmly into the connector by pushing down on the blue handle, as shown in the following figure. 4. Secure the adapter with the locking tabs (B). Figure 24. Removing a host Ethernet adapter ( P1-C6) for 8233-E8B or 8236-E8C Removal and replacement procedures for the 8233-E8B and 8236-E8C 47 5. Replace the service access cover. For more information, see Installing the service access cover on an 8233-E8B or 8236-E8C system . 6. Reconnect the Ethernet cables to the back of the system. 7. Place the system in the operating position. For instructions, see Placing the 8233-E8B or 8236-E8C in the operating position. 8. Verify the installed part. For instructions, see Verifying the installed part. Media devices Use this procedure to service media devices. Select the procedure that applies to the media device that needs to be serviced. v Removing and replacing a SAS media device v Removing and replacing a Slimline media device with the power on v Removing and replacing a universal serial bus disk drive v Installing an external universal serial bus disk drive SAS media device Use this procedure to service SAS media devices. Removing a SAS media device: If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to complete the steps for removing media devices from the server. For instructions, see Removing a part using the Systems Director Management Console. If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the steps for removing media devices from the server. For information about using the HMC to remove media devices, see Removing a part using the Hardware Management Console. Figure 25. Install the Host Ethernet Adapter on a rack-mounted model for 8233-E8B or 8236-E8C 48 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures If you do not have an HMC or SDMC, complete the following steps to remove the media device: Before you remove a media device, perform the prerequisite tasks described in Before you begin. To remove a SAS media device, complete the following steps: 1. Stop the media device and eject any media. 2. Power off the system. See Stopping the system or logical partition. 3. Disconnect the power cords from the system. Note: This system has a second power supply. Before continuing with this procedure, ensure that the power source to the system has been disconnected. (L003) or 4. Remove the front cover. See Removing the front cover. 5. Place the system into the service position. See Placing the system into the service position. 6. Remove the service access cover. See Removing the service access cover. 7. Attach the wrist strap. Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. 8. Remove the fans and fan cage. Removal and replacement procedures for the 8233-E8B and 8236-E8C 49 Note: Although the following figures show a media device, you have already removed the media device filler and your system will have an empty space until you complete all the steps. a. Remove the fan by squeezing or lifting the tab (A), depending on your model, and lifting it out of the system as shown in the following figure. b. Remove the blue air baffles as shown in the following figure. Figure 26. Fan removal 50 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures c. Remove the fan cage (B) by unscrewing the thumbscrews (A) and lifting out the fan cage. Note: If the thumbscrews are too tight, you might need a flat-head screwdriver to complete this step. To pull the fan cage out of the system, you might need to unplug the control panel cable to allow more space. Figure 27. Air baffle removal Removal and replacement procedures for the 8233-E8B and 8236-E8C 51 9. If you are replacing the media device cable, disconnect the media-device connections from the system backplane. 10. Press the release latch (A) and unseat the media device (B) from the system as shown. Note: You need to remove the media cable (C) attached at the back of the device before removing the media device. Figure 28. Fan cage removal 52 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 11. Pull the media device from the system. 12. If you removed the media device as a part of another procedure, return to that procedure. 13. If you removed the media device to replace the media device, see Installing a SAS media device in 8233-E8B. Replacing a SAS media device: If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to complete the steps for installing a feature in the server. For instructions, see Removing a part using the Systems Director Management Console. If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the steps for installing a feature in the server. For information about using the HMC to install a feature, see Installing a feature using the Hardware Management Console. If you do not have an HMC or SDMC, complete the following steps to install a media device from the system or partition that controls the media device: Before you install a SAS media device, perform the prerequisite tasks described in Before you begin. To install a SAS media device, do the following steps: 1. Power off the system. See Stopping the system or logical partition. 2. Disconnect the power cords from the system. Note: This system has a second power supply. Before continuing with this procedure, ensure that the power source to the system has been completely disconnected. (L003) Removal and replacement procedures for the 8233-E8B and 8236-E8C 53 or 3. Place the system into the service position. See Placing the system into the service position. 4. Remove the front cover. See Removing the front cover for instructions. 5. Remove the service access cover. See Removing the service access cover for instructions. 6. Attach the wrist strap. Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. 7. Remove the media device filler, if present, as shown in the following figure. 54 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 8. Install the rails (A) onto the media device using the two small screws (B) included with your media device. The rails must be placed on each side of the media device. Screw the rails into the side of the media device using the lower screw hole. 9. Push the media device (A) into the system until you feel the latches (B) lock in place as shown. Note: You need to attach the media cable (C) provided at the back of the device before installing the media device. Removal and replacement procedures for the 8233-E8B and 8236-E8C 55 10. Remove the fans and fan cage. Note: Although the following figures show a media device, you have already removed the media device filler and your system will have an empty space until you complete all the steps. a. Remove the fan by squeezing or lifting the tab (A), depending on your model, and lifting it out of the system as shown in the following figure. 56 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures b. Remove the blue air baffles as shown in the following figure. c. Remove the fan cage (B) by unscrewing the thumbscrews (A) and lifting out the fan cage. Figure 29. Fan removal Figure 30. Air baffle removal Removal and replacement procedures for the 8233-E8B and 8236-E8C 57 Note: If the thumbscrews are too tight, you might need a flat-head screwdriver to complete this step. To pull the fan cage out of the system, you might need to unplug the control panel cable to allow more space. 11. Connect the SAS cable to the drive. 12. Attach the media-device connections to the system backplane. 13. Reinstall the fans and fan cage. a. Replace the fan cage by inserting it into the system and tightening the thumbscrews (B). Figure 31. Fan cage removal 58 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures b. Replace the blue air baffles as shown in the following figure. c. Replace the fans by inserting them into the fan cage until they lock into place, as shown in the following figure. Removal and replacement procedures for the 8233-E8B and 8236-E8C 59 14. Install the service access cover. See Installing the service access cover on a rack-mounted system for instructions. 15. Attach the front cover. See Installing the front cover on the rack-mounted system. 16. Connect the power cords to the system. 17. Start the system. See Starting the system or logical partition. 18. Verify that the media device is installed and working correctly. See Verifying the installed part. Removing and replacing a Slimline media device with the power on Use this procedure to service Slimline media devices. Removing a Slimline media device with the power on: Before you remove a media device, perform the prerequisite tasks described in Before you begin. To remove a Slimline media device, complete the following steps: 1. Remove any media from the drive. 2. Remove the front cover. See Removing the front cover from a rack-mounted system. 3. Press the retaining tab (A) away from the Slimline media device until it is unseated from the media bay as shown. 60 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 4. Pull the Slimline media device (B) away from the system. Replacing a Slimline media device with the power on: If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to complete the steps for installing a feature in the server. For instructions, see Removing a part using the Systems Director Management Console. If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the steps for installing a feature in the server. For information about using the HMC to install a feature, see Installing a feature using the Hardware Management Console. If you do not have an HMC or SDMC, complete the following steps to install a media device from the system or partition that controls the media device: Note: The 8233-E8B and 8236-E8C support only one Slimline media device per processor. To install a Slimline media device for models 8233-E8B and 8236-E8C with the system or partition powered off, follow these steps from the system or partition that controls the media device: 1. Stop the system. For instructions, see http://publib.boulder.ibm.com/infocenter/powersys/v3r1m5/ topic/p7haj/crustopsys.htm. If you have created an IBM i logical partition on your system, see Logical partitioning. To find instructions for powering off a system with an IBM i logical partition, see Shutting down IBM i logical partitions. Attention: Media devices are fragile. Handle with care. 2. Remove the system unit front cover. For instructions, see Removing the front cover from a rack-mounted system. 3. Find the package that contains the new media device and remove it from the static-protective package. Removal and replacement procedures for the 8233-E8B and 8236-E8C 61 Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. 4. Remove the media-device filler, if present. 5. Align the media device with the Slimline media bay, and support the bottom of the device as you slide it halfway into the system. 6. Push the device (A) fully into the system as shown in the following figure. To ensure that the device is held firmly in place, push in the plastic retaining tab (B). 7. Start the system. For instructions on starting the system, see Starting the system or logical partition. 8. Verify that the new resource is functional. See Verifying the installed part. 9. Replace the front cover on the system. For instructions, see Installing the front cover on the rack-mounted system. 10. Close the front rack door, if this is a rack-mounted model. Removing and replacing a USB disk drive Use this procedure to service USB media devices. Removing a USB disk drive: Before you remove a media device, perform the prerequisite tasks described in Before you begin. Figure 32. Installation of a Slimline media device 62 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures To remove a USB disk drive media device, complete the following steps: 1. Power off the system. See Stopping the system or logical partition. 2. Disconnect the power cords from the system. Note: This system has a second power supply. Before continuing with this procedure, ensure that the power source to the system has been disconnected. (L003) or 3. Remove the front cover. See Removing the front cover. 4. Place the system into the service position. See Placing the system into the service position. 5. Remove the service access cover. See Removing the service access cover. 6. Attach the wrist strap. Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. 7. Remove the fans. a. Remove the fan by squeezing the tab (A) and lifting it out of the system as shown in the following figure. Removal and replacement procedures for the 8233-E8B and 8236-E8C 63 b. Remove the blue air baffles as shown in the following figure. 8. Remove the fans and fan cage. Figure 33. Fan removal Figure 34. Air baffle removal 64 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures Note: Although the following figures show a media device, you have already removed the media device filler and your system will have an empty space until you complete all the steps. a. Remove the fan by squeezing or lifting the tab (A) and lifting it out of the system, as shown in the following figure. Remove the blue air baffles as shown in the following figure. Figure 35. Fan removal Removal and replacement procedures for the 8233-E8B and 8236-E8C 65 Remove the fan cage (B) by unscrewing the thumbscrews (A) and lifting out the fan cage. Note: If the thumbscrews are too tight, you might need a flat-head screwdriver to complete this step. To pull the fan cage out of the system, you might need to unplug the control panel cable to allow more space. Figure 36. Air baffle removal 66 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 9. Disconnect the media-device connections from the system backplane. Note: v When removing cables, ensure that you release any cable locks. Figure 37. Fan cage removal Removal and replacement procedures for the 8233-E8B and 8236-E8C 67 v Optional: If you do not intend to replace the USB disk drive, you can reconnect the control panel USB cable and media connections. 10. Reattach any other cables that you previously disconnected from the backplane. 11. Reinstall the fans into the 8233-E8B or 8236-E8C system. a. Replace the fan cage by inserting it into the system and tightening the thumbscrews (B). Replace the blue air baffles as shown in the following figure. 68 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures Replace the fans by inserting them into the fan cage until they lock into place, as shown in the following figure. 12. Press the release latch (A) and unseat the media device (B) from the system as shown. Note: You need to remove the media cables (A) and (B) provided at the back of the device before removing the media device. Removal and replacement procedures for the 8233-E8B and 8236-E8C 69 13. Pull the media device from the system. 14. If you removed the media device as a part of another procedure, return to that procedure. 15. If you removed the media device to replace the media device, see Installing a universal serial bus disk drive in 8233-E8B . Replacing a USB disk drive: If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to complete the steps for installing a feature in the server. For instructions, see Removing a part using the Systems Director Management Console. If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the steps for installing a feature in the server. For information about using the HMC to install a feature, see Installing a feature using the Hardware Management Console. If you do not have an HMC or SDMC, complete the following steps to install a USB disk drive: Before you install a USB disk drive, perform the prerequisite tasks described in http:// publib.boulder.ibm.com/infocenter/powersys/v3r1m5/topic/p7haj/beforebegin.htm. To install a USB disk drive, do the following steps: 1. Power off the system. See http://publib.boulder.ibm.com/infocenter/powersys/v3r1m5/topic/ p7haj/crustopsys.htm. 2. Disconnect the power cords from the system. Note: This system has a second power supply. Before continuing with this procedure, ensure that the power source to the system has been disconnected. (L003) or 70 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 3. Place the system into the service position. See Placing the system into the service position. 4. Remove the front cover. See Removing the front cover for instructions. 5. Remove the service access cover. See Removing the service access cover for instructions. 6. Attach the wrist strap. Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. 7. Remove the media device filler, if present, as shown in the following figure. 8. Remove the fans and fan cage. Note: Although the following figures show a media device, you have already removed the media device filler and your system will have an empty space until you complete all the steps. a. Remove the fan by squeezing or lifting the tab (A) and lifting it out of the system, as shown in the following figure. Removal and replacement procedures for the 8233-E8B and 8236-E8C 71 Remove the blue air baffles as shown in the following figure. Remove the fan cage (B) by unscrewing the thumbscrews (A) and lifting out the fan cage. Figure 38. Fan removal Figure 39. Air baffle removal 72 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures Note: If the thumbscrews are too tight, you might need a flat-head screwdriver to complete this step. To pull the fan cage out of the system, you might need to unplug the control panel cable to allow more space. 9. Install the rails (A) onto the media device using the two small screws (B) included with your media device. Place the rails on each side of the media device. Screw the rails into the side of the media device using the lower screw hole. 10. Press the media device (A) into the system until you feel the latches (B) lock in place as shown. Figure 40. Fan cage removal Removal and replacement procedures for the 8233-E8B and 8236-E8C 73 Note: You need to attach the media cables (A) and (B) provided at the back of the device before installing the media device. 11. Attach the media-device connections (A) and (B) to the system backplane. The connection for cable (A) on the backplane is already occupied. Remove the existing cable and replace it with cable (A). Note: The USB connection on the front panel is now inoperative. 74 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 12. Reattach any other cables that you previously disconnected from the backplane. 13. Reinstall the fans into the 8233-E8B or 8236-E8C system. a. Replace the fan cage by inserting it into the system and tightening the thumbscrews (B). Replace the blue air baffles as shown in the following figure. Figure 41. Attaching the media device connections to the system backplane of a 8233-E8B or 8236-E8C system Removal and replacement procedures for the 8233-E8B and 8236-E8C 75 Replace the fans by inserting them into the fan cage until they lock into place, as shown in the following figure. 14. Install the service access cover. See Installing the service access cover on a rack-mounted model for instructions. 15. Attach the front cover. See Installing the front cover on the rack-mounted system. 16. Connect the power cords to the system. 17. Start the system. See Starting the system or logical partition. 76 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 18. Configure the removable disk drive by performing the following steps: a. For the AIX operating system, log in as root user. b. At the command line, type cfgmgr. c. Press Enter. d. To verify that the system recognizes the device, run the lsdev -Cc usbms command. Note: The Linux operating system automatically configures the drive as a disk drive with a name in the format sdx, for example, sda, sdb, and sdc. To verify if the system recognizes the device, enter lsusb. To find the device that is associated with the USB disk drive, enter lsscsi. 19. Verify that the media device is installed and working correctly. See Verifying the installed part. Installing an external USB docking station and removable disk drive with power on Use this procedure to service external USB media devices. Keep in mind the following points when you install the USB external docking station for a removable disk drive: v A rack shelf is not included with feature code 1104. v The docking station can be rested on a flat surface in a rack or on a table top next to a rack. v Ensure that the device is on a flat surface, is right side up, and is not likely to be bumped, dropped, or otherwise damaged or jolted. v If the docking station is placed by itself on a rack shelf, it has space on the sides of the device. No filler panels are provided with the docking station. v If the docking station is placed on a rack shelf, ensure that the docking station has sufficient air flow. Also ensure that it does not affect the air flow of the other systems in the rack. v Do not block the fan on the back of the docking station. v The external USB disk drive can be installed while the system is powered on and needs to be configured following the installation. If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to complete the steps for installing a media enclosure. For instructions, see Removing a part using the Systems Director Management Console. If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the steps for installing a feature in the server. For information about using the HMC for installing a media enclosure, see Installing a feature using the Hardware Management Console. If you do not have an HMC or SDMC, install the docking station by completing the following steps: To install the docking station, complete the following steps: 1. Place the docking station on a flat service. Use the previous guidelines when selecting a location. 2. Attach one end of the external USB cable (F) to the back of the external docking station. Removal and replacement procedures for the 8233-E8B and 8236-E8C 77 3. Attach the other end of the external USB cable (F) to an external, integrated USB port or to the USB ports on a four-port USB PCI Express adapter (feature code 2728). 4. Attach the power supply cord (E) to the back of the external docking station and plug it into a power source. In addition to the external power supply cord, there are also universal adapters to be used as needed. 5. After the docking station is powered on, insert the disk drive into the docking station. A green indicator light appears when the disk drive is placed into the dock correctly. 6. Configure the removable disk drive by performing the following steps: a. For the AIX operating system, log in as root user. b. At the command line, type cfgmgr. c. Press Enter. d. To verify that the system recognizes the device, run the lsdev -Cc usbms command. Note: The Linux operating system automatically configures the drive as a disk drive with a name in the format sdx, for example, sda, sdb, and sdc. To verify if the system recognizes the device, enter lsusb. To find the device that is associated with the USB disk drive, enter lsscsi. 7. Verify that the media device is installed and working correctly. See Verifying the installed part. Memory modules Use this procedure to service dual inline memory modules (DIMMs). Removing memory modules If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the steps for removing memory modules from the server. For information about using the HMC to remove memory modules, see Removing a part by using the Hardware Management Console. If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to remove the memory module from the server. For instructions, see Removing a part by using the Systems Director Management Console. If you do not have an HMC or SDMC, complete the following procedure to remove memory modules. 1. Complete the prerequisite tasks described in “Before you begin” on page 115. 2. If you are removing the memory modules for a system upgrade or as part of another procedure, continue to the next step. If you are removing the memory modules because of a system failure, use the service action log to help identify the failing part as described in Identifying a part. Identification light-emitting diodes (LEDs) for failing memory modules are located on the top of the system processor assembly. 3. Stop the system, as described in Stopping a system or logical partition. 4. Disconnect the power source from the system by unplugging the system. 78 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures Note: This system is equipped with a second power supply. Before continuing with this procedure, ensure that the power source to the system has been completely disconnected. (L003) or 5. Attach the wrist strap. Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. 6. If you have a rack-mounted system, place it in the service position, as described in Placing a 8233-E8B or 8236-E8C in the service position. 7. Remove the service access cover, as described in Removing the service access cover from an 8233-E8B or 8236-E8C system. 8. Locate the memory module that you want to remove by observing the service indicator light-emitting diodes (LEDs). 9. Remove the system processor assembly and protective cover, as described in Removing the system processor assembly. Skip any steps that you have already completed. 10. Unlock the memory module cover by pushing the locking tab (A) down, as shown in Figure 42 on page 80. Lift the cover up to access the memory modules. Removal and replacement procedures for the 8233-E8B and 8236-E8C 79 11. Unlock the memory module by pushing the locking tabs (B) out and down, as shown in Figure 43. Lift the memory module (C) out of the connector. The lever action of the tabs forces the memory module out of the connector. Finally, replace the memory module you removed or install new memory modules. Figure 42. Removing a memory module Figure 43. Removing a memory module 80 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures Replacing memory modules Before installing or replacing a feature, ensure that the software required to support the feature is installed on your system. For information about software prerequisites, see IBM Prerequisite. If the required software is not installed, see the following web sites to download it, and then install it before continuing: v To download firmware and software updates and fixes, see Fix Central. v To download Hardware Management Console (HMC) updates and fixes, see Hardware Management Console Support and downloads. If your system is managed by the HMC, use the HMC to complete the steps for installing a feature in the server. For information about using the HMC to install a feature, see Installing a feature by using the Hardware Management Console. If you do not have an HMC, complete this procedure to install memory modules. If your system is managed by the HMC, use the HMC to complete the steps for replacing the memory modules in the server. For information about using the HMC to replace memory modules, see Replacing a part by using the Hardware Management Console. If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to replace the memory module in the server. For instructions, see Replacing a part by using the Systems Director Management Console. If you do not have an HMC or SDMC, use the following procedure to replace a memory module. Restriction: Use these instructions only as a part of removing and replacing memory modules that are identical in size, speed, and feature code. If you are installing new memory or upgrading memory, see Installing 8233-E8B or 8236-E8C memory modules. 1. Complete the prerequisite tasks described in “Before you begin” on page 115. 2. Attach the wrist strap. Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. 3. If necessary, unlock and lift the memory module cover. 4. If necessary, remove the new memory module from its antistatic package. 5. Ensure that the connector locking tabs are pressed out in the unlocked position before installing a new memory module. Attention: Memory modules are keyed to prevent a memory module from being installed incorrectly. Note the location of the key tab within the memory module connector before attempting to install the memory module. 6. Carefully grasp the memory module along two edges and align the connector. 7. Insert the memory module firmly into the connector (A), as shown in Figure 44 on page 82. 8. Secure the memory module with the locking tabs (B) located at each end of the memory module connector, as shown in Figure 44 on page 82. Removal and replacement procedures for the 8233-E8B and 8236-E8C 81 9. Close the memory module cover (C) until it locks in place, as shown in Figure 45. 10. Replace the system processor assembly. See Replacing the system processor assembly for information about replacing the system processor assembly and covers, restoring power, and starting the system. Skip any steps that you have already completed. Figure 44. Replacing a memory module Figure 45. Closing the memory module cover 82 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 11. Verify the installed part, as described in Verifying the installed part. Continue with any other service procedures you must perform. Installing, removing, and replacing PCI adapters Learn about installing, removing, and replacing Peripheral Component Interconnect (PCI), PCI-X, and PCI Express (PCIe) adapters. Model 8233-E8B and 8236-E8C PCI adapters You can remove, replace, or install PCI adapters in the 8233-E8B and 8236-E8C. If you are installing a new adapter, you will also need to refer to PCI adapter placement for slot placement information. Important: If you are installing a new feature, ensure that you have the software required to support the new feature and determine whether there are any existing PTF prerequisites to install. To do this, use the IBM Prerequisite website at http://www-912.ibm.com/e_dir/eServerPrereq.nsf . Installing a PCI adapter in the 8233-E8B and 8236-E8C server: You can install a PCI adapter. Installing a PCI adapter in the 8233-E8B and 8236-E8C server with the power off: You can install a PCI adapter with the system power off. If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the steps for installing a PCI adapter. If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to install a PCI adapter in the server. For instructions, see Installing a feature by using the Systems Director Management Console. If you are installing a PCI-X double-wide, quad-channel Ultra320 SCSI RAID Controller (FC 5739 or 5778, CCIN 571F and 575B), remove the lower, right standoff from between the two sides of the adapter before installing it in the system unit. See Figure 71 on page 129. If you do not have an HMC or SDMC, complete this procedure to install a PCI adapter with the system power off: 1. Take appropriate precautions for avoiding electric shock and handling static-sensitive devices. For information, see “Safety notices” on page v and “Handling static-sensitive devices” on page 118. 2. Determine in which slot to place the PCI adapter. For system-specific adapter placement information, see the PCI adapter placement. 3. Perform the prerequisite tasks described in “Before you begin” on page 115. 4. Stop the system or logical partition. See Stop the system or logical partition. 5. Disconnect the power source from the system by unplugging the system. 6. If you are installing, removing, or replacing a PCI adapter in a rack-mounted system, follow these steps to remove the service access cover. a. Open the front rack door. b. Place the system unit in the service position. c. Remove or open the service access cover. 7. If necessary, remove the adapter from the antistatic package. Removal and replacement procedures for the 8233-E8B and 8236-E8C 83 Attention: Avoid touching the components and gold connectors on the adapter. 8. Place the adapter, component-side up, on a flat, antistatic surface. 9. Some PCI adapters are shipped from the manufacturer with a blue handle or support bracket along the back edge of the card. To use adapters of this type in this system, you must remove the blue handle or support bracket from the card. 10. Slide the adapter latch (A) into the open position, as shown in the following figure. 11. If necessary, remove the adapter expansion slot filler. 12. Ensure that the slot is empty. If you are installing a double-wide adapter, remove the PCI adapter divider if one is present. See “Removing a PCI adapter divider from the 8233-E8B and 8236-E8C server” on page 109. 13. Carefully grasp the adapter (A) by its top edge, and align the adapter with the expansion slot and its connector on the system backplane. See the following figure. 14. Press the adapter firmly into its connector. 15. Slide the adapter latch (B) back into place as shown in the following figure. Figure 46. PCI adapter or filler plate removed from the rack-mounted system unit 84 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 16. Connect any adapter cables. 17. If you are servicing a rack-mounted system, route the cables through the cable-management arm. 18. Replace or close the service access cover and, if applicable, return the system to the operating position. 19. Reconnect the power source to the system. 20. Start the system or logical partition. Refer to Start the system or logical partition. 21. Verify that the new resource is functional. Installing a PCI adapter in the 8233-E8B and 8236-E8C server, with the power on in an AIX environment: You can install a PCI adapter with the system power on in an AIX environment. If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the steps for installing a PCI adapter. If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to install a PCI adapter in the server. For instructions, see Installing a feature by using the Systems Director Management Console. If you are installing a PCI-X double-wide, quad-channel Ultra320 SCSI RAID Controller (FC 5739 or 5778, CCIN 571F and 575B), remove the lower, right standoff from between the two sides of the adapter before installing it in the system unit. See Figure 71 on page 129. If you do not have an HMC or SDMC, complete the following steps to install a PCI adapter with the system power on in the AIX environment: 1. Take appropriate precautions for avoiding electric shock and handling static-sensitive devices. For information, see “Safety notices” on page v and “Handling static-sensitive devices” on page 118. Figure 47. PCI adapter replaced in the rack-mounted system unit Removal and replacement procedures for the 8233-E8B and 8236-E8C 85 2. Determine in which slot to place the PCI adapter. For system-specific adapter placement information, see the PCI adapter placement. 3. Perform the prerequisite tasks described in “Before you begin” on page 115. 4. If you are installing, removing, or replacing a PCI adapter in a rack-mounted system, follow these steps to remove the service access cover. a. Open the front rack door. b. Place the system unit in the service position. c. Remove or open the service access cover. 5. If necessary, remove the adapter expansion slot shield. 6. If necessary, remove the adapter from the antistatic package. Attention: Avoid touching the components and gold connectors on the adapter. 7. Place the adapter, component-side up, on a flat, antistatic surface. 8. Some PCI adapters are shipped from the manufacturer with a blue handle or support bracket along the back edge of the card. To use adapters of this type in this system, you must remove the blue handle or support bracket from the card. 9. If you are installing a double-wide adapter, remove the PCI adapter divider if one is present. See “Removing a PCI adapter divider from the 8233-E8B and 8236-E8C server” on page 109. 10. Refer to “PCI hot-plug manager access for AIX” on page 122, and follow the steps in the access procedure to select PCI Hot Plug Manager. Then return here to continue. 11. From the PCI Hot-Plug Manager menu, select Add a PCI Hot-Plug Adapter and press Enter. The Add a Hot-Plug Adapter window displays. 12. Select the appropriate empty PCI slot from the ones listed on the screen, and press Enter. 13. Slide the adapter latch (A) into the open position, as shown in the following figure. 14. Remove the adapter filler plate if one is present. Figure 48. PCI adapter or filler plate removed from the rack-mounted system unit 86 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 15. Follow the instructions on the screen to install the adapter until the LED for the specified PCI slot is set to the Action state. See “Component LEDs” on page 124. 16. When you are instructed to install the adapter in the adapter slot, carefully grasp the adapter by the edges and align the adapter (A) in the slot guides. Insert the adapter fully into the adapter slot connector. If you are installing a full-length adapter, ensure that both ends of the adapter engage the card guides. 17. Press the adapter firmly into its connector. 18. Slide the adapter latch (B) back into place as shown in the following figure. 19. Connect any adapter cables. 20. Replace or close the service access cover and, if applicable, return the system to the operating position. 21. Run the cfgmgr command to configure the adapter. 22. Verify that the new resource is functional. See Verify the installed part. Installing a PCI adapter in the 8233-E8B server, with the power on in an IBM i environment: You can install a PCI adapter in the i environment. If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the steps for installing a PCI adapter. If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to install a PCI adapter in the server. For instructions, see Installing a feature by using the Systems Director Management Console. Figure 49. PCI adapter replaced in the rack-mounted system unit Removal and replacement procedures for the 8233-E8B and 8236-E8C 87 If you are installing a PCI-X double-wide, quad-channel Ultra320 SCSI RAID Controller (FC 5739 or 5778, CCIN 571F and 575B), remove the lower, right standoff from between the two sides of the adapter before installing it in the system unit. See Figure 71 on page 129. Important: v If you are removing, installing or replacing a PCI-X double-wide, quad-channel Ultra320 SCSI RAID controller, review the Concurrent maintenance procedure in the PCI-X double-wide, quad-channel Ultra320 SCSI RAID Controller (FC 5739, 5778, 5781, 5782; CCIN 571F, 575B) topic, before proceeding with the instructions provided here. v If you are removing, installing or replacing a PCI-X DDR 1.5 GB cache SAS RAID Adapter, review the Concurrent maintenance procedure in the PCI-X DDR 1.5 GB cache SAS RAID Adapter topic, before proceeding with the instructions provided here. v If you are exchanging a 2766, 2787, 280E, 576B, or 5774 Fibre Channel IOA, the external storage subsystem must be updated to use the worldwide port name of the new 2766, 2787, 280E, 576B, or 5774 IOA. For instructions, see “Updating the worldwide port name for a new 2766, 2787, 280E, 576B, or 5774 IOA.” on page 126. v If you are replacing a 2748, 2757, 2763, 2767, 2778, 2780, 2782, 5702, 5709, or 570B storage IOA, take note of the following: Depending on the configuration of the system, the storage IOA cache might have been disabled to allow the attachment of OEM storage that emulates a load source drive. If you are replacing a storage IOA that has its cache disabled, configure the replacement IOA the same way as the IOA that you removed. If you remove hardware from the replacement IOA, return that hardware with the failed IOA. If you do not have an HMC or SDMC, complete the following steps to install a PCI adapter with the system power on in the i environment: 1. Perform the prerequisite tasks described in “Before you begin” on page 115. 2. Take appropriate precautions for avoiding electric shock and handling static-sensitive devices. For information, see “Safety notices” on page v and “Handling static-sensitive devices” on page 118. 3. Determine in which slot to place the PCI adapter. For system-specific adapter placement information, see the PCI adapter placement. 4. If you are installing, removing, or replacing a PCI adapter in a rack-mounted system, follow these steps to remove the service access cover. a. Open the front rack door. b. Place the system unit in the service position. c. Remove or open the service access cover. 5. If necessary, remove the adapter expansion slot shield. 6. If necessary, remove the adapter from the antistatic package. Attention: Avoid touching the components and gold connectors on the adapter. 7. Place the adapter, component-side up, on a flat, antistatic surface. 8. Some PCI adapter cards are shipped from the manufacturer with a blue handle or support bracket along the back edge of the card. To use adapters of this type in this system, you must remove the blue handle or support bracket from the card. 9. If you are installing a double-wide adapter, remove the PCI adapter divider if one is present. See “Removing a PCI adapter divider from the 8233-E8B and 8236-E8C server” on page 109. 10. Type strsst on the command line of the Main menu and then press Enter. 11. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign On display. Press Enter. 12. Select Start a service tool from the System Service Tools (SST) display and press Enter. 13. Select Hardware service manager from the Start a Service Tool display and press Enter. 88 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 14. Select Packaging hardware resources (system, frames, cards) from the Hardware Service Manager display. Press Enter. 15. Type 9 (Hardware contained within package) in the System Unit or Expansion Unit field of the unit where you are replacing the card. Press Enter. 16. Select the option to Include empty positions. 17. Select Concurrent Maintenance on the card position where you want to replace the card and then press Enter. 18. Select the option to Toggle LED blink off/on. A light-emitting diode (LED) flashes identifying the position you chose. Physically verify that this is the slot where you want to install the adapter. 19. Select the option to Toggle LED blink off/on to stop the blinking LED. 20. Slide the adapter latch (A) into the open position, as shown in the following figure. 21. Ensure that the slot is empty. Remove the adapter filler plate if one is present. 22. Carefully grasp the adapter by its top edge, and align the adapter with the expansion slot and its connector on the system backplane. 23. Press the adapter firmly into its connector. 24. Slide the adapter latch (B) back into place as shown in the following figure. Figure 50. PCI adapter or filler plate removed from the rack-mounted system unit Removal and replacement procedures for the 8233-E8B and 8236-E8C 89 25. Connect any adapter cables. 26. Select Power on domain on the Hardware Resource Concurrent Maintenance display and press Enter. 27. Select Assign to on the resource that has an asterisk (*) on the Work with Controlling Resource display. Press Enter. 28. Wait for the Hardware Resource Concurrent Maintenance display to appear with this message: Power on complete 29. Replace or close the service access cover and, if applicable, return the system to the operating position. 30. Verify that the new resource is functional. See Verify the installed part. Installing a PCI adapter in the 8233-E8B and 8236-E8C server, with the power on in a Linux environment: You can install a PCI adapter with the system power on in the Linux environment. If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the steps for installing a PCI adapter. If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to install a PCI adapter in the server. For instructions, see Installing a feature by using the Systems Director Management Console. If you are installing a PCI-X double-wide, quad-channel Ultra320 SCSI RAID Controller (FC 5739 or 5778, CCIN 571F and 575B), remove the lower, right standoff from between the two sides of the adapter before installing it in the system unit. See Figure 71 on page 129. If you do not have an HMC or SDMC, complete the following steps to install a PCI adapter with the system power on in the Linux environment: Figure 51. PCI adapter replaced in the rack-mounted system unit 90 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 1. Ensure that the system meets the “Prerequisites for hot-plugging PCI adapters in Linux” on page 125. 2. Take appropriate precautions for avoiding electric shock and handling static-sensitive devices. For information, see “Safety notices” on page v and “Handling static-sensitive devices” on page 118. 3. “Verifying that the hot-plug PCI tools are installed for Linux” on page 125. 4. Determine in which slot to place the PCI adapter. For system-specific adapter placement information, see the PCI adapter placement. 5. Perform the prerequisite tasks described in “Before you begin” on page 115. 6. If you are installing, removing, or replacing a PCI adapter in a rack-mounted system, follow these steps to remove the service access cover. a. Open the front rack door. b. Place the system unit in the service position. c. Remove or open the service access cover. 7. If necessary, remove the adapter expansion slot shield. 8. If necessary, remove the adapter from the antistatic package. Attention: Avoid touching the components and gold connectors on the adapter. 9. Place the adapter, component-side up, on a flat, antistatic surface. 10. If you are installing a double-wide adapter, remove the PCI adapter divider if one is present. See “Removing a PCI adapter divider from the 8233-E8B and 8236-E8C server” on page 109. 11. Log in to the system console as the root user. 12. Run the lsslot tool to list the hot-plug PCI slots that are available in the server or partition: lsslot -c pci -a The following is an example of the information displayed by this command: # Slot Description Device(s) U7879.001.DQD014E-P1-C1 PCI-X capable, 64 bit, 133MHz slot Empty U7879.001.DQD014E-P1-C4 PCI-X capable, 64 bit, 133MHz slot Empty U7879.001.DQD014E-P1-C5 PCI-X capable, 64 bit, 133MHz slot Empty Select the appropriate empty PCI slot from the ones listed by the command. 13. Slide the adapter latch (A) into the open position, as shown in the following figure. Removal and replacement procedures for the 8233-E8B and 8236-E8C 91 14. Ensure that the slot is empty. Remove the adapter filler plate if one is present. 15. Run the drslot_chrp_pci command to enable an adapter to be installed. For example, to install an adapter into PCI slot U7879.001.DQD014E-P1-C3, enter the following command: drslot_chrp_pci -a -s U7879.001.DQD014E-P1-C3 The following displays: The visual indicator for the specified PCI slot has been set to the identify state. Press Enter to continue or enter x to exit. 16. Press Enter. The following displays: The visual indicator for the specified PCI slot has been set to the action state. Insert the PCI card into the identified slot, connect any devices to be configured and press Enter to continue. Enter x to exit. 17. When you are instructed to install the adapter (A) in the adapter slot, carefully grasp the adapter by its top edge, and align the adapter with the expansion slot and its connector on the system backplane. 18. Press the adapter firmly into its connector. Attention: When you install an adapter into the system, be sure that it is completely and correctly seated in its connector. 19. Slide the adapter latch (B) back into place as shown in the following figure. Figure 52. PCI adapter or filler plate removed from the rack-mounted system unit 92 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 20. Connect any adapter cables. 21. Run the lsslot command to verify that the slot is occupied. For example, enter lsslot -c pci -s U7879.001.DQD014E-P1-C3 The following is an example of the information displayed by this command: # Slot Description Device(s) U7879.001.DQD014E-P1-C3 PCI-X capable, 64 bit, 133MHz slot 0001:40:01.0 22. If you are servicing a rack-mounted system, route the cables through the cable-management arm. 23. Replace or close the service access cover and, if applicable, return the system to the operating position. Removing a PCI adapter from an 8233-E8B and 8236-E8C server: You can remove a PCI adapter. Important: v If you are exchanging a 2766, 2787, 280E, 576B, or 5774 Fibre Channel input/output adapter (IOA), the external storage subsystem must be updated to use the worldwide port name of the new 2766, 2787, 280E, 576B, or 5774 IOA. For instructions, see “Updating the worldwide port name for a new 2766, 2787, 280E, 576B, or 5774 IOA.” on page 126. v If you are replacing a 2748, 2757, 2763, 2767, 2778, 2780, 2782, 5702, 5703, 5709, or 570B storage IOA, take note of the following: Depending on the configuration of the system, the storage IOA might have been altered or the storage IOA cache might have been disabled to allow the attachment of OEM storage that emulates a load source drive. If you are replacing a storage IOA that has its cache disabled, configure the replacement IOA the same way as the IOA that you removed. If you remove hardware from the replacement IOA, return that hardware with the failed IOA. Removing a PCI adapter from the 8233-E8B and 8236-E8C server with the power off: Figure 53. PCI adapter replaced in the rack-mounted system unit Removal and replacement procedures for the 8233-E8B and 8236-E8C 93 You can remove a PCI adapter with the system power off. If your system is managed by the Hardware Management Console (HMC), use the HMC to remove a PCI adapter. If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to remove the PCI adapter from the server. For instructions, see Removing a part by using the Systems Director Management Console. If you do not have an HMC or SDMC, complete the following steps to remove a PCI adapter with the system power off: 1. Perform the prerequisite tasks described in “Before you begin” on page 115. 2. Take appropriate precautions for avoiding electric shock and handling static-sensitive devices. For information, see “Safety notices” on page v and “Handling static-sensitive devices” on page 118. 3. If you are removing a failing PCI adapter, see Identifying a part. If you are removing the PCI adapter for other reasons, continue to the next step. 4. Stop the system or logical partition. 5. Disconnect the power source from the system by unplugging the system. 6. If you are installing, removing, or replacing a PCI adapter in a rack-mounted system, follow these steps to remove the service access cover. a. Open the front rack door. b. Place the system unit in the service position. c. Remove or open the service access cover. 7. Determine which adapter you plan to remove, then label and disconnect all cables attached to that adapter. 8. Record the slot number and location of each adapter being removed. Note: Adapter slots are numbered on the rear of the system. 9. Slide the adapter latch (A) into the open position, as shown in the following figure. 94 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 10. Carefully grasp the PCI adapter (B) by its top edge or upper corners, and remove it from the system. Store the adapter in a safe place. 11. If you are removing a PCI adapter as part of another procedure, return to that procedure. If not, continue to the next step. 12. If you plan to install another adapter into the vacated slot, go to “Replacing a PCI adapter in the 8233-E8B and 8236-E8C server with the power off” on page 101; otherwise, continue with the next step. 13. Seal the expansion slot using an expansion-slot cover. 14. Replace or close the service access cover and, if applicable, return the system to the operating position. 15. Reconnect the power source to the system. 16. Start the system or logical partition. Refer to Start the system or logical partition. 17. To replace the PCI adapter, see “Replacing a PCI adapter in the 8233-E8B and 8236-E8C server” on page 101. Removing a PCI adapter from the 8233-E8B and 8236-E8C server, with the power on in the AIX environment: You can remove a PCI adapter with the system power on in the AIX environment. To remove a failing adapter and replace it with the same adapter, see “Removing and replacing a PCI adapter in the 8233-E8B and 8236-E8C server, with the power on in the AIX environment” on page 103. If the adapter that is removed will be placed into a different slot or system, complete this removal procedure, and then install the adapter as described in “Installing a PCI adapter in the 8233-E8B and 8236-E8C server, with the power on in an AIX environment” on page 85. If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the steps for removing a PCI adapter. Figure 54. PCI adapter or filler plate removed from the rack-mounted system unit Removal and replacement procedures for the 8233-E8B and 8236-E8C 95 If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to remove the PCI adapter from the server. For instructions, see Removing a part by using the Systems Director Management Console. Note: Procedures performed on a PCI adapter with the system power on in the AIX environment, also known as hot-plug procedures, require the system administrator to take the PCI adapter offline before performing the operation. Before taking an adapter offline, the devices attached to the adapter must be taken offline as well. This action prevents a service representative or user from causing an unexpected outage for system users. If you do not have an HMC or SDMC, complete the following steps to remove a PCI adapter with the system power on in the AIX environment: 1. Perform the prerequisite tasks described in “Before you begin” on page 115. 2. Take appropriate precautions for avoiding electric shock and handling static-sensitive devices. For information, see “Safety notices” on page v and “Handling static-sensitive devices” on page 118. 3. If you are removing a failing PCI adapter, see Identifying a failing part. If you are removing the PCI adapter for other reasons, continue to the next step. 4. If you are installing, removing, or replacing a PCI adapter in a rack-mounted system, follow these steps to remove the service access cover. a. Open the front rack door. b. Place the system unit in the service position. c. Remove or open the service access cover. 5. Determine which adapters you plan to remove. 6. Record the slot number and location of each adapter being removed. Adapter slots are numbered on the rear of the system unit. 7. Ensure that any processes or applications that might use the adapter are stopped. 8. Follow these steps to place the adapter in the action state using the PCI Hot-Plug Manager: a. Enter the system diagnostics by logging in as root user or as the celogin user, type diag at the AIX command line. b. When the DIAGNOSTIC OPERATING INSTRUCTIONS menu displays, press Enter. c. At the FUNCTION SELECTION menu, select Task Selection, and then press Enter. d. At the Task Selection list, select PCI Hot Plug Manager. e. Select Unconfigure a Device, and then press Enter. f. Press F4 (or Esc +4) to display the Device Names menu. g. Select the adapter you are removing in the Device Names menu. h. Use the Tab key to answer NO to Keep Definition. Use the Tab key again to answer YES to Unconfigure Child Devices, and then press Enter. The ARE YOU SURE window is displayed. i. Press Enter to verify the information. Successful unconfiguration is indicated by the OK message displayed next to the Command field at the top of the screen. j. Press F4 (or Esc +4) twice to return to the Hot Plug Manager menu. k. Select Replace/remove PCI Hot Plug adapter. l. Select the slot that has the device to be removed from the system. m. Select Remove. A fast-blinking amber LED located at the back of the machine near the adapter indicates that the slot has been identified. n. Label all cables attached to the adapter you plan to remove. o. Press Enter. This places the adapter in the action state, meaning it is ready to be removed from the system. p. Disconnect all cables attached to the adapter you plan to remove. 9. Label, and then disconnect all cables attached to the adapter you plan to remove. 96 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 10. Slide the adapter latch (A) into the open position, as shown in the following figure. 11. Carefully grasp the PCI adapter (B) by its top edge or upper corners, and remove it from the system. Store the adapter in a safe place. 12. If you plan to install another adapter into the vacated slot, go to “Installing a PCI adapter in the 8233-E8B and 8236-E8C server, with the power on in an AIX environment” on page 85; otherwise, continue with the next step. 13. Seal the expansion slot using an expansion-slot cover. 14. Lower the plastic retainer seat over the PCI adapter faceplate. 15. Rotate the locking latch clockwise until it clicks into the locked position. 16. Continue to follow the screen instructions until you receive a message that the adapter removal is successful. Successful removal is indicated by the OK message displayed next to the Command field at the top of the screen. 17. If you have other adapters to remove, press the F3 key to return to the PCI Hot-Plug Manager menu and then return to step 9 on page 96. If you do not have other adapters to remove, continue with the next step. 18. Press F10 to exit the Hot-Plug Manager. 19. Run the diag -a command. If the system responds with a menu or prompt, follow the instructions to complete the device configuration. 20. Replace or close the service access cover and, if applicable, return the system to the operating position. 21. To replace the PCI adapter, see “Replacing a PCI adapter in the 8233-E8B and 8236-E8C server” on page 101. Removing a PCI adapter in the 8233-E8B server, with the power on in the IBM i environment: You can remove a PCI adapter with the system power on in the IBM i environment. Figure 55. PCI adapter or filler plate removed from the rack-mounted system unit Removal and replacement procedures for the 8233-E8B and 8236-E8C 97 If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the steps for removing a PCI adapter. If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to remove the PCI adapter from the server. For instructions, see Removing a part by using the Systems Director Management Console. Important: v If the adapter is the load source I/O adapter (IOA) or the load source I/O processor (IOP), or any other storage IOA or IOP with critical disk storage attached for the system or partition, follow the on-screen instructions when you use HSM to power down the IOP or IOA. Instructions to use functions 68 and 69 on the control panel will be included. v If the adapter is the console IOA or the console IOP for the system or partition, you must perform the maintenance from an IBM i operating system session connected through a different IOA or IOP, or you must power down the partition to perform maintenance. v If you are removing, installing or replacing a PCI-X double-wide, quad-channel Ultra320 SCSI RAID controller, review the Concurrent maintenance procedure in the PCI-X double-wide, quad-channel Ultra320 SCSI RAID Controller (FC 5739, 5778, 5781, 5782; CCIN 571F, 575B) topic, before proceeding with the instructions provided here. v If you are removing, installing or replacing a PCI-X DDR 1.5 GB cache SAS RAID Adapter, review the Concurrent maintenance procedure in the PCI-X DDR 1.5 GB cache SAS RAID Adapter topic, before proceeding with the instructions provided here. v If you are exchanging a 2766, 2787, 280E, 576B, or 5774 Fibre Channel IOA, the external storage subsystem must be updated to use the worldwide port name of the new 2766, 2787, 280E, 576B, or 5774 IOA. For instructions, see “Updating the worldwide port name for a new 2766, 2787, 280E, 576B, or 5774 IOA.” on page 126. v If you are replacing a 2748, 2757, 2763, 2767, 2778, 2780, 2782, 5702, 5709, or 570B storage IOA, take note of the following: Depending on the configuration of the system, the storage IOA cache might have been disabled to allow the attachment of OEM storage that emulates a load source drive. If you are replacing a storage IOA that has its cache disabled, configure the replacement IOA the same way as the IOA that you removed. If you remove hardware from the replacement IOA, return that hardware with the failed IOA. If you do not have an HMC or SDMC, complete the following steps to remove a PCI adapter with the system power on in the IBM i environment: 1. Perform the prerequisite tasks described in “Before you begin” on page 115. 2. Take appropriate precautions for avoiding electric shock and handling static-sensitive devices. For information, see “Safety notices” on page v and “Handling static-sensitive devices” on page 118. 3. If you are removing a failing PCI adapter, see Identifying a failing part. If you are removing the PCI adapter for other reasons, continue to the next step. 4. If you are installing, removing, or replacing a PCI adapter in a rack-mounted system, follow these steps to remove the service access cover. a. Open the front rack door. b. Place the system unit in the service position. c. Remove or open the service access cover. 5. Type strsst on the command line of the Main Menu and press Enter. 6. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign On display. Press Enter. 7. Select Start a service tool from the System Service Tools (SST) display. Press Enter. 8. Select Hardware service manager from the Start a Service Tool display and press Enter. 98 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 9. Select Packaging hardware resources (system, frames, cards) from the Hardware Service Manager display. Press Enter. 10. Type 9 (Hardware contained within package) in the System Unit or Expansion Unit field of the unit where you are removing the card, and then press Enter. 11. Select the option to Include empty positions. 12. Select Concurrent Maintenance on the card position where you want to remove the card and then press Enter. 13. Select the option to Toggle LED blink off/on. A light-emitting diode (LED) blinks identifying the position you chose. Physically verify that this is the slot where you want to remove the adapter. 14. Select the option to Toggle LED blink off/on to stop the blinking LED. 15. Select the option to Power off domain on the Hardware Resource Concurrent Maintenance display and press Enter. 16. Wait for the Hardware Resource Concurrent Maintenance display to appear with this message: Power off complete 17. Label and then disconnect all cables attached to the adapter you plan to remove. 18. Record the slot number and location of each adapter being removed. Note: Adapter slots are numbered on the rear of the system. 19. Slide the adapter latch (A) into the open position, as shown in the following figure. 20. Carefully grasp the PCI adapter (B) by its top edge or upper corners, and remove it from the system. Store the adapter in a safe place. 21. If you are removing a PCI adapter as part of another procedure, return to that procedure. If not, continue to the next step. 22. If you plan to install another adapter into the vacated slot, go to “Installing a PCI adapter in the 8233-E8B server, with the power on in an IBM i environment” on page 87; otherwise, continue with the next step. Figure 56. PCI adapter or filler plate removed from the rack-mounted system unit Removal and replacement procedures for the 8233-E8B and 8236-E8C 99 23. Seal the expansion slot using an expansion-slot cover. 24. Replace or close the service access cover and, if applicable, return the system to the operating position. 25. To replace the PCI adapter, see “Replacing a PCI adapter in the 8233-E8B and 8236-E8C server” on page 101. Removing a PCI adapter from the 8233-E8B and 8236-E8C server, with the power on in the Linux environment: You can remove a PCI adapter with the system power on in the Linux environment. If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the steps for removing a PCI adapter. If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to remove the PCI adapter from the server. For instructions, see Removing a part by using the Systems Director Management Console. If you do not have an HMC or SDMC, complete the following steps to remove a PCI adapter with the system power on in the Linux environment: 1. Ensure that the system meets the “Prerequisites for hot-plugging PCI adapters in Linux” on page 125. 2. “Verifying that the hot-plug PCI tools are installed for Linux” on page 125. 3. Perform the prerequisite tasks described in “Before you begin” on page 115. 4. Take appropriate precautions for avoiding electric shock and handling static-sensitive devices. For information, see “Safety notices” on page v and “Handling static-sensitive devices” on page 118. 5. If you are removing a failing PCI adapter, see Identifying a failing part. If you are removing the PCI adapter for other reasons, continue to the next step. 6. If you are installing, removing, or replacing a PCI adapter in a rack-mounted system, follow these steps to remove the service access cover. a. Open the front rack door. b. Place the system unit in the service position. c. Remove or open the service access cover. 7. Determine which adapter you plan to remove, then label and disconnect all cables attached to that adapter. 8. Record the slot number and location of each adapter being removed. Note: Adapter slots are numbered on the rear of the system. 9. Label, and then disconnect all cables attached to the adapter you plan to remove. Note: Before performing a PCI hot-plug removal of storage devices, ensure that the file systems on those devices are unmounted. 10. Run the drslot_chrp_pci command to enable an adapter to be removed: For example, to remove the PCI adapter in slot U7879.001.DQD014E-P1-C3, run this command: drslot_chrp_pci -r -s U7879.001.DQD014E-P1-C3 Follow the instructions on the display to complete the task. 11. Slide the adapter latch (A) into the open position, as shown in the following figure. 100 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 12. Carefully grasp the PCI adapter (B) by its top edge or upper corners, and remove it from the system. Store the adapter in a safe place. 13. If you are removing a PCI adapter as part of another procedure, return to that procedure. If not, continue to the next step. 14. If you plan to install another adapter into the vacated slot, go to “Replacing a PCI adapter in the 8233-E8B and 8236-E8C server with the power on in the Linux environment” on page 108; otherwise, continue with the next step. 15. Seal the expansion slot using an expansion-slot cover. 16. Replace or close the service access cover and, if applicable, return the system to the operating position. 17. Reconnect the power source to the system. 18. Start the system or logical partition. Refer to Start the system or logical partition. 19. To replace the PCI adapter, see “Replacing a PCI adapter in the 8233-E8B and 8236-E8C server.” Replacing a PCI adapter in the 8233-E8B and 8236-E8C server: You can replace a PCI adapter. Replacing a PCI adapter in the 8233-E8B and 8236-E8C server with the power off: You can replace a PCI adapter with the system power off. You must have already completed the procedure “Removing a PCI adapter from the 8233-E8B and 8236-E8C server with the power off” on page 93 in order to have the slot powered off. If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the steps for replacing a PCI adapter. Figure 57. PCI adapter or filler plate removed from the rack-mounted system unit Removal and replacement procedures for the 8233-E8B and 8236-E8C 101 If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to replace the PCI adapter in the server. For instructions, see Replacing a part by using the Systems Director Management Console. If you do not have an HMC or SDMC, complete the following steps to replace a PCI adapter with the system power off: 1. Perform the prerequisite tasks described in “Before you begin” on page 115. 2. Take appropriate precautions for avoiding electric shock and handling static-sensitive devices. For information, see “Safety notices” on page v and “Handling static-sensitive devices” on page 118. 3. If necessary, remove the adapter from the antistatic package. Attention: Avoid touching the components and gold connectors on the adapter. 4. Place the adapter, component-side up, on a flat, static-protective surface. 5. Carefully grasp the adapter by its top edge, and align the adapter with the expansion slot and its connector on the system backplane. 6. Press the adapter (A) firmly into its connector. Attention: When you install an adapter into the system, be sure that it is completely and correctly seated in its connector. 7. Slide the adapter latch (B) back into place as shown in the following figure. 8. Connect the adapter cables. 9. If you are servicing a rack-mounted system, route the cables through the cable-management arm. 10. Replace or close the service access cover and, if applicable, return the system to the operating position. 11. Reconnect the power source to the system. 12. Start the system or logical partition. Refer to Start the system or logical partition. Figure 58. PCI adapter replaced in the rack-mounted system unit 102 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 13. Verify that the new resource is functional. See Verify the installed part. Removing and replacing a PCI adapter in the 8233-E8B and 8236-E8C server, with the power on in the AIX environment: You can replace a PCI adapter with the system power on in the AIX environment. If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the steps for replacing a PCI adapter. If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to replace the PCI adapter in the server. For instructions, see Replacing a part by using the Systems Director Management Console. Read the following notes to determine if this is the correct procedure for the task to be performed. Notes: v Use this procedure if you intend to remove a failing PCI adapter and replace it with the same type of adapter. v If you plan to remove a failing adapter and leave the slot empty, see “Removing a PCI adapter from the 8233-E8B and 8236-E8C server, with the power on in the AIX environment” on page 95. v This procedure should not be used to remove an existing adapter and install a different type of adapter. To install a different adapter, remove the existing adapter as described in “Removing a PCI adapter from the 8233-E8B and 8236-E8C server, with the power on in the AIX environment” on page 95, then install the new adapter as described in “Installing a PCI adapter in the 8233-E8B and 8236-E8C server, with the power on in an AIX environment” on page 85. v Procedures performed on a PCI adapter with the system power on in AIX, also known as hot-plug procedures, require the system administrator to take the PCI adapter offline before performing the operation. Before taking an adapter offline, the devices attached to the adapter must be taken offline as well. This action prevents a service representative or user from causing an unexpected outage for system users. If you do not have an HMC or SDMC, complete the following steps to replace a PCI adapter with the system power on in the AIX environment: 1. Perform the prerequisite tasks described in “Before you begin” on page 115. 2. Take appropriate precautions for avoiding electric shock and handling static-sensitive devices. For information, see “Safety notices” on page v and “Handling static-sensitive devices” on page 118. 3. If you are removing a failing PCI adapter, see Identifying a failing part. If you are removing the PCI adapter for other reasons, continue to the next step. 4. If you are installing, removing, or replacing a PCI adapter in a rack-mounted system, follow these steps to remove the service access cover. a. Open the front rack door. b. Place the system unit in the service position. c. Remove or open the service access cover. 5. Determine which adapters you plan to remove. 6. Record the slot number and location of each adapter being removed. Note: Adapter slots are numbered on the rear of the system unit. 7. Ensure that any processes or applications that might use the adapter are stopped. 8. Enter the system diagnostics by logging in as root user or as the celogin user, type diag at AIX command line. 9. When the DIAGNOSTIC OPERATING INSTRUCTIONS menu displays, press Enter. Removal and replacement procedures for the 8233-E8B and 8236-E8C 103 10. At the FUNCTION SELECTION menu, select Task Selection, then press enter. 11. At the Task Selection list, select PCI Hot Plug Manager. 12. Select Unconfigure a Device, then press Enter. 13. Press F4 (or Esc +4) to display the Device Names menu. 14. Select the adapter you are removing in the Device Names menu. 15. Use the Tab key to answer YES to Keep Definition. Use the Tab key again to answer YES to Unconfigure Child Devices, then press Enter. 16. The ARE YOU SURE screen displays. Press Enter to verify the information. Successful unconfiguration is indicated by the OK message displayed next to the Command field at the top of the screen. 17. Press F3 (or Esc +3) twice to return to the Hot Plug Manager menu. 18. Select Replace/remove PCI Hot Plug adapter. 19. Select the slot that has the device to be removed from the system. 20. Select Replace. Note: A fast-blinking amber LED located at the back of the machine near the adapter indicates that the slot has been identified. 21. Press Enter. This places the adapter in the action state, meaning that it is ready to be removed from the system. 22. Label, and then disconnect all cables attached to the adapter you plan to remove. 23. Slide the adapter latch (A) into the open position, as shown in the following figure. 24. Carefully grasp the PCI adapter (B) by its top edge or upper corners, and remove it from the system. Store the adapter in a safe place. 25. If necessary, remove the replacement adapter from the antistatic package. Figure 59. PCI adapter or filler plate removed from the rack-mounted system unit 104 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures Attention: Avoid touching the components and gold connectors on the adapter. 26. Carefully grasp the adapter by its top edge, and align the adapter with the expansion slot and its connector on the system backplane. 27. Press the adapter (A) firmly into its connector. See the following figure. Attention: When you install an adapter into the system, be sure that it is completely and correctly seated in its connector. 28. Slide the adapter latch (B) back into place as shown in the following figure. 29. Connect the adapter cables. 30. Press enter and continue to follow the instructions in the system diagnostics until you receive a message that the replacement is successful. Successful replacement is indicated by the OK message displayed next to the Command field at the top of the menu. 31. Press the F3 (or Esc+3) key to return to the PCI Hot-Plug Manager menu. 32. Press the F3 (or Esc+3) key to return to the TASK selection list. 33. Select Log Repair Action. 34. Select the resource just replaced, press Enter, press Commit (F7 or ESC 7), then press Enter. 35. Press F3 (or Esc+3) to return to TASK Selection List. 36. Select Hot Plug Task, press enter. 37. Select PCI Hot Plug Manager, then select Configure a defined device, then press Enter. 38. Select the device just replaced from the list, then press Enter. The device is now configured. 39. Press the F10 key to exit the diagnostic program. Note: If you are running the stand-alone diagnostics, do not exit the program completely. 40. Verify the PCI adapter by using the following instructions: a. Did you replace the adapter with the system power on? Figure 60. PCI adapter replaced in the rack-mounted system unit Removal and replacement procedures for the 8233-E8B and 8236-E8C 105 v Yes - Go to the next step. v No - Load the diagnostic program by doing the following: – If AIX is available, boot AIX, log in as root or CELOGIN, then enter the diag command. – If AIX is not available, boot the stand-alone diagnostics b. Type the diag command if you are not already displaying the diagnostic menus c. Select Advance Diagnostic Routines, then select Problem Determination. d. Select the name of the resource just replaced from the menu. If the resource just replaced is not shown, choose the resource associated with it. Press Enter, then press Commit ((F7 or Esc+7)). e. Did the Problem Determination identify any problems? v No: Continue to the next step. v Yes: A problem is identified – If you are a customer, record the error information, then contact your service provider. – If you are an authorized service provider, return to map 210-5. 41. Press the F10 key to exit the diagnostic program. 42. Replace or close the service access cover and, if applicable, return the system to the operating position. 43. Verify that the new resource is functional. See Verify the installed part. Replacing a PCI adapter in the 8233-E8B server, with the power on in the IBM i environment: You can replace a PCI adapter with the system power on in the IBM i environment. Attention: You must have already completed the procedure “Removing a PCI adapter in the 8233-E8B server, with the power on in the IBM i environment” on page 97 in order to have the slot powered off. If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the steps for replacing a PCI adapter. If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to replace the PCI adapter in the server. For instructions, see Replacing a part by using the Systems Director Management Console. Important: v If the adapter is the load source I/O adapter (IOA) or the load source I/O processor (IOP), or any other storage IOA or IOP with critical disk storage attached for the system or partition, follow the on-screen instructions when you use HSM to power down the IOP or IOA. Instructions to use functions 68 and 69 on the control panel will be included. v If the adapter is the console IOA or the console IOP for the system or partition, you must perform the maintenance from an IBM i operating system session connected through a different IOA or IOP, or you must power down the partition to perform maintenance. v If you are removing, installing or replacing a PCI-X double-wide, quad-channel Ultra320 SCSI RAID controller, review the Concurrent maintenance procedure in the PCI-X double-wide, quad-channel Ultra320 SCSI RAID Controller (FC 5739, 5778, 5781, 5782; CCIN 571F, 575B) topic, before proceeding with the instructions provided here. v If you are removing, installing or replacing a PCI-X DDR 1.5 GB cache SAS RAID Adapter, review the Concurrent maintenance procedure in the PCI-X DDR 1.5 GB cache SAS RAID Adapter topic, before proceeding with the instructions provided here. v If you are exchanging a 2766, 2787, 280E, 576B, or 5774 Fibre Channel IOA, the external storage subsystem must be updated to use the worldwide port name of the new 2766, 2787, 280E, 576B, or 5774 IOA. For instructions, see “Updating the worldwide port name for a new 2766, 2787, 280E, 576B, or 5774 IOA.” on page 126. 106 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures v If you are replacing a 2748, 2757, 2763, 2767, 2778, 2780, 2782, 5702, 5709, or 570B storage IOA, take note of the following: Depending on the configuration of the system, the storage IOA cache might have been disabled to allow the attachment of OEM storage that emulates a load source drive. If you are replacing a storage IOA that has its cache disabled, configure the replacement IOA the same way as the IOA that you removed. If you remove hardware from the replacement IOA, return that hardware with the failed IOA. If you do not have an HMC or SDMC, complete the following steps to replace a PCI adapter with the system power on in the IBM i environment: 1. Perform the prerequisite tasks described in “Before you begin” on page 115. 2. Take appropriate precautions for avoiding electric shock and handling static-sensitive devices. For information, see “Safety notices” on page v and “Handling static-sensitive devices” on page 118. 3. If necessary, remove the adapter from the antistatic package. Attention: Avoid touching the components and gold connectors on the adapter. 4. Carefully grasp the adapter by its top edge, and align the adapter with the expansion slot and its connector on the system backplane. 5. Press the adapter (A) firmly into its connector. Attention: When you install an adapter into the system, be sure that it is completely and correctly seated in its connector. 6. Slide the adapter latch (B) back into place as shown in the following figure. 7. Connect the adapter cables. 8. Select Power on domain on the Hardware Resource Concurrent Maintenance display and press Enter. 9. Select Assign to on the resource that has an asterisk (*) on the Work with Controlling Resource display. Press Enter. Figure 61. PCI adapter replaced in the rack-mounted system unit Removal and replacement procedures for the 8233-E8B and 8236-E8C 107 10. Wait for the Hardware Resource Concurrent Maintenance display to appear with this message: Power on complete 11. Replace or close the service access cover and, if applicable, return the system to the operating position. 12. Verify that the new resource is functional. See Verify the installed part. Replacing a PCI adapter in the 8233-E8B and 8236-E8C server with the power on in the Linux environment: You can replace a PCI adapter with the system power on in the Linux environment. You must have already completed the procedure “Removing a PCI adapter from the 8233-E8B and 8236-E8C server, with the power on in the Linux environment” on page 100. If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the steps for installing a PCI adapter. If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to replace the PCI adapter in the server. For instructions, see Replacing a part by using the Systems Director Management Console. If you do not have an HMC or SDMC, complete the following steps to replace a PCI adapter with the system power on in the Linux environment: 1. Perform the prerequisite tasks described in “Before you begin” on page 115. 2. Take appropriate precautions for avoiding electric shock and handling static-sensitive devices. For information, see “Safety notices” on page v and “Handling static-sensitive devices” on page 118. 3. If necessary, remove the adapter from the antistatic package. Attention: Avoid touching the components and gold connectors on the adapter. 4. Place the adapter, component-side up, on a flat, static-protective surface. 5. Run the drslot_chrp_pci command to enable an adapter to be replaced: For example, to replace the PCI adapter in slot U7879.001.DQD014E-P1-C3 run this command: drslot_chrp_pci -R -s U7879.001.DQD014E-P1-C3 Follow the instructions on the display to complete the task. 6. Press the adapter firmly into its connector. Attention: When you install an adapter into the system, be sure that it is completely and correctly seated in its connector. 7. Carefully grasp the adapter by its top edge, and align the adapter with the expansion slot and its connector on the system backplane. 8. Press the adapter (A) firmly into its connector. Attention: When you install an adapter into the system, be sure that it is completely and correctly seated in its connector. 9. Slide the adapter latch (B) back into place as shown in the following figure. 108 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 10. Connect the adapter cables. 11. Run the lsslot command to verify that the slot is occupied. For example, Enter lsslot -c pci -s U7879.001.DQD014E-P1-C3 The following is an example of the information displayed by this command: # Slot Description Device(s) U7879.001.DQD014E-P1-C3 PCI-X capable, 64 bit, 133MHz slot 0001:40:01.0 12. If you are servicing a rack-mounted system, route the cables through the cable-management arm. 13. Replace or close the service access cover and, if applicable, return the system to the operating position. 14. Verify that the new resource is functional. See Verify the installed part. Model 8233-E8B and 8236-E8C PCI adapter dividers You might need to remove, replace, or install PCI adapter dividers. Use the procedures in this section to perform these tasks. Removing a PCI adapter divider from the 8233-E8B and 8236-E8C server: You can remove a PCI adapter divider. The following procedure describes the removal of PCI adapter dividers with the system power off. This procedure can be done with the system power on by omitting the steps related to powering off the system. To remove a divider, do the following steps: 1. Perform prerequisite tasks as described in “Before you begin” on page 115. 2. Take appropriate precautions for avoiding electric shock and handling static-sensitive devices. For information, see “Safety notices” on page v and “Handling static-sensitive devices” on page 118. 3. Stop the system or logical partition. See Stop the system or logical partition. Figure 62. PCI adapter replaced in the rack-mounted system unit Removal and replacement procedures for the 8233-E8B and 8236-E8C 109 4. Disconnect the power source from the system by unplugging the system. 5. Remove the unit's cover if you are installing or removing a PCI adapter divider. 6. Slide the adapter latch (A) into the open position, as shown in the following figure. 7. Flex the front edge (A) of the PCI adapter divider out of the bracket and toward the rest of the divider. 8. Pull the back edge of the divider away from the retention notches (B). Figure 63. PCI adapter or filler plate removed from the rack-mounted system unit 110 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 9. If you are removing the PCI adapter divider as part of another procedure, return to that procedure now. To replace the divider, see “Installing a PCI adapter divider in the 8233-E8B and 8236-E8C server” on page 112. To close up the system, continue to the next step. 10. Slide the adapter latch (B) back into place as shown in the following figure. Figure 64. PCI-adapter divider removed from the system unit Removal and replacement procedures for the 8233-E8B and 8236-E8C 111 11. Replace or close the service access cover and, if applicable, return the system to the operating position. 12. Reconnect the power source to the system. 13. Start the system or logical partition. Refer to Start the system or logical partition. Installing a PCI adapter divider in the 8233-E8B and 8236-E8C server: You can install a PCI adapter divider. The following procedure describes the installation of a PCI adapter divider with the system power off. This procedure can be done with the system power on by omitting the steps related to powering on the system. To install a divider, do the following steps: 1. Locate the PCI adapter divider slot that you want to use. To access the divider slots, see the first 6 steps in “Removing a PCI adapter divider from the 8233-E8B and 8236-E8C server” on page 109. 2. Slide the adapter latch (A) into the open position, as shown in the following figure. Figure 65. PCI adapter replaced in the rack-mounted system unit 112 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 3. Carefully grasp the PCI adapter divider by its top edge and align the back edge of the divider with the retention notches (A). 4. Insert the front edge of the divider (B) into the slots and then press the divider into place. Figure 66. PCI adapter or filler plate removed from the rack-mounted system unit Removal and replacement procedures for the 8233-E8B and 8236-E8C 113 5. Slide the adapter latch (B) back into place as shown in the following figure. Figure 67. PCI-adapter divider replaced in the system unit Figure 68. PCI adapter replaced in the rack-mounted system unit 114 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 6. Replace or close the service access cover and, if applicable, return the system to the operating position. 7. Reconnect the power source to the system. 8. Start the system or logical partition. Refer to Start the system or logical partition. Related procedures for installing and removing PCI adapters These procedures are related to installing and removing PCI adapters. Before you begin: Observe these precautions when you are installing, removing, or replacing features and parts. These precautions are intended to create a safe environment to service your system and do not provide steps for servicing your system. The installation, removal and replacement procedures provide the step-by-step processes required to service your system. DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) DANGER Removal and replacement procedures for the 8233-E8B and 8236-E8C 115 Observe the following precautions when working on or around your IT rack system: v Heavy equipment–personal injury or equipment damage might result if mishandled. v Always lower the leveling pads on the rack cabinet. v Always install stabilizer brackets on the rack cabinet. v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest devices in the bottom of the rack cabinet. Always install servers and optional devices starting from the bottom of the rack cabinet. v Rack-mounted devices are not to be used as shelves or work spaces. Do not place objects on top of rack-mounted devices. v Each rack cabinet might have more than one power cord. Be sure to disconnect all power cords in the rack cabinet when directed to disconnect power during servicing. v Connect all devices installed in a rack cabinet to power devices installed in the same rack cabinet. Do not plug a power cord from a device installed in one rack cabinet into a power device installed in a different rack cabinet. v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of the system or the devices that attach to the system. It is the responsibility of the customer to ensure that the outlet is correctly wired and grounded to prevent an electrical shock. CAUTION v Do not install a unit in a rack where the internal rack ambient temperatures will exceed the manufacturer's recommended ambient temperature for all your rack-mounted devices. v Do not install a unit in a rack where the air flow is compromised. Ensure that air flow is not blocked or reduced on any side, front, or back of a unit used for air flow through the unit. v Consideration should be given to the connection of the equipment to the supply circuit so that overloading of the circuits does not compromise the supply wiring or overcurrent protection. To provide the correct power connection to a rack, refer to the rating labels located on the equipment in the rack to determine the total power requirement of the supply circuit. v (For sliding drawers.) Do not pull out or install any drawer or feature if the rack stabilizer brackets are not attached to the rack. Do not pull out more than one drawer at a time. The rack might become unstable if you pull out more than one drawer at a time. v (For fixed drawers.) This drawer is a fixed drawer and must not be moved for servicing unless specified by the manufacturer. Attempting to move the drawer partially or completely out of the rack might cause the rack to become unstable or cause the drawer to fall out of the rack. (R001) Before you begin a replacement or installation procedure, perform these tasks: 1. If you are installing a new feature, ensure that you have the software required to support the new feature. To do this, go to the following Web site: IBM Prerequisite 2. If you are performing an installation or replacement procedure that might put your data at risk, ensure, wherever possible, that you have a current backup of your system or logical partition (including operating systems, licensed programs, and data). 3. Review the installation or replacement procedure for the feature or part. 4. Note the significance of color on your system. 116 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures Blue or terra-cotta on a part of the hardware indicates a touch point where you can grip the hardware to remove it from or install it in the system, open or close a latch, and so on. Terra-cotta might also indicate that the part can be removed and replaced with the system or logical partition power on. 5. Ensure that you have access to a medium flat-blade screwdriver, a Phillips screwdriver, and a pair of scissors. 6. If parts are incorrect, missing, or visibly damaged, do the following: v If you are replacing a part, contact the provider of your parts or next level of support. v If you are installing a feature, contact one of the following service organizations: – The provider of your parts or next level of support. – In the United States, the IBM Rochester Manufacturing Automated Information Line (R–MAIL) at 1–800–300–8751. In countries and regions outside of the United States, use the following Web site to locate your service and support telephone numbers: http://www.ibm.com/planetwide 7. If you encounter difficulties during the installation, contact your service provider, your IBM reseller, or your next level of support. 8. If you are installing new hardware in a logical partition, you need to understand and plan for the implications of partitioning your system. For information, see Logical Partitioning. Avoiding electric shock: Learn about precautions you should take to avoid electric shock when working on our around a computer system. Removal and replacement procedures for the 8233-E8B and 8236-E8C 117 DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) Handling static-sensitive devices: Learn about precautions you should take to prevent damage to electronic components from static electricity discharge. Electronic boards, adapters, media drives, and disk drives are sensitive to static electricity discharge. These devices are wrapped in antistatic bags to prevent this damage. Take the following precautions to prevent damage to these devices from static electricity discharge. v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic discharge from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted metal surface of the system for a minimum of 5 seconds. v Do not remove the device from the antistatic bag until you are ready to install the device in the system. 118 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures v With the device still in its antistatic bag, touch it to the metal frame of the system. v Grasp cards and boards by the edges. Avoid touching the components and gold connectors on the adapter. v If you need to lay the device down while it is out of the antistatic bag, lay it on the antistatic bag. Before picking it up again, touch the antistatic bag and the metal frame of the system at the same time. v Handle the devices carefully to prevent permanent damage. Installing or replacing a PCI adapter with the system power on in Virtual I/O Server: You can install or replace a PCI adapter in the Virtual I/O Server logical partition or in the Integrated Virtualization Manager management partition. The Virtual I/O Server includes a PCI Hot Plug Manager that is similar to the PCI Hot Plug Manager in the AIX operating system. The PCI Hot Plug Manager allows you to hot plug PCI adapters into the server and then activate them for the logical partition without having to reboot the system. Use the PCI Hot Plug Manager for adding, identifying, or replacing PCI adapters in the system that are currently assigned to the Virtual I/O Server. Getting started: Prerequisites: v If you are installing a new adapter, an empty system slot must be assigned to the Virtual I/O Server logical partition. This task can be done through dynamic logical partitioning (DLPAR) operations. – If you are using a Hardware Management Console (HMC), you must also update the logical partition profile of the Virtual I/O Server so that the new adapter is configured to the Virtual I/O Server after you restart the system. – If you are using the Integrated Virtualization Manager, an empty slot is probably already assigned to the Virtual I/O Server logical partition because all slots are assigned to the Virtual I/O Server by default. You only need to assign an empty slot to the Virtual I/O Server logical partition if you previously assigned all empty slots to other logical partitions. v If you are installing a new adapter, ensure that you have the software required to support the new adapter and determine whether there are any existing PTF prerequisites to install. To do this, use the IBM Prerequisite website at http://www-912.ibm.com/e_dir/eServerPrereq.nsf v If you need help determining the PCI slot in which to place a PCI adapter, see the PCI adapter placement. Follow these steps to access the Virtual I/O Server, PCI Hot Plug Manager: 1. If you are using the Integrated Virtualization Manager, connect to the command-line interface. 2. Use the diagmenu command to open the Virtual I/O Server diagnostic menu. The menus are similar to the AIX diagnostic menus. 3. Select Task Selection, then press Enter. 4. At the Task Selection list, select PCI Hot Plug Manager. Installing a PCI adapter: To install a PCI adapter with the system power on in Virtual I/O Server, do the following steps: 1. From the PCI Hot Plug Manager, select Add a PCI Hot Plug Adapter, then press Enter. The Add a Hot-Plug Adapter window is displayed. 2. Select the appropriate empty PCI slot from those listed, and press Enter. A fast-blinking amber LED located at the back of the server near the adapter indicates that the slot has been identified. 3. Follow the instructions on the screen to install the adapter until the LED for the specified PCI slot is set to the Action state. Removal and replacement procedures for the 8233-E8B and 8236-E8C 119 a. Set the adapter LED to the action state so that the indicator light for the adapter slot flashes b. Physically install the adapter c. Finish the adapter installation task in diagmenu. 4. Run the cfgdev command to configure the device for the Virtual I/O Server. If you are installing a PCI, Fibre Channel adapter, it is now ready to be attached to a SAN and have LUNs assigned to the Virtual I/O Server for virtualization. Replacing a PCI adapter: Prerequisite: Before you can remove or replace a storage adapter, you must unconfigure that adapter. See “Unconfiguring storage adapters” for instructions. To replace a PCI adapter with the system power on in Virtual I/O Server, do the following steps: 1. From the PCI Hot Plug Manager, select Unconfigure a Device, then press Enter. 2. Press F4 (or Esc +4) to display the Device Names menu. 3. Select the adapter you are removing in the Device Names menu. 4. In the Keep Definition field, use the Tab key to answer Yes. In the Unconfigure Child Devices field, use the Tab key again to answer YES, then press Enter. 5. Press Enter to verify the information on the ARE YOU SURE screen. Successful unconfiguration is indicated by the OK message displayed next to the Command field at the top of the screen. 6. Press F4 (or Esc +4) twice to return to the Hot Plug Manager. 7. Select replace/remove PCI Hot Plug adapter. 8. Select the slot that has the device to be removed from the system. 9. Select replace. A fast-blinking amber LED located at the back of the machine near the adapter indicates that the slot has been identified. 10. Press Enter which places the adapter in the action state, meaning it is ready to be removed from the system. Unconfiguring storage adapters: Before you can remove or replace a storage adapter, you must unconfigure that adapter. Storage adapters are generally parent devices to media devices, such as disk drives or tape drives. Removing the parent requires that all attached child devices either be removed or placed in the define state. Unconfiguring a storage adapter involves the following tasks: v Closing all applications that are using the adapter you are removing, replacing, or moving v Unmounting file systems v Ensuring that all devices connected to the adapter are identified and stopped v Listing all slots that are currently in use or a slot that is occupied by a specific adapter v Identifying the adapter's slot location v Making parent and child devices unavailable v Making the adapter unavailable If the adapter supports physical volumes that are in use by a client logical partition, then You can perform steps on the client logical partition before unconfiguring the storage adapter. For instructions, see “Preparing the client logical partitions” on page 121. For example, the adapter might be in use because the physical volume was used to create a virtual target device, or it might be part of a volume group used to create a virtual target device. Follow these steps to unconfigure SCSI, SSA, and Fibre Channel storage adapters: 120 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 1. Connect to the Virtual I/O Server command-line interface. 2. Use the oem_setup_env command to close all applications that are using the adapter you are unconfiguring. 3. Type lsslot-c pci to list all the hot plug slots in the system unit and display their characteristics. 4. Type lsdev -C to list the current state of all the devices in the system unit. 5. Type unmount to unmount previously mounted file systems, directories, or files using this adapter. 6. Type rmdev -l adapter -R to make the adapter unavailable. Attention: Do not use the -d flag with the rmdev command for hot plug operations because this action removes your configuration. Preparing the client logical partitions: If the virtual target devices of the client logical partitions are not available, the client logical partitions can fail or they might be unable to perform I/O operations for a particular application. If you use the HMC to manage the system, you might have redundant Virtual I/O Server logical partitions, which allow for Virtual I/O Server maintenance and avoid downtime for client logical partitions. If you are replacing an adapter on the Virtual I/O Server and your client logical partition is dependent on one or more of the physical volumes accessed by that adapter, then You can take action on the client before you unconfigure the adapter. The virtual target devices must be in the define state before the Virtual I/O Server adapter can be replaced. Do not remove the virtual devices permanently. To prepare the client logical partitions so that you can unconfigure an adapter, complete the following steps depending on your situation. Table 1. Situations and steps for preparing the client logical partitions Situation Steps You have redundant hardware on the Virtual I/O Server for the adapter. No action is required on the client logical partition. HMC-managed systems only: You have redundant Virtual I/O Server logical partitions that, in conjunction with virtual client adapters, provide multiple paths to the physical volume on the client logical partition. No action is required on the client logical partition. However, path errors might be logged on the client logical partition. HMC-managed systems only: You have redundant Virtual I/O Server logical partitions that, in conjunction with virtual client adapters, provide multiple physical volumes that are used to mirror a volume group. See the procedures for your client operating system. For example, for AIX, see Replacing a disk on the Virtual I/O Server in the IBM System p® Advanced POWER® Virtualization Best Practices Redpaper. The procedure for Linux is similar to this procedure for AIX. You do not have redundant Virtual I/O Server logical partitions. Shut down the client logical partition. For instructions, see the following topics about shutting down logical partitions: v For systems that are managed by the HMC, see “Shutting down AIX logical partitions using the HMC”, “Shutting down IBM i logical partitions using the HMC”, and “Shutting down Linux logical partitions using the HMC” in the Logical partitioning.1 v For systems that are managed by the Integrated Virtualization Manager, see “Shutting down logical partitions” on page 122. 1The Logical partitioning can be found on the Hardware Information website at http://publib.boulder.ibm.com/infocenter/systems/scope/hw/topic/p7hdx/power_systems.htm . Removal and replacement procedures for the 8233-E8B and 8236-E8C 121 Shutting down logical partitions: You can use the Integrated Virtualization Manager to shut down the logical partitions or to shut down the entire managed system. Use any role other than View Only to perform this task. The Integrated Virtualization Manager provides the following types of shutdown options for logical partitions: v Operating System (recommended) v Delayed v Immediate The recommended shutdown method is to use the client operating systems shutdown command. Use the immediate shutdown method only as a last resort because using this method causes an abnormal shutdown which might result in data loss. If you choose the Delayed shutdown method, then be aware of the following considerations: v Shutting down the logical partitions is equivalent to pressing and holding the white control-panel power button on a server that is not partitioned. v Use this procedure only if you cannot successfully shut down the logical partitions through operating system commands. When you use this procedure to shut down the selected logical partitions, the logical partitions wait a predetermined amount of time to shut down. This allows the logical partitions time to end jobs and write data to disks. If the logical partition is unable to shut down within the predetermined amount of time, it ends abnormally, and the next restart might take a long time. If you plan to shut down the entire managed system, shut down each client logical partition, then shut down the Virtual I/O Server management partition. To shut down a logical partition, complete the following steps in the Integrated Virtualization Manager: 1. In the navigation area, select View/Modify Partitions under Partition Management. The View/Modify Partitions page is displayed. 2. Select the logical partition that you want to shut down. 3. From the Tasks menu, click Shutdown. The Shutdown Partitions page is displayed. 4. Select the shutdown type. 5. Optional: Select Restart after shutdown completes if you want the logical partition to start immediately after it shuts down. 6. Click OK to shut down the partition. The View/Modify Partitions page is displayed, and the logical partition state has a value of shutdown. PCI hot-plug manager access for AIX: You might need to service PCI adapters with the system power on in AIX. Use the procedures in this section to perform this task. The instructions for servicing PCI adapters with the system power on in AIX refer you to these procedures when it is appropriate to perform them. Note: For an adapter to be serviced with the system power on, both the adapter and the system unit must support hot-plug procedures. To identify adapters that are hot-pluggable in the system you are servicing, refer to the following placement information: PCI adapter placement. Accessing hot-plug management functions: 122 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures You can use PCI Hot Plug Manager to service PCI adapters with the system power on in AIX. Use the procedures in this section to perform this task. Note: Procedures performed on a PCI adapter with the system power on in AIX, also known as hot-plug procedures, require the system administrator to take the PCI adapter offline prior to performing the operation. Before taking an adapter offline, the devices attached to the adapter must be taken offline as well. This action prevents a service representative or user from causing an unexpected outage for system users. To access the hot-plug menus, do the following: 1. Log in as root user. 2. At the command line, type smitty. 3. Select Devices. 4. Select PCI Hot Plug Manager and press Enter. 5. The PCI Hot-Plug Manager menu displays. Return to the procedure that directed you here. The following section describes the menu options. PCI hot-plug manager menu: You can use PCI Hot Plug Manager to service PCI adapters with the system power on in AIX. The following options are available from the PCI Hot Plug Manager menu. Note: For information about the PCI slot LED states, see “Component LEDs” on page 124. List PCI hot-plug slots Provides a descriptive list of all slots that support PCI hot-plug capability. If the listing for a slot indicates it holds an “Unknown” device, select the Install/configure Devices added after IPL to configure the adapter in that slot. Add a PCI hot-plug adapter Allows the user to add a new PCI hot-plug-capable adapter to the slot with the system turned on. You will be asked to identify the PCI slot that you have selected prior to the actual operation. The selected PCI slot will go into the Action state and finally into the On state. Note: The system will indicate the slot holds an “Unknown” device until you perform the Install/configure devices added after IPL option to configure the adapter. Replace/remove a PCI hot-plug adapter Allows the user to remove an existing adapter, or replace an existing adapter with an identical one. For this option to work, the adapter must be in the Defined state (see the “Unconfigure a Device” option). You will be asked to identify the PCI slot prior to the actual operation. The selected PCI slot will go into the Action state. Identify a PCI hot-plug slot Allows the user to identify a PCI slot. The selected PCI slot will go into the Identify state. See “Component LEDs” on page 124. Unconfigure a device Allows the user to put an existing PCI adapter into the Defined state if the device is no longer in use. Removal and replacement procedures for the 8233-E8B and 8236-E8C 123 This step must be completed successfully before starting any removal or replacement operation. If this step fails, the customer must take action to release the device. Configure a defined device Allows a new PCI adapter to be configured into the system if software support is already available for the adapter. The selected PCI slot will go into the On state. Install/configure devices added after IPL The system attempts to configure any new devices and tries to find and install any required software from a user-selected source. The add, remove, and replace functions return information to the user indicating whether the operation was successful. If additional instructions are provided on the screen, complete the recommended actions. If the instructions do not resolve the problem, do the following: v If the adapter is listed as Unknown, perform the Install/configure devices Added After IPL option to configure the adapter. v If you receive a warning indicating that needed device packages are not installed, the system administrator must install the specified packages before you can configure or diagnose the adapter. v If you receive a failure message indicating a hardware error, the problem might be either the adapter or the PCI slot. Isolate the problem by retrying the operation in a different PCI slot, or trying a different adapter in the slot. If you determine that you have failing hardware, call your service representative. v Do not use Install/configure devices added after IPL if your system is set up to run HACMP™ clustering. Consult with your system administrator or software support to determine the correct method to configure the replacement device. Component LEDs: Individual LEDs are located on or near the failing components. Use the information in this section to interpret the LEDs. The LEDs are located either on the component itself or on the carrier of the component (for example, memory card, fan, memory module, or processor). LEDs are either green or amber. Green LEDs indicate either of the following: v Electrical power is present. v Activity is occurring on a link. (The system could be sending or receiving information.) Amber LEDs indicate a fault or identify condition. If your system or one of the components on your system has an amber LED turned on or blinking, identify the problem and take the appropriate action to restore the system to normal. Resetting the LEDs in AIX: Individual LEDs are located on or near the failing components. You can use this procedure to reset the LEDs after you have completed a repair action. After the repair action is completed, do the following: 1. Log in as root user. 2. At the command line, type diag. 3. Select Task Selection. 4. Select Log Repair Action. 124 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 5. Select the device that was repaired. 6. Press F10 to exit diagnostics. If the Attention LED remains on after you have completed the repair action and reset the LEDs, call for service support. Prerequisites for hot-plugging PCI adapters in Linux: In the course of installing, removing, or replacing a PCI adapter with the system powered on in aLinux operating system, you might need to complete some prerequisite tasks. Use the information in this section to identify those prerequisites. The Linux system administrator needs to take the PCI adapter offline prior to removing, replacing, or installing a PCI adapter with the system power on (hot-plugging). Before taking an adapter offline, the devices attached to the adapter must be taken offline as well. This action prevents a service representative or user from causing an unexpected outage for system users. Before hot-plugging adapters for storage devices, ensure that the file systems on those devices are unmounted. After hot-plugging adapters for storage devices, ensure that the file systems on those devices are remounted. Before hot-plugging an adapter, ensure that the server or partition is at the correct level of the Linux operating system (Linux 2.6 or later). Install the POWER Linux Service Aids. These service aids enable system serviceability, as well to improve system management. If you are using a Linux on POWER distribution with Linux kernel version 2.6 or later, you can install the Service Aids that gives you access to more capabilities, which can help you diagnose problems on your system. This software is available at the Service and productivity tools for Linux on POWER Web site (http://www14.software.ibm.com/webapp/set2/sas/f/lopdiags/home.html). Verifying that the hot-plug PCI tools are installed for Linux: In the course of installing, removing, or replacing a PCI adapter with the system power on in Linux operating system you might need use the hot-plug PCI tools. Use the procedure in this section to verify that you have the hot-plug PCI tools installed. 1. Enter the following command to verify that the hot-plug PCI tools are installed: rpm -aq | grep rpa-pci-hotplug If the command does not list any rpa-pci-hotplug packages, the PCI Hot Plug tools are not installed. 2. Enter the following command to ensure that the rpaphp driver is loaded: ls -l /sys/bus/pci/slots/ The directory should contain data. If the directory is empty, the driver is not loaded or the system does not contain hot-plug PCI slots. The following is an example of the information displayed by this command: drwxr-xr-x 15 root root 0 Feb 16 23:31 . drwxr-xr-x 5 root root 0 Feb 16 23:31 .. drwxr-xr-x 2 root root 0 Feb 16 23:31 0000:00:02.0 drwxr-xr-x 2 root root 0 Feb 16 23:31 0000:00:02.2 drwxr-xr-x 2 root root 0 Feb 16 23:31 0000:00:02.4 drwxr-xr-x 2 root root 0 Feb 16 23:31 0001:00:02.0 drwxr-xr-x 2 root root 0 Feb 16 23:31 0001:00:02.2 drwxr-xr-x 2 root root 0 Feb 16 23:31 0001:00:02.4 Removal and replacement procedures for the 8233-E8B and 8236-E8C 125 drwxr-xr-x 2 root root 0 Feb 16 23:31 0001:00:02.6 drwxr-xr-x 2 root root 0 Feb 16 23:31 0002:00:02.0 drwxr-xr-x 2 root root 0 Feb 16 23:31 0002:00:02.2 drwxr-xr-x 2 root root 0 Feb 16 23:31 0002:00:02.4 drwxr-xr-x 2 root root 0 Feb 16 23:31 0002:00:02.6 If the directory does not exist, run the following command to mount the filesystem: mount -t sysfs sysfs /sys 3. Ensure the following tools are available in the /usr/sbin directory. v lsslot v drslot_chrp_pci 4. Return to the procedure that sent you here. Updating the worldwide port name for a new 2766, 2787, 280E, 576B, or 5774 IOA.: If you have exchanged a 2766, 2787, 280E, 576B, or 5774 Fibre Channel Input/Output Adapter (IOA), the IBM external storage subsystem must be updated to use the worldwide port name (WWPN) of the new 2766, 2787, 280E, 576B, or 5774 IOA. Any SAN hardware using WWPN zoning might also need updating. For instructions on how to update the external storage subsystem or SAN hardware configurations, see the documentation for those systems. The WWPN for the Fibre Channel IOA can be found using the Hardware Service Manager in SST or DST. Display detail on the 2766, 2787, 280E, 576B, or 5774 IOA Logical Hardware Resource information, and use the port worldwide name field. The 16-digit WWPN can also be determined by appending the digits "1000" to the beginning of the 12-digit IEEE address found on the tailstock label of the Fibre Channel IOA. PCI-X double-wide, quad-channel Ultra320 SCSI RAID Controller (FC 5739, 5778, 5781, 5782; CCIN 571F, 575B): Learn about the features, specifications, and installation notes for the PCI-X double-wide, quad-channel Ultra320 SCSI RAID Controller. The PCI-X double-wide, quad-channel Ultra320 SCSI RAID Controller is a high-performance SCSI adapter combined with an auxiliary-write cache adapter to form a double-wide, adapter pair. The two adapters are screwed together. This topic contains the following sections: v “Features” v “Specifications” on page 127 v “Placement information” on page 128 v “Logical partitioning” on page 128 v “Placing the adapter in a double-wide cassette” on page 128 v “Installing the cassette containing the adapter in a system” on page 128 v “Placing the double-wide adapter in a system that does not use cassettes” on page 129 v “Power LED” on page 129 v “Concurrent maintenance procedure” on page 129 Features v 3 external U320 SCSI ports v 1 internal U320 SCSI port dedicated to connecting the controller to the auxiliary-write cache adapter v Up to 36 Direct Access Storage Devices (DASD) can be externally attached per adapter v 320 MB data rate per SCSI bus 126 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures v Supports low voltage differential (LVD) disk devices only v Supports RAID 5 and 6 (3–18 drive sets) v 390 MB/1.5 GB compressed write cache v 415 MB/1.6 GB compressed read cache v PCI-X DDR support (storage adapter side) v IOP or IOPless (5739 requires an IOP. 5778 does not.) (A) Battery cover (B) SCSI port 2 (C) SCSI port 0 (D) SCSI port 1 Specifications Item Description Adapter FRU number 42R6578 Cache battery FRU number 42R3965 Unit description v 64 bit, 133 MHz, 3.3 V v PCI-X 2.0 compliant v Double-wide adapter, requires 2, adjacent, long slots. The SCSI controller side of the adapter pair requires a 64-bit slot. (The controller side is the side with the external SCSI connectors.) v The auxiliary-write cache adapter contains a dual, concurrently maintainable, cache-battery pack, which maintains cache memory on both adapters in the event of an abnormal termination. Operating system or partition requirements v IBM i V5R3, V5R3M5, and V5R4 with PTFs. v AIX 5L™ Version 5.2 with the 5200-10 Technology Level, or later v AIX 5L Version 5.3 with the 5300-06 Technology Level, or later v Red Hat Enterprise Linux version 4 U4, or later v SUSE Linux Enterprise Server 10, or later Check the IBM Prerequisite Web page for further details and updates. You can find a link to that Web page at the end of this topic. Figure 69. Adapter Removal and replacement procedures for the 8233-E8B and 8236-E8C 127 Item Description Maximum number See Placement information. Placement information For placement information for this adapters, see PCI adapter placement. Logical partitioning When used in a logical partition (LPAR) environment, this double-wide adapter must have both slots of the adapter assigned to the same logical partition. When implementing dynamic logical partitioning (DLPAR), both slots of the adapter must be managed together. Placing the adapter in a double-wide cassette To place this double-wide adapter in a double-wide cassette, first remove the adapter handle (B), and the two plastic covers from the SCSI jumper cable (A), as shown in the following figure. (The two plastic covers are only found on adapters made before August 2008.) Then follow the general procedures to place a double-wide adapter in a double-wide cassette. See Placing an adapter in the PCI adapter double-wide cassette. Installing the cassette containing the adapter in a system To install the cassette containing the adapter into a system unit or expansion unit, following the procedures in Model 5796 expansion units, PCI adapters and cassettes. Figure 70. Preparing the adapter for placement in a cassette 128 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures Placing the double-wide adapter in a system that does not use cassettes If you are installing the double-wide adapter in the internal PCI slots, remove the lower, right standoff shown in Figure 71. Power LED For a double-wide adapter, there is only one power LED visible for both slots. Concurrent maintenance procedure Concurrent maintenance of this double-wide adapter is not supported through the Hardware Management Console (HMC). Concurrent maintenance must be done from the Hardware Service Manager (HSM) of the system or owning partition. The HSM automatically powers off and on both PCI slots when either slot is selected. In AIX or Linux, you must manually power off and on each slot separately. Important: v Both PCI slots must be powered off when installing or removing this adapter with the system power on. v If this adapter is the load source IOA, or under the load source IOP, or any other storage IOA/IOP with critical DASD attached for the system, have a qualified service provider perform the concurrent maintenance procedure. At step 12 on page 130, the HSM directs you to use control panel functions 68 and 69 to power off the domain. The following steps are a general concurrent maintenance procedure for installing, removing, or replacing an adapter in IBM i operating system. If you were sent here by another procedure, additional specific instructions for removing and replacing a PCI adapter can be found in that procedure. 1. Start an IBM i session for the system or partition containing the adapter and sign on to the system or partition. 2. Type strsst on the command line of the Main menu and then press Enter. 3. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign On display and press Enter. 4. Select Start a service tool from the System Service Tools (SST) display and press Enter. 5. Select Hardware service manager from the Start a Service Tools display and press Enter. Figure 71. Removing the standoff Removal and replacement procedures for the 8233-E8B and 8236-E8C 129 6. Select Logical hardware resources (system, frames, cards) from the Hardware Service Manager display and press Enter. 7. Select System bus resources and press Enter. The display changes to the Logical Hardware Resources on System Bus display. 8. Page down and locate the IOP that controls the IOA to be serviced. If you are installing a new IOPless adapter in an empty slot, the IOP is not shown. 9. Enter 9 in the Opt field for the IOP to be serviced. 10. Locate the Storage IOA resource for the adapter to be serviced and enter 8 in the Opt field for that resource. 11. Enter 3 for Concurrent maintenance in the Opt field for the Storage IOA resource to be serviced. The display changes to the Hardware Resource Concurrent Maintenance display. 12. Press F9 to power off the domain. In some cases the card slots are already powered off if empty. 13. Press Enter to start power off. The display changes to the Hardware Resource Concurrent Maintenance Status display, which shows the status of the power off. When the power off is complete, the display returns to the Hardware Resource Concurrent Maintenance display. The display shows that the Power Status is off for Storage IOA. Leave this display on the HSM while you install, remove, or replace the adapter. 14. Check the slot power LED state for the slot in which the adapter resides to verify that the adapter is powered off. For a double-wide adapter, there is only one power LED visible for both slots. 15. Install, remove, or replace the adapter. See Model 5796 expansion units, PCI adapters and cassettes. Links to procedures for other systems can be found at Installing PCI adapters. 16. After the replacement is complete, return to the Hardware Resource Concurrent Maintenance display on the HSM. 17. If you were previously instructed to use operator panel function 69 to power on the adapter, do so now. Then go to step Installing PCI adapters, otherwise continue with next step. 18. Enter F10 to power the adapter on. 19. Press Enter to begin power on. 20. If the Work with Controlling Resource display appears, enter 7 in the Opt field for the IOP to assign the IOA to. The Hardware Resource Concurrent Maintenance Status display shows the status of the power on. 21. When the power on is complete, the display returns to the Hardware Resource Concurrent Maintenance display. The display shows that the Power Status is on for the Storage IOA. 22. Check the LED status to verify that the adapter is powered on. For a double-wide adapter, there is only one power LED visible for both slots. 23. Return to the procedure that sent you here. PCI-X DDR 1.5 GB cache SAS RAID Adapter (FC 5904 and 5908; CCIN 572F and 575C): Learn about the specifications and operating system requirements for the 5904 and 5908 adapter. Overview The PCI-X DDR 1.5 GB cache SAS RAID Adapter is a SAS disk controller with a maximum of 1.5 GB compressed write cache and a maximum 1.6 GB compressed read cache. Auxiliary write cache and concurrent battery maintenance are provided. The controller is implemented using two physical adapters that are securely connected to form a double-wide adapter. The double-wide adapter requires two adjacent Peripheral Component Interconnect-X (PCI-X) slots. The auxiliary write cache side of the 130 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures double-wide adapter contains a dual, concurrently maintainable cache battery pack which maintains cache memory on both adapters in the event of an abnormal termination. When used in a logical partition (LPAR) environment, this double-wide adapter must have both slots of the adapter assigned to the same logical partition. When implementing dynamic LPAR (DLPAR), both slots of the adapter must be managed together. Feature 5904 and 5908 are all feature codes representing the same PCI-X DDR 1.5 GB cache SAS RAID Adapter. Different feature codes indicate if a blind swap cassette is used and its type: v Feature 5904 indicates no blind swap cassette. The feature is used in enclosures that do not use blind swap cassettes. v Feature 5908 indicates a gen-3 blind swap cassette. This feature is used in the 5790 and 5796. 572F is the CCIN number on the RAID adapter side of the double-wide adapter pair. 575C is the CCIN number on the write-cache adapter. The adapter provides RAID 0, RAID 5, RAID 6, and RAID 10 for the AIX and Linux operating systems. Under the IBM i operating system, mirroring and data spreading are provided by the operating system, and RAID 5 and RAID 6 are provided by the adapter. 1 The adapter provides three mini-SAS 4x connectors for the attachment of SAS drives located in 5886 EXP 12S Expansion Drawers. The adapter supports a maximum of five 5886 EXP 12S Expansion Drawers.1 With proper cabling and configuration, multiple wide ports are used to provide redundant paths to each dual port SAS drive. The adapter manages SAS path redundancy and path switching should a SAS failure occur. This adapter supports the multi-initiator and high availability configurations. With the optional pairing of adapter sets, an even higher level of protection is provided by using a dual controller I/O configuration1 to protect against the failure of an entire adapter set. In such a high availability I/O configuration, SAS X cables are used to attach 5586 EXP 12S Expansion Drawers, and the fourth (top) mini-SAS connector on each card set is used to directly connect the card sets by using a SAS AA cable. See Serial attached SCSI cable planning for more information. Important: See the SAS RAID controller for AIX or SAS RAID controller for Linux topics for more information and important considerations for multi-initiator and high availability configurations. See also PCI-X DDR 1.5 GB cache SAS RAID Adapter (FC 5904 and 5908; CCIN 572F and 575C). The following figure shows the adapter. Removal and replacement procedures for the 8233-E8B and 8236-E8C 131 Note: Port T3 does not support any device attachment. T3 is only used in dual controller I/O configurations for adapter to adapter communication. Specifications Item Description Adapter FRU number 44V8622 (Adapter only) (Designed to comply with RoHS requirement.) 44V7627 (Adapter in a gen-2.5 blind swap cassette) Battery FRU number 42R3965 or 74Y5665 (Designed to comply with RoHS requirement.) I/O bus architecture PCI-X Slot requirement Two, long, adjacent PCI-X slots. Cables SAS device attachment requires specific cables that are provided with the subsystem or device features being attached. Special cabling is required for multi-initiator and high availability configurations. See Serial attached SCSI cable planning. Voltage 3.3V Form factor Long Maximum number See PCI adapter placement. Attributes v SAS speed: 3 Gbps v SAS, SAS Serial SCSI Protocol (SSP) and Serial Management Protocol (SMP) supported v Single controller supported with SAS y cables (3692, 3693, 3694) v Dual controller supported with SAS x cables (3661, 3662, 3663) and SAS AA cable (3681, 3682) v Single controller supports mirrored write cache data with auxiliary cache v Dual controller supports mirrored-write cache data and mirrored-RAID parity footprints between card sets Figure 72. PCI-X DDR 1.5GB cache SAS RAID Adapter 132 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures v Solid-state drives (SSDs) supported v Removable media devices are not supported Operating system or partition requirements This adapter is supported for the following operating systems: v AIX: – AIX 5L Version 5.3 with the 5300-07 Technology Level and Service Pack 8, or later – AIX 5L Version 5.3 with the 5300-08 Technology Level and Service Pack 6, or later – AIX 5L Version 5.3 with the 5300-09 Technology Level and Service Pack 2, or later – AIX 5L Version 5.3 with the 5300-10 Technology Level, or later – AIX 6.1 and Service Pack 8, or later – AIX Version 6.1 with the 6100-01 Technology Level and Service Pack 4, or later – AIX Version 6.1 with the 6100-02 Technology Level and Service Pack 3, or later – AIX Version 6.1 with the 6100-03 Technology Level, or later v Linux: – Red Hat Enterprise Linux version 4, with update 7, or later – Red Hat Enterprise Linux version 5, with update 2, or later – SUSE Linux Enterprise Server 10, with service pack 2, or later v IBM i – IBM i V5R4M5 and Resave F, or later – IBM i 6.1 and Resave F, or later – IBM i 6.1.1 and Resave A, or later, is required for Dual Storage IOA support. This adapter requires the following drivers: v AIX: devices.pci.1410bd02 device driver package v Linux: – iprutils version 2.4.1 and ipr driver version 2.0.11.6 (or later) for RHEL4 kernels – iprutils version 2.4.1 and ipr driver version 2.2.0.2 (or later) for RHEL5 kernels – iprutils version 2.4.1 and ipr driver version 2.2.0.2 (or later) for SLES10 kernels If you are installing a new feature, ensure that you have the software required to support the new feature and that you determine if there are any existing prerequisites for this feature and attaching devices. To check for the prerequisites, use the IBM Prerequisite website at http://www-912.ibm.com/e_dir/eServerPrereq.nsf . Concurrent maintenance procedure Concurrent maintenance of this double-wide adapter is not supported through the Hardware Management Console (HMC). Concurrent maintenance must be done from within the partition operating system. In IBM i, the Hardware Service Manager (HSM) of the system or owning partition automatically turns off the power or turns on the power to both PCI slots when either slot is selected. In AIX or Linux, you must manually turn off the power and turn on the power to each slot separately. Important: v Both PCI slots must be powered off when installing or removing this adapter with the system power turned on. v If this adapter is the load source IOA, or any other storage IOA with critical DASD attached for the system, this concurrent maintenance procedure should be done by a qualified service provider. Removal and replacement procedures for the 8233-E8B and 8236-E8C 133 Power supply Learn about removing and replacing a power supply in a server. Learn about removing and replacing a power supply, fan, or input power distribution assembly in a 5802 or a 5877 expansion unit. Removing the power supply with the system power off If your system is managed by the Hardware Management Console (HMC), use the HMC to remove a power supply. For instructions, see Removing a part by using the Hardware Management Console. If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to remove a power supply from the server. For instructions, see Removing a part by using the Systems Director Management Console. If your system is not managed by an HMC or SDMC, complete the following steps to remove a power supply from a system while the system power is turned off: 1. Perform the prerequisite tasks, as described in Before you begin. 2. Identify the failing part, as described in Identifying a failing part. 3. If the system is running, stop the system as described in Stopping the system or logical partition. 4. Disconnect the power source from the system by unplugging the system, as described in Disconnecting the power cords. Note: Before continuing with this procedure, ensure that all power sources to the system have been disconnected. (L003) or 134 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 5. Open the front rack door. 6. Remove the front cover, as described in Removing the front cover from an 8233-E8B or 8236-E8C system. 7. Disconnect the power cable (A) from the front of the power supply as shown in Figure 73. 8. Press the locking lever (B) down into the open position. 9. Pull the power supply out of the system. Figure 73. Removing the power supply from a rack-mounted model Removal and replacement procedures for the 8233-E8B and 8236-E8C 135 If you want to replace the power supply that you removed, see Replacing the 8233-E8B or 8236-E8C power supply with the power turned off. Removing the power supply with the system power on If your system is managed by the Hardware Management Console (HMC), use the HMC to remove a power supply. For instructions, see Removing a part by using the Hardware Management Console. If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to remove a power supply from a server. For instructions, see Removing a part by using the Systems Director Management Console. If your system is not managed by an HMC or SDMC, complete the following steps to remove a power supply from a system while the system power is turned on: 1. Perform the prerequisite tasks, as described in Before you begin. 2. Open the front rack door. 3. Remove the front cover, as described in Removing the front cover from an 8233-E8B or 8236-E8C system. 4. Identify the failing part, as described in Identifying a failing part. 5. Disconnect the power cable (A) from the front of the power supply as shown in Figure 74 on page 137. 6. Attach the wrist strap. Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. 7. Press down on the locking latch (B) as shown in Figure 74 on page 137. Using the locking handle, pull the power supply out of the system. 136 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures If you need to replace the power supply that you removed, see Replacing the 8233-E8B or 8236-E8C power supply with the power turned on. Replacing the power supply with the system power off If your system is managed by the Hardware Management Console (HMC), use the HMC to replace a power supply. For instructions, see Exchanging a part by using the Hardware Management Console. If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to replace a power supply in the server. For instructions, see Replacing a part by using the Systems Director Management Console. If your system is not managed by an HMC or SDMC, complete the following steps to replace a power supply in a system while the system power is turned off: 1. To remove the power supply, see Removing the 8233-E8B or 8236-E8C power supply with the power turned off. 2. Open the rear rack door. 3. Push the power supply into the system until the latch (A) locks in place as shown in Figure 75 on page 138. Figure 74. Removing the power supply Removal and replacement procedures for the 8233-E8B and 8236-E8C 137 4. Reconnect the power cable (B) to the front of the power supply, as shown in Figure 75. 5. Reconnect the power cable to the back of the system. 6. Start the system, as described in Starting the system or logical partition. 7. Note the state of the light-emitting diodes (LEDs) on the power supply. For more information, see Power supply LEDs . If the LEDs indicate the power supply is operating normal, that is both of the green LEDs are on solid (not flashing), continue to the next step. If not, remove the power supply from the system and repeat the procedure. If, after repeating the procedure, the power supply is not operating normally, contact your service provider. 8. Replace the front cover, as described in Installing the front cover on an 8233-E8B or 8236-E8C system. 9. Close the rear rack door. Figure 75. Replacing a power supply 138 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures Replacing the power supply with the system power on If your system is managed by the Hardware Management Console (HMC), use the HMC to replace a power supply. For instructions, see Exchanging a part by using the Hardware Management Console. If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to replace a power supply in the server. For instructions, see Replacing a part by using the Systems Director Management Console. If your system is not managed by an HMC or SDMC, complete the following steps to replace a power supply in a system while the system power is turned on. Important: You must have a functioning power supply in the system to use the following steps. 1. If you are replacing a redundant power supply because of a failure, remove the failing power supply. For instructions, see Removing the 8233-E8B or 8236-E8C power supply with the power turned on. 2. Perform the prerequisite tasks, as described in Before you begin. 3. Attach the wrist strap. Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. 4. Push the power supply into place until the locking latch (A) snaps into place as shown in Figure 76 on page 140. Removal and replacement procedures for the 8233-E8B and 8236-E8C 139 5. Reconnect the power cable (B) to the front of the power supply. 6. Replace the front cover, as described in Installing the front cover on an 8233-E8B or 8236-E8C system. 7. Close the front rack door. RAID enablement card Use this procedure to service the RAID enablement card. Preparing the system To prepare the system for the removal of a RAID enablement card, perform the following steps: 1. Perform prerequisite tasks as described in “Before you begin” on page 115. 2. Identify the auxiliary cache card that you need to remove. For instructions, see Identifying a part. 3. Stop the system. For instructions, see Stopping a system or logical partition. 4. Disconnect the power source from the system by unplugging the system. Attention: You must disconnect the power source from the system by disconnecting all power cords to prevent system damage during this procedure. Note: This system might be equipped with a second power supply. Before continuing with this procedure, ensure that all power sources to the system have been completely disconnected. (L003) Figure 76. Replacing a power supply 140 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures or 5. If you have a rack-mounted system, place it in the service position. See Placing a 8233-E8B or 8236-E8C in the service position. 6. Remove the access cover. See Removing the service access cover from an 8233-E8B or 8236-E8C system. 7. Attach the wrist strap Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. Removing the RAID enablement card To remove a RAID enablement card, perform the following steps: 1. Locate the RAID enablement card you want to remove. 2. Press the tab (B) and slide the card out of the system as shown in the following figures. The RAID enablement card is attached to a metal plate (A). You will slide the metal plate out of the system to remove the RAID enablement card. Removal and replacement procedures for the 8233-E8B and 8236-E8C 141 3. Replace the RAID enablement card. For instructions, see Replacing the model 8233-E8B Redundant Array of Independent Disks enablement card. Replacing the Redundant Array of Independent Disks enablement card You must have already completed the procedure Removing the model 8233-E8B Redundant Array of Independent Disks enablement card. To replace a RAID enablement card, complete the following steps: 1. Locate the slot for the RAID enablement card Figure 77. RAID enablement card removal from a rack-mounted system unit 142 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 2. Align the RAID enablement card with the slot in the system. 3. Slide the RAID enablement card fully into the system until the tab (A) locks into place. Figure 78. Slot location for the RAID enablement card installation Figure 79. RAID enablement card replacement in a rack-mounted system unit Removal and replacement procedures for the 8233-E8B and 8236-E8C 143 4. Replace the service access cover. See Installing the service access cover on an 8233-E8B or 8236-E8C system 5. Place the system in the operating position. See Placing the 8233-E8B or 8236-E8C in the operating position. 6. Start the system. For instructions, see Starting the system or logical partition. 7. If you replaced the RAID enablement card as part of another procedure, return to that procedure now. If you replaced the RAID enablement card because it was not operational, verify that the new resource is functional. For instructions, see Verifying the installed part. System backplane Use this procedure to service the system backplane. Note: Before you begin this procedure, make sure that you have authority to log in as an authorized service provider. The different authority levels are described at ASMI authority levels. Removing the system backplane 1. Access the ASMI The Advanced System Management Interface (ASMI) is the interface to the service processor that is required to perform general and administrator-level service tasks, such as reading service processor error logs, reading vital product data, setting up the service processor, and controlling the system power. The ASMI might also be referred to as the service processor menus. To access the ASMI, see Accessing the Advanced System Management Interface. When you have accessed the ASMI, continue with the next step. 2. Check and record the server firmware and service processor settings 1. Check and record the admin user ID's password. 2. Check and record the server firmware level prior to replacing this field replaceable unit (FRU). You might need to update the system to the latest server firmware code level after you replace this FRU. 3. Check the service processor settings that you previously set using Advanced System Management (ASM). This can be done by using a management console or a PC to access ASM. You need to reset these service processor settings after you replace this FRU or the default settings will be used. 4. Record any service processor settings you might have set by using operating system commands: v System Name setting v System Power Control settings v System Service Aids settings v System Configuration settings v Network Services settings v Performance Option settings v Login Profile settings v Boot List settings v Host Ethernet Adapter settings 3. Power off the unit If the unit that contains the backplane is powered on, power it off now. 4. Disconnect the power cords 144 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures To disconnect the power cords, complete the following steps: 1. Open the rear rack door. 2. Disconnect all power cords from the unit you are servicing. 5. Remove the front cover 1. Open the front rack door. 2. Remove the two thumbscrews (A) that secure the front cover to the system unit. 3. Push the cover release latches (C) in the direction of the arrow to release the cover from the system unit. 4. Pull the cover out and away from the system unit. 6. Disconnect the external cables Label and disconnect the external cables connected to the FRU you are servicing. 7. Place the unit into the service position Removal and replacement procedures for the 8233-E8B and 8236-E8C 145 Attention: When placing the model rack into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. Remove two screws (A) securing the system unit (B) to the rack. 8. Place the unit into the service position (continued) Notes: 1. Ensure that the cables at the rear of the system unit do not catch or bind as you pull the unit forward in the rack. 2. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far. While holding the system-unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked. 9. Remove the service access cover 146 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 1. Loosen the two thumbscrews (A) located at the rear of the cover. 2. Slide the cover (B) toward the rear of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit. Attention: For proper cooling and airflow, replace the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components. 10. Remove both power supplies 1. Disconnect the power cable (A) from the power supply you are removing. 2. While pushing the lever (B) down, pull the power supply straight out of the system. 3. Repeat these steps to remove the other power supply. 11. Remove the front fan Removal and replacement procedures for the 8233-E8B and 8236-E8C 147 1. While squeezing the latch (A) on the front right fan, grasp the fan firmly and pull the fan out of the fan cage. 2. Repeat these steps until all system fans are removed from the fan cage. 12. Remove the air baffle Grasp the right side air baffle and pull it out of the fan cage. 13. Remove the fan cage 148 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 1. Loosen the two retaining thumbscrews (A) that secure the fan cage to the system board. 2. Lift the fan cage (B) up and out of the enclosure. 14. Remove the disk unit cage 1. Disconnect the cables from the rear of the disk unit backplane. 2. Rotate the handles outward in the direction shown to release the disk unit cage. 3. Slide the disk unit cage out of the system unit. 15. Remove the vital product data (VPD) card Removal and replacement procedures for the 8233-E8B and 8236-E8C 149 Remove the VPD card (B) from the VPD card guide (A) on the planar by pulling the blue handle. 16. Remove the thermal power management device (TPMD) card, if present 1. Push the tab (A) in the direction shown to release the card (B). 2. Gently pull the card straight out of its slot on the system backplane. 17. Remove the daughter RAID card, if present 150 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 1. Push the latch (A) in the direction shown to release the card. 2. Gently pull the card (B) straight out of the enclosure. 18. Remove the base RAID card, if present 1. Push the tab (B) in the direction shown to release the card. 2. Gently pull the card (A) straight out of the enclosure. 19. Remove all PCI adapters Removal and replacement procedures for the 8233-E8B and 8236-E8C 151 1. Record the slot number and location of each adapter being removed. 2. Slide the adapter latch (A) in the direction shown to release the adapter (B). 3. Carefully grasp the PCI adapter by its top edge or upper corners, and pull the adapter straight out of its connector on the system backplane. 4. Store the adapter in a safe place. 5. Repeat these steps until all the PCI adapters are removed. 20. Remove the GX adapter, if present 1. Slide the adapter latch (A) in the direction shown to release the adapter. 2. Squeeze the latches (B) to release them from the adapter and rotate them in the direction shown. 3. Gently pull the adapter straight out of the system. 4. Repeat these steps to remove a second GX adapter, if present. 21. Remove the Ethernet card 152 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 1. Slide the adapter latch (A) in the direction shown to release the Ethernet card (B). 2. Carefully grasp the Ethernet card by its top edge or upper corners, and pull the card straight out of its connector on the system backplane. 22. Remove the system processor assemblies and fillers 1. Move the top of the system processor assembly handles (A) in the direction shown to release the handles. 2. Rotate the system processor assembly handles to the open position (B). 3. Pull the system processor assembly (C) out by the handles. 4. Place the system processor assembly on an electrostatic discharge (ESD) surface. 5. Repeat these steps to remove all of the system processor assemblies and fillers from the enclosure. 23. Remove the PCI adapter dividers Removal and replacement procedures for the 8233-E8B and 8236-E8C 153 1. Flex the front edge (A) of the PCI adapter divider out of the bracket and toward the rest of the divider. 2. Pull the back edge of the divider away from the retention notches (B) in the system chassis. 3. Repeat these steps for the other PCI adapter dividers that need to be removed. 24. Remove the middle system divider 1. Remove the screw (A) to unfasten the middle system divider from the system backplane. 2. Grasp the middle system divider (B) and lift it out of the system. 25. Remove the line cord assembly 154 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 1. Remove the screw (A) located on the back of the system chassis. 2. Pull down the terra cota cable retainers that hold the power supply cables. 3. Slide the line cord assembly (B) toward the front of the system to unlock it. 4. Grasp the line cord assembly and pull it out of the system. 26. Remove the front processor support 1. Remove the five screws (A) to unfasten the front processor support. 2. Grasp the front processor support (B) and lift it out of the system. 27. Remove the system backplane Removal and replacement procedures for the 8233-E8B and 8236-E8C 155 1. Label and disconnect all Ethernet cables that are present. 2. Remove the six screws (A) as shown in the illustration. 3. Push the system backplane (B) toward the front of the system. 4. Lift the system backplane up and out of the system chassis. Note: Use care when removing the system backplane. Do not lift the system backplane by any of the attached modules. Replacing the system backplane 1. Install the system backplane Note: Use care when you replace the system backplane. 1. Carefully grasp the replacement system backplane (A) along two edges. Note: Ensure the pin holes on the system backplane are lined up correctly with the pins (B) on the system chassis. 2. Lower the system backplane at an angle, with the rear of the backplane connecting with the system chassis first. 3. Slide the system backplane toward the rear of the system chassis. 4. Tighten the six screws (C), securing the system backplane to the system chassis. 5. Reconnect all Ethernet cables that were previously removed. 2. Install the front processor support 156 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 1. Align the pin hole (A) on the front processor support with the chassis guide pin on the system backplane. 2. Insert and tighten the five screws (B) to secure the front processor support. 3. Install the line cord assembly 1. Place the line cord assembly (A) into position, ensuring that the slots (B) on the line cord assembly align with the pins on the system chassis. 2. Slide the line cord assembly toward the rear of the system to lock it into place. 3. Ensure that the terra cota cable retainers that hold the power supply cables are connected. 4. Insert and tighten the screw (C) to secure the line cord assembly. 4. Install the middle divider Removal and replacement procedures for the 8233-E8B and 8236-E8C 157 1. Align the middle system divider (A) in between the front and rear processor supports. 2. Slide the middle system divider down and into place. 3. Insert and tighten the screw (B) to secure the middle system divider. 5. Install the PCI adapter dividers 1. Carefully grasp the PCI adapter divider by its top edge and align the rear edge of the divider with the retention notches (A) in the system chassis. 2. Insert the front edge of the divider (B) into the slots, and then press the divider into place. 3. Repeat these steps for the other PCI adapter dividers that need to be installed. 6. Install the system processor assemblies and fillers 158 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 1. Rotate the system processor assembly handles (A) to the open position. 2. Align the system processor assembly (B) with the slot in the system frame. 3. Carefully push on the system processor assembly until you feel it lock into place. 4. Rotate the handles to the locked position (C). 5. Repeat these steps to install all of the system processor assemblies and fillers. 7. Install the Ethernet card 1. Carefully grasp the Ethernet card (A) by its top edge, and align the card with the expansion slot and its connector on the system backplane. 2. Press the Ethernet card firmly into its connector. 3. Slide the adapter latch (B) back into place in the direction shown to secure the Ethernet card. 8. Install the GX adapter card, if previously removed Removal and replacement procedures for the 8233-E8B and 8236-E8C 159 1. Align the adapter with the expansion slot and its connector on the system backplane. 2. With the latches (A) in the open position, gently push the adapter straight into the system. 3. Rotate the latches in the direction shown until they latch. 4. Slide the adapter latch (B) back into place in the direction shown to secure the adapter. 5. Repeat these steps to install a second GX adapter, if previously removed. 9. Install all PCI adapters 1. Carefully grasp the adapter (A) by its top edge, and align the adapter with the expansion slot and its connector on the system backplane. 2. Press the adapter firmly into its connector. 3. Slide the adapter latch (B) in the direction shown to secure the adapter. 4. Repeat these steps until all PCI adapters are installed. 10. Install the base RAID card, if previously removed 160 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 1. Gently push the card (B) in the direction shown straight into the enclosure. 2. Push the tab (A) in the direction shown to fit the card into its position. 11. Install the daugther RAID card, if previously removed 1. Gently push the card (A) in the direction shown straight into the enclosure. 2. Push the latch (B) in the direction shown to tighten the card into its position. 12. Install the TPMD card, if previously removed Removal and replacement procedures for the 8233-E8B and 8236-E8C 161 1. Gently push the card (A) straight into its slot on the system backplane. 2. Push the tab (B) in the direction shown to fit the card into its position. 13. Install the VPD card 1. Insert the VPD card (B) into the VPD card guide (A). 2. Push until the VPD card is fully seated. 14. Install the disk unit cage 162 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 1. Slide the replacement disk unit cage into the system until you feel it lock. 2. Rotate the handles in the direction shown to secure the disk unit cage. 3. Reconnect the cables on the rear of the disk unit backplane. 15. Install the fan cage 1. Lower the fan cage (A) into its location on the system board, aligning the locator pins through the holes. 2. Tighten the two retaining thumbscrews (B) securing the fan cage to the system board. 16. Install the air baffle Removal and replacement procedures for the 8233-E8B and 8236-E8C 163 1. Lower the air baffle into its location in the fan cage, ensuring that the arrows on the air baffle point toward the front of the system. 2. Push down on the air baffle until it locks into place. 17. Install the front fan 1. Lower the fan into its location in the fan cage. 2. Push down on the fan until it locks into place. 18. Install both power supplies 164 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 1. While pushing the lever (A) down, carefully slide the power supply into its location in the system. 2. Connect the power cable (B) to the power supply. 3. Repeat these steps to install the other power supply. 19. Install the service access cover 1. Place the service access cover (A) on the top of the system, about 25 mm (1 in.) from the front of the system. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the rear of the service access cover with the two holes on the rear of the system chassis. 4. Tighten the thumbscrews to secure the service access cover. 20. Place the unit in the operating position Removal and replacement procedures for the 8233-E8B and 8236-E8C 165 1. To unlock the rail safety latches (B), slide the latches toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches lock into position. Note: Ensure that the cables at the rear of the system do not catch or bind as you push the system unit back into the rack. 21. Place the unit in the operating position (continued) Replace and tighten the two screws (B) that secure the system unit (A) to the rack. 22. Install the front cover 166 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 1. Position the cover on the front of the system unit so that the two thumbscrews (C) align with the screw holes on the front of the system unit (A). 2. Push the cover release latches (B) in the direction of the arrows to attach the cover to the front of the system unit as shown in the figure. 3. Tighten the thumbscrews. 4. Close the front rack door. 23. Reconnect the external cables Reconnect the external cables, which you previously removed, to the FRU you are servicing. 24. Connect the power cords If you removed the power cords, connect all power cords to the unit you are servicing. 25. Power on the system Removal and replacement procedures for the 8233-E8B and 8236-E8C 167 26. Restore network connectivity The installed service processor must connect to the network to be accessible from the management console. How will the IP address of the service processor be managed? Obtain an IP address automatically from a Dynamic Host Configuration Protocol (DHCP) server Configure a static IP address ? Go to step 28. 27. Restore service processor settings Note: If the network connection uses the Dynamic Host Configuration Protocol (DHCP) to establish an IP address, and uses the Hardware Management Console (HMC) as the DHCP server, no additional configuration is necessary to enable network access to the service processor. To enable network access to the service processor, complete the following steps. 1. Ensure that the service processor is connected to the existing service network by verifying that the HMC cable is connected to the HMC port on the service processor assembly. 2. If not already connected, connect all system power cables by plugging them into power outlets. Note: Do not start the system at this time. Can you access the service network? Yes No Go to step 30 on page 169. Go to step 33 on page 169. 28. Restore service processor settings If the network connection uses static IP address assignments, complete the following steps: 1. Connect a client with a Web browser directly to the service processor network port using one of the following URLs: v https://169.254.2.147 v https://169.254.3.147 Note: For additional information on IP address assignments, see Accessing the ASMI using a PC or notebook and Web browser. 2. If not already connected, connect all system power cables by plugging them into power outlets. Note: Do not start the system at this time. 3. Log in to Advanced System Management (ASM) with the user ID admin and the default password admin. 4. Change the admin user ID's password and the general user ID's password when prompted. 5. Configure network access using the static IP address. 29. Restore service processor settings With the network connection now configured to use static IP address assignments, access the service network. 168 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures Can you access the service network? Yes No ? Go to step 34 30. Access the ASMI The Advanced System Management Interface (ASMI) is the interface to the service processor that is required to perform general and administrator-level service tasks, such as reading service processor error logs, reading vital product data, setting up the service processor, and controlling the system power. The ASMI might also be referred to as the service processor menus. To access the ASMI, see Accessing the Advanced System Management Interface. When you have accessed the ASMI, continue with the next step. 31. Set System Enclosure Type. The enclosure serial number is a seven-digit number printed on a bar-coded label on the system unit. When the system backplane is replaced, the seven-digit enclosure serial number and feature code number needs to be manually inputted and stored in the new system backplane using the Advanced System Management Interface (ASMI). 32. Go to step 36. 33. Restore service processor settings If the network connection uses DHCP to establish an IP address, but does not use a management console as the DHCP server, do the following steps: 1. Complete any network configuration necessary to allow the DHCP server to recognize and assign an IP address to the service processor. 2. Ensure that the service processor is connected to the existing service network by verifying that the network cable is connected to the network port on the service processor assembly. 3. If not already connected, connect all system power cables by plugging them into power outlets. Note: Do not start the system at this time. Can you access the service network? No Yes ? Go to step 30. 34. Restore service processor settings You have indicated that you are still not able to access the service network. You will need to reset the service processor network interfaces (and passwords) by moving the reset jumpers. 35. Go to step 26 on page 168. 36. Restore service processor settings Removal and replacement procedures for the 8233-E8B and 8236-E8C 169 Change the admin user ID's password when prompted by following these steps: 1. In the navigation area of ASM, expand Login Profile. 2. Select Change Password. 3. Specify the required information, and click Continue. Note: Passwords can be any combination of up to 64 alphanumeric characters. The default password for the administrator ID is admin. After your initial login to ASM, the administrator passwords must be changed. As a security measure, you are required to enter the password for your user ID in the Current password for current user field. 37. Restore service processor settings Set the system name. The system name can be changed to any valid ASCII string. It does not have to follow the initialized machine type, model number, and serial number format. To change the system name, complete the followings steps: 1. In the navigation area of ASM, expand System Configuration. 2. Select System Name. 3. Enter the desired system name using the previous naming convention. 4. Click Save settings to update the system name to the new value. The new system name is displayed in the status frame, the area where the logout button is located. If another method, such as the management console, is used to change the system name, the status frame does not reflect the change. 38. Restore service processor settings Set the time of day by completing these steps: 1. In the navigation area of ASM, expand System Configuration. 2. Select Time of Day. If the system is powered off, the right pane displays a form that shows the current date (month, day, and year) and time (hours, minutes, and seconds). 3. Change the date value, the time value, or both, and click Update Time Of Day. 39. Restore service processor settings. Verify if the server firmware level is same as the value recorded prior to replacing the system backplane. If the value is different then it must be fixed. For information on how to fix the firmware, see Getting firmware fixes. 40. Restore service processor settings Reenter any of the following settings that you previously changed through ASM, unless you want to use the default settings: 1. System Power settings 2. ASMI Service Aids settings 3. System Configuration settings 4. Network Services settings 5. Login Profile settings 41. Restore service processor settings 170 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 1. Reset any service processor settings that you might have set by using the operating system commands. You recorded these settings at the beginning of this procedure. 2. If you reset the management console access password, use one of the following methods: For HMC: a. From the HMC GUI (preferred method): 1) Expand the Systems Management folder in the navigation tree pane. 2) Double click the Servers folder. 3) Select a server in the central panel. 4) Under the Operations folder, select Change Password. 5) Provide the desired information and click OK. b. From the HMC command line, type chsyspwd -m managed system -t access --passwd --newpasswd newpassword where: v The value for managed systemis the new service processor's managed system name. v No value for --passwd is entered thereby allowing authentication. v The value for newpassword is the new password value. For SDMC: a. Go to the Resources tab and click Hosts. b. Select a server in the right pane. c. On the Actions menu, click Operations > Change Password. d. Provide the desired information and click OK. 42. Recover partition data on the service processor The profile data that is stored in the managed server has been cleared or corrupted. To recover profile data by using the HMC, complete the following steps: 1. Expand the Systems Management folder in the navigation tree pane. 2. Double click the Servers folder. 3. Use the check box to select a server, in the central panel. 4. In the Configuration folder if you have the Manage Partition Data folder, select Restore. To recover profile data using the SDMC, complete the following steps: 1. Go to the Resources tab and click Hosts. 2. Select a server in the right pane. 3. On the Actions menu, click System Configuration > Manage Partition Data > Restore. 4. Select the desired restore option and click OK. 43. Update vital product data Removal and replacement procedures for the 8233-E8B and 8236-E8C 171 If your system is running IBM i and is not managed by the management console , you must update the load source after replacing the system backplane. To update the load source without using the management console, do the following steps: 1. Load the I_BASE_01 Licensed Internal Code optical media into the device that is defined for the system and access it to perform an initial program load (IPL). Note: This IPL can take several minutes. 2. When the Select a Language Group display is shown, press Enter to select the default language feature 2924 (English). Note: To change the language feature, type the language feature that appears on the media that contains the operating system, and then press Enter. Confirm the language feature by pressing Enter. 3. From the Install Licensed Internal Code display, select option 2 (Work with Dedicated Service Tools (DST)) and press Enter. Use the Dedicated Service Tools to make sure the disk units, including the load source, are reporting correctly and that there are no errors that might prevent the IPL from disk. When all errors have been resolved and the load source disk is reporting in correctly, continue with the next step. 4. From the Work with Dedicated Service Tools (DST) display, select option 2 (Work with disk units) and press Enter. 5. Select option 8 (Update system vital product data) to identify the location of the load source disk unit. Press Enter. The Update System Vital Product Data display appears. 6. Press Enter to confirm that you want the vital product data to be written. 7. Press F12 (Previous) twice. 8. Press 1 and press Enter to exit DST. Use the system control panel to delay turning off the system. Change the IPL mode to B, and power on the system to perform an IPL from disk. 44. Restore boot list and Host Ethernet Adapter settings The boot list is not present on a new system backplane, therefore; it is necessary to input the boot devices by booting into the system management services (SMS) menus. See Selecting boot options for information on setting the boot device list. After the operating system is booted, the bootlist command can be used to verify, or make additional changes to, the list of boot devices for a system or partition. To restore the Host Ethernet Adapter settings, see Configuring physical ports on a Host Ethernet Adapter. 45. Verify the repair Go to Verifying a repair. System processor assembly Use this procedure to service the control panel or signal cable. Removing the system processor assembly If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to remove the system processor assembly from the server. For instructions, see Removing a part using the Systems Director Management Console. If your system is managed by the Hardware Management Console (HMC), use the HMC to remove the system processor assembly from the server. For information about using the HMC to remove the processor voltage regulator module, see Removing a part using the Hardware Management Console. If you do not have an HMC or SDMC, complete the following steps to remove either the system processor assembly or the filler for the system processor assembly: 172 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 1. Perform the prerequisite tasks listed in “Before you begin” on page 115. 2. If necessary, identify the system processor assembly that you need to replace. See Identifying a part. 3. Stop the system or logical partition. See Stopping a system or logical partition. 4. Place the system in the service position. See Placing a 8233-E8B or 8236-E8C in the service position. 5. Remove the service access cover. See Removing the service access cover from an 8233-E8B or 8236-E8C system. 6. Attach the wrist strap. Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. 7. Disconnect the power cables from the primary power supply and secondary power supply if present. Note: This system has a second power supply. Before continuing with this procedure, ensure that the power source to the system has been completely disconnected. (L003) or 8. As shown in Figure 80 on page 174, unlock the latches (A) on the system processor assembly. Using both hands, grasp the unlocked latches (B) to pull the system processor assembly (C) through the guide rails away from the system backplane assembly. Removal and replacement procedures for the 8233-E8B and 8236-E8C 173 9. Place the system processor assembly on an acceptable ESD surface. 10. If necessary, remove the memory modules to reinstall them in the new system processor assembly. See Memory modules. Replacing the system processor assembly To complete this procedure, you must have already completed the removal procedure. For the removal procedure instructions, see Removing the 8233-E8B or 8236-E8C system processor assembly. If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to replace the system processor assembly. For instructions, see Replacing a part using the Systems Director Management Console. If your system is managed by the Hardware Management Console (HMC), you can use the HMC to replace the system processor assembly. For information about using the HMC to replace voltage regulator modules, see Exchanging a part using the Hardware Management Console. If you do not have an HMC or SDMC, complete the following steps to replace the system processor assembly: Attention: Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic discharge from damaging your hardware. When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. If you do not have a wrist strap prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted metal surface of the system for a minimum of 5 seconds. 1. If necessary, remove the system processor assembly from its antistatic package. Figure 80. Removing system processor assembly from the backplane assembly in a rack-mounted system 174 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 2. Move the existing memory modules or memory fillers in to the new system processor assembly. Use the same configuration that you had before you removed the memory modules. See the memory module information you printed for this procedure. If you did not print the memory module topic, see Memory modules. 3. With the latches (A) in the unlocked position, correctly align the system processor assembly (B) and slide it through the guide rails into the system backplane assembly. 4. Move the latches (A) in the direction of the arrow to lock the system processor assembly into position. 5. Replace the service access cover. For instructions, see Installing the service access cover on an 8233-E8B or 8236-E8C system. 6. Place the system in the operating position. For instructions, see Placing the 8233-E8B or 8236-E8C in the operating position. 7. Restore power to the system by reconnecting the system power cables. 8. Start the system or logical partition. For instructions, see Starting the system or logical partition. 9. Verify that the system processor assembly is functional. For instructions, see Verifying the installed part. 10. Do one of the following actions: v If you received new activation codes for your feature, see Activate Capacity on Demand or Advanced POWER Virtualization for more information about entering the new code. v If you did not receive a new activation code but one is required, see Order Capacity on Demand to enter the new code order. After you receive your new activation code, see Activate Capacity on Demand or Advanced POWER Virtualization for more information about entering the new code. To search for an activation code for a specific system, see Capacity on Demand activation codes. Figure 81. Installing the system processor assembly into the backplane assembly of a rack-mounted model Removal and replacement procedures for the 8233-E8B and 8236-E8C 175 System VPD card Use this procedure to service the system vital product data (VPD) card. Attention: The system VPD card contains data that is vital to system operation: v Machine type, model, and serial number v System brand v System unique ID v Activation codes for Capacity on Demand (CoD) and Virtualization Engine technologies, if applicable. If the VPD card is replaced, this data must be programmed into the replacement VPD card. If the system is still functional prior to the replacement of the VPD card, view and record the system unique ID. New, replacement activation codes for CoD and Virtualization Engine technologies must be generated. The original activation codes for the system cannot be reused. Contact your next level of support for new activation codes. Note: If you plan to replace the VPD card and have the replacement card available, you can contact the following e-mail address from 8 a.m. to 5 p.m. U.S. Central Time, Monday through Friday, to obtain replacement activation codes prior to repair. You must supply machine type and serial number, and the part number and serial number of the replacement VPD card. For System p systems contact pcod@us.ibm.com, and for System i® systems contact icod@us.ibm.com. To exchange the VPD card, perform the following procedure: 1. Access the ASMI The Advanced System Management Interface (ASMI) is the interface to the service processor that is required to perform general and administrator-level service tasks, such as reading service processor error logs, reading vital product data, setting up the service processor, and controlling the system power. The ASMI might also be referred to as the service processor menus. To access the ASMI, see Accessing the Advanced System Management Interface. When you have accessed the ASMI, continue with the next step. 2. Save the system identifiers 1. On the ASM Welcome pane, specify your user ID and password, and click Log in. Note: To perform this operation, your authority level must be administrator or authorized service provider. 2. In the navigation area, expand System Configuration ? Program Vital Product Data. 3. Select System Brand. 4. In the right pane, manually record the value for the System Brand. 5. Under Program Vital Product Data, Select System Keywords. 6. In the right pane, manually record the values for the machine type and model, system serial number, system unique ID, and (if applicable) activation codes for Capacity on Demand (COD) and Virtualization Engine technologies. 3. Power off the unit If the unit that contains the VPD card is powered on, power it off now. 4. Disconnect the power cords 176 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures To disconnect the power cords, complete the following steps: 1. Open the rear rack door. 2. Disconnect all power cords from the unit you are servicing. 5. Remove the front cover 1. Open the front rack door. 2. Remove the two thumbscrews (A) that secure the front cover to the system unit. 3. Push the cover release latches (C) in the direction of the arrow to release the cover from the system unit. 4. Pull the cover out and away from the system unit. 6. Place the unit into the service position Removal and replacement procedures for the 8233-E8B and 8236-E8C 177 Attention: When placing the model rack into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. Remove two screws (A) securing the system unit (B) to the rack. 7. Place the unit into the service position (continued) Notes: 1. Ensure that the cables at the rear of the system unit do not catch or bind as you pull the unit forward in the rack. 2. When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far. While holding the system-unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked. 8. Remove the service access cover 178 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 1. Loosen the two thumbscrews (A) located at the rear of the cover. 2. Slide the cover (B) toward the rear of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit. Attention: For proper cooling and airflow, replace the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components. 9. Remove the vital product data (VPD) card Remove the VPD card (B) from the VPD card guide (A) on the planar by pulling the blue handle. 10. Install the VPD card Removal and replacement procedures for the 8233-E8B and 8236-E8C 179 1. Insert the VPD card (B) into the VPD card guide (A). 2. Push until the VPD card is fully seated. 11. Install the service access cover 1. Place the service access cover (A) on the top of the system, about 25 mm (1 in.) from the front of the system. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the rear of the service access cover with the two holes on the rear of the system chassis. 4. Tighten the thumbscrews to secure the service access cover. 12. Place the unit in the operating position 180 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 1. To unlock the rail safety latches (B), slide the latches toward the front of the system. 2. Push the system unit (A) back into the rack until both system-unit release latches lock into position. Note: Ensure that the cables at the rear of the system do not catch or bind as you push the system unit back into the rack. 13. Place the unit in the operating position (continued) Replace and tighten the two screws (B) that secure the system unit (A) to the rack. 14. Install the front cover Removal and replacement procedures for the 8233-E8B and 8236-E8C 181 1. Position the cover on the front of the system unit so that the two thumbscrews (C) align with the screw holes on the front of the system unit (A). 2. Push the cover release latches (B) in the direction of the arrows to attach the cover to the front of the system unit as shown in the figure. 3. Tighten the thumbscrews. 4. Close the front rack door. 15. Connect the power cords If you removed the power cords, connect all power cords to the unit you are servicing. 16. Power on the system 17. Access the ASMI 182 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures The Advanced System Management Interface (ASMI) is the interface to the service processor that is required to perform general and administrator-level service tasks, such as reading service processor error logs, reading vital product data, setting up the service processor, and controlling the system power. The ASMI might also be referred to as the service processor menus. To access the ASMI, see Accessing the Advanced System Management Interface. When you have accessed the ASMI, continue with the next step. 18. Set the system identifiers 1. On the ASM Welcome pane, if you have not already logged in, specify your user ID and password, and click Log In. Note: To perform this operation, your authority level must be administrator or authorized service provider. 2. In the navigation area, expand System Configuration ? Program Vital Product Data. 3. Click System Brand. 4. If the backed up system brand is correct, click Accept backed up setting and save. Otherwise, enter the correct system brand and click Continue. Note: If you do not know the correct system brand, contact your next level of support. 5. Click Save setting to update the system brand to the VPD. 6. Under Program Vital Product Data, click System Keywords. 7. If the backed-up system keywords are correct, click Accept backed up setting and save. Otherwise, enter the values for the machine type and model, system serial number, system unique ID, and (if applicable) activation codes for Capacity on Demand (COD) and Virtualization Engine technologies. Set the Reserved field to blank spaces unless directed otherwise by Level 4 technical support. Note: If you do not know the system unique ID, contact your next level of support. You cannot boot the system until valid values are entered for all fields. You can change these entries only once. 8. Click Save settings to update the system keywords and to save them to the VPD. 19. Verify the repair. Go to Verifying a repair. Thermal power management device Use this procedure to service the thermal power management device (TPMD). Removing the TPMD card For information about using the HMC to remove the TPMD card, see Removing a part by using the Hardware Management Console To remove the TPMD card from a system that is not managed by the HMC, complete the following steps: 1. Complete the prerequisite tasks described in “Before you begin” on page 115. 2. Stop the system, as described in Stopping a system or logical partition. 3. Disconnect all power cords from the unit you are servicing. Note: This system has a second power supply. Before continuing with this procedure, ensure that the power source to the system has been disconnected. (L003) Removal and replacement procedures for the 8233-E8B and 8236-E8C 183 or 4. If you have a rack-mounted system, place it in the service position, as described in Placing a 8233-E8B or 8236-E8C in the service position. 5. Remove the service access cover, as described in Removing the service access cover from an 8233-E8B or 8236-E8C system. 6. Attach the wrist strap. Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. 7. Locate the TPMD card that is indicated by the amber fault LED being on. 8. Remove the TPMD card as follows: a. Push the tab (A) in the direction shown to release the card (B). b. Gently pull the card straight out of its slot on the system backplane. 184 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures Note: The location code of the TPMD card is P1-C12 or you can identify the card with the amber fault LED on. If you removed the TPMD card as part of another service action, return to that procedure. If you need to replace the TPMD card that you removed, see Replacing the 8233-E8B or 8236-E8C TPMD card. Replacing the TPMD card Before installing or replacing a feature, ensure that the software required to support the feature is installed on your system. For information about software prerequisites, see IBM Prerequisite. If the required software is not installed, see the following Web sites to download it, and then install it before continuing: v To download firmware and software updates and fixes, see Microcode downloads. v To download Hardware Management Console (HMC) updates and fixes, see Hardware Management Console Support and downloads. If your system is managed by the HMC, use the HMC to complete the steps for replacing the TPMD card in the server. For information about using the HMC to replace the TPMD card, see Replacing a part by using the Hardware Management Console. To replace a TPMD card from a system that is not managed by an HMC, complete the following steps: 1. Complete the prerequisite tasks described in “Before you begin” on page 115. 2. Attach the wrist strap. Removal and replacement procedures for the 8233-E8B and 8236-E8C 185 Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. 3. If necessary, remove the TPMD card from its antistatic package. 4. Install the TPMD card as follows: a. Gently push the card (A) straight into its slot on the system backplane. b. Push the tab (B) in the direction shown to fit the card into its position. 5. Replace the service access cover, as described in Installing the service access cover on an 8233-E8B or 8236-E8C system. 6. Place the system in the operating position, as described in Placing the 8233-E8B or 8236-E8C in the operating position. 7. If you removed the power cords, connect all power cords to the unit you are servicing. 8. Verify the installed part, as described in Verifying the installed part. Voltage regulators Use this procedure to service the voltage regulator. Removing the processor voltage regulator If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to remove the processor voltage regulator module from the server. For instructions, see Removing a feature using the Systems Director Management Console. 186 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures If your system is managed by the Hardware Management Console (HMC), use the HMC to remove the processor voltage regulator modules from the server. For information about using the HMC to remove the processor voltage regulator module, see Removing a part using the Hardware Management Console. If you do not have an HMC or SDMC, complete the following steps to remove a memory voltage regulator module: Note: Ensure that you keep a screwdriver handy before you begin with this procedure. 1. Perform the prerequisite tasks as described in “Before you begin” on page 115. 2. Remove the front cover. For instructions, see Removing the front cover from an 8233-E8B or 8236-E8C system 3. Open the back rack door. 4. Stop the system. For instructions, see Stopping a system or logical partition. 5. Disconnect the power source from the system by unplugging the system. Note: This system has a second power supply. Before continuing with this procedure, ensure that the power source to the system has been completely disconnected. (L003) or 6. Place the system in the service position. For instructions, see Placing a 8233-E8B or 8236-E8C in the service position. 7. Remove the access cover. For instructions, see Removing the service access cover from an 8233-E8B or 8236-E8C system. 8. Attach the wrist strap. Removal and replacement procedures for the 8233-E8B and 8236-E8C 187 Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. 9. Remove the system processor assembly. (Skip the steps you have already completed.) For instructions, see Removing the 8233-E8B or 8236-E8C system processor assembly. 10. Remove the airflow cover by lifting the clear shield from the system. a. Loosen the screws (A). b. Slide the cover (B) from under the flange (C) and remove from the system processor assembly. 11. Locate the processor voltage regulator module to be removed from the system processor assembly. Figure 82. Processor voltage regulator module location (C1) 188 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 12. To remove the processor voltage regulator module from the system processor assembly, follow these steps: a. Grasp the latch end of both of the locked mechanical arms, and pull the locking latch (A) back to unlock, as shown in the following figure. b. Lift both of the mechanical voltage regulator arms (A) upward at the same time until they are in the vertical position, as shown in Figure 84 on page 190. c. Carefully pull up on both of the arms (A), at the same time, to lift the processor voltage regulator module from the system processor assembly. Figure 83. Unlocking the processor voltage regulator module arms Removal and replacement procedures for the 8233-E8B and 8236-E8C 189 13. Place the voltage regulator module on an electrostatic discharge (ESD) surface. Replacing the processor voltage regulator If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to replace the processor voltage regulator module. For instructions, see Replacing a feature using the Systems Director Management Console. If your system is managed by the Hardware Management Console (HMC), you can use the HMC to replace the processor voltage regulator module. For information about using the HMC to replace voltage regulator modules, see Exchanging a part using the Hardware Management Console. If you do not have an HMC or SDMC, complete the following steps to replace a processor voltage regulator module: 1. Remove the voltage regulator module from its antistatic package. Locate the connectors in the system processor assembly to place the module. Figure 84. Removing the processor voltage regulator module from the system 190 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 2. Open both of the locking arms on the processor voltage regulator module, at the same time, until they extend straight up from the voltage regulator module. 3. To replace the voltage regulator module, complete the following steps: Note: When reinstalling processor voltage regulator module, ensure that the DIMM latches are fully engaged and latched to the DIMMs or blanks to avoid possible damage to the DIMM latches. a. Lift the voltage regulator module by both of the arms (A). Hold the voltage regulator module above the system processor assembly, aligning the connectors of the voltage regulator module with the connectors on the system processor assembly. b. Align the holes in the processor voltage regulator module (B) with the large mechanical pins (C) on the system processor assembly. Slide the voltage regulator module down onto the system processor assembly until the arms touch the top of the pins. Figure 85. Processor voltage regulator module shown above the location in the system processor assembly Removal and replacement procedures for the 8233-E8B and 8236-E8C 191 c. Use both hands to apply downward force on all four corners of the processor voltage regulator module, as shown by the arrows. When the arms are in the half open position, the voltage regulator module is in place. d. Using one finger on each hand, apply pressure downward on both processor voltage regulator arms at the same time, to move to the close position. Press down at the end of each arm A until they reach the locked position. 192 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 4. Install the airflow cover as shown in the following figure. a. To install the airflow cover, slide the cover (A) under the flange (B). b. Tighten the screws (C). The memory voltage regulator is locked into place. 5. Replace the system processor assembly. (Skip the steps you have already completed.) For instructions, see Replacing the 8233-E8B or 8236-E8C system processor assembly 6. Replace the service access cover: For instructions, see Installing the service access cover on an 8233-E8B or 8236-E8C system. 7. Place the system in the operating position. For instructions, see Placing the 8233-E8B or 8236-E8C in the operating position. Removal and replacement procedures for the 8233-E8B and 8236-E8C 193 8. Replace the front cover. For instructions, see Installing the front cover on an 8233-E8B or 8236-E8C system. 9. Reconnect all power and signal cables to their respective connectors. 10. Close the back rack door or the back system door. 11. Start the system. For instructions, see Starting the system or logical partition. 12. Verify that the new resource is working correctly. For instructions, see Verifying the installed part. Removing the memory voltage regulator If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to remove the memory voltage regulator module from the server. For instructions, see Removing a feature using the Systems Director Management Console. If your system is managed by the Hardware Management Console (HMC), use the HMC to remove the memory voltage regulator modules from the server. For information about using the HMC to remove the memory voltage regulator module, see Removing a part using the Hardware Management Console. If you do not have an HMC or SDMC, complete the following steps to remove a memory voltage regulator module: Note: Ensure that you keep a screwdriver handy before you continue with this procedure. 1. Perform the prerequisite tasks as described in “Before you begin” on page 115. 2. Remove the front cover. For instructions, see Removing the front cover from an 8233-E8B or 8236-E8C system. 3. On a rack-mounted system unit, open the back rack door. 4. Stop the system. For instructions, see Stopping a system or logical partition. 5. Disconnect the power source from the system by unplugging the system. Note: This system has a second power supply. Before continuing with this procedure, ensure that the power source to the system has been completely disconnected. (L003) or 194 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 6. Place the system in the service position. For instructions, see Placing a 8233-E8B or 8236-E8C in the service position. 7. Remove the access cover. For instructions, see Removing the service access cover from an 8233-E8B or 8236-E8C system. 8. Attach the wrist strap. Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. 9. Remove the system processor assembly. (Skip the steps you have already completed.) For more information, see Removing the 8233-E8B or 8236-E8C system processor assembly. 10. Remove the airflow cover by lifting the clear shield from the system. a. Loosen the screws (A). b. Slide the cover (B) from under the flange (C) and remove from the system processor assembly. Removal and replacement procedures for the 8233-E8B and 8236-E8C 195 11. Locate the memory voltage regulator module to be removed from the system processor assembly. Figure 86 shows the memory voltage regulator (C10) in relation to the location on the system processor assembly. 12. Remove the memory voltage regulator module (A) from the connectors, as shown in the following figure: a. Grasp the top or back edge (A) of the memory voltage regulator module to loosen from the connectors. b. Place your thumb under bottom edge (B)the voltage regulator module and lift to remove the module from the connectors. Figure 86. Memory voltage regulator module location (C10) 196 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 13. Place the voltage regulator module on an electrostatic discharge (ESD) surface. Replacing the memory voltage regulator If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to replace the memory voltage regulator module. For instructions, see Replacing a feature using the Systems Director Management Console. If your system is managed by the Hardware Management Console (HMC), you can use the HMC to replace the memory voltage regulator module. For information about using the HMC to replace voltage regulator modules, see Exchanging a part using the Hardware Management Console. If you do not have an HMC or SDMC, complete the following steps to replace a memory voltage regulator module: 1. Remove the voltage regulator module from its antistatic package. Locate the connectors on the system processor assembly to connect the module. Figure 87. Removing the memory voltage regulator module Removal and replacement procedures for the 8233-E8B and 8236-E8C 197 2. Install the voltage regulator module by completing the following steps: a. Carefully grasp the voltage regulator module along two edges and align it with the connectors. b. Insert the voltage regulator module (A) into the connectors and firmly push down into position. 3. Install the airflow cover as shown in the following figure. a. To install the airflow cover, slide the cover (A) under the flange (B). b. Tighten the screws (C). The memory voltage regulator is locked into place. Figure 88. Memory voltage regulator module (A) shown above the voltage regulator connectors on the system processor assembly Figure 89. Installing the memory voltage regulator module 198 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 4. Replace the system processor assembly. (Skip the steps you have already completed.) For instructions, see Replacing the 8233-E8B or 8236-E8C system processor assembly 5. Replace the service access cover. For instructions, see Installing the service access cover on an 8233-E8B or 8236-E8C system 6. Place the system in the operating position. See Placing the 8233-E8B or 8236-E8C in the operating position. 7. Replace the front cover. For a rack-mounted system unit, see Installing the front cover on an 8233-E8B or 8236-E8C system. 8. Reconnect all power and signal cables to their respective connectors. 9. Close the back rack door or the back system door. 10. Start the system. See Starting the system or logical partition. 11. Verify that the new resource is working correctly. For instructions, see Verifying the installed part. Verifying a repair Use these procedures to verify hardware operation after making repairs to the system. Choose from the following options: v To verify the repair of a system that is currently powered off, go to step 1. v To verify the repair of a system that is currently powered on with no operating system loaded, go to step 3 on page 200. v To verify the repair of a system that is currently powered on and that has an operating system loaded, go to step 5 on page 200. 1. Power on the server and all attached I/O enclosures. Did all the enclosures power on? Yes: Go to step 3 on page 200.? Go to step 3 on page 200. No: Continue with the next step. Removal and replacement procedures for the 8233-E8B and 8236-E8C 199 2. Choose from the following options: v If the original problem was that an enclosure would not power on and you have another FRU to replace, locate and replace the next field-replaceable unit (FRU). v If the next FRU in the FRU list is an isolation procedure, perform the isolation procedure. v If the original problem was that an enclosure would not power on and you have an isolation procedure to complete, perform the isolation procedure. v If the original problem was that an enclosure would not power on and there are no more FRUs or isolation procedures in the FRU list, contact your next level of support. v If you have a new problem, perform problem analysis and repair the new problem. 3. Load the operating system. Did the operating system load successfully? Yes: Go to step 5. No: Continue with the next step. 4. Choose from the following options: v If the original problem was a failing disk drive that contained the operating system software, go to step 5. v If the original problem was that the operating system would not load and you have another FRU to replace, go to your FRU locations section to locate the next FRU. v If the next FRU in the FRU list is an isolation procedure, perform the isolation procedure. v If the original problem was that the operating system would not load and you have an isolation procedure to complete, perform the isolation procedure. v If the original problem was that the operating system would not load and there are no more FRUs or isolation procedures in the FRU list, contact your next level of support. v If you have a new problem, perform problem analysis and repair the new problem. 5. Choose from the following options: v “Verifying the repair in AIX” v “Verifying the repair in Linux” on page 203 v “Verifying a repair using an IBM i system or logical partition” on page 204 Verifying the repair in AIX You can use this procedure to verify that a repair is complete using the AIX operating system. Use this maintenance analysis procedure (MAP) to check out the server after a repair is completed. 1. Did you replace a disk drive in the root volume group? No Go to step 3 on page 201. Yes Continue with the next step. 2. Run stand-alone diagnostics either from a CD or from a Network Installation Management (NIM) server. 200 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures Did you encounter any problems? No Reinstall the operating system and continue with step 5. Yes If the original problem still exists, replace the field-replaceable unit (FRU) or perform the isolation procedure that is next in the FRU list. If you have reached the end of the FRU list, contact your next level of support. If a new problem has occurred, go to Beginning problem analysis. 3. Did you replace a FRU with the power turned on and concurrently with system operations? No Go to step 5. Yes Continue with the next step. 4. Did you use an AIX diagnostics service aid hot-swap operation to change the FRU? Yes Go to step 6. No Go to step 7. Note: The AIX diagnostic service aid was used if a resource was removed using the Hot Plug task. 5. Note: If any FRUs have been removed that should be reinstalled, reinstall them now. Perform the following steps: 1. If the system is not powered on, power it on now. 2. Perform a slow boot. 3. Wait until the AIX operating system login prompt displays or until system activity on the operator panel or display apparently has stopped. 4. Did you encounter any problems? No Continue at step 6. Yes If the original problem still exists, replace the FRU or perform the isolation procedure that is next in the FRU list. If you have reached the end of the FRU list, contact your next level of support. If a new problem has occurred, go to Beginning problem analysis. 6. If the Resource Repair Action menu is already displayed, go to step 9 on page 202; otherwise, perform the following steps: 1. Log into the operating system either with root authority (if needed, ask the customer to enter the password) or use the CE login. 2. Enter the diag -a command and check for missing resources. Follow any instructions that display. If an SRN displays, suspect a loose card or connection. If no instructions display, no resources were detected as missing. Continue with the next step. 7. Perform the following steps: Removal and replacement procedures for the 8233-E8B and 8236-E8C 201 1. Enter diag at the command prompt. 2. Press Enter. 3. Select the Diagnostics Routines option. 4. When the Diagnostic Mode Selection menu displays, select System verification. 5. When the Diagnostic Selection menu displays, select the All Resources option or test the FRUs you exchanged, and any devices that are attached to the FRU(s) you exchanged, by selecting the diagnostics for the individual FRU. Did the Resource Repair Action menu (801015) display? No Continue with the next step. Yes Go to step 9. 8. Did the Testing Complete, no trouble was found menu (801010) display? Yes Use the Log Repair Action option, if not previously logged, in the TASK SELECTION menu to update the AIX error log. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. Note: If the system attention indicator is on, this action will set it back to the normal state. Go to step 11 on page 203. No If the original problem still exists, replace the FRU or perform the isolation procedure that is next in the FRU list. If you have reached the end of the FRU list, contact your next level of support. If a new problem has occurred, go to Beginning problem analysis. 9. When a test is run on a resource in system verification mode, that resource has an entry in the AIX error log. If the test on that resource was successful, the Resource Repair Action menu displays. After replacing a FRU, you must select the resource for that FRU from the Resource Repair Action menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced. Note: If the system attention indicator is on, this action will set it back to the normal state. Perform the following steps: 1. Select the resource that has been replaced from the Resource Repair Action menu. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. 2. Press Commit after you make your selections. Did another Resource Repair Action (801015) display? No If the No Trouble Found menu displays, go to step 11 on page 203. Yes Continue with the next step. 10. The parent or child of the resource you just replaced might also require that you run the Resource Repair Action service aid on it. 202 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures When a test is run on a resource in system verification mode, that resource has an entry in the AIX error log. If the test on the resource was successful, the Resource Repair Action menu displays. After replacing that FRU, you must select the resource for that FRU from the Resource Repair Action menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced. Note: If the system attention indicator is on, this action will set it back to the normal state. Perform the following steps: 1. From the RESOURCE REPAIR ACTION menu, select the parent or child of the resource that has been replaced. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. 2. Press COMMIT after you make your selections. 3. If the No Trouble Found menu displays, continue with the next step. 11. If you changed the service processor or network settings, as instructed in previous MAPs, restore the settings to the value they had prior to servicing the system. If you ran stand-alone diagnostics from CD-ROM, remove the stand-alone diagnostics CD-ROM from the system. Did you perform service on a RAID subsystem involving changing of the PCI RAID adapter cache card or changing the configuration? Note: This information does not apply to the PCI-X RAID adapter or cache. No Go to the close of call procedure. Yes Continue with the next step. 12. Use the Recover Options selection to resolve the RAID configuration. To do this, perform the following steps: 1. On the PCI SCSI Disk Array Manager display, select Recovery options. 2. If a previous configuration exists on the replacement adapter, this must be cleared. Select Clear PCI SCSI Adapter Configuration and press F3. 3. On the Recovery Options screen, select Resolve PCI SCSI RAID Adapter Configuration. 4. On the Resolve PCI SCSI RAID Adapter Configuration screen, select Accept Configuration on Drives. 5. On the PCI SCSI RAID Adapter selections menu, select the adapter that you changed. 6. On the next screen, press Enter. 7. When you see the Are You Sure selection menu, press Enter to continue. 8. If you see a Failed status message, verify that you selected the correct adapter, and then repeat this procedure. When the recovery is complete, exit the operating system. 9. Go to the close of call procedure. Verifying the repair in Linux You can use this procedure to verify that a repair is complete using the Linux operating system. Removal and replacement procedures for the 8233-E8B and 8236-E8C 203 1. Run stand-alone diagnostics from either a CD or from a Network Installation Management (NIM) server. See Running the stand-alone diagnostics from CD-ROM. Did you encounter any problems? No Reboot the operating system and continue with the close of call procedure. Yes If the original problem still exists, replace the field-replaceable unit (FRU) or perform the isolation procedure that is next in the FRU list. If you have reached the end of the FRU list, contact your next level of support. If a new problem has occurred, go to Beginning problem analysis and repair the new problem. Verifying a repair using an IBM i system or logical partition Use this procedure to verify a repair using the IBM i operating system. 1. Was the system powered off during the repair? Yes: Continue with the next step. No: Continue with step 3. 2. Perform the following tasks: a. Verify that the power cable is plugged into the power outlet. b. Verify that power is available at the customer's power outlet. 3. Was the partition powered off during the repair? Yes: Continue with the next step. No: Continue with step 6. 4. Select the IPL type and mode for the system or logical partition that the customer uses (see IPL type mode and speed options in the Service functions). 5. Start an IPL by powering on the system or partition (see Powering on and powering off). Did the system complete the IPL? Yes: Continue with the next step. No: This might be a new problem. Go to the Start of call procedure. This ends the procedure. 6. Did the system or partition remain running throughout the repair, and was the I/O processor, I/O adapter, or storage device replaced? Yes: Continue with step 10. No: Continue with the next step. 7. Use the service action log or serviceable event view (if the system is managed by an HMC) to look for any reference codes that are related to this IPL (see Using the Service Action Log). Are there any reference codes that are related to this IPL? Yes: Continue with the next step. No: If the problem was related to removable media or communications, perform the verification procedures in the Service functions to verify that the problem is corrected. Then return the system to the customer and have the customer verify the system date and time. This ends the procedure. 8. Is the new reference code the same as the original reference code? Yes: Continue with the next step. No: A new symptom might have occurred. Go to the Start of call procedure. This ends the procedure. 9. Are there any other failing items that remain to be replaced? Yes: Replace the next failing item listed for this reference code. This ends the procedure. No: Contact your next level of support for assistance. This ends the procedure. 10. Was concurrent maintenance performed on an optical storage unit? 204 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures Yes: The product activity log and service action log, in most cases, contain a reference code for the optical storage unit when concurrent maintenance is performed. You can ignore this reference code. Perform the following: - Perform the verification procedures in the Service functions topic to verify that the problem is corrected. - Return the system to the customer and have the customer verify the system date and time. This ends the procedure. No: Continue with the next step. 11. Use the service action log to look for any new reference codes (see Using the Service Action Log). Are there any new reference codes? Yes: Continue with the next step. No: Go to step 14. 12. Is the new reference code the same as the original reference code? Yes: Continue with the next step. No: A new symptom might have occurred. Go to the Start of call procedure to determine the cause of the problem. This ends the procedure. 13. Are there any other failing items that need to be replaced? Yes: Replace the next failing item listed for the reference code. This ends the procedure.. No: Contact your next level of support for assistance. This ends the procedure. 14. Are you working with a tape device? Yes: Perform the verification procedures in the Service functions to verify that the problem is corrected. After the verification test is complete, the tape device description will be set to the failed state because a resource change was detected. Perform the following tasks: - Vary the tape device description off and then on. - Return the system to the customer and have the customer verify the system date and time. Then go to Verifying the repair from the HMC. This ends the procedure. No: Continue with the next step. 15. Are you working with an IOP or an IOA? Yes: Use the display hardware configuration service function to check for any missing or failed hardware: - On the command line, enter the STRSST (Start System Service Tools command). If you cannot get to SST, select DST. Do not IPL the system or partition to get to DST. - On the Start Service Tools Sign On display, enter the user ID with the service authority and password. - Select Start a service tool ? Hardware service manager ? Logical hardware resources ? System bus resources. - Select the function key for Include nonreporting resources. - If the IOP and IOA that you just replaced is a failed or non-reporting resource, the problem has not been fixed. Continue to the next failing item in the failing item list. This ends the procedure. No: Perform the verification procedures in the Service functions topics to verify that the problem is corrected. Resources that usually vary on automatically during an IPL, or that were previously varied on manually, might need to be varied on again after the verification procedures are complete. Return the system to the customer and have the customer verify the system date and time. This ends the procedure. Removal and replacement procedures for the 8233-E8B and 8236-E8C 205 Closing a service call Perform these procedures to close problem numbers, clear hardware messages, and prepare the server to return to the customer. Follow this checklist before performing the procedures: v Return the server to the state that the customer normally uses such as IPL type, IPL mode, and the way the system is configured or partitioned. Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. v While you were performing the problem analysis on the original serviceable event, other serviceable events might have been opened. Close all serviceable events that were opened as a result of your service activity. v Ensure that server verification has been performed and no problems require additional service actions. v If the repair was done using the management console online repair procedures, ensure that the original serviceable event is now closed. 1. Record the system reference code (SRC) or symptom and the location code of the field-replaceable unit (FRU) you replaced, for future reference. Is the server managed by a management console? v Yes: Continue with the next step. v No: Do one of the following steps: – If the server is managed by Integrated Virtualization Manager (IVM), go to “Closing a service call using Integrated Virtualization Manager” on page 210. – If the server is not partitioned and is running the AIX or Linux operating system, go to “Closing a service call using AIX or Linux” on page 214. 2. On the HMC, open Manage Serviceable Events and examine the service action event log for any open service action events. 3. Are there any service action events that are open? v Yes: Continue with the next step. v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 218. Return the system to the customer. This completes the repair. 4. Record the list of open service action events. 5. From the list of serviceable events recorded in step 4, perform the following step 6 through step 32 on page 209 for each open service action event. 6. Determine the error class of the serviceable event. Record for future use. 7. Examine the details of the open service action event. Is the error code that is associated with this service action event the same as recorded in Step 1? 206 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures v Yes: Go to step 11. v No: Continue with the next step. 8. Examine the FRU list of the service action event. Are any FRUs listed for the service action event? v Yes: Continue with the next step. v No: Go to step 11. 9. Is the FRU list identical (same FRUs, same number of FRUs, and same order of FRUs) to the FRU list of the error code recorded in step 1 on page 206? v Yes: Go to step 11. v No: Continue with the next step. 10. The FRU list is different. Is the FRU you replaced and recorded in step 1 on page 206 in the list of FRUs for this service action event? v Yes: Continue with the next step. v No: Go to step 32 on page 209. Note: There are service action events that will remain open when you leave this MAP. Further service actions might be required to complete the repair. 11. Examine the details of this service action event, and record the partitions involved in this service action event for use in a later step. 12. Is the error code associated with this service action event of the form A11-xxx or A01-xxx? v Yes: Continue with the next step. v No: Go to step 17. 13. Have you begun a list of Axx partitions from prior service action events that you processed in this maintenance analysis procedure (MAP)? v Yes: Go to step 15. v No: Continue with the next step. 14. Begin a new list of Axx partitions by copying the list of partitions obtained in step 11. Go to step 16. 15. Add the partition list obtained in step 11 to the existing list of Axx partitions obtained from processing previous service action events in this MAP. 16. Remove all entries in the list of all partitions you recorded in step 11. If you are referred to the list of partitions obtained in step 11 in future steps, the list is empty. Go to step 17. 17. Select and highlight the service action event from the Error Associated With This Serviceable Event window. Removal and replacement procedures for the 8233-E8B and 8236-E8C 207 18. Click Close Event. 19. Add comments for the serviceable event. Include any unique additional information. Click OK. The following steps will add or update FRU information. 20. Did you replace, add, or modify a FRU of the open service action event? v Yes: Continue with the next step. v No: Go to step 22. 21. From the FRU list, select a FRU that you need to update. Double-click the FRU, and update the FRU information. Go to step 23. 22. Select the No FRU Replaced for this Serviceable Event option. 23. Click OK to close the service action event. 24. Is the list of all partitions you recorded in step 11 on page 207 empty? v Yes: Go to step 32 on page 209. v No: Continue with the next step. 25. Does the list of all partitions you recorded in step 11 on page 207 contain more than one entry? v Yes: Continue with the next step. v No: Go to step 32 on page 209. 26. Is the error class recorded in step 25 AIX? v Yes: Continue with the next step. v No: Go to step 32 on page 209. 27. Perform the following steps for each entry in the list of all partitions you recorded in step 11 on page 207, except the partition you were using to debug the original problem. 28. From the HMC virtual terminal window of a partition in the list of all partitions, type diag at the AIX command prompt. 29. When the diagnostic operating instructions are displayed, do the following steps: 208 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 1. Press Enter. 2. Select the Task Selection option. 3. Select the Log Repair option. 4. Select the resource associated with the repair action. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your repair action is not displayed on the Resource List, select sysplanar0. 5. Click Commit after you have made your selection. Note: If the terminal type is not defined, you are prompted to define it before you can proceed. 30. Exit from diagnostics in this partition and return to the AIX prompt. 31. Have all the partitions in the list of all the partitions you recorded in step 11 on page 207 been processed? v Yes: Continue with the next step. v No: Go to step 24 on page 208 to process the next partition in the list you recorded in step 11 on page 207. 32. Have all the serviceable events recorded in step 4 on page 206 been processed? v Yes: Continue with the next step. v No: Go to step 5 on page 206 and process the next service action event in the list of serviceable events recorded in step 4 on page 206. 33. While processing all service action events, were you directed to step 14 on page 207? v Yes: Continue with the next step. v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 218. Return the system to the customer. This completes the repair. Note: If during the processing of the list of open service action events, some service action events remained open, further service actions might be required to complete the repair. 34. Perform the following steps for each entry in the list of Axx partitions you began recording in step 14 on page 207, except the partition you were using to debug the original problem. 35. From the management console virtual terminal window of a partition in the list of Axx partitions, type diag at the AIX command prompt. 36. When the diagnostic operating instructions are displayed, do the following steps: Removal and replacement procedures for the 8233-E8B and 8236-E8C 209 1. Press Enter. 2. Select the Task Selection option. Note: If the terminal type is not defined, you are prompted to define it before you can proceed. 3. Select the Log Repair option. 4. Select the resource associated with the repair action. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your repair action is not displayed on the Resource List, select sysplanar0. 5. Click Commit after you have made your selection. 37. Exit from diagnostics in this partition and return to the AIX prompt. 38. Have all the partitions in the list of Axx partitions you began recording in step 14 on page 207 been processed? v Yes: Continue with the next step. v No: Go to step 34 on page 209 to process the next partition in the list you recorded in step 14 on page 207. 39. If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 218. This completes the repair. Return the system to the customer. Note: If during the processing of the list of open service action events, some service action events remained open, further service actions might be required to complete the repair. Closing a service call using Integrated Virtualization Manager Perform these procedures to close problem numbers, clear hardware messages, and prepare the server to return to the customer. Follow this checklist before performing the procedures: v Return the server to the state that the customer normally uses, such as IPL type, IPL mode, and the way the system is configured or partitioned. Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. v While you were performing the problem analysis on the original serviceable event, other serviceable-event numbers might have been opened. Close all serviceable events that were opened as a result of your service activity. v Ensure that server verification has been performed and there are no problems that require additional service actions. v If the repair was done using the Integrated Virtualization Manager (IVM) online repair procedures, ensure that the original serviceable event is now closed. 1. Record the system reference code (SRC) or symptom and the location code of the field-replaceable unit (FRU) you replaced, for future reference. 2. On the IVM, open Manage Serviceable Events and look at existing serviceable events. 210 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 3. Are there any service action events that are open? v Yes: Continue with the next step. v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 218. Return the system to the customer. This completes the repair. 4. Record the list of open service action events. 5. From the list of serviceable events recorded in step 4, perform step 6 through step 30 on page 213 for each open service action event. 6. Determine the error class of the serviceable event. Record for future use. 7. Examine the details of the open service action event. Is the error code associated with this service action event the same as recorded in step 1 on page 210? v Yes: Go to step 11. v No: Continue with the next step. 8. Examine the FRU list of the service action event. Are any FRUs listed for the service action event? v Yes: Continue with the next step. v No: Go to step 11. 9. Is the FRU list identical (same FRUs, same number of FRUs, and same order of FRUs) to the FRU list of the error code recorded in step 1 on page 210? v Yes: Go to step 11. v No: Continue with the next step. 10. Is the FRU you replaced and recorded in step 1 on page 210 in the list of FRUs for this service action event? v Yes: Continue with the next step. v No: Go to step 30 on page 213. Note: There are service action events that will remain open when you leave this MAP. Further service actions might be required to complete the repair. 11. Examine the details of this service action event, and record the partitions involved in this service action event for use in a later step. 12. Is the error code associated with this service action event of the form A11-xxx or A01-xxx? Removal and replacement procedures for the 8233-E8B and 8236-E8C 211 v Yes: Continue with the next step. v No: Go to step 17. 13. Have you begun a list of Axx partitions from prior service action events that you processed in this maintenance analysis procedure (MAP)? v Yes: Go to step 15. v No: Continue with the next step. 14. Begin a new list of Axx partitions by copying the list of partitions obtained in step 11 on page 211. Go to step 16. 15. Add the partition list obtained in step 11 on page 211 to the existing list of Axx partitions obtained from processing previous service action events in this MAP. 16. Remove all entries in the list of all partitions you recorded in step 11 on page 211. If you are referred to the list of partitions obtained in step 11 on page 211 in future steps, the list is empty. Go to step 17. 17. Select and highlight the service action event from the Manage Serviceable Events window. 18. Click Close Event. 19. Add comments for the serviceable event. Include any unique additional information. Click OK. 20. Add or update FRU information: Did you replace, add, or modify a FRU of the open service action event? v Yes: Continue with the next step. v No: Go to step 22. 21. Click OK to close the service action event. 22. Is the list of all partitions you recorded in step 11 on page 211 empty? v Yes: Go to step 30 on page 213. v No: Continue with the next step. 23. Does the list of all partitions you recorded in step 11 on page 211 contain more than one entry? 212 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures v Yes: Continue with the next step. v No: Go to step 30. 24. Is the error class recorded in step 23 on page 212? v Yes: Continue with the next step. v No: Go to step 30. 25. Perform the following steps for each entry in the list of all partitions you recorded in step 11 on page 211, except the partition you were using to debug the original problem. 26. From the IVM virtual terminal window of a partition in the list of all partitions, type diag at the AIX command prompt. 27. When the diagnostic operating instructions are displayed, do the following steps: 1. Press Enter. 2. Select the Task Selection option. 3. Select the Log Repair option. 4. Select the resource associated with the repair action. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your repair action is not displayed on the Resource List, select sysplanar0. 5. Click Commit after you have made your selection. Note: If the terminal type is not defined, you are prompted to define it before you can proceed. 28. Exit from diagnostics in this partition and return to the AIX prompt. 29. Have all the partitions in the list of all partitions you recorded in step 11 on page 211 been processed? v Yes: Continue with the next step. v No: Go to step 25 to process the next partition in the list you recorded in step 11 on page 211. 30. Have all the serviceable events recorded in step 4 on page 211 been processed? v Yes: Continue with the next step. v No: Go to step 5 on page 211 and process the next service action event in the list of serviceable events recorded in step 4 on page 211. 31. While processing all service action events, were you directed to step 14 on page 212? v Yes: Continue with the next step. v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 218. Return the system to the customer. This completes the repair. Note: If during the processing of the list of open service action events, some service action events remained open, further service actions might be required to complete the repair. Removal and replacement procedures for the 8233-E8B and 8236-E8C 213 32. Perform the following steps for each entry in the list of Axx partitions you began recording in step 14 on page 212, except the partition you were using to debug the original problem. 33. From the IVM virtual terminal window of a partition in the list of Axx partitions, type diag at the AIX command prompt. 34. When the diagnostic operating instructions are displayed, do the following steps: 1. Press Enter. 2. Select the Task Selection option. Note: If the terminal type is not defined, you are prompted to define it before you can proceed. 3. Select the Log Repair option. 4. Select the resource associated with the repair action. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your repair action is not displayed on the Resource List, select sysplanar0. 5. Click Commit after you have made your selection. 35. Exit from diagnostics in this partition and return to the AIX prompt. 36. Have all the partitions in the list of Axx partitions you began recording in step 14 on page 212 been processed? v Yes: Continue with the next step. v No: Go to step 32 to process the next partition in the list you recorded in step 14 on page 212. 37. If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 218. This completes the repair. Return the system to the customer. Note: If, during the processing of the list of open service action events, some service action events remained open, further service actions might be required to complete the repair. Closing a service call using AIX or Linux If the server is not connected to an management console and not using Integrated Virtualization Manager (IVM), perform these procedures to close problem numbers, clear hardware messages, and prepare the server to return to the customer. Follow this checklist before performing the procedures: v Return the server to the state that the customer normally uses, such as IPL type, IPL mode, and the way the system is configured or partitioned. Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. v While you were performing the problem analysis on the original serviceable event, other serviceable-event numbers might have been opened. Close all serviceable events that were opened as a result of your service activity. 214 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures v Ensure that server verification has been performed and that no problems require additional service actions. v If the repair was done using the IVM online repair procedures, ensure that the original serviceable event is now closed. 1. Did you use an AIX diagnostics service aid hot-swap operation to change the FRU? v Yes: Go to step 4 v No: Continue with the next step. 2. Do you have any field-replaceable units (FRUs) (for example cards, adapters, cables, or devices) that were removed during problem analysis that you want to put back into the system? Note: If the system planar or battery has been replaced and you are loading diagnostics from a server over a network, it might be necessary for the customer to set the network boot information for this system before diagnostics can be loaded. The system time and date information should also be set when the repair is completed. v Yes: Reinstall all of the FRUs that were removed during problem analysis. Go to step 3 v No: Continue with the next step. 3. Is the system or logical partition that you are performing a repair action on running the AIX operating system? v Yes: Continue with the next step. v No: Go to step 5. 4. Does the system or logical partition you are performing a repair action on have AIX installed? Note: Answer no to this question if you have just replaced a hard disk in the root volume group. v Yes: Go to step 7 on page 216. v No: Continue with the next step. 5. Run stand-alone diagnostics in problem determination mode from either a CD-ROM or from a Network Installation Management (NIM) server. Note: For instructions on running stand-alone diagnostics from a CD and not using an HMC, go to Running the stand-alone diagnostics from CD on a server without an HMC attached. For instructions on running stand-alone diagnostics from a NIM server, go to Running the stand-alone diagnostics from a Network Installation Management server. Did you encounter any problems? v Yes: Go to problem analysis. v No: Continue with the next step. 6. The system hardware is functioning correctly. Removal and replacement procedures for the 8233-E8B and 8236-E8C 215 If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 218. This completes the repair. Note: If, during the processing of the list of open service action events, some service action events remained open, further service actions might be required to complete the repair. Return the server to the state that the customer normally uses, such as IPL type, IPL mode, and the way the system is configured or partitioned. This might require you to reboot the operating system. Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. 7. Complete the following steps: 1. If the system supports slow boot (see Performing a slow boot), do a slow boot on the system. If the system does not support slow boot, do a normal boot. 2. Power on the system. 3. Wait until the AIX operating system login prompt displays or until system activity on the operator panel or display apparently has stopped. Did the AIX Login Prompt display? v Yes: Continue with the next step. v No: Go to problem analysis. 8. If the Resource Repair Action menu is already displayed, go to 12 on page 217; otherwise, do the following steps: 1. Log into the operating system, either with root authority (if needed, ask the customer to enter the password) or use the CE login. 2. Enter the diag -a command and check for missing resources. Follow any instructions that display. If an system request number (SRN) displays, suspect a loose card or connection. If no instructions display, no resources were detected as missing. Continue with 9. 9. Complete the following steps: 1. Enter diag at the command prompt. 2. Press Enter. 3. Select the Diagnostics Routines option. 4. When the Diagnostic Mode Selection menu displays, select Problem determination. 5. When the Advanced Diagnostic Selection menu displays, select the All Resources option, or test the FRUs you exchanged, and any devices that are attached to the FRUs you exchanged, by selecting the diagnostics for the individual FRU. Did the Resource Repair Action menu (801015) display? 216 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures v Yes: Go to step 13. v No: Continue with the next step. 10. Did the TESTING COMPLETE, no trouble was found menu (801010) display? v Yes: Continue with the next step. v No: There is still a problem. Go to problem analysis. 11. Use the Log Repair Action option, if not previously logged, in the TASK SELECTION menu to update the AIX error log. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. Note: If the system attention indicator is on, this will set it back to the normal state. Go to step 14 on page 218. 12. When a test is run on a resource in system verification mode, and that resource has an entry in the AIX error log, if the test on the resource was successful, the Resource Repair Action menu displays. After replacing a FRU, you must select the resource for that FRU from the Resource Repair Action menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced. Note: If the system attention indicator is on, this action will set it back to the normal state. Do the following steps: 1. Select the resource that has been replaced from the Resource Repair Action menu. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. 2. Press Commit after you make your selections. Did another Resource Repair Action (801015) display? v Yes: Continue with the next step. v No: If the No Trouble Found menu displays, go to step 14 on page 218. 13. The parent or child of the resource you just replaced might also require that you run the Resource Repair Action service aid on it. When a test is run on a resource in system verification mode, and that resource has an entry in the AIX error log, if the test on the resource was successful, the Resource Repair Action menu displays. After replacing that FRU, you must select the resource for that FRU from the Resource Repair Action menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced. Note: If the system attention indicator is on, this action will set it back to the normal state. Removal and replacement procedures for the 8233-E8B and 8236-E8C 217 Do the following steps: 1. From the Resource Repair Action menu, select the parent or child of the resource that has been replaced. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. 2. Press COMMIT after you make your selections. Did the No Trouble Found menu display? v Yes: Continue with the next step. v No: Go to problem analysis. 14. If you changed the service processor or network settings, as instructed in previous MAPs, restore the settings to the value they had prior to servicing the system. If you ran stand-alone diagnostics from CD-ROM, remove the stand-alone diagnostics CD-ROM from the system. Did you perform service on a RAID subsystem involving changing of the PCI RAID adapter cache card or changing the configuration? Note: This does not refer to the PCI-X RAID adapter or cache. v Yes: Continue with the next step. v No: Go to step 16. 15. Use the Recover Options selection to resolve the RAID configuration. To do this, do the following steps: 1. On the PCI SCSI Disk Array Manager dialog, select Recovery options. 2. If a previous configuration exists on the replacement adapter, this must be cleared. Select Clear PCI SCSI Adapter Configuration and press F3. 3. On the Recovery Options dialog, select Resolve PCI SCSI RAID Adapter Configuration. 4. On the Resolve PCI SCSI RAID Adapter Configuration dialog, select Accept Configuration on Drives. 5. On the PCI SCSI RAID Adapter selections menu, select the adapter that you changed. 6. On the next dialog, press Enter. 7. When you see the Are You Sure selection menu, press Enter to continue. 8. You should see an OK status message when the recover is complete. If you get a Failed status message, verify that you selected the correct adapter, and then repeat this procedure. When recover is complete, exit the operating system. 9. Go to 16. 16. The system hardware is functioning correctly. Return the server to the state that the customer normally uses, such as IPL type, IPL mode, and the way the system is configured or partitioned. Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. Activating and deactivating LEDs You can use these procedures to activate or deactivate light-emitting diodes (LEDs) using the management console or the Advanced System Management Interface (ASMI). 218 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures For IBM Power Systems™ servers that contain the POWER7® processor, the LEDs can be used to identify or verify a part that you are servicing. The error and identify function (amber) LED indicates an error and corresponds to the location code in the system reference code (SRC). The LED is activated and deactivated automatically. Additionally, the following procedures can also be used to activate and deactivate LEDs. v “Deactivating a system attention LED or partition LED using the management console” v “Activating or deactivating an identify LED using the management console” v “Deactivating a system attention LED or logical partition LED using the Advanced System Management Interface” on page 220 v “Activating or deactivating an identify LED using the Advanced System Management Interface” on page 221 Deactivating a system attention LED or partition LED using the management console You can deactivate a system attention LED or a logical partition LED if you decide that a problem is not a high priority and you decide to repair the problem at a later time. You can do this task from the Hardware Management Console (HMC) or the IBM Systems Director Management Console (SDMC). If you want to be alerted if another problem occurs, you must deactivate the system attention LED so that it can be activated again if another problem occurs. To deactivate a system attention LED by using HMC, complete the following steps: 1. In the navigation area, open Systems management. 2. Select the server you are working on by checking the box next to its name. 3. Open Operations. 4. Open LED Status. 5. Select View System Attention. The system attention LED window opens. The selected system and its LED state are displayed in the upper part of the window. The logical partition and its LED state are displayed in the lower part of the window. From the system attention LED window, you can deactivate both the system attention LED and the logical partition LED. 6. Select Deactivate System Attention LED from the Action menu. A confirmation window is displayed that provides the following information: v A verification that the system attention LED was deactivated. v An indication that there still might be open problems within the system. v An indication that you cannot activate the system attention LED. 7. Select one of the logical partitions in the lower table, and select Deactivate partition LED from the Partition Operations menu. A confirmation window is displayed that provides the following information: v A verification that the logical partition LED was deactivated. v An indication that there still might be open problems within the logical partition. v An indication that you cannot activate the logical partition LED. To deactivate a system attention LED by using SDMC, complete the following steps: 1. On the Welcome page, under the Resources tab, select the server. 2. Click Actions > Service and Support> Hardware> System Attention LED. 3. Click OK. Activating or deactivating an identify LED using the management console You can activate or deactivate an identify LED for components attached to the system from the Hardware Management Console (HMC) or the IBM Systems Director Management Console (SDMC). Removal and replacement procedures for the 8233-E8B and 8236-E8C 219 The system provides several LEDs that help identify various components, such as enclosures or field-replaceable units (FRUs), in the system. For this reason, they are called identify LEDs. You can activate or deactivate the following types of identify LEDs: v Identify LED for an enclosure If you want to add an adapter to a specific drawer (enclosure), you need to know the machine type, model, and serial number (MTMS) of the drawer. To determine whether you have the correct MTMS for the drawer that needs the new adapter, you can activate the LED for a drawer and verify that the MTMS corresponds to the drawer that requires the new adapter. v Identify LED for a FRU associated with a specified enclosure If you want to hook up a cable to a specific I/O adapter, you can activate the LED for the adapter, which is a field replaceable unit (FRU), and then physically check to see where you should hook up the cable. This is especially useful when you have several adapters with open ports. To activate or deactivate an identify LED for an enclosure or FRU by using the HMC, follow these steps: 1. In the navigation area, open Systems management. 2. Select the server you are working on. 3. Click Operations ? LED Status ? Identify LED. The Identify LED, Select Enclosure window opens. 4. To activate or deactivate an identify LED for an enclosure, select an enclosure from the table, and click either Activate LED or Deactivate LED. The associated LED is either turned on or off. 5. To activate or deactivate an identify LED for a FRU, select an enclosure from the table, and then select Selected ? List FRUs. 6. Select one or more FRUs from the table, and click either Activate LED or Deactivate LED. The associated LED is either turned on or off. To activate or deactivate an identify LED for an enclosure or FRU by using the SDMC, complete these steps: 1. On the Welcome page, under the Resources tab, select the server. 2. Click Actions > Service and Support> Hardware> Identify LED. 3. To activate or deactivate an identify LED for an enclosure, select an enclosure from the table, and click either Activate LED or Deactivate LED. The associated LED is either turned on or off. 4. To activate or deactivate an identify LED for a FRU, select an enclosure from the table, and then click List FRUs. 5. Select one or more FRUs from the table, and then click either Activate LED or Deactivate LED. The associated LED is either turned on or off. Deactivating a system attention LED or logical partition LED using the Advanced System Management Interface You can deactivate a system attention LED or a logical partition LED using the Advanced System Management Interface (ASMI). The system attention indicator provides a visual signal that the system as a whole requires attention or service. Each system has a single system attention indicator. When an event occurs that either needs your intervention or that of service and support, the system attention indicator lights continuously. The system attention indicator is turned on when an entry is made in the service processor error log. The error entry is transmitted to the system error log and the operating system error log. To perform this operation, your authority level must be one of the following levels: v Administrator v Authorized service provider To turn off the system attention indicator, do the following steps: 1. On the ASMI Welcome pane, specify your user ID and password, and click Log In. 220 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures 2. In the navigation area, expand System Configuration ? Service Indicators ? System Attention Indicator. 3. In the right pane, click Turn off system attention indicator. If the attempt is unsuccessful, an error message is displayed. Activating or deactivating an identify LED using the Advanced System Management Interface You can activate or deactivate an identify LED using the Advanced System Management Interface (ASMI). You can specify the location code of any indicator to view or modify its current state. If you provide the wrong location code, the advanced system manager attempts to go to the next higher level of the location code. The next level is the base-level location code for that field replaceable unit (FRU). For example, a user types the location code for the FRU located on the second I/O slot of the third enclosure in the system. If the location code for the second I/O slot is incorrect (the FRU does not exist at this location), an attempt to set the indicator for the third enclosure is initiated. This process continues until a FRU is located or no other level is available. To perform this operation, your authority level must be one of the following levels: v Administrator v Authorized service provider To change the current state of an indicator, do the following steps: 1. On the ASMI Welcome pane, specify your user ID and password, and click Log In. 2. In the navigation area, expand System Configuration ? Service Indicators ? Indicators by Location code. 3. In the right pane, enter the location code of the FRU and click Continue. 4. Select the preferred state from the list. 5. Click Save settings. Removal and replacement procedures for the 8233-E8B and 8236-E8C 221 222 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures Notices This information was developed for products and services offered in the U.S.A. The manufacturer may not offer the products, services, or features discussed in this document in other countries. Consult the manufacturer's representative for information on the products and services currently available in your area. Any reference to the manufacturer's product, program, or service is not intended to state or imply that only that product, program, or service may be used. Any functionally equivalent product, program, or service that does not infringe any intellectual property right of the manufacturer may be used instead. However, it is the user's responsibility to evaluate and verify the operation of any product, program, or service. The manufacturer may have patents or pending patent applications covering subject matter described in this document. The furnishing of this document does not grant you any license to these patents. You can send license inquiries, in writing, to the manufacturer. The following paragraph does not apply to the United Kingdom or any other country where such provisions are inconsistent with local law: THIS INFORMATION IS PROVIDED “AS IS” WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESS OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF NON-INFRINGEMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Some states do not allow disclaimer of express or implied warranties in certain transactions, therefore, this statement may not apply to you. This information could include technical inaccuracies or typographical errors. Changes are periodically made to the information herein; these changes will be incorporated in new editions of the publication. The manufacturer may make improvements and/or changes in the product(s) and/or the program(s) described in this publication at any time without notice. Any references in this information to websites not owned by the manufacturer are provided for convenience only and do not in any manner serve as an endorsement of those websites. The materials at those websites are not part of the materials for this product and use of those websites is at your own risk. The manufacturer may use or distribute any of the information you supply in any way it believes appropriate without incurring any obligation to you. Any performance data contained herein was determined in a controlled environment. Therefore, the results obtained in other operating environments may vary significantly. Some measurements may have been made on development-level systems and there is no guarantee that these measurements will be the same on generally available systems. Furthermore, some measurements may have been estimated through extrapolation. Actual results may vary. Users of this document should verify the applicable data for their specific environment. Information concerning products not produced by this manufacturer was obtained from the suppliers of those products, their published announcements or other publicly available sources. This manufacturer has not tested those products and cannot confirm the accuracy of performance, compatibility or any other claims related to products not produced by this manufacturer. Questions on the capabilities of products not produced by this manufacturer should be addressed to the suppliers of those products. All statements regarding the manufacturer's future direction or intent are subject to change or withdrawal without notice, and represent goals and objectives only. The manufacturer's prices shown are the manufacturer's suggested retail prices, are current and are subject to change without notice. Dealer prices may vary. © Copyright IBM Corp. 2010 223 This information is for planning purposes only. The information herein is subject to change before the products described become available. This information contains examples of data and reports used in daily business operations. To illustrate them as completely as possible, the examples include the names of individuals, companies, brands, and products. All of these names are fictitious and any similarity to the names and addresses used by an actual business enterprise is entirely coincidental. If you are viewing this information in softcopy, the photographs and color illustrations may not appear. The drawings and specifications contained herein shall not be reproduced in whole or in part without the written permission of the manufacturer. The manufacturer has prepared this information for use with the specific machines indicated. The manufacturer makes no representations that it is suitable for any other purpose. The manufacturer's computer systems contain mechanisms designed to reduce the possibility of undetected data corruption or loss. This risk, however, cannot be eliminated. Users who experience unplanned outages, system failures, power fluctuations or outages, or component failures must verify the accuracy of operations performed and data saved or transmitted by the system at or near the time of the outage or failure. In addition, users must establish procedures to ensure that there is independent data verification before relying on such data in sensitive or critical operations. Users should periodically check the manufacturer's support websites for updated information and fixes applicable to the system and related software. Ethernet connection usage restriction This product is not intended to be connected directly or indirectly by any means whatsoever to interfaces of public telecommunications networks. Trademarks IBM, the IBM logo, and ibm.com are trademarks or registered trademarks of International Business Machines Corp., registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM or other companies. A current list of IBM trademarks is available on the Web at Copyright and trademark information at www.ibm.com/legal/copytrade.shtml. Linux is a registered trademark of Linus Torvalds in the United States, other countries, or both. Red Hat, the Red Hat "Shadow Man" logo, and all Red Hat-based trademarks and logos are trademarks or registered trademarks of Red Hat, Inc., in the United States and other countries. Other company, product, or service names may be trademarks or service marks of others. Electronic emission notices When attaching a monitor to the equipment, you must use the designated monitor cable and any interference suppression devices supplied with the monitor. Class A Notices The following Class A statements apply to the IBM servers that contain the POWER7 processor and its features unless designated as electromagnetic compatibility (EMC) Class B in the feature information. 224 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures Federal Communications Commission (FCC) statement Note: This equipment has been tested and found to comply with the limits for a Class A digital device, pursuant to Part 15 of the FCC Rules. These limits are designed to provide reasonable protection against harmful interference when the equipment is operated in a commercial environment. This equipment generates, uses, and can radiate radio frequency energy and, if not installed and used in accordance with the instruction manual, may cause harmful interference to radio communications. Operation of this equipment in a residential area is likely to cause harmful interference, in which case the user will be required to correct the interference at his own expense. Properly shielded and grounded cables and connectors must be used in order to meet FCC emission limits. IBM is not responsible for any radio or television interference caused by using other than recommended cables and connectors or by unauthorized changes or modifications to this equipment. Unauthorized changes or modifications could void the user's authority to operate the equipment. This device complies with Part 15 of the FCC rules. Operation is subject to the following two conditions: (1) this device may not cause harmful interference, and (2) this device must accept any interference received, including interference that may cause undesired operation. Industry Canada Compliance Statement This Class A digital apparatus complies with Canadian ICES-003. Avis de conformité à la réglementation d'Industrie Canada Cet appareil numérique de la classe A est conforme à la norme NMB-003 du Canada. European Community Compliance Statement This product is in conformity with the protection requirements of EU Council Directive 2004/108/EC on the approximation of the laws of the Member States relating to electromagnetic compatibility. IBM cannot accept responsibility for any failure to satisfy the protection requirements resulting from a non-recommended modification of the product, including the fitting of non-IBM option cards. This product has been tested and found to comply with the limits for Class A Information Technology Equipment according to European Standard EN 55022. The limits for Class A equipment were derived for commercial and industrial environments to provide reasonable protection against interference with licensed communication equipment. European Community contact: IBM Deutschland GmbH Technical Regulations, Department M456 IBM-Allee 1, 71139 Ehningen, Germany Tele: +49 7032 15-2937 email: tjahn@de.ibm.com Warning: This is a Class A product. In a domestic environment, this product may cause radio interference, in which case the user may be required to take adequate measures. Notices 225 VCCI Statement - Japan The following is a summary of the VCCI Japanese statement in the box above: This is a Class A product based on the standard of the VCCI Council. If this equipment is used in a domestic environment, radio interference may occur, in which case, the user may be required to take corrective actions. Japanese Electronics and Information Technology Industries Association (JEITA) Confirmed Harmonics Guideline (products less than or equal to 20 A per phase) Japanese Electronics and Information Technology Industries Association (JEITA) Confirmed Harmonics Guideline with Modifications (products greater than 20 A per phase) Electromagnetic Interference (EMI) Statement - People's Republic of China Declaration: This is a Class A product. In a domestic environment this product may cause radio interference in which case the user may need to perform practical action. 226 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures Electromagnetic Interference (EMI) Statement - Taiwan The following is a summary of the EMI Taiwan statement above. Warning: This is a Class A product. In a domestic environment this product may cause radio interference in which case the user will be required to take adequate measures. IBM Taiwan Contact Information: Electromagnetic Interference (EMI) Statement - Korea Germany Compliance Statement Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse A EU-Richtlinie zur Elektromagnetischen Verträglichkeit Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die Grenzwerte der EN 55022 Klasse A ein. Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne Empfehlung von IBM gesteckt/eingebaut werden. Notices 227 EN 55022 Klasse A Geräte müssen mit folgendem Warnhinweis versehen werden: "Warnung: Dieses ist eine Einrichtung der Klasse A. Diese Einrichtung kann im Wohnbereich Funk-Störungen verursachen; in diesem Fall kann vom Betreiber verlangt werden, angemessene Maßnahmen zu ergreifen und dafür aufzukommen." Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland. Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse A Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen - CE - zu führen. Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller: International Business Machines Corp. New Orchard Road Armonk, New York 10504 Tel: 914-499-1900 Der verantwortliche Ansprechpartner des Herstellers in der EU ist: IBM Deutschland GmbH Technical Regulations, Abteilung M456 IBM-Allee 1, 71139 Ehningen, Germany Tel: +49 7032 15-2937 email: tjahn@de.ibm.com Generelle Informationen: Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse A. Electromagnetic Interference (EMI) Statement - Russia Terms and conditions Permissions for the use of these publications is granted subject to the following terms and conditions. Personal Use: You may reproduce these publications for your personal, noncommercial use provided that all proprietary notices are preserved. You may not distribute, display or make derivative works of these publications, or any portion thereof, without the express consent of the manufacturer. Commercial Use: You may reproduce, distribute and display these publications solely within your enterprise provided that all proprietary notices are preserved. You may not make derivative works of these publications, or reproduce, distribute or display these publications or any portion thereof outside your enterprise, without the express consent of the manufacturer. 228 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures Except as expressly granted in this permission, no other permissions, licenses or rights are granted, either express or implied, to the publications or any information, data, software or other intellectual property contained therein. The manufacturer reserves the right to withdraw the permissions granted herein whenever, in its discretion, the use of the publications is detrimental to its interest or, as determined by the manufacturer, the above instructions are not being properly followed. You may not download, export or re-export this information except in full compliance with all applicable laws and regulations, including all United States export laws and regulations. THE MANUFACTURER MAKES NO GUARANTEE ABOUT THE CONTENT OF THESE PUBLICATIONS. THESE PUBLICATIONS ARE PROVIDED "AS-IS" AND WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING BUT NOT LIMITED TO IMPLIED WARRANTIES OF MERCHANTABILITY, NON-INFRINGEMENT, AND FITNESS FOR A PARTICULAR PURPOSE. Notices 229 230 IBM Power 750 Express (8233-E8B) and IBM Power 755 (8236-E8C) removal and replacement procedures Red Hat Cluster Suite Conguring and Managing a Cluster 86 A2 35ER 00 Red Hat Cluster Suite: Conguring and Managing a Cluster Copyright © 2000-2004 by Red Hat, Inc.Mission Critical Linux, Inc.K.M. Sorenson Red Hat, Inc. 1801 Varsity Drive Raleigh NC 27606-2072 USA Phone: +1 919 754 3700 Phone: 888 733 4281 Fax: +1 919 754 3701 PO Box 13588 Research Triangle Park NC 27709 USA rh-cs(EN)-3-Print-RHI (2004-06-04T17:42) For Part I Using the Red Hat Cluster Manager and Part III Appendixes, permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.1 or any later version published by the Free Software Foundation. A copy of the license is available at http://www.gnu.org/licenses/fdl.html. The content described in this paragraph is copyrighted by © Mission Critical Linux, Inc. (2000), K.M. Sorenson (2000), and Red Hat, Inc. (2000-2003). This material in Part II Conguring a Linux Virtual Server Cluster may be distributed only subject to the terms and conditions set forth in the Open Publication License, V1.0 or later (the latest version is presently available at http://www.opencontent.org/openpub/). Distribution of substantively modied versions of this material is prohibited without the explicit permission of the copyright holder. Distribution of the work or derivative of the work in any standard (paper) book form for commercial purposes is prohibited unless prior permission is obtained from the copyright holder. The content described in this paragraph is copyrighted by © Red Hat, Inc. (2000-2003). Red Hat, Red Hat Network, the Red Hat "Shadow Man" logo, RPM, Maximum RPM, the RPM logo, Linux Library, PowerTools, Linux Undercover, RHmember, RHmember More, Rough Cuts, Rawhide and all Red Hat-based trademarks and logos are trademarks or registered trademarks of Red Hat, Inc. in the United States and other countries. Linux is a registered trademark of Linus Torvalds. Motif and UNIX are registered trademarks of The Open Group. XFree86 is a trademark of The XFree86 Project, Inc, and is pending registration. Intel and Pentium are registered trademarks of Intel Corporation. Itanium and Celeron are trademarks of Intel Corporation. AMD, Opteron, Athlon, Duron, and K6 are registered trademarks of Advanced Micro Devices, Inc. Netscape is a registered trademark of Netscape Communications Corporation in the United States and other countries. Java and Swing are trademarks or registered trademarks of Sun Microsystems, Inc. in the U.S. or other countries. Oracle is a registered trademark, and Oracle8i, Oracle9i, and interMedia are trademarks or registered trademarks of Oracle Corporation. Microsoft andWindows are either registered trademarks or trademarks of Microsoft Corporation in the United States and/or other countries. SSH and Secure Shell are trademarks of SSH Communications Security, Inc. FireWire is a trademark of Apple Computer Corporation. IBM, AS/400, OS/400, RS/6000, S/390, and zSeries are registered trademarks of International Business Machines Corporation. eServer, iSeries, and pSeries are trademarks of International Business Machines Corporation. All other trademarks and copyrights referred to are the property of their respective owners. The GPG ngerprint of the security@redhat.com key is: CA 20 86 86 2B D6 9D FC 65 F6 EC C4 21 91 80 CD DB 42 A6 0E Table of Contents Acknowledgments ................................................................................................................................ i Introduction........................................................................................................................................iii 1. How To Use This Manual ....................................................................................................iii 2. Document Conventions........................................................................................................ iv 3. More to Come ...................................................................................................................... vi 3.1. Send in Your Feedback ......................................................................................... vi 4. Sign Up for Support............................................................................................................ vii I. Using the Red Hat Cluster Manager ............................................................................................ ix 1. Red Hat Cluster Manager Overview..................................................................................... 1 1.1. Red Hat Cluster Manager Features ........................................................................ 2 2. Hardware Installation and Operating System Conguration ................................................ 5 2.1. Choosing a Hardware Conguration ..................................................................... 5 2.2. Setting Up the Members ...................................................................................... 15 2.3. Installing and Conguring Red Hat Enterprise Linux ......................................... 17 2.4. Setting Up and Connecting the Cluster Hardware............................................... 21 3. Cluster Conguration.......................................................................................................... 35 3.1. Installing the Red Hat Cluster Suite Packages..................................................... 35 3.2. Installation Notes for Red Hat Enterprise Linux 2.1 Users ................................. 37 3.3. The Cluster Conguration Tool ........................................................................ 37 3.4. Conguring the Cluster Software ........................................................................ 40 3.5. Editing the rawdevices File .............................................................................. 41 3.6. Conguring Cluster Daemons.............................................................................. 42 3.7. Adding and Deleting Members............................................................................ 46 3.8. Conguring a Power Controller Connection ....................................................... 47 3.9. Conguring a Failover Domain ........................................................................... 49 3.10. Adding a Service to the Cluster ......................................................................... 51 3.11. Checking the Cluster Conguration................................................................... 53 3.12. Conguring syslogd Event Logging ............................................................... 55 4. Service Administration ....................................................................................................... 59 4.1. Conguring a Service .......................................................................................... 59 4.2. Displaying a Service Conguration..................................................................... 62 4.3. Disabling a Service .............................................................................................. 63 4.4. Enabling a Service ............................................................................................... 63 4.5. Modifying a Service............................................................................................. 63 4.6. Relocating a Service ............................................................................................ 64 4.7. Deleting a Service ................................................................................................ 64 4.8. Handling Failed Services ..................................................................................... 64 5. Database Services ............................................................................................................... 67 5.1. Setting Up an Oracle Service............................................................................... 67 5.2. Tuning Oracle Service ......................................................................................... 72 5.3. Setting Up a MySQL Service .............................................................................. 73 6. Network File Sharing Services ........................................................................................... 77 6.1. Setting Up an NFS Service .................................................................................. 77 6.2. Using the NFS Druid .......................................................................................... 77 6.3. NFS Caveats......................................................................................................... 82 6.4. Importing the Contents of an NFS Exports File .................................................. 82 6.5. NFS Conguration: Active-Active Example ....................................................... 83 6.6. Setting Up a Samba Service................................................................................. 84 6.7. Using the Samba Druid........................................................................................ 86 6.8. Fields in the smb.conf.sharename File........................................................... 90 7. Setting Up Apache HTTP Server........................................................................................ 93 7.1. Apache HTTP Server Setup Overview ................................................................ 93 7.2. Conguring Shared Storage................................................................................. 93 7.3. Installing and Conguring the Apache HTTP Server.......................................... 94 8. Cluster Administration........................................................................................................ 97 8.1. Overview of the Cluster Status Tool.................................................................. 97 8.2. Displaying Cluster and Service Status................................................................. 97 8.3. Starting and Stopping the Cluster Software......................................................... 99 8.4. Modifying the Cluster Conguration................................................................. 100 8.5. Backing Up and Restoring the Cluster Database............................................... 100 8.6. Modifying Cluster Event Logging ..................................................................... 101 8.7. Updating the Cluster Software........................................................................... 101 8.8. Changing the Cluster Name............................................................................... 102 8.9. Disabling the Cluster Software .......................................................................... 102 8.10. Diagnosing and Correcting Problems in a Cluster........................................... 102 II. Conguring a Linux Virtual Server Cluster ........................................................................... 107 9. Introduction to Linux Virtual Server................................................................................. 109 9.1. Technology Overview........................................................................................ 109 9.2. Basic Congurations.......................................................................................... 109 10. Linux Virtual Server Overview....................................................................................... 111 10.1. A Basic LVS Conguration ............................................................................. 111 10.2. A Three Tiered LVS Conguration.................................................................. 113 10.3. LVS Scheduling Overview............................................................................... 113 10.4. Routing Methods.............................................................................................. 115 10.5. Persistence and Firewall Marks ....................................................................... 117 10.6. LVS Cluster ? A Block Diagram ................................................................... 117 11. Initial LVS Conguration................................................................................................ 121 11.1. Conguring Services on the LVS Routers ....................................................... 121 11.2. Setting a Password for the Piranha Conguration Tool ............................... 122 11.3. Starting the Piranha Conguration Tool Service.......................................... 122 11.4. Limiting Access To the Piranha Conguration Tool .................................... 123 11.5. Turning on Packet Forwarding......................................................................... 124 11.6. Conguring Services on the Real Servers ....................................................... 124 12. Setting Up a Red Hat Enterprise Linux LVS Cluster...................................................... 125 12.1. The NAT LVS Cluster ...................................................................................... 125 12.2. Putting the Cluster Together ............................................................................ 127 12.3. Multi-port Services and LVS Clustering.......................................................... 128 12.4. FTP In an LVS Cluster..................................................................................... 130 12.5. Saving Network Packet Filter Settings ............................................................ 132 13. Conguring the LVS Routers with Piranha Conguration Tool ................................. 133 13.1. Necessary Software.......................................................................................... 133 13.2. Logging Into the Piranha Conguration Tool .............................................. 133 13.3. CONTROL/MONITORING......................................................................... 134 13.4. GLOBAL SETTINGS ................................................................................... 135 13.5. REDUNDANCY............................................................................................. 137 13.6. VIRTUAL SERVERS .................................................................................... 139 13.7. Synchronizing Conguration Files .................................................................. 147 13.8. Starting the Cluster .......................................................................................... 148 III. Appendixes................................................................................................................................ 149 A. Using Red Hat Cluster Manager with Piranha................................................................. 151 B. Using Red Hat GFS with Red Hat Cluster Suite.............................................................. 153 B.1. Terminology ...................................................................................................... 153 B.2. Changes to Red Hat Cluster .............................................................................. 154 B.3. Installation Scenarios ........................................................................................ 154 C. The GFS Setup Druid....................................................................................................... 157 C.1. Cluster Name..................................................................................................... 157 C.2. LOCK_GULM parameters .................................................................................. 157 C.3. Choose Location for CCS Files......................................................................... 158 C.4. Cluster Members ............................................................................................... 159 C.5. Saving Your Conguration and Next Steps....................................................... 161 D. Supplementary Hardware Information............................................................................. 163 D.1. Setting Up Power Controllers ........................................................................... 163 D.2. SCSI Bus Conguration Requirements ............................................................ 165 D.3. SCSI Bus Termination ...................................................................................... 166 D.4. SCSI Bus Length............................................................................................... 166 D.5. SCSI Identication Numbers ............................................................................ 167 E. Supplementary Software Information .............................................................................. 169 E.1. Cluster Communication Mechanisms................................................................ 169 E.2. Failover and Recovery Scenarios ...................................................................... 170 E.3. Common Cluster Behaviors: General................................................................ 170 E.4. Common Behaviors: Two Member Cluster with Disk-based Tie-breaker ........ 172 E.5. Common Behaviors: 2-4 Member Cluster with IP-based Tie-Breaker ............. 173 E.6. Common Behaviors: 3-5 Member Cluster ........................................................ 173 E.7. Common Behaviors: Cluster Service Daemons ................................................ 174 E.8. Common Behaviors: Miscellaneous.................................................................. 175 E.9. The cluster.xml File..................................................................................... 175 F. Cluster Command-line Utilities........................................................................................ 179 F.1. Using redhat-config-cluster-cmd........................................................... 179 F.2. Using the shutil Utility................................................................................... 180 F.3. Using the clusvcadm Utility ............................................................................ 180 F.4. Using the clufence Utility .............................................................................. 181 Index................................................................................................................................................. 183 Colophon.......................................................................................................................................... 191 Acknowledgments The Red Hat Cluster Manager software was originally based on the open source Kimberlite (http://oss.missioncriticallinux.com/kimberlite/) cluster project, which was developed by Mission Critical Linux, Inc. Subsequent to its inception based on Kimberlite, developers at Red Hat have made a large number of enhancements and modications. The following is a non-comprehensive list highlighting some of these enhancements. ? Packaging and integration into the Red Hat installation paradigm to simplify the end user's experience. ? Addition of support for multiple cluster members. ? Addition of support for high availability NFS services. ? Addition of support for high availability Samba services. ? Addition of the Cluster Conguration Tool, a graphical conguration tool. ? Addition of the Cluster Status Tool, a graphical monitoring and administration tool. ? Addition of support for failover domains. ? Addition of support for Red Hat GFS, including the GFS Setup Druid and the LOCK_GULM fencing driver. ? Addition of support for using watchdog timers as a data integrity provision. ? Addition of service monitoring which automatically restart a failed application. ? Rewrite of the service manager to facilitate additional cluster-wide operations. ? A set of miscellaneous bug xes. The Red Hat Cluster Manager software incorporates STONITH compliant power switch modules from the Linux-HA project; refer to http://www.linux-ha.org/stonith/. ii Acknowledgments Introduction The Red Hat Cluster Suite is a collection of technologies working together to provide data integrity and the ability to maintain application availability in the event of a failure. Administrators can deploy enterprise cluster solutions using a combination of hardware redundancy along with the failover and load-balancing technologies in Red Hat Cluster Suite. Red Hat Cluster Manager is a high-availability cluster solution specically suited for database applications, network le servers, and World Wide Web (Web) servers with dynamic content. An Red Hat Cluster Manager system features data integrity and application availability using redundant hardware, shared disk storage, power management, and robust cluster communication and application failover mechanisms. Administrators can also deploy highly available applications services using Piranha, a load-balancing and advanced routing cluster solution based on Linux Virtual Server (LVS) technology. Using Piranha, administrators can build highly available e-commerce sites that feature complete data integrity and service availability, in addition to load balancing capabilities. Refer to Part II Conguring a Linux Virtual Server Cluster for more information. This guide assumes that the user has an advanced working knowledge of Red Hat Enterprise Linux and understands the concepts of server computing. For more information about using Red Hat Enterprise Linux, refer to the following resources: ? Red Hat Enterprise Linux Installation Guide for information regarding installation. ? Red Hat Enterprise Linux Introduction to System Administration for introductory information for new Red Hat Enterprise Linux system administrators. ? Red Hat Enterprise Linux System Administration Guide for more detailed information about con- guring Red Hat Enterprise Linux to suit your particular needs as a user. ? Red Hat Enterprise Linux Reference Guide provides detailed information suited for more experienced users to refer to when needed, as opposed to step-by-step instructions. ? Red Hat Enterprise Linux Security Guide details the planning and the tools involved in creating a secured computing environment for the data center, workplace, and home. HTML, PDF, and RPM versions of the manuals are available on the Red Hat Enterprise Linux Documentation CD and online at: http://www.redhat.com/docs/ 1. How To Use This Manual This manual contains information about setting up a Red Hat Cluster Manager system. These tasks are described in Chapter 2 Hardware Installation and Operating System Conguration and Chapter 3 Cluster Conguration. For information about conguring Red Hat Cluster Manager cluster services, refer to Chapter 4 Service Administration through Chapter 7 Setting Up Apache HTTP Server. Part II Conguring a Linux Virtual Server Cluster describes how to achieve load balancing in an Red Hat Enterprise Linux cluster by using the Linux Virtual Server. For information about deploying a Red Hat Cluster Manager cluster in conjunction with Red Hat GFS using the GFS Setup Druid, refer to Appendix C The GFS Setup Druid. Appendix D Supplementary Hardware Information contains detailed conguration information on specic hardware devices and shared storage congurations. Appendix E Supplementary Software Information contains background information on the cluster software and other related information. iv Introduction Appendix F Cluster Command-line Utilities provides usage and reference information on the command-line utilities included with Red Hat Cluster Suite. This guide assumes you have a thorough understanding of Red Hat Enterprise Linux system administration concepts and tasks. For detailed information on Red Hat Enterprise Linux system administration, refer to the Red Hat Enterprise Linux System Administration Guide. For reference information on Red Hat Enterprise Linux, refer to the Red Hat Enterprise Linux Reference Guide. 2. Document Conventions When you read this manual, certain words are represented in different fonts, typefaces, sizes, and weights. This highlighting is systematic; different words are represented in the same style to indicate their inclusion in a specic category. The types of words that are represented this way include the following: command Linux commands (and other operating system commands, when used) are represented this way. This style should indicate to you that you can type the word or phrase on the command line and press [Enter] to invoke a command. Sometimes a command contains words that would be displayed in a different style on their own (such as le names). In these cases, they are considered to be part of the command, so the entire phrase is displayed as a command. For example: Use the cat testfile command to view the contents of a le, named testfile, in the current working directory. file name File names, directory names, paths, and RPM package names are represented this way. This style should indicate that a particular le or directory exists by that name on your system. Examples: The .bashrc le in your home directory contains bash shell denitions and aliases for your own use. The /etc/fstab le contains information about different system devices and le systems. Install the webalizer RPM if you want to use a Web server log le analysis program. application This style indicates that the program is an end-user application (as opposed to system software). For example: Use Mozilla to browse the Web. [key] A key on the keyboard is shown in this style. For example: To use [Tab] completion, type in a character and then press the [Tab] key. Your terminal displays the list of les in the directory that start with that letter. [key]-[combination] A combination of keystrokes is represented in this way. For example: The [Ctrl]-[Alt]-[Backspace] key combination exits your graphical session and return you to the graphical login screen or the console. Introduction v text found on a GUI interface A title, word, or phrase found on a GUI interface screen or window is shown in this style. Text shown in this style is being used to identify a particular GUI screen or an element on a GUI screen (such as text associated with a checkbox or eld). Example: Select the Require Password checkbox if you would like your screensaver to require a password before stopping. top level of a menu on a GUI screen or window A word in this style indicates that the word is the top level of a pulldown menu. If you click on the word on the GUI screen, the rest of the menu should appear. For example: Under File on a GNOME terminal, the New Tab option allows you to open multiple shell prompts in the same window. If you need to type in a sequence of commands from a GUI menu, they are shown like the following example: Go to Main Menu Button (on the Panel) => Programming => Emacs to start the Emacs text editor. button on a GUI screen or window This style indicates that the text can be found on a clickable button on a GUI screen. For example: Click on the Back button to return to the webpage you last viewed. computer output Text in this style indicates text displayed to a shell prompt such as error messages and responses to commands. For example: The ls command displays the contents of a directory. For example: Desktop about.html logs paulwesterberg.png Mail backupfiles mail reports The output returned in response to the command (in this case, the contents of the directory) is shown in this style. prompt A prompt, which is a computer's way of signifying that it is ready for you to input something, is shown in this style. Examples: $ # [stephen@maturin stephen]$ leopard login: user input Text that the user has to type, either on the command line, or into a text box on a GUI screen, is displayed in this style. In the following example, text is displayed in this style: To boot your system into the text based installation program, you must type in the text command at the boot: prompt. replaceable Text used for examples which is meant to be replaced with data provided by the user is displayed in this style. In the following example, ?? version-number  is displayed in this style: vi Introduction The directory for the kernel source is /usr/src/  version-number  /, where  version-number  is the version of the kernel installed on this system. Additionally, we use several different strategies to draw your attention to certain pieces of information. In order of how critical the information is to your system, these items are marked as note, tip, important, caution, or a warning. For example: Note Remember that Linux is case sensitive. In other words, a rose is not a ROSE is not a rOsE. Tip The directory /usr/share/doc/ contains additional documentation for packages installed on your system. Important If you modify the DHCP conguration le, the changes will not take effect until you restart the DHCP daemon. Caution Do not perform routine tasks as root ? use a regular user account unless you need to use the root account for system administration tasks. Warning Be careful to remove only the necessary Red Hat Enterprise Linux partitions. Removing other partitions could result in data loss or a corrupted system environment. 3. More to Come This manual is part of Red Hat's growing commitment to provide useful and timely support to Red Hat Enterprise Linux users. Introduction vii 3.1. Send in Your Feedback If you spot a typo, or if you have thought of a way to make this manual better, we would love to hear from you. Please submit a report in Bugzilla (http://bugzilla.redhat.com/bugzilla/) against the component rh-cs. Be sure to mention the manual's identier: rh-cs(EN)-3-Print-RHI (2004-06-04T17:42) By mentioning this manual's identier, we know exactly which version of the guide you have. If you have a suggestion for improving the documentation, try to be as specic as possible. If you have found an error, please include the section number and some of the surrounding text so we can nd it easily. 4. Sign Up for Support If you have a variant of Red Hat Enterprise Linux 3, please remember to sign up for the benets you are entitled to as a Red Hat customer. Registration enables access to the Red Hat Services you have purchased, such as technical support and Red Hat Network. To register your product, go to: http://www.redhat.com/apps/activate/ Note You must activate your product before attempting to connect to Red Hat Network. If your product has not been activated, Red Hat Network rejects registration to channels to which the system is not entitled. Good luck, and thank you for choosing Red Hat Enterprise Linux! The Red Hat Documentation Team viii Introduction I. Using the Red Hat Cluster Manager Clustered systems provide reliability, scalability, and availability to critical production services. Using the Red Hat Cluster Manager, administrators can create high availability clusters for lesharing, Web servers, databases, and more. This part discusses the installation and conguration of cluster systems using the recommended hardware and Red Hat Enterprise Linux. This section is licensed under the GNU Free Documentation License. For details refer to the Copyright page. Table of Contents 1. Red Hat Cluster Manager Overview............................................................................................. 1 2. Hardware Installation and Operating System Conguration .................................................... 5 3. Cluster Conguration................................................................................................................... 35 4. Service Administration................................................................................................................. 59 5. Database Services.......................................................................................................................... 67 6. Network File Sharing Services..................................................................................................... 77 7. Setting Up Apache HTTP Server ................................................................................................ 93 8. Cluster Administration................................................................................................................. 97 Chapter 1. Red Hat Cluster Manager Overview Red Hat Cluster Manager allows administrators to connect separate systems (called members or nodes) together to create failover clusters that ensure application availability and data integrity under several failure conditions. Administrators can use Red Hat Cluster Manager with database applications, le sharing services, web servers, and more. To set up a failover cluster, you must connect the member systems (often referred to simply as members or nodes) to the cluster hardware, and congure the members into the cluster environment. The foundation of a cluster is an advanced host membership algorithm. This algorithm ensures that the cluster maintains complete data integrity at all times by using the following methods of inter-member communication: ? Network connections between the cluster systems for heartbeat ? Shared state on shared disk storage to hold cluster status To make an application and data highly available in a cluster, you must congure a service (such as an application and shared disk storage) as a discrete, named group of properties and resources to which you can assign an IP address to provide transparent client access. For example, you can set up a service that provides clients with access to highly-available database application data. You can associate a service with a failover domain, a subset of cluster members that are eligible to run the service. In general, any eligible member can run the service and access the service data on shared disk storage. However, each service can run on only one cluster member at a time, in order to maintain data integrity. You can specify whether or not the members in a failover domain are ordered by preference. You can also specify whether or not a service is restricted to run only on members of its associated failover domain. (When associated with an unrestricted failover domain, a service can be started on any cluster member in the event no member of the failover domain is available.) You can set up an active-active conguration in which the members run different services, or a hotstandby conguration in which a primary member runs all the services, and a backup cluster system takes over only if the primary system fails. Figure 1-1 shows an example of a cluster in an active-active conguration. Figure 1-1. Example Cluster in Active-Active Conguration 2 Chapter 1. Red Hat Cluster Manager Overview If a hardware or software failure occurs, the cluster automatically restarts the failed member's services on the functional member. This service failover capability ensures that no data is lost, and there is little disruption to users. When the failed member recovers, the cluster can re-balance the services across the members. In addition, you can cleanly stop the services running on a cluster system and then restart them on another system. This service relocation capability allows you to maintain application and data availability when a cluster member requires maintenance. 1.1. Red Hat Cluster Manager Features Cluster systems deployed with Red Hat Cluster Manager include the following features: ? No-single-point-of-failure hardware conguration Clusters can include a dual-controller RAID array, multiple network channels, and redundant uninterruptible power supply (UPS) systems to ensure that no single failure results in application down time or loss of data. Alternately, a low-cost cluster can be set up to provide less availability than a no-single-point-offailure cluster. For example, you can set up a cluster with a single-controller RAID array and only a single Ethernet channel. Certain low-cost alternatives, such as software RAID and multi-initiator parallel SCSI, are not compatible or appropriate for use on the shared cluster storage. Refer to Section 2.1 Choosing a Hardware Conguration, for more information. ? Service conguration framework Clusters allow you to easily congure individual services to make data and applications highly available. To create a service, you specify the resources used in the service and properties for the service, including the service name, application start, stop, and status script, disk partitions, mount points, and the cluster members on which you prefer the service to run. After you add a service, the cluster management software stores the information in a cluster conguration le on shared storage, where the conguration data can be accessed by all cluster members. The cluster provides an easy-to-use framework for database applications. For example, a database service serves highly-available data to a database application. The application running on a cluster member provides network access to database client systems, such as Web servers. If the service fails over to another member, the application can still access the shared database data. A networkaccessible database service is usually assigned an IP address, which is failed over along with the service to maintain transparent access for clients. The cluster service framework can be easily extended to other applications, as well. ? Failover domains By assigning a service to a restricted failover domain, you can limit the members that are eligible to run a service in the event of a failover. (A service that is assigned to a restricted failover domain cannot be started on a cluster member that is not included in that failover domain.) You can order the members in a failover domain by preference to ensure that a particular member runs the service (as long as that member is active). If a service is assigned to an unrestricted failover domain, the service starts on any available cluster member (if none of the members of the failover domain are available). ? Data integrity assurance To ensure data integrity, only one member can run a service and access service data at one time. The use of power switches in the cluster hardware conguration enables a member to power-cycle another member before restarting that member's services during the failover process. This prevents any two systems from simultaneously accessing the same data and corrupting it. Although not required, it is recommended that power switches are used to guarantee data integrity under all failure Chapter 1. Red Hat Cluster Manager Overview 3 conditions. Watchdog timers are an optional variety of power control to ensure correct operation of service failover. ? Cluster administration user interface The cluster administration interface facilitiates management tasks such as: creating, starting, and stopping services; relocating services from one member to another; modifying the cluster conguration (to add or remove services or resources); and monitoring the cluster members and services. ? Ethernet channel bonding To monitor the health of the other members, each member monitors the health of the remote power switch, if any, and issues heartbeat pings over network channels. With Ethernet channel bonding, multiple Ethernet interfaces are congured to behave as one, reducing the risk of a single-point-offailure in the typical switched Ethernet connection between systems. ? Shared storage for quorum information Shared state information includes whether the member is active. Service state information includes whether the service is running and which member is running the service. Each member checks to ensure that the status of the other members is up to date. In a two-member cluster, each member periodically writes a timestamp and cluster state information to two shared cluster partitions located on shared disk storage. To ensure correct cluster operation, if a member is unable to write to both the primary and shadow shared cluster partitions at startup time, it is not allowed to join the cluster. In addition, if a member is not updating its timestamp, and if heartbeats to the system fail, the member is removed from the cluster. Figure 1-2 shows how members communicate in a cluster conguration. Note that the terminal server used to access system consoles via serial ports is not a required cluster component. Figure 1-2. Cluster Communication Mechanisms 4 Chapter 1. Red Hat Cluster Manager Overview ? Service failover capability If a hardware or software failure occurs, the cluster takes the appropriate action to maintain application availability and data integrity. For example, if a member completely fails, another member (in the associated failover domain, if used, or in the cluster) restarts its services. Services already running on this member are not disrupted. When the failed member reboots and is able to write to the shared cluster partitions, it can rejoin the cluster and run services. Depending on how the services are congured, the cluster can re-balance the services among the members. ? Manual service relocation capability In addition to automatic service failover, a cluster allows you to cleanly stop services on one member and restart them on another member. You can perform planned maintenance on a member system while continuing to provide application and data availability. ? Event logging facility To ensure that problems are detected and resolved before they affect service availability, the cluster daemons log messages by using the conventional Linux syslog subsystem. You can customize the severity level of the logged messages. ? Application monitoring The infrastructure in a cluster can optionally monitor the state and health of an application. In this manner, should an application-specic failure occur, the cluster automatically restarts the application. In response to the application failure, the application attempts to be restarted on the member it was initially running on; failing that, it restarts on another cluster member. You can specify which members are eligible to run a service by assigning a failover domain to the service. Chapter 2. Hardware Installation and Operating System Conguration To set up the hardware conguration and install Red Hat Enterprise Linux, follow these steps: ? Choose a cluster hardware conguration that meets the needs of applications and users; refer to Section 2.1 Choosing a Hardware Conguration. ? Set up and connect the members and the optional console switch and network switch or hub; refer to Section 2.2 Setting Up the Members. ? Install and congure Red Hat Enterprise Linux on the cluster members; refer to Section 2.3 Installing and Conguring Red Hat Enterprise Linux. ? Set up the remaining cluster hardware components and connect them to the members; refer to Section 2.4 Setting Up and Connecting the Cluster Hardware. After setting up the hardware conguration and installing Red Hat Enterprise Linux, install the cluster software. Tip Refer to the Red Hat Hardware Compatibility List available at http://hardware.redhat.com/hcl/ for a list of compatible hardware. Perform a Quick Search for the term cluster to nd results for power switch and shared storage hardware certied for or compatible with Red Hat Cluster Manager. For general system hardware compatibility searches, use manufacturer, brand, and/or model keywords to check for compatibility with Red Hat Enterprise Linux. 2.1. Choosing a Hardware Conguration The Red Hat Cluster Manager allows administrators to use commodity hardware to set up a cluster conguration that meets the performance, availability, and data integrity needs of applications and users. Cluster hardware ranges from low-cost minimum congurations that include only the components required for cluster operation, to high-end congurations that include redundant Ethernet channels, hardware RAID, and power switches. Regardless of conguration, the use of high-quality hardware in a cluster is recommended, as hardware malfunction is a primary cause of system down time. Although all cluster congurations provide availability, some congurations protect against every single point of failure. In addition, all cluster congurations provide data integrity, but some congurations protect data under every failure condition. Therefore, administrators must fully understand the needs of their computing environment and also the availability and data integrity features of different hardware congurations to choose the cluster hardware that meets the proper requirements. When choosing a cluster hardware conguration, consider the following: Performance requirements of applications and users Choose a hardware conguration that provides adequate memory, CPU, and I/O resources. Be sure that the conguration chosen can handle any future increases in workload as well. 6 Chapter 2. Hardware Installation and Operating System Conguration Cost restrictions The hardware conguration chosen must meet budget requirements. For example, systems with multiple I/O ports usually cost more than low-end systems with fewer expansion capabilities. Availability requirements If a computing environment requires the highest degree of availability, such as a production environment, then a cluster hardware conguration that protects against all single points of failure, including disk, storage interconnect, Ethernet channel, and power failures is recommended. Environments that can tolerate an interruption in availability, such as development environments, may not require as much protection. Refer to Section 2.4.3 Conguring UPS Systems and Section 2.4.4 Conguring Shared Disk Storagefor more information about using redundant hardware for high availability. Data integrity under all failure conditions requirement Using power switches in a cluster conguration guarantees that service data is protected under every failure condition. These devices enable a member to power cycle another member before restarting its services during failover. Power switches protect against data corruption if an unresponsive (or hanging) member becomes responsive after its services have failed over and then issues I/O to a disk that is also receiving I/O from the other member. In addition, if a quorum daemon fails on a member, the member is no longer able to monitor the shared cluster partitions. If you are not using power switches in the cluster, this error condition may result in services being run on more than one member, which can cause data corruption. Refer to Section 2.4.2 Conguring Power Switches for more information about the benets of using power switches in a cluster. It is recommended that production environments use power switches or watchdog timers in the cluster conguration. 2.1.1. Shared Storage Requirements The operation of the cluster depends on reliable, coordinated access to shared storage. In the event of hardware failure, it is desirable to be able to disconnect one member from the shared storage for repair without disrupting the other members. Shared storage is truly vital to the cluster conguration. Testing has shown that it is difcult, if not impossible, to congure reliable multi-initiator parallel SCSI congurations at data rates above 80MB/sec using standard SCSI adapters. Further tests have shown that these congurations cannot support online repair because the bus does not work reliably when the HBA terminators are disabled, and external terminators are used. For these reasons, multiinitiator SCSI congurations using standard adapters are not supported. Either single-initiator SCSI bus adapters (connected to multi-ported storage) or Fibre Channel adapters are required. The Red Hat Cluster Manager requires that all cluster members have simultaneous access to the shared storage. Certain host RAID adapters are capable of providing this type of access to shared RAID units. These products require extensive testing to ensure reliable operation, especially if the shared RAID units are based on parallel SCSI buses. These products typically do not allow for online repair of a failed member. Only host RAID adapters listed in the Red Hat Hardware Compatibility List are supported. The use of software RAID, or software Logical Volume Management (LVM), is not supported on shared storage. This is because these products do not coordinate access from multiple hosts to shared storage. Software RAID or LVMmay be used on non-shared storage on cluster members (for example, boot and system partitions, and other le systems which are not associated with any cluster services). 2.1.2. Minimum Hardware Requirements A minimum hardware conguration includes only the hardware components that are required for cluster operation, as follows: Chapter 2. Hardware Installation and Operating System Conguration 7 ? Two servers to run cluster services ? Ethernet connection for sending heartbeat pings and for client network access ? Shared disk storage for the shared cluster partitions and service data The hardware components described in Table 2-1 can be used to set up a minimum cluster conguration. This conguration does not guarantee data integrity under all failure conditions, because it does not include power switches. Note that this is a sample conguration; it is possible to set up a minimum conguration using other hardware. Warning The minimum cluster conguration is not a supported solution and should not be used in a production environment, as it does not guarantee data integrity under all failure conditions. Hardware Description Two servers Each member includes a network interface for client access and for Ethernet connections and a SCSI adapter (termination disabled) for the shared storage connection Two network cables with RJ45 connectors Network cables connect an Ethernet network interface on each member to the network for client access and heartbeat pings. RAID storage enclosure The RAID storage enclosure contains one controller with at least two host ports. Two HD68 SCSI cables Each cable connects one HBA to one port on the RAID controller, creating two single-initiator SCSI buses. Table 2-1. Example of Minimum Cluster Conguration The minimum hardware conguration is the most cost-effective cluster conguration; however, it includes multiple points of failure. For example, if the RAID controller fails, then all cluster services become unavailable. When deploying the minimal hardware conguration, software watchdog timers should be congured as a data integrity provision. Refer to Section D.1.2.3 Conguring a Hardware Watchdog Timer for details. To improve availability, protect against component failure, and guarantee data integrity under all failure conditions, the minimum conguration can be expanded, as described in Table 2-2. Problem Solution Disk failure Hardware RAID to replicate data across multiple disks RAID controller failure Dual RAID controllers to provide redundant access to disk data Heartbeat failure Ethernet channel bonding and failover Power source failure Redundant uninterruptible power supply (UPS) systems Data corruption under all failure conditions Power switches or hardware-based watchdog timers Table 2-2. Improving Availability and Guaranteeing Data Integrity 8 Chapter 2. Hardware Installation and Operating System Conguration A no single point of failure hardware conguration that guarantees data integrity under all failure conditions can include the following components: ? At least two servers to run cluster services ? Ethernet connection between each member for heartbeat pings and for client network access ? Dual-controller RAID array to replicate shared partitions and service data ? Power switches to enable each member to power-cycle the other members during the failover process ? Ethernet interfaces congured to use channel bonding ? At least two UPS systems for a highly-available source of power The components described in Table 2-3 can be used to set up a no single point of failure cluster con- guration that includes two single-initiator SCSI buses and power switches to guarantee data integrity under all failure conditions. Note that this is a sample conguration; it is possible to set up a no single point of failure conguration using other hardware. Hardware Description Two servers (up to 8 supported) Each member includes the following hardware: Two network interfaces for: Point-to-point Ethernet connections Client network access and Ethernet heartbeat pings Three serial ports for: Remote power switch connection Connection to the terminal server One Adaptec 29160 adapter (termination enabled) for the shared disk storage connection. One network switch A network switch enables the connection of multiple members to a network. One Cyclades terminal server A terminal server allows for management of remote members from a central location. (A terminal server is not required for cluster operation.) Four network cables Network cables connect the terminal server and a network interface on each member to the network switch. Two RJ45 to DB9 crossover cables RJ45 to DB9 crossover cables connect a serial port on each member to the Cyclades terminal server. Two serial-attached power switches Power switches enable each member to power-cycle the other member before restarting its services. The power cable for each member is connected to its own power switch. Note that serial-attach power switches are supported in two-member clusters only. Two null modem cables Null modem cables connect a serial port on each member to the power switch that provides power to the other member. This connection enables each member to power-cycle the other member. FlashDisk RAID Disk Array with dual controllers Dual RAID controllers protect against disk and controller failure. The RAID controllers provide simultaneous access to all the logical units on the host ports. Chapter 2. Hardware Installation and Operating System Conguration 9 Hardware Description Two HD68 SCSI cables HD68 cables connect each host bus adapter to a RAID enclosure "in" port, creating two single-initiator SCSI buses. Two terminators Terminators connected to each "out" port on the RAID enclosure terminate both single-initiator SCSI buses. Redundant UPS Systems UPS systems provide a highly-available source of power. The power cables for the power switches and the RAID enclosure are connected to two UPS systems. Table 2-3. Example of a No Single Point of Failure Conguration Figure 2-1 shows an example of a no single point of failure hardware conguration that includes the previously-described hardware, two single-initiator SCSI buses, and power switches to guarantee data integrity under all error conditions. A "T" enclosed in a circle represents a SCSI terminator. Figure 2-1. No Single Point of Failure Conguration Example Cluster hardware congurations can also include other optional hardware components that are common in a computing environment. For example, a cluster can include a network switch or network hub, which enables the connection of the members to a network. A cluster may also include a console 10 Chapter 2. Hardware Installation and Operating System Conguration switch, which facilitates the management of multiple members and eliminates the need for separate monitors, mouses, and keyboards for each member. One type of console switch is a terminal server, which enables connection to serial consoles and management of many members from one remote location. As a low-cost alternative, you can use a KVM (keyboard, video, and mouse) switch, which enables multiple members to share one keyboard, monitor, and mouse. A KVM is suitable for congurations in which access to a graphical user interface (GUI) to perform system management tasks is preferred. When choosing a system, be sure that it provides the required PCI slots, network slots, and serial ports. For example, a no single point of failure conguration requires multiple bonded Ethernet ports. Refer to Section 2.2.1 Installing the Basic Cluster Hardware for more information. 2.1.3. Choosing the Type of Power Controller The Red Hat Cluster Manager implementation consists of a generic power management layer and a set of device-specic modules which accommodate a range of power management types. When selecting the appropriate type of power controller to deploy in the cluster, it is important to recognize the implications of specic device types. The following describes the types of supported power switches followed by a summary table. For a more detailed description of the role a power switch plays to ensure data integrity, refer to Section 2.4.2 Conguring Power Switches. Serial-attached and network-attached power switches are separate devices which enable one cluster member to power cycle another member. They resemble a power plug strip on which individual outlets can be turned on and off under software control through either a serial or network cable. Networkattached power switches differ from serial-attached in that they connect to cluster members via an Ethernet hub or switch, rather than direct connection to cluster members. A network-attached power switch can not be directly attached to a cluster member using a crossover cable, as the power switch would be unable to power cycle the other members. Watchdog timers provide a means for failed members to remove themselves from the cluster prior to another member taking over its services, rather than allowing one cluster member to power cycle another. The normal operational mode for watchdog timers is that the cluster software must periodically reset a timer prior to its expiration. If the cluster software fails to reset the timer, the watchdog triggers under the assumption that the member may have hung or otherwise failed. The healthy cluster member allows a window of time to pass prior to concluding that another cluster member has failed (by default, this window is 12 seconds). The watchdog timer interval must be less than the duration of time for one cluster member to conclude that another has failed. In this manner, a healthy member can assume, prior to taking over services, that the failed cluster member has safely removed itself from the cluster (by rebooting) and is no longer a risk to data integrity. The underlying watchdog support is included in the core Linux kernel. Red Hat Cluster Manager utilizes these watchdog features via its standard APIs and conguration mechanism. There are two types of watchdog timers: hardware-based and software-based. Hardware-based watchdog timers typically consist of system board components such as the Intel® i810 TCO chipset. This circuitry has a high degree of independence from the main system CPU. This independence is bene- cial in failure scenarios of a true system hang, as in this case it pulls down the system's reset lead resulting in a system reboot. Some PCI expansion cards provide watchdog features. Software-based watchdog timers do not have any dedicated hardware. The implementation is a kernel thread which is periodically run; if the timer duration has expired, the thread initiates a system reboot. The vulnerability of the software watchdog timer is that under certain failure scenarios, such as system hangs while interrupts are blocked, the kernel thread is not called. As a result, in such conditions it cannot be denitively depended on for data integrity. This can cause the healthy cluster member to take over services for a hung member which could cause data corruption under certain scenarios. Finally, administrators can choose not to employ a power controller at all. When a power controller is not in use, no provision exists for a cluster member to power cycle a failed member. Similarly, the failed member cannot be guaranteed to reboot itself under all failure conditions. Chapter 2. Hardware Installation and Operating System Conguration 11 Important Use of a power controller is strongly recommended as part of a production cluster environment. Conguration of a cluster without a power controller is not supported. Ultimately, the right type of power controller deployed in a cluster environment depends on the data integrity requirements weighed against the cost and availability of external power switches. Table 2-4 summarizes the types of supported power management modules and discusses their advantages and disadvantages individually. Type Notes Pros Cons Serial-attached power switches (supported for two-member clusters only) Two serial attached power controllers are used in a cluster (one per member) Affords strong data integrity guarantees ? the power controller itself is not a single point of failure as there are two in a cluster Requires purchase of power controller hardware and cables; consumes serial ports; can only be used in two-member cluster Network-attached power switches A single network attached power controller is required per cluster (depending on the number of members); however, up to three are supported for each cluster member Affords strong data integrity guarantees and can be used in clusters with more than two members Requires purchase of power controller hardware ? the power controller itself can become a single point of failure (although they are typically very reliable devices) Hardware Watchdog Timer Affords strong data integrity guarantees Obviates the need to purchase external power controller hardware Not all systems include supported watchdog hardware Software Watchdog Timer Offers acceptable data integrity provisions Obviates the need to purchase external power controller hardware; works on any system Under some failure scenarios, the software watchdog is not operational, opening a small vulnerability window No power controller No power controller function is in use Obviates the need to purchase external power controller hardware; works on any system Vulnerable to data corruption under certain failure scenarios Table 2-4. Power Switches 2.1.4. Cluster Hardware Components Use the following tables to identify the hardware components required for the cluster conguration. Table 2-5 includes the hardware required for the cluster members. 12 Chapter 2. Hardware Installation and Operating System Conguration Hardware Quantity Description Required Cluster members eight (maximum supported) Each member must provide enough PCI slots, network slots, and serial ports for the cluster hardware conguration. Because disk devices must have the same name on each member, it is recommended that the members have symmetric I/O subsystems. It is also recommended that the processor speed and amount of system memory be adequate for the processes run on the cluster members. Consult the Red Hat Enterprise Linux 3 Release Notes for specics. Refer to Section 2.2.1 Installing the Basic Cluster Hardware for more information. Yes Table 2-5. Cluster Member Hardware Table 2-6 includes several different types of power switches. A single cluster requires only one type of power switch. Hardware Quantity Description Required Serial power switches Two In a two-member cluster, use serial power switches to enable each cluster member to power-cycle the other member. Refer to Section 2.4.2 Conguring Power Switches for more information. Note, cluster members are congured with either serial power switches (supported for two-member clusters only) or network-attached power switches, but not both. Strongly recommended for data integrity under all failure conditions Null modem cable Two Null modem cables connect a serial port on a cluster member to a serial power switch. This enables each member to power-cycle the other member. Some power switches may require different cables. Only if using serial power switches Mounting bracket One Some power switches support rack mount congurations and require a separate mounting bracket. Only for rack mounting power switches Network power switch One (depends on member count) Network-attached power switches enable each cluster member to power cycle all others. Refer to Section 2.4.2 Conguring Power Switches for more information. Strongly recommended for data integrity under all failure conditions Watchdog Timer One per member Watchdog timers cause a failed cluster member to remove itself from a cluster prior to a healthy member taking over its services. Refer to Section 2.4.2 Conguring Power Switches for more information. Recommended for data integrity on systems which provide integrated watchdog hardware Chapter 2. Hardware Installation and Operating System Conguration 13 Table 2-6. Power Switch Hardware Table Table 2-8 through Table 2-10 show a variety of hardware components for an administrator to choose from. An individual cluster does not require all of the components listed in these tables. Hardware Quantity Description Required Network interface One for each network connection Each network connection requires a network interface installed in a member. Yes Network switch or hub One A network switch or hub allows connection of multiple members to a network. No Network cable One for each network interface A conventional network cable, such as a cable with an RJ45 connector, connects each network interface to a network switch or a network hub. Yes Table 2-7. Network Hardware Table Hardware Quantity Description Required Host bus adapter One per member To connect to shared disk storage, install either a parallel SCSI or a Fibre Channel host bus adapter in a PCI slot in each cluster member. For parallel SCSI, use a low voltage differential (LVD) host bus adapter. Adapters have either HD68 or VHDCI connectors. Host-bus adapter based RAID cards are only supported if they correctly support multi-host operation. At the time of publication, there were no fully tested host-bus adapter based RAID cards. Yes External disk storage enclosure At least one Use Fibre Channel or single-initiator parallel SCSI to connect the cluster members to a single or dual-controller RAID array. To use single-initiator buses, a RAID controller must have multiple host ports and provide simultaneous access to all the logical units on the host ports. To use a dual-controller RAID array, a logical unit must fail over from one controller to the other in a way that is transparent to the OS. SCSI RAID arrays that provide simultaneous access to all logical units on the host ports are recommended. To ensure symmetry of device IDs and LUNs, many RAID arrays with dual redundant controllers must be congured in an active/passive mode. Refer to Section 2.4.4 Conguring Shared Disk Storage for more information. Yes 14 Chapter 2. Hardware Installation and Operating System Conguration Hardware Quantity Description Required SCSI cable One per member SCSI cables with 68 pins connect each host bus adapter to a storage enclosure port. Cables have either HD68 or VHDCI connectors. Cables vary based on adapter type. Only for parallel SCSI congurations SCSI terminator As required by hardware conguration For a RAID storage enclosure that uses "out" ports (such as FlashDisk RAID Disk Array) and is connected to single-initiator SCSI buses, connect terminators to the "out" ports to terminate the buses. Only for parallel SCSI congurations and only as necessary for termination Fibre Channel hub or switch One or two A Fibre Channel hub or switch may be required. Only for some Fibre Channel congurations Fibre Channel cable As required by hardware conguration A Fibre Channel cable connects a host bus adapter to a storage enclosure port, a Fibre Channel hub, or a Fibre Channel switch. If a hub or switch is used, additional cables are needed to connect the hub or switch to the storage adapter ports. Only for Fibre Channel congurations Table 2-8. Shared Disk Storage Hardware Table Hardware Quantity Description Required Network interface Two for each member Each Ethernet connection requires a network interface card installed on all cluster members. No Network crossover cable One for each channel A network crossover cable connects a network interface on one member to a network interface on other cluster members, creating an Ethernet connection for communicating heartbeat. Only for a redundant Ethernet connection (use of channel-bonded Ethernet connection is preferred) Table 2-9. Point-To-Point Ethernet Connection Hardware Table Hardware Quantity Description Required UPS system One or more Uninterruptible power supply (UPS) systems protect against downtime if a power outage occurs. UPS systems are highly recommended for cluster operation. Connect the power cables for the shared storage enclosure and both power switches to redundant UPS systems. Note, a UPS system must be able to provide voltage for an adequate period of time, and should be connected to its own power circuit. Strongly recommended for availability Table 2-10. UPS System Hardware Table Chapter 2. Hardware Installation and Operating System Conguration 15 Hardware Quantity Description Required Terminal server One A terminal server enables you to manage many members from one remote location. No KVM One A KVM enables multiple members to share one keyboard, monitor, and mouse. Cables for connecting members to the switch depend on the type of KVM. No Table 2-11. Console Switch Hardware Table 2.2. Setting Up the Members After identifying the cluster hardware components described in Section 2.1 Choosing a Hardware Conguration, set up the basic cluster hardware and connect the members to the optional console switch and network switch or hub. Follow these steps: 1. In all members, install the required network adapters and host bus adapters. Refer to Section 2.2.1 Installing the Basic Cluster Hardware for more information about performing this task. 2. Set up the optional console switch and connect it to each member. Refer to Section 2.2.2 Setting Up a Console Switch for more information about performing this task. If a console switch is not used, then connect each member to a console terminal. 3. Set up the optional network switch or hub and use conventional network cables to connect it to the members and the terminal server (if applicable). Refer to Section 2.2.3 Setting Up a Network Switch or Hub for more information about performing this task. If a network switch or hub is not used, then conventional network cables should be used to connect each member and the terminal server (if applicable) to a network. After performing the previous tasks, install Red Hat Enterprise Linux as described in Section 2.3 Installing and Conguring Red Hat Enterprise Linux. 2.2.1. Installing the Basic Cluster Hardware Members must provide the CPU processing power and memory required by applications. In addition, members must be able to accommodate the SCSI or Fibre Channel adapters, network interfaces, and serial ports that the hardware conguration requires. Systems have a limited number of pre-installed serial and network ports and PCI expansion slots. Table 2-12 helps determine how much capacity the member systems employed require. Cluster Hardware Component Serial Ports Network Slots PCI Slots SCSI or Fibre Channel adapter to shared disk storage One for each bus adapter 16 Chapter 2. Hardware Installation and Operating System Conguration Cluster Hardware Component Serial Ports Network Slots PCI Slots Network connection for client access and Ethernet heartbeat pings One for each network connection Point-to-point Ethernet connection for 2-node clusters (optional) One for each connection Terminal server connection (optional) One Table 2-12. Installing the Basic Cluster Hardware Most systems come with at least one serial port. If a system has graphics display capability, it is possible to use the serial console port for a power switch connection. To expand your serial port capacity, use multi-port serial PCI cards. For multiple-member clusters, use a network power switch. Also, ensure that local system disks are not on the same SCSI bus as the shared disks. For example, use two-channel SCSI adapters, such as the Adaptec 39160-series cards, and put the internal devices on one channel and the shared disks on the other channel. Using multiple SCSI cards is also possible. Refer to the system documentation supplied by the vendor for detailed installation information. Refer to Appendix D Supplementary Hardware Information for hardware-specic information about using host bus adapters in a cluster. Figure 2-2 shows the bulkhead of a sample member and the external cable connections for a typical cluster conguration. Figure 2-2. Typical External Cabling for a Cluster Member Chapter 2. Hardware Installation and Operating System Conguration 17 2.2.2. Setting Up a Console Switch Although a console switch is not required for cluster operation, it can be used to facilitate member management and eliminate the need for separate monitors, mouses, and keyboards for each cluster member. There are several types of console switches. For example, a terminal server enables connection to serial consoles and management of many members from a remote location. For a low-cost alternative, use a KVM (keyboard, video, and mouse) switch, which enables multiple members to share one keyboard, monitor, and mouse. A KVM switch is suitable for congurations in which GUI access to perform system management tasks is preferred. Set up the console switch according to the documentation provided by the vendor. After the console switch has been set up, connect it to each cluster member. The cables used depend on the type of console switch. For example, a Cyclades terminal server uses RJ45 to DB9 crossover cables to connect a serial port on each member to the terminal server. 2.2.3. Setting Up a Network Switch or Hub A network switch or hub, although not required for operating a two-node cluster, can be used to facilitate cluster and client system network operations. Clusters of more than two members require a switch or hub. Set up a network switch or hub according to the documentation provided by the vendor. After setting up the network switch or hub, connect it to each member by using conventional network cables. A terminal server, if used, is connected to the network switch or hub through a network cable. 2.3. Installing and Conguring Red Hat Enterprise Linux After the setup of basic cluster hardware, proceed with installation of Red Hat Enterprise Linux on each member and ensure that all systems recognize the connected devices. Follow these steps: 1. Install Red Hat Enterprise Linux on all cluster members. Refer to Red Hat Enterprise Linux Installation Guide for instructions. In addition, when installing Red Hat Enterprise Linux, it is strongly recommended to do the following: ? Gather the IP addresses for the members and for the bonded Ethernet ports, before installing Red Hat Enterprise Linux. Note that the IP addresses for the bonded Ethernet ports can be private IP addresses, (for example, 10.x.x.x). ? Do not place local le systems (such as /, /etc, /tmp, and /var) on shared disks or on the same SCSI bus as shared disks. This helps prevent the other cluster members from accidentally mounting these le systems, and also reserves the limited number of SCSI identication numbers on a bus for cluster disks. ? Place /tmp and /var on different le systems. This may improve member performance. ? When a member boots, be sure that the member detects the disk devices in the same order in which they were detected during the Red Hat Enterprise Linux installation. If the devices are not detected in the same order, the member may not boot. ? When using RAID storage congured with Logical Unit Numbers (LUNs) greater than zero, it is necessary to enable LUN support by adding the following to /etc/modules.conf: options scsi_mod max_scsi_luns=255 18 Chapter 2. Hardware Installation and Operating System Conguration After modifying modules.conf, it is necessary to rebuild the initial ram disk using mkinitrd. Refer to the Red Hat Enterprise Linux System Administration Guide for more information about creating ramdisks using mkinitrd. 2. Reboot the members. 3. When using a terminal server, congure Red Hat Enterprise Linux to send console messages to the console port. 4. Edit the /etc/hosts le on each cluster member and include the IP addresses used in the cluster or ensure that the addresses are in DNS. Refer to Section 2.3.1 Editing the /etc/hosts File for more information about performing this task. 5. Decrease the alternate kernel boot timeout limit to reduce boot time for members. Refer to Section 2.3.2 Decreasing the Kernel Boot Timeout Limit for more information about performing this task. 6. Ensure that no login (or getty) programs are associated with the serial ports that are being used for the remote power switch connection (if applicable). To perform this task, edit the /etc/inittab le and use a hash symbol (#) to comment out the entries that correspond to the serial ports used for the remote power switch. Then, invoke the init q command. 7. Verify that all systems detect all the installed hardware: ? Use the dmesg command to display the console startup messages. Refer to Section 2.3.3 Displaying Console Startup Messages for more information about performing this task. ? Use the cat /proc/devices command to display the devices congured in the kernel. Refer to Section 2.3.4 Displaying Devices Congured in the Kernel for more information about performing this task. 8. Verify that the members can communicate over all the network interfaces by using the ping command to send test packets from one member to another. 9. If intending to congure Samba services, verify that the required RPM packages for Samba services are installed. 2.3.1. Editing the /etc/hosts File The /etc/hosts le contains the IP address-to-hostname translation table. The /etc/hosts le on each member must contain entries for the following: ? IP addresses and associated hostnames for all cluster members ? IP addresses and associated hostnames for the point-to-point Ethernet heartbeat connections (these can be private IP addresses) As an alternative to the /etc/hosts le, naming services such as DNS or NIS can be used to de- ne the host names used by a cluster. However, to limit the number of dependencies and optimize availability, it is strongly recommended to use the /etc/hosts le to dene IP addresses for cluster network interfaces. The following is an example of an /etc/hosts le on a member: 127.0.0.1 localhost.localdomain localhost 193.186.1.81 cluster2.example.com cluster2 10.0.0.1 ecluster2.example.com ecluster2 193.186.1.82 cluster3.example.com cluster3 10.0.0.2 ecluster3.example.com ecluster3 Chapter 2. Hardware Installation and Operating System Conguration 19 The previous example shows the IP addresses and hostnames for two members (cluster2 and cluster3), and the private IP addresses and hostnames for the Ethernet interface (ecluster2 and ecluster3) used for the point-to-point heartbeat connection on each member. Verify correct formatting of the local host entry in the /etc/hosts le to ensure that it does not include non-local systems in the entry for the local host. An example of an incorrect local host entry that includes a non-local system (server1) is shown next: 127.0.0.1 localhost.localdomain localhost server1 An Ethernet connection may not operate properly if the format of the /etc/hosts le is not correct. Check the /etc/hosts le and correct the le format by removing non-local systems from the local host entry, if necessary. Note that each network adapter must be congured with the appropriate IP address and netmask. The following example shows a portion of the output from the /sbin/ifconfig command on a cluster member: eth0 Link encap:Ethernet HWaddr 00:00:BC:11:76:93 inet addr:192.186.1.81 Bcast:192.186.1.245 Mask:255.255.255.0 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 RX packets:65508254 errors:225 dropped:0 overruns:2 frame:0 TX packets:40364135 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:100 Interrupt:19 Base address:0xfce0 eth1 Link encap:Ethernet HWaddr 00:00:BC:11:76:92 inet addr:10.0.0.1 Bcast:10.0.0.245 Mask:255.255.255.0 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 RX packets:0 errors:0 dropped:0 overruns:0 frame:0 TX packets:0 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:100 Interrupt:18 Base address:0xfcc0 The previous example shows two network interfaces on a cluster member: eth0 (the network interface for the member) and eth1 (the network interface for the point-to-point Ethernet connection). You may also add the IP addresses for the cluster members to your DNS server. Refer to the Red Hat Enterprise Linux Reference Guide for information on conguring DNS, or consult your network administrator. 2.3.2. Decreasing the Kernel Boot Timeout Limit It is possible to reduce the boot time for a member by decreasing the kernel boot timeout limit. During the Red Hat Enterprise Linux boot sequence, the boot loader allows for specifying an alternate kernel to boot. The default timeout limit for specifying a kernel is ten seconds. To modify the kernel boot timeout limit for a member, edit the appropriate les as follows: When using the GRUB boot loader, the timeout parameter in /boot/grub/grub.conf should be modied to specify the appropriate number of seconds for the timeout parameter. To set this interval to 3 seconds, edit the parameter to the following: timeout = 3 When using the LILO or ELILO boot loaders, edit the /etc/lilo.conf le (on x86 systems) or the elilo.conf le (on Itanium systems) and specify the desired value (in tenths of a second) for the timeout parameter. The following example sets the timeout limit to three seconds: 20 Chapter 2. Hardware Installation and Operating System Conguration timeout = 30 To apply any changes made to the /etc/lilo.conf le, invoke the /sbin/lilo command. On an Itanium system, to apply any changes made to the /boot/efi/efi/redhat/elilo.conf le, invoke the /sbin/elilo command. 2.3.3. Displaying Console Startup Messages Use the dmesg command to display the console startup messages. Refer to the dmesg(8) man page for more information. The following example of output from the dmesg command shows that two external SCSI buses and nine disks were detected on the member. (Lines with backslashes display as one line on most screens): May 22 14:02:10 storage3 kernel: scsi0 : Adaptec AHA274x/284x/294x \ (EISA/VLB/PCI-Fast SCSI) 5.1.28/3.2.4 May 22 14:02:10 storage3 kernel: May 22 14:02:10 storage3 kernel: scsi1 : Adaptec AHA274x/284x/294x \ (EISA/VLB/PCI-Fast SCSI) 5.1.28/3.2.4 May 22 14:02:10 storage3 kernel: May 22 14:02:10 storage3 kernel: scsi : 2 hosts. May 22 14:02:11 storage3 kernel: Vendor: SEAGATE Model: ST39236LW Rev: 0004 May 22 14:02:11 storage3 kernel: Detected scsi disk sda at scsi0, channel 0, id 0, lun 0 May 22 14:02:11 storage3 kernel: Vendor: SEAGATE Model: ST318203LC Rev: 0001 May 22 14:02:11 storage3 kernel: Detected scsi disk sdb at scsi1, channel 0, id 0, lun 0 May 22 14:02:11 storage3 kernel: Vendor: SEAGATE Model: ST318203LC Rev: 0001 May 22 14:02:11 storage3 kernel: Detected scsi disk sdc at scsi1, channel 0, id 1, lun 0 May 22 14:02:11 storage3 kernel: Vendor: SEAGATE Model: ST318203LC Rev: 0001 May 22 14:02:11 storage3 kernel: Detected scsi disk sdd at scsi1, channel 0, id 2, lun 0 May 22 14:02:11 storage3 kernel: Vendor: SEAGATE Model: ST318203LC Rev: 0001 May 22 14:02:11 storage3 kernel: Detected scsi disk sde at scsi1, channel 0, id 3, lun 0 May 22 14:02:11 storage3 kernel: Vendor: SEAGATE Model: ST318203LC Rev: 0001 May 22 14:02:11 storage3 kernel: Detected scsi disk sdf at scsi1, channel 0, id 8, lun 0 May 22 14:02:11 storage3 kernel: Vendor: SEAGATE Model: ST318203LC Rev: 0001 May 22 14:02:11 storage3 kernel: Detected scsi disk sdg at scsi1, channel 0, id 9, lun 0 May 22 14:02:11 storage3 kernel: Vendor: SEAGATE Model: ST318203LC Rev: 0001 May 22 14:02:11 storage3 kernel: Detected scsi disk sdh at scsi1, channel 0, id 10, lun 0 May 22 14:02:11 storage3 kernel: Vendor: SEAGATE Model: ST318203LC Rev: 0001 May 22 14:02:11 storage3 kernel: Detected scsi disk sdi at scsi1, channel 0, id 11, lun 0 May 22 14:02:11 storage3 kernel: Vendor: Dell Model: 8 BAY U2W CU Rev: 0205 May 22 14:02:11 storage3 kernel: Type: Processor \ ANSI SCSI revision: 03 May 22 14:02:11 storage3 kernel: scsi1 : channel 0 target 15 lun 1 request sense \ failed, performing reset. May 22 14:02:11 storage3 kernel: SCSI bus is being reset for host 1 channel 0. May 22 14:02:11 storage3 kernel: scsi : detected 9 SCSI disks total. The following example of the dmesg command output shows that a quad Ethernet card was detected on the member: May 22 14:02:11 storage3 kernel: 3c59x.c:v0.99H 11/17/98 Donald Becker May 22 14:02:11 storage3 kernel: tulip.c:v0.91g-ppc 7/16/99 becker@cesdis.gsfc.nasa.gov May 22 14:02:11 storage3 kernel: eth0: Digital DS21140 Tulip rev 34 at 0x9800, \ 00:00:BC:11:76:93, IRQ 5. May 22 14:02:12 storage3 kernel: eth1: Digital DS21140 Tulip rev 34 at 0x9400, \ 00:00:BC:11:76:92, IRQ 9. May 22 14:02:12 storage3 kernel: eth2: Digital DS21140 Tulip rev 34 at 0x9000, \ 00:00:BC:11:76:91, IRQ 11. May 22 14:02:12 storage3 kernel: eth3: Digital DS21140 Tulip rev 34 at 0x8800, \ 00:00:BC:11:76:90, IRQ 10. Chapter 2. Hardware Installation and Operating System Conguration 21 2.3.4. Displaying Devices Congured in the Kernel To be sure that the installed devices, including serial and network interfaces, are congured in the kernel, use the cat /proc/devices command on each member. Use this command to also determine if there is raw device support installed on the member. For example: Character devices: 1 mem 2 pty 3 ttyp 4 ttyS 5 cua 7 vcs 10 misc 19 ttyC 20 cub 128 ptm 136 pts 162 raw Block devices: 2 fd 3 ide0 8 sd 65 sd The previous example shows: ? Onboard serial ports (ttyS) ? Serial expansion card (ttyC) ? Raw devices (raw) ? SCSI devices (sd) 2.4. Setting Up and Connecting the Cluster Hardware After installing Red Hat Enterprise Linux, set up the cluster hardware components and verify the installation to ensure that the members recognize all the connected devices. Note that the exact steps for setting up the hardware depend on the type of conguration. Refer to Section 2.1 Choosing a Hardware Conguration for more information about cluster congurations. To set up the cluster hardware, follow these steps: 1. Shut down the members and disconnect them from their power source. 2. Set up the bonded Ethernet channels, if applicable. Refer to Section 2.4.1 Conguring Ethernet Channel Bonding for more information. 3. When using power switches, set up the switches and connect each member to a power switch. Refer to Section 2.4.2 Conguring Power Switches for more information. In addition, it is recommended to connect each power switch (or each member's power cord if not using power switches) to a different UPS system. Refer to Section 2.4.3 Conguring UPS Systems for information about using optional UPS systems. 4. Set up the shared disk storage according to the vendor instructions and connect the members to the external storage enclosure. Refer to Section 2.4.4 Conguring Shared Disk Storage for more information about performing this task. 22 Chapter 2. Hardware Installation and Operating System Conguration In addition, it is recommended to connect the storage enclosure to redundant UPS systems. Refer to Section 2.4.3 Conguring UPS Systems for more information about using optional UPS systems. 5. Turn on power to the hardware, and boot each cluster member. During the boot-up process, enter the BIOS utility to modify the member setup, as follows: ? Ensure that the SCSI identication number used by the HBA is unique for the SCSI bus it is attached to. Refer to Section D.5 SCSI Identication Numbers for more information about performing this task. ? Enable or disable the onboard termination for each host bus adapter, as required by the storage conguration. Refer to Section 2.4.4 Conguring Shared Disk Storage and Section D.3 SCSI Bus Termination for more information about performing this task. ? Enable the member to automatically boot when it is powered on. 6. Exit from the BIOS utility, and continue to boot each member. Examine the startup messages to verify that the Red Hat Enterprise Linux kernel has been congured and can recognize the full set of shared disks. Use the dmesg command to display console startup messages. Refer to Section 2.3.3 Displaying Console Startup Messages for more information about using the dmesg command. 7. Verify that the members can communicate over each point-to-point Ethernet connection by using the ping command to send packets over each network interface. 8. Set up the shared cluster partitions on the shared disk storage. Refer to Section 2.4.4.3 Conguring Shared Cluster Partitions for more information about performing this task. 2.4.1. Conguring Ethernet Channel Bonding Ethernet channel bonding in a no-single-point-of-failure cluster system allows for a fault tolerant network connection by combining two Ethernet devices into one virtual device. The resulting channel bonded interface ensures that in the event that one Ethernet device fails, the other device will become active. This type of channel bonding, called an active-backup policy allows connection of both bonded devices to one switch or can allow each Ethernet device to be connected to separate hubs or switches, which eliminates the single point of failure in the network hub/switch. Channel bonding requires each cluster member to have two Ethernet devices installed. When it is loaded, the bonding module uses the MAC address of the rst enslaved network device and assigns that MAC address to the other network device if the rst device fails link detection. To congure two network devices for channel bonding, perform the following: 1. Create a bonding devices in /etc/modules.conf. For example: alias bond0 bonding options bonding miimon=100 mode=1 This loads the bonding device with the bond0 interface name, as well as passes options to the bonding driver to congure it as an active-backup master device for the enslaved network interfaces. 2. Edit the /etc/sysconfig/network-scripts/ifcfg-ethX conguration le for both eth0 and eth1 so that the les show identical contents. For example: DEVICE=ethX USERCTL=no ONBOOT=yes MASTER=bond0 SLAVE=yes BOOTPROTO=none Chapter 2. Hardware Installation and Operating System Conguration 23 This will enslave ethX (replace X with the assigned number of the Ethernet devices) to the bond0 master device. 3. Create a network script for the bonding device (for example, /etc/sysconfig/network-scripts/ifcfg-bond0), which would appear like the following example: DEVICE=bond0 USERCTL=no ONBOOT=yes BROADCAST=192.168.1.255 NETWORK=192.168.1.0 NETMASK=255.255.255.0 GATEWAY=192.168.1.1 IPADDR=192.168.1.10 4. Reboot the system for the changes to take effect. Alternatively, manually load the bonding device and restart the network. For example: /sbin/insmod /lib/modules/`uname -r`/kernel/drivers/net/bonding/bonding.o \ miimon=100 mode=1 /sbin/service network restart For more information about channel bonding, refer to the high-availability section of the Linux Ethernet Bonding Driver Mini-Howto, available in: /usr/src/linux-2.4/Documentation/networking/bonding.txt Note You must have the kernel-source package installed in order to view the Linux Ethernet Bonding Driver Mini-Howto. 2.4.2. Conguring Power Switches Power switches enable a member to power-cycle another member before restarting its services as part of the failover process. The ability to remotely disable a member ensures data integrity is maintained under any failure condition. It is recommended that production environments use power switches or watchdog timers in the cluster conguration. Only development (test) environments should use a conguration without power switches. Refer to Section 2.1.3 Choosing the Type of Power Controller for a description of the various types of power switches. Note that within this section, the general term "power switch" also includes watchdog timers. In a cluster conguration that uses physical power switches, each member's power cable is connected to a power switch through either a serial or network connection (depending on switch type). When failover occurs, a member can use this connection to power-cycle another member before restarting its services. Power switches protect against data corruption if an unresponsive (or hanging) member becomes responsive after its services have failed over, and issues I/O to a disk that is also receiving I/O from another member. In addition, if a quorum daemon fails on a member, the member is no longer able to monitor the shared cluster partitions. If power switches or watchdog timers are not used in the cluster, then this error condition may result in services being run on more than one member, which can cause data corruption and possibly system crashes. It is strongly recommended to use power switches in a cluster. However, administrators who are aware of the risks may choose to set up a cluster without power switches. 24 Chapter 2. Hardware Installation and Operating System Conguration A member may hang for a few seconds if it is swapping or has a high system workload. For this reason, adequate time is allowed prior to concluding another member has failed (typically 15 seconds). If a member determines that a hung member is down, and power switches are used in the cluster, that member power-cycles the hung member before restarting its services. Clusters congured to use watchdog timers self-reboot under most system hangs. This causes the hung member to reboot in a clean state and prevent it from issuing I/O and corrupting service data. Hung members reboot themselves either due to a watchdog ring, failure to send heartbeat packets, or ? in the case a member has no physical power switch ? loss of quorum status. Hung members may be rebooted by other members if they are attached to a power switch. If the hung member never becomes responsive and no power switches are in use, then a manual reboot is required. When used, power switches must be set up according to the vendor instructions. However, some cluster-specic tasks may be required to use a power switch in the cluster. Refer to Section D.1 Setting Up Power Controllers for detailed information on power switches (including information about watchdog timers). Be sure to take note of any caveats or functional attributes of specic power switches. Note that the cluster-specic information provided in this manual supersedes the vendor information. When cabling power switches, take special care to ensure that each cable is plugged into the appropriate outlet. This is crucial because there is no independent means for the software to verify correct cabling. Failure to cable correctly can lead to an incorrect member being power cycled, or for one member to inappropriately conclude that it has successfully power cycled another cluster member. After setting up the power switches, perform these tasks to connect them to the members: 1. Connect the power cable for each member to a power switch. 2. Connect each member to the power switch. The cable used for the connection depends on the type of power switch. Serial-attached power switches use null modem cables, while a networkattached power switches require an Ethernet patch cable. 3. Connect the power cable for each power switch to a power source. It is recommended to connect each power switch to a different UPS system. Refer to Section 2.4.3 Conguring UPS Systems for more information. After the installation of the cluster software, test the power switches to ensure that each member can power-cycle the other member before starting the cluster. Refer to Section 3.11.2 Testing the Power Switches for information. 2.4.3. Conguring UPS Systems Uninterruptible power supplies (UPS) provide a highly-available source of power. Ideally, a redundant solution should be used that incorporates multiple UPS systems (one per server). For maximal faulttolerance, it is possible to incorporate two UPS systems per server as well as APC Automatic Transfer Switches to manage the power and shutdown management of the server. Both solutions are solely dependent on the level of availability desired. It is not recommended to use a single UPS infrastructure as the sole source of power for the cluster. A UPS solution dedicated to the cluster is more exible in terms of manageability and availability. A complete UPS system must be able to provide adequate voltage and current for a prolonged period of time. While there is no single UPS to t every power requirement, a solution can be tailored to t a particular conguration. If the cluster disk storage subsystem has two power supplies with separate power cords, set up two UPS systems, and connect one power switch (or one member's power cord if not using power switches) and one of the storage subsystem's power cords to each UPS system. A redundant UPS system conguration is shown in Figure 2-3. Chapter 2. Hardware Installation and Operating System Conguration 25 Figure 2-3. Redundant UPS System Conguration An alternative redundant power conguration is to connect the power switches (or the members' power cords) and the disk storage subsystem to the same UPS system. This is the most cost-effective conguration, and provides some protection against power failure. However, if a power outage occurs, the single UPS system becomes a possible single point of failure. In addition, one UPS system may not be able to provide enough power to all the attached devices for an adequate amount of time. A single UPS system conguration is shown in Figure 2-4. Figure 2-4. Single UPS System Conguration Many vendor-supplied UPS systems include Red Hat Enterprise Linux applications that monitor the operational status of the UPS system through a serial port connection. If the battery power is low, the monitoring software initiates a clean system shutdown. As this occurs, the cluster software is properly stopped, because it is controlled by a SysV runlevel script (for example, /etc/rc.d/init.d/clumanager). Refer to the UPS documentation supplied by the vendor for detailed installation information. 2.4.4. Conguring Shared Disk Storage In a cluster, shared disk storage is used to hold service data and two partitions (primary and shadow) that store cluster state information. Because this storage must be available to all members, it cannot be 26 Chapter 2. Hardware Installation and Operating System Conguration located on disks that depend on the availability of any one member. Refer to the vendor documentation for detailed product and installation information. There are some factors to consider when setting up shared disk storage in a cluster: ? External RAID It is strongly recommended to use use RAID 1 (mirroring) to make service data and the shared cluster partitions highly available. Optionally, parity RAID can also be employed for high-availability. Do not use RAID 0 (striping) alone for shared partitions because this reduces storage availability. ? Multi-initiator SCSI congurations Multi-initiator SCSI congurations are not supported due to the difculty in obtaining proper bus termination. ? The Red Hat Enterprise Linux device name for each shared storage device must be the same on each member. For example, a device named /dev/sdc on one member must be named /dev/sdc on the other cluster members. Using identical hardware for all members usually ensures that these devices are named the same. ? A disk partition can be used by only one cluster service. ? Do not include any le systems used in a cluster service in the member's local /etc/fstab les, because the cluster software must control the mounting and unmounting of service le systems. ? For optimal performance of shared le systems, make sure to specify a 4 KB block size with the -b option to mke2fs. A smaller block size can cause long fsck times. Refer to Section 2.4.4.6 Creating File Systems. The following list details parallel SCSI requirements, and must be adhered to when parallel SCSI buses are employed in a cluster environment: ? SCSI buses must be terminated at each end, and must adhere to length and hot plugging restrictions. ? Devices (disks, host bus adapters, and RAID controllers) on a SCSI bus must have a unique SCSI identication number. Refer to Section D.2 SCSI Bus Conguration Requirements for more information. It is strongly recommended to connect the storage enclosure to redundant UPS systems for a highlyavailable source of power. Refer to Section 2.4.3 Conguring UPS Systems for more information. Refer to Section 2.4.4.1 Setting Up a Single-initiator SCSI Bus and Section 2.4.4.2 Setting Up a Fibre Channel Interconnect for more information about conguring shared storage. After setting up the shared disk storage hardware, partition the disks and then either create le systems or raw devices on the partitions. Two raw devices must be created for the primary and the shadow shared cluster partitions. Refer to Section 2.4.4.3 Conguring Shared Cluster Partitions, Section 2.4.4.4 Partitioning Disks, Section 2.4.4.5 Creating Raw Devices, and Section 2.4.4.6 Creating File Systems for more information. 2.4.4.1. Setting Up a Single-initiator SCSI Bus A single-initiator SCSI bus has only one member connected to it, and provides host isolation and better performance than a multi-initiator bus. Single-initiator buses ensure that each member is protected from disruptions due to the workload, initialization, or repair of the other members. When using a single- or dual-controller RAID array that has multiple host ports and provides simultaneous access to all the shared logical units from the host ports on the storage enclosure, the setup of the single-initiator SCSI buses to connect each cluster member to the RAID array is possible. If a logical unit can fail over from one controller to the other, the process must be transparent to the Chapter 2. Hardware Installation and Operating System Conguration 27 operating system. Note that some RAID controllers restrict a set of disks to a specic controller or port. In this case, single-initiator bus setups are not possible. A single-initiator bus must adhere to the requirements described in Section D.2 SCSI Bus Conguration Requirements. To set up a single-initiator SCSI bus conguration, the following is required: ? Enable the on-board termination for each host bus adapter. ? Enable the termination for each RAID controller. ? Use the appropriate SCSI cable to connect each host bus adapter to the storage enclosure. Setting host bus adapter termination is usually done in the adapter BIOS utility during member boot. To set RAID controller termination, refer to the vendor documentation. Figure 2-5 shows a conguration that uses two single-initiator SCSI buses. Figure 2-5. Single-initiator SCSI Bus Conguration Figure 2-6 shows the termination in a single-controller RAID array connected to two single-initiator SCSI buses. Figure 2-6. Single-controller RAID Array Connected to Single-initiator SCSI Buses Figure 2-7 shows the termination in a dual-controller RAID array connected to two single-initiator SCSI buses. 28 Chapter 2. Hardware Installation and Operating System Conguration Figure 2-7. Dual-controller RAID Array Connected to Single-initiator SCSI Buses 2.4.4.2. Setting Up a Fibre Channel Interconnect Fibre Channel can be used in either single-initiator or multi-initiator congurations. A single-initiator Fibre Channel interconnect has only one member connected to it. This may provide better host isolation and better performance than a multi-initiator bus. Single-initiator interconnects ensure that each member is protected from disruptions due to the workload, initialization, or repair of the other member. If employing a RAID array that has multiple host ports, and the RAID array provides simultaneous access to all the shared logical units from the host ports on the storage enclosure, set up single-initiator Fibre Channel interconnects to connect each member to the RAID array. If a logical unit can fail over from one controller to the other, the process must be transparent to the operating system. Figure 2-8 shows a single-controller RAID array with two host ports and the host bus adapters connected directly to the RAID controller, without using Fibre Channel hubs or switches. When using this type of single-initiator Fibre Channel connection, your RAID controller must have a separate host port for each cluster member. Chapter 2. Hardware Installation and Operating System Conguration 29 Figure 2-8. Single-controller RAID Array Connected to Single-initiator Fibre Channel Interconnects The external RAID array must have a separate SCSI channel for each cluster member. In clusters with more than two members, connect each member to a different SCSI channel on the RAID array, using a single-initiator SCSI bus as shown in Figure 2-8. To connect multiple cluster members to the same host port on the RAID array, use an FC hub or switch. In this case, each HBA is connected to the hub or switch, and the hub or switch is connected to a host port on the RAID controller. A Fibre Channel hub or switch is also required with a dual-controller RAID array with two host ports on each controller. This conguration is shown in Figure 2-9. Additional cluster members may be connected to either Fibre Channel hub or switch shown in the diagram. Some RAID arrays include a built-in hub so that each host port is already connected to each of the internal RAID controllers. In this case, an additional external hub or switch may not be needed. 30 Chapter 2. Hardware Installation and Operating System Conguration Figure 2-9. Dual-controller RAID Array Connected to Single-initiator Fibre Channel Interconnects 2.4.4.3. Conguring Shared Cluster Partitions Two raw devices on shared disk storage must be created for the primary shared partition and the shadow shared partition. Each shared partition must have a minimum size of 10 MB. The amount of data in a shared partition is constant; it does not increase or decrease over time. The shared partitions are used to hold cluster state information, including the following: ? Cluster lock states ? Service states ? Conguration information Periodically, each member writes the state of its services to shared storage. In addition, the shared partitions contain a version of the cluster conguration le. This ensures that each member has a common view of the cluster conguration. If the primary shared partition is corrupted, the cluster members read the information from the shadow (or backup) shared partition and simultaneously repair the primary partition. Data consistency is maintained through checksums, and any inconsistencies between the partitions are automatically corrected. If a member is unable to write to both shared partitions at startup time, it is not allowed to join the cluster. In addition, if an active member can no longer write to both shared partitions, the member removes itself from the cluster by rebooting (and may be remotely power cycled by a healthy member). The following are shared partition requirements: ? Both partitions must have a minimum size of 10 MB. ? Shared partitions must be raw devices. They cannot contain le systems. ? Shared partitions can be used only for cluster state and conguration information. The following are recommended guidelines for conguring the shared partitions: Chapter 2. Hardware Installation and Operating System Conguration 31 ? It is strongly recommended to set up a RAID subsystem for shared storage, and use RAID 1 (mirroring) to make the logical unit that contains the shared partitions highly available. Optionally, parity RAID can be used for high availability. Do not use RAID 0 (striping) alone for shared partitions. ? Place both shared partitions on the same RAID set, or on the same disk if RAID is not employed, because both shared partitions must be available for the cluster to run. ? Do not put the shared partitions on a disk that contains heavily-accessed service data. If possible, locate the shared partitions on disks that contain service data that is rarely accessed. Refer to Section 2.4.4.4 Partitioning Disks and Section 2.4.4.5 Creating Raw Devices for more information about setting up the shared partitions. Refer to Section 3.5 Editing the rawdevices File for information about editing the rawdevices le to bind the raw character devices to the block devices each time the members boot. 2.4.4.4. Partitioning Disks After shared disk storage hardware has been set up, partition the disks so they can be used in the cluster. Then, create le systems or raw devices on the partitions. For example, two raw devices must be created for the shared partitions using the guidelines described in Section 2.4.4.3 Conguring Shared Cluster Partitions. Use parted to modify a disk partition table and divide the disk into partitions. While in parted, use the p to display the partition table and the mkpart command to create new partitions. The following example shows how to use parted to create a partition on disk: ? Invoke parted from the shell using the command parted and specifying an available shared disk device. At the (parted) prompt, use the p to display the current partition table. The output should be similar to the following: Disk geometry for /dev/sda: 0.000-4340.294 megabytes Disk label type: msdos Minor Start End Type Filesystem Flags ? Decide on how large of a partition is required. Create a partition of this size using the mkpart command in parted. Although the mkpart does not create a le system, it normally requires a le system type at partition creation time. parted uses a range on the disk to determine partition size; the size is the space between the end and the beginning of the given range. The following example shows how to create two partitions of 20 MB each on an empty disk. (parted) mkpart primary ext3 0 20 (parted) mkpart primary ext3 20 40 (parted) p Disk geometry for /dev/sda: 0.000-4340.294 megabytes Disk label type: msdos Minor Start End Type Filesystem Flags 1 0.030 21.342 primary 2 21.343 38.417 primary ? When more than four partitions are required on a single disk, it is necessary to create an extended partition. If an extended partition is required, the mkpart also performs this task. In this case, it is not necessary to specify a le system type. Note Only one extended partition may be created, and the extended partition must be one of the four primary partitions. (parted) mkpart extended 40 2000 (parted) p 32 Chapter 2. Hardware Installation and Operating System Conguration Disk geometry for /dev/sda: 0.000-4340.294 megabytes Disk label type: msdos Minor Start End Type Filesystem Flags 1 0.030 21.342 primary 2 21.343 38.417 primary 3 38.417 2001.952 extended ? An extended partition allows the creation of logical partitionsinside of it. The following example shows the division of the extended partition into two logical partitions. (parted) mkpart logical ext3 40 1000 (parted) p Disk geometry for /dev/sda: 0.000-4340.294 megabytes Disk label type: msdos Minor Start End Type Filesystem Flags 1 0.030 21.342 primary 2 21.343 38.417 primary 3 38.417 2001.952 extended 5 38.447 998.841 logical (parted) mkpart logical ext3 1000 2000 (parted) p Disk geometry for /dev/sda: 0.000-4340.294 megabytes Disk label type: msdos Minor Start End Type Filesystem Flags 1 0.030 21.342 primary 2 21.343 38.417 primary 3 38.417 2001.952 extended 5 38.447 998.841 logical 6 998.872 2001.952 logical ? A partition may be removed using parted's rm command. For example: (parted) rm 1 (parted) p Disk geometry for /dev/sda: 0.000-4340.294 megabytes Disk label type: msdos Minor Start End Type Filesystem Flags 2 21.343 38.417 primary 3 38.417 2001.952 extended 5 38.447 998.841 logical 6 998.872 2001.952 logical ? After all required partitions have been created, exit parted using the quit command. If a partition was added, removed, or changed while both members are powered on and connected to the shared storage, reboot the other member for it to recognize the modications. After partitioning a disk, format the partition for use in the cluster. For example, create the le systems or raw devices for shared partitions. Refer to Section 2.4.4.5 Creating Raw Devices and Section 2.4.4.6 Creating File Systems for more information. For basic information on partitioning hard disks at installation time, refer to the Red Hat Enterprise Linux Installation Guide. 2.4.4.5. Creating Raw Devices After partitioning the shared storage disks, create raw devices on the partitions. File systems are block devices (for example, /dev/sda1) that cache recently-used data in memory to improve performance. Raw devices do not utilize system memory for caching. Refer to Section 2.4.4.6 Creating File Systems for more information. Red Hat Enterprise Linux supports raw character devices that are not hard-coded against specic block devices. Instead, Red Hat Enterprise Linux uses a character major number (currently 162) to implement a series of unbound raw devices in the /dev/raw/ directory. Any block device can have a character raw device front-end, even if the block device is loaded later at run time. Chapter 2. Hardware Installation and Operating System Conguration 33 To create a raw device, edit the /etc/sysconfig/rawdevices le to bind a raw character device to the appropriate block device to enable the raw device to be opened, read, and written. Shared partitions and some database applications require raw devices, because these applications perform their own buffer caching for performance purposes. Shared partitions cannot contain le systems because if state data was cached in system memory, the members would not have a consistent view of the state data. Raw character devices must be bound to block devices each time a member boots. To ensure that this occurs, edit the /etc/sysconfig/rawdevices le and specify the shared partition bindings. If using a raw device in a cluster service, use this le to bind the devices at boot time. Refer to Section 3.5 Editing the rawdevices File for more information. After editing /etc/sysconfig/rawdevices, the changes take effect either by rebooting or by execute the following command: service rawdevices restart Query all the raw devices by using the command raw -aq. The output should be similar to the following: /dev/raw/raw1 bound to major 8, minor 17 /dev/raw/raw2 bound to major 8, minor 18 Note that, for raw devices, no cache coherency exists between the raw device and the block device. In addition, requests must be 512-byte aligned both in memory and on disk. For example, the standard dd command cannot be used with raw devices because the memory buffer that the command passes to the write system call is not aligned on a 512-byte boundary. For more information on using the raw command, refer to the raw(8) man page. Note The same raw device names (for example, /dev/raw/raw1 and /dev/raw/raw2) must be used on all cluster members. 2.4.4.6. Creating File Systems Use the mkfs command to create an ext3 le system. For example: mke2fs -j -b 4096 /dev/sde3 For optimal performance of shared le systems, make sure to specify a 4 KB block size with the -b option to mke2fs. A smaller block size can cause long fsck times. 34 Chapter 2. Hardware Installation and Operating System Conguration Chapter 3. Cluster Conguration After installing and conguring the cluster hardware, the cluster system software and cluster conguration software can be installed. 3.1. Installing the Red Hat Cluster Suite Packages The clumanager and redhat-config-cluster packages are required to congure the Red Hat Cluster Manager. Perform the following instructions to install the Red Hat Cluster Suite on your Red Hat Enterprise Linux system. 3.1.1. Installation with the Package Management Tool 1. Insert the Red Hat Cluster Suite CD in your CD-ROM drive. If you are using a graphical desktop, the CD will automatically run the Package Management Tool. Click Forward to continue. Figure 3-1. Package Management Tool 2. Check the box for the Red Hat Cluster Suite, and click the Details link to the package descriptions. 36 Chapter 3. Cluster Conguration Figure 3-2. Choosing Package Group 3. While viewing the package group details, check the box next to the packages to install. Click Close when nished. Figure 3-3. Choosing Packages for Installation 4. The Package Management Tool shows an overview of the packages to be installed. Click Forward to install the packages. 5. When the installation is complete, click Finish to exit the Package Management Tool. 3.1.2. Installation with rpm If you are not using a graphical desktop environment, you can install the packages manually using the rpm utility at a shell prompt. Chapter 3. Cluster Conguration 37 Insert the Red Hat Cluster Suite CD into the CD-ROM drive. Log into a shell prompt, change to the RedHat/RPMS/ directory on the CD, and type the following commands as root (replacing  version  and  arch  with the version and architecture of the packages to install): rpm --Uvh clumanager-  version .  arch .rpm rpm --Uvh redhat-config-cluster-  version .noarch.rpm 3.2. Installation Notes for Red Hat Enterprise Linux 2.1 Users Warning Make sure the cluster service is not running on any members before performing the following procedure. If you are currently running cluster services on Red Hat Enterprise Linux 2.1 and you want to install Red Hat Enterprise Linux 3 on your system while preserving your cluster conguration, perform the following steps on the Red Hat Enterprise Linux 2.1 system to backup the cluster conguration before installing Red Hat Enterprise Linux 3: 1. Stop the cluster service by invoking the command service cluster stop on all cluster members. 2. Remove the cluster database le (/etc/cluster.conf) from all members, saving a copy with a different name (for example, /etc/cluster.conf.sav) to a separate running system or portable storage such as a diskette. 3. Reinstall the Red Hat Enterprise Linux 2.1 system with Red Hat Enterprise Linux 3 on all members. 4. Install the Red Hat Cluster Suite using the method described in Section 3.1 Installing the Red Hat Cluster Suite Packages. 5. On all members, install the applications to be managed as services by the cluster software. 6. Restore the cluster database le you saved in step 2 to one member, naming the le /etc/cluster.conf. 7. Invoke the /usr/sbin/cluster-convert command to convert the /etc/cluster.conf to /etc/cluster.xml. 8. Congure shared storage on the cluster member by running /usr/sbin/shutil -i which initializes the shared storage. Then run /usr/sbin/shutil -s /etc/cluster.xml to read the shared storage information from the le and write it to the shared state. 9. Start the cluster service on this member using the command /sbin/service clumanager start. 10. Copy the /etc/cluster.xml le to the other member using the scp command. 11. Start the cluster service on the other members. 38 Chapter 3. Cluster Conguration 3.3. The Cluster Conguration Tool Red Hat Cluster Manager consists of the following RPM packages: ? clumanager ? This package consists of the software that is responsible for cluster operation (including the cluster daemons). ? redhat-config-cluster ? This package contains the Cluster Conguration Tool and the Cluster Status Tool, which allow for the conguration of the cluster and the display of the current status of the cluster and its members and services. You can use either of the following methods to access the Cluster Conguration Tool: ? Select Main Menu => System Settings => Server Settings => Cluster. ? At a shell prompt, type the redhat-config-cluster command. The rst time that the application is started, the Cluster Conguration Tool is displayed. After you complete the cluster conguration, the command starts the Cluster Status Tool by default. To access the Cluster Conguration Tool from the Cluster Status Tool, select Cluster => Congure. Figure 3-4. The Cluster Conguration Tool The following tabbed sections are available within the Cluster Conguration Tool: ? Members ? Use this section to add members to the cluster and optionally congure a power controller connection for any given member. ? Failover Domains ? Use this section to establish one or more subsets of the cluster members for specifying which members are eligible to run a service in the event of a system failure. (Note that the use of failover domains is optional.) ? Services ?Use this section to congure one or more services to be managed by the cluster. As you specify an application service, the relationship between the service and its IP address, device special le, mount point, and NFS exports is represented by a hierarchical structure. The parent-child relationships in the Cluster Conguration Tool reect the organization of the service information in the /etc/cluster.xml le. Chapter 3. Cluster Conguration 39 Warning Do not manually edit the contents of the /etc/cluster.xml le. Do not simultaneously run the Cluster Conguration Tool on multiple members. (It is permissible to run the Cluster Status Tool on more than one member at a time.) The Cluster Conguration Tool stores information about the cluster service and daemons, cluster members, and cluster services in the /etc/cluster.xml conguration le. The cluster conguration le is created the rst time the Cluster Conguration Tool is started. Save the conguration at any point (using File => Save) while running the Cluster Conguration Tool. When File => Quit is selected, it prompts you to save changes if any unsaved changes to the conguration are detected. Note When you save the cluster conguration for the rst time using the Cluster Conguration Tool and exit the application, the next time you the cluster (either by choosing Main Menu => System Settings => Server Settings => Cluster or by running redhat-config-cluster from a shell prompt) the Cluster Status Tool will display by default. The Cluster Status Tool displays the status of the cluster service, cluster members, and application services, and shows statistics concerning service operation. If you need to further congure the cluster system, choose Cluster => Congure from the Cluster Status Tool menu. The Cluster Conguration Tool) is used to congure cluster members, services, and cluster daemons. The Cluster Status Tool is used to monitor, start and stop the cluster members and services on particular members, and move an application service to another member. The Cluster Conguration Tool uses a hierarchical tree structure to show relationships between components in the cluster conguration. A triangular icon to the left of a component name indicates that the component has children. To expand or collapse the portion of the tree below a component, click the triangle icon. Figure 3-5. Cluster Conguration Tool 40 Chapter 3. Cluster Conguration To display the properties of a component, select the component and click Properties. As a shortcut, you can display the properties of a component by double-clicking on the component name. To view or modify the properties of the cluster daemons, choose Cluster => Daemon Properties. 3.4. Conguring the Cluster Software Before describing the steps to congure the cluster software, the hierarchical structure of cluster members and services needs to be considered. Cluster members and services can be thought of in terms of a parent/child tree structure, as shown in Figure 3-6. Figure 3-6. Cluster Conguration Structure Each member can have power controller children. Each service can have IP address children. Each Service can also have device children. Each device can have NFS export directory children. Each NFS export directory can have client children. This structure is reected in the /etc/cluster.xml le syntax. The steps to congure the cluster software consist of the following: 1. Edit the /etc/sysconfig/rawdevicesle on all cluster members and specify the raw device special les and character devices for the primary and backup shared partitions as described in Section 2.4.4.3 Conguring Shared Cluster Partitions and Section 3.5 Editing the rawdevices File. 2. Run the Cluster Conguration Tool on one cluster member. 3. Enter a name for the cluster in the Cluster Name eld. The name should be descriptive enough to distinguish it from other clusters and systems on your network (for example, nfs_cluster or httpd_cluster). 4. Choose Cluster => Shared State, and conrm that the Raw Primary and Raw Shadow settings match the settings specied in step 1. The default for Raw Primary is /dev/raw/raw1. The default for Raw Shadow is /dev/raw/raw2. 5. Choose Cluster => Daemon Properties to edit the daemon properties. Each daemon has its own tab. Update the daemon options to suit your preferences and operating environment, and click OK when done. Refer to Section 3.6 Conguring Cluster Daemons for more details. Chapter 3. Cluster Conguration 41 6. Add member systems to the cluster by selecting the Members tab and clicking the New button. Refer to Section 3.7 Adding and Deleting Members for details. If the hardware conguration includes power controllers (power switches), then for each member connected to a power controller, congure that member's connection to the power controller by selecting the member and clicking Add Child. Refer to Section 3.8 Conguring a Power Controller Connection for more information. 7. Set up one or more failover domains, if needed, to restrict the members on which a service can run or restrict the order of members followed when a service fails over from one failover domain member to another (or both). Refer to Section 3.9 Conguring a Failover Domain for details. 8. Congure the application services to be managed by the cluster, specifying IP addresses, failover domain (if applicable) and user script that manages the service. Refer to Section 3.10 Adding a Service to the Cluster for more information. 9. Save cluster conguration changes by selecting File => Save. When you save the cluster con- guration, the command service clumanager reload command is executed to cause the cluster software to load the changed conguration le. 10. Quit the application by selecting File => Quit. Running the Cluster Conguration Tool for the rst time causes the cluster conguration le /etc/cluster.xml to be created automatically. When quiting after running it for the rst time, you are prompted to Press 'OK' to initialize Shared Storage, which uses the shared partitions to pass quorum and service state information between cluster members. Click OK to initialize the shared storage. Note Shared storage must be initialized before starting the cluster service for the rst time, or the cluster will not run properly. 11. After completing the cluster software conguration on one member, copy the conguration le /etc/cluster.xml to all other cluster members. The scp can be used to copy les from one host to another. 3.5. Editing the rawdevices File The /etc/sysconfig/rawdevices le is used to map the raw devices for the shared partitions each time a cluster member boots. As part of the cluster software installation procedure, edit the rawdevices le on each member and specify the raw character devices and block devices for the primary and backup shared partitions. This must be done prior to running the Cluster Conguration Tool. If raw devices are employed in a cluster service, the rawdevices le is also used to bind the devices at boot time. Edit the le and specify the raw character devices and block devices that you want to bind each time the member boots. To make the changes to the rawdevices le take effect without requiring a reboot, perform the following command: /sbin/service rawdevices restart The following is an example rawdevices le that designates two shared partitions: # raw device bindings # format: rawdev major minor # rawdev  blockdev # example: /dev/raw/raw1 /dev/s da1 42 Chapter 3. Cluster Conguration # /dev/raw/raw2 8 5 /dev/raw/raw1 /dev/hda5 /dev/raw/raw2 /dev/hda6 Refer to Section 2.4.4.3 Conguring Shared Cluster Partitions for more information about setting up the shared partitions. Refer to Section 2.4.4.5 Creating Raw Devices for more information on using the raw command to bind raw character devices to block devices. Note The rawdevices conguration must be performed on all cluster members, and all members must use the same raw devices (from the previous example, /dev/raw/raw1 and/dev/raw/raw2). To check raw device conguration on the current cluster member, choose Cluster => Shared State in the Cluster Conguration Tool. The Shared State dialog is displayed, as shown in Figure 3-7. Figure 3-7. Verifying Shared State Conguration 3.6. Conguring Cluster Daemons The Red Hat Cluster Manager provides the following daemons to monitor cluster operation: ? cluquorumd? Quorum daemon ? clusvcmgrd? Service manager daemon ? clurmtabd?Synchronizes NFS mount entries in /var/lib/nfs/rmtab with a private copy on a service's mount point ? clulockd? Global lock manager (the only client of this daemon is clusvcmgrd) ? clumembd? Membership daemon Each of these daemons can be individually congured using the Cluster Conguration Tool. To access the Cluster Daemon Properties dialog box, choose Cluster => Daemon Properties. The following sections explain how to congure cluster daemon properties. However, note that the default values are applicable to most congurations and do not need to be changed. 3.6.1. Conguring clumembd On each cluster system, the clumembd daemon issues heartbeats (pings) across the point-to-point Ethernet lines to which the cluster members are connected. Chapter 3. Cluster Conguration 43 Figure 3-8. Conguring clumembd Note You can enable both broadcast heartbeating and multicast heartbeating, but at least one of these features must be used. Multicast heartbeating over a channel-bonded Ethernet interface provides good fault tolerance and is recommended for availability. You can specify the following properties for the clumembd daemon: ? Log Level ? Determines the level of event messages that get logged to the cluster log le (by default /var/log/messages). Choose the appropriate logging level from the menu. Refer to Section 3.12 Conguring syslogd Event Logging for more information. ? Failover Speed?Determines the number of seconds that the cluster service waits before shutting down a non-responding member (that is, a member from which no heartbeat is detected). To set the failover speed, drag the slider bar. The default failover speed is 10 seconds. Note Setting a faster failover speed increases the likelihood of false shutdowns. ? Heartbeating ? Enable Broadcast Heartbeating or Enable Multicast Heartbeating by clicking the corresponding radio button. Broadcast heartbeating species that the broadcast IP address is to be used by the clumembd daemon when emitting heartbeats. By default, the clumembd is congured to emit heartbeats via multicast. Multicast uses the network interface associated with the member's hostname for transmission of heartbeats. ? Multicast IP Address ? Species the IP address to be used by the clumembd daemon over the multicast channel. This eld is not editable if Enable Broadcast Heartbeating is checked. The default multicast IP address used by the cluster is 225.0.0.11. 3.6.2. Conguring cluquorumd In a two-member cluster without a specied tiebreaker IP address, the cluquorumd daemon periodically writes a time-stamp and system status to a specic area on the primary and shadow shared 44 Chapter 3. Cluster Conguration partitions. The daemon also reads the other member's timestamp and system status information from the primary shared partition or, if the primary partition is corrupted, from the shadow partition. Figure 3-9. Conguring cluquorumd You can specify the following properties for the cluquorumd daemon: ? Log Level ? Determines the level of event messages that get logged to the cluster log le (by default /var/log/messages). Choose the appropriate logging level from the menu. Refer to Section 3.12 Conguring syslogd Event Logging for more information. ? Ping Interval or Tiebreaker IP ? Ping Interval is used for a disk-based heartbeat; species the number of seconds between the quorum daemon's updates to its on-disk status. Tiebreaker IP is a network-based heartbeat used to determine quorum, which is the ability to run services. The tiebreaker IP address is only checked when the cluster has an even split in a twoor four-member cluster. This IP address should be associated with a router that, during normal operation, can be reached by all members over the Ethernet interface used by the cluster software. 3.6.3. Conguring clurmtabd The clurmtabd daemon synchronizes NFS mount entries in /var/lib/nfs/rmtab with a private copy on a service's mount point. The clurmtabd daemon runs only when a service with NFS exports is running. Figure 3-10. Conguring clurmtabd You can specify the following properties for the clurmtabd daemon: Chapter 3. Cluster Conguration 45 ? Log Level ? Determines the level of event messages that get logged to the cluster log le (by default, /var/log/messages). Choose the appropriate logging level from the menu. See Section 3.12 Conguring syslogd Event Logging for more information. ? Poll Interval?Species the number of seconds between polling cycles to synchronize the local NFS rmtab to the cluster rmtab on shared storage. 3.6.4. Conguring the clusvcmgrd daemon On each cluster system, the clusvcmgrd service manager daemon responds to changes in cluster membership by stopping and starting services. You might notice, at times, that more than one clusvcmgrd process is running; separate processes are spawned for start, stop, and monitoring operations. Figure 3-11. Conguring clusvcmgrd You can specify the following properties for the clusvcmgrd daemon: ? Log Level ? Determines the level of event messages that get logged to the cluster log le (by default, /var/log/messages). Choose the appropriate logging level from the menu. See Section 3.12 Conguring syslogd Event Logging for more information. 3.6.5. Conguring clulockd The clulockd daemon manages the locks on les being accessed by cluster members. Figure 3-12. Conguring clulockd 46 Chapter 3. Cluster Conguration You can specify the following properties for the clulockd daemon: ? Log Level ? Determines the level of event messages that get logged to the cluster log le (by default, /var/log/messages). Choose the appropriate logging level from the menu. See Section 3.12 Conguring syslogd Event Logging for more information. 3.7. Adding and Deleting Members The procedure to add a member to a cluster varies slightly, depending on whether the cluster is already running or is a newly-congured cluster. 3.7.1. Adding a Member to a New Cluster To add a member to a new cluster, follow these steps: Figure 3-13. Adding a Member to a New Cluster 1. Ensure that the Members tab is selected and click New. It prompts for a member name. 2. Enter the name or address of a system on the cluster subnet. Note that each member must be on the same subnet as the system on which you are running the Cluster Conguration Tool and must be dened either in DNS or in each cluster system's /etc/hosts le The system on which you are running the Cluster Conguration Tool must be explicitly added as a cluster member; the system is not automatically added to the cluster conguration as a result of running the Cluster Conguration Tool. 3. Leave Enable SWWatchdog checked. (A software watchdog timer enables a member to reboot itself.) 4. Click OK. 5. Choose File => Save to save the changes to the cluster conguration. 3.7.2. Adding a Member to a Running Cluster To add a member to an existing cluster that is currently in operation, follow these steps: 1. Ensure that the cluster service is not running on the new member by invoking the /sbin/service clumanager status command. Invoke the /sbin/service clumanager stop command to stop the cluster service. 2. Ensure that the cluster service is running on all active cluster members. Run /sbin/service clumanager start to start the service on the existing cluster members. Chapter 3. Cluster Conguration 47 3. On one of the running members, invoke the Cluster Conguration Tool to add the new member. Ensure that the Members tab is selected and click New. 4. It prompts for a member name. Enter the name of the new member. Note that each member must be on the same subnet as the system on which you are running the Cluster Conguration Tool and must be dened in DNS or in each cluster member's /etc/hosts le.. 5. Leave Enable SWWatchdog checked. (A software watchdog timer enables a member to reboot itself.) 6. Click OK. 7. Choose File => Save to save the changes to the cluster conguration. 8. Copy the updated /etc/cluster.xml le (containing the newly-added member) to the new member. 9. Invoke the service clumanager start command to start the cluster service on the new member. 3.7.3. Deleting a Member from a Running Cluster To delete a member from an existing cluster that is currently in operation, follow these steps: 1. Ensure that the cluster service is not running on the member to be deleted. Invoke the service clumanager stop command to stop the cluster service. 2. On one of the running members, invoke the Cluster Conguration Tool to delete the new member, as follows: a. Remove the member from any failover domains of which it is a member (see Section 3.9 Conguring a Failover Domain). b. Select the Members tab. c. Select the member to be deleted and click Delete. d. Click Yes to conrm deletion. Figure 3-14. Deleting a Member 3. Choose File => Save to save the changes to the cluster conguration. 3.8. Conguring a Power Controller Connection To ensure data integrity, only one member can run a service and access service data at one time. The use of power controllers (also called power switches) in the cluster hardware conguration enables a member to power-cycle another member before restarting that member's services during the failover process. This prevents more than one system from simultaneously accessing the same data and corrupting it. Although not required, it is recommended that you use power controllers to guarantee data 48 Chapter 3. Cluster Conguration integrity under all failure conditions. Watchdog timers are an optional variety of power control to ensure correct operation of service failover. If the hardware conguration for the cluster includes one or more power controllers, congure the connection to a power controller for any given member as follows: 1. Select the member for which you want to congure a power controller connection and click Add Child. The Power Controller dialog box is displayed as shown in Figure 3-15. Figure 3-15. Conguring Power Controllers 2. Specify the following information depending on whether the controller is connected to the member through a serial port or through the network. Note that serial connections to power controllers are valid only in a two-member cluster. Field Description Type Type of serial power controller (only the rps10 is supported in Red Hat Enterprise Linux 3). Device Physical device on the power controller's parent member that the power controller is plugged into; must be one of /dev/ttyS0 through /dev/ttyS9. Port Port number on the power controller itself that is attached to the specied device. Owner The member that controls, or owns, the device. The owner is the member that can power off this device. Automatically defaults to the name of the other member. This eld cannot be edited. Table 3-1. Conguring a Serial Power Controller Field Description Type Type of network power controller; choose from the menu containing all supported types. Chapter 3. Cluster Conguration 49 Field Description IP Address IP of the network power controller itself. Port Specic port on the power controller that this member is attached to. User Username to be used when logging in to the power controller. Password Password to be used when logging in to the power controller. Table 3-2. Conguring a Network Power Controller Users of Red Hat GFS can choose the Grand Unied Lock Manager (GULM) driver as their power controller by clicking the GULM STONITH radio button. The GULM driver is used in conjunction with the Red Hat Cluster Manager failover subsystem and features performance gains in the detection and fencing initialization of hung or failed cluster members. For more information about Red Hat GFS refer to the Red Hat GFS Administrator's Guide. 3. Click OK. 4. Choose File => Save to save the changes to the cluster conguration. 3.9. Conguring a Failover Domain A failover domain is a named subset of cluster members that are eligible to run a service in the event of a system failure. A failover domain can have the following characteristics: ? Unrestricted ? Allows you to specify that a subset of members are preferred, but that a service assigned to this domain can run on any available member. ? Restricted ? Allows you to restrict the members that can run a particular service. If none of the members in a restricted failover domain are available, the service cannot be started (either manually or by the cluster software). ? Unordered ? When a service is assigned to an unordered failover domain, the member on which the service runs is chosen from the available failover domain members with no priority ordering. ? Ordered ?Allows you to specify a preference order among the members of a failover domain. The member at the top of the list is the most preferred, followed by the second member in the list, and so on. By default, failover domains are unrestricted and unordered. In a cluster with several members, using a restricted failover domain can minimize the work to set up the cluster to run a service (such as httpd, which requires you to set up the conguration identically on all members that run the service). Instead of setting up the entire cluster to run the service, you must set up only the members in the restricted failover domain that you associate with the service. Tip To implement the concept of a preferred member, create an unrestricted failover domain comprised of only one cluster member. By doing this, a service runs on the preferred member; in the event of a failure, the service fails over to any of the other members. To add a failover domain to the cluster software conguration, follow these steps: 1. Select the Failover Domains tab and click New. The Failover Domain dialog box is displayed as shown in Figure 3-16. 50 Chapter 3. Cluster Conguration Figure 3-16. Conguring a Failover Domain 2. Enter a name for the domain in the Domain Name eld. The name should be descriptive enough to distinguish its purpose relative to other names used on your network. 3. Check Restrict failover to only these members to prevent any member other than those listed from taking over a service assigned to this domain. 4. Check Ordered Failover if you want members to assume control of a failed service in a particular sequence; preference is indicated by the member's position in the list of members in the domain, with the most preferred member at the top. 5. Click Add Members to select the members for this failover domain. The Failover Domain Member dialog box is displayed. Figure 3-17. Failover Domain Member You can choose multiple members from the list by pressing either the [Shift] key while clicking the start and end of a range of members, or pressing the [Ctrl] key while clicking on noncontiguous members. Chapter 3. Cluster Conguration 51 6. When nished selecting members from the list, click OK. The selected members are displayed on the Failover Domain list. 7. When Ordered Failover is checked, you can rearrange the order of the members in the domain by dragging the member name in the list box to the desired position. A thin, black line is displayed to indicate the new row position (when you release the mouse button). 8. When nished, click OK. 9. Choose File => Save to save the changes to the cluster conguration. To remove a member from a failover domain, follow these steps: 1. On the Failover Domains tab, double-click the name of the domain you want to modify (or select the domain and click Properties). 2. In the Failover Domain dialog box, click the name of the member you want to remove from the domain and click Delete Member. (Members must be deleted one at a time.) You are prompted to conrm the deletion. 3. When nished, click OK. 4. Choose File => Save to save the changes to the cluster conguration. 3.10. Adding a Service to the Cluster To add a service to the cluster, follow these steps: 1. Select the Services tab and click New. The Service dialog is displayed as shown in Figure 3-18. Figure 3-18. Adding a Service 2. Give the service a descriptive Service Name to distinguish its functionality relative to other services that may run on the cluster. 3. If you want to restrict the members on which this service is able to run, choose a failover domain from the Failover Domain list. (Refer to Section 3.9 Conguring a Failover Domain for instructions on how to congure a failover domain.) 4. Adjust the quantity in the Check Interval eld, which sets the interval (in seconds) that the cluster infrastructure checks the status of a service. This eld is only applicable if the service script is written to check the status of a service. 5. Specify a User Script that contains settings for starting, stopping, and checking the status of a service. 6. Specify service properties, including an available oating IP address (an address that can be transferred transparently from a failed member to a running member in the event of failover) 52 Chapter 3. Cluster Conguration and devices (which are congured as children of the service). For instructions, refer to Section 3.10.1 Adding a Service IP Address and Section 3.10.2 Adding a Service Device. 7. When nished, click OK. 8. Choose File => Save to save the changes to the cluster conguration. The following sections describe service conguration in more detail. 3.10.1. Adding a Service IP Address To specify a service IP address, follow these steps: 1. On the Services tab of the Cluster Conguration Tool, select the service you want to congure and click Add Child. 2. Select Add Service IP Address and click OK. 3. Specify an IP address (which must be resolvable by DNS but cannot be the IP address of a running service). 4. Optionally specify a netmask and broadcast IP address. 5. Choose File => Save to save the change to the /etc/cluster.xml conguration le. 3.10.2. Adding a Service Device To specify a device for a service, follow these steps: 1. On the Services tab of the Cluster Conguration Tool, select the service you want to congure and click Add Child. 2. Select Add Device and click OK. 3. Specify a Device Special File (for example, /dev/hda7) and a mount point (for example, /mnt/share). Each device must have a unique device special le and a unique mount point within the cluster. 4. Specify a Samba Share Name for the device if it is intended to be a Samba export directory. If a Samba Share Name has been entered, when the user selects File => Save, a /etc/samba/smb.conf.sharename le is created (where sharename is the name of the Samba share), and will be used by Samba when the cluster starts the service. For each Samba share you create, an /etc/samba/smb.conf.sharename is created. Copy all of these les to the other cluster members before initializing the cluster service on those members. For more information about conguring a Samba share, refer to Section 6.6 Setting Up a Samba Service. 5. Specify a directory from which to mount the device in the Mount Point eld. This directory should not be listed in /etc/fstab as it is automatically mounted by the Red Hat Cluster Manager when the service is started. 6. Choose a le system type from the FS Type list. 7. Optionally specify Options for the device. If you leave the Options eld blank, the default mount options (rw,suid,dev,exec,auto,nouser,async) are used. Refer to the mount man page for a complete description of the options available for mount. 8. Check Force Unmount to force any application that has the specied le system mounted to be killed prior to disabling or relocating the service (when the application is running on the same member that is running the disabled or relocated service). 9. When nished, click OK. 10. Choose File => Save to save the change to the /etc/cluster.xml conguration le. Chapter 3. Cluster Conguration 53 3.11. Checking the Cluster Conguration To ensure that the cluster software has been correctly congured, use the following tools located in the /usr/sbin directory: ? Test the shared partitions and ensure that they are accessible. Invoke the /usr/sbin/shutil utility with the -v option to test the accessibility of the shared partitions. See Section 3.11.1 Testing the Shared Partitions for more information. ? Test the operation of the power switches. If power switches are used in the cluster hardware conguration, run the clufence command on each member to ensure that it can remotely power-cycle the other member. Do not run this command while the cluster software is running. See Section 3.11.2 Testing the Power Switches for more information. ? Ensure that all members are running the same software version. Invoke the rpm -q clumanager command and rpm -q redhat-config-cluster command on each member to display the revision of the installed cluster software RPMs. The following section explains the cluster utilities in further detail. 3.11.1. Testing the Shared Partitions The shared partitions must refer to the same physical device on all members. Invoke the /usr/sbin/shutil utility with the -v command to test the shared partitions and verify that they are accessible. If the command succeeds, run the /usr/sbin/shutil -p /cluster/header command on all members to display a summary of the header data structure for the shared partitions. If the output is different on the members, the shared partitions do not point to the same devices on all members. Check to make sure that the raw devices exist and are correctly specied in the /etc/sysconfig/rawdevices le. See Section 2.4.4.3 Conguring Shared Cluster Partitions for more information. The following example shows that the shared partitions refer to the same physical device on cluster members clu1.example.com and clu2.example.com via the /usr/sbin/shutil -p /cluster/header command: /cluster/header is 140 bytes long SharedStateHeader { ss_magic = 0x39119fcd ss_timestamp = 0x000000003ecbc215 (14:14:45 May 21 2003) ss_updateHost = clu1.example.com All elds in the output from the /usr/sbin/shutil -p /cluster/header command should be the same when run on all cluster members. If the output is not the same on all members, perform the following: ? Examine the /etc/sysconfig/rawdevices le on each member and ensure that the raw character devices and block devices for the primary and backup shared partitions have been accurately specied. If they are not the same, edit the le and correct any mistakes. Then re-run the Cluster Conguration Tool. See Section 3.5 Editing the rawdevices File for more information. ? Ensure that you have created the raw devices for the shared partitions on each member. See Section 2.4.4.3 Conguring Shared Cluster Partitions for more information. ? To determine the bus conguration on each member, examine the system startup messages by running dmesg |less to the point where the system probes the SCSI subsystem. Verify that all members identify the same shared storage devices and assign them the same name. 54 Chapter 3. Cluster Conguration ? Verify that a member is not attempting to mount a le system on the shared partition. For example, make sure that the actual device (for example, /dev/sdb1) is not included in an /etc/fstab le. After performing these tasks, re-run the /usr/sbin/shutil utility with the -p option. 3.11.2. Testing the Power Switches If either network-attached or serial-attached power switches are employed in the cluster hardware con- guration, install the cluster software and invoke the clufence command to test the power switches. Invoke the command on each member to ensure that it can remotely power-cycle the other member. If testing is successful, then the cluster can be started. The clufence command can accurately test a power switch only if the cluster software is not running. This is due to the fact that for serial attached switches, only one program at a time can access the serial port that connects a power switch to a member. When the clufence command is invoked, it checks the status of the cluster software. If the cluster software is running, the command exits with a message to stop the cluster software. The clufence command line options are as follows: ? -d ? Turn on debugging ? -f ? Fence (power off) member ? -u ? Unfence (power on) member ? -r ? Reboot (power cycle) member ? -s ? Check status of all switches controlling member When testing power switches, the rst step is to ensure that each cluster member can successfully communicate with its attached power switch. The following output of the clufence command shows that the cluster member is able to communicate with its power switch: [27734] info: STONITH: rps10 at /dev/ttyS0, port 0 controls clumember1.example.com [27734] info: STONITH: rps10 at /dev/ttyS0, port 1 controls clumember2.example.com In the event of an error in the clufence output, check the following: ? For serial attached power switches: ? Verify that the device special le for the remote power switch connection serial port (for example, /dev/ttyS0) is specied correctly in the cluster conguration le; in the Cluster Conguration Tool, display the Power Controller dialog box to check the serial port value. If necessary, use a terminal emulation package such as minicom to test if the cluster member can access the serial port. ? Ensure that a non-cluster program (for example, a getty program) is not using the serial port for the remote power switch connection. You can use the lsof command to perform this task. ? Check that the cable connection to the remote power switch is correct. Verify that the correct type of cable is used (for example, an RPS-10 power switch requires a null modem cable), and that all connections are securely fastened. ? Verify that any physical dip switches or rotary switches on the power switch are set properly. ? For network based power switches: ? Verify that the network connection to network-based power switches is operational. Most switches have a link light that indicates connectivity. Chapter 3. Cluster Conguration 55 ? It should be possible to ping the network power switch; if not, then the switch may not be properly congured for its network parameters. ? Verify that the correct password and login name (depending on switch type) have been specied in the cluster conguration le (as established by running the Cluster Conguration Tool and viewing the properties specied in the Power Controller dialog box). A useful diagnostic approach is to verify Telnet access to the network switch using the same parameters as specied in the cluster conguration. After successfully verifying communication with the switch, attempt to power cycle the other cluster member. Prior to doing this, we recommend you verify that the other cluster member is not actively performing any important functions (such as serving cluster services to active clients). Running the command clufence -f clumember2.example.com displays the following output upon a successful shutdown and fencing operation (which means that the system does not receive power from the power switch until the system has been unfenced): [7397] info: STONITH: rps10 at /dev/ttyS0, port 0 controls clumember1.example.com [7397] info: STONITH: rps10 at /dev/ttyS0, port 1 controls clumember2.example.com [7397] notice: STONITH: clumember2.example.com has been fenced! 3.11.3. Displaying the Cluster Software Version Ensure that all members in the cluster are running the same version of the Red Hat Cluster Manager software. To display the version of the Cluster Conguration Tool and the Cluster Status Tool, use either of the following methods: ? Choose Help => About. The About dialog includes the version numbers. ? Invoke the following commands: rpm -q redhat-config-cluster rpm -q clumanager ? The version of the clumanager package can also be determined by invoking the clustat -v command. 3.12. Conguring syslogd Event Logging It is possible to edit the /etc/syslog.conf le to enable the cluster to log events to a le that is different from the /var/log/messages log le. Logging cluster messages to a separate le helps to diagnose problems more clearly. The members use the syslogd daemon to log cluster-related events to a le, as specied in the /etc/syslog.conf le. The log le facilitates diagnosis of problems in the cluster. It is recommended to set up event logging so that the syslogd daemon logs cluster messages only from the member on which it is running. Therefore, you need to examine the log les on all members to get a comprehensive view of the cluster. The syslogd daemon logs messages from the cluster daemons, which all default to severity level 4 (warning). Refer to Section 3.6 Conguring Cluster Daemons for more information on cluster daemons. The importance of an event determines the severity level of the log entry. Important events should be investigated before they affect cluster availability. The cluster can log messages with the following severity levels, listed in order of severity level: 56 Chapter 3. Cluster Conguration ? EMERG ?The member is unusable (emergency). ? ALERT ?Action must be taken immediately to address the problem. ? CRIT ?A critical condition has occurred. ? ERR ?An error has occurred. ? WARN ?A signicant event that may require attention has occurred. ? NOTICE ?A signicant, but normal, event has occurred. ? INFO ?An insignicant, but normal, cluster operation has occurred. ? DEBUG ?Diagnostic output detailing cluster operations (typically not of interest to administrators.) Examples of log le entries are as follows: Jul 18 20:24:39 clu1 clufence[7397]:  info  STONITH: rps10 at /dev/ttyS0,\ port 0 controls clu1 Jul 18 20:24:39 clu1 clufence[7397]:  info  STONITH: rps10 at /dev/ttyS0,\ port 1 controls clu2 Jul 18 20:24:53 clu1 clufence[7397]: Port 0 being turned off. Jul 18 20:24:53 clu1 clufence[7397]:  notice  STONITH: clu2 has been fenced! Jul 18 20:51:03 clu1 clufence[30780]:  info  STONITH: rps10 at/dev/ttyS0,\ port 0 controls clu1 Jul 18 20:51:03 clu1 clufence[30780]:  info  STONITH: rps10 at /dev/ttyS0,\ port 1 controls clu2 Jul 18 20:51:17 clu1 clufence[30780]: Port 0 being turned on. Jul 18 20:51:17 clu1 clufence[30780]:  notice  STONITH: clu2 is no longer fenced off. [1] [2] [3] [4] [5] Each entry in the log le contains the following information: ? [1] Date and time ? [2] Hostname ? [3] Cluster resource or daemon ? [4] Severity ? [5] Message After conguring the cluster software, optionally edit the /etc/syslog.conf le to enable the cluster to log events to a le that is different from the default log le, /var/log/messages. The cluster utilities and daemons log their messages using a syslog tag called local4. Using a cluster-specic log le facilitates cluster monitoring and problem solving. To prevent cluster events from being logged to the /var/log/messages le, add local4.none to the following line in the /etc/syslog.conf le: # Log anything (except mail) of level info or higher. # Don't log private authentication messages! *.info;mail.none;news.none;authpriv.none;local4.none /var/log/messages To direct the cluster logging facility to log cluster events in the /var/log/cluster le, add lines similar to the following to the /etc/syslog.conf le: # # Cluster messages coming in on local4 go to /var/log/cluster # local4.* /var/log/cluster To apply the previous changes, restart syslogd with the service syslog restart command. Chapter 3. Cluster Conguration 57 In addition, it is possible to modify the severity level of the events that are logged by the individual cluster daemons; refer to Section 3.6 Conguring Cluster Daemons and the man page for syslog.conf for more information. To rotate the cluster log le according to the frequency specied in the /etc/logrotate.conf le (the default is weekly), add /var/log/cluster to the rst line of the /etc/logrotate.d/syslog le. For example: /var/log/messages /var/log/secure /var/log/maillog /var/log/spooler /var/log/boot.log /var/log/cron /var/log/cluster { sharedscripts postrotate /bin/kill -HUP `cat /var/run/syslogd.pid 2> /dev/null` 2> /dev/null || true endscript } 58 Chapter 3. Cluster Conguration Chapter 4. Service Administration The following sections describe how to display, enable, disable, modify, relocate, and delete a service, and how to handle services that fail to start. Examples of setting up specic types of services are provided. 4.1. Conguring a Service After setting up disk storage and installing applications to be managed by the cluster, you can congure services to manage these applications and resources by using the Cluster Conguration Tool. To congure a service, follow these steps: 1. If applicable, create a script to start, stop, and check the status of the application used in the service. Refer to Section 4.1.2 Creating Service Scripts for information. 2. Gather information about service resources and properties. Refer to Section 4.1.1 Gathering Service Information for information. 3. Set up the le systems or raw devices that the service uses. Refer to Section 4.1.3 Conguring Service Disk Storage for information. 4. Ensure that the application software can run on each member (in either the associated failover domain, if used, or in the cluster) and that the service script, if any, can start and stop the service application. Refer to Section 4.1.4 Verifying Application Software and Service Scripts for upgrade instructions. 5. If upgrading from an existing cluster, back up the /etc/cluster.conf le. Refer to Section 3.2 Installation Notes for Red Hat Enterprise Linux 2.1 Users for upgrade instructions. 6. Start the Cluster Conguration Tool and add services, specifying the information about the service resources and properties obtained in step 2. 7. Save your conguration. Saving the settings on one cluster member propogates to other cluster members automatically. For more information about adding a cluster service, refer to the following: ? Section 5.1 Setting Up an Oracle Service ? Section 5.3 Setting Up a MySQL Service ? Section 6.1 Setting Up an NFS Service ? Section 6.6 Setting Up a Samba Service ? Chapter 7 Setting Up Apache HTTP Server 4.1.1. Gathering Service Information Before conguring a service, gather all available information about the service resources and properties. In some cases, it is possible to specify multiple resources for a service (for example, multiple IP addresses and disk devices). The service properties and resources that you can specify using the Cluster Conguration Tool are described in Table 4-1. 60 Chapter 4. Service Administration Property Description Service name Each service must have a unique name. A service name can consist of one to 63 characters and must consist of a combination of letters (either uppercase or lowercase), integers, underscores, periods, and dashes (hyphens). A service name must begin with a letter or an underscore. Failover domain Identify the members on which to run the service by associating the service with an existing failover domain. When ordered failover is enabled, if the member on which the service is running fails, the service is automatically relocated to the next member on the ordered member list. (Order of preference is established by the sequence of member names in the failover domain list). Refer to Section 3.9 Conguring a Failover Domain for more information. Check interval Species the frequency (in seconds) that the member checks the health of the application associated with the service. For example, when you specify a nonzero check interval for an NFS or Samba service, Red Hat Cluster Manager veries that the necessary NFS or Samba daemons are running. For other types of services, Red Hat Cluster Manager checks the return status after calling the status clause of the application service script. By default, check interval is 0, indicating that service monitoring is disabled. User script If applicable, specify the full path name for the script that is used to start and stop the service. Refer to Section 4.1.2 Creating Service Scripts for more information. IP address One or more Internet protocol (IP) addresses may be assigned to a service. This IP address (sometimes called a "oating" IP address) is different from the IP address associated with the host name Ethernet interface for a member, because it is automatically relocated along with the service resources when failover occurs. If clients use this IP address to access the service, they do not know which member is running the service, and failover is transparent to the clients. Note that cluster members must have network interface cards congured in the IP subnet of each IP address used in a service. Netmask and broadcast addresses for each IP address can also be specied; if they are not, then the cluster uses the netmask and broadcast addresses from the network interconnect in the subnet. Device special le Specify each shared disk partition used in a service. File system and sharing options If the service uses a le system, specify the type of le system, the mount point, and any mount options. You can specify any of the standard le system mount options as described in the mount(8) man page. It is not necessary to provide mount information for raw devices (if used in a service). ext2 and ext3 le systems are supported for a cluster. Specify whether to enable forced unmount for a le system. Forced unmount allows the cluster service management infrastructure to unmount a le system prior to relocation or failover, even if the le system is busy. This is accomplished by terminating any applications that are accessing the le system. You can also specify whether to export the le system via NFS set access permissions. Refer to Section 6.1 Setting Up an NFS Service for details. Specify whether or not to make the le system accessible to SMB clients via Samba by providing a Samba share name. Table 4-1. Service Property and Resource Information Chapter 4. Service Administration 61 4.1.2. Creating Service Scripts The cluster infrastructure starts and stops specied applications by running service-specic scripts. For both NFS and Samba services, the associated scripts are built into the cluster services infrastructure. Consequently, when running the Cluster Conguration Tool to congure NFS and Samba services, leave the User Script eld blank. For other application types it is necessary to designate a service script. For example, when conguring a database application, specify the fully-qualied pathname of the corresponding database start script. The format of the service scripts conforms to the conventions followed by the System V init scripts. This convention dictates that the scripts have a start, stop, and status clause. These scripts should return an exit status of 0 upon successful completion. When a service fails to start on one cluster member, Red Hat Cluster Manager will attempt to start the service on other cluster members. If the other cluster members fail to start the service, Red Hat Cluster Manager attempts to stop the service on all members. If it fails to stop the service for any reason, the cluster infrastructure will place the service in the Failed state. Administrators must then start the Cluster Status Tool, highlight the failed service, and click Disable before they can enable the service. In addition to performing the stop and start functions, the script is also used for monitoring the status of an application service. This is performed by calling the status clause of the service script. To enable service monitoring, specify a nonzero value in the Check Interval eld when specifying service properties with the Cluster Conguration Tool. If a nonzero exit is returned by a status check request to the service script, then the cluster infrastructure rst attempts to restart the application on the member it was previously running on. Status functions do not have to be fully implemented in service scripts. If no real monitoring is performed by the script, then a stub status clause should be present which returns success. The operations performed within the status clause of an application can be tailored to best meet the application's needs as well as site-specic parameters. For example, a simple status check for a database would consist of verifying that the database process is still running. A more comprehensive check would consist of a database table query. The /usr/share/cluster/doc/services/examples/ directory contains a template that can be used to create service scripts, in addition to examples of scripts. Refer to Section 5.1 Setting Up an Oracle Service, Section 5.3 Setting Up a MySQL Service, Chapter 7 Setting Up Apache HTTP Server, for sample scripts. 4.1.3. Conguring Service Disk Storage Prior to creating a service, set up the shared le systems and raw devices that the service is to use. Refer to Section 2.4.4 Conguring Shared Disk Storage for more information. If employing raw devices in a cluster service, it is possible to use the /etc/sysconfig/rawdevices le to bind the devices at boot time. Edit the le and specify the raw character devices and block devices that are to be bound each time the member boots. Refer to Section 3.5 Editing the rawdevices File for more information. Note that software RAID and host-based RAID cannot be used for shared disk storage. Only certied SCSI adapter-based RAID cards can be used for shared disk storage. You should adhere to the following service disk storage recommendations: ? For optimal performance, use a 4 KB block size when creating le systems. Note that some of the mkfs le system build utilities default to a 1 KB block size, which can cause long fsck times. ? To facilitate quicker failover times, it is recommended that the ext3 le system be used. Refer to Section 2.4.4.6 Creating File Systems for more information. 62 Chapter 4. Service Administration ? For large le systems, use the nocheck option to bypass code that checks all the block groups on the partition. Specifying the nocheck option can signicantly decrease the time required to mount a large le system. 4.1.4. Verifying Application Software and Service Scripts Prior to setting up a service, install any applications that are used in the service on each member in the cluster (or each member in the failover domain, if used). After installing the application on these members, verify that the application runs and can access shared disk storage. To prevent data corruption, do not run the application simultaneously on more than one member. If using a script to start and stop the service application, install and test the script on all cluster members, and verify that it can be used to start and stop the application. Refer to Section 4.1.2 Creating Service Scripts for more information. 4.2. Displaying a Service Conguration In the Cluster Conguration Tool, the following information for a given  service  element is viewable directly on the Services tab (when the service conguration is fully expanded): Figure 4-1. The Services Tab ? Service name ? Device special le name ? Service IP address ? NFS export and clients (if specied) for the device You can display the failover domain, check interval, and user script name for a  service  by accessing its properties dialog box. Chapter 4. Service Administration 63 The Samba share name, mount point, le system type, and mount options for a service are displayed as the properties for a  device  . You can view the permissions for NFS client access by displaying  client  properties. To display the properties for an element in the cluster conguration, use any of the following methods: ? Double-click the element name. ? Select the element name and click Properties. ? Select the element name and choose File => Properties. For instructions on conguring a service using the Cluster Conguration Tool, refer to Section 3.10 Adding a Service to the Cluster. To display cluster service status, refer to Section 8.2 Displaying Cluster and Service Status. 4.3. Disabling a Service A running service can be disabled to stop the service and make it unavailable. Once disabled, a service can then be re-enabled. Refer to Section 4.4 Enabling a Service for information. There are several situations in which a running service may need to be disabled: ? To modify a service?A running service must be disabled before it can be modied. Refer to Section 4.5 Modifying a Service for more information. ? To temporarily stop a service?A running service can be disabled, making it unavailable to clients without having to completely delete the service. To disable a running service, invoke the redhat-config-cluster command to display the Cluster Status Tool. In the Services tab, select the service you want to disable and click Disable. 4.4. Enabling a Service When you enable a disabled service, that service is started and becomes available on the specied cluster member. (You can start the service on another member by dragging the service icon into the Members tab of the Cluster Status Tool and dropping the icon on the member icon .) To enable a stopped service, invoke the redhat-config-cluster command to display the Cluster Conguration Tool. In the Services tab, select the service you want to enable and click Enable. 4.5. Modifying a Service You can modify the properties for a service any time after creating the service. For example, you can change the IP address or location of the user script. You can also add more resources (for example, additional le systems or IP addresses) to an existing service. Refer to Section 4.1.1 Gathering Service Information for information. Using the Cluster Status Tool, rst disable, or stop, the service before modifying its properties. If an administrator attempts to modify a running service, a warning dialog displays prompting to stop the service. Refer to Section 4.3 Disabling a Service for instructions on disabling a service. Because a service is unavailable while being modied, be sure to gather all the necessary service information before disabling it to minimize service down time. In addition, back up the cluster conguration le (/etc/cluster.xml) before modifying a service; refer to Section 8.5 Backing Up and Restoring the Cluster Database. 64 Chapter 4. Service Administration Use the Cluster Conguration Tool to modify the service properties, then save the conguration by choosing File => Save. After the modications to the service have been saved, you can restart the service using the Cluster Status Tool to enable the modications. 4.6. Relocating a Service In addition to providing automatic service failover, a cluster enables you to cleanly stop a service on one member and restart it on another member. This service relocation functionality allows you to perform maintenance on a cluster member while maintaining application and data availability. To relocate a service by using the Cluster Status Tool, drag the service icon from the Services tab onto the member icon in the Members tab. The Red Hat Cluster Manager stops the service on the member on which it was running and restarts it on the new member. 4.7. Deleting a Service A cluster service can be deleted. Note that the cluster conguration le should be backed up before deleting a service. Refer to Section 8.5 Backing Up and Restoring the Cluster Database for information. To delete a service from the cluster, follow these steps: 1. If the service is running, disable the service using the Cluster Status Tool (refer to Section 4.3 Disabling a Service). 2. On the Services tab in the Cluster Conguration Tool, select the name of the service you want to remove and click Delete. 3. You are prompted to conrm the deletion. To remove the service, click OK. 4. Choose File => Save to save the change to the /etc/cluster.xml conguration le. 4.8. Handling Failed Services The cluster puts a service into the Failed state if it is unable to successfully start the service across all members and then cannot cleanly stop the service. A Failed state can be caused by various problems, such as a misconguration as the service is running or a service hang or crash. The Cluster Status Tool displays the service as being Failed. Chapter 4. Service Administration 65 Figure 4-2. Service in Failed State Note You must disable a Failed service before you can modify or re-enable the service. Be sure to carefully handle failed services. If service resources are still congured on the owner member, starting the service on another member may cause signicant problems. For example, if a le system remains mounted on the owner member, and you start the service on another member, the le system is mounted on both members, which can cause data corruption. If the enable fails, the service remains in the Disabled state. After highlighting the service and clicking Disable, you can attempt to correct the problem that caused the Failed state. After you modify the service, the cluster software enables the service on the owner member, if possible; otherwise, the service remains in the Disabled state. The following list details steps to follow in the event of service failure: 1. Modify cluster event logging to log debugging messages. Viewing the logs can help determine problem areas. Refer to Section 8.6 Modifying Cluster Event Logging for more information. 2. Use the Cluster Status Tool to attempt to enable or disable the service on one of the cluster or failover domain members. Refer to Section 4.3 Disabling a Service and Section 4.4 Enabling a Service for more information. 3. If the service does not start or stop on the member, examine the /var/log/messages and (if congured to log separately) /var/log/cluster log les, and diagnose and correct the problem. You may need to modify the service to x incorrect information in the cluster conguration le (for example, an incorrect start script), or you may need to perform manual tasks on the owner member (for example, unmounting le systems). 4. Repeat the attempt to enable or disable the service on the member. If repeated attempts fail to correct the problem and enable or disable the service, reboot the member. 5. If still unable to successfully start the service, verify that the service can be manually restarted outside of the cluster framework. For example, this may include manually mounting the le systems and manually running the service start script. 66 Chapter 4. Service Administration Chapter 5. Database Services This chapter contains instructions for conguring Red Hat Enterprise Linux to make database services highly available. Note The following descriptions present example database conguration instructions. Be aware that differences may exist in newer versions of each database product. Consequently, this information may not be directly applicable. 5.1. Setting Up an Oracle Service A database service can serve highly-available data to a database application. The application can then provide network access to database client systems, such as Web servers. If the service fails over, the application accesses the shared database data through the new cluster system. A network-accessible database service is usually assigned an IP address, which is failed over along with the service to maintain transparent access for clients. This section provides an example of setting up a cluster service for an Oracle database. Although the variables used in the service scripts depend on the specic Oracle conguration, the example may aid in setting up a service for individual environments. Refer to Section 5.2 Tuning Oracle Service for information about improving service performance. In the example that follows: ? The service includes one IP address for the Oracle clients to use. ? The service has two mounted le systems, one for the Oracle software (/u01/) and the other for the Oracle database (/u02/), which are set up before the service is added. ? An Oracle administration account with the name oracle is created on the cluster systems that run the service before the service are actually added. ? The administration directory is on a shared disk that is used in conjunction with the Oracle service (for example, /u01/app/oracle/admin/db1). Create a consistent user/group conguration that can properly access Oracle service for each cluster system. For example: mkdir /users groupadd -g 900 dba groupadd -g 901 oinstall useradd -u 901 -g 901 -d /users/oracle -m oracle usermod -G 900 oracle The Oracle service example uses three scripts that must be placed in /users/oracle and owned by the Oracle administration account. The oracle script is used to start and stop the Oracle service. Specify this script when you add the service. This script calls the other Oracle example scripts. The startdb and stopdb scripts start and stop the database. Note that there are many ways for an application to interact with an Oracle database. 68 Chapter 5. Database Services The following is an example of the oracle script, which is used to start, stop, and check the status of the Oracle service. #!/bin/sh # # Cluster service script to start, stop, and check status of oracle # cd /users/oracle case $1 in start) su - oracle -c ./startdb ;; stop) su - oracle -c ./stopdb ;; status) status oracle ;; esac The following is an example of the startdb script, which is used to start the Oracle Database Server instance: #!/bin/sh # # # Script to start the Oracle Database Server instance. # ######################################################################## # # ORACLE_RELEASE # # Specifies the Oracle product release. # ######################################################################## ORACLE_RELEASE=9.2.0 ######################################################################## # # ORACLE_SID # # Specifies the Oracle system identifier or "sid", which is the name of # the Oracle Server instance. # ######################################################################## export ORACLE_SID=TEST ######################################################################## # # ORACLE_BASE # # Specifies the directory at the top of the Oracle software product and # administrative file structure. # ######################################################################## export ORACLE_BASE=/u01/app/oracle Chapter 5. Database Services 69 ######################################################################## # # ORACLE_HOME # # Specifies the directory containing the software for a given release. # The Oracle recommended value is $ORACLE_BASE/product/  release #  ######################################################################## export ORACLE_HOME=/u01/app/oracle/product/${ORACLE_RELEASE} ######################################################################## # # LD_LIBRARY_PATH # # Required when using Oracle products that use shared libraries. # ######################################################################## export LD_LIBRARY_PATH=${ORACLE_HOME}/lib:$LD_LIBRARY_PATH ######################################################################## # # PATH # # Verify that the users search path includes $ORACLE_HOME/bin # ######################################################################## export PATH=$PATH:${ORACLE_HOME}/bin ######################################################################## # # This does the actual work. # # Start the Oracle Server instance based on the initSID.ora # initialization parameters file specified. # ######################################################################## /u01/app/oracle/product/9.2.0/bin/sqlplus << EOF sys as sysdba spool /home/oracle/startdb.log startup pfile = /u01/app/oracle/product/9.2.0/admin/test/scripts/init.ora open; spool off quit; EOF exit The following is an example of the stopdb script, which is used to stop the Oracle Database Server instance: #!/bin/sh # # # Script to STOP the Oracle Database Server instance. # ###################################################################### # # ORACLE_RELEASE 70 Chapter 5. Database Services # # Specifies the Oracle product release. # ###################################################################### ORACLE_RELEASE=9.2.0 ###################################################################### # # ORACLE_SID # # Specifies the Oracle system identifier or "sid", which is the name # of the Oracle Server instance. # ###################################################################### export ORACLE_SID=TEST ###################################################################### # # ORACLE_BASE # # Specifies the directory at the top of the Oracle software product # and administrative file structure. # ###################################################################### export ORACLE_BASE=/u01/app/oracle ###################################################################### # # ORACLE_HOME # # Specifies the directory containing the software for a given release. # The Oracle recommended value is $ORACLE_BASE/product/  release #  ###################################################################### export ORACLE_HOME=/u01/app/oracle/product/${ORACLE_RELEASE} ###################################################################### # # LD_LIBRARY_PATH # # Required when using Oracle products that use shared libraries. # ###################################################################### export LD_LIBRARY_PATH=${ORACLE_HOME}/lib:$LD_LIBRARY_PATH ###################################################################### # # PATH # # Verify that the users search path includes $ORACLE_HOME/bin # ###################################################################### export PATH=$PATH:${ORACLE_HOME}/bin ###################################################################### # Chapter 5. Database Services 71 # This does the actual work. # # STOP the Oracle Server instance in a tidy fashion. # ###################################################################### /u01/app/oracle/product/9.2.0/bin/sqlplus << EOF sys as sysdba spool /home/oracle/stopdb.log shutdown abort; spool off quit; EOF exit 5.1.1. Oracle and the Cluster Conguration Tool To add an Oracle service using the Cluster Conguration Tool, perform the following: 1. Start the Cluster Conguration Tool by choosing Main Menu => System Settings => Server Settings => Cluster or by typing redhat-config-cluster at a shell prompt. The Cluster Status Tool appears by default. 2. Start the Cluster Conguration Tool by selecting Cluster => Congure from the Cluster Status Tool menus. 3. Click the Services tab. 4. Add the Oracle service. ? Click New. The Service dialog appears. Figure 5-1. Adding an Oracle Service ? Enter a Service Name for the Oracle service. ? Select a Failover Domain or leave it as None. ? Type a quantity (seconds) to check the health of the Oracle service through the status function of the init script. ? Enter a User Script, such as /home/oracle/oracle. ? Click OK 5. Add an IP address for the Oracle service. ? Select the Oracle service and click Add Child. 72 Chapter 5. Database Services ? Select Add Service IP Address and click OK. The Service IP Address dialog appears. ? Enter an IP Address. ? Enter a Netmask, or leave it None. ? Enter a Broadcast Address, or leave it None. ? Click OK. 6. Add a device for the Oracle service and administrative les. ? Select the Oracle service and click Add Child. ? Select Add Device and click OK. The Device dialog appears. ? Enter the Device Special File (for example, /dev/sdb5). ? In the Mount Point eld, enter /u01. ? Select the le system type in FS Type or leave it blank. ? Enter any mount point Options, including rw (read-write). ? Check or uncheck Force Unmount. ? Click OK. 7. Add a device for the Oracle database les. ? Select the Oracle service and click Add Child. ? Select Add Device and click OK. The Device dialog appears. ? Enter the Device Special File (for example, /dev/sdb6). ? In the Mount Point eld, enter /u02. ? Select the le system type in FS Type or leave it blank. ? Enter any mount point Options, including rw (read-write). ? Check or uncheck Force Unmount. ? Click OK. 8. Choose File => Save to save the Oracle service. 5.2. Tuning Oracle Service The Oracle database recovery time after a failover is directly proportional to the number of outstanding transactions and the size of the database. The following parameters control database recovery time: ? LOG_CHECKPOINT_TIMEOUT ? LOG_CHECKPOINT_INTERVAL ? FAST_START_IO_TARGET ? REDO_LOG_FILE_SIZES To minimize recovery time, set the previous parameters to relatively low values. Note that excessively low values adversely impact performance. Try different values to nd the optimal value. Chapter 5. Database Services 73 Oracle provides additional tuning parameters that control the number of database transaction retries and the retry delay time. Be sure that these values are large enough to accommodate the failover time in the cluster environment. This ensures that failover is transparent to database client application programs and does not require programs to reconnect. 5.3. Setting Up a MySQL Service A database service can serve highly-available data to a MySQL database application. The application can then provide network access to database client systems, such as Web servers. If the service fails over, the application accesses the shared database data through the new cluster system. A networkaccessible database service is usually assigned one IP address, which is failed over along with the service to maintain transparent access for clients. An example of a conguring a MySQL database service is as follows: ? The MySQL server packages are installed on each cluster system that will run the service. The MySQL database directory resides on a le system that is located on a disk partition on shared storage. This allows the database data to be accessed by all cluster members. In the example, the le system is mounted as /var/lib/mysql, using the shared disk partition /dev/sda1. ? An IP address is associated with the MySQL service to accommodate network access by clients of the database service. This IP address is automatically migrated among the cluster members as the service fails over. In the example below, the IP address is 10.1.16.12. ? The script that is used to start and stop the MySQL database is the standard init script mysqld. If general logging of connections and queries is needed, edit the mysqld script to add the option --log=/var/log/mysqld.log as the last option to the safe_mysqld command. The resulting line should appear similar to the following (Note: the forward slash (\) denotes the continuation of one line): /usr/bin/safe_mysqld --defaults-file=/etc/my.cnf --log=/var/log/mysqld.log \ >/dev/null 2>&1 & If the --log option is added to the mysqld script, the new mysqld script should be copied to the other cluster members that can run the MySQL service, so that they can log connections and queries if the MySQL service fails over to those members. ? by default, a client connection to a MySQL server times out after eight hours of inactivity. This connection limit can be modied by setting the wait_timeout variable in the /etc/my.cnf le. For example, to set timeouts to four hours, add the following line to the [mysqld] section of /etc/my.cnf: set-variable = wait_timeout=14400 Restart the MySQL service. Note that after this change is made, the new /etc/my.cnf le should be copied to all the other cluster members that can run the MySQL service. To check if a MySQL server has timed out, invoke the mysqladmin command and examine the uptime. If it has timed out, invoke the query again to automatically reconnect to the server. Depending on the Linux distribution, one of the following messages may indicate a MySQL server timeout: CR_SERVER_GONE_ERROR CR_SERVER_LOST 5.3.1. MySQL and the Cluster Conguration Tool To add a MySQL service using the Cluster Conguration Tool, perform the following: 74 Chapter 5. Database Services 1. Start the Cluster Conguration Tool by choosing Main Menu => System Settings => Server Settings => Cluster or by typing redhat-config-cluster at a shell prompt. The Cluster Status Tool appears by default. 2. Start the Cluster Conguration Tool by selecting Cluster => Congure from the Cluster Status Tool menus. 3. Click the Services tab. 4. Add the MySQL service. ? Click New. The Service dialog appears. Figure 5-2. Adding a Service ? Enter a Service Name for the MySQL service. ? Select a Failover Domain or leave it as None. ? Type a quantity (seconds) in the Check Interval box if you want to check the health of the MySQL service through the status directive of the mysqld init script. ? Enter a User Script, such as /etc/init.d/mysqld. ? Click OK. 5. Add an IP address for the MySQL service. ? Select the MySQL service and click Add Child. ? Select Add Service IP Address and click OK. The Service IP Address dialog appears. ? Enter an IP Address. ? Enter a Netmask, or leave it None. ? Enter a Broadcast Address, or leave it None. ? Click OK. 6. Add a device for the MySQL service. ? Select the MySQL service and click Add Child. ? Select Add Device and click OK. The Device dialog appears. ? Enter the Device Special File (for example, /dev/sdc3). ? In the Mount Point eld, enter /var/lib/mysql. ? Select the le system type in FS Type or leave it blank. ? Enter any mount point Options, including rw (read-write). ? Check or uncheck Force Unmount. Chapter 5. Database Services 75 ? Click OK. 7. Choose File => Save to save the MySQL service. 76 Chapter 5. Database Services Chapter 6. Network File Sharing Services This chapter contains instructions for conguring Red Hat Enterprise Linux to make network le sharing services through NFS and Samba highly available. 6.1. Setting Up an NFS Service A highly-available network le system (NFS) is one of the key strengths of the clustering infrastructure. Advantages of clustered NFS services include: ? Ensures that NFS clients maintain uninterrupted access to key data in the event of server failure. ? Facilitates planned maintenance by allowing transparent relocation of NFS services to one cluster member, allowing you to x or upgrade the other cluster member. ? Allows setup of an active-active conguration to maximize equipment utilization. Refer to Section 6.5 NFS Conguration: Active-Active Example for more information. 6.1.1. NFS Server Requirements To create highly available NFS services, there are a few requirements which must be met by each cluster member. (Note: these requirements do not pertain to NFS client systems.) These requirements are as follows: ? The NFS daemon must be running on all cluster servers. Check the status of the servers by running the following: /sbin/service nfs status NFS services will not start unless the following NFS daemons are running: nfsd, rpc.mountd, and rpc.statd. If the service is not running, start it with the following commands: /sbin/service portmap start /sbin/service nfs start To make NFS start upon reboot and when changing runlevels, run the following command: /sbin/chkconfig --level 345 nfs on ? The RPC portmap daemon must also be enabled with the following command: /sbin/chkconfig --level 345 portmap on ? File system mounts and their associated exports for clustered NFS services should not be included in /etc/fstab or /etc/exports. Rather, for clustered NFS services, the parameters describing mounts and exports are entered via the Cluster Conguration Tool. For your convenience, the tool provides a Bulk Load NFS feature to import entries from an existing le into the cluster conguration le. ? NFS cannot be congured to run over TCP with Red Hat Cluster Manager. For proper failover capabilities, NFS must run over the default UDP. For detailed information about setting up an NFS server, refer to the Red Hat Enterprise Linux System Administration Guide. 78 Chapter 6. Network File Sharing Services 6.2. Using the NFS Druid This section describes how to use the NFS Druid to quickly congure an NFS share for client access. 1. Start the Cluster Status Tool. Verify that the cluster daemons are running; if not, choose Cluster => Start Cluster Service to start the cluster daemons. 2. In the Cluster Status Tool, choose Cluster => Congure to display the Cluster Conguration Tool. 3. Start the NFS Druid by choosing Add Exports => NFS... and click Forward to continue. Figure 6-1. NFS Druid 4. Enter the Export Directory ? Specied as a child of a device, the export directory can be the same as the mount point. In this case, the entire le system is accessible through NFS. Alternatively, you can specify a portion (subdirectory) of a mounted le system to be mounted (instead of the entire le system). By exporting subdirectories of a mountpoint, different access rights can be allocated to different sets of NFS clients. Enter the Client Name?Specied as a child of an export directory, the NFS client specication identies which systems will be allowed to access the le system as NFS clients. You can specify individual systems (for example, fred) or groups of systems by using wildcards (for example, *.example.com). Entering an asterisk (*) in the Client Name eld allows any client to mount the le system. Enter any Client Options in the provided elds ? Specied as part of the NFS Export Client information, this eld denes the access rights afforded to the corresponding client(s). Examples include ro (read only), and rw (read write). Unless explicitly specied otherwise, the default export options are ro,async,wdelay, root_squash. Refer to the exports(5) manpage for more options. Chapter 6. Network File Sharing Services 79 Figure 6-2. Export and Client Options 5. If an existing service contains the device and mountpoint conguration for the directory you want to NFS export, then select that existing service. Otherwise, enter a new Service Name and Service IP Address for the NFS export directory. Service Name ? A name used to uniquely identify this service within the cluster (such as nfs_cluster or marketing.) Service IP Address?NFS clients access le systems from an NFS server which is designated by its IP address (or associated hostname). To keep NFS clients from knowing which specic cluster member is the acting NFS server, the client systems should not use the cluster member's hostname as the IP address from which a service is started. Rather, clustered NFS services are assigned oating IP addresses which are distinct from the cluster server's IP addresses. This oating IP address is then congured on whichever cluster member is actively serving the NFS export. Following this approach, the NFS clients are only aware of the oating IP address and are unaware of the fact that clustered NFS server has been deployed. Figure 6-3. Select Service for Export 6. For non-clustered le systems, the mount information is typically placed in /etc/fstab. However, clustered le systems must not be placed in /etc/fstab. This is necessary to ensure that 80 Chapter 6. Network File Sharing Services only one cluster member at a time has the le system mounted. Failure to do so will likely result in le system corruption and system crashes. If you selected an existing service, then devices for that service will be listed under Existing Device and Mountpoint. If the device and mount point for your NFS export is listed, then select it. Otherwise, select New Device and use the elds to edit the following settings. Device Special File ? Designates the disk or partition on shared storage. Device Mountpoint ? Species the directory on which the le system will be mounted. An NFS service can include more than one le system mount. In this manner, the le systems will be grouped together as a single failover unit. Figure 6-4. Select Device for Export 7. At the end of the NFS Druid, click Apply to create the service. Save the conguration by choosing File => Save from the Cluster Conguration Tool. To modify your NFS service conguration, click the Services tab in the Cluster Conguration Tool and click the triangular icon next to the NFS service to display the full child tree for the service. Double-click each child to modify options. 1. Highlight the  service  and click Properties to congure the the following options: Chapter 6. Network File Sharing Services 81 Figure 6-5. Services in the Cluster Conguration Tool ? Service Name ? A name used to uniquely identify this service within the cluster (such as nfs_cluster or marketing. ? Failover Domain?An optional property that species a subset (or ordered subset) of cluster members which are eligible to run the service in the event of a failover. You must create the failover domain before you can reference it in an NFS service conguration; see Section 3.9 Conguring a Failover Domain for more information. ? Check Interval ? An optional property that species whether or not to check the status of the NFS daemons at a regular interval (in seconds). The default value is 0 seconds, meaning the daemon status is not checked. If the service returns an error or does not respond to the status check, the cluster attempts to cleanly shut down the service and start it on another member. If at any point it fails to cleanly shut down the NFS service, the cluster will place the service in a Failed state, requiring the administrator to disable the service rst before attempting to restart it. ? For the User Script, leave the eld as None, as the cluster infrastructure handles NFS service control and status checking. 2. Choose the  service ip address  child to change the Service IP Address and to enter a Netmask and Broadcast address, which are both set as None by default. If these elds are left as None, then the cluster infrastructure will use the netmask and broadcast IP address congured on the network device of the member running the service. 3. Choose the  device  child to modify the Device Special File, Device Mount Point, FS Type, and Mount Options. You can also check or uncheck the Force Unmount. When Forced Unmount is enabled, any applications that have the specied le system mounted will be killed prior to disabling or relocating the NFS service (assuming the application is running on the same member that is running the NFS service) 4. Choose the  nfsexport  child to specify a directory name for clients to mount the exported share. 82 Chapter 6. Network File Sharing Services 5. Choose the  client  child to enter Client Name, any hosts, groups, and domains that are allowed to mount the exported shares (default is * which allows any client to mount the share) and Options for allowed client mount options (such as rw for read-write or ro for read-only). 6.2.1. NFS Client Access The NFS usage model for clients is completely unchanged from its normal approach. For example, to mount the NFS share from clu1.example.com to the client's /mnt/users/ directory, run the following command: /bin/mount -t nfs clu1.example.com:/share /mnt/users To simplify the mounting of the NFS share for clients, place the following in the client's /etc/fstab le: clu1.example.com:/share /mnt/users nfs rw,rsize=8192,wsize=8192 0 0 For additional NFS mount options, refer to the Red Hat Enterprise Linux System Administration Guide. 6.3. NFS Caveats The following points should be taken into consideration when clustered NFS services are congured. Avoid using exportfs -r File systems being NFS exported by cluster members do not get specied in the conventional /etc/exports le. Rather, the NFS exports associated with cluster services are specied in the cluster conguration le (as established by the Cluster Conguration Tool). The command exportfs -r removes any exports which are not explicitly specied in the /etc/exports le. Running this command causes the clustered NFS services to become unavailable until the service is restarted. For this reason, it is recommended to avoid using the exportfs -r command on a cluster on which highly available NFS services are congured. To recover from unintended usage of exportfs -r, the NFS cluster service must be stopped and then restarted. NFS File Locking NFS le locks are not preserved across a failover or service relocation. This is due to the fact that the Linux NFS implementation stores le locking information in system les. These system les representing NFS locking state are not replicated across the cluster. The implication is that locks may be regranted subsequent to the failover operation. 6.4. Importing the Contents of an NFS Exports File The Bulk Load NFS feature allows you to import all the entries from an /etc/exports le without having to create the entries individually as children of a device. This can be convenient for administrators who are transitioning from traditional NFS server systems to a high-availability cluster. To use the Bulk Load NFS feature, follow these steps: 1. Stop all non-clustered NFS exports (for example, /sbin/service nfs stop). Chapter 6. Network File Sharing Services 83 2. Unmount the le systems on which the exports reside (for example, /bin/umount /mnt/nfs/, where /mnt/nfs is the directory on which the NFS exported partitions are mounted). 3. Start the Cluster Conguration Tool. 4. On the Services tab, select the device(s) to be loaded by the /etc/exports le. 5. Choose File => Bulk Load NFS. 6. Select the exports le that you want to load into the selected device (by default, the Bulk Load NFS dialog attempts to load /etc/exports), and click OK. All entries in the le specied in the Bulk Load NFS dialog box are subjected to the same validation as when an NFS export directory is manually specied. Any entries in the imported le that fail validation are displayed in a message box. Only entries that pass validation are loaded into the selected device. 7. If the bulk load successfully completed, you must then remove all exports from the /etc/exports le so that it does not affect the cluster NFS services. 8. Choose File => Save to save the change to the /etc/cluster.xml conguration le. 6.5. NFS Conguration: Active-Active Example Section 6.2 Using the NFS Druid described how to congure a simple NFS service using the NFS Druid. This section shows how to congure a second NFS service on another running cluster member. The second service has its own separate IP address and failover domain. This cluster conguration, called an active-active conguration, allows multiple cluster members to simultaneously export le systems. This most effectively utilizes the capacity of cluster. In the event of a failure (or planned maintenance) on any cluster member running NFS services, those services will failover to the active cluster member. For this example, individual subdirectories of the mounted le system will be made accessible on a read-write (rw) basis by three members of a department. The names of the systems used by these three team members are ferris, denham, and brown. To make this example more illustrative, notice that each team member will only be able to NFS mount their specic subdirectory, which has already been created for them and over which they have user and group permissions. Use the Cluster Conguration Tool as follows to congure this example NFS service: 1. Verify that the cluster daemons are running in the Cluster Status Tool; if not, choose Cluster => Start Local Cluster Daemons. 2. Choose Cluster => Congure to display the Cluster Conguration Tool. 3. Choose the Services tab and, if services have already been dened, select one and click New. (If no services are dened, just click New.) a. Specify nfs_engineering in the Service Name eld. This name was chosen as a reminder of the service's intended function to provide exports to the members of the engineering department. b. In this example, assume a failover domain named clu3_domain was previously created using the Cluster Conguration Tool, consisting only of member clu3 with both Restricted failover to only these members and Ordered Failover unchecked. In this way, clu3 is designated as the preferred member for this service. (Note that the nfs_accounting service is assigned to failover domain clu4_domain.) Choose clu3_domain from the Failover Domain list. (For more information on failover domains, refer to Section 3.9 Conguring a Failover Domain.) 84 Chapter 6. Network File Sharing Services c. Specify a value of 30 in the Check Interval eld to specify that the status of the NFS daemons should be checked every 30 seconds. d. The cluster infrastructure includes support for NFS services. Consequently, there is no need to create or specify a value for User Script when conguring an NFS service. Accept the default value of None. e. Click OK to complete this portion of the service conguration. 4. In the Cluster Conguration Tool, select the service you just created, and click Add Child. On the Add Device or IP Address dialog box, choose Add Service IP Address and click OK. a. In the Service IP Address eld, enter 10.0.0.11. This example assumes a hostname of clunfseng is associated with this IP address, by which NFS clients mount the le system. Note that this IP address must be distinct from that of any cluster member. b. The default netmask address will be used, so accept the default of None. c. The default broadcast address will be used, so accept the default of None. d. Click OK to complete the service IP address conguration. 5. In the Cluster Conguration Tool, select the nfs_engineering service and click Add Child. On the Add Device or IP Address dialog box, choose Add Device and click OK. a. In the Device Special File eld, enter /dev/sdb11 which refers to the partition on the shared storage RAID box on which the le system will be physically stored. Leave the Samba Share Name eld blank. b. In the Mount Point eld, enter /mnt/users/engineering. c. From the FS Type menu, choose ext3. d. In the Options eld, enter rw,nosuid,sync. e. Leave the Force Unmount checkbox checked. f. Click OK to complete this portion of the device conguration. 6. In the Cluster Conguration Tool, select the device you just created, and click Add Child. Enter /mnt/users/engineering/ferrisin the NFS Export Directory Name eld and click OK. Repeat this step twice, adding NFS export directories named /mnt/users/engineering/denham and /mnt/users/engineering/brown. 7. In the Cluster Conguration Tool, select the NFS export for ferris and click Add Child. The NFS Export Client dialog box is displayed. In the Client Name eld, type ferris. In the Options eld, type rw. Click OK. 8. Repeat step 7 twice, specifying clients named denham and brown, respectively, each with the same permissions options (rw). 9. Save the service by selecting File => Save in the Cluster Conguration Tool. 10. Start the service from the Cluster Status Tool by highlighting the service and clicking Start. Chapter 6. Network File Sharing Services 85 6.6. Setting Up a Samba Service Highly available network le services are one of the key strengths of the clustering infrastructure. Advantages of high availibility Samba services include: ? Heterogeneous le serving capabilities to Microsoft® Windows? clients using the CIFS/SMB protocol. ? Allows the same set of le systems to be simultaneously network served to both NFS and Windows based clients. ? Ensures that Windows-based clients maintain access to key data, and can quickly reestablish connection in the event of server failure. ? Facilitates planned maintenance by allowing the transparent relocation of Samba services to one cluster member, enabling you to x or upgrade the other cluster member. ? Allows the setup of an active-active conguration to maximize equipment utilization. Note A complete explanation of Samba conguration is beyond the scope of this document. Rather, this documentation highlights aspects which are crucial for clustered operation. Refer to Red Hat Enterprise Linux System Administration Guide for more details on Samba conguration. 6.6.1. Samba Server Requirements If you intend to create highly available Samba services, each cluster member on which the services will run must meet the following requirements: ? The Samba RPM packages must be installed. Note that there have been no modications to the Samba RPMs to support high availability. ? The Samba daemons will be started and stopped by the cluster infrastructure on a per-service basis. Consequently, the Samba conguration information should not be specied in the conventional /etc/samba/smb.conf le. The Cluster Conguration Tool writes a smb.conf.sharename le to the /etc/samba/ directory for each Samba share (where sharename is the name you specied for the Samba share). ? The automated system startup of the Samba daemons smbd and nmbd should be disabled in init.d runlevels. For example: chkconfig --del smb. ? Since the cluster infrastructure stops the cluster-related Samba daemons appropriately, do not manually run the conventional Samba stop script (service smb stop) as this will terminate all clusterrelated samba daemons. ? File system mounts for clustered Samba services should not be included in /etc/fstab. Rather, for clustered services, the parameters describing mounts are entered via the Cluster Conguration Tool. ? Failover of Samba printer shares is not currently supported. 86 Chapter 6. Network File Sharing Services 6.6.2. Samba Operating Model This section provides background information describing the implementation model in support of Samba high availability services. Knowledge of this information will provide the context for understanding the conguration requirements of clustered Samba services. The conventional, non-clustered Samba conguration model consists of editing the /etc/samba/smb.conf le to designate which le systems are to be made network accessible to the specied clients. It also designates access permissions and other mapping capabilities. In the single system model, a single instance of each of the smbd and nmbd daemons are automatically started up by the /etc/rc.d/init.d/smb runlevel script. To implement high availibility Samba services, rather than having a single /etc/samba/smb.conf le, each service has its own Samba conguration le. These les are named /etc/samba/smb.conf.sharename, where sharename is the specic name of the individual conguration le associated with a Samba service. For example, if you created a share called mktg, the corresponding Samba conguration le would be /etc/samba/smb.conf.mktg. Note A Samba share must be in a service with at least one IP address. The format of the smb.conf.sharename le is identical to the conventional smb.conf format. No additional elds have been created for clustered operation. There are several elds within the smb.conf.sharename le which are required for correct cluster operation; these elds will be described in Section 6.8 Fields in the smb.conf.sharename File. When a new Samba service is created using the Cluster Conguration Tool, the corresponding smb.conf.sharename le is created based on the service-specic parameters, including appropriate client systems, specic directories to share, and read-write permissions. Copy the /etc/samba/smb.conf.sharename les onto all members in the cluster (or all members in an unrestricted failover domain, if used); refer to Section 3.9 Conguring a Failover Domain for more information. After the initial conguration, should any changes be made to any smb.conf.sharename le, you must also copy the updated version to the other members. To facilitate high-availability Samba functionality, each individual Samba service congured within the cluster (through the Cluster Conguration Tool) will have its own individual pair of smbd and nmbd daemons. Consequently, if there are more than one Samba services congured with the cluster, you may see multiple instances of these daemon pairs running on an individual cluster server. These Samba daemons smbd and nmbd are not initiated via the conventional init.d run level scripts; rather they are initiated by the cluster infrastructure based on whichever member is the active service provider. To allow a single system to run multiple instances of the Samba daemons, every pair of daemons is required to have both its own locking directory and its own process ID (pid) directory. Consequently, there will be a separate per-service Samba daemon locking and running process directory. These directories are given the name /var/cache/samba/sharename/ for lock les and /var/run/samba/sharename/ for pid les (where sharename is replaced by the Samba share name specied within the service conguration information set using the Cluster Conguration Tool). Continuing the prior example, the corresponding directories for our mktg share would be /var/cache/samba/mktg/ and /var/run/samba/mktg/. Chapter 6. Network File Sharing Services 87 6.7. Using the Samba Druid This section describes how to use the Samba Druid to quickly congure an Samba share for client access. 1. Start the Cluster Status Tool. Verify that the cluster daemons are running; if not, choose Cluster => Start Cluster Service to start the cluster daemons. 2. In the Cluster Status Tool, choose Cluster => Congure to display the Cluster Conguration Tool. 3. Start the Samba Druid by choosing Add Exports => Samba... and click Forward to continue. Figure 6-6. Samba Druid 4. Choose to create a new service with a new oating IP address. Enter a Service Name ? A name used to uniquely identify this service within the cluster. Enter a Service IP Address ? Clients access le shares from a server as designated by its IP address (or associated hostname). To abstract clients from knowing which specic cluster member is the acting Samba server, the client systems should not use the cluster member's hostname as the IP address by which a service is accessed. Rather, clustered Samba services are assigned oating IP addresses which are distinct from the cluster server's IP addresses. This oating IP address is then congured on which ever cluster member is actively serving the share. Following this approach, the clients are only aware of the oating IP address and are unaware of the fact that clustered Samba services have been deployed. 88 Chapter 6. Network File Sharing Services Figure 6-7. Select Service for Export 5. Enter the device special lename and mount point for the service. Mount information ? For non-clustered le systems, the mount information is typically placed in /etc/fstab. In contrast, clustered le systems must not be placed in /etc/fstab. This is necessary to ensure that only one cluster member at a time has the le system mounted. Failure to do so will result in le system corruption and potential system crashes. ? Device Special File ? The mount information designates the disk's device special le and the directory on which the le system will be mounted. ? Device Mount point ? A Samba service can include more than one le system mount. In this manner, the le systems will be grouped together as a single failover unit. Figure 6-8. Select Device for Export 6. Enter a Share Name ? Species the name by which clients refer to the mount point. Based on the name you specify, a corresponding /etc/samba/smb.conf.sharename le and lock directory /var/cache/samba/sharename will be created. By convention the actual Windows share name specied within the smb.conf.sharename will be set in accordance with this parameter. In practice, you can designate more than one Samba share within an individual smb.conf.sharename le. There can be at most one samba conguration specied per Chapter 6. Network File Sharing Services 89 service, which must be specied with the rst device. For example, if you have multiple disk devices (and corresponding le system mounts) within a single service, then specify a single sharename for the service. Then within the /etc/samba/smb.conf.sharename le, designate multiple individual Samba shares to share directories from the multiple devices. To disable Samba sharing of a service, the share name should be set to None. Figure 6-9. Samba Share Name 7. Click Apply to save the congurationg le (/etc/samba/smb.conf.sharename) to the cluster member. Figure 6-10. Samba Share Completion 8. Save the conguration for the Samba service by choosing File => Save from the Cluster Con- guration Tool. 9. Copy /etc/samba/smb.conf.sharename over to the other cluster members. To modify your Samba service conguration, click the Services tab in the Cluster Conguration Tool and click the triangular icon next to the Samba service to display the full child tree for the service. Double-click each child to modify options. ? Highlight the  service and click Properties to congure the following options: 90 Chapter 6. Network File Sharing Services Service Name ? A name used to uniquely identify this service within the cluster. Failover Domain ? Denes which systems are eligible to be the Samba server for this service when more than one cluster member is operational. Check Interval ? Species how often (in seconds) the cluster subsystem should verify that the Samba daemons (smbd and nmbd) associated with this service are running. In the event that either of these daemons have unexpectedly exited, they will be automatically restarted to resume services. If a value of 0 is specied, then no monitoring will be performed. For example, designating an interval of 90 seconds will result in monitoring at that interval. For the User Script, leave the eld as None, as the cluster infrastructure handles NFS service control and status checking. ? Choose the ! service ip address " child to change the Service IP Address and to enter a Netmask and Broadcast address, which are both set as None by default. If these elds are left as None, then the cluster infrastructure will use the netmask and broadcast IP address congured on the network device of the member running the service. ? Choose the ! device " child to modify the Device Special File and Samba Share Name, Mount Point, FS Type, and Mount Options. You can also check or uncheck the Force Unmount button. As part of the mount information, you can specify whether forced unmount should be enabled or not. When forced unmount is enabled, if any applications running on the cluster server have the designated le system mounted when the service is being disabled or relocated, then that application will be killed to allow the unmount to proceed. 6.7.1. Samba Considerations When running the Cluster Conguration Tool to congure Samba services: ? Correctly enter the service parameters, as the validation logic associated with Samba parameters is not robust. ? After conguring a Samba service via the Cluster Conguration Tool, remember to tune the /etc/samba/smb.conf.sharename le for each service in accordance with the clients and authorization scheme you desire. ? Remember to copy the smb.conf.sharename le over to other cluster members. ? Note that the cluster infrastructure creates Samba lock directories when it starts the service. ? If you delete a Samba service, the Cluster Conguration Tool automatically deletes this le to preserve your site-specic conguration parameters for possible later use. 6.8. Fields in the smb.conf.sharename File This section describes the elds within the smb.conf.sharename le which are most relevant to the correct operation of highly available Samba services. It is beyond the scope of this document to completely describe all of the elds within a Samba conguration le. There have been no additional eld names added in support of clustering, and the le format follows the normal Samba conventions. The following is an example smb.conf.sharename le which is automatically generated by the Cluster Conguration Tool. The example shows a share named mktg. The name of the le is /etc/samba/smb.conf.mktg. # Template samba service configuration file - please modify # to specify subdirectories and client access permissions. # Remember to copy this file over to *ALL* other cluster Chapter 6. Network File Sharing Services 91 # members. # # From a cluster perspective, the key fields are: # lock directory - must be unique per samba service. # bind interfaces only - must be present set to yes. # interfaces - must be set to service floating IP address. # path - must be the service mountpoint or subdirectory thereof. # Refer to the cluster documentation for details. [global] workgroup = RHCLUSTER pid directory = /var/run/samba/mktg lock directory = /var/cache/samba/mktg log file = /var/log/samba/%m.log encrypt passwords = yes bind interfaces only = yes interfaces = 192.168.26.11 [mktg] comment = High Availability Samba Service browsable = yes writable = no public = yes path = /share The following are descriptions of the most relevant elds, from a clustering perspective, in the /etc/samba/smb.conf.sharename le. In this example, the le is named /etc/samba/smb.conf.mktg in accordance with the share name that was specied (mktg) while running the Cluster Conguration Tool. Only the cluster-specic elds are described below. The remaining elds follow standard Samba syntax. Global Parameters These parameters pertain to all shares which are specied in the smb.conf.sharename le. It is possible to designate more than one share within this le, provided that the directories described within it are within the service's le system mounts. lock directory Dictates the name of the directory in which the Samba daemons (smbd and nmbd) will place their locking les. This must be set to /var/cache/samba/sharename, where sharename varies based on the parameter specied in the Cluster Conguration Tool. Specication of a lock directory is required to allow a separate per-service instance of smbd and nmbd. pid directory Dictates the name of the directory in which the Samba daemons (smbd and nmbd) will place their processor ID (pid) les. This must be set to /var/run/samba/sharename/, where sharename varies based on the parameter specied in the Cluster Conguration Tool tool. Specication of a pid directory is required to allow a separate per-service instance of smbd and nmbd. bind interfaces only This parameter must be set to yes to allow each smbd and nmbd pair to bind to the oating IP address associated with this clustered Samba service. 92 Chapter 6. Network File Sharing Services interfaces Species the IP address associated with the Samba service. If a netmask is specied within the service, this eld appears like the following example: interfaces = 10.0.0.10/255.255.254.0 Share-specic parameters These parameters pertain to a specic Samba share. writable By default, the share access permissions are conservatively set as non-writable. Tune this parameter according to your site-specic preferences. path Defaults to the rst le system mount point specied within the service conguration. This should be adjusted to match the specic directory or subdirectory intended to be available as a share to clients. Chapter 7. Setting Up Apache HTTP Server This chapter contains instructions for conguring Red Hat Enterprise Linux to make the Apache HTTP Server highly available. It provides an example of setting up a cluster service that fails over an Apache HTTP Server. Although the actual variables used in the service depend on the specic conguration, the example may assist in setting up a service for a particular environment. 7.1. Apache HTTP Server Setup Overview First, congure Apache HTTP Server on all members in the cluster. Consider assigning the service to a failover domain to reduce the number of systems that must be congured to run this service. Refer to Section 3.9 Conguring a Failover Domain for instructions. The cluster software ensures that only one cluster system runs the Apache HTTP Server at one time. The conguration consists of installing the httpd RPM package on all cluster members (or on members in the failover domain, if used) and conguring a shared le system to house the website's content. When installing the Apache HTTP Server on the cluster systems, do not congure the cluster systems so that the service automatically starts when the system boots by performing the following command: chkconfig --del httpd Rather than having the system startup scripts spawn httpd, the cluster infrastructure does that on the active cluster server. This ensures that the corresponding IP address and le system mounts are active on only one cluster member at a time. When adding an httpd service, a oating IP address must be assigned to the service such that the IP address will transfer from one cluster member to another in the event of failover or service relocation. The cluster infrastructure binds this IP address to the network interface on the cluster system that is currently running the Apache HTTP Server. This IP address ensures that the cluster system running the httpd service is transparent to the HTTP clients accessing the Apache HTTP Server. The le systems that contain the Web content must not be automatically mounted on shared disk storage when the cluster systems boot. Instead, the cluster software must mount and unmount the le systems as the httpd service is started and stopped on the cluster systems. This prevents the cluster systems from accessing the same data simultaneously, which may result in data corruption. Therefore, do not include the le systems in the /etc/fstab le. 7.2. Conguring Shared Storage To set up the shared le system, perform the following tasks as root on one cluster system: 1. On a shared disk, use the interactive parted utility to create a partition to use for the document root directory. Note that it is possible to create multiple document root directories on different disk partitions. Refer to Section 2.4.4.4 Partitioning Disks for more information. 2. Use the mkfs command to create an ext3 le system on the partition you created in the previous step. Specify the drive letter and the partition number. For example: mkfs -t ext3 /dev/sde3 3. Mount the le system that contains the document root directory. For example: mount /dev/sde3 /var/www/html 94 Chapter 7. Setting Up Apache HTTP Server Do not add this mount information to the /etc/fstab le because only the cluster software can mount and unmount le systems used in a service. 4. Copy all the required les to the document root directory. 5. If you have CGI les or other les that must be in different directories or in separate partitions, repeat these steps, as needed. 7.3. Installing and Conguring the Apache HTTP Server The Apache HTTP Server must be installed and congured on all members in the assigned failover domain, if used, or in the cluster. The basic server conguration must be the same on all members on which it runs for the service to fail over correctly. The following example shows a basic Apache HTTP Server installation that includes no third-party modules or performance tuning. On all member systems in the cluster (or members in the failover domain, if used), install the httpd RPM package. For example: rpm -Uvh httpd- # version $ . # arch $ .rpm On one system, perform the following tasks: 1. Edit the /etc/httpd/conf/httpd.conf conguration le and customize the le according to your conguration. For example: ? Specify the directory that contains the HTML les. Also specify this mount point when adding the service to the cluster conguration. It is only required to change this eld if the mountpoint for the website's content differs from the default setting of /var/www/html/. For example: DocumentRoot "/mnt/httpdservice/html" ? If the script directory resides in a non-standard location, specify the directory that contains the CGI programs. For example: ScriptAlias /cgi-bin/ "/mnt/httpdservice/cgi-bin/" ? Specify the path that was used in the previous step, and set the access permissions to default to that directory. For example: # Directory /mnt/httpdservice/cgi-bin" AllowOverride None $ Options None Order allow,deny Allow from all # /Directory $ Additional changes may need to be made to tune the Apache HTTP Server or add module functionality. For information on setting up other options, refer to the Red Hat Enterprise Linux System Administration Guide and the Red Hat Enterprise Linux Reference Guide. 2. The standard Apache HTTP Server start script, /etc/rc.d/init.d/httpd is also used within the cluster framework to start and stop the Apache HTTP Server on the active cluster member. Accordingly, when conguring the service, specify this script in the User Script eld of the Service dialog box. 3. Copy the conguration le over to the other members of the cluster (or members of the failover domain, if congured). Before the service is added to the cluster conguration, ensure that the Apache HTTP Server directories are not mounted. Then, on one member, invoke the Cluster Conguration Tool to add the service, as follows. This example assumes a failover domain named httpd-domain was created for Chapter 7. Setting Up Apache HTTP Server 95 this service. Figure 7-1 shows the % device & settings for conguring the shared storage and mount point where HTML les and CGI scripts will be stored. Figure 7-1. Conguring Apache HTTP Server 1. Select the Services tab and click New. The Service properties dialog box is displayed. a. Give the service a name (for example, httpd). b. Choose httpd-domain from the Failover Domain list. c. Specify a value in the Check Interval eld. d. Specify /etc/rc.d/init.d/httpd in the User Script eld. e. Click OK. 2. Select the httpd service on the Services tab and click Add Child. The Add Device or Service IP Address dialog box is displayed. a. Choose Add Device and click OK. The Device properties dialog box is displayed. b. Enter the device special le name in the Device Special File eld (for example, /dev/hda7). c. Enter the mount point in the Mount Point eld (for example, /var/www/html/). d. Choose ext3 from the FS Type list. e. Enter rw in the Options eld. f. Ensure that Force Unmount is checked, and click OK. 3. Ensure that the httpd service is still selected in the Services tab and click Add Child. The Add Device or Service IP Address dialog box is displayed. a. Choose Add Service IP Address and click OK. The Service IP Address properties dialog box is displayed. b. In the IP Address eld, specify an IP address, which the cluster infrastructure binds to the network interface on the cluster system that runs the httpd service (for example, 192.168.26.10). c. Specify a netmask of None in the Netmask eld. d. In the Broadcast eld, specify an IP address of None for broadcasting on the cluster subnet. e. Click OK. 96 Chapter 7. Setting Up Apache HTTP Server 4. Choose File => Save to save your changes. 5. To start the Apache HTTP Server within the Cluster Status Tool, highlight the service and click Enable. Chapter 8. Cluster Administration This chapter describes the various administrative tasks involved in maintaining a cluster after it has been installed and congured. 8.1. Overview of the Cluster Status Tool The Cluster Status Tool displays the status of the cluster service, cluster members, and application services, and shows statistics concerning service operation. The cluster conguration le (maintained by the Cluster Conguration Tool) is used to determine how to manage the members, services, and cluster daemons. Important Do not manually edit the cluster conguration le. Use the Cluster Status Tool to start and stop the cluster service on that particular member, restart application services, or move an application service to another member. The Cluster Conguration Tool can be displayed by choosing Cluster => Congure within the Cluster Status Tool. Figure 8-1. Cluster Status Tool 8.2. Displaying Cluster and Service Status Monitoring cluster and application service status can help identify and resolve problems in the cluster environment. The following tools assist in displaying cluster status information: ? The clustat command 98 Chapter 8. Cluster Administration ? The clufence command ? Log le messages ? The cluster monitoring GUI Note that status is representative of the cluster system on which an administrator is running a particular tool. To obtain comprehensive cluster status on all member, run the tools on all cluster systems. Important Members that are not running the cluster software cannot determine or report the status of other members of the cluster. Cluster and service status includes the following information: ? Cluster member system status ? Heartbeat channel status ? Service status and which cluster system is running the service or owns the service ? Service monitoring status of the cluster system The following tables describe how to analyze the status information shown by the Cluster Status Tool and clustat command. Member Status Description Active The member system is communicating with other member systems and accessing the quorum partitions. Inactive The member system is unable to communicate with the other member system. Table 8-1. Member Status Service Status Description Running The service resources are congured and available on the cluster system that owns the service. Pending The service has failed on a member and is pending start on another member. Disabled The service has been disabled, and does not have an assigned owner. Stopped The service not running; waiting for a member capable of starting service. Failed The service has failed to start on and the cluster cannot successfully stop the service. Refer to Section 4.8 Handling Failed Services for more information on failed services. Table 8-2. Service Status To display a snapshot of the current cluster status from a shell prompt, invoke the clustat utility. Example output is as follows: Chapter 8. Cluster Administration 99 Cluster Status - clustertest 22:15:32 Quorum: Yes, view 36 Shared State: Shared Raw Device Driver v1.0 [Min. Size=1176064] Member Status ------------------ ---------- clu1 Active clu2 Active <-- You are here Service Status Owner (Last) Last Transition Chk Restarts -------------- -------- ---------------- --------------- --- -------- nfs_clutest started clu1 22:11:28 Jul 21 0 0 smb_clutest started clu2 22:13:06 Jul 21 0 0 To monitor the cluster and display status at specic time intervals from a shell prompt, invoke clustat with the -i time option, where time species the number of seconds between status snapshots. For example: clustat -i 10 8.3. Starting and Stopping the Cluster Software To start the cluster software on a member, run the Cluster Status Tool and choose Cluster => Start Local Cluster Daemons. The cluster member's Status eld changes from Disabled to Active. Figure 8-2. Cluster Member Started You can also start the cluster software at a shell prompt by typing the following: /sbin/service clumanager start To stop a cluster member using the Cluster Status Tool, choose Cluster => Stop Local Cluster Daemons. The cluster member's Status eld changes from Cluster is running to Cluster is not running. 100 Chapter 8. Cluster Administration Figure 8-3. Cluster Member Stopped To stop the cluster software on a cluster system at a shell prompt, run the following command: /sbin/service clumanager stop Stopping the cluster services on a member causes its services to failover to an active member. 8.4. Modifying the Cluster Conguration It may be necessary to modify the cluster conguration. For example, it may be necessary to correct heartbeat channel or quorum partition entries in the cluster database, a copy of which is located in the /etc/cluster.xml le. Important Use only the Cluster Conguration Tool to modify the cluster conguration. Do not directly edit the /etc/cluster.xml le with a text editor. To modify the cluster conguration, stop the cluster software on all cluster systems, as described in Section 8.3 Starting and Stopping the Cluster Software. Start the Cluster Conguration Tool and modify the elds that you need to change. Choose File => Save to save your changes. Settings automatically transfer via shared storage to active cluster members. 8.5. Backing Up and Restoring the Cluster Database The Cluster Conguration Tool automatically saves a back-up copy of your /etc/cluster.xml le whenever you save your conguration. This is useful if the cluster does not function correctly due to misconguration and you need to return to a previous working conguration. The back-up conguration is stored in a le called /etc/cluster.xml.bak. If a cluster member becomes inoperable due to misconguration, stop cluster services on all members: /sbin/service cluster stop Next, overwrite the erroneous conguration with a previously saved conguration: Chapter 8. Cluster Administration 101 cp /etc/cluster.xml.bak /etc/cluster.xml Finally, re-initialize the shared storage: shutil -s /etc/cluster.xml 8.6. Modifying Cluster Event Logging It is possible to modify the severity level of the events that are logged by the clupowerd, cluquorumd, cluhbd, and clusvcmgrd daemons. This is done so that the daemons on the cluster systems log messages at a user-dened severity level. Figure 8-4. Cluster Logging Severity Levels To change a cluster daemon's logging level on all cluster members, use the Cluster Conguration Tool and choose Cluster => Daemon Properties. Specify the daemon using the tabs, and the severity level for each daemon using the pull down menu. The following list denes each severity level: ? EMERG - The system is unusable ? ALERT - Action must be taken immediately ? CRIT - Critical conditions ? ERR - Error conditions ? WARN - Warning conditions ? NOTICE - Normal but signicant condition ? INFO - Informational ? DEBUG - Debug-level messages Note that the cluster logs messages with the designated severity level and also messages of a higher severity. For example, if the severity level for quorum daemon messages is CRIT, then the cluster logs messages for CRIT, ALERT, and EMERG severity levels. Also note that setting the logging level to a low severity level (such as DEBUG) results in large log les over time. 8.7. Updating the Cluster Software To update the cluster software while minimizing service downtime, follow these steps: 102 Chapter 8. Cluster Administration 1. Stop the cluster software on the rst member to be upgraded by clicking Cluster => Stop Local Cluster Daemons from the Cluster Status Tool. Alternatively, run the following command at a shell prompt: /sbin/service clumanager stop 2. Make a copy of the cluster conguration le. For example, to save the le to the /root directory, run the following command: cp /etc/cluster.xml /root/ 3. Upgrade the cluster packages using the steps outlined in Section 3.1 Installing the Red Hat Cluster Suite Packages. 4. Copy the conguration le you saved back to the /etc/ directory. cp /root/cluster.xml /etc/ 5. Start the cluster software on the updated cluster system by clicking Cluster => Start Local Cluster Daemons from the Cluster Status Tool. Alternatively, run the following command at a shell prompt: /sbin/service clumanager start 6. Repeat steps 1 - 5 for each cluster member to be updated. 8.8. Changing the Cluster Name Start the Cluster Conguration Tool and type a new name into the Cluster Name eld. The cluster name is used to distinctively separate multiple clusters by function or department (such as clu_marketing or test_cluster). 8.9. Disabling the Cluster Software It may become necessary to temporarily disable the cluster software on a member system. For example, if a cluster system experiences a hardware failure, an administrator may want to reboot the system, but prevent it from rejoining the cluster to perform maintenance on the system. Use the /sbin/chkconfig command to stop the member from joining the cluster at boot-up. /sbin/chkconfig --del clumanager Once the issues with the disabled cluster member has been resolved, use the following command to allow it to rejoin the cluster: /sbin/chkconfig --add clumanager You can then reboot the system for the changes to take effect or run the following command to restart cluster services: /sbin/service clumanager start 8.10. Diagnosing and Correcting Problems in a Cluster To ensure the proper diagnosis of any problems in a cluster, event logging must be enabled. In addition, if problems arise in a cluster, be sure to set the severity level to DEBUG for the cluster daemons. This logs descriptive messages that may help solve problems. Chapter 8. Cluster Administration 103 Note Once any issues have been resolved, reset the debug level back down to its default value of WARN to avoid excessively large log message les from being generated. Refer to Section 8.6 Modifying Cluster Event Logging for more information. Use Table 8-3 to troubleshoot issues in a cluster. Problem Symptom Solution SCSI bus not terminated SCSI errors appear in the log le Each SCSI bus must be terminated only at the beginning and end of the bus. Depending on the bus conguration, it might be necessary to enable or disable termination in host bus adapters, RAID controllers, and storage enclosures. To support hot plugging, external termination is required to terminate a SCSI bus. In addition, be sure that no devices are connected to a SCSI bus using a stub that is longer than 0.1 meter. Refer to Section 2.4.4 Conguring Shared Disk Storage and Section D.3 SCSI Bus Termination for information about terminating different types of SCSI buses. SCSI bus length greater than maximum limit SCSI errors appear in the log le Each type of SCSI bus must adhere to restrictions on length, as described in Section D.4 SCSI Bus Length. In addition, ensure that no single-ended devices are connected to the LVD SCSI bus, because this causes the entire bus to revert to a single-ended bus, which has more severe length restrictions than a differential bus. SCSI identication numbers not unique SCSI errors appear in the log le Each device on a SCSI bus must have a unique identication number. Refer to Section D.5 SCSI Identication Numbers for more information. 104 Chapter 8. Cluster Administration Problem Symptom Solution SCSI commands timing out before completion SCSI errors appear in the log le The prioritized arbitration scheme on a SCSI bus can result in low-priority devices being locked out for some period of time. This may cause commands to time out, if a low-priority storage device, such as a disk, is unable to win arbitration and complete a command that a host has queued to it. For some workloads, this problem can be avoided by assigning low-priority SCSI identication numbers to the host bus adapters. Refer to Section D.5 SCSI Identication Numbers for more information. Mounted quorum partition Messages indicating checksum errors on a quorum partition appear in the log le Be sure that the quorum partition raw devices are used only for cluster state information. They cannot be used for cluster services or for non-cluster purposes, and cannot contain a le system. Refer to Section 2.4.4.3 Conguring Shared Cluster Partitions for more information. These messages could also indicate that the underlying block device special le for the quorum partition has been erroneously used for non-cluster purposes. Service le system is unclean A disabled service cannot be enabled Manually run a checking program such as fsck. Then, enable the service. Note that the cluster infrastructure does by default run fsck with the -p option to automatically repair le system inconsistencies. For particularly egregious error types, you may be required to manually initiate le system repair options. Chapter 8. Cluster Administration 105 Problem Symptom Solution Quorum partitions not set up correctly Messages indicating that a quorum partition cannot be accessed appear in the log le Run the /sbin/shutil -t command to check that the quorum partitions are accessible. If the command succeeds, run the shutil -p command on both cluster systems. If the output is different on the systems, the quorum partitions do not point to the same devices on both systems. Check to make sure that the raw devices exist and are correctly specied in the /etc/sysconfig/rawdevices le. Refer to Section 2.4.4.3 Conguring Shared Cluster Partitions for more information. Cluster service operation fails Messages indicating the operation failed to appear on the console or in the log le There are many different reasons for the failure of a service operation (for example, a service stop or start). To help identify the cause of the problem, set the severity level for the cluster daemons to DEBUG to log descriptive messages. Then, retry the operation and examine the log le. Refer to Section 8.6 Modifying Cluster Event Logging for more information. Cluster service stop fails because a le system cannot be unmounted Messages indicating the operation failed appear on the console or in the log le Use the fuser and ps commands to identify the processes that are accessing the le system. Use the kill command to stop the processes. Use the lsof -t file_system command to display the identication numbers for the processes that are accessing the specied le system. If needed, pipe the output to the kill command. To avoid this problem, be sure that only cluster-related processes can access shared storage data. In addition, modify the service and enable forced unmount for the le system. This enables the cluster service to unmount a le system even if it is being accessed by an application or user. Incorrect entry in the cluster database Cluster operation is impaired The Cluster Status Tool can be used to examine and modify service conguration. The Cluster Conguration Tool is used to modify cluster parameters. 106 Chapter 8. Cluster Administration Problem Symptom Solution Incorrect Ethernet heartbeat entry in the cluster database or /etc/hosts le Cluster status indicates that a Ethernet heartbeat channel is OFFLINE even though the interface is valid Examine and modify the cluster conguration by running the Cluster Conguration Tool, as specied in Section 8.4 Modifying the Cluster Conguration, and correct the problem. In addition, be sure to use the ping command to send a packet to all network interfaces used in the cluster. Loose cable connection to power switch Power switch status using clufence returns an error or hangs Check the serial cable connection. Power switch serial port incorrectly specied in the cluster database Power switch status using clufence indicates a problem Examine the current settings and modify the cluster conguration by running the Cluster Conguration Tool, as specied in Section 8.4 Modifying the Cluster Conguration, and correct the problem. Table 8-3. Diagnosing and Correcting Problems in a Cluster II. Conguring a Linux Virtual Server Cluster Building a Linux Virtual Server (LVS) system offers highly-available and scalable solution for production services using specialized routing and load-balancing techniques congured through the Piranha Conguration Tool. This part discusses the conguration of high-performance systems and services with Red Hat Enterprise Linux and LVS. This section is licensed under the Open Publication License, V1.0 or later. For details refer to the Copyright page. Table of Contents 9. Introduction to Linux Virtual Server........................................................................................ 109 10. Linux Virtual Server Overview ............................................................................................... 111 11. Initial LVS Conguration......................................................................................................... 121 12. Setting Up a Red Hat Enterprise Linux LVS Cluster............................................................ 125 13. Conguring the LVS Routers with Piranha Conguration Tool.......................................... 133 Chapter 9. Introduction to Linux Virtual Server Using Red Hat Enterprise Linux, it is possible to create highly available server clustering solutions able to withstand many common hardware and software failures with little or no interruption of critical services. By allowing multiple computers to work together in offering these critical services, system administrators can plan and execute system maintenance and upgrades without service interruption. The chapters in this part guide you through the following steps in understanding and deploying a clustering solution based on the Red Hat Enterprise Linux Linux Virtual Server (LVS) technology: ? Explains the Linux Virtual Server technology used by Red Hat Enterprise Linux to create a loadbalancing cluster ? Explains how to congure a Red Hat Enterprise Linux LVS cluster ? Guides you through the Piranha Conguration Tool, a graphical interface used for conguring and monitoring an LVS cluster 9.1. Technology Overview Red Hat Enterprise Linux implements highly available server solutions via clustering. It is important to note that cluster computing consists of three distinct branches: ? Compute clustering (such as Beowulf) uses multiple machines to provide greater computing power for computationally intensive tasks. This type of clustering is not addressed by Red Hat Enterprise Linux. ? High-availability (HA) clustering uses multiple machines to add an extra level of reliability for a service or group of services. ? Load-balance clustering uses specialized routing techniques to dispatch trafc to a pool of servers. Red Hat Enterprise Linux addresses the latter two types of clustering technology. Using a collection of programs to monitor the health of the systems and services in the cluster. Note The clustering technology included in Red Hat Enterprise Linux is not synonymous with fault tolerance. Fault tolerant systems use highly specialized and often very expensive hardware to implement a fully redundant environment in which services can run uninterrupted by hardware failures. However, fault tolerant systems do not account for operator and software errors which Red Hat Enterprise Linux can address through service redundancy. Also, since Red Hat Enterprise Linux is designed to run on commodity hardware, it creates an environment with a high level of system availability at a fraction of the cost of fault tolerant hardware. 9.2. Basic Congurations While Red Hat Enterprise Linux can be congured in a variety of different ways, the congurations can be broken into two major categories: 110 Chapter 9. Introduction to Linux Virtual Server ? High-availability clusters using Red Hat Cluster Manager ? Load-balancing clusters using Linux Virtual Servers This part explains what a load-balancing cluster system is and how to congure a load-balancing system using Linux Virtual Servers on Red Hat Enterprise Linux. 9.2.1. Load-Balancing Clusters Using Linux Virtual Servers To an outside user accessing a hosted service (such as a website or database application), a Linux Virtual Server (LVS) cluster appears as one server. In reality, however, the user is actually accessing a cluster of two or more servers behind a pair of redundant LVS routers that distribute client requests evenly throughout the cluster system. Load-balanced clustered services allow administrators to use commodity hardware and Red Hat Enterprise Linux to create continuous and consistent access to all hosted services while also addressing availability requirements. An LVS cluster consists of at least two layers. The rst layer is composed of a pair of similarly con- gured Linux machines or cluster members. One of these machine acts as the LVS routers, congured to direct requests from the Internet to the cluster. The second layer consists of a cluster of machines called real servers. The real servers provide the critical services to the end-user while the LVS router balances the load on these servers. For a detailed overview of LVS clustering, refer to Chapter 10 Linux Virtual Server Overview. Chapter 10. Linux Virtual Server Overview Red Hat Enterprise Linux LVS clustering uses a Linux machine called the active router to send requests from the Internet to a pool of servers. To accomplish this, LVS clusters consist of two basic machine classications ? the LVS routers (one active and one backup) and a pool of real servers which provide the critical services. The active router serves two roles in the cluster: ? To balance the load on the real servers. ? To check the integrity of the services on each of the real servers. The backup router's job is to monitor the active router and assume its role in the event of failure. 10.1. A Basic LVS Conguration Figure 10-1 shows a simple LVS cluster consisting of two layers. On the rst layer are two LVS routers ? one active and one backup. Each of the LVS routers has two network interfaces, one interface on the Internet and one on the private network, enabling them to regulate trafc between the two networks. For this example the active router is using Network Address Translation or NAT to direct trafc from the Internet to a variable number of real servers on the second layer, which in turn provide the necessary services. Therefore, the real servers in this example are connected to a dedicated private network segment and pass all public trafc back and forth through the active LVS router. To the outside world, the server cluster appears as one entity. Figure 10-1. A Basic LVS Conguration 112 Chapter 10. Linux Virtual Server Overview Service requests arriving at the LVS cluster are addressed to a virtual IP address or VIP. This is a publicly-routable address the administrator of the site associates with a fully-qualied domain name, such as www.example.com, and which is assigned to one or more virtual server1. Note that a VIP address migrates from one LVS router to the other during a failover, thus maintaining a presence at that IP address, also known as oating IP addresses. VIP addresses may be aliased to the same device which connects the LVS router to the Internet. For instance, if eth0 is connected to the Internet, than multiple virtual servers can be aliased to eth0:1. Alternatively, each virtual server can be associated with a separate device per service. For example, HTTP trafc can be handled on eth0:1, and FTP trafc can be handled on eth0:2. Only one LVS router is active at a time. The role of the active router is to redirect service requests from virtual IP addresses to the real servers. The redirection is based on one of eight supported loadbalancing algorithms described further in Section 10.3 LVS Scheduling Overview. The active router also dynamically monitors the overall health of the specic services on the real servers through simple send/expect scripts. To aid in detecting the health of services that require dynamic data, such as HTTPS or SSL, the administrator can also call external executables. If a service on a real server malfunctions, the active router stops sending jobs to that server until it returns to normal operation. The backup router performs the role of a standby system. Periodically, the LVS routers exchange heartbeat messages through the primary external public interface and, in a failover situation, the private interface. Should the backup node fail to receive a heartbeat message within an expected interval, it initiates a failover and assumes the role of the active router. During failover, the backup router takes over the VIP addresses serviced by the failed router using a technique known as ARP spoong ? where the backup LVS router announces itself as the destination for IP packets addressed to the failed node. When the failed node returns to active service, the backup node assumes its hot-backup role again. The simple, two-layered conguration used in Figure 10-1 is best for clusters serving data which does not change very frequently ? such as static webpages ? because the individual real servers do not automatically sync data between each node. 10.1.1. Data Replication and Data Sharing Between Real Servers Since there is no built-in component in LVS clustering to share the same data between the real servers, the administrator has two basic options: ? Synchronize the data across the real server pool ? Add a third layer to the topology for shared data access The rst option is preferred for servers that do not allow large numbers of users to upload or change data on the real servers. If the cluster allows large numbers of users to modify data, such as an ecommerce website, adding a third layer is preferable. 10.1.1.1. Conguring Real Servers to Synchronize Data There are many ways an administrator can choose to synchronize data across the pool of real servers. For instance, shell scripts can be employed so that if aWeb engineer updates a page, the page is posted to all of the servers simultaneously. Also, the cluster administrator can use programs such as rsync to replicate changed data across all nodes at a set interval. However, this type of data synchronization does not optimally function if the cluster is overloaded with users constantly uploading les or issuing database transactions. For a cluster with a high load, a three-tiered topology is the ideal solution. 1. A virtual server is a service congured to listen on a specic virtual IP. Refer to Section 13.6 VIRTUAL SERVERS for more on conguring a virtual server using the Piranha Conguration Tool. Chapter 10. Linux Virtual Server Overview 113 10.2. A Three Tiered LVS Conguration Figure 10-2 shows a typical three tiered LVS cluster topology. In this example, the active LVS router routes the requests from the Internet to the pool of real servers. Each of the real servers then accesses a shared data source over the network. Figure 10-2. A Three Tiered LVS Conguration This conguration is ideal for busy FTP servers, where accessible data is stored on a central, highly available server and accessed by each real server via an exported NFS directory or Samba share. This topography is also recommended for websites that access a central, highly available database for transactions. Additionally, using an active-active conguration with Red Hat Cluster Manager, administrators can congure one high-availability cluster to serve both of these roles simultaneously. The third tier in the above example does not have to use Red Hat Cluster Manager, but failing to use a highly available solution would introduce a critical single point of failure. 10.3. LVS Scheduling Overview One of the advantages of using an LVS cluster is its ability to perform exible, IP-level load balancing on the real server pool. This exibility is due to the variety of scheduling algorithms an administrator can choose from when conguring a cluster. LVS load balancing is superior to less exible methods, such as Round-Robin DNS where the hierarchical nature of DNS and the caching by client machines 114 Chapter 10. Linux Virtual Server Overview can lead to load imbalances. Additionally, the low-level ltering employed by the LVS router has advantages over application-level request forwarding because balancing loads at the network packet level causes minimal computational overhead and allows for greater scalability. Using scheduling, the active router can take into account the real servers' activity and, optionally, an administrator-assigned weight factor when routing service requests. Using assigned weights gives arbitrary priorities to individual machines. Using this form of scheduling, it is possible to create a group of real servers using a variety of hardware and software combinations and the active router can evenly load each real server. The scheduling mechanism for an LVS cluster is provided by a collection of kernel patches called IP Virtual Server or IPVS modules. These modules enable layer 4 (L4) transport layer switching, which is designed to work well with multiple servers on a single IP address. To track and route packets to the real servers efciently, IPVS builds an IPVS table in the kernel. This table is used by the active LVS router to redirect requests from a virtual server address to and returning from real servers in the pool. The IPVS table is constantly updated by a utility called ipvsadm ? adding and removing cluster members depending on their availability. 10.3.1. Scheduling Algorithms The structure that the IPVS table takes depends on the scheduling algorithm that the administrator chooses for any given virtual server. To allow for maximum exibility in the types of services you can cluster and how these services are scheduled, Red Hat Enterprise Linux provides the following scheduling algorithms listed below. For instructions on how to assign scheduling algorithms refer to Section 13.6.1 The VIRTUAL SERVER Subsection. Round-Robin Scheduling Distributes each request sequentially around the pool of real servers. Using this algorithm, all the real servers are treated as equals without regard to capacity or load. This scheduling model resembles round-robin DNS but is more granular due to the fact that it is network-connection based and not host-based. LVS round-robin scheduling also does not suffer the imbalances caused by cached DNS queries. Weighted Round-Robin Scheduling Distributes each request sequentially around the pool of real servers but gives more jobs to servers with greater capacity. Capacity is indicated by a user-assigned weight factor, which is then adjusted upward or downward by dynamic load information. Refer to Section 10.3.2 Server Weight and Scheduling for more on weighting real servers. Weighted round-robin scheduling is a preferred choice if there are signicant differences in the capacity of real servers in the pool. However, if the request load varies dramatically, the more heavily weighted server may answer more than its share of requests. Least-Connection Distributes more requests to real servers with fewer active connections. Because it keeps track of live connections to the real servers through the IPVS table, least-connection is a type of dynamic scheduling algorithm, making it a better choice if there is a high degree of variation in the request load. It is best suited for a real server pool where each member node has roughly the same capacity. If a group of servers have different capabilities, weighted least-connection scheduling is a better choice. Weighted Least-Connections (default) Distributes more requests to servers with fewer active connections relative to their capacities. Capacity is indicated by a user-assigned weight, which is then adjusted upward or downward by dynamic load information. The addition of weighting makes this algorithm ideal when the real Chapter 10. Linux Virtual Server Overview 115 server pool contains hardware of varying capacity. Refer to Section 10.3.2 Server Weight and Scheduling for more on weighting real servers. Locality-Based Least-Connection Scheduling Distributes more requests to servers with fewer active connections relative to their destination IPs. This algorithm is designed for use in a proxy-cache server cluster. It routes the packets for an IP address to the server for that address unless that server is above its capacity and has a server in its half load, in which case it assigns the IP address to the least loaded real server. Locality-Based Least-Connection Scheduling with Replication Scheduling Distributes more requests to servers with fewer active connections relative to their destination IPs. This algorithm is also designed for use in a proxy-cache server cluster. It differs from Locality- Based Least-Connection Scheduling by mapping the target IP address to a subset of real server nodes. Requests are then routed to the server in this subset with the lowest number of connections. If all the nodes for the destination IP are above capacity, it replicates a new server for that destination IP address by adding the real server with the least connections from the overall pool of real servers to the subset of real servers for that destination IP. The most loaded node is then dropped from the real server subset to prevent over-replication. Destination Hash Scheduling Distributes requests to the pool of real servers by looking up the destination IP in a static hash table. This algorithm is designed for use in a proxy-cache server cluster. Source Hash Scheduling Distributes requests to the pool of real servers by looking up the source IP in a static hash table. This algorithm is designed for LVS routers with multiple rewalls. 10.3.2. Server Weight and Scheduling The administrator of an LVS cluster can assign a weight to each node in the real server pool. This weight is an integer value which is factored into any weight-aware scheduling algorithms (such as weighted least-connections) and helps the LVS router more evenly load hardware with different capabilities. Weights work as a ratio relative to one another. For instance, if one real server has a weight of 1 and the other server has a weight of 5, then the server with a weight of 5 gets 5 connections for every 1 connection the other server gets. The default value for a real server weight is 1. Although adding weight to varying hardware congurations in a real server pool can help load-balance the cluster more efciently, it can cause temporary imbalances when a real server is introduced to the real server pool and the virtual server is scheduled using weighted least-connections. For example, suppose there are three servers in the real server pool. Servers A and B are weighted at 1 and the third, server C, is weighted at 2. If server C goes down for any reason, servers A and B evenly distributes the abandoned load. However, once server C comes back online, the LVS router sees it has zero connections and oods the server with all incoming requests until it is on par with servers A and B. To prevent this phenomenon, administrators can make the virtual server a quiesce server ? anytime a new real server node comes online, the least-connections table is reset to zero and the LVS router routes requests as if all the real servers were newly added to the cluster. 116 Chapter 10. Linux Virtual Server Overview 10.4. Routing Methods Red Hat Enterprise Linux uses Network Address Translation or NAT routing for LVS clustering, which allows the administrator tremendous exibility when utilizing available hardware and integrating the cluster into an existing network. 10.4.1. NAT Routing Figure 10-3, illustrates an LVS cluster utilizing NAT routing to move requests between the Internet and a private network. Figure 10-3. An LVS Cluster Implemented with NAT Routing In the example, there are two NICs in the active LVS router. The NIC for the Internet has a real IP address on eth0 and has a oating IP address aliased to eth0:1. The NIC for the private network interface has a real IP address on eth1 and has a oating IP address aliased to eth1:1. In the event of failover, the virtual interface facing the Internet and the private facing virtual interface are takenover by the backup LVS router simultaneously. All of the cluster's real servers located on the private network use the oating IP for the NAT router as their default route to communicate with the active LVS router so that their abilities to respond to requests from the Internet is not impaired. In this example, the LVS router's public LVS oating IP address and private NAT oating IP address are aliased to two physical NICs. While it is possible to associate each oating IP address to its own physical device on the LVS router nodes, having more than two NICs is not a requirement. Using this topography, the active LVS router receives the request and routes it to the appropriate server. The real server then processes the request and returns the packets to the LVS router which uses network address translation to replace the address of the real server in the packets with the LVS Chapter 10. Linux Virtual Server Overview 117 routers public VIP address. This process is called IP masquerading because the actual IP addresses of the real servers is hidden from the requesting clients. Using this NAT routing, the real servers may be any kind of machine running various operating systems. The main disadvantage is that the LVS router may become a bottleneck in large cluster deployments because it must process outgoing as well as incoming requests. 10.5. Persistence and Firewall Marks In certain situations, it may be desirable for a client to reconnect repeatedly to the same real server, rather than have an LVS load balancing algorithm send that request to the best available server. Examples of such situations include multi-screen web forms, cookies, SSL, and FTP connections. In these cases, a client may not work properly unless the transactions are being handled by the same server to retain context. LVS provides two different features to handle this: persistence and rewall marks. 10.5.1. Persistence When enabled, persistence acts like a timer. When a client connects to a service, LVS remembers the last connection for a specied period of time. If that same client IP address connects again within that period, it is sent to the same server it connected to previously ? bypassing the load-balancing mechanisms. When a connection occurs outside the time window, it is handled according to the scheduling rules in place. Persistence also allows the administrator to specify a subnet mask to apply to the client IP address test as a tool for controlling what addresses have a higher level of persistence, thereby grouping connections to that subnet. Grouping connections destined for different ports can be important for protocols which use more than one port to communicate, such as FTP. However, persistence is not the most efcient way to deal with the problem of grouping together connections destined for different ports. For these situations, it is best to use rewall marks. 10.5.2. Firewall Marks Firewall marks are an easy and efcient way to a group ports used for a protocol or group of related protocols. For instance, if an LVS cluster is deployed to run an e-commerce site, rewall marks can be used to bundle HTTP connections on port 80 and secure, HTTPS connections on port 443. By assigning the same rewall mark to the virtual server for each protocol, state information for the transaction can be preserved because the LVS router forwards all requests to the same real server after a connection is opened. Because of its efciency and ease-of-use, administrators of LVS clusters should use rewall marks instead of persistence whenever possible for grouping connections. However, administrators should still add persistence to the virtual servers in conjunction with rewall marks to ensure the clients are reconnected to the same server for an adequate period of time. 10.6. LVS Cluster ? A Block Diagram LVS routers use a collection of programs to monitor cluster members and cluster services. Figure 10-4 illustrates how these various programs on both the active and backup LVS routers work together to manage the cluster. 118 Chapter 10. Linux Virtual Server Overview Figure 10-4. Components of a Running LVS Cluster The pulse daemon runs on both the active and passive LVS routers. On the backup router, pulse sends a heartbeat to the public interface of the active router to make sure the active router is still properly functioning. On the active router, pulse starts the lvs daemon and responds to heartbeat queries from the backup LVS router. Once started, the lvs daemon calls the ipvsadm utility to congure and maintain the IPVS routing table in the kernel and starts a nanny process for each congured virtual server on each real server. Each nanny process checks the state of one congured service on one real server, and tells the lvs daemon if the service on that real server is malfunctioning. If a malfunction is detected, the lvs daemon instructs ipvsadm to remove that real server from the IPVS routing table. If the backup router does not receive a response from the active router, it initiates failover by calling send_arp to reassign all virtual IP addresses to the NIC hardware addresses (MAC address) of the backup node, sends a command to the active router via both the public and private network interfaces to shut down the lvs daemon on the active router, and starts the lvs daemon on the backup node to accept requests for the congured virtual servers. 10.6.1. Components of an LVS Cluster Section 10.6.1.1 pulse shows a detailed list of each software component in an LVS router. 10.6.1.1. pulse This is the controlling process which starts all other daemons related to LVS routers. At boot time, the daemon is started by the /etc/rc.d/init.d/pulse script. It then reads the conguration le /etc/sysconfig/ha/lvs.cf. On the active router, pulse starts the LVS daemon. On the backup router, pulse determines the health of the active router by executing a simple heartbeat at a usercon gurable interval. If the active router fails to respond after a user-congurable interval, it initiates failover. During failover, pulse on the backup router instructs the pulse daemon on the active router to shut down all LVS services, starts the send_arp program to reassign the oating IP addresses to the backup router's MAC address, and starts the lvs daemon. Chapter 10. Linux Virtual Server Overview 119 10.6.1.2. lvs The lvs daemon runs on the active LVS router once called by pulse. It reads the conguration le /etc/sysconfig/ha/lvs.cf, calls the ipvsadm utility to build and maintain the IPVS routing table, and assigns a nanny process for each congured LVS service. If nanny reports a real server is down, lvs instructs the ipvsadm utility to remove the real server from the IPVS routing table. 10.6.1.3. ipvsadm This service updates the IPVS routing table in the kernel. The lvs daemon sets up and administers an LVS cluster by calling ipvsadm to add, change, or delete entries in the IPVS routing table. 10.6.1.4. nanny The nanny monitoring daemon runs on the active LVS router. Through this daemon, the active router determines the health of each real server and, optionally, monitors its workload. A separate process runs for each service dened on each real server. 10.6.1.5. /etc/sysconfig/ha/lvs.cf This is the LVS cluster conguration le. Directly or indirectly, all daemons get their conguration information from this le. 10.6.1.6. Piranha Conguration Tool This is the Web-based tool for monitoring, conguring, and administering an LVS cluster. This is the default tool to maintain the /etc/sysconfig/ha/lvs.cf LVS cluster conguration le. 10.6.1.7. send_arp This program sends out ARP broadcasts when the oating IP address changes from one node to another during failover. Chapter 11 Initial LVS Conguration reviews important post-installation conguration steps you should take before conguring Red Hat Enterprise Linux to be an LVS router. 120 Chapter 10. Linux Virtual Server Overview Chapter 11. Initial LVS Conguration After installing Red Hat Enterprise Linux, you must take some basic steps to set up both the LVS routers and the real servers in the LVS cluster. This chapter covers these initial steps in detail. Note The LVS router node that becomes the active node once the cluster is started is also referred to as the primary node. When conguring an LVS cluster, use the Piranha Conguration Tool on the primary node. 11.1. Conguring Services on the LVS Routers The Red Hat Enterprise Linux installation program installs all of the components needed to set up an LVS cluster, but the appropriate services must be activated before conguring the cluster. For both LVS routers, set the appropriate services to start at boot time. There are three primary tools available for setting services to activate at boot time under Red Hat Enterprise Linux: the command line program chkconfig, the ncurses-based program ntsysv, and the graphical Services Conguration Tool. All of these tools require root access. Tip To attain root access, open a shell prompt and type the following command followed by the root password: su - On the LVS routers, there are three services which need to be set to activate at boot time: ? The piranha-gui service (primary node only) ? The pulse service ? The sshd service If you are clustering multi-port services or using rewall marks, you must also enable the iptables service. It is best to set these services to activate in both runlevel 3 and runlevel 5. To accomplish this using chkconfig, type the following command for each service: /sbin/chkconfig --level 35 daemon on In the above command, replace daemon with the name of the service you are activating. To get a list of services on the system as well as what runlevel they are set to activate on, issue the following command: /sbin/chkconfig --list 122 Chapter 11. Initial LVS Conguration Warning Turning any of the above services on using chkconfig does not actually start the daemon. To do this use the /sbin/service command. See Section 11.3 Starting the Piranha Conguration Tool Service for an example of how to use the /sbin/service command. For more information on runlevels and conguring services with ntsysv and the Services Conguration Tool, refer to the chapter titled "Controlling Access to Services" in the Red Hat Enterprise Linux System Administration Guide. 11.2. Setting a Password for the Piranha Conguration Tool Before using the Piranha Conguration Tool for the rst time on the primary LVS router, you must restrict access to it by creating a password. To do this, login as root and issue the following command: /usr/sbin/piranha-passwd After entering this command, create the administrative password when prompted. Warning For a password to be more secure, it should not contain proper nouns, commonly used acronyms, or words in a dictionary from any language. Do not leave the password unencrypted anywhere on the system. If the password is changed during an active Piranha Conguration Tool session, the administrator is prompted to provide the new password. 11.3. Starting the Piranha Conguration Tool Service After you have set the password for the Piranha Conguration Tool, start or restart the piranha-gui service located in /etc/rc.d/init.d/piranha-gui. To do this, type the following command as root: /sbin/service piranha-gui start or /sbin/service piranha-gui restart Issuing this command starts a private session of the Apache HTTP Server by calling the symbolic link /usr/sbin/piranha_gui -> /usr/sbin/httpd. For security reasons, the piranha-gui version of httpd runs as the piranha user in a separate process. The fact that piranha-gui leverages the httpd service means that: 1. The Apache HTTP Server must be installed on the system. 2. Stopping or restarting the Apache HTTP Server via the service command stops the piranha-gui service. Chapter 11. Initial LVS Conguration 123 Warning If the command /sbin/service httpd stop or /sbin/service httpd restart is issued on an LVS router, you must start the piranha-gui service by issuing the following command: /sbin/service piranha-gui start The piranha-gui service is all that is necessary to begin conguring an LVS cluster. However, if you are conguring the cluster remotely, the sshd service is also required. You do not need to start the pulse service until conguration using the Piranha Conguration Tool is complete. See Section 13.8 Starting the Cluster for information on starting the pulse service. 11.3.1. Conguring the Piranha Conguration Tool Web Server Port The Piranha Conguration Tool runs on port 3636 by default. To change this port number, change the line Listen 3636 in Section 2 of the piranha-gui Web server conguration le /etc/sysconfig/ha/conf/httpd.conf. To use the Piranha Conguration Tool you need at minimum a text-only Web browser. If you start a Web browser on the primary LVS router, open the location http://localhost:3636. You can reach the Piranha Conguration Tool from anywhere via Web browser by replacing localhost with the hostname or IP address of the primary LVS router. When your browser connects to the Piranha Conguration Tool, you must login to access the cluster conguration services. Enter piranha in the Username eld and the password set with piranha-passwd in the Password eld. Now that the Piranha Conguration Tool is running, you may wish to consider limiting who has access to the tool over the network. The next section reviews ways to accomplish this task. 11.4. Limiting Access To the Piranha Conguration Tool The Piranha Conguration Tool prompts for a valid username and password combination. However, because all of the data passed to the Piranha Conguration Tool is in plain text, it is recommended that you restrict access only to trusted networks or to the local machine. The easiest way to restrict access is to use the Apache HTTP Server's built in access control mechanisms by editing /etc/sysconfig/ha/web/secure/.htaccess. After altering the le you do not have to restart the piranha-gui service because the server checks the .htaccess le each time it accesses the directory. By default, the access controls for this directory allow anyone to view the contents of the directory. Here is what the default access looks like: Order deny,allow Allow from all To limit access of the Piranha Conguration Tool to only the localhost change the .htaccess le to allow access from only the loopback device (127.0.0.1). For more information on the loopback device, see the chapter titled Network Scripts in the Red Hat Enterprise Linux Reference Guide. Order deny,allow Deny from all Allow from 127.0.0.1 You can also allow specic hosts or subnets as seen in this example: 124 Chapter 11. Initial LVS Conguration Order deny,allow Deny from all Allow from 192.168.1.100 Allow from 172.16.57 In this example, only Web browsers from the machine with the IP address of 192.168.1.100 and machines on the 172.16.57/24 network can access the Piranha Conguration Tool. Caution Editing the Piranha Conguration Tool .htaccess le limits access to the conguration pages in the /etc/sysconfig/ha/web/secure/ directory but not to the login and the help pages in /etc/sysconfig/ha/web/. To limit access to this directory, create a .htaccess le in the /etc/sysconfig/ha/web/ directory with order, allow, and deny lines identical to /etc/sysconfig/ha/web/secure/.htaccess. 11.5. Turning on Packet Forwarding In order for the LVS router to forward network packets properly to the real servers, each LVS router node must have IP forwarding turned on in the kernel. Log in as root and change the line which reads net.ipv4.ip_forward = 0 in /etc/sysctl.conf to the following: net.ipv4.ip_forward = 1 The changes take effect when you reboot the system. To check if IP forwarding is turned on, issue the following command as root: /sbin/sysctl net.ipv4.ip_forward If the above command returns a 1, then IP forwarding is enabled. If it returns a 0, then you can turn it on manually using the following command: /sbin/sysctl -w net.ipv4.ip_forward=1 11.6. Conguring Services on the Real Servers If the real servers in the cluster are Red Hat Enterprise Linux systems, set the appropriate server daemons to activate at boot time. These daemons can include httpd for Web services or xinetd for FTP or Telnet services. It may also be useful to access the real servers remotely, so the sshd daemon should also be installed and running. Chapter 12. Setting Up a Red Hat Enterprise Linux LVS Cluster A Red Hat Enterprise Linux LVS cluster consists of two basic groups: the LVS routers and the real servers. To prevent a single point of failure, each groups should contain at least two member systems. The LVS router group should consist of two identical or very similar systems running Red Hat Enterprise Linux. One will act as the active LVS router while the other stays in hot standby mode, so they need to have as close to the same capabilities as possible. Before choosing and conguring the hardware for the real server group, you most decide what which of the three types of LVS topographies to use. 12.1. The NAT LVS Cluster The NAT topography allows for great latitude in utilizing existing hardware, but it is limited in its ability to handle large loads due to the fact that all packets going into and coming out of the cluster pass through the LVS router. Network Layout The topography for an LVS cluster utilizing NAT routing is the easiest to congure from a network layout perspective because the cluster needs only one access point to the public network. The real servers pass all requests back through the LVS router so they are on their own private network. Hardware The NAT topography is the most exible in regards to cluster hardware because the real servers do not need to be Linux machines to function correctly in the cluster. In a NAT cluster, each real server only needs one NIC since it will only be responding to the LVS router. The LVS routers, on the other hand, need two NICs each to route trafc between the two networks. Because this topography creates a network bottleneck at the LVS router, gigabit Ethernet NICs can be employed on each LVS router to increase the bandwidth the LVS routers can handle. If gigabit Ethernet is employed on the LVS routers, any switch connecting the real servers to the LVS routers must have at least two gigabit Ethernet ports to handle the load efciently. Software Because the NAT topography requires the use of iptables for some congurations, there can be a fair amount of software conguration outside of Piranha Conguration Tool. In particular, FTP services and the use of rewall marks requires extra manual conguration of the LVS routers to route requests properly. 12.1.1. Conguring Network Interfaces for a NAT LVS Cluster To set up a NAT LVS cluster, the administrator must rst congure the network interfaces for the public network and the private network on the LVS routers. In this example, the LVS routers' public interfaces (eth0) will be on the 192.168.26/24 network (I know, I know, this is not a routable IP, but let us pretend there is a rewall in front of the LVS router for good measure) and the private interfaces which link to the real servers (eth1) will be on the 10.11.12/24 network. So on the active or primary LVS router node, the public interface's network script, /etc/sysconfig/network-scripts/ifcfg-eth0, could look something like this: 126 Chapter 12. Setting Up a Red Hat Enterprise Linux LVS Cluster DEVICE=eth0 BOOTPROTO=static ONBOOT=yes IPADDR=192.168.26.9 NETMASK=255.255.255.0 GATEWAY=192.168.26.254 The /etc/sysconfig/network-scripts/ifcfg-eth1 for the private NAT interface on the LVS router could look something like this: DEVICE=eth1 BOOTPROTO=static ONBOOT=yes IPADDR=10.11.12.9 NETMASK=255.255.255.0 In this example, the VIP for the LVS router's public interface will be 192.168.26.10 and the VIP for the NAT or private interface will be 10.11.12.10. So, it is essential that the real servers route requests back to the VIP for the NAT interface. Important The sample Ethernet interface conguration settings in this section are for the real IP addresses of an LVS router and not the oating IP addresses. To congure the public and private oating IP addresses the administrator should use the Piranha Conguration Tool, as shown in Section 13.4 GLOBAL SETTINGS and Section 13.6.1 The VIRTUAL SERVER Subsection. After conguring the primary LVS router node's network interfaces, congure the backup LVS router's real network interfaces ? taking care that none of the IP address conict with any other IP addresses on the network. Important Be sure each interface on the backup node services the same network as the interface on primary node. For instance, if eth0 connects to the public network on the primary node, it must also connect to the public network on the backup node as well. 12.1.2. Routing on the Real Servers The most important thing to remember when conguring the real servers network interfaces in a NAT cluster is to set the gateway for the NAT oating IP address of the LVS router. In this example, that address will be 10.11.12.10. Note Once the network interfaces are up on the real servers, the machines will be unable to ping or connect in other ways to the public network. This is normal. You will, however, be able to ping the real IP for the LVS router's private interface, in this case 10.11.12.8. Chapter 12. Setting Up a Red Hat Enterprise Linux LVS Cluster 127 So the real server's /etc/sysconfig/network-scripts/ifcfg-eth0 le could look similar to this: DEVICE=eth0 ONBOOT=yes BOOTPROTO=static IPADDR=10.11.12.1 NETMASK=255.255.255.0 GATEWAY=10.11.12.10 Warning If a real server has more than one network interface congured with a GATEWAY= line, the rst one to come up will get the gateway. Therefore if both eth0 and eth1 are congured and eth1 is used for LVS clustering, the real servers may not route requests properly. It is best to turn off extraneous network interfaces by setting ONBOOT=no in their network scripts within the /etc/sysconfig/network-scripts/ directory or by making sure the gateway is correctly set in the interface which comes up rst. 12.1.3. Enabling NAT Routing on the LVS Routers In a simple NAT LVS cluster where each clustered service uses only one port, like HTTP on port 80, the administrator needs only to enable packet forwarding on the LVS routers for the requests to be properly routed between the outside world and the real servers. See Section 11.5 Turning on Packet Forwarding for instructions on turning on packet forwarding. However, more conguration is necessary when the clustered services require more than one port to go to the same real server during a user session. For information on creating multi-port services using rewall marks, see Section 12.3 Multi-port Services and LVS Clustering. Once forwarding is enabled on the LVS routers and the real servers are set up and have the clustered services running, use the Piranha Conguration Tool to congure the cluster as shown in Chapter 13 Conguring the LVS Routers with Piranha Conguration Tool. Warning Do not congure the oating IP for eth0:1 or eth1:1 by manually editing network scripts or using a network conguration tool. Instead, use the Piranha Conguration Tool as shown in Section 13.4 GLOBAL SETTINGS and Section 13.6.1 The VIRTUAL SERVER Subsection to congure any cluster-related virtual interfaces. When nished, start the pulse service as shown in Section 13.8 Starting the Cluster. Once pulse is up and running, the active LVS router will begin routing requests to the pool of real servers. 12.2. Putting the Cluster Together After determining which of the above routing methods to use, the hardware for the LVS cluster should be linked together on the network. 128 Chapter 12. Setting Up a Red Hat Enterprise Linux LVS Cluster Important The adapter devices on the LVS routers must be congured to access the same networks. For instance if eth0 connects to public network and eth1 connects to the private network, then these same devices on the backup LVS router must connect to the same networks. Also the gateway listed in the rst interface to come up at boot time is added to the routing table and subsequent gateways listed in other interfaces are ignored. This is especially important to consider when conguring the real servers. After physically connecting together the cluster hardware, congure the network interfaces on the primary and backup LVS routers. This can be done using a graphical application such as redhat-congnetwork or by editing the network scripts manually. For more information about adding devices using redhat-cong-network, see the chapter titled Network Conguration in the Red Hat Enterprise Linux System Administration Guide. For more information on editing network scripts by hand, see the chapter titled Network Scripts in the Red Hat Enterprise Linux Reference Guide. For the remainder of the chapter, example alterations to network interfaces are made either manually or through the Piranha Conguration Tool. 12.2.1. General LVS Networking Tips Congure the real IP addresses for both the public and private networks on the LVS routers before attempting to congure the cluster using the Piranha Conguration Tool. The sections on each topography give example network addresses, but the actual network addresses are needed. Below are some useful commands for bringing up network interfaces or checking their status. Bringing Up Real Network Interfaces The best way to bring up any real network interface is to use the following commands as root replacing N with the number corresponding to the interface (eth0 and eth1): /sbin/ifup ethN Warning Do not use the ifup scripts to bring up any oating IP addresses you may congure using Piranha Conguration Tool (eth0:1 or eth1:1). Use the service command to start pulse instead (see Section 13.8 Starting the Cluster for details). To bring a network interface down, type the following command: /sbin/ifdown ethN Again, replace N in the above command with the number corresponding to the interface you wish to bring down. Checking the Status of Network Interfaces If you need to check which network interfaces are up at any given time, type the following: /sbin/ifconfig To view the routing table for a machine, issue the following command: /sbin/route 12.3. Multi-port Services and LVS Clustering LVS routers under any topology require extra conguration when creating multi-port LVS services. Multi-port services can be created articially by using rewall marks to bundle together different, but related protocols, such as HTTP (port 80) and HTTPS (port 443), or when LVS is used to cluster true Chapter 12. Setting Up a Red Hat Enterprise Linux LVS Cluster 129 multi-port protocols, such as FTP. In either case, the LVS router uses rewall marks to recognize that packets destined for different ports, but bearing the same rewall mark, should be handled identically. Also, when combined with persistence, rewall marks ensure connections from the client machine are routed to the same host, as long as the connections occur within the length of time specied by the persistence parameter. For more on assigning persistence to a virtual server, see Section 13.6.1 The VIRTUAL SERVER Subsection. Unfortunately, the mechanism used to balance the loads on the real servers ?IPVS?can recognize the rewall marks assigned to a packet, but cannot itself assign rewall marks. The job of assigning rewall marks must be performed by the network packet lter, iptables, outside of Piranha Conguration Tool. 12.3.1. Assigning Firewall Marks To assign rewall marks to a packet destined for a particular port, the administrator must use iptables. This section illustrates how to bundle HTTP and HTTPS as an example, however FTP is another commonly clustered multi-port protocol. If an LVS cluster is used for FTP services, see Section 12.4 FTP In an LVS Cluster for details on how to best congure the cluster. The basic rule to remember when using rewall marks is that for every protocol using a rewall mark in Piranha Conguration Tool there must be a commensurate iptables rule to assign marks to the network packets. Before creating network packet lter rules, make sure there are no rules already in place. To do this, open a shell prompt, login as root, and type: /sbin/service iptables status If iptables is not running, the prompt will instantly reappear. If iptables is active, it displays a set of rules. If rules are present, type the following command: /sbin/service iptables stop If the rules already in place are important, check the contents of /etc/sysconfig/iptables and copy any rules worth keeping to a safe place before proceeding. Below are rules which assign the same rewall mark, 80, to incoming trafc destined for the oating IP address, n.n.n.n, on ports 80 and 443. For instructions on assigning the VIP to the public network interface, see Section 13.6.1 The VIRTUAL SERVER Subsection. Also note that you must log in as root and load the module for iptables before issuing rules for the rst time. /sbin/modprobe ip_tables /sbin/iptables -t mangle -A PREROUTING -p tcp \ -d n.n.n.n/32 --dport 80 -j MARK --set-mark 80 /sbin/iptables -t mangle-A PREROUTING -p tcp \ -d n.n.n.n/32 --dport 443 -j MARK --set-mark 80 In the above iptables commands, n.n.n.n should be replaced with the oating IP for your HTTP and HTTPS virtual servers. These commands have the net effect of assigning any trafc addressed to the VIP on the appropriate ports a rewall mark of 80, which in turn is recognized by IPVS and forwarded appropriately. 130 Chapter 12. Setting Up a Red Hat Enterprise Linux LVS Cluster Warning The commands above will take effect immediately, but do not persist through a reboot of the system. To ensure network packet lter settings are restored upon reboot, refer to Section 12.5 Saving Network Packet Filter Settings 12.4. FTP In an LVS Cluster File Transport Protocol (FTP) is an old and complex multi-port protocol that presents a distinct set of challenges to a clustered environment. To understand the nature of these challenges, you must rst understand some key things about how FTP works. 12.4.1. How FTP Works With most other server client relationships, the client machine opens up a connection to the server on a particular port and the server then responds to the client on that port. When an FTP client connects to an FTP server it opens a connection to the FTP control port 21. Then the client tells the FTP server whether to establish an active or passive connection. The type of connection chosen by the client determines how the server responds and on what ports transactions will occur. The two types of data connections are: Active Connections When an active connection is established, the server opens a data connection to the client from port 20 to a high range port on the client machine. All data from the server is then passed over this connection. Passive Connections When a passive connection is established, the client asks the FTP server to establish a passive connection port, which can be on any port higher than 10,000. The server then binds to this high-numbered port for this particular session and relays that port number back to the client. The client then opens the newly bound port for the data connection. Each data request the client makes results in a separate data connection. Most modern FTP clients attempt to establish a passive connection when requesting data from servers. The two important things to note about all of this in regards to clustering is: 1. The client determines the type of connection, not the server. This means, to effectively cluster FTP, you must congure the LVS routers to handle both active and passive connections. 2. The FTP client/server relationship can potentially open a large number of ports that the Piranha Conguration Tool and IPVS do not know about. 12.4.2. How This Affects LVS Routing IPVS packet forwarding only allows connections in and out of the cluster based on it recognizing its port number or its rewall mark. If a client from outside the cluster attempts to open a port IPVS is not congured to handle, it drops the connection. Similarly, if the real server attempts to open a connection back out to the Internet on a port IPVS does not know about, it drops the connection. This means all connections from FTP clients on the Internet must have the same rewall mark assigned to them and all connections from the FTP server must be properly forwarded to the Internet using network packet ltering rules. Chapter 12. Setting Up a Red Hat Enterprise Linux LVS Cluster 131 12.4.3. Creating Network Packet Filter Rules Before assigning any iptables rules for FTP service, review the information in Section 12.3.1 Assigning Firewall Marks concerning multi-port services and techniques for checking the existing network packet ltering rules. Below are rules which assign the same rewall mark, 21, to FTP trafc. For these rules to work properly, you must also use the VIRTUAL SERVER subsection of Piranha Conguration Tool to congure a virtual server for port 21 with a value of 21 in the Firewall Mark eld. See Section 13.6.1 The VIRTUAL SERVER Subsection for details. 12.4.3.1. Rules for Active Connections The rules for active connections tell the kernel to accept and forward connections coming to the internal oating IP address on port 20 ? the FTP data port. iptables /sbin/iptables -t nat -A POSTROUTING -p tcp \ -s n.n.n.0/24 --sport 20 -j MASQUERADE In the above iptables commands, n.n.n should be replaced with the rst three values for the oating IP for the NAT interface's internal network interface dened in the GLOBAL SETTINGS panel of Piranha Conguration Tool. The command allows the LVS router to accept outgoing connections from the real servers that IPVS does not know about. 12.4.3.2. Rules for Passive Connections The rules for passive connections assign the appropriate rewall mark to connections coming in from the Internet to the oating IP for the service on a wide range of ports ? 10,000 to 20,000. Warning If you are limiting the port range for passive connections, you must also congure the VSFTP server to use a matching port range. This can be accomplished by adding the following lines to /etc/vsftpd.conf: pasv_min_port=10000 pasv_max_port=20000 You must also control the address that the server displays to the client for passive FTP connections. In a NAT routed LVS system, add the following line to /etc/vsftpd.conf to override the real server IP address to the VIP, which is what the client sees upon connection. For example: pasv_address=X.X.X.X Replace X.X.X.X with the VIP address of the LVS system. For conguration of other FTP servers, consult the respective documentation. This range should be a wide enough for most situations; however, you can increase this number to include all available non-secured ports by changing 10000:20000 in the commands below to 1024:65535. 132 Chapter 12. Setting Up a Red Hat Enterprise Linux LVS Cluster iptables /sbin/iptables -t mangle -A PREROUTING -p tcp \ -d n.n.n.n/32 \ --dport 21 -j MARK --set-mark 21 /sbin/iptables -t mangle -A PREROUTING -p tcp \ -d n.n.n.n/32 \ --dport 10000:20000 -j MARK --set-mark 21 In the above iptables commands, n.n.n.n should be replaced with the oating IP for the FTP virtual server dened in the VIRTUAL SERVER subsection of Piranha Conguration Tool. These commands have the net effect of assigning any trafc addressed to the oating IP on the appropriate ports a rewall mark of 21, which is in turn recognized by IPVS and forwarded appropriately. Warning The commands above take effect immediately, but do not persist through a reboot of the system. To ensure network packet lter settings are restored after a reboot, see Section 12.5 Saving Network Packet Filter Settings Finally, you need to be sure that the appropriate service is set to activate on the proper runlevels. For more on this, refer to Section 11.1 Conguring Services on the LVS Routers. 12.5. Saving Network Packet Filter Settings After conguring the appropriate network packet lters for your situation, save the settings so they get restored after a reboot. For iptables, type the following command: /sbin/service iptables save This saves the settings in /etc/sysconfig/iptables so they can be recalled at boot time. Once this le is written, you are able to use the /sbin/service command to start, stop, and check the status (using the status switch) of iptables. The /sbin/service will automatically load the appropriate module for you. For an example of how to use the /sbin/service command, see Section 11.3 Starting the Piranha Conguration Tool Service. Finally, you need to be sure the appropriate service is set to activate on the proper runlevels. For more on this, see Section 11.1 Conguring Services on the LVS Routers. The next chapter explains how to use the Piranha Conguration Tool to congure the LVS router and describe the steps necessary to active an LVS cluster. Chapter 13. Conguring the LVS Routers with Piranha Conguration Tool The Piranha Conguration Tool provides a structured approach to creating the necessary conguration le for a Piranha cluster ? /etc/sysconfig/ha/lvs.cf. This chapter describes the basic operation of the Piranha Conguration Tool and how to activate the cluster once conguration is complete. Important The conguration le for the LVS cluster follows strict formatting rules. Using the Piranha Conguration Tool is the best way to prevent syntax errors in the lvs.cf and therefore prevent software failures. 13.1. Necessary Software The piranha-gui service must be running on the primary LVS router to use the Piranha Conguration Tool. To congure the cluster, you minimally need a text-only Web browser, such as links. If you are accessing the LVS router from another machine, you also need an ssh connection to the primary LVS router as the root user. While conguring the primary LVS router it is a good idea to keep a concurrent ssh connection in a terminal window. This connection provides a secure way to restart pulse and other services, congure network packet lters, and monitor /var/log/messages during trouble shooting. The next four sections walk through each of the conguration pages of the Piranha Conguration Tool and give instructions on using it to set up the LVS cluster. 13.2. Logging Into the Piranha Conguration Tool When conguring an LVS cluster, you should always begin by conguring the primary router with the Piranha Conguration Tool. To do this,verify that the piranha-gui service is running and an administrative password has been set, as described in Section 11.2 Setting a Password for the Piranha Conguration Tool. If you are accessing the machine locally, you can open http://localhost:3636 in a Web browser to access the Piranha Conguration Tool. Otherwise, type in the hostname or real IP address for the server followed by :3636. Once the browser connects, you will see the screen shown in Figure 13-1. 134 Chapter 13. Conguring the LVS Routers with Piranha Conguration Tool Figure 13-1. The Welcome Panel Click on the Login button and enter piranha for the Username and the administrative password you created in the Password eld. The Piranha Conguration Tool is made of four main screens or panels. In addition, the Virtual Servers panel contains four subsections. The CONTROL/MONITORING panel is the rst panel after the login screen. 13.3. CONTROL/MONITORING The CONTROL/MONITORING Panel presents the cluster administrator with a limited runtime status of the cluster. It displays the status of the pulse daemon, the LVS routing table, and the LVSspawned nanny processes. Note The elds for CURRENT LVS ROUTING TABLE and CURRENT LVS PROCESSES remain blank until you actually start the cluster, as shown in Section 13.8 Starting the Cluster . Chapter 13. Conguring the LVS Routers with Piranha Conguration Tool 135 Figure 13-2. The CONTROL/MONITORING Panel Auto update The status display on this page can be updated automatically at a user congurable interval. To enable this feature, click on the Auto update checkbox and set the desired update frequency in the Update frequency in seconds text box (the default value is 10 seconds). It is not recommended that you set the automatic update to an interval less than 10 seconds. Doing so may make it difcult to recongure the Auto update interval because the page will update too frequently. If you encounter this issue, simply click on another panel and then back on CONTROL/MONITORING. The Auto update feature does not work with all browsers, such as Mozilla. Update information now You can manually update the status information manually by clicking this button. CHANGE PASSWORD Clicking this button takes you to a help screen with information on how to change the administrative password for the Piranha Conguration Tool. 13.4. GLOBAL SETTINGS The GLOBAL SETTINGS panel is where the cluster administrator denes the networking details for the primary LVS router's public and private network interfaces. 136 Chapter 13. Conguring the LVS Routers with Piranha Conguration Tool Figure 13-3. The GLOBAL SETTINGS Panel The top half of this panel sets up the primary LVS router's public and private network interfaces. These are the interfaces already congured in Section 12.1.1 Conguring Network Interfaces for a NAT LVS Cluster. Primary server public IP In this eld, enter the publicly routable real IP address for the primary LVS node. Primary server private IP Enter the real IP address for an alternative network interface on the primary LVS node. This address is used solely as an alternative heartbeat channel for the backup router and does not have to correlate to the real private IP address assigned in Section 12.1.1 Conguring Network Interfaces for a NAT LVS Cluster. You may leave this eld blank, but doing so will mean there is no alternate heartbeat channel for the backup LVS router to use and therefore will create a single point of failure. Tip The primary LVS router's private IP can be congured on any interface that accepts TCP/IP, whether it be an Ethernet adapter or a serial port. Use network type Click the NAT button to select NAT routing. Chapter 13. Conguring the LVS Routers with Piranha Conguration Tool 137 The next three elds deal specically with the NAT router's virtual network interface connected the private network with the real servers. NAT Router IP Enter the private oating IP in this text eld. This oating IP should be used as the gateway for the real servers. NAT Router netmask If the NAT router's oating IP needs a particular netmask, select it from drop-down list. NAT Router device Use this text eld to dene the device name of the network interface for the oating IP address, such as eth1:1. Tip You should alias the NAT oating IP address to the Ethernet interface connected to the private network. In this example, the private network is on the eth1 interface, so eth1:1 is the oating IP address. Warning After completing this page, click the ACCEPT button to make sure you do not lose any changes when selecting a new panel. 13.5. REDUNDANCY The REDUNDANCY panel allows you to congure of the backup LVS router node and set various heartbeat monitoring options. Tip The rst time you visit this screen, it displays an "inactive" Backup status and an ENABLE button. To congure the backup LVS router, click on the ENABLE button so that the screen matches Figure 13-4. 138 Chapter 13. Conguring the LVS Routers with Piranha Conguration Tool Figure 13-4. The REDUNDANCY Panel Redundant server public IP Enter the public real IP address for the backup LVS router node. Redundant server private IP Enter the backup node's private real IP address in this text eld. If you do not see the eld called Redundant server private IP, go back to the GLOBAL SETTINGS panel and enter a Primary server private IP address and click ACCEPT. The rest of the panel is devoted to conguring the heartbeat channel, which is used by the backup node to monitor the primary node for failure. Heartbeat Interval (seconds) This eld sets the number of seconds between heartbeats ? the interval that the backup node will check the functional status of the primary LVS node. Assume dead after (seconds) If the primary LVS node does not respond after this number of seconds, then the backup LVS router node will initiate failover. Heartbeat runs on port This eld sets the port at which the heartbeat communicates with the primary LVS node. The default is set to 539 if this eld is left blank. Chapter 13. Conguring the LVS Routers with Piranha Conguration Tool 139 Warning Remember to click the ACCEPT button after making any changes in this panel to make sure you do not lose any changes when selecting a new panel. 13.6. VIRTUAL SERVERS The VIRTUAL SERVERS panel displays information for each currently dened virtual server. Each table entry shows the status of the virtual server, the server name, the virtual IP assigned to the server, the netmask of the virtual IP, the port number to which the service communicates, the protocol used, and the virtual device interface. Figure 13-5. The VIRTUAL SERVERS Panel Each server displayed in the VIRTUAL SERVERS panel can be congured on subsequent screens or subsections. To add a service, click the ADD button. To remove a service, select it by clicking the radio button next to the virtual server and click the DELETE button. To enable or disable a virtual server in the table click its radio button and click the (DE)ACTIVATE button. After adding a virtual server, you can congure it by clicking the radio button to its left and clicking the EDIT button to display the VIRTUAL SERVER subsection. 140 Chapter 13. Conguring the LVS Routers with Piranha Conguration Tool 13.6.1. The VIRTUAL SERVER Subsection The VIRTUAL SERVER subsection panel shown in Figure 13-6 allows you to congure an individual virtual server. Links to subsections related specically to this virtual server are located along the top of the page. But before conguring any of the subsections related to this virtual server, complete this page and click on the ACCEPT button. Figure 13-6. The VIRTUAL SERVERS Subsection Name Enter a descriptive name to identify the virtual server. This name is not the hostname for the machine, so make it descriptive and easily identiable. You can even reference the protocol used by the virtual server, such as HTTP. Application port Enter the port number through which the service application will listen. Since this example is for HTTP services, port 80 is used. Protocol Choose between UDP and TCP in the drop-down menu. Web servers typically communicate via the TCP protocol, so this is selected in the example above. Virtual IP Address Enter the virtual server's oating IP address in this text eld. Chapter 13. Conguring the LVS Routers with Piranha Conguration Tool 141 Virtual IP Network Mask Set the netmask for this virtual server with the drop-down menu. Firewall Mark Do not enter a rewall mark integer value in this eld unless you are bundling multi-port protocols or creating a multi-port virtual server for separate, but related protocols. In this example, the above virtual server has a Firewall Mark of 80 because we are bundling connections to HTTP on port 80 and to HTTPS on port 443 using the rewall mark value of 80. When combined with persistence, this technique will ensure users accessing both insecure and secure webpages are routed to the same real server, preserving state. Warning Entering a rewall mark in this eld allows IPVS to recognize that packets bearing this rewall mark are treated the same, but you must perform further conguration outside of the Piranha Conguration Tool to actually assign the rewall marks. See Section 12.3 Multi-port Services and LVS Clustering for instructions on creating multi-port services and Section 12.4 FTP In an LVS Cluster for creating a highly available FTP virtual server. Device Enter the name of the network device to which you want the oating IP address dened the Virtual IP Address eld to bind. You should alias the public oating IP address to the Ethernet interface connected to the public network. In this example, the public network is on the eth0 interface, so eth0:1 should be entered as the device name. Re-entry Time Enter an integer value which denes the length of time, in seconds, before the active LVS router attempts to bring a real server back into the cluster after a failure. Service Timeout Enter an integer value which denes the length of time, in seconds, before a real server is considered dead and removed from the cluster. Quiesce server When the Quiesce server radio button is selected, anytime a new real server node comes online, the least-connections table is reset to zero so the active LVS router routes requests as if all the real servers were freshly added to the cluster. This option prevents the a new server from becoming bogged down with a high number of connections upon entering the cluster. Load monitoring tool The LVS router can monitor the load on the various real servers by using either rup or ruptime. If you select rup from the drop-down menu, each real server must run the rstatd service. If you select ruptime, each real server must run the rwhod service. Caution Load monitoring is not the same as load balancing and can result in hard to predict scheduling behavior when combined with weighted scheduling algorithms. Also, if you use load monitoring, the real servers in the cluster must be Linux machines. 142 Chapter 13. Conguring the LVS Routers with Piranha Conguration Tool Scheduling Select your preferred scheduling algorithm from the drop-down menu. The default is Weighted least-connection. For more information on scheduling algorithms, see Section 10.3.1 Scheduling Algorithms. Persistence If an administrator needs persistent connections to the virtual server during client transactions, enter the number of seconds of inactivity allowed to lapse before a connection times out in this text eld. Important If you entered a value in the Firewall Mark eld above, you should enter a value for persistence as well. Also, be sure that if you use rewall marks and persistence together, that the amount of persistence is the same for each virtual server with the rewall mark. For more on persistence and rewall marks, refer to Section 10.5 Persistence and Firewall Marks. Persistence Network Mask To limit persistence to particular subnet, select the appropriate network mask from the drop-down menu. Note Before the advent of rewall marks, persistence limited by subnet was a crude way of bundling connections. Now, it is best to use persistence in relation to rewall marks to achieve the same result. Warning Remember to click the ACCEPT button after making any changes in this panel. To make sure you do not lose changes when selecting a new panel. 13.6.2. REAL SERVER Subsection Clicking on the REAL SERVER subsection link at the top of the panel displays the EDIT REAL SERVER subsection. It displays the status of the physical server hosts for a particular virtual service. Chapter 13. Conguring the LVS Routers with Piranha Conguration Tool 143 Figure 13-7. The REAL SERVER Subsection Click the ADD button to add a new server. To delete an existing server, select the radio button beside it and click the DELETE button. Click the EDIT button to load the EDIT REAL SERVER panel, as seen in Figure 13-8. 144 Chapter 13. Conguring the LVS Routers with Piranha Conguration Tool Figure 13-8. The REAL SERVER Conguration Panel This panel consists of three entry elds: Name A descriptive name for the real server. Tip This name is not the hostname for the machine, so make it descriptive and easily identiable. Address The real server's IP address. Since the listening port is already specied for the associated virtual server, do not add a port number. Weight An integer value indicating this host's capacity relative to that of other hosts in the pool. The value can be arbitrary, but treat it as a ratio in relation to other real servers in the cluster. For more on server weight, see Section 10.3.2 Server Weight and Scheduling. Warning Remember to click the ACCEPT button after making any changes in this panel. To make sure you do not lose any changes when selecting a new panel. Chapter 13. Conguring the LVS Routers with Piranha Conguration Tool 145 13.6.3. EDIT MONITORING SCRIPTS Subsection Click on the MONITORING SCRIPTS link at the top of the page. The EDIT MONITORING SCRIPTS subsection allows the administrator to specify a send/expect string sequence to verify that the service for the virtual server is functional on each real server. It is also the place where the administrator can specify customized scripts to check services requiring dynamically changing data. Figure 13-9. The EDIT MONITORING SCRIPTS Subsection Sending Program For more advanced service verication, you can use this eld to specify the path to a servicechecking script. This functionality is especially helpful for services that require dynamically changing data, such as HTTPS or SSL. To use this functionality, you must write a script that returns a textual response, set it to be executable, and type the path to it in the Sending Program eld. Tip To ensure that each server in the real server pool is checked, use the special token %h after the path to the script in the Sending Program eld. This token is replaced with each real server's IP address as the script is called by the nanny daemon. 146 Chapter 13. Conguring the LVS Routers with Piranha Conguration Tool The following is a sample script to use as a guide when composing an external service-checking script: #!/bin/sh TEST=`dig -t soa example.com @$1 | grep -c dns.example.com if [ $TEST != "1" ]; then echo "OK else echo "FAIL" fi Note If an external program is entered in the Sending Program eld, then the Send eld is ignored. Send Enter a string for the nanny daemon to send to each real server in this eld. By default the send eld is completed for HTTP. You can alter this value depending on your needs. If you leave this eld blank, the nanny daemon attempts to open the port and assume the service is running if it succeeds. Only one send sequence is allowed in this eld, and it can only contain printable, ASCII characters as well as the following escape characters: ? \n for new line. ? \r for carriage return. ? \t for tab. ? \ to escape the next character which follows it. Expect Enter a the textual response the server should return if it is functioning properly. If you wrote your own sending program, enter the response you told it to send if it was successful. Tip To determine what to send for a given service, you can open a telnet connection to the port on a real server and see what is returned. For instance, FTP reports 220 upon connecting, so could enter quit in the Send eld and 220 in the Expect eld. Warning Remember to click the ACCEPT button after making any changes in this panel. To make sure you do not lose any changes when selecting a new panel. Once you have congured virtual servers using the Piranha Conguration Tool, you must copy specic conguration les to the backup LVS router. See Section 13.7 Synchronizing Conguration Files for details. Chapter 13. Conguring the LVS Routers with Piranha Conguration Tool 147 13.7. Synchronizing Conguration Files After conguring the primary LVS router, there are several conguration les that must be copied to the backup LVS router before you start the cluster. These les include: ? /etc/sysconfig/ha/lvs.cf? the conguration le for the LVS routers. ? /etc/sysctl?the conguration le that, among other things, turns on packet forwarding in the kernel. ? /etc/sysconfig/iptables? If you are using rewall marks, you should synchronize one of these les based on which network packet lter you are using. Important The /etc/sysctl.conf and /etc/sysconfig/iptables les do not change when you congure the cluster using the Piranha Conguration Tool. 13.7.1. Synchronizing lvs.cf Anytime the LVS conguration le, /etc/sysconfig/ha/lvs.cf, is created or updated, you must copy it to the backup LVS router node. Warning Both the active and backup LVS router nodes must have identical lvs.cf les. Mismatched LVS conguration les between the LVS router nodes can prevent failover. The best way to do this is to use the scp command. Important To use scp the sshd must be running on the backup router, see Section 11.1 Conguring Services on the LVS Routers for details on how to properly congure the necessary services on the LVS routers. Issue the following command as the root user from the primary LVS router to sync the lvs.cf les between the router nodes: scp /etc/sysconfig/ha/lvs.cf n.n.n.n:/etc/sysconfig/ha/lvs.cf In the above command, replace n.n.n.n with the real IP address of the backup LVS router. 13.7.2. Synchronizing sysctl The sysctl le is only modied once in most situations. This le is read at boot time and tells the kernel to turn on packet forwarding. 148 Chapter 13. Conguring the LVS Routers with Piranha Conguration Tool Important If you are not sure whether or not packet forwarding is enabled in the kernel, see Section 11.5 Turning on Packet Forwarding for instructions on how to check and, if necessary, enable this key functionality. 13.7.3. Synchronizing Network Packet Filtering Rules If you are using iptables, you will need to synchronize the appropriate conguration le on the backup LVS router. If you alter the any network packet lter rules, enter the following command as root from the primary LVS router: scp /etc/sysconfig/iptables n.n.n.n:/etc/sysconfig/ In the above command, replace n.n.n.n with the real IP address of the backup LVS router. Next either open an ssh session to the backup router or log into the machine as root and type the following command: /sbin/service iptables restart Once you have copied these les over to the backup router and started the appropriate services (see Section 11.1 Conguring Services on the LVS Routers for more on this topic) you are ready to start the cluster. 13.8. Starting the Cluster To start the LVS cluster, it is best to have two root terminals open simultaneously or two simultaneous root open ssh sessions to the primary LVS router. In one terminal, watch the kernel log messages with the command: tail -f /var/log/messages Then start the cluster by typing the following command into the other terminal: /sbin/service pulse start Follow the progress of the pulse service's startup in the terminal with the kernel log messages. When you see the following output, the pulse daemon has started properly: gratuitous lvs arps finished To stop watching /var/log/messages, type [Ctrl]-[c]. From this point on, the primary LVS router is also the active LVS router. While you can make requests to the cluster at this point, you should start the backup LVS router before putting the cluster into service. To do this, simply repeat the process described above on the backup LVS router node. After completing this nal step, the cluster will be up and running. III. Appendixes This section is licensed under the GNU Free Documentation License. For details refer to the Copyright page. Table of Contents A. Using Red Hat Cluster Manager with Piranha....................................................................... 151 B. Using Red Hat GFS with Red Hat Cluster Suite..................................................................... 153 C. The GFS Setup Druid ................................................................................................................ 157 D. Supplementary Hardware Information ................................................................................... 163 E. Supplementary Software Information...................................................................................... 169 F. Cluster Command-line Utilities................................................................................................. 179 Appendix A. Using Red Hat Cluster Manager with Piranha A cluster can be used in conjunction with the Piranha load-balancing features to deploy a highly available e-commerce site that has complete data integrity and application availability, in addition to load balancing capabilities. Figure A-1 shows how you could use Red Hat Cluster Manager with Piranha. The gure shows a cluster with a three-tier architecture, where the top tier consists of Piranha load-balancing systems to distribute Web requests. The second tier consists of a set of Web servers to serve the requests. The third tier consists of a cluster to serve data to the Web servers. Figure A-1. Cluster in an LVS Environment In a Piranha conguration, client systems issue requests on theWorldWideWeb. For security reasons, these requests enter a Web site through a rewall, which can be a Linux system serving in that capacity or a dedicated rewall device. For redundancy, you can congure rewall devices in a failover conguration. Behind the rewall are Piranha load-balancing systems, which can be congured in an active-standby mode. The active load-balancing system forwards the requests to a set of Web servers. Each Web server can independently process an HTTP request from a client and send the response back to the client. Piranha enables an administrator to expand a Web site's capacity by adding Web servers to the load-balancing systems' set of active Web servers. In addition, if a Web server fails, it can be removed from the set. This Piranha conguration is particularly suitable if the Web servers serve only static Web content, which consists of small amounts of infrequently changing data, such as corporate logos, that can be easily duplicated on the Web servers. However, this conguration is not suitable if the Web servers serve dynamic content, which consists of information that changes frequently. Dynamic content could 152 Appendix A. Using Red Hat Cluster Manager with Piranha include a product inventory, purchase orders, or customer database, which must be consistent on all the Web servers to ensure that customers have access to up-to-date and accurate information. To serve dynamic Web content in a Piranha conguration, add a cluster behind the Web servers, as shown in the previous gure. This combination of Piranha and Red Hat Cluster Manager allows for the conguration of a high-integrity, no-single-point-of-failure e-commerce site. The cluster can run a highly-available instance of a database or a set of databases that are network-accessible to the Web servers. For example, the gure could represent an e-commerce site used for online merchandise ordering through a URL. Client requests to the URL pass through the rewall to the active Piranha loadbalancing system, which then forwards the requests to one of the three Web servers. The Red Hat Cluster Manager systems serve dynamic data to the Web servers, which forward the data to the requesting client system. Appendix B. Using Red Hat GFS with Red Hat Cluster Suite This appendix provides information about considerations to take when running Red Hat GFS 6.0 with Red Hat Cluster Suite and consists of the following sections: ? Section B.1 Terminology ? Section B.2 Changes to Red Hat Cluster ? Section B.3 Installation Scenarios B.1. Terminology You may have encountered new terms associated with Red Hat Cluster Suite. The following list provides a brief description of terms used with Red Hat GFS and Red Hat Cluster Suite: GFS Setup Druid This application is a Red Hat Cluster GUI for initial conguration of Red Hat GFS. The GUI is launched separately from the Red Hat Cluster GUI, the Cluster Conguration Tool. The GFS Setup Druid uses /etc/cluster.xml as input. If /etc/cluster.xml does not exist, the GFS Setup Druid displays a message and exits. Note You must run the Cluster Conguration Tool before running the GFS Setup Druid; the Cluster Conguration Tool creates /etc/cluster.xml. To run the GFS Setup Druid, enter the following at the command line: # redhat-config-gfscluster gulm-bridge This is a fence method available for Red Hat Cluster nodes, if and only if the Red Hat GFS RPM is installed on the node that the Cluster Conguration Tool runs on. The gulm-bridge fence method has been added to Red Hat Cluster Suite specically for the Red Hat Enterprise Linux 3 Update 3 release. Using this fence method on a Red Hat Cluster Manager member prevents it from being fenced twice. Red Hat Cluster Red Hat Cluster Manager is part of the Red Hat Cluster Suite. It provides cluster administration functionality for Red Hat Enterprise Linux 3. Red Hat Cluster Manager contains two major components: ? Red Hat Cluster Manager ? The underlying software (non-GUI) that performs Red Hat Cluster administrations services. ? Cluster Conguration Tool ? This component is the graphical user interface (GUI) for Red Hat Cluster Manager. The GUI provides a conguration interface and a status monitor for members and services in a Red Hat Cluster Manager system. The Cluster Conguration Tool accepts conguration data from a user and writes it to the /etc/cluster.xml le. The Red Hat Cluster Manager reads the conguration data from the /etc/cluster.xml le. 154 Appendix B. Using Red Hat GFS with Red Hat Cluster Suite Also, the Cluster Conguration Tool wraps several command line calls into the Red Hat Cluster Manager, such as starting and stopping services. B.2. Changes to Red Hat Cluster The following changes to Red Hat Cluster enable running it with Red Hat GFS in RHEL-U3: ? The Cluster Conguration Tool has been changed. After entering members in the conguration section of the application, if a member is highlighted, and you click Add Child, a dialog box is displayed, offering fence method options. You can select a fence method by clicking a radio button next to the fence method in the dialog box. Earlier Red Hat Cluster releases provided only two fence method options (under Power Controller Type): Serial and Network. For Red Hat Enterprise Linux 3 Update 3, if Red Hat GFS is installed on the node, then a third fence-method option, GULM-STONITH (the gulm-bridge fence method), is available. ? The Red Hat Cluster Manager now provides support for GULM-STONITH, the gulm-bridge fence method. ? A druid application, the GFS Setup Druid, provides for conguring an initial instance of Red Hat GFS by writing the three Red Hat GFS conguration les: cluster.ccs, nodes.ccs, and fence.ccs. The GFS Setup Druid requires an /etc/cluster.xml le when started. B.3. Installation Scenarios When running Red Hat GFS with Red Hat Cluster Manager, you must take into account certain considerations, according to the following circumstances: ? New installations of Red Hat GFS and Red Hat Cluster Manager ? Adding Red Hat GFS to an existing Red Hat Cluster Manager deployment ? Upgrading Red Hat GFS 5.2.1 to Red Hat GFS 6.0 B.3.1. New Installations of Red Hat GFS and Red Hat Cluster Manager When installing Red Hat GFS and Red Hat Cluster Manager for the rst time into a cluster, install and congure Red Hat Cluster Suite before installing and conguring Red Hat GFS.With the Cluster Conguration Tool, you can congure up to 16 nodes ? the maximum number of nodes allowed in Red Hat Cluster Manager system. You can add services and failover domains (and other functions) after initially conguring Red Hat GFS with the GFS Setup Druid. Note The only conguration items in Red Hat Cluster that Red Hat GFS or the GFS Setup Druid depend on are setting up Red Hat Cluster Manager members and specifying fence devices. Appendix B. Using Red Hat GFS with Red Hat Cluster Suite 155 B.3.2. Adding Red Hat GFS to an Existing Red Hat Cluster Manager Deployment Adding Red Hat GFS to an existing Red Hat Cluster Manager deployment requires running the Red Hat GFS druid application, GFS Setup Druid (also known as redhat-cong-gfscluster). As with the scenario in Section B.3.1 New Installations of Red Hat GFS and Red Hat Cluster Manager, while Red Hat GFS is scalable up to 300 nodes, a Red Hat Cluster Manager limits the total number of nodes in a cluster to 16. Therefore, in this scenario, Red Hat GFS scalability is limited. If the 16-node limit is too small for your deployment, you may want to consider using multiple Red Hat Cluster Manager clusters. 156 Appendix B. Using Red Hat GFS with Red Hat Cluster Suite Appendix C. The GFS Setup Druid The Red Hat Cluster Suite includes a graphical tool for administrators who are new to Red Hat GFS conguration. The GFS Setup Druid allows administrators to quickly migrate existing Red Hat Cluster Manager cluster systems to Red Hat GFS. The GFS Setup Druid integrates any previously con- gured cluster nodes and fencing devices from the existing /etc/cluster.xml conguration le. Administrators can then congure any new nodes and fence devices, as well as associate fence devices to nodes for proper power control. Note For more detailed information about installation, conguration, and administration of Red Hat GFS, refer to the Red Hat GFS Administrator's Guide. The GFS Setup Druid can be started by typing redhat-config-gfscluster as root from a shell prompt. C.1. Cluster Name After reading the introductory screen describing the functionality of the GFS Setup Druid, click Forward to continue. You are then prompted to type a name for the GFS cluster. You can enter any name that does not contain spaces. (such as Development1 or Engineering_Cluster). Click Forward to continue. Figure C-1. Cluster Name 158 Appendix C. The GFS Setup Druid C.2. LOCK_GULM parameters After naming the cluster, you must then congure the GFS LOCK_GULM parameters. The Grand Unied Lock Manager (GULM) is a centralized server-based locking and cluster state management for GFS le systems. Multiple lock servers can be congured for failover redundancy in the event of server failure. Figure C-2 shows the LOCK_GULM conguration screen. Figure C-2. Conguring LOCK_GULM parameters Allowed Misses species the number of responses that a node can consecutively miss before the lock server(s) declare the node expired. The default value is 2. Heartbeat Rate sets the interval (in seconds) at which heartbeats are checked by the lock server(s). Two-thirds of this interval is the rate at which nodes send heartbeat messages to the lock server(s). The default rate is set at 15 seconds. After conguring the LOCK_GULM parameters, click Forward to continue. C.3. Choose Location for CCS Files The next screen prompts you to type or browse for a directory that stores the GFS cluster conguration les created by using the GFS Setup Druid. By default, the /tmp/ directory stores the three conguration les: cluster.ccs, nodes.ccs, and fence.ccs. When a directory is chosen, click Forward to continue. Appendix C. The GFS Setup Druid 159 Figure C-3. Location to Write GFS Conguration Files C.4. Cluster Members The next screen allows you to add, delete, and congure cluster nodes. You can also congure fence devices, which are hardware- or software-based devices used to perform shutdown or reboot of nodes that are expired or that need to be serviced. Figure C-4 shows the cluster and fence device conguration screen. Figure C-4. Congure Cluster Members 160 Appendix C. The GFS Setup Druid C.4.1. Adding a New Member Click the Add a New Member Node to create a new cluster member. A window similar to Figure C-5 appears that prompts you to add the Host Name, IP Address, and Network Device of the member node. Once these values are entered, click Update to add the member to the system. Figure C-5. Add a New Cluster Member The added node appears within the Member Nodes and Fence Instances section. You can congure any member node by clicking the Edit Properties button. Clicking Delete Entry deletes the highlighted member node. C.4.2. Fence Devices To create an entry for a fence device, click the Add a Fence Device button in the Fence Device Specications section. A window appears similar to Figure C-6. Figure C-6. Add a Fence Device Enter a name (do not use spaces) in the Fence Device Name eld. Then, choose the type of fence device using the Agent Type drop down menu. Depending on the fence device chosen, enter any required parameters. Available parameters include: ? IP Address ? The address of the device if it is a network power switch or Fiber Channel switch. ? Login ? The username that is entered to access the device. ? Password ? The password used to authenticate the username that accesses the device. ? Server ? The server name of the device (required when using the Global Network Block Device (GNBD) as a fencing method. ? Hostname ? The hostname of the device. This name must be addressable to the cluster either by DNS assignment or by static entry in each nodes /etc/hosts le. Appendix C. The GFS Setup Druid 161 Note For more information about the types of fence devices in the Agent Type drop down menu, refer to the chapter titled "Creating the Cluster Conguration System Files" in the Red Hat GFS Administrator's Guide. Once a fence device is added, you can recongure the entry by clicking the Edit Properties button. To delete a device, highlight its entry and click the Delete Entry button. Finally, you can associate a fence device to a cluster member by highlighting the entry for the particular member and clicking the Add a Fence to a Node button in the Member Nodes and Fence Instances section. A pop up window appears allowing you to choose any congured fence device from a drop-down menu and associate to the chosen node. Depending on the fence device chosen, you may have to enter additional fence device specications, such as the Port to which the member node is connected on the fence device. Once congured, click Update and the GFS Setup Druid shows the member node with the associated fence device(s) below it. Note You cannot delete a fence device listed in the Fence Device Specications section if it is associated with a cluster node. You must rst delete the fence instance(s) in the Member Nodes and Fence Instances section for that fence device before you delete the entry in the Fence Device Specications section. C.5. Saving Your Conguration and Next Steps When you are satised with your entries, click Forward to continue to the nal screen, which prompts you to click Apply and save the changes to the cluster conguration les. A pop up window appears allowing you to write the les or cancel the save and go back and change the settings. Important Once you have written the conguration les, you cannot change them again using the GFS Setup Druid. You must either restart the conguration from the beginning or edit each cluster conguration le manually using a text editor such as vi or emacs. Once the les are saved, you can create the cluster conguration archive (CCA), which is a binary accessible to the cluster via shared device (also called a pool volume). Create the CCA by running the ccs_tool at a shell prompt as root. The usage of ccs_tool for creating the CCA is as follows: ccs_tool create /directory CCADevice For example, if your cluster conguration les are located in /tmp, and you created a shared device named /dev/pool/devel_cca using the GFS Pool Volume Manager (which allows you to congure pools of physical storage into logical volumes accessible to cluster nodes as if they were local storage), run the following command: 162 Appendix C. The GFS Setup Druid ccs_tool create /tmp/ /dev/pool/devel_cca This chapter briey explains the process of conguring an existing Red Hat Cluster Manager cluster to leverage Red Hat GFS using the GFS Setup Druid. However, for complete instructions on how to properly install, setup, run, and congure Red Hat GFS, refer to the Red Hat GFS Administrator's Guide. Appendix D. Supplementary Hardware Information The following sections provide additional information about conguring the hardware used in a cluster system. D.1. Setting Up Power Controllers This section discusses power controllers. For more information about power controllers and their role in a cluster environment, refer to Section 2.1.3 Choosing the Type of Power Controller. D.1.1. Power Switches For a list of serial-attached and network-attached power switches tested with and/or supported by Red Hat, Inc. for cluster power control, refer to the Red Hat Hardware Compatibility List located at the following URL: http://hardware.redhat.com/hcl/ D.1.2. Setting up Watchdog Power Switches A description of the usage model for watchdog timers as a cluster data integrity provision appears in Section 2.1.3 Choosing the Type of Power Controller. As described in that section, there are two variants of watchdog timers: hardware-based and software-based. This section details the conguration tasks required to setup watchdog timer usage in a cluster hardware conguration. Regardless of which type of watchdog timer is employed, it is necessary to create the device special le appropriate for the watchdog timer. This can be accomplished with the following commands: cd /dev ./MAKEDEV watchdog When using the Cluster Conguration Tool, each new member added to the cluster has software watchdog functionality enabled by default. D.1.2.1. Conguring the Software Watchdog Timer Any cluster system can utilize the software watchdog timer as a data integrity provision, as no dedicated hardware components are required. The cluster software automatically loads the corresponding loadable kernel module called softdog. If the cluster is congured to utilize the software watchdog timer, the cluster membership daemon (clumembd) periodically resets the timer interval. Should clumembd fail to reset the timer, the failed cluster member reboots itself. When using the software watchdog timer, there is a small risk that the system hangs in such a way that the software watchdog thread is not executed. In this unlikely scenario, the other cluster member may takeover services of the apparently hung cluster member. Generally, this is a safe operation; but in the unlikely event that the hung cluster member resumes, data corruption could occur. To further lessen the chance of this vulnerability occurring when using the software watchdog timer, administrators should also congure the NMI watchdog timer as well as an external power switch (if available). 164 Appendix D. Supplementary Hardware Information D.1.2.2. Enabling the NMI Watchdog Timer If you are using the software watchdog timer as a data integrity provision, it is also recommended to enable the Non-Maskable Interrupt (NMI) watchdog timer to enhance the data integrity guarantees. The NMI watchdog timer is a different mechanism for causing the system to reboot in the event of a hang scenario where interrupts are blocked. This NMI watchdog can be used in conjunction with the software watchdog timer. Unlike the software watchdog timer which is reset by the cluster quorum daemon (cluquorumd), the NMI watchdog timer counts system interrupts. Normally, a healthy system receives hundreds of device and timer interrupts per second. If there are no interrupts in a 5 second interval, a system hang has occurred and the NMI watchdog timer expires, initiating a system reboot. A robust data integrity solution can be implemented by combining the health monitoring of the the cluster quorum daemon with the software watchdog timer along with the low-level system status checks of the NMI watchdog. Correct operation of the NMI watchdog timer mechanism requires that the cluster members contain an APIC chip on the main system board. The NMI watchdog is enabled on supported systems by adding nmi_watchdog=1 to the kernel's command line. Here is an example /etc/grub.conf: Note The following GRUB and LILO bootloader congurations only apply to the x86 architecture of Red Hat Enterprise Linux. # grub.conf default=0 timeout=10 splashimage=(hd0,0)/grub/splash.xpm.gz title HA Test Kernel (2.4.9-10smp) root (hd0,0) # This is the kernel's command line. kernel /vmlinuz-2.4.9-10smp ro root=/dev/hda2 nmi_watchdog=1 # end of grub.conf On systems using LILO, add "nmi_watchdog=1" to the append section in /etc/lilo.conf. For example: # lilo.conf prompt timeout=50 default=linux boot=/dev/hda map=/boot/map install=/boot/boot.b lba32 image=/boot/vmlinuz-2.4.9-10smp label=linux read-only root=/dev/hda2 append="nmi_watchdog=1" # end of lilo.conf Appendix D. Supplementary Hardware Information 165 Run /sbin/lilo after editing /etc/lilo.conf for the changes to take effect. To determine if a server supports the NMI watchdog timer, rst try adding "nmi_watchdog=1" to the kernel command line as described above. After the system has booted, log in as root and type: cat /proc/interrupts The output should appear similar to the following: CPU0 0: 5623100 XT-PIC timer 1: 13 XT-PIC keyboard 2: 0 XT-PIC cascade 7: 0 XT-PIC usb-ohci 8: 1 XT-PIC rtc 9: 794332 XT-PIC aic7xxx, aic7xxx 10: 569498 XT-PIC eth0 12: 24 XT-PIC PS/2 Mouse 14: 0 XT-PIC ide0 NMI: 5620998 LOC: 5623358 ERR: 0 MIS: 0 The relevant portion of the above output is to verify that the NMI id appears on the left side. If NMI value has a value larger than zero (0), the server supports the NMI watchdog. If this approach fails, that is, NMI is zero, try passing nmi_watchdog=2 to the kernel instead of nmi_watchdog=1 in the manner described previously. Again, check /proc/interrupts after the system boots. If NMI has a value larger than zero, the NMI watchdog has been congured properly. If NMI is zero, your system does not support the NMI watchdog timer. D.1.2.3. Conguring a Hardware Watchdog Timer The kernel provides driver support for various types of hardware watchdog timers. Some of these timers are implemented directly on the system board; whereas, others are separate hardware components such as PCI cards. Hardware-based watchdog timers provide excellent data integrity provisions in the cluster because they operate independently of the system processor and are therefore fully capable of rebooting a system in the event of a system hang. Due to a lack of uniformity among low-level hardware watchdog components, it is difcult to make generalizations describing how to know if a particular system contains such components. Many lowlevel hardware watchdog components are not self-identifying. When conguring any of the supported watchdog timers supported by the kernel, it is necessary to place a corresponding entry into the /etc/modules.conf le. For example, if an Intel-810 based TCO watchdog timer is to be used, the following line should be added to /etc/modules.conf: alias wdt i810-tco D.2. SCSI Bus Conguration Requirements SCSI buses must adhere to a number of conguration requirements to operate correctly. Failure to adhere to these requirements adversely affects cluster operation and resource availability. The following are SCSI bus conguration requirements: 166 Appendix D. Supplementary Hardware Information ? Buses must be terminated at each end. Refer to Section D.3 SCSI Bus Termination for more information. ? Buses must not extend beyond the maximum length restriction for the bus type. Internal cabling must be included in the length of the SCSI bus. Refer to Section D.4 SCSI Bus Length for more information. ? All devices (host bus adapters and disks) on a bus must have unique SCSI identication numbers. Refer to Section D.5 SCSI Identication Numbers for more information. ? The Linux device name for each shared SCSI device must be the same on each cluster system. For example, a device named /dev/sdc on one cluster system must be named /dev/sdc on the other cluster system. One way to ensure that devices are named the same is by using identical hardware for both cluster systems. Use the system's conguration utility to set SCSI identication numbers and enable host bus adapter termination. When the system boots, a message is displayed describing how to start the utility. For example, the utility prompts the user to press [Ctrl]-[A], and follow the prompts to perform a particular task. To set storage enclosure and RAID controller termination, refer to the vendor documentation. Refer to Section D.3 SCSI Bus Termination and Section D.5 SCSI Identication Numbers for more information. D.3. SCSI Bus Termination A SCSI bus is an electrical path between two terminators. A device (host bus adapter, RAID controller, or disk) attaches to a SCSI bus by a short stub, which is an unterminated bus segment that usually must be less than 0.1 meter in length. Buses must have only two terminators located at opposing ends of the bus. Additional terminators, terminators that are not at the ends of the bus, or long stubs cause the bus to operate incorrectly. Termination for a SCSI bus can be provided by the devices connected to the bus or by external terminators, if the internal (onboard) device termination can be disabled. Testing has shown that external termination on HBAs that run at speeds greater than 80MB/second does not work reliably. When disconnecting a device from a single-initiator SCSI bus follow these guidelines: ? Unterminated SCSI cables must not be connected to an operational host bus adapter or storage device. ? Connector pins must not bend or touch an electrical conductor while the SCSI cable is disconnected. ? To disconnect a host bus adapter from a single-initiator bus, rst disconnect the SCSI cable from the RAID controller and then from the adapter. This ensures that the RAID controller is not exposed to any erroneous input. ? Protect connector pins from electrostatic discharge while the SCSI cable is disconnected by wearing a grounded anti-static wrist guard and physically protecting the cable ends from contact with other objects. ? Do not remove a device that is currently participating in any SCSI bus transactions. To enable or disable an adapter's internal termination, use the system BIOS utility. When the system boots, a message is displayed describing how to start the utility. For example, many utilities prompt users to press [Ctrl]-[A]. Follow the prompts for setting the termination. At this point, it is also possible to set the SCSI identication number, as needed, and disable SCSI bus resets. Refer to Section D.5 SCSI Identication Numbers for more information. To set storage enclosure and RAID controller termination, refer to the vendor documentation. Appendix D. Supplementary Hardware Information 167 D.4. SCSI Bus Length A SCSI bus must adhere to length restrictions for the bus type. Buses that do not adhere to these restrictions do not operate properly. The length of a SCSI bus is calculated from one terminated end to the other and must include any cabling that exists inside the system or storage enclosures. A cluster supports LVD (low voltage differential) buses. The maximum length of a single-initiator LVD bus is 25 meters. The maximum length of a multi-initiator LVD bus is 12 meters. According to the SCSI standard, a single-initiator LVD bus is a bus that is connected to only two devices, each within 0.1 meter from a terminator. All other buses are dened as multi-initiator buses. Do not connect any single-ended devices to an LVD bus; doing so converts the bus single-ended, which has a much shorter maximum length than a differential bus. D.5. SCSI Identication Numbers Each device on a SCSI bus must have a unique SCSI identication number. Devices include host bus adapters, RAID controllers, and disks. The number of devices on a SCSI bus depends on the data path for the bus. A cluster supports wide SCSI buses, which have a 16-bit data path and support a maximum of 16 devices. Therefore, there are sixteen possible SCSI identication numbers that can be assigned to the devices on a bus. In addition, SCSI identication numbers are prioritized. Use the following priority order to assign SCSI identication numbers: 7 - 6 - 5 - 4 - 3 - 2 - 1 - 0 - 15 - 14 - 13 - 12 - 11 - 10 - 9 - 8 The previous order species that 7 is the highest priority, and 8 is the lowest priority. The default SCSI identication number for a host bus adapter is 7, because adapters are usually assigned the highest priority. It is possible to assign identication numbers for logical units in a RAID subsystem by using the RAID management interface. To modify an adapter's SCSI identication number, use the system BIOS utility. When the system boots, a message is displayed describing how to start the utility. For example, a user may be prompted to press [Ctrl]-[A] and follow the prompts for setting the SCSI identication number. At this point, it is possible to enable or disable the adapter's internal termination, as needed, and disable SCSI bus resets. Refer to Section D.3 SCSI Bus Termination for more information. The prioritized arbitration scheme on a SCSI bus can result in low-priority devices being locked out for some period of time. This may cause commands to time out, if a low-priority storage device, such as a disk, is unable to win arbitration and complete a command that a host has queued to it. For some workloads, it is possible to avoid this problem by assigning low-priority SCSI identication numbers to the host bus adapters. 168 Appendix D. Supplementary Hardware Information Appendix E. Supplementary Software Information The information in the following sections can assist in the management of the cluster software con- guration. E.1. Cluster Communication Mechanisms A cluster uses several intra-cluster communication mechanisms to ensure data integrity and correct cluster behavior when a failure occurs. The cluster uses these mechanisms to: ? Control when a system can become a cluster member ? Determine the state of the cluster systems ? Control the behavior of the cluster when a failure occurs The cluster communication mechanisms are as follows: ? Shared (quorum) partitions Periodically, each cluster system writes a time-stamp and system status to the primary and shadow shared partitions, which are raw partitions located on shared storage. Each member reads the system status and time-stamp that were written by the other members and determines if they are up to date. The members attempt to read the information from the primary shared partition. If this partition is corrupted, the members read the information from the shadow shared partition and simultaneously repair the primary partition. Data consistency is maintained through checksums and any inconsistencies between the partitions are automatically corrected. If a member reboots but cannot write to both shared partitions, the system is not allowed to join the cluster. In addition, if an existing member can no longer write to both partitions, it removes itself from the cluster by shutting down. Shared partitions are only used as a communication mechanism in two-member clusters that have network tie-breaker disabled. ? Remote power switch monitoring Periodically, each member monitors the health of the remote power switch connection, if any. The member uses this information to help determine the status of the other cluster members. The complete failure of the power switch communication mechanism does not automatically result in a failover. If a power switch fails to power-cycle a hung system, no failover is performed as the cluster infrastructure cannot guarantee the member's present state. ? Ethernet heartbeats The members are connected together by using point-to-point Ethernet lines. Periodically, each member issues heartbeats (pings) across these lines. The cluster uses this information to help determine the status of the members and to ensure correct cluster operation. The complete failure of the heartbeat communication mechanism does not automatically result in a failover. If a member determines that a time-stamp from another member is not up-to-date, it checks the heartbeat status. If heartbeats to the member are still operating, the cluster software takes no action. If a member does not update its time-stamp after some period of time, and does not respond to heartbeat pings, it is considered down. The cluster remains operational as long as one cluster system can write to the shared partitions, even if all other communication mechanisms fail. 170 Appendix E. Supplementary Software Information Note that shared partition is only used as a back-up in some two-member congurations. The network membership algorithm is the primary determining factor as to which cluster members are active and which are not. A member that is not updating its time-stamp in this conguration never causes a failover unless clumembd reports that the member is down. E.2. Failover and Recovery Scenarios Understanding cluster behavior when signicant events occur can assist in the proper management of a cluster. Note that cluster behavior depends on whether power switches are employed in the con- guration. Power switches enable the cluster to maintain complete data integrity under all failure conditions. The following sections describe how the system responds to various failure and error scenarios. E.3. Common Cluster Behaviors: General Loss of connectivity to a power switch or failure to fence a member Common Causes: Serial power switch disconnected from controlling member. Network power switch disconnected from network. Expected Behavior: Members controlled by the power switch will not be able to be shut down or restarted. In this case, if the member hangs, services will not fail-over from any member controlled by the switch in question. Verication: Run clustat to verify that services are still marked as running on the member, even though it is inactive according to membership. Dissolution of the cluster quorum Common Causes: A majority of cluster members (for example, 3 of 5 members) go ofine Test Case: In a 3 member cluster, stop the cluster software on two members. Expected Behavior: All members which do not have controlling power switches reboot immediately. All services stop immediately and their states are not updated on the shared media (when running clustat, the service status blocks may still display that the service is running). Service managers exit. Cluster locks are lost and become unavailable. Verication: Run clustat on one of the remaining active members. Member loses participatory status in the cluster quorum but is not hung Common Causes: Total loss of connectivity to other members. Test Case: Disconnect all network cables from a cluster member. Expected Behavior: If the member has no controlling power switches, it reboots immediately. Otherwise, it attempts to stop services as quickly as possible. If a quorum exists, the set of members comprising the cluster quorum will fence the member. clumembd crashes Test Case: killall -KILL clumembd Expected Behavior: System reboot. Appendix E. Supplementary Software Information 171 clumembd hangs, watchdog in use Test Case: killall -STOP clumembd Expected Behavior: System reboot may occur if clumembd hangs for a time period greater than (failover_time - 1) seconds. Triggered externally by watchdog timer. clumembd hangs, no watchdog in use Test Case: killall -STOP clumembd Expected Behavior: System reboot may occur if clumembd hangs for a time period greater than (failover_time) seconds. Triggered internally by clumembd. cluquorumd crashes Test Case: killall -KILL cluquorumd Expected Behavior: System reboot. clusvcmgrd crashes Test Case: killall -KILL clusvcmgrd Expected Behavior: cluquorumd re-spawns clusvcmgrd, which runs the stop phase of all services. Services which are stopped are started. Verication: Consult system logs for a warning message from cluquorumd. clulockd crashes Test Case: killall -KILL clulockd Expected Behavior: cluquorumd re-spawns clulockd. Locks may be unavailable (preventing service transitions) for a short period of time. Verication: Consult system logs for a warning message from cluquorumd. Unexpected system reboot without clean shutdown of cluster services Common Causes: Any noted scenario which causes a system reboot. Test Case: reboot -fn; pressing the reset switch. Expected Behavior: If a power switch controls the rebooting member in question, the system will also be fenced (generally, power-cycled) if a cluster quorum exists. Loss of quorum during clean shutdown Test Case: Stop cluster services (service clumanager stop) on all members. Expected Behavior: Any remaining services are stopped uncleanly. Verication: Consult the system logs for warning message. Successful STONITH fencing operation Expected Behavior: Services on member which was fenced are started elsewhere in the cluster, if possible. Verication: Verify that services are, in fact, started after the member is fenced. This should only take a few seconds. Unsuccessful fencing operation on cluster member Common Causes: Power switch returned error status or is not reachable. Test Case: Disconnect power switch controlling a member and run reboot -fn on the member. 172 Appendix E. Supplementary Software Information Expected Behavior: Services on a member which fails to be fenced are not started elsewhere in the cluster. If the member recovers, services on the cluster are restarted. Because there is no way to accurately determine the member's state, it is assumed that it is now still running even if heartbeats have stopped. Thus, all services should be reported as running on the down member. Verication: Run clustat to verify that services are still marked as running on the member, even though it is inactive according to membership. Messages will be logged stating that the member is now in the PANIC state. Error reading from one of the shared partitions Test Case: Run dd to write zeros to the shared partition dd if=/dev/zero of=/dev/raw/raw1 bs=512 count=1 shutil -p /cluster/header Expected Behavior: Event is logged. The data from the good shared partition is copied to the partition which returned errors Verication: A second read pass of the same data should not produce a second error message. Error reading from both of the shared partitions Common Causes: Shared media is either unreachable or both partitions have corruption. Test Case: Unplug SCSI or Fibre Channel cable from a member. Expected Behavior: The event is logged. Congured action is taken to address loss of access to shared storage (reboot/halt/stop/ignore). Default action is to reboot E.4. Common Behaviors: Two Member Cluster with Disk-based Tie-breaker Loss of network connectivity to other member, shared media still accessible Common Causes: Network connectivity lost. Test Case: Disconnect all network cables from a member. Expected Behavior: No fail-over unless disk updates are also lost. Services will not be able to be relocated in most cases, which is due to the fact that the lock server requires network connectivity. Verication: Run clustat to verify that services are still marked as running on the member, even though it is inactive according to membership. Messages are logged stating that the member is now in the PANIC state. Loss of access to shared media Common Causes: Shared media loses power, cable connecting a member to the shared media is disconnected. Test Case: Unplug SCSI or Fibre Channel cable from a member. Expected Behavior: No failover occurs unless network is also lost. Congured action is taken to address loss of access to shared storage (reboot/halt/stop/ignore). Default is reboot. The action may subsequently cause a failover.) System hang or crash (panic) on member X Test Case: Kill the cluquorumd and clumembd daemons. killall -STOP cluquorumd clumembd Appendix E. Supplementary Software Information 173 Expected Behavior: Hung cluster member is fenced by other cluster member. Services fail over. Congured watchdog timers may be triggered. Start of Cluster Services without network connectivity Common Causes: Bad switch; one or both members are without network connectivity Test Case: Stop cluster services on all members. Disconnect all network cables from one member. Start cluster services on both members. Verication: Not all services may start, as locks require network connectivity. Because the Cluster Manager requires a fully connected subnet, this case is handled on a best-effort basis, but is technically an inoperable cluster. E.5. Common Behaviors: 2-4 Member Cluster with IP-based Tie-Breaker Network Partition Common Causes: Network switch problem. Test Case: Connect half of the members to switch A. Connect other half to switch B. Connect switch A to switch B using up-link or crossover cable. Connect device acting as tie-breaker IP address to switch A. Start cluster services. Unplug switch A from switch B. Expected Behavior: All cluster partitions which are comprised of exactly half (1/2 or 2/4) members send ping packets to the tie-breaker IP address. If a reply is received, the partition forms a quorum. In the test case, this means the half connected to switch A will form a quorum. Because the Cluster Manager requires a fully connected subnet, the case where an even-split (or split-brain) occurs when both partitions can reach the tie-breaker IP is not handled. Verication: Run clustat on the members plugged in to switch A. There should be a Cluster Quorum Incarnation number listed at the top of the output. Loss of access to shared media Common Causes: Shared media loses power; cable connecting a member to the shared media is disconnected. Test Case: Unplug SCSI or Fibre Channel cable from a member. Expected Behavior: Congured action is taken to address loss of access to shared storage (reboot/ halt/stop/ignore). Default is reboot. System hang or crash (panic) on cluster member Test Case: Kill the cluquorumd and clumembd daemons. killall -STOP cluquorumd clumembd Expected Behavior: The cluster member is fenced by another member. Services fail over. If a watchdog is in use, it may be triggered. E.6. Common Behaviors: 3-5 Member Cluster Network Partition Common Causes: Network switch problem 174 Appendix E. Supplementary Software Information Test Case: Connect a majority of members to switch A. Connect remaining members to switch B. Connect switch A to switch B using up-link or crossover cable. Start cluster services. Unplug switch A from switch B. Expected Behavior: The partition with a majority of members continues operating, and a new view of the cluster quorum is formed. Members in the minority partition are fenced, and services which were running in the minority partition are started in the majority partition, if possible. In the test case, this means that members connected to switch A will fence members connected to switch B. Verication: Run clustat on one of the members connected to switch A. There should be a Cluster Quorum Incarnation number listed near the top of the output. System hang on cluster member Test Case: Kill the clumembd daemon. killall -STOP clumembd Expected Behavior: The cluster member is fenced by another member. Services fail over. If watchdog timer is congured, it may be triggered. Loss of access to shared media Common Causes: Shared media loses power, cable connecting a member to the shared media is disconnected. Test Case: Unplug SCSI or Fibre Channel cable from a member. Expected Behavior: Congured action is taken to address loss of access to shared storage (reboot/ halt/stop/ignore). Default is reboot. E.7. Common Behaviors: Cluster Service Daemons Service status check fails Common Causes: User script reports error, clurmtabd not running on NFS service, smbd and nmbd not running for a service with a Samba share. Test Case: Create a service with an init script that returns a status output of 1. Expected Behavior: Service restarts on the current owner. Verication: Consult system logs for a service restart event. The restarts eld of the service's status information should be incremented. A member fails to start services Common Causes: User script returns error due to le system errors. Test Case: Create a service with a user script which returns 1 for the start phase only on one member. Attempt to enable the service on this member. Expected Behavior: Service is stopped and started on another member, provided the services stops successfully. Service start fails on all members Common Causes: User script returns error, le system errors. Test Case: Create a service with a user script which returns 1 for the start phase on all members. Expected Behavior: Service is placed into the disabled state. Verication: Run clustat and verify that the service is in the disabled state. Appendix E. Supplementary Software Information 175 Service stop fails on a member Common Causes: User script returns error; le system can not be unmounted. Test Case: Create a service script which returns 1 for the stop phase. Expected Behavior: Service is placed into the failed state. At this point, the administrator must intervene to determine the cause of the failure and the appropriate course of action. The service must then be disabled before it can be enabled. Verication: Run clustat and verify that the service has been placed in to the failed state. E.8. Common Behaviors: Miscellaneous /etc/cluster.xml no longer exists yet cluster is running Test Case: rm -f cluster.xml Expected Behavior: clusvcmgrd recreates /etc/cluster.xml from its cached version. Verication: Check to see that /etc/cluster.xml is present shortly after testing this action. A log message appears in the system logs. clurmtabd crashes Test Case: Kill the clurmtabd daemon. killall -KILL clurmtabd Expected Behavior: Since clurmtabd is only spawned during the start phase of a service with NFS exports, it is only checked by a service which has a Check Interval and NFS exports congured. If clurmtabd is not present (and it is congured to be), the check phase of the service returns an error and the service is restarted. Verication: Consult system logs for a service restart event. The restarts eld of the service's status information should be incremented. E.9. The cluster.xml File The cluster conguration le, /etc/cluster.xml, contains detailed information about the cluster members and services. Do not manually edit the conguration le. Instead, use the Cluster Conguration Tool to modify the cluster conguration. When you run Cluster Conguration Tool, cluster-specic information is entered in a hierarchical XML format. The following is a description of each area of conguration, from daemon and shared storage to cluster members and services. Note that the back slash (\) represents a continuation of a single line. ' ?xml version="1.0"? ' cluconfig version="3( .0" ' clumembd broadcast="no( " interval="500000" loglevel="4" multicast="yes" \ multicast_ipaddress="225.0.0.11" thread="yes" tko_count="20"/ ' cluquorumd loglevel="6" pinginterval="" tiebreaker_ip=""/ ( ' clurmtabd loglevel="4" pollinterval="4"/ ( ' clusvcmgrd loglevel="4"/ ( ' clulockd loglevel="4"/ ( ' cluster config_viewnumb( er="18" key="7a497d303feefeef0f8be9b72697aaed" name="Octane"/ ( The above elds contain versioning information and cluster daemon operation parameters such as logging levels, networking addresses, and more. For more information on conguring cluster daemon parameters, refer to Section 3.6 Conguring Cluster Daemons. 176 Appendix E. Supplementary Software Information ) cluster config_viewnumber="18" key="7a497d303feefeef0f8be9b72697aaed" \ name="Test_cluster"/ ) sharedstate driver="li* bsharedraw.so" rawprimary="/dev/raw/raw1" \ rawshadow="/dev/raw/raw2" type="raw"/ * The elds above dene the cluster quorum and shared cluster conguration parameters. The driver and raw partition information about the shared primary and backup partitions are also specied in these elds. For more information about conguring the shared partitions, refer to Section 3.5 Editing the rawdevices File. ) members ) member* id="0" name="clu1" watchdog="yes" ) powercontroller id="0" ipaddress="192.1* 68.65.51" password="apc" \ port="1:1" type="apcmaster" user="apc"/ ) /member * ) member i* d="1" name="clu2" watchdog="yes" ) powercontroller id="0" ipaddress="192.1* 68.65.52" password="baytech" \ port="1" type="baytech" user="admin"/ ) /member * ) member i* d="2" name="clu3" watchdog="yes" ) powercontroller id="0" ipaddress="192.1* 68.65.53" password="baytech" \ port="2" type="baytech" user="admin"/ ) /member * ) member i* d="3" name="clu4" watchdog="yes" ) powercontroller id="0" ipaddress="192.1* 68.65.54" password="wti" \ port="blue" type="wti_nps" user=""/ ) /member * ) /members* * The elds above dene the cluster and its individual members. Each member eld contains identication and conguration information, including cluster names, addresses, power controllers and types, and authentication details. For more information on conguring cluster members, refer to Section 3.7 Adding and Deleting Members. ) services ) service* checkinterval="0" failoverdomain="None" id="0" name="test" \ userscript="None" ) service_ipa* ddresses/ ) /service * ) service c* heckinterval="0" failoverdomain="foodomain" id="1" name="test2" \ userscript="None" ) service_ipa* ddresses/ ) /service * ) /services * * The elds above dene the services controlled by the cluster system, such as NFS, Samba, and HTTP. The parameters in these elds include service names, failover domain names, service status check intervals, and location of service init scripts (if applicable). For more information about conguring clustered services, refer to Section 3.10 Adding a Service to the Cluster. ) failoverdomains ) failoverdomain* id="0" name="fonfs" ordered="yes" restricted="yes" ) failoverdomainnode id="0" name="clu2"/ * ) failoverdomainnode id="1" name="clu3"/* ) /failoverdomain * ) failoverdomain i* d="1" name="fosamba" ordered="no" restricted="no" ) failoverdomainnode id="0" name="clu1"/ * ) failoverdomainnode id="1" name="clu3"/* ) /failoverdomain * ) /failoverdomains * * Appendix E. Supplementary Software Information 177 The elds above dene the failover domains that control the priority and order in which the cluster members queue in the event of failover. Parameters in these elds include failover domain name, restricted and ordered toggling, and node order by member name. For more information about conguring failover domains for cluster systems, refer to Section 3.9 Conguring a Failover Domain. 178 Appendix E. Supplementary Software Information Appendix F. Cluster Command-line Utilities This appendix provides reference information on the following command-line utilities provided with Red Hat Cluster Suite: ? redhat-config-cluster-cmd?Provides command-line access to the conguration features of the Cluster Conguration Tool utility ? shutil?Checks status of the quorum partitions ? clufence?Tests and controls the connections to network and serial-attached power switches F.1. Using redhat-config-cluster-cmd This section details an example of the redhat-cong-cluster-cmd utility, which allows you to congure all aspects of the cluster, and stores the information in the /etc/cluster.xml le. Usage, options, and examples of using the redhat-config-cluster-cmd command can be found in its man page. To access the manpage from a shell-prompt, type man redhat-config-cluster-cmd. The following describes an example cluster system that is congured using only the redhat-config-cluster-cmd utility. Suppose a system administrator wants to create a cluster system that will serve highly available NFS services to the engineering department of a small organization. The NFS export should only be accessible to the three members of the department (Bob, Jane, and Tom). 1. Add the service and assign it a descriptive name to distinguish its functionality from other services that may run on the cluster. redhat-config-cluster-cmd --add_service --name=nfs_engineers 2. Add a service IP address that will transfer from one member to another in the event of failover: redhat-config-cluster-cmd --service=nfs_engineers --add_service_ipaddress \ --ipaddress=10.0.0.10 3. Add a device to the service (the disk partition that serves as the NFS export): redhat-config-cluster-cmd --service=nfs_engineering --add_device --name=/dev/sdc3 4. Add a mount point for the device (note: the mount point cannot be listed in /etc/fstab): redhat-config-cluster-cmd --service=nfs_engineering --device=/dev/sdc3 --mount \ --mountpoint=/mnt/nfs/engineering/ --fstype=ext3 \ --options=rw,nosuid,sync --forceunmount=yes 5. Add the mounted directory for NFS exporting: redhat-config-cluster-cmd --service=nfs_engineering --device=/dev/sdc3 \ --add_nfs_export --name=/mnt/nfs/engineering 6. Allow Bob to access the clustered NFS export: redhat-config-cluster-cmd --service=nfs_engineering --device=/dev/sdc3 \ --nfsexport=/mnt/nfs/engineering --add_client --name=bob \ --options=rw 7. Repeat step 6 for Jane and Tom. For more information and examples of using redhat-config-cluster-cmd, refer to the man page by typing the following at a shell prompt: 180 Appendix F. Cluster Command-line Utilities man redhat-config-cluster-cmd F.2. Using the shutil Utility Test the quorum partitions and ensure that they are accessible by invoking the shutil utility with the -p option. Running /usr/sbin/shutil -p /cluster/header on all active cluster members should give the same output on each machine. For example: /cluster/header is 144 bytes long SharedStateHeader { ss_magic = 0x39119fcd ss_timestamp = 0x000000003f5d3eea (22:46:02 Sep 08 2003) ss_updateHost = lab.example.com } If the output of the shutil utility with the -p option is not the same on all cluster systems, perform the following: ? Examine the /etc/sysconfig/rawdevices le on each cluster system and ensure that the raw character devices and block devices for the primary and backup quorum partitions have been accurately specied. If they are not the same, edit the le and correct any mistakes. Then re-run the clucong utility. Refer to Section 3.5 Editing the rawdevices File for more information. ? Ensure that you have created the raw devices for the quorum partitions on each cluster system. Refer to Section 2.4.4.3 Conguring Shared Cluster Partitions for more information. ? On each cluster system, examine the system startup messages at the point where the system probes the SCSI subsystem to determine the bus conguration. Verify that both cluster systems identify the same shared storage devices and assign them the same name. ? Verify that a cluster system is not attempting to mount a le system on the quorum partition. For example, make sure that the actual device (for example, /dev/sdb1) is not included in an /etc/fstab le. F.3. Using the clusvcadm Utility The clusvcadm utility provides a command-line user interface that enables an administrator to monitor and manage the cluster systems and services. Use the clusvcadm utility to perform the following tasks: ? Disable and enable services ? Relocate and restart cluster services ? lock and unlock service state The clusvcadm command line options are as follows: -d service Disable a service. -e service Enable a service. Appendix F. Cluster Command-line Utilities 181 -e service -m member Enable a service on a specic member. -l Lock the service states. -r service -m member Relocate a service to a specic member. -q Operate quietly. -R Restart a service. -s Stop a service -u Unlock the service states. -v Displays information about the current version of clusvcadm. Refer to the clusvcadm(8) man page for more information. F.4. Using the clufence Utility If power switches are used in the cluster hardware conguration, run the clufence utility on each cluster system to ensure that it can remotely power-cycle the other cluster members. If the command succeeds, run the shutil -p command on both cluster systems to display a summary of the header data structure for the quorum partitions. If the output is different on the systems, the quorum partitions do not point to the same devices on both systems. Check to make sure that the raw devices exist and are correctly specied in the /etc/sysconfig/rawdevices le. See Section 2.4.4.3 Conguring Shared Cluster Partitions for more information. If either network- or serial-attached power switches are employed in the cluster hardware conguration, install the cluster software and invoke the clufence command to test the power switches. Invoke the command on each cluster system to ensure that it can remotely power-cycle the other cluster members. If testing is successful, then the cluster can be started. The clufence command can accurately test a power switch. The format of the clufence command is as follows: usage: clufence [-d] [-[furs] + member , ] -d Turn on debugging -f + member , Fence (power off) + member -u , + member , Unfence (power on) + member -r , + member , Reboot (power cycle) + member -s , + member , Check status of all switches controlling + member , When testing power switches, the rst step is to ensure that each cluster member can successfully communicate with its attached power switch. The following example of the clufence command output shows that the cluster member is able to communicate with its power switch: 182 Appendix F. Cluster Command-line Utilities [23750] info: STONITH: baytech at 192.168.1.31, port 1 controls clu2 [23750] info: STONITH: baytech at 192.168.1.31, port 2 controls clu3 [23750] info: STONITH: wti_nps at 192.168.1.29, port clu4 controls clu4 [23750] info: STONITH: wti_nps at 192.168.1.29, port clu5 controls clu5 Any errors in the output could be indicative of the following types of problems: ? For serial attached power switches: ? Verify that the device special le for the remote power switch connection serial port (for example, /dev/ttyS0) is specied correctly in the cluster database, as established via Cluster Conguration Tool. If necessary, use a terminal emulation package such as minicom to test if the cluster system can access the serial port. ? Ensure that a non-cluster program (for example, a getty program) is not using the serial port for the remote power switch connection. You can use the lsof command to perform this task. ? Check that the cable connection to the remote power switch is correct. Verify that the correct type of cable is used (for example, an RPS-10 power switch requires a null modem cable), and that all connections are securely fastened. ? Verify that any physical dip switches or rotary switches on the power switch are set properly. ? For network based power switches: ? Verify that the network connection to network-based switches is operational. Most switches have a link light that indicates connectivity. ? It should be possible to ping the network switch; if not, then the switch may not be properly congured for its network parameters. ? Verify that the correct password and login name (depending on switch type) have been specied in the cluster conguration database (as established by running Cluster Conguration Tool). A useful diagnostic approach is to verify Telnet access to the network switch using the same parameters as specied in the cluster conguration. After successfully verifying communication with the switch, attempt to power cycle the other cluster member. Prior to doing this, it is recommended to verify that the other cluster member is not actively performing any important functions (such as serving cluster services to active clients). By executing the following command : clufence -r clu3 The following depicts a successful power cycle operation: Successfully power cycled host clu3. Index Symbols /etc/hosts editing, 18 /etc/syscong/ha/lvs.cf le, 119 /etc/syscong/rawdevices editing the le, 41 A acknowledgments, i active router (See LVS clustering) active-active conguration, 1 Apache HTTP Server httpd.conf, 94 setting up service, 93 availability and data integrity table, 7 B backup router (See LVS clustering) C channel bonding (See Ethernet bonding) chkcong, 121 cluster (See cluster types) administration, 97 checking the conguration, 53, 180 daemons, 42 diagnosing and correcting problems, 102 disabling the cluster software, 102 displaying status, 97 name, changing, 102 using Red Hat Cluster Manager with Piranha, 151 cluster administration, 97 backing up the cluster database, 100 changing the cluster name, 102 diagnosing and correcting problems in a cluster, 102 disabling the cluster software, 102 displaying cluster and service status, 97 modifying cluster event logging, 101 modifying the cluster conguration, 100 restoring the cluster database, 100 starting and stopping the cluster software, 99 updating the cluster software, 101 cluster command-line utilities, 179 cluster communication mechanisms, 169 cluster conguration, 53 minimum example, 7 modifying, 100 reloading, 41 using the shutil utility, 180 Cluster Conguration Tool accessing, 38 and Oracle service, 71 cluster daemons, 42 membership daemon, 42 quorum daemon, 43 service manager daemon, 45 cluster database backing up, 100 restoring, 100 cluster event logging daemons cluhbd, 101 clupowerd, 101 cluquorumd, 101 clusvcmgrd, 101 severity levels, 101 cluster features administration user interface, 2 application monitoring, 2 data integrity assurance, 2 event logging facility, 2 failover domain restricted, 2 unrestricted, 2 manual service relocation capabilities, 2 multiple cluster communication methods, 2 no-single-point-of-failure hardware conguration, 2 service conguration framework, 2 service failover capabilities, 2 status monitoring agent, 2 cluster hardware connecting, 21 power controllers, 10 setting up, 21 cluster hardware tables, 11 cluster member hardware table, 11 cluster overview, 1 cluster service, 1 displaying status, 97 cluster services active-active NFS conguration, 83 administration, 59 Apache HTTP Server, setting up, 93 httpd.conf, 94 conguration, 51, 59 conguring service disk storage, 61 184 deleting a service, 64 disabling a service, 63 displaying a service conguration, 62 enabling a service, 63 gathering service information, 59 handling a service that fails to start, 64 modifying a service, 63 MySQL service, setting up, 73 NFS caveats, 82 NFS client access, 82 NFS Druid, 78 NFS server requirements, 77 NFS service, setting up, 77 Oracle service, setting up, 67 Oracle, tuning, 72 relocating a service, 64 Samba Druid, 87 Samba operating model, 86 Samba server requirements, 85 Samba service, setting up, 85 scripts, creating, 61 smb.conf.sharename le elds, 90 verifying application software and service scripts, 62 cluster software command-line utilities, 179 disabling, 102 installation and conguration, 35 steps for installing and initializing, 35 with rpm, 36 with the Package Management Tool, 35 starting and stopping, 99 steps for installing and initializing, 35 updating, 101 version display, 55 cluster software installation and conguration, 35 cluster systems, 1 cluster types compute-clustering Beowulf, 109 denition of, 109 high-availability clustering, 109 (See Also Red Hat Cluster Manager) denition of, 109 load-balance clustering, 109 (See Also LVS clustering) denition of, 109 overview of, 109 clusvcadm using, 180 common cluster behaviors 2-4 Member cluster, IP tie-breaker, 173 3-5 Member Cluster, 173 miscellaneous, 175 service daemons, 174 two member disk-based tie-breaker, 172 common cluster behaviors table, 170 components of LVS cluster, 118 compute-clustering (See cluster types) conguration Red Hat Enterprise Linux, 17 conguration le reloading, 41 conguring a service, 59 console startup messages displaying, 20 console switch, 9 setting up, 17 console switch hardware table, 15 conventions document, iv D daemons clulockd conguring, 42 clumembd conguring, 42 cluquorumd conguring, 42 clurmtabd conguring, 42 clusvcmgrd conguring, 42 database service, 2 databases MySQL setting up service, 73 Oracle oraclescript example, 68 setting up, 67 startdb script example, 68 stopdb script example, 69 tuning, 72 using Cluster Conguration Tool with, 71 deleting a service, 64 diagnosing and correcting problems in a cluster table, 103 disabling a service, 63 disk storage conguring service disk storage, 61 displaying a service conguration, 62 displaying console startup messages, 20 displaying devices congured in the kernel, 21 185 E enabling a service, 63 Ethernet channel bonding conguring, 22 event logging modifying, 101 syslog conguring, 55 examples minimum cluster conguration, 7 NFS Druid, 78 no single point of failure conguration, 8 oracle script, 68 Samba Druid, 87 startdb script, 68 stopdb script, 69 using Cluster Conguration Tool to add an Oracle service, 71 ext3, 33 F failover and recover scenarios, 170 features, cluster, 2 feedback, vii le services NFS active-active conguration, 83 caveats, 82 client access, 82 NFS Druid, 78 server requirements, 77 setting up service, 77 Samba operating model, 86 Samba Druid, 87 server requirements, 85 setting up, 85 le systems creating, 33 FTP, clustering, 130 (See Also LVS clustering) G GFS cluster archive conguration, 161 conguration with ccs_tool, 161 GFS Setup Druid, 157 and CCS les, 158 cluster name, 157 conguration, 157 conguring cluster members, 159 fence device conguration, 160 GULM settings, 158 member conguration, 160 post-conguration, 161 saving conguration, 161 Grand Unied Lock Manager (See GULM) GULM, 158 conguration with GFS Setup Druid, 158 H handling a service that fails to start, 64 hardware installing basic cluster hardware, 15 hardware conguration availability considerations, 6 choosing a conguration, 5 cost restrictions, 6 data integrity under all failure conditions, 6 minimum, 6 optional hardware, 9 performance considerations, 5 shared storage requirements, 6 hardware information, supplementary, 163 hardware installation operating system conguration, 5 hardware watchdog timer conguring, 165 hardware watchdog timers, 165 high-availability clustering (See cluster types) hot-standby conguration, 1 how to use this manual, iii HTTP services Apache HTTP Server httpd.conf, 94 setting up, 93 I installation Red Hat Enterprise Linux, 17 installing basic cluster hardware, 15 installing the basic cluster hardware, 15 introduction, iii cluster features, 2 cluster overview, 1 how to use this manual, iii other Red Hat Enterprise Linux manuals, iii iptables, 121 ipvsadm program, 119 186 J job scheduling, LVS, 113 K kernel decreasing kernel boot timeout limit, 19 displaying congured devices, 21 Kernel Boot Timeout Limit decreasing, 19 Kimberlite, i KVM (keyboard, video, mouse) switch, 10 L least connections (See job scheduling, LVS) Linux Virtual Server (See LVS clustering) load-balance clustering (See cluster types) low voltage differential (LVD), 167 LVS /etc/syscong/ha/lvs.cf le, 119 components of, 118 daemon, 119 date replication, real servers, 112 denition of, 109 initial conguration, 121 ipvsadm program, 119 job scheduling, 113 lvs daemon, 119 LVS routers conguring services, 121 necessary services, 121 primary node, 121 multi-port services, 128 FTP, 130 nanny daemon, 119 NAT routing enabling, 127 requirements, hardware, 125 requirements, network, 125 requirements, software, 125 overview of, 110, 111 packet forwarding, 124 Piranha Conguration Tool, 119 pulse daemon, 118 real servers, 110 routing methods NAT, 116 routing prerequisites, 125 scheduling, job, 113 send_arp program, 119 shared data, 112 starting the cluster, 148 synchronizing conguration les, 147 three tiered Red Hat Cluster Manager, 113 using Red Hat Cluster Manager with Piranha, 151 lvs daemon, 119 M member status table, 98 member systems (See cluster systems) members setting up, 15 membership daemon, 42 minimum cluster conguration example, 7 minimum hardware conguration, 6 Mission Critical Linux, Inc., i mkfs, 33 multi-port services, clustering, 128 (See Also LVS clustering) MySQL setting up service, 73 N nanny daemon, 119 NAT enabling, 127 routing methods, LVS, 116 network address translation (See NAT) network hardware table, 13 network hub, 9 network switch, 9 NFS active-active conguration, 83 caveats, 82 client access, 82 NFS Druid, 78 server requirements, 77 setting up service, 77 NMI watchdog timer enabling, 164 no single point of failure conguration, 8 Non-Maskable Interrupt (NMI) watchdog timers, 164 187 O operating system conguration hardware installation, 5 Oracle adding an Oracle service, 71 oracle script example, 68 setting up service, 67 startdb script example, 68 stopdb script example, 69 tuning service, 72 overview introduction, 1 P packet forwarding, 124 (See Also LVS clustering) Parallel SCSI requirements, 26 parted creating disk partitions, 31 partitioning disks, 31 Piranha using Red Hat Cluster Manager with, 151 Piranha Conguration Tool, 119 CONTROL/MONITORING, 134 EDIT MONITORING SCRIPTS Subsection, 145 GLOBAL SETTINGS, 135 limiting access to, 123, 123 login panel, 133 necessary software, 133 overview of, 133 REAL SERVER subsection, 142 REDUNDANCY, 137 setting a password, 122 VIRTUAL SERVER subsection Firewall Mark, 141 Persistence, 142 Scheduling, 141 Virtual IP Address, 140 VIRTUAL SERVER subsection, 139 VIRTUAL SERVERS, 139 piranha-gui service, 121 piranha-passwd, 122 point-to-point Ethernet connection hardware table, 14 power controller connection, conguring, 48 power controllers, 10 network-attached, 10 serial-attached, 10 watchdog timers, 10 hardware-based, 10 software-based, 10 power switch, 48 (See Also power controller) power switch hardware table, 12 power switches conguring, 23 hardware watchdog timers conguring, 165 NMI watchdog timers enabling, 164 setting up, 163 watchdog, 163 software watchdog timers conguration, 163 testing, 54, 181 troubleshooting, 54, 182 pulse daemon, 118 pulse service, 121 Q quorum daemon, 43 quorum partitions (See shared cluster partitions) testing, 180 R raw, 32 raw devices creating, 32 rawdevices editing the le, 41 real servers (See LVS clustering) conguring services, 124 Red Hat Cluster Manager, 109 and Piranha, 151 command-line utilities, 179 with Red Hat GFS, 157 Red Hat Cluster Suite, 35 installation, 35 with rpm, 36 with the Package Management Tool, 35 Red Hat Enterprise Linux, 109 installation and conguration, 17 overview of, 109 Red Hat GFS and Red Hat Cluster Suite, 157 Red Hat GFS with Red Hat Cluster Suite changes to Red Hat Cluster Suite, 154 installation scenarios, 154 terminology, 153 using, 153 using GFS Setup Druid, 157 relocating a service, 64 round robin (See job scheduling, LVS) routing 188 prerequisites for LVS, 125 S Samba operating model, 86 Samba Druid, 87 server requirements, 85 setting up service, 85 smb.conf.sharename le elds, 90 scheduling, job (LVS), 113 scripts creating service scripts, 61 oracle script example, 68 startdb script example, 68 stopdb script example, 69 verifying application software and service scripts, 62 SCSI bus conguration requirements, 165 SCSI bus length, 167 SCSI bus termination, 166 SCSI identication numbers, 167 security Piranha Conguration Tool, 123 send_arp program, 119 service conguration, 59 service failover, 1 service property and resource information table, 59 service relocation, 1 service status table, 98 services, 51 (See Also adding to the cluster conguration) shared cluster partitions conguring, 30 requirements, 30 shared disk storage conguring, 25 shared disk storage hardware table, 13 shared partitions testing, 53 shared state, 1 shared storage, 26 shared storage requirements, 6 single-initiator bre channel interconnect setting up, 28 single-initiator SCSI bus setting up, 26 software information, supplementary, 169 software watchdog timers, 163 sshd service, 121 synchronizing conguration les, 147 syslog, 55 syslog event logging conguring, 55 syslogd, 55 System V init, 99 T tables availability and data integrity, 7 cluster hardware, 11 cluster member hardware, 11 common cluster behaviors, 170 console switch hardware, 15 diagnosing and correcting problems in a cluster, 103 installing the basic cluster hardware, 15 member status, 98 minimum cluster conguration components, 7 network hardware, 13 no single point of failure conguration, 8 point-to-point Ethernet connection hardware, 14 power controller connection, conguring, 48 power switch hardware, 12 service property and resource information, 59 service status, 98 shared disk storage hardware, 13 UPS system hardware, 14 terminal server, 10 testing power switches, 54, 181 quorum partitions, 180 shared partitions, 53 troubleshooting diagnosing and correcting problems in a cluster, 102 failover and recover scenarios, 170 power switch testing, 54, 182 table, 103 U UPS system hardware table, 14 UPS systems conguring, 24 using Red Hat GFS with Red Hat Cluster Suite, 153 utilities clufence, 181 clusvcadm, 180 redhat-cong-cluster-cmd, 179 shutil, 180 189 W watchdog timers hardware conguring, 165 hardware-based, 10 NMI enabling, 164 setting up, 163 software, 163 conguration, 163 software-based, 10 weighted least connections (See job scheduling, LVS) weighted round robin (See job scheduling, LVS) Unités de disques REFERENCE 86 F1 14FG 01 ESCALA Power7 Unités de disques Cette publication concerne les modèles suivants : - Bull Escala E5-700 (Power 750 / 8233-E8B) - Bull Escala M6-700 (Power 770 / 9117-MMB) - Bull Escala M7-700 (Power 780 / 9179-MHB) - Bull Escala E1-700 (Power 710 / 8231-E2B) - Bull Escala E2-700 / E2-700T (Power 720 / 8202-E4B) - Bull Escala E3-700 (Power 730 / 8231-E2B) - Bull Escala E4-700 / E4-700T (Power 740 / 8205-E6B) Les références au modèle Power 755 / 8236-E8C sont sans objet. Hardware Septembre 2010 BULL CEDOC 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE REFERENCE 86 F1 14FG 01 L’avis juridique de copyright ci–après place le présent document sous la protection des lois de Copyright qui prohibent, sans s’y limiter, des actions comme la copie, la distribution, la modification et la création de produits dérivés. Copyright Bull SAS 2007-2010 Imprimé en France Marques déposées Toutes les marques citées dans ce manuel sont la propriété de leurs détenteurs respectifs. Tous les noms de marques ainsi que les noms de produits matériels et/ou logiciels sont régis par le droit des marques et/ou des brevets. La citation des noms de marques et de produits est purement informative et ne constitue pas une violation du droit des marques et/ou des brevets. Des corrections ou des modifications au contenu de ce document peuvent intervenir sans préavis. Bull SAS ne pourra pas être tenu pour responsable des éventuelles erreurs qui pourraient être contenues dans ce manuel, ni pour tout dommage pouvant résulter de son application. Table des matières Avis aux lecteurs canadiens . . . . . . . . . . . . . . . . . . . . . . . . . . vii Consignes de sécurité . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xi Unités de disques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 Unités de disques - Nouveautés . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 Installation d'une unité de disques ou d'une unité SSD . . . . . . . . . . . . . . . . . . . . . 1 Unités SSD (Solid State Drive) . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 Installation d'une unité de disques ou d'une unité SSD sur un système ou une partition logique AIX sous tension . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 Préparation du système. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9 Installation d'une unité de disques . . . . . . . . . . . . . . . . . . . . . . . . . . 9 Fin de la procédure. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14 Installation d'une unité de disques ou d'une unité SSD sur un système ou une partition logique Linux sous tension . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15 Installation d'une unité de disques utilisant le noyau 2.4 Linux . . . . . . . . . . . . . . . . 17 Installation d'une unité de disques. . . . . . . . . . . . . . . . . . . . . . . . . 17 Suivez la procédure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23 Installation d'une unité de disques utilisant le noyau 2.6 Linux . . . . . . . . . . . . . . . . 24 Installation d'une unité de disques. . . . . . . . . . . . . . . . . . . . . . . . . 24 Vérification du fonctionnement de la nouvelle unité de disques . . . . . . . . . . . . . . . 29 Suivez la procédure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30 Installation d'une unité de disques ou d'une unité SSD sur un système ou une partition logique Virtual I/O Server sous tension. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30 Préparation du système . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32 Installation d'une unité de disques. . . . . . . . . . . . . . . . . . . . . . . . . . 32 Suivez la procédure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37 Installation d'une unité de disques ou d'une unité SSD sur un système ou une partition logique hors tension . 37 Préparation du système . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39 Installation d'une unité de disques. . . . . . . . . . . . . . . . . . . . . . . . . . 40 Suivez la procédure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44 Retrait d'une unité de disques ou d'une unité SSD . . . . . . . . . . . . . . . . . . . . . . 45 Remplacement d'une unité de disques . . . . . . . . . . . . . . . . . . . . . . . . . . 45 Remplacement d'une unité de disques sur un système ou une partition logique AIX sous tension . . . . . 46 Préparation du système . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48 Retrait d'une unité de disques . . . . . . . . . . . . . . . . . . . . . . . . . . . 48 Remplacement d'une unité de disque. . . . . . . . . . . . . . . . . . . . . . . . . 51 Fin de la procédure. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54 Remplacement d'une unité de disques sur un système ou une partition logique Linux sous tension . . . . . 55 Remplacement d'une unité de disques utilisant le noyau 2.4 Linux . . . . . . . . . . . . . . . 57 Préparation du retrait de l'unité de disques. . . . . . . . . . . . . . . . . . . . . . 57 Retrait d'une unité de disques . . . . . . . . . . . . . . . . . . . . . . . . . . 59 Remplacement d'une unité de disques . . . . . . . . . . . . . . . . . . . . . . . 63 Suivez la procédure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67 Remplacement d'une unité de disques utilisant le noyau 2.6 Linux . . . . . . . . . . . . . . . 67 Préparation du retrait de l'unité de disques. . . . . . . . . . . . . . . . . . . . . . 67 Retrait d'une unité de disques . . . . . . . . . . . . . . . . . . . . . . . . . . 69 Remplacement d'une unité de disques . . . . . . . . . . . . . . . . . . . . . . . 71 Suivez la procédure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73 Remplacement d'une unité de disques sur un système ou une partition logique hors tension . . . . . . . 73 Préparation du système . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74 Retrait d'une unité de disques . . . . . . . . . . . . . . . . . . . . . . . . . . . 75 Remplacement d'une unité de disques . . . . . . . . . . . . . . . . . . . . . . . . 78 Fin de la procédure. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80 Remplacement d'une unité de disques SCSI . . . . . . . . . . . . . . . . . . . . . . . 81 iii Retrait d'une unité de disques . . . . . . . . . . . . . . . . . . . . . . . . . . . 81 Remplacement d'une unité de disques . . . . . . . . . . . . . . . . . . . . . . . . 83 Suivez la procédure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86 Retrait ou installation d'un obturateur d'unité de disques . . . . . . . . . . . . . . . . . . . . 86 Retrait d'un obturateur d'unité de disques . . . . . . . . . . . . . . . . . . . . . . . . 86 Installation d'un obturateur d'unité de disques . . . . . . . . . . . . . . . . . . . . . . 88 Emplacements des unités de disque et voyants de maintenance simultanée . . . . . . . . . . . . . . 90 Emplacements de l'adaptateur PCIe RAID et SSD SAS . . . . . . . . . . . . . . . . . . . . 90 Emplacements des unités de disques et voyants de maintenance simultanée pour les systèmes 02E/4B et 05F/6B. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90 Emplacements des unités de disque et voyants de maintenance simultanée pour 31E/2B . . . . . . . . 92 Emplacements des unités de disques et voyants de maintenance simultanée sur les modèles 33E/8B et 36E/8C 94 Emplacements des unités de disques et voyants de maintenance simultanée pour les modèles 17M/MB et 79M/HB . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95 Emplacements des unités de disques et voyants de maintenance simultanée sur le boîtier du disque 58/86 . . 96 Emplacements des unités de disques et voyants de maintenance simultanée pour les boîtiers d'unité SCSI 31D/24 et 31T/24 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98 Emplacements des unités de disques et voyants de maintenance simultanée sur une unité d'extension 58/02 100 Informations connexes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 100 Préparation d'une unité de disques dans un système ou une partition logique AIX . . . . . . . . . . 100 Préparation du retrait d'une unité de disques d'une unité centrale ou d'extension contrôlée par le système d'exploitation AIX. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 100 Régénération des données d'une unité de disques de remplacement sous AIX. . . . . . . . . . . . 101 Régénération des données d'une unité de disques de remplacement sous Linux . . . . . . . . . . . 101 Régénération de données sur une unité de disques utilisant le noyau 2.4 Linux . . . . . . . . . . 101 Régénération de données sur une unité de disques utilisant le noyau 2.6 Linux . . . . . . . . . . 102 Partage d'unités de disques internes sur le modèle 02E/4B ou 05F/6B . . . . . . . . . . . . . . . 104 Partage d'unités de disques internes sur un modèle 33E/8B ou 36E/8C . . . . . . . . . . . . . . . 105 Partage d'unités de disques internes sur un modèle 17M/MB ou 79M/HB . . . . . . . . . . . . . . 107 Installation du port SAS externe sur un modèle 02E/4B ou 05F/6B . . . . . . . . . . . . . . . . 109 Installation du port SAS externe sur un modèle 31E/2B . . . . . . . . . . . . . . . . . . . . 111 Installation du port SAS externe des modèles 33E/8B ou 36E/8C . . . . . . . . . . . . . . . . . 113 Installation du port SAS externe sur un modèle 17M/MB ou 79M/HB . . . . . . . . . . . . . . . 121 Remise en place d'une carte d'extension SAS dans une unité d'extension 58/02 hors tension . . . . . . . . 123 Retrait d'une carte d'extension SAS dans une unité d'extension 58/02 hors tension . . . . . . . . . . . 131 Remise en place d'une carte d'extension SAS dans une unité d'extension 58/02 hors tension . . . . . . . . 140 Retrait d'une carte d'extension SAS d'une unité d'extension 58/02 hors tension . . . . . . . . . . . . 141 Procédures communes pour les fonctions installables . . . . . . . . . . . . . . 143 Avant de commencer. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 143 Identification d'un composant défaillant . . . . . . . . . . . . . . . . . . . . . . . . . 146 Voyants du panneau de commande . . . . . . . . . . . . . . . . . . . . . . . . . . 146 Identification d'un composant défaillant sur un système AIX ou une partition logique . . . . . . . . . 148 Localisation d'un composant défaillant sur un système AIX ou une partition logique . . . . . . . . 148 Activation du voyant associé au composant défaillant. . . . . . . . . . . . . . . . . . . 148 Désactivation du voyant d'un composant défaillant. . . . . . . . . . . . . . . . . . . . 148 Identification d'un composant défaillant sur un système Linux ou une partition logique . . . . . . . . 149 Localisation d'un composant défaillant sur un système Linux ou une partition logique. . . . . . . . 149 Localisation d'un composant défaillant sur un système ou une partition logique Linux. . . . . . . . 149 Activation du voyant associé au composant défaillant. . . . . . . . . . . . . . . . . . . 150 Désactivation du voyant d'un composant défaillant. . . . . . . . . . . . . . . . . . . . 150 Localisation d'un composant défaillant sur un système Virtual I/O Server ou une partition logique . . . . 150 Identification d'un composant à l'aide du Virtual I/O Server . . . . . . . . . . . . . . . . 151 Démarrage du système ou de la partition logique . . . . . . . . . . . . . . . . . . . . . . 151 Démarrage d'un système non géré par une console HMC . . . . . . . . . . . . . . . . . . 151 Démarrage d'un système ou d'une partition logique via la console HMC (Hardware Management Console) 153 Arrêt d'un système ou d'une partition logique . . . . . . . . . . . . . . . . . . . . . . . 153 Arrêt d'un système non géré par une console HMC . . . . . . . . . . . . . . . . . . . . 153 Arrêt d'un système à l'aide de la console HMC . . . . . . . . . . . . . . . . . . . . . . 155 Retrait et remplacement du volet ou panneau d'unité d'extension . . . . . . . . . . . . . . . . . 156 Retrait du panneau avant sur un modèle 14G/30 . . . . . . . . . . . . . . . . . . . . . 156 iv Unités de disques Installation du panneau avant sur un modèle 14G/30. . . . . . . . . . . . . . . . . . . . 156 Carters système . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 157 Retrait et remplacement des carters du système 02E/4B ou 05F/6B . . . . . . . . . . . . . . . 157 Retrait du carter avant d'un système monté en armoire 02E/4B ou 05F/6B. . . . . . . . . . . . 157 Retrait du carter avant sur un système autonome 02E/4B ou 05F/6B. . . . . . . . . . . . . . 158 Installation du carter avant sur un système monté en armoire 02E/4B ou 05F/6B . . . . . . . . . 159 Installation du carter avant et du volet avant sur un système autonome 02E/4B ou 05F/6B . . . . . . 160 Retrait du capot d'accès d'un système monté sur armoire 02E/4B ou 05F/6B . . . . . . . . . . . 161 Retrait du capot d'accès d'un système autonome 02E/4B ou 05F/6B . . . . . . . . . . . . . . 162 Installation du capot d'accès d'un système monté sur armoire 02E/4B ou 05F/6B . . . . . . . . . 164 Installation du capot d'accès d'un système autonome 02E/4B ou 05F/6B . . . . . . . . . . . . 164 Retrait et remplacement des carters du système 31E/2B . . . . . . . . . . . . . . . . . . . 166 Retrait du capot d'accès du système 31E/2B . . . . . . . . . . . . . . . . . . . . . . 166 Installation du capot d'accès du système 31E/2B . . . . . . . . . . . . . . . . . . . . 167 Retrait et installation des carters sur le système 33E/8B ou 36E/8C . . . . . . . . . . . . . . . 168 Retrait du carter avant des modèles 33E/8B ou 36E/8C. . . . . . . . . . . . . . . . . . . 168 Installation du carter avant des modèles 33E/8B ou 36E/8C. . . . . . . . . . . . . . . . . 169 Retrait du capot d'accès des modèles 33E/8B ou 36E/8C. . . . . . . . . . . . . . . . . . 170 Installation du capot d'accès des modèles 33E/8B ou 36E/8C. . . . . . . . . . . . . . . . . 171 Retrait et remplacement des carters du système 17M/MB ou 79M/HB . . . . . . . . . . . . . . 172 Retrait du capot avant des modèles 17M/MB ou 79M/HB . . . . . . . . . . . . . . . . . 172 Installation du panneau avant du modèle 17M/MB ou 79M/HB . . . . . . . . . . . . . . . 173 Mise en position de maintenance ou de fonctionnement du système 33E/8B ou 36E/8C . . . . . . . . . 174 Mise en position de maintenance d'un système monté en armoire 31E/2B . . . . . . . . . . . . . 174 Mise en position de fonctionnement d'un système monté en armoire 31E/2B . . . . . . . . . . . . 175 Mise en position de maintenance d'un système monté en armoire 02E/4B ou 05F/6B . . . . . . . . . . 176 Mise en position de fonctionnement du système monté en armoire 02E/4B ou 05F/6B . . . . . . . . . . 178 Positions de maintenance et de fonctionnement . . . . . . . . . . . . . . . . . . . . . . . 179 Mise en position de fonctionnement du système 02E/4B ou 05F/6B . . . . . . . . . . . . . . . 179 Mise en position de maintenance d'un système monté en armoire 02E/4B ou 05F/6B . . . . . . . . 179 Mise en position de fonctionnement du système monté en armoire 02E/4B ou 05F/6B . . . . . . . . 181 Mise en position de maintenance ou de fonctionnement du système 33E/8B ou 36E/8C . . . . . . . . 182 Mise en position de maintenance d'un système monté en armoire 31E/2B . . . . . . . . . . . . 182 Mise en position de fonctionnement d'un système monté en armoire 31E/2B . . . . . . . . . . . 183 Mise en position de maintenance ou de fonctionnement du système 33E/8B ou 36E/8C . . . . . . . . 184 Mise en position de maintenance de l'unité 33E/8B ou 36E/8C. . . . . . . . . . . . . . . . 184 Placement de 33E/8B ou de 36E/8C en position de fonctionnement . . . . . . . . . . . . . . 187 Cordons d'alimentation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 189 Déconnexion des cordons d'alimentation d'un système 02E/4B ou 05F/6B . . . . . . . . . . . . . 189 Connexion des cordons d'alimentation d'un système 02E/4B ou 05F/6B. . . . . . . . . . . . . . 190 Déconnexion des cordons d'alimentation du système 31E/2B . . . . . . . . . . . . . . . . . 191 Connexion des cordons d'alimentation au système 31E/2B . . . . . . . . . . . . . . . . . . 192 Installation d'un dispositif via la console HMC (Hardware Management Console) . . . . . . . . . . . 193 Retrait d'un composant via la console HMC . . . . . . . . . . . . . . . . . . . . . . . 194 Remplacement d'un composant via la console HMC . . . . . . . . . . . . . . . . . . . . 194 Vérification du composant installé . . . . . . . . . . . . . . . . . . . . . . . . . . . 195 Vérification d'un dispositif installé ou d'un composant remplacé sur un système ou une partition logique AIX 195 Vérification d'un composant installé sur un système Linux ou une partition logique. . . . . . . . . . 198 Vérification d'un composant installé via les programmes de diagnostic autonomes . . . . . . . . . . 198 Vérification d'un composant installé via la console HMC (Hardware Management Console) . . . . . . . 199 Activation et désactivation de voyants . . . . . . . . . . . . . . . . . . . . . . . . 200 Désactivation d'un voyant d'avertissement système ou de partition . . . . . . . . . . . . . 200 Activation ou désactivation de voyants d'identification . . . . . . . . . . . . . . . . . 201 Visualisation d'événements réparables . . . . . . . . . . . . . . . . . . . . . . . . 202 Vérification de la présence d'un dispositif installé ou d'un composant remplacé sur un système ou une partition logique à l'aide des outils Virtual I/O Server. . . . . . . . . . . . . . . . . . . . 202 Remarques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 207 Marques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 208 Bruits radioélectriques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 208 Remarques sur la classe A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 208 Table des matières v Remarques sur la classe B . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 212 Dispositions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 214 vi Unités de disques Avis aux lecteurs canadiens Le présent document a été traduit en France. Voici les principales différences et particularités dont vous devez tenir compte. Illustrations Les illustrations sont fournies à titre d'exemple. Certaines peuvent contenir des données propres à la France. Terminologie La terminologie des titres IBM peut différer d'un pays à l'autre. Reportez-vous au tableau ci-dessous, au besoin. IBM France IBM Canada ingénieur commercial représentant agence commerciale succursale ingénieur technico-commercial informaticien inspecteur technicien du matériel Claviers Les lettres sont disposées différemment : le clavier français est de type AZERTY, et le clavier français-canadien, de type QWERTY. OS/2 - Paramètres canadiens Au Canada, on utilise : v les pages de codes 850 (multilingue) et 863 (français-canadien), v le code pays 002, v le code clavier CF. Nomenclature Les touches présentées dans le tableau d'équivalence suivant sont libellées différemment selon qu'il s'agit du clavier de la France, du clavier du Canada ou du clavier des États-Unis. Reportez-vous à ce tableau pour faire correspondre les touches françaises figurant dans le présent document aux touches de votre clavier. vii Recommandations à l'utilisateur Ce matériel utilise et peut émettre de l'énergie radiofréquence. Il risque de parasiter les communications radio et télévision s'il n'est pas installé et utilisé conformément aux instructions du constructeur (instructions d'utilisation, manuels de référence et manuels d'entretien). Si cet équipement provoque des interférences dans les communications radio ou télévision, mettez-le hors tension puis sous tension pour vous en assurer. Il est possible de corriger cet état de fait par une ou plusieurs des mesures suivantes : v Réorienter l'antenne réceptrice ; v Déplacer l'équipement par rapport au récepteur ; v Éloigner l'équipement du récepteur ; v Brancher l'équipement sur une prise différente de celle du récepteur pour que ces unités fonctionnent sur des circuits distincts ; v S'assurer que les vis de fixation des cartes et des connecteurs ainsi que les fils de masse sont bien serrés ; v Vérifier la mise en place des obturateurs sur les connecteurs libres. Si vous utilisez des périphériques non IBM avec cet équipement, nous vous recommandons d'utiliser des câbles blindés mis à la terre, à travers des filtres si nécessaire. En cas de besoin, adressez-vous à votre détaillant. Le fabricant n'est pas responsable des interférences radio ou télévision qui pourraient se produire si des modifications non autorisées ont été effectuées sur l'équipement. viii Unités de disques L'obligation de corriger de telles interférences incombe à l'utilisateur. Au besoin, l'utilisateur devrait consulter le détaillant ou un technicien qualifié pour obtenir de plus amples renseignements. Brevets Il est possible qu'IBM détienne des brevets ou qu'elle ait déposé des demandes de brevets portant sur certains sujets abordés dans ce document. Le fait qu'IBM vous fournisse le présent document ne signifie pas qu'elle vous accorde un permis d'utilisation de ces brevets. Vous pouvez envoyer, par écrit, vos demandes de renseignements relatives aux permis d'utilisation au directeur général des relations commerciales d'IBM, 3600 Steeles Avenue East, Markham, Ontario, L3R 9Z7. Assistance téléphonique Si vous avez besoin d'assistance ou si vous voulez commander du matériel, des logiciels et des publications IBM, contactez IBM direct au 1 800 465-1234. Avis aux lecteurs canadiens ix x Unités de disques Consignes de sécurité Les consignes de sécurité peuvent être imprimées tout au long de ce guide. v DANGER - Consignes attirant votre attention sur un risque de blessures graves, voire mortelles. v ATTENTION - Consignes attirant votre attention sur un risque de blessures graves, en raison de certaines circonstances réunies. v Avertissement - Consignes attirant votre attention sur un risque de dommages sur un programme, une unité, un système ou des données. Consignes de sécurité relatives au commerce international Plusieurs pays nécessitent la présentation des consignes de sécurité indiquées dans les publications du produit dans leur langue nationale. Si votre pays en fait partie, un livret de consignes de sécurité est inclus dans l'ensemble des publications livré avec le produit. Ce livret contient les consignes de sécurité dans votre langue en faisant référence à la source en anglais (Etats-Unis). Avant d'utiliser une publication en version originale anglaise pour installer, faire fonctionner ou dépanner ce produit, vous devez vous familiariser avec les consignes de sécurité figurant dans ce livret. Vous devez également consulter ce livret chaque fois que les consignes de sécurité des publications en anglais (Etats-Unis) ne sont pas assez claires pour vous. Informations sur les appareils à laser Les serveurs peuvent comprendre des cartes d'E-S ou des composants à fibres optiques, utilisant des lasers ou des diodes électroluminescentes (LED). Conformité aux normes relatives aux appareils à laser Les serveurs peuvent être installés à l'intérieur ou à l'extérieur d'une armoire d'équipement informatique. xi DANGER Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes suivantes : Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut présenter un danger. Pour éviter tout risque de choc électrique : v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce dernier avec un autre produit. v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique. v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de reconfiguration de ce produit au cours d'un orage. v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de danger électrique, débranchez tous les cordons d'alimentation. v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent aux informations de la plaque d'alimentation électrique du système. v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce produit. v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les câbles d'interface. v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence de dommages matériels. v Avant de retirer les carters de l'unité, mettez celle-ci hors tension et déconnectez ses cordons d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication et aux modems (sauf mention contraire dans les procédures d'installation et de configuration). v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et déconnecter les différents cordons. Pour déconnecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Débranchez les cordons d'alimentation des prises. 3. Débranchez les câbles d'interface des connecteurs. 4. Débranchez tous les câbles des unités. Pour connecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Branchez tous les cordons sur les unités. 3. Branchez les câbles d'interface sur des connecteurs. 4. Branchez les cordons d'alimentation aux prises. 5. Mettez l'unité sous tension. (D005a) DANGER xii Unités de disques Observez les consignes suivantes lors de l'utilisation du système en armoire ou lorsque vous travaillez à proximité de ce dernier : v Un mauvais maniement de l'équipement lourd peut engendrer blessures et dommages matériels. v Abaissez toujours les vérins de mise à niveau de l'armoire. v Installez toujours des équerres de stabilisation sur l'armoire. v Pour prévenir tout danger lié à une mauvaise répartition de la charge, installez toujours les unités les plus lourdes dans la partie inférieure de l'armoire. Installez toujours les serveurs et les unités en option en commençant par le bas de l'armoire. v Un serveur monté en armoire n'est pas une étagère ou un espace de travail. Ne posez pas d'objet sur un serveur monté en armoire. v Chaque armoire peut être équipée de plusieurs cordons d'alimentation. Avant de manipuler l'armoire, vous devez débrancher l'ensemble des cordons d'alimentation. v Reliez toutes les unités installées dans l'armoire aux dispositifs d'alimentation installés dans la même armoire. Vous ne devez pas brancher le cordon d'alimentation d'une unité installée dans une armoire au dispositif d'alimentation installé dans une autre armoire. v Un mauvais câblage du socle de prise de courant peut provoquer une mise sous tension dangereuse des parties métalliques du système ou des unités qui lui sont raccordées. Il appartient au client de s'assurer que le socle de prise de courant est correctement câblé et mis à la terre afin d'éviter tout risque de choc électrique. ATTENTION v N'installez pas d'unité dans une armoire dont la température ambiante interne dépasse la température ambiante que le fabricant recommande pour toutes les unités montées en armoire. v N'installez pas d'unité dans une armoire où la ventilation n'est pas assurée. Vérifiez que les côtés, l'avant et l'arrière de l'unité sont correctement ventilés. v Le matériel doit être correctement raccordé au circuit d'alimentation pour éviter qu'une surcharge des circuits n'entrave le câblage des dispositifs d'alimentation ou de protection contre les surintensités. Pour choisir des connexions d'alimentation à l'armoire adaptées, consultez les étiquettes de puissance nominale situées sur le matériel dans l'armoire afin de déterminer l'alimentation totale requise par le circuit d'alimentation. v Armoires dotées de tiroirs coulissants : Si l'armoire n'est pas équipée d'équerres de stabilisation, ne sortez et n'installez pas de tiroir ou de dispositif. Ne retirez pas plusieurs tiroirs à la fois. Si vous retirez plusieurs tiroirs simultanément, l'armoire risque de devenir instable. v Armoires dotées de tiroirs fixes : Sauf indication du fabricant, les tiroirs fixes ne doivent pas être retirés à des fins de maintenance. Si vous tentez de retirer une partie ou l'ensemble du tiroir, l'armoire risque de devenir instable et le tiroir risque de tomber. (R001) Consignes de sécurité xiii ATTENTION : Le retrait des composants des parties supérieures de l'armoire améliore sa stabilité au cours du déplacement. Pour déplacer une armoire remplie de composants dans une pièce ou dans un bâtiment, procédez comme suit : v Pour réduire le poids de l'armoire, retirez les équipements, à commencer par celui situé en haut. Si possible, restaurez la configuration d'origine de l'armoire. Si vous ne connaissez pas cette configuration, procédez comme suit : – Retirez toutes les unités de la position 32U et plus. – Assurez-vous que les unités les plus lourdes sont installées dans la partie inférieure de l'armoire. – Assurez-vous qu'il ne reste aucun niveau U vide entre les unités installées dans l'armoire, en dessous du niveau 32U. v Si l'armoire déplacée fait partie d'un groupe d'armoires, séparez-la de ce dernier. v Vérifiez l'itinéraire envisagé pour éliminer tout risque. v Vérifiez que l'armoire une fois chargée n'est pas trop lourde pour l'itinéraire choisi. Pour plus d'informations sur le poids d'une armoire chargée, consultez la documentation fournie avec votre armoire. v Vérifiez que toutes les ouvertures mesurent au moins 760 x 230 mm. v Vérifiez que toutes les unités, toutes les étagères, tous les tiroirs, toutes les portes et tous les câbles sont bien fixés. v Vérifiez que les vérins de mise à niveau sont à leur position la plus haute. v Vérifiez qu'aucune équerre de stabilisation n'est installée sur l'armoire pendant le déplacement. v N'utilisez pas de rampe inclinée à plus de dix degrés. v Dès que l'armoire est à son nouvel emplacement, procédez comme suit : – Abaissez les quatre vérins de mise à niveau. – Installez des équerres de stabilisation sur l'armoire. – Si vous avez retiré des unités de l'armoire, remettez-les à leur place, en remontant de la partie inférieure à la partie supérieure de l'armoire. v Si un déplacement important est nécessaire, restaurez la configuration d'origine de l'armoire. Mettez l'armoire dans son emballage d'origine ou dans un autre emballage équivalent. De plus, abaissez les vérins de mise à niveau pour que les roulettes ne soient plus au contact de la palette et fixez l'armoire à celle-ci. (R002) (L001) (L002) xiv Unités de disques (L003) ou Aux Etats-Unis, tous les appareils à laser sont certifiés conformes aux normes indiquées dans le sous-chapitre J du DHHS 21 CFR relatif aux produits à laser de classe 1. Dans les autres pays, ils sont certifiés être des produits à laser de classe 1 conformes aux normes CEI 60825. Consultez les étiquettes sur chaque pièce du laser pour les numéros d'accréditation et les informations de conformité. ATTENTION : Ce produit peut contenir des produits à laser de classe 1 : lecteur de CD-ROM, DVD-ROM, DVD-RAM ou module à laser. Notez les informations suivantes : v Ne retirez pas les carters. En ouvrant le produit à laser, vous vous exposez au rayonnement dangereux du laser. Vous ne pouvez effectuer aucune opération de maintenance à l'intérieur. v Pour éviter tout risque d'exposition au rayon laser, respectez les consignes de réglage et d'utilisation des commandes, ainsi que les procédures décrites dans le présent manuel. (C026) Consignes de sécurité xv ATTENTION : Les installations informatiques peuvent comprendre des modules à laser fonctionnant à des niveaux de rayonnement excédant les limites de la classe 1. Il est donc recommandé de ne jamais examiner à l'oeil nu la section d'un cordon optique ni une prise de fibres optiques ouverte. (C027) ATTENTION : Ce produit contient un laser de classe 1M. Ne l'observez pas à l'aide d'instruments optiques. (C028) ATTENTION : Certains produits à laser contiennent une diode à laser intégrée de classe 3A ou 3B. Prenez connaissance des informations suivantes. Rayonnement laser lorsque le carter est ouvert. Evitez toute exposition directe au rayon laser. Evitez de regarder fixement le faisceau ou de l'observer à l'aide d'instruments optiques. (C030) Informations sur l'alimentation électrique et sur le câblage relatives au document GR-1089-CORE du NEBS (Network Equipment-Building System) Les commentaires suivants s'appliquent aux serveurs qui ont été déclarés conformes au document GR-1089-CORE du NEBS (Network Equipment-Building System) : Cet équipement peut être installé : v dans des infrastructures de télécommunications réseau v aux endroits préconisés dans les directives NEC (National Electrical Code). Les ports de ce matériel qui se trouvent à l'intérieur du bâtiment peuvent être connectés à des câbles internes ou non exposés uniquement. Ils ne doivent pas être connectés par leur partie métallique aux interfaces connectées au réseau extérieur ou à son câblage. Ces interfaces sont conçues pour être exclusivement utilisées à l'intérieur d'un bâtiment (ports de type 2 ou 4 décrits dans le document GR-1089-CORE) ; elles doivent être isolées du câblage à découvert du réseau extérieur. L'ajout de dispositifs de protection primaires n'est pas suffisant pour pouvoir connecter ces interfaces par leur partie métallique au câblage du réseau extérieur. Remarque : Tous les câbles Ethernet doivent être blindés et mis à la terre aux deux extrémités. Dans le cas d'un système alimenté en courant alternatif, il n'est pas nécessaire d'installer un dispositif externe de protection contre les surtensions (SPD). Un système alimenté en courant continu fait appel à un dispositif de retour du continu (DC-I). La borne de retour de la batterie en courant continu ne doit pas être connectée à la masse. xvi Unités de disques Unités de disques Informations sur l'installation, le retrait et le remplacement d'une unité de disque. Unités de disques - Nouveautés Prenez connaissance des nouveautés et des modifications apportées aux rubriques Unités de disques depuis leur dernière mise à jour. Installation d'une unité de disques ou d'une unité SSD Vous pouvez installer une unité de disques SCSI (Small Computer System Interface), SAS (serial-attached SCSI) ou SSD (Solid State Drive) sur le serveur ou l'unité d'extension. Les procédures de cette rubrique permettent d'effectuer cette tâche. Votre système ou partition logique peut être indifféremment hors tension ou sous tension lorsque vous installez l'unité de disques. Identifiez le système pour effectuer l'installation ou d'une mise à niveau. Allumez le voyant d'identification système (BLEU). Unités SSD (Solid State Drive) Pour installer et configurer des unités SSD, vous devez vérifier les détails relatifs au type de machine, à la carte et à la configuration pris en charge. Pourquoi et quand exécuter cette tâche Les unités SSD, également appelées unités flash, observent la plupart des règles appliquées à une unité de disque dur (HDD) normal. Par exemple, les unités SSD ressemblent physiquement aux unités de disque dur, s'installent de la même manière et dans la plupart des mêmes emplacements que ces derniers. Cependant, certaines restrictions et règles de configuration applicables aux unités SSD diffèrent de celles des unités de disque dur. Les tableaux suivants peuvent vous aider à déterminer les règles de configuration du système d'exploitation, de l'adaptateur, du type machine et de l'unité applicables à votre système. Après avoir examiné les règles de configuration, vous pouvez installer l'unité de la même manière que pour tout autre unité. Pour plus d'informations, voir «Installation d'une unité de disques ou d'une unité SSD». Restriction : Les unités SSD doivent faire partie intégrante d'une grappe RAID ou d'un miroitage du système (). Choisissez dans la liste suivante le système d'exploitation du serveur ou de la partition sur lequel vous souhaitez installer les disques : v Règles relatives aux unités SSD pour le système d'exploitation AIX v Règles relatives aux unités SSD pour le système d'exploitation Linux® v Règles relatives aux unités SSD pour le système d'exploitation Avertissement : Avant d'installer une unité SSD sur votre système, vous devez vous assurer de disposer de toutes les mises à jour préalablement requises pour l'unité SSD et les adaptateurs. . 1 Que faire ensuite Tableau 1. Règles relatives aux unités SSD pour le système d'exploitation AIX Modèle du type de machine Carte Règles d'association 58/86 v (5904) - Carte SAS RAID PCI-X DDR 1,5 Go de cache v (5906 et 5908) - Carte SAS RAID PCI-X DDR 1,5 Go de cache (BSC) v (5903) - Carte SAS RAID PCIe à double canal x4 3 Go de cache de 380 Mo v Les unités SSD et les unités de disque dur ne peuvent pas être combinées dans ce boîtier d'unité. v Jusqu'à huit unités sont prises en charge dans cette configuration. v Un seul 58/86 peut être raccordé à la carte, ou paire de cartes, lorsque des unités SSD sont installées. v Les mêmes règles s'appliquent à l'utilisation de cartes SAS RAID à double canal connectées à 5886. 33E/8B et 36E/8C v (5679) - Activation de SAS RAID : carte interne v (5904) - Carte SAS RAID PCI-X DDR 1,5 Go de cache v (5903) - Carte SAS RAID PCIe à double canal x4 3 Go de cache de 380 Mo v Les unités SSD et HDD ne peuvent jamais être combinées dans la même grappe RAID ni le même groupe de mise en miroir. v Elles peuvent être associées dans le fond de panier de l'unité de disques du système dès lors que ce dernier n'est pas divisé. v Les unités SSD et HDD peuvent coexister dans un fond de panier divisé si chaque côté du fond de panier inclut uniquement l'unité SSD ou HDD. Restriction : Les fonds de panier divisés pour ces modèles ne sont pas pris en charge avec les cartes SAS RAID à double canal. A faire : L'unité SSD doit être située d'un côté du fond de panier divisé et l'unité HDD, de l'autre côté. Ces types d'unité ne peuvent pas être combinés au sein d'un jeu d'unités à contrôle de parité intégré. Toutefois, si vous décidez de le faire, assurez-vous de disposer d'un nombre suffisant d'unités de chaque type pour mener à bien votre programme de protection. 2 Unités de disques Tableau 1. Règles relatives aux unités SSD pour le système d'exploitation AIX (suite) Modèle du type de machine Carte Règles d'association 17M/MB et 79M/HB v (5652) - Fond de panier du disque/du support de stockage v (5662) - Carte d'activation Cache RAID de 175 Mo - Adaptateur d'E-S double stockage v (5903) - Carte SAS RAID PCIe à double canal x4 3 Go de cache de 380 Mo v Les unités SSD et HDD ne peuvent pas être combinées dans la même grappe RAID ni le même groupe de mise en miroir. v Elles peuvent être associées dans le fond de panier de l'unité de disques système. A faire : Vous ne devez pas combiner ces types d'unité au sein d'un jeu d'unités à contrôle de parité intégré. Toutefois, si vous décidez de le faire, assurez-vous de disposer d'un nombre suffisant d'unités de chaque type pour mener à bien votre programme de protection. 58/02 (5903) - Carte SAS RAID PCIe à double canal x4 3 Go de cache de 380 Mo v Les unités SSD et HDD ne peuvent pas être combinées dans un même fond de panier d'unité de disque. v Les unités SSD et HDD peuvent coexister dans un fond de panier divisé si chaque côté du fond de panier inclut uniquement l'unité SSD ou HDD. L'unité SSD doit être située d'un côté du fond de panier divisé et l'unité HDD, de l'autre côté. v Les unités SSD et HDD ne peuvent pas être associées sur une paire de cartes 5903. v Une paire de cartes 5903 peut prendre en charge un maximum de 9 SSD. Tableau 2. Règles relatives aux unités SSD pour le système d'exploitation Linux. Modèle du type de machine Carte Règles d'association 58/86 v (5904) - Carte SAS RAID PCI-X DDR 1,5 Go de cache v (5906 et 5908) - Carte SAS RAID PCI-X DDR 1,5 Go de cache (BSC) v (5903) - Carte SAS RAID PCIe à double canal x4 3 Go de cache de 380 Mo v Les unités SSD et les unités de disque dur ne peuvent pas être combinées dans ce boîtier d'unité. v Jusqu'à huit unités sont prises en charge dans cette configuration. v Un seul 58/86 peut être raccordé à la carte, ou paire de cartes, lorsque des unités SSD sont installées. v Les mêmes règles s'appliquent à l'utilisation de cartes SAS RAID à double canal connectées à 5886. Unités de disques 3 Tableau 2. Règles relatives aux unités SSD pour le système d'exploitation Linux. (suite) Modèle du type de machine Carte Règles d'association 33E/8B et 36E/8C v (5679) - Activation de SAS RAID : carte interne v (5904) - Carte SAS RAID PCI-X DDR 1,5 Go de cache v (5903) - Carte SAS RAID PCIe à double canal x4 3 Go de cache de 380 Mo v Les unités SSD et HDD ne peuvent jamais être combinées dans la même grappe RAID ni le même groupe de mise en miroir. v Elles peuvent être associées dans le fond de panier de l'unité de disques du système dès lors que ce dernier n'est pas divisé. v Les unités SSD et HDD peuvent coexister dans un fond de panier divisé si chaque côté du fond de panier inclut uniquement l'unité SSD ou HDD. Restriction : Les fonds de panier divisés pour ces modèles ne sont pas pris en charge avec les cartes SAS RAID à double canal. A faire : L'unité SSD doit être située d'un côté du fond de panier divisé et l'unité HDD, de l'autre côté. Ces types d'unité ne peuvent pas être combinés au sein d'un jeu d'unités à contrôle de parité intégré. Toutefois, si vous décidez de le faire, assurez-vous de disposer d'un nombre suffisant d'unités de chaque type pour mener à bien votre programme de protection. 17M/MB et 79M/HB v (5652) - Fond de panier du disque/du support de stockage v (5662) - Carte d'activation Cache RAID de 175 Mo - Adaptateur d'E-S double stockage v (5903) - Carte SAS RAID PCIe à double canal x4 3 Go de cache de 380 Mo v Les unités SSD et HDD ne peuvent pas être combinées dans la même grappe RAID ni le même groupe de mise en miroir. v Elles peuvent être associées dans le fond de panier de l'unité de disques système. A faire : Vous ne devez pas combiner ces types d'unité au sein d'un jeu d'unités à contrôle de parité intégré. Toutefois, si vous décidez de le faire, assurez-vous de disposer d'un nombre suffisant d'unités de chaque type pour mener à bien votre programme de protection. 4 Unités de disques Tableau 2. Règles relatives aux unités SSD pour le système d'exploitation Linux. (suite) Modèle du type de machine Carte Règles d'association 58/02 (5903) - Carte SAS RAID PCIe à double canal x4 3 Go de cache de 380 Mo v Les unités SSD et HDD ne peuvent pas être combinées dans un même fond de panier d'unité de disque. v Les unités SSD et HDD peuvent coexister dans un fond de panier divisé si chaque côté du fond de panier inclut uniquement l'unité SSD ou HDD. L'unité SSD doit être située d'un côté du fond de panier divisé et l'unité HDD, de l'autre côté. v Les unités SSD et HDD ne peuvent pas être associées sur une paire de cartes 5903. v Une paire de cartes 5903 peut prendre en charge un maximum de 9 SSD. Tableau 3. Règles relatives aux unités SSD pour le système d'exploitation Modèle du type de machine Carte Règles d'association 58/86 v (5904) - Carte SAS RAID PCI-X DDR 1,5 Go de cache v (5906 et 5908) - Carte SAS RAID PCI-X DDR 1,5 Go de cache (BSC) v (5903) - Carte SAS RAID PCIe à double canal x4 3 Go de cache de 380 Mo v Les unités SSD et les unités de disque dur ne peuvent pas être combinées dans ce boîtier d'unité. v Jusqu'à huit unités sont prises en charge dans cette configuration. v Un seul 58/86 peut être raccordé à la carte, ou paire de cartes, lorsque des unités SSD sont installées. v Les mêmes règles s'appliquent à l'utilisation de cartes SAS RAID à double canal connectées à 5886. 33E/8B et 36E/8C v (5679) - Activation de SAS RAID : carte interne v (5904) - Carte SAS RAID PCI-X DDR 1,5 Go de cache v (5903) - Carte SAS RAID PCIe à double canal x4 3 Go de cache de 380 Mo v Les unités SSD et HDD ne peuvent pas être combinées dans la même grappe RAID ni le même groupe de mise en miroir. v Elles peuvent être associées dans le fond de panier de l'unité de disques système. A faire : Vous ne devez pas combiner ces types d'unité au sein d'un jeu d'unités à contrôle de parité intégré. Toutefois, si vous décidez de le faire, assurez-vous de disposer d'un nombre suffisant d'unités de chaque type pour mener à bien votre programme de protection. Unités de disques 5 Tableau 3. Règles relatives aux unités SSD pour le système d'exploitation (suite) Modèle du type de machine Carte Règles d'association 17M/MB et 79M/HB v (5652) - Fond de panier du disque/du support de stockage v (5662) - Carte d'activation Cache RAID de 175 Mo - Adaptateur d'E-S double stockage v (5903) - Carte SAS RAID PCIe à double canal x4 3 Go de cache de 380 Mo v Les unités SSD et HDD ne peuvent pas être combinées dans la même grappe RAID ni le même groupe de mise en miroir. v Elles peuvent être associées dans le fond de panier de l'unité de disques système. A faire : Vous ne devez pas combiner ces types d'unité au sein d'un jeu d'unités à contrôle de parité intégré. Toutefois, si vous décidez de le faire, assurez-vous de disposer d'un nombre suffisant d'unités de chaque type pour mener à bien votre programme de protection. 58/02 (5903) - Carte SAS RAID PCIe à double canal x4 3 Go de cache de 380 Mo v Les unités SSD et HDD ne peuvent pas être combinées dans un même fond de panier d'unité de disque. v Les unités SSD et HDD peuvent coexister dans un fond de panier divisé si chaque côté du fond de panier inclut uniquement l'unité SSD ou HDD. L'unité SSD doit être située d'un côté du fond de panier divisé et l'unité HDD, de l'autre côté. v Les unités SSD et HDD ne peuvent pas être associées sur une paire de cartes 5903. v Une paire de cartes 5903 peut prendre en charge un maximum de 9 SSD. 6 Unités de disques Tâches associées «Installation d'une unité de disques ou d'une unité SSD sur un système ou une partition logique AIX sous tension» Vous pouvez installer une unité de disques ou une unité SSD sur un système ou une unité d'extension alors que le système d'exploitation ou la partition logique AIX qui contrôle l'emplacement de l'unité de disques est sous tension. Exécutez la procédure décrite dans cette section pour effectuer cette tâche. «Installation d'une unité de disques ou d'une unité SSD sur un système ou une partition logique Linux sous tension», à la page 15 Il peut être nécessaire d'installer une unité de disques ou une unité SSD sur un système ou une unité d'extension alors que le système d'exploitation ou la partition logique Linux qui contrôle l'emplacement de l'unité de disques est sous tension. Exécutez la procédure décrite dans cette section pour effectuer cette tâche. «Installation d'une unité de disques ou d'une unité SSD sur un système ou une partition logique hors tension», à la page 37 Découvrez comment installer une unité de disques sur un système ou une partition logique qui contrôle l'emplacement de l'unité de disques une fois l'alimentation coupée. La procédure décrite dans cette information permet d'effectuer cette tâche. Information associée Câblage du sous-système du disque 5802 Installation d'une unité de disques ou d'une unité SSD sur un système ou une partition logique AIX sous tension Vous pouvez installer une unité de disques ou une unité SSD sur un système ou une unité d'extension alors que le système d'exploitation ou la partition logique AIX qui contrôle l'emplacement de l'unité de disques est sous tension. Exécutez la procédure décrite dans cette section pour effectuer cette tâche. Avant de commencer Avertissement : Si vous effectuez la maintenance d'un composant défaillant, consultez les procédures de maintenance pour «Remplacement d'une unité de disques sur un système ou une partition logique AIX sous tension», à la page 46. Cette procédure s'applique à l'installation ou à la mise à niveau d'un nouveau drive de mémoire. Avant d'installer ou de remplacer un dispositif, assurez-vous que le logiciel requis pour prendre en charge ce dispositif est installé sur votre système. Si votre système est géré par la console HMC, utilisez celle-ci pour effectuer la procédure de remplacement du ventilateur dans le serveur. Pour obtenir des informations sur l'utilisation de la console HMC pour l'installation d'un dispositif, voir «Installation d'un dispositif via la console HMC (Hardware Management Console)», à la page 193. Si vous ne disposez pas d'une console HMC, procédez comme suit pour installer une unité de disque du système ou de la partition qui contrôle l'unité de disques. Si vous installez des unités SSD, lisez les règles de configuration puis revenez à cette section. Pour plus d'informations, voir «Unités SSD (Solid State Drive)», à la page 1. Unités de disques 7 Pourquoi et quand exécuter cette tâche DANGER Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes suivantes : Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut présenter un danger. Pour éviter tout risque de choc électrique : v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce dernier avec un autre produit. v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique. v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de reconfiguration de ce produit au cours d'un orage. v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de danger électrique, débranchez tous les cordons d'alimentation. v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent aux informations de la plaque d'alimentation électrique du système. v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce produit. v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les câbles d'interface. v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence de dommages matériels. v Avant de retirer les carters de l'unité, mettez celle-ci hors tension et déconnectez ses cordons d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication et aux modems (sauf mention contraire dans les procédures d'installation et de configuration). v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et déconnecter les différents cordons. Pour déconnecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Débranchez les cordons d'alimentation des prises. 3. Débranchez les câbles d'interface des connecteurs. 4. Débranchez tous les câbles des unités. Pour connecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Branchez tous les cordons sur les unités. 3. Branchez les câbles d'interface sur des connecteurs. 4. Branchez les cordons d'alimentation aux prises. 5. Mettez l'unité sous tension. (D005a) Avertissement : La procédure avec le système sous tension est un ensemble complexe d'étapes qui doivent être exécutées avec précision pour éviter des pertes de données ou un endommagement du système. Vous pouvez également effectuer la procédure hors tension, plus simple et moins risquée en termes de pertes de données ou d'endommagement du système. Pour plus d'informations, voir «Installation d'une unité de disques ou d'une unité SSD sur un système ou une partition logique hors tension», à la page 37. Remarque : Certaines figures accompagnant ces procédures peuvent ne pas correspondre exactement à votre unité centrale ou votre unité d'extension. Néanmoins, les procédures demeurent identiques pour l'exécution de chaque tâche. 8 Unités de disques Préparation du système Procédure 1. Effectuez les tâches préalablement requises décrites dans la section «Avant de commencer», à la page 143. 2. Le cas échéant, retirez le volet de l'unité centrale, celui de l'unité d'extension, ou ouvrez l'armoire. 3. Le cas échéant, retirez le panneau avant. 4. Identifiez le premier emplacement d'unité de disques libre de l'unité centrale ou de l'unité d'extension. Pour plus d'informations, voir «Emplacements des unités de disque et voyants de maintenance simultanée», à la page 90. Remarque : Si le système est doté d'une unité d'extension, comblez d'abord les emplacements de l'unité centrale. Toutefois, vous pouvez choisir une autre position pour l'unité de disques, en fonction de votre stratégie de protection des données. 5. Inscrivez l'emplacement d'installation de la nouvelle unité. Par exemple, la nouvelle unité de disques peut être placée en P3-D4. 6. Pour installer une unité de disques, procédez comme suit. Installation d'une unité de disques Pourquoi et quand exécuter cette tâche 1. Connectez-vous en tant que superutilisateur. 2. A l'invite, tapez diag et appuyez sur Entrée. 3. Appuyez sur Entrée pour continuer. 4. Dans l'écran de sélection de fonction, sélectionnez Sélection des tâches. 5. Sur l'écran de sélection des tâches, sélectionnez Remplacement à chaud. 6. Sélectionnez Gestionnaire d'unités SCSI et RAID SCSI remplaçables à chaud. 7. Sélectionnez Connexion d'une unité à une unité SES. La liste des emplacements vides du boîtier de connexion à chaud SCSI s'affiche. 8. Sélectionnez l'emplacement dans lequel vous allez installer l'unité de disques, puis appuyez sur Entrée. L'emplacement de l'unité de disques affiche l'état Identification. Le voyant de maintenance simultanée de l'emplacement commence à clignoter. Pour afficher l'emplacement des voyants de maintenance simultanée, choisissez le modèle sur lequel vous souhaitez installer une unité de disques. Pour une liste des modèles, voir «Emplacements des unités de disque et voyants de maintenance simultanée», à la page 90. 9. Si l'emplacement que vous avez choisi comporte un obturateur d'unité de disque, conservez celui-ci pour une utilisation ultérieure ou mettez-le au rebut puis passez à l'étape Insertion de l'unité de disque. Remarque : L'obturateur est une pièce séparée que ne se fixe pas sur la nouvelle unité de disques. Important : . Tirez l'obturateur d'unité de disques pour le détacher de l'emplacement approprié (voir figure 1, à la page 10). Unités de disques 9 Avertissement : v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel contre les risques de décharge électrostatique. v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation d'appareils électriques. v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique, d'installer ou de remplacer du matériel. 10. Si l'emplacement en question contient un obturateur d'unité de disques, procédez comme suit. Retirez le loquet (A) du panneau frontal de l'unité de disques du bord avant de l'obturateur (voir figure suivante). Figure 1. Retrait d'un obturateur d'unité de disques 10 Unités de disques 11. Si l'emplacement en question contient un obturateur d'unité de disques, procédez comme suit. Dévissez et retirez la vis moletée (C) pour dégager le panneau frontal (B) du bord avant de l'obturateur. 12. Prenez l'emballage contenant la nouvelle unité de disques. Avertissement : Les unités de disques sont fragiles. Manipulez-les avec précaution. 13. Retirez l'unité de disques de son emballage antistatique. 14. Déverrouillez la poignée de l'unité de disques en la tirant vers vous. Si la poignée n'est pas complètement sortie, vous ne pouvez pas insérer l'unité de disques dans l'unité centrale ou l'unité d'extension. 15. Si un panneau frontal n'est pas préinstallé sur l'unité de disques que vous installez, procédez à l'étape suivante. Fixez le panneau frontal (A) sur l'unité de disques (voir figure suivante). Figure 2. Retrait du panneau frontal d'un obturateur d'unité de disques Unités de disques 11 16. Si un panneau frontal n'est pas préinstallé sur l'unité de disques que vous installez, procédez à l'étape suivante. Appuyez sur la vis moletée (B), puis serrez la vis pour maintenir le panneau frontal (A) sur l'unité de disques. 17. Maintenez l'unité de disques par-dessous lorsque vous l'alignez sur les rails de guidage de l'unité centrale. Ne la tenez pas par la poignée. 18. Faites glisser complètement l'unité de disques dans l'unité centrale (voir figure 4, à la page 13). 19. Poussez complètement la poignée de l'unité de disques. Le voyant de l'unité de disques s'allume. Figure 3. Installation d'un panneau frontal sur une unité de disques 12 Unités de disques Figure 4. Installation d'une unité de disques sur un modèle Figure 5. Installation d'une unité de disques à encombrement réduit Unités de disques 13 20. Appuyez sur Entrée. 21. Si vous installez plusieurs unités de disques, répétez les étapes 1, à la page 9 à 20 jusqu'à installation de toutes les unités. 22. Annulez pour revenir au menu Gestionnaire d'unités SCSI et RAID SCSI remplaçables à chaud. 23. Sélectionnez Configuration des unités ajoutées/remplacées. 24. Retournez à la ligne de commande. Fin de la procédure Pourquoi et quand exécuter cette tâche 1. Remettez en place le panneau avant. 2. Installez ou fermez le volet du système, de l'unité d'extension ou de l'armoire. 3. Vérifiez le composant installé. Pour obtenir des instructions, consultez la rubrique «Vérification d'un dispositif installé ou d'un composant remplacé sur un système ou une partition logique AIX», à la page 195. 4. Pour préparer l'utilisation de la nouvelle unité de disques installée, voir «Préparation d'une unité de disques dans un système ou une partition logique AIX», à la page 100. Figure 6. Installation d'une unité de disques à encombrement réduit sur l'unité d'extension 5803 14 Unités de disques Concepts associés Retrait et remplacement de volets et capots Suivez ces procédures pour retirer et remettre en place les carters sur les serveurs systèmes contenant le processeur POWER7. Référence associée «Retrait et remplacement du volet ou panneau d'unité d'extension», à la page 156 Il se peut que vous deviez retirer, remplacer ou installer des panneaux ou des volets sur une unité d'extension pour accéder à des composants ou effectuer une opération de maintenance. Installation d'une unité de disques ou d'une unité SSD sur un système ou une partition logique Linux sous tension Il peut être nécessaire d'installer une unité de disques ou une unité SSD sur un système ou une unité d'extension alors que le système d'exploitation ou la partition logique Linux qui contrôle l'emplacement de l'unité de disques est sous tension. Exécutez la procédure décrite dans cette section pour effectuer cette tâche. Avant de commencer Avertissement : Si vous effectuez la maintenance d'un composant défaillant, consultez les procédures de maintenance pour «Remplacement d'une unité de disques sur un système ou une partition logique Linux sous tension», à la page 55. Cette procédure s'applique à l'installation ou à la mise à niveau d'un nouveau drive de mémoire. Avant d'installer ou de remplacer un dispositif, assurez-vous que le logiciel requis pour prendre en charge ce dispositif est installé sur votre système. Si votre système est géré par la console HMC, utilisez celle-ci pour effectuer la procédure de remplacement du ventilateur dans le serveur. Pour obtenir des informations sur l'utilisation de la console HMC pour l'installation d'un dispositif, voir «Installation d'un dispositif via la console HMC (Hardware Management Console)», à la page 193. Si vous ne disposez pas d'une console HMC, procédez comme suit pour installer une unité de disque du système ou de la partition qui contrôle l'unité de disques. Si vous installez des unités SSD, lisez les règles de configuration puis revenez à cette section. Pour plus d'informations, voir «Unités SSD (Solid State Drive)», à la page 1. Unités de disques 15 DANGER Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes suivantes : Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut présenter un danger. Pour éviter tout risque de choc électrique : v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce dernier avec un autre produit. v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique. v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de reconfiguration de ce produit au cours d'un orage. v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de danger électrique, débranchez tous les cordons d'alimentation. v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent aux informations de la plaque d'alimentation électrique du système. v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce produit. v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les câbles d'interface. v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence de dommages matériels. v Avant de retirer les carters de l'unité, mettez celle-ci hors tension et déconnectez ses cordons d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication et aux modems (sauf mention contraire dans les procédures d'installation et de configuration). v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et déconnecter les différents cordons. Pour déconnecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Débranchez les cordons d'alimentation des prises. 3. Débranchez les câbles d'interface des connecteurs. 4. Débranchez tous les câbles des unités. Pour connecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Branchez tous les cordons sur les unités. 3. Branchez les câbles d'interface sur des connecteurs. 4. Branchez les cordons d'alimentation aux prises. 5. Mettez l'unité sous tension. (D005a) Avertissement : Avant de commencer, assurez-vous d'avoir pris connaissance des présentes instructions et de bien maîtriser la complexité de la procédure utilisant le mode Sous tension. Si vous êtes dans l'un des cas suivants, voir «Installation d'une unité de disques ou d'une unité SSD sur un système ou une partition logique hors tension», à la page 37. v Vous installez une unité de disques qui sera connectée à l'adaptateur SCSI 6203 ou 6204. v Vous ne vous sentez pas à l'aise avec la procédure d'installation en mode sous tension. Remarque : Certaines figures accompagnant ces procédures peuvent ne pas correspondre exactement à votre unité centrale ou votre unité d'extension. Néanmoins, les procédures demeurent identiques pour l'exécution de chaque tâche. 16 Unités de disques Pourquoi et quand exécuter cette tâche Préparation du système Procédure 1. Effectuez les tâches prérequises (voir «Avant de commencer», à la page 143). 2. Le cas échéant, retirez le volet de l'unité centrale, de l'unité d'extension ou de l'armoire. 3. Le cas échéant, retirez le panneau avant. v Si vous avez un panneau avant d'un autre modèle, retirez-le. 4. Placez-vous devant l'unité centrale ou l'unité d'extension et identifiez le premier emplacement d'unité de disques disponible. Pour plus d'informations, voir «Emplacements des unités de disque et voyants de maintenance simultanée», à la page 90. Remarque : Si le système est doté d'une unité d'extension, comblez d'abord les emplacements de l'unité centrale. Toutefois, vous pouvez choisir une autre position pour l'unité de disques, en fonction de votre stratégie de protection des données. 5. Inscrivez l'emplacement d'installation de la nouvelle unité. Par exemple, la nouvelle unité de disques peut être positionnée en P3–D4. Résultats Installez une unité de disques SCSI dans une unité centrale ou une unité d'extension via un noyau 2.4 Linux ou un noyau 2.6 Linux. Concepts associés Retrait et remplacement de volets et capots Suivez ces procédures pour retirer et remettre en place les carters sur les serveurs systèmes contenant le processeur POWER7. Référence associée «Retrait et remplacement du volet ou panneau d'unité d'extension», à la page 156 Il se peut que vous deviez retirer, remplacer ou installer des panneaux ou des volets sur une unité d'extension pour accéder à des composants ou effectuer une opération de maintenance. Installation d'une unité de disques utilisant le noyau 2.4 Linux Si vous installez une unité de disques à un emplacement contrôlé par le noyau 2.4 Linux, appliquez la procédure décrite dans cette section pour effectuer cette tâche. Installation d'une unité de disques : Procédure 1. Si l'emplacement que vous avez choisi comporte un obturateur d'unité de disque, conservez celui-ci pour une utilisation ultérieure ou mettez-le au rebut puis passez à l'étape 7, à la page 20. Unités de disques 17 Remarque : L'obturateur est une pièce séparée que ne se fixe pas sur la nouvelle unité de disques. Avertissement : v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel contre les risques de décharge électrostatique. v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation d'appareils électriques. v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique, d'installer ou de remplacer du matériel. a. Tirez l'obturateur d'unité de disques pour le détacher de l'emplacement (voir figure 7). b. Retirez le loquet (A) du panneau frontal de l'unité de disques du bord avant de l'obturateur (voir figure suivante). Figure 7. Retrait du panneau obturateur sur un modèle 18 Unités de disques c. Dévissez et retirez la vis moletée (C) pour dégager le panneau frontal (B) de l'unité de disques du bord avant de l'obturateur. 2. Prenez l'emballage contenant la nouvelle unité de disques. Avertissement : Les unités de disques sont fragiles. Manipulez-les avec précaution. 3. Retirez l'unité de disques de son emballage antistatique. Inscrivez le numéro de série (les quatre derniers chiffres) de l'unité de disques. 4. Déverrouillez la poignée de l'unité de disques en la tirant vers vous. Si la poignée n'est pas complètement sortie, vous ne pouvez pas insérer l'unité de disques dans l'unité centrale ou l'unité d'extension. 5. Si votre unité de disques n'est pas dotée d'un panneau frontal, procédez comme suit pour l'installer : a. Fixez le panneau frontal (A) sur l'unité de disques (voir figure suivante). Figure 8. Retrait du panneau frontal de l'unité de disques Unités de disques 19 b. Appuyez sur la vis moletée (B), puis serrez la vis pour maintenir le panneau frontal sur l'unité de disques. 6. Maintenez l'unité de disques par-dessous lorsque vous l'alignez sur les rails de guidage de l'unité centrale ou de l'unité d'extension. Ne la tenez pas par la poignée. 7. Faites glisser l'unité de disques dans le système. Faites glisser complètement l'unité de disques dans l'unité centrale ou l'unité d'extension, puis bloquez-la dans son emplacement en poussant la poignée (voir figure 10, à la page 21). Figure 9. Installation d'un panneau frontal sur une unité de disques 20 Unités de disques Figure 10. Installation d'une unité de disques sur un modèle Figure 11. Installation d'une unité de disques à encombrement réduit Unités de disques 21 Remarque : Le voyant de maintenance simultanée ne clignote plus et reste allumé lorsque l'unité de disques est correctement installée. Pour afficher l'emplacement des voyants de maintenance simultanée, sélectionnez le modèle approprié dans la liste suivante : v «Emplacements des unités de disques et voyants de maintenance simultanée pour les boîtiers d'unité SCSI 31D/24 et 31T/24», à la page 98 v «Emplacements des unités de disques et voyants de maintenance simultanée sur le boîtier du disque 58/86», à la page 96 v «Emplacements des unités de disques et voyants de maintenance simultanée sur une unité d'extension 58/02», à la page 100 8. Si vous installez plusieurs unités de disques, répétez les étapes 1, à la page 17 à 7, à la page 20. 9. Connectez-vous en tant que superutilisateur. 10. Après avoir installé l'unité de disques, attendez 30 secondes, puis tapez iprconfig sur la ligne de commande de la session Linux. Appuyez sur Entrée. L'écran Work with Disk Units apparaît. 11. Sélectionnez Display disk hardware status dans l'écran Work with Disk Units. Appuyez sur Entrée. L'écran Display Hardware Status apparaît (voir figure suivante). Figure 12. Installation d'une unité de disques à encombrement réduit sur l'unité d'extension 5803 22 Unités de disques 12. Localisez l'unité de disques que vous avez installée. Le numéro de série peut être 0. Si l'unité de disques apparaît en étant définie comme disque de modèle 50, vous avez terminé. Si tel n'est pas le cas, procédez comme suit : a. Recherchez le nom /dev/sdx associé à l'unité de disques que vous avez installée. b. Retournez à la ligne de commande. c. Tapez sg_map sur la ligne de commande et appuyez sur Entrée. d. Utilisez la sortie de la commande sg_map pour comparer l'unité /dev/sdx à une unité /dev/sgx. e. Tapez sg_inq /dev/sgx sur la ligne de commande en utilisant la valeur /dev/sgx de l'étape précédente. Appuyez sur Entrée. f. Tapez iprconfig sur la ligne de commande et appuyez sur Entrée. g. Sélectionnez Display disk hardware status dans l'écran Work with Disk Units. Appuyez sur Entrée. h. Le numéro de série de la nouvelle unité de disques s'affiche dans l'écran Display Hardware Status. 13. Notez le nom de ressource lorsque vous configurez l'unité de disques. Suivez la procédure : Procédure 1. Remettez en place le panneau avant. 2. Installez ou fermez le volet avant du système, de l'unité d'extension ou de l'armoire. 3. Configurez la nouvelle unité de disques en fonction de votre environnement. Pour plus d'informations, voir PCI-X SCSI RAID Controller Reference Guide for Linux. Display Disk Hardware Status Serial Vendor Product Resource Hardware Number ID ID Model Name Status 03132041 IBM 2780001 001 /dev/ipr3 Operational 000015A5 IBM ST373453LC 079 DPY/Active E3V1WEAB IBM IC35L036UCDY10-0 070 DPY/Active E3V0J55B IBM IC35L036UCDY10-0 079 DPY/Active E3V1W1UB IBM IC35L036UCDY10-0 070 DPY/Active 04B81193 IBM 3C98D203 210 /dev/sdi Operational 000227CE IBM ST336605LC 020 /dev/sdh Operational 00000001 IBM 570B001 001 /dev/ipr2 Operational 00006719 IBM ST336607LC 020 /dev/sdd Operational E3V0E77B IBM IC35L036UCDY10-0 020 /dev/sde Operational 0007E0C4 IBM ST373453LC 020 /dev/sdf Operational 00000001 IBM 5702001 001 /dev/ipr1 Operational 03060038 IBM 5703001 001 /dev/ipr0 Operational More... Press Enter to continue. e=Exit q=Cancel r=Refresh f=PageDn b=PageUp d=Display disk unit details p=Display device parity status Figure 13. Ecran Display Disk Hardware Status Unités de disques 23 Concepts associés Retrait et remplacement de volets et carters Suivez ces procédures pour retirer et remettre en place les carters sur les serveurs systèmes contenant le processeur POWER7. Référence associée «Retrait et remplacement du volet ou panneau d'unité d'extension», à la page 156 Il se peut que vous deviez retirer, remplacer ou installer des panneaux ou des volets sur une unité d'extension pour accéder à des composants ou effectuer une opération de maintenance. Installation d'une unité de disques utilisant le noyau 2.6 Linux Si vous installez une unité de disques à un emplacement contrôlé par le noyau 2.6 Linux, appliquez la procédure décrite dans cette section pour effectuer cette tâche. Installation d'une unité de disques : Procédure 1. Connectez-vous en tant que superutilisateur. 2. Tapez lsslot sur la ligne de commande de la session Linux et appuyez sur Entrée. Un écran similaire à la figure suivante apparaît. 3. Notez le nom Linux de l'emplacement de la carte, sous lequel vous voulez installer l'unité de disques. 4. Tapez iprconfig sur la ligne de commande de la session Linux et appuyez sur Entrée. L'écran Configuration Utility apparaît. 5. Sélectionnez Work with disk unit recovery sur l'écran Configuration Utility. Appuyez sur Entrée. 6. Sélectionnez Concurrent add device dans l'écran Work with Disk Unit Recovery. Appuyez sur Entrée. Un écran Concurrent Device Add similaire à la figure suivante apparaît. 7. Tapez l'option 1 à côté de l'emplacement dans lequel vous voulez installer l'unité de disques, puis appuyez sur Entrée. x0537p1:~ # lsslot # Slot Description Linux Name Device(s) U787A.001.DNZ00XV-P1-C1 Logical I/O Slot 0000:00:02.0 ethernet U787A.001.DNZ00XV-P1-C2 Logical I/O Slot 0000:00:02.2 pci U787A.001.DNZ00XV-P1-C3 Logical I/O Slot 0001:00:02.2 pci1069,b166 U787A.001.DNZ00XV-P1-C4 Logical I/O Slot 0000:00:02.6 pci1069,b166 U787A.001.DNZ00XV-P1-C5 Logical I/O Slot 0001:00:02.6 pci1014,0180 U787A.001.DNZ00XV-P1-T5 Logical I/O Slot 0000:00:02.4 ethernet U787A.001.DNZ00XV-P1-T7 Logical I/O Slot 0000:00:02.3 usb U787A.001.DNZ00XV-P1-T10 Logical I/O Slot 0001:00:02.4 scsi U787A.001.DNZ00XV-P1-T12 Logical I/O Slot 0001:00:02.3 ide U9111.520.10D3C6C-V1-C0 Virtual I/O Slot 30000000 vty Figure 14. Nom Linux correspondant à l'emplacement d'installation approprié pour l'unité de disques. Concurrent Device Add Choose a single location for add operations 1=Select OPT Name PCI/SCSI Location Description Status --- ------ -------------------------- ------------------------- ----------------- 0000:41:01.0.1/0:5: Empty 0000:41:01.0.1/1:8: Empty 0001:61:01.0.2/0:8: Empty Figure 15. Exemple d'écran Concurrent Device Remove 24 Unités de disques L'écran Verify Device Concurrent Add apparaît. Un voyant de maintenance simultanée indique l'emplacement où l'unité peut être installée. 8. Appuyez sur Entrée dans l'écran Verify Device Concurrent Add pour vérifier l'emplacement. 9. Si l'emplacement que vous avez choisi comporte un obturateur d'unité de disque, conservez celui-ci pour une utilisation ultérieure ou mettez-le au rebut puis passez à l'étape 16, à la page 27. Remarque : L'obturateur est une pièce séparée que ne se fixe pas sur la nouvelle unité de disques. Avertissement : v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel contre les risques de décharge électrostatique. v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation d'appareils électriques. v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique, d'installer ou de remplacer du matériel. a. Tirez l'obturateur d'unité de disques pour le détacher de l'emplacement (voir figure 16). b. Retirez le loquet (A) du panneau frontal de l'unité de disques du bord avant de l'obturateur (voir figure suivante). Figure 16. Retrait du panneau obturateur sur un modèle Unités de disques 25 c. Dévissez et retirez la vis moletée (C) pour dégager le panneau frontal (B) de l'unité de disques du bord avant de l'obturateur. 10. Prenez l'emballage contenant la nouvelle unité de disques. Avertissement : Les unités de disques sont fragiles. Manipulez-les avec précaution. 11. Retirez l'unité de disques de son emballage antistatique. Inscrivez le numéro de série (les quatre derniers chiffres) de l'unité de disques. 12. Déverrouillez la poignée de l'unité de disques en la tirant vers vous. Si la poignée n'est pas complètement sortie, vous ne pouvez pas insérer l'unité de disques dans l'unité centrale ou l'unité d'extension. 13. Si votre unité de disques n'est pas dotée d'un panneau frontal, procédez comme suit pour l'installer : a. Fixez le panneau frontal (A) sur l'unité de disques (voir figure suivante). Figure 17. Retrait du panneau frontal sur l'unité de disques. 26 Unités de disques b. Appuyez sur la vis moletée (B), puis serrez la vis pour maintenir le panneau frontal sur l'unité de disques. 14. Maintenez l'unité de disques par-dessous lorsque vous l'alignez sur les rails de guidage de l'unité centrale ou de l'unité d'extension. Ne la tenez pas par la poignée. 15. L'écran Complete Device Concurrent Add apparaît. 16. Faites glisser l'unité de disques dans le système. Faites glisser complètement l'unité de disques dans l'unité centrale ou l'unité d'extension, puis bloquez-la dans son emplacement en poussant la poignée (voir figure 19, à la page 28). Figure 18. Fixation du panneau frontal à l'unité de disques Unités de disques 27 Figure 19. Installation d'une unité de disques sur un modèle Figure 20. Installation d'une unité de disques à encombrement réduit 28 Unités de disques 17. Appuyez sur Entrée. Le voyant de maintenance simultanée ne clignote plus et reste allumé lorsque l'unité de disques est correctement installée. Pour afficher l'emplacement des voyants de maintenance simultanée, sélectionnez le modèle approprié dans la liste suivante : v «Emplacements des unités de disques et voyants de maintenance simultanée pour les boîtiers d'unité SCSI 31D/24 et 31T/24», à la page 98 v «Emplacements des unités de disques et voyants de maintenance simultanée sur le boîtier du disque 58/86», à la page 96 v «Emplacements des unités de disques et voyants de maintenance simultanée sur une unité d'extension 58/02», à la page 100 18. Si vous installez plusieurs unités de disques, répétez les étapes 2, à la page 24 à 17 jusqu'à installation de toutes les unités. Vérification du fonctionnement de la nouvelle unité de disques : Pourquoi et quand exécuter cette tâche Pour vérifier si l'unité de disques fonctionne, procédez comme suit : 1. Connectez-vous en tant que superutilisateur. 2. Tapez iprconfig sur la ligne de commande de la session Linux et appuyez sur Entrée.L'écran Configuration Utility apparaît. 3. Sélectionnez Display hardware status. L'écran Display Hardware Status apparaît (voir figure suivante). Figure 21. Installation d'une unité de disques à encombrement réduit sur l'unité d'extension 5803 Unités de disques 29 4. Vérifiez que l'unité de disques installée apparaît à l'écran. Suivez la procédure : Pourquoi et quand exécuter cette tâche 1. Remettez en place le panneau avant. 2. Installez ou fermez le volet avant de l'unité système, de l'unité d'extension ou de l'armoire. 3. Configurez la nouvelle unité de disques en fonction de votre environnement. Pour plus d'informations, voir PCI-X SCSI RAID Controller Reference Guide for Linux. Concepts associés Retrait et remplacement de volets et capots Suivez ces procédures pour retirer et remettre en place les carters sur les serveurs systèmes contenant le processeur POWER7. Référence associée «Retrait et remplacement du volet ou panneau d'unité d'extension», à la page 156 Il se peut que vous deviez retirer, remplacer ou installer des panneaux ou des volets sur une unité d'extension pour accéder à des composants ou effectuer une opération de maintenance. Installation d'une unité de disques ou d'une unité SSD sur un système ou une partition logique Virtual I/O Server sous tension Vous pouvez installer une unité de disques ou une unité SSD sur un système ou une unité d'extension alors que le système d'exploitation ou la partition logique Virtual I/O Server qui contrôle l'emplacement de l'unité de disques est sous tension. La procédure décrite dans cette information permet d'effectuer cette tâche. Avant de commencer Avant d'installer ou de remplacer un dispositif, assurez-vous que le logiciel requis pour prendre en charge ce dispositif est installé sur votre système. Si votre système est géré par la console HMC, utilisez celle-ci pour effectuer la procédure de remplacement du ventilateur dans le serveur. Pour obtenir des informations sur l'utilisation de la console HMC pour l'installation d'un dispositif, voir «Installation d'un dispositif via la console HMC (Hardware Management Console)», à la page 193. Si vous ne disposez pas Display Hardware Status Type option, press Enter. 1=Display hardware resource information details OPT Name PCI/SCSI Location Description Status --- ------ -------------------------- ------------------------- ---------------- 0000:01:01.0.0/ PCI-X SCSI Adapter Operational 0000:41:01.0.1/ PCI-X SCSI Adapter Operational sda 0000:41:01.0.1/0:3:0 Physical Disk Active sdb 0000:41:01.0.1/0:4:0 Physical Disk Active sdc 0000:41:01.0.1/0:8:0 Physical Disk Active sdd 0000:41:01.0.1/1:3:0 Physical Disk Active sde 0000:41:01.0.1/1:4:0 Physical Disk Active sdf 0000:41:01.0.1/1:5:0 Physical Disk Active 0001:61:01.0.2/ PCI-X SCSI RAID Adapter Operational sdg 0001:61:01.0.2/0:3:0 Physical Disk Active 0001:61:01.0.2/0:6:0 Advanced Function Disk Active sdi 0001:61:01.0.2/0:9:0 Physical Disk Active sdh 0001:61:01.0.2/255:0:0 RAID 10 Disk Array Failed 0001:61:01.0.2/0:4:0 RAID 10 Array Member Failed 0001:61:01.0.2/0:5:0 RAID 10 Array Member Failed e=Exit q=Cancel r=Refresh t=Toggle Figure 22. Exemple d'écran Display Hardware Status 30 Unités de disques d'une console HMC, procédez comme suit pour installer une unité de disque du système ou de la partition qui contrôle l'unité de disques. Pourquoi et quand exécuter cette tâche DANGER Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes suivantes : Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut présenter un danger. Pour éviter tout risque de choc électrique : v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce dernier avec un autre produit. v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique. v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de reconfiguration de ce produit au cours d'un orage. v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de danger électrique, débranchez tous les cordons d'alimentation. v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent aux informations de la plaque d'alimentation électrique du système. v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce produit. v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les câbles d'interface. v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence de dommages matériels. v Avant de retirer les carters de l'unité, mettez celle-ci hors tension et déconnectez ses cordons d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication et aux modems (sauf mention contraire dans les procédures d'installation et de configuration). v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et déconnecter les différents cordons. Pour déconnecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Débranchez les cordons d'alimentation des prises. 3. Débranchez les câbles d'interface des connecteurs. 4. Débranchez tous les câbles des unités. Pour connecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Branchez tous les cordons sur les unités. 3. Branchez les câbles d'interface sur des connecteurs. 4. Branchez les cordons d'alimentation aux prises. 5. Mettez l'unité sous tension. (D005a) Avertissement : La procédure avec le système sous tension est un ensemble complexe d'étapes qui doivent être exécutées avec précision pour éviter des pertes de données ou un endommagement du système. Vous pouvez également effectuer la procédure hors tension, plus simple et moins risquée en termes de pertes de données ou d'endommagement du système. Pour plus d'informations, voir «Installation d'une unité de disques ou d'une unité SSD sur un système ou une partition logique hors tension», à la page 37. Unités de disques 31 Remarque : Certaines figures accompagnant ces procédures peuvent ne pas correspondre exactement à votre unité centrale ou votre unité d'extension. Néanmoins, les procédures demeurent identiques pour l'exécution de chaque tâche. Préparation du système Procédure 1. Effectuez les tâches prérequises décrites dans «Avant de commencer», à la page 143. 2. Retirez les capots avant. Pour plus d'informations, voir «Carters système», à la page 157 3. Identifiez le premier emplacement d'unité de disques libre de l'unité centrale ou de l'unité d'extension. Pour plus d'informations, voir «Emplacements des unités de disque et voyants de maintenance simultanée», à la page 90. Remarque : Si le système est doté d'une unité d'extension, comblez d'abord les emplacements de l'unité centrale. Toutefois, vous pouvez sélectionner une autre position pour l'unité de disques, en fonction de votre stratégie de protection des données. 4. Inscrivez l'emplacement d'installation de la nouvelle unité. Par exemple, la nouvelle unité de disques peut être placée en P3-D4. 5. Pour installer une unité de disques, procédez comme suit. Installation d'une unité de disques Pourquoi et quand exécuter cette tâche 1. Connectez-vous en tant que superutilisateur. 2. Sur la ligne de commande, tapez diagmenu et appuyez sur Entrée. 3. Appuyez sur Entrée pour continuer. 4. Dans l'écran de sélection de fonction, sélectionnez Sélection des tâches. 5. Sur l'écran de sélection des tâches, sélectionnez Remplacement à chaud. 6. Sélectionnez Gestionnaire d'unités SCSI et RAID SCSI remplaçables à chaud. 7. Sélectionnez Connexion d'une unité à une unité SES. La liste des emplacements vides du boîtier de connexion à chaud SCSI s'affiche. 8. Sélectionnez l'emplacement dans lequel vous allez installer l'unité de disques, puis appuyez sur Entrée. L'emplacement de l'unité de disques affiche l'état Identification. Le voyant de maintenance simultanée de l'emplacement commence à clignoter. Pour afficher l'emplacement des voyants de maintenance simultanée, choisissez le modèle sur lequel vous souhaitez installer une unité de disques. Pour une liste des modèles, voir «Emplacements des unités de disque et voyants de maintenance simultanée», à la page 90. 9. Si l'emplacement que vous avez choisi comporte un obturateur d'unité de disque, conservez celui-ci pour une utilisation ultérieure ou mettez-le au rebut puis passez à l'étape Insertion de l'unité de disque. Remarque : L'obturateur est une pièce séparée que ne se fixe pas sur la nouvelle unité de disques. Important : . Tirez l'obturateur d'unité de disques pour le détacher de l'emplacement approprié (voir figure 23, à la page 33). 32 Unités de disques Avertissement : v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel contre les risques de décharge électrostatique. v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation d'appareils électriques. v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique, d'installer ou de remplacer du matériel. 10. Si l'emplacement en question contient un obturateur d'unité de disques, procédez comme suit. Retirez le loquet (A) du panneau frontal de l'unité de disques du bord avant de l'obturateur (voir figure suivante). Figure 23. Retrait d'un obturateur d'unité de disques Unités de disques 33 11. Si l'emplacement en question contient un obturateur d'unité de disques, procédez comme suit. Dévissez et retirez la vis moletée (C) pour dégager le panneau frontal (B) du bord avant de l'obturateur. 12. Prenez l'emballage contenant la nouvelle unité de disques. Avertissement : Les unités de disques sont fragiles. Manipulez-les avec précaution. 13. Retirez l'unité de disques de son emballage antistatique. 14. Déverrouillez la poignée de l'unité de disques en la tirant vers vous. Si la poignée n'est pas complètement sortie, vous ne pouvez pas insérer l'unité de disques dans l'unité centrale ou l'unité d'extension. 15. Si un panneau frontal n'est pas préinstallé sur l'unité de disques que vous installez, procédez à l'étape suivante. Fixez le panneau frontal (A) sur l'unité de disques (voir figure suivante). Figure 24. Retrait du panneau frontal d'un obturateur d'unité de disques 34 Unités de disques 16. Si un panneau frontal n'est pas préinstallé sur l'unité de disques que vous installez, procédez à l'étape suivante. Appuyez sur la vis moletée (B), puis serrez la vis pour maintenir le panneau frontal (A) sur l'unité de disques. 17. Maintenez l'unité de disques par-dessous lorsque vous l'alignez sur les rails de guidage de l'unité centrale. Ne la tenez pas par la poignée. 18. Faites glisser complètement l'unité de disques dans l'unité centrale (voir figure 26, à la page 36). 19. Poussez complètement la poignée de l'unité de disques. Le voyant de l'unité de disques s'allume. Figure 25. Installation d'un panneau frontal sur une unité de disques Unités de disques 35 Figure 26. Installation d'une unité de disques sur un modèle Figure 27. Installation d'une unité de disques à encombrement réduit 36 Unités de disques 20. Appuyez sur Entrée. 21. Si vous installez plusieurs unités de disques, répétez les étapes 1, à la page 32 à 20 jusqu'à installation de toutes les unités. 22. Annulez pour revenir au menu Gestionnaire d'unités SCSI et RAID SCSI remplaçables à chaud. 23. Sélectionnez Configuration des unités ajoutées/remplacées. 24. Retournez à la ligne de commande. Suivez la procédure Pourquoi et quand exécuter cette tâche 1. Réinstallez le panneau avant. Pour plus d'informations sur le remplacement du capot, voir «Carters système», à la page 157. 2. Vérifiez le composant installé. Pour plus d'informations, voir «Vérification du composant installé», à la page 195. Installation d'une unité de disques ou d'une unité SSD sur un système ou une partition logique hors tension Découvrez comment installer une unité de disques sur un système ou une partition logique qui contrôle l'emplacement de l'unité de disques une fois l'alimentation coupée. La procédure décrite dans cette information permet d'effectuer cette tâche. Avant de commencer Avertissement : Si vous effectuez la maintenance d'un composant défaillant, consultez les procédures de maintenance pour «Remplacement d'une unité de disques sur un système ou une partition logique hors tension», à la page 73. Cette procédure s'applique à l'installation ou à la mise à niveau d'un nouveau drive de mémoire. Figure 28. Installation d'une unité de disques à encombrement réduit sur l'unité d'extension 5803 Unités de disques 37 Avant d'installer ou de remplacer un dispositif, assurez-vous que le logiciel requis pour prendre en charge ce dispositif est installé sur votre système. Si votre système est géré par la console HMC, utilisez celle-ci pour effectuer la procédure de remplacement du ventilateur dans le serveur. Pour obtenir des informations sur l'utilisation de la console HMC pour l'installation d'un dispositif, voir http://publib.boulder.ibm.com/infocenter/powersys/v3r1m5/topic/p7haj/hmcinstall.htm. Si vous ne disposez pas d'une console HMC, procédez comme suit pour installer une unité de disque du système ou de la partition qui contrôle l'unité de disques. Si vous installez des unités SSD, lisez les règles de configuration puis revenez à cette section. Pour plus d'informations, voir «Unités SSD (Solid State Drive)», à la page 1. 38 Unités de disques Pourquoi et quand exécuter cette tâche DANGER Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes suivantes : Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut présenter un danger. Pour éviter tout risque de choc électrique : v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce dernier avec un autre produit. v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique. v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de reconfiguration de ce produit au cours d'un orage. v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de danger électrique, débranchez tous les cordons d'alimentation. v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent aux informations de la plaque d'alimentation électrique du système. v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce produit. v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les câbles d'interface. v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence de dommages matériels. v Avant de retirer les carters de l'unité, mettez celle-ci hors tension et déconnectez ses cordons d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication et aux modems (sauf mention contraire dans les procédures d'installation et de configuration). v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et déconnecter les différents cordons. Pour déconnecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Débranchez les cordons d'alimentation des prises. 3. Débranchez les câbles d'interface des connecteurs. 4. Débranchez tous les câbles des unités. Pour connecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Branchez tous les cordons sur les unités. 3. Branchez les câbles d'interface sur des connecteurs. 4. Branchez les cordons d'alimentation aux prises. 5. Mettez l'unité sous tension. (D005a) Préparation du système Procédure 1. Effectuez les tâches préalablement requises décrites dans la section «Avant de commencer», à la page 143. Remarque : Ne coupez pas l'alimentation lorsque vous arrêtez le système ou la partition logique afin d'installer une unité de disque ou une unité SSD. 2. Le cas échéant, retirez le volet de l'unité centrale, celui de l'unité d'extension, ou ouvrez l'armoire. 3. Le cas échéant, retirez le panneau avant. Unités de disques 39 4. Identifiez le premier emplacement d'unité de disques libre de l'unité centrale ou de l'unité d'extension. Pour plus d'informations, voir «Emplacements des unités de disque et voyants de maintenance simultanée», à la page 90. Remarque : Si le système est doté d'une unité d'extension, vous devez d'abord combler les emplacements de l'unité centrale. Toutefois, vous pouvez sélectionner une autre position pour l'unité de disques, en fonction de votre stratégie de protection des données. 5. Arrêtez le système ou la partition qui contrôle l'emplacement de l'unité de disques. Pour obtenir des instructions, voir http://publib.boulder.ibm.com/infocenter/powersys/v3r1m5/topic/p7haj/ crustopsys.htm. Installation d'une unité de disques Pourquoi et quand exécuter cette tâche 1. Si l'emplacement que vous avez choisi comporte un obturateur d'unité de disque, conservez celui-ci pour une utilisation ultérieure ou mettez-le au rebut puis passez à l'étape Insertion du disque. Remarque : L'obturateur est une pièce séparée que ne se fixe pas sur la nouvelle unité de disques. Avertissement : v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel contre les risques de décharge électrostatique. v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation d'appareils électriques. v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique, d'installer ou de remplacer du matériel. a. Tirez l'obturateur d'unité de disques pour le détacher de l'emplacement approprié (voir figure 29). b. Retirez le loquet (A) du panneau frontal de l'unité de disques du bord avant de l'obturateur (voir figure suivante). Figure 29. Retrait d'un obturateur d'unité de disques d'un modèle 40 Unités de disques c. Dévissez et retirez la vis moletée (C) pour dégager le panneau frontal (B) du bord avant de l'obturateur. 2. Prenez l'emballage contenant la nouvelle unité de disques. Avertissement : Les unités de disques sont fragiles. Manipulez-les avec précaution. 3. Retirez l'unité de disques de son emballage protecteur. Inscrivez le numéro de série (les quatre derniers chiffres) de l'unité de disques. 4. Déverrouillez la poignée de l'unité de disques en la tirant vers vous avant d'installer l'unité de disques. Si la poignée n'est pas complètement sortie, vous ne pouvez pas insérer l'unité de disques dans l'unité centrale ou l'unité d'extension. 5. Si un panneau frontal n'est pas préinstallé sur l'unité de disques, procédez comme suit pour l'installer sur la nouvelle unité : a. Fixez le panneau frontal (A) sur l'unité de disques (voir figure suivante). Figure 30. Retrait du panneau frontal d'une unité de disques Unités de disques 41 b. Appuyez sur la vis moletée (B), puis serrez la vis pour maintenir le panneau frontal sur l'unité de disques. 6. Maintenez l'unité de disques par-dessous lorsque vous l'alignez sur les rails de guidage de l'unité centrale ou de l'unité d'extension. Ne la tenez pas par la poignée. 7. Faites glisser complètement l'unité de disques dans l'unité centrale ou l'unité d'extension. Remarque : Lors de l'installation de l'unité de disques, vérifiez qu'elle est complètement et parfaitement insérée dans le système. 8. Poussez complètement la poignée de l'unité de disques (voir figure 32, à la page 43). Figure 31. Installation d'un panneau frontal sur une unité de disques 42 Unités de disques Figure 32. Installation d'une unité de disques sur un modèle Figure 33. Installation d'une unité de disques à encombrement réduit Unités de disques 43 9. Si vous installez plusieurs unités de disques, répétez les étapes 1, à la page 40 à 8, à la page 42 jusqu'à installation de toutes les unités. Suivez la procédure Pourquoi et quand exécuter cette tâche 1. Remettez en place le panneau avant. 2. Installez ou fermez le volet avant de l'unité système, de l'unité d'extension ou de l'armoire. 3. Si les câbles d'alimentation système ont été retirés, rebranchez-les. 4. Démarrez le système ou la partition logique. Pour plus d'informations, voir «Démarrage du système ou de la partition logique», à la page 151. 5. Pour configurer la nouvelle unité de disques, sélectionnez la procédure correspondant au système d'exploitation qui contrôle l'unité de disques. v «Préparation d'une unité de disques dans un système ou une partition logique AIX», à la page 100 v Pour plus d'informations sur la configuration de la nouvelle unité de disques dans un environnement Linux, voir PCI-X SCSI RAID Controller Reference Guide for Linux. Figure 34. Installation d'une unité de disques à encombrement réduit sur l'unité d'extension 5803 44 Unités de disques Concepts associés Retrait et remplacement de volets et capots Suivez ces procédures pour retirer et remettre en place les carters sur les serveurs systèmes contenant le processeur POWER7. Référence associée «Retrait et remplacement du volet ou panneau d'unité d'extension», à la page 156 Il se peut que vous deviez retirer, remplacer ou installer des panneaux ou des volets sur une unité d'extension pour accéder à des composants ou effectuer une opération de maintenance. Retrait d'une unité de disques ou d'une unité SSD Il peut être nécessaire de retirer une unité de disques ou une unité SSD du serveur ou de l'unité d'extension. La présente section fournit des informations sur le boîtier d'unités de disques et les procédures permettant d'effectuer ces tâches. Pour retirer et remplacer une unité de disques défaillante, voir «Remplacement d'une unité de disques». Si le système est géré par la console HMC (Hardware Management Console), utilisez-la pour retirer l'unité de disques. Pour plus d'informations, voir «Retrait d'un composant via la console HMC», à la page 194. Cette rubrique explique comment retirer une unité de disques sur une unité centrale ou une unité d'extension contrôlée par différents systèmes d'exploitation. «Remplacement d'une unité de disques sur un système ou une partition logique AIX sous tension», à la page 46 «Remplacement d'une unité de disques sur un système ou une partition logique Linux sous tension», à la page 55 Remplacement d'une unité de disques Il peut être nécessaire de remplacer une unité de disques SCSI (Small Computer System Interface) ou SAS (serial-attached Attached SCSI) sur le serveur ou l'unité d'extension. Les procédures de cette section permettent d'effectuer cette tâche. Pourquoi et quand exécuter cette tâche Remarque : Ces instructions (les trois premières sections) ne concernent que le remplacement d'une unité de disques SAS. Pour retirer et remplacer une unité d'extension dans un boîtier différent, utilisez les informations du boîtier. Suivez cette procédure pour retirer une unité de disques défaillante et la remplacer par le même type. Si vous envisagez de retirer une unité de disques et d'installer un type différent, retirez l'unité existante (voir «Retrait d'une unité de disques ou d'une unité SSD») et installez la nouvelle unité de disques (voir «Installation d'une unité de disques ou d'une unité SSD», à la page 1). Si le système est géré par la console HMC (Hardware Management Console), utilisez-la pour remplacer l'unité de disques. Pour plus d'informations, voir «Remplacement d'un composant via la console HMC», à la page 194. Votre système ou votre partition peut être indifféremment hors tension ou sous tension lorsque vous remplacez l'unité de disques. Si l'unité de disques remplacée se trouve dans le groupe de volumes racine (rootvg) AIX ou Linux et n'est pas protégée, mettez le système hors tension pour remplacer le disque (voir la procédure appropriée). Unités de disques 45 Remplacement d'une unité de disques sur un système ou une partition logique AIX sous tension Il peut être nécessaire de remplacer une unité de disques SCSI (Small Computer System Interface) ou SAS (serial-attached Attached SCSI) sur un système ou une unité d'extension alors que le système d'exploitation ou la partition logique AIX qui contrôle l'emplacement de l'unité de disques est sous tension. Exécutez la procédure décrite dans cette section pour effectuer cette tâche. Avant de commencer Avertissement : Si vous installez une unité de disque nouvelle ou mise à niveau, voir «Installation d'une unité de disques ou d'une unité SSD sur un système ou une partition logique AIX sous tension», à la page 7. Si vous retirez une unité de disque dans le cadre d'une procédure de maintenance, continuez de suivre la procédure suivante. 46 Unités de disques Pourquoi et quand exécuter cette tâche DANGER Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes suivantes : Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut présenter un danger. Pour éviter tout risque de choc électrique : v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce dernier avec un autre produit. v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique. v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de reconfiguration de ce produit au cours d'un orage. v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de danger électrique, débranchez tous les cordons d'alimentation. v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent aux informations de la plaque d'alimentation électrique du système. v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce produit. v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les câbles d'interface. v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence de dommages matériels. v Avant de retirer les carters de l'unité, mettez celle-ci hors tension et déconnectez ses cordons d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication et aux modems (sauf mention contraire dans les procédures d'installation et de configuration). v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et déconnecter les différents cordons. Pour déconnecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Débranchez les cordons d'alimentation des prises. 3. Débranchez les câbles d'interface des connecteurs. 4. Débranchez tous les câbles des unités. Pour connecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Branchez tous les cordons sur les unités. 3. Branchez les câbles d'interface sur des connecteurs. 4. Branchez les cordons d'alimentation aux prises. 5. Mettez l'unité sous tension. (D005a) Avertissement : La procédure avec le système sous tension est un ensemble complexe d'étapes qui doivent être exécutées avec exactitude pour éviter des pertes de données ou un endommagement du système. Vous pouvez également effectuer la procédure hors tension, plus simple et moins risquée en termes de pertes de données ou d'endommagement du système. Si le système ou la partition logique qui contrôle l'unité de disques à remplacer est sur la ligne de commande AIX, vous pouvez utiliser cette procédure avec l'alimentation sous tension. Si l'unité de disques à remplacer se trouve dans le groupe de volumes racine (rootvg) et qu'elle n'est pas protégée par la technologie RAID (Redundant Array of Independent Disks) ou la mise en miroir, ou pour utiliser la procédure hors tension, voir «Remplacement d'une unité de disques sur un système ou une partition logique hors tension», à la page 73. Unités de disques 47 Remarque : Certaines figures accompagnant ces procédures peuvent ne pas correspondre exactement à votre unité centrale ou votre unité d'extension. Néanmoins, les procédures demeurent identiques pour l'exécution de chaque tâche. Préparation du système Procédure 1. Effectuez les tâches de configuration prérequises décrites dans la section «Avant de commencer», à la page 143. 2. Préparez-vous à retirer l'unité de disques de l'unité centrale ou de l'unité d'extension (voir «Préparation du retrait d'une unité de disques d'une unité centrale ou d'extension contrôlée par le système d'exploitation AIX», à la page 100). 3. Le cas échéant, retirez le volet de l'unité centrale, de l'unité d'extension ou de l'armoire. 4. Le cas échéant, retirez le panneau avant. v Si vous avez un panneau avant d'un autre modèle, retirez-le. Résultats Pour remplacer une unité de disques JBOD (Just a Bunch of Disks) connectée à une carte SCSI ou SCSI RAID, voir «Remplacement d'une unité de disques SCSI», à la page 81. Retrait d'une unité de disques Procédure 1. Si vous retirez une unité SSD située sur un adaptateur PCIe RAID et SSD SAS, reportez-vous à Retrait et remplacement de l'adaptateur PCIe RAID et SSD SASRetrait et remplacement de l'adaptateur PCIe RAID et SSD SAS et suivez la procédure, puis revenez ici et effectuez les étapes suivantes. 2. Connectez-vous en tant que superutilisateur. 3. A l'invite, tapez diag et appuyez sur Entrée. 4. Appuyez sur Entrée pour continuer. 5. Dans l'écran de sélection de fonction, sélectionnez Sélection des tâches. 6. Sélectionnez Remplacement à chaud. 7. Sélectionnez Gestionnaire d'unités SCSI et RAID SCSI remplaçables à chaud ou Unités RAID remplaçables à chaud, en fonction de l'adaptateur auquel votre unité est connectée. Conseil : L'une, l'autre ou ces deux options peuvent être utilisées, en fonction de la configuration de votre système. Si l'unité que vous désirez remplacer ne figure pas dans la liste lorsque vous suivez la procédure du Gestionnaire d'unités SCSI et RAID SCSI remplaçables à chaud, passez à l'étape 8. v Si une unité de disques RAID est connectée à une carte PCI SCSI RAID 2498, passez à l'étape 8. v Si une unité de disques JBOD est connectée à une carte SCSI ou SCSI RAID, voir «Remplacement d'une unité de disques SCSI», à la page 81. v Si des unités RAID sont connectées à d'autres cartes, voir PCI-X SCSI RAID Controller Reference Guide for AIX. 8. Sélectionnez Unités RAID remplaçables à chaud. 9. Dans la liste de sélection des ressources, sélectionnez la carte connectée à l'ensemble contenant l'unité de disques à remplacer. Appuyez sur Entrée. 10. Sélectionnez Validation. 11. Dans le menu Identification et retrait de ressources, sélectionnez Retrait. Appuyez sur Entrée. La liste des unités de disques pouvant être retirées apparaît. Si l'unité de disques à retirer ne figure pas dans la liste, demandez à l'administrateur système de la placer dans l'état Incident avant de poursuivre cette procédure. Pour plus d'informations, voir AIX System Management Guide: Operating System and Devices. 48 Unités de disques 12. Sélectionnez l'unité à retirer et appuyez sur Entrée. Le voyant de maintenance simultanée correspondant à l'emplacement clignote rapidement. Avertissement : Si ce n'est pas le cas, laissez l'unité à sa place. Pour afficher l'emplacement des voyants de maintenance simultanée, consultez «Emplacements des unités de disque et voyants de maintenance simultanée», à la page 90. 13. Déverrouillez la poignée de l'unité de disques en appuyant dessus et en la tirant vers vous (voir figure 35). Figure 35. Retrait d'une unité de disques d'un modèle Unités de disques 49 14. Maintenez l'unité de disques par-dessous lorsque vous la faites glisser hors de l'unité centrale ou de l'unité d'extension. Ne la tenez pas par la poignée. Figure 36. Retrait d'une unité de disques à encombrement réduit Figure 37. Retrait d'une unité de disques à encombrement réduit sur l'unité d'extension 5803 50 Unités de disques Le voyant de maintenance simultanée de l'emplacement s'éteint lorsque vous retirez l'unité de disques. 15. Votre nouvelle unité de disques est-elle déjà équipée d'un panneau frontal ? v Non : continuez la procédure à ce stade. v Oui : Passez à l'étape 6, à la page 52 de la procédure de remplacement. Important : Si le panneau frontal de l'unité de disques n'est pas doté d'une vis moletée, il s'agit d'une unité de disques SAS (serial-attached Attached SCSI) non amovible. L'unité de remplacement doit alors être pourvue d'un panneau frontal fixé. Dévissez et retirez la vis moletée (B) pour dégager le panneau frontal (C) de l'unité de disques, comme illustré dans la figure suivante. Remplacement d'une unité de disque Procédure 1. Prenez l'emballage contenant la nouvelle unité de disques. Avertissement : Les unités de disques sont fragiles. Manipulez-les avec précaution. 2. Retirez l'unité de disques de son emballage protecteur. 3. Déverrouillez la poignée de l'unité de disques de remplacement en la tirant vers vous. Si la poignée n'est pas complètement sortie, vous ne pouvez pas insérer l'unité de disques dans le système. 4. Si la nouvelle unité de disques n'est pas déjà équipée d'un panneau frontal, procédez comme suit : Fixez le panneau frontal (A) à l'unité de disques de remplacement, comme illustré dans la figure suivante. 5. Si la nouvelle unité de disques n'est pas déjà équipée d'un panneau frontal, procédez comme suit : Figure 38. Retrait du panneau frontal d'une unité de disques Unités de disques 51 Appuyez sur la vis moletée (B), puis serrez la vis pour maintenir le panneau frontal sur l'unité de disques. 6. Installez l'unité de disques (voir figure 40, à la page 53). a. Maintenez l'unité de disques par-dessous lorsque vous l'alignez sur les rails de guidage de l'unité centrale. Ne la tenez pas par la poignée. b. Faites glisser complètement l'unité de disques dans l'unité centrale. Important : Lorsque vous installez une unité de disques, vous devez vérifier que l'unité est complètement et parfaitement insérée dans le système. c. Poussez complètement la poignée de l'unité de disques. Figure 39. Installation d'un panneau frontal sur une unité de disques 52 Unités de disques Figure 40. Installation d'une unité de disques sur un modèle Figure 41. Installation d'une unité de disques à encombrement réduit Unités de disques 53 7. Appuyez sur Entrée. L'emplacement de l'unité de disques passe de l'état Retrait à l'état Normal. 8. Sélectionnez le menu Unités RAID remplaçables à chaud. Appuyez sur F3 pour revenir à l'écran précédent. 9. Passez à l'écran de sélection des tâches. 10. Sélectionnez Consignation d'action corrective. 11. Sélectionnez l'unité de disques remplacée et appuyez sur Entrée. 12. Sélectionnez Validation après la sélection et appuyez sur Entrée. 13. Retournez à la ligne de commande. Fin de la procédure Procédure 1. Remettez en place le panneau avant. 2. Installez ou fermez le volet avant de l'unité centrale, de l'unité d'extension ou de l'armoire. 3. Pour préparer l'unité de disques, voir «Régénération des données d'une unité de disques de remplacement sous AIX», à la page 101. Figure 42. Installation d'une unité de disques à encombrement réduit sur l'unité d'extension 5803 54 Unités de disques Concepts associés Retrait et remplacement de volets et capots Suivez ces procédures pour retirer et remettre en place les carters sur les serveurs systèmes contenant le processeur POWER7. Référence associée «Retrait et remplacement du volet ou panneau d'unité d'extension», à la page 156 Il se peut que vous deviez retirer, remplacer ou installer des panneaux ou des volets sur une unité d'extension pour accéder à des composants ou effectuer une opération de maintenance. Remplacement d'une unité de disques sur un système ou une partition logique Linux sous tension Il peut être nécessaire de remplacer une unité de disques SCSI (Small Computer System Interface) ou SAS (serial-attached Attached SCSI) sur un système ou une unité d'extension alors que le système d'exploitation ou la partition logique Linux qui contrôle l'emplacement de l'unité de disques est sous tension. Exécutez la procédure décrite dans cette section pour effectuer cette tâche. Avant de commencer Avertissement : Si vous installez une unité de disque nouvelle ou mise à niveau, voir «Installation d'une unité de disques ou d'une unité SSD sur un système ou une partition logique Linux sous tension», à la page 15. Si vous retirez une unité de disque dans le cadre d'une procédure de maintenance, continuez de suivre la procédure suivante. Unités de disques 55 DANGER Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes suivantes : Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut présenter un danger. Pour éviter tout risque de choc électrique : v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce dernier avec un autre produit. v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique. v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de reconfiguration de ce produit au cours d'un orage. v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de danger électrique, débranchez tous les cordons d'alimentation. v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent aux informations de la plaque d'alimentation électrique du système. v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce produit. v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les câbles d'interface. v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence de dommages matériels. v Avant de retirer les carters de l'unité, mettez celle-ci hors tension et déconnectez ses cordons d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication et aux modems (sauf mention contraire dans les procédures d'installation et de configuration). v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et déconnecter les différents cordons. Pour déconnecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Débranchez les cordons d'alimentation des prises. 3. Débranchez les câbles d'interface des connecteurs. 4. Débranchez tous les câbles des unités. Pour connecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Branchez tous les cordons sur les unités. 3. Branchez les câbles d'interface sur des connecteurs. 4. Branchez les cordons d'alimentation aux prises. 5. Mettez l'unité sous tension. (D005a) Avertissement : Avant de commencer, assurez-vous d'avoir pris connaissance des présentes instructions et de bien maîtriser la complexité de la procédure utilisant le mode Sous tension. Si le système ou la partition logique qui contrôle l'unité de disques à remplacer est sur la ligne de commande Linux, vous pouvez utiliser cette procédure avec l'alimentation sous tension. Si vous êtes dans l'un des cas suivants, voir «Remplacement d'une unité de disques sur un système ou une partition logique hors tension», à la page 73 : v Vous installez une unité de disques qui sera connectée à l'adaptateur SCSI 6203 ou 6204 v Vous ne vous sentez pas à l'aise avec la procédure d'installation sur un système ou une partition logique sous tension. 56 Unités de disques Remarque : Certaines figures accompagnant ces procédures peuvent ne pas correspondre exactement à votre unité centrale ou votre unité d'extension. Néanmoins, les procédures demeurent identiques pour l'exécution de chaque tâche. Pourquoi et quand exécuter cette tâche Préparation du système Procédure 1. Effectuez les tâches de configuration prérequises décrites dans la section «Avant de commencer», à la page 143. 2. L'unité de disques à remplacer se trouve-t-elle dans une batterie de disques RAID (Redundant Array of Independent Disk) de niveau 0 ou dans une batterie de disques RAID de niveau 5 ou 10 défaillante ? v Si oui, vous devez supprimer le module avant de remplacer le disque. Pour plus d'informations, voir PCI-X SCSI RAID Controller Reference Guide for Linux. v Si non, votre unité est configurée comme JBOD (Just a Bunch of Disks) ou dans un module de disques endommagé. Passez à l'étape suivante. 3. Si l'unité de disques à remplacer se trouve dans un module de disques RAID (Redundant Array of Independent Disk) de niveau 0 ou dans un module de disques RAID de niveau 5 ou 10 défaillant, vous devez supprimer le module de disques avant de remplacer le disque. Pour plus d'informations, voir PCI-X SCSI RAID Controller Reference Guide for Linux. 4. Le cas échéant, retirez le volet de l'unité centrale, celui de l'unité d'extension, ou ouvrez le volet avant de l'armoire. 5. Le cas échéant, retirez le panneau avant. v Si vous avez un panneau avant d'un autre modèle, retirez-le. Résultats Pour remplacer une unité de disques sur un système ou une partition logique Linux qui contrôle l'emplacement de cette unité sous tension, sélectionnez la procédure appropriée dans la liste suivante : Concepts associés Retrait et remplacement de volets et capots Suivez ces procédures pour retirer et remettre en place les carters sur les serveurs systèmes contenant le processeur POWER7. Référence associée «Retrait et remplacement du volet ou panneau d'unité d'extension», à la page 156 Il se peut que vous deviez retirer, remplacer ou installer des panneaux ou des volets sur une unité d'extension pour accéder à des composants ou effectuer une opération de maintenance. Remplacement d'une unité de disques utilisant le noyau 2.4 Linux Les procédures suivantes expliquent comment remplacer une unité de disques à un emplacement contrôlé par un système ou une partition logique qui exécute le noyau 2.4 Linux. Préparation du retrait de l'unité de disques : Procédure 1. Connectez-vous en tant que superutilisateur. 2. Tapez iprconfig sur la ligne de commande de la session Linux et appuyez sur Entrée. L'écran Work with Disk Units apparaît. 3. Dans l'écran Work with Disk Units, sélectionnez Analyze log. Appuyez sur Entrée. L'écran Kernel Messages Log similaire à l'écran suivant apparaît. Unités de disques 57 4. Dans l'écran Kernel Messages Log, sélectionnez Use vi to view most recent error messages. Appuyez sur Entrée. 5. Recherchez l'entrée de journal de l'unité à remplacer. 6. Notez les informations d'emplacement de l'unité. Les informations d'emplacement ont le format P1–I2/Z1–A5. 7. Revenez à l'écran Work with Disk Units. 8. Sélectionnez Display disk hardware status dans l'écran Work with Disk Units. Appuyez sur Entrée. 9. Notez l'état matériel de l'unité de disques à remplacer. 10. Tapez d (affichage des informations sur l'unité de disques) à côté de l'unité de disques à remplacer. 11. Dans l'écran Display Disk Unit Details, tapez 5 (affichage des informations sur l'unité de disques) à côté de l'unité de disques à remplacer. L'écran Disk Unit Hardware Resource Information Details, similaire à l'écran suivant, apparaît. 12. Notez les informations d'emplacement physique. 13. Revenez à l'écran Work with Disk Units. 14. Si l'unité de disques à remplacer n'est pas protégée ou si elle est en cours d'utilisation, transférez les données de l'unité de disques avant de poursuivre la procédure. Kernel Messages Log Select one of the following: 1. Use vi to view most recent error messages 2. Use vi to view Storage error messages 3. Use vi to view all kernel error messages 4. Use vi to view sisconfig error messages 5. Set root kernel message log directory 6. Set default editor 7. Restore defaults 8. Use vi to view Storage boot time messages Selection: e=Exit Figure 43. Kernel Messages Log Disk Unit Hardware Resource Information Details Manufacturer . . . . . . . . : IBM Machine Type and Model. . . : IC35L073UCDY10-0 Firmware Version . . . . . . : 53323846 Model . . . . . . . . . . . : 020 Serial Number. . . . . . . : E6VKUNPD Capacity . . . . . . . . . . : 73,41 Go Resource Name. . . . . . . : /dev/sda Physical location: Location . . . . . . . : U1.9-P1/Z1-A8 PCI Bus. . . . . . . . . . : 257 PCI Device . . . . . . . . : 1 SCSI Host Number . . . . . : 0 SCSI Channel . . . . . . . : 0 SCSI Id. . . . . . . . . . : 8 SCSI Lun . . . . . . . . . : 0 Press Enter to continue. e=Exit q=Cancel v=Display Extended VPD Figure 44. Ecran Disk Unit Hardware Resource Information Details 58 Unités de disques Pour plus d'informations, voir PCI-X SCSI RAID Controller Reference Guide for Linux. Retrait d'une unité de disques : Procédure 1. Si vous retirez une unité SSD située sur un adaptateur PCIe RAID et SSD SAS, reportez-vous à Retrait et remplacement de l'adaptateur PCIe RAID et SSD SASRetrait et remplacement de l'adaptateur PCIe RAID et SSD SAS et suivez la procédure, puis revenez ici et effectuez les étapes suivantes. 2. Dans l'écran Work with Disk Units, sélectionnez Work with disk unit recovery. Appuyez sur Entrée. 3. Dans l'écran Work with Disk Unit Recovery, sélectionnez Device Concurrent maintenance. Appuyez sur Entrée. Un écran Device Concurrent Maintenance similaire à l'écran suivant apparaît. 4. Entrez les informations d'emplacement physique ou l'emplacement PCI ou SCSI de l'unité de disques à remplacer. Appuyez sur Entrée. 5. Spécifiez 1 (Remove device) pour que le système effectue l'action. 6. Saisissez un délai en minutes ; soit le temps nécessaire pour retirer l'unité de disques. Le voyant de maintenance simultanée indique que vous pouvez retirer l'unité de disques. Pour afficher l'emplacement des voyants de maintenance simultanée, sélectionnez le modèle approprié dans la liste suivante : v «Emplacements des unités de disques et voyants de maintenance simultanée pour les boîtiers d'unité SCSI 31D/24 et 31T/24», à la page 98 7. Appuyez sur Entrée. Un écran Device Concurrent Maintenance Action similaire à l'écran suivant apparaît. Device Concurrent Maintenance Type the choices, then press Enter. Specify either Physical Location or PCI/SCSI location. Physical Location: U0.1-P1-I4/Z1-A8 OU PCI/SCSI . . . PCI Bus: PCI Device: SCSI Channel: SCSI Id: SCSI Lun: Specify action as 1=Remove device 2=Install device Action à exécuter. . . . . . . . . : 1 Enter a time value between 01 and 19. Time needed in minutes . . . . . . . . . : 1 Or leave blank and press Enter to cancel Figure 45. Exemple d'écran Device Concurrent Maintenance Unités de disques 59 8. Entrez c pour confirmer que vous voulez retirer l'unité de disques dans l'emplacement défini. Un écran Device Concurrent Maintenance Action In Progress similaire à l'écran suivant apparaît. Avertissement : v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel contre les risques de décharge électrostatique. v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation d'appareils électriques. v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique, d'installer ou de remplacer du matériel. 9. Appuyez sur la poignée de l'unité de disques et tirez-la vers vous pour retirer l'unité de disques (voir figure 48, à la page 61). Si la poignée n'est pas complètement sortie, vous ne pouvez pas retirer l'unité de disques de l'unité centrale ou de l'unité d'extension. Device Concurrent Maintenance Action Removal of device: Location: U0.1-P1-I4/Z1-A8 You have 1 minute to perform the operation During this time, your system can seem unresponsive c=Confirm q=Cancel Figure 46. Exemple d'écran Device Concurrent Maintenance Action Device Concurrent Maintenance Action In Progress Please remove the device located in: Location: U0.1-P1-I4/Z1-A8 Figure 47. Exemple d'écran Device Concurrent Maintenance Action In Progress 60 Unités de disques Figure 48. Retrait d'une unité de disques d'un modèle ou 11D/20 Figure 49. Retrait d'une unité de disques à encombrement réduit Unités de disques 61 10. Retirez l'unité de disques en la maintenant par-dessous. Ne la tenez pas par la poignée. Remarque : Le voyant de maintenance simultanée de l'emplacement s'arrête de clignoter et reste éteint lorsque vous retirez l'unité de disques. 11. Appuyez sur Entrée. 12. Votre unité de disques est-elle déjà équipée d'un panneau frontal ? v Non : continuez la procédure à ce stade. v Oui : passez à l'étape 5, à la page 64 de la procédure de remplacement. Dévissez et retirez la vis moletée (B) pour dégager le panneau frontal (C) de l'unité de disques (voir figure suivante). Figure 50. Retrait d'une unité de disques à encombrement réduit sur l'unité d'extension 5803 62 Unités de disques Remplacement d'une unité de disques : Procédure 1. Prenez l'emballage contenant la nouvelle unité de disques. Avertissement : Les unités de disques sont fragiles. Manipulez-les avec précaution. 2. Retirez l'unité de disques de son emballage protecteur. 3. Déverrouillez la poignée de l'unité de disques en la tirant vers vous avant d'installer l'unité de disques. Si la poignée n'est pas complètement sortie, vous ne pouvez pas insérer l'unité de disques dans l'unité centrale ou l'unité d'extension. 4. Si l'unité de disques de remplacement n'est pas déjà équipée d'un panneau frontal, procédez comme suit pour l'installer : a. Fixez le panneau frontal (A) sur l'unité de disques (voir figure suivante). Figure 51. Retrait du panneau frontal d'une unité de disques Unités de disques 63 b. Appuyez sur la vis moletée (B), puis serrez la vis pour maintenir le panneau frontal sur l'unité de disques. 5. Sélectionnez Device Concurrent maintenance dans l'écran Work with Disk Unit Recovery. Appuyez sur Entrée. 6. Entrez les informations d'emplacement physique ou l'emplacement PCI/SCSI de l'unité de disques que vous avez retirée. Appuyez sur Entrée. 7. Définissez l'action 2 (installation d'unité). 8. Saisissez un délai en minutes ; soit le temps nécessaire pour installer la nouvelle unité de disques. Le voyant de maintenance simultanée de l'emplacement indique que vous pouvez installer l'unité de disques. 9. Appuyez sur Entrée. Un écran Device Concurrent Maintenance Action similaire à l'écran suivant apparaît. Figure 52. Fixation d'un panneau frontal à une unité de disques 64 Unités de disques 10. Entrez c pour confirmer que vous voulez installer l'unité de disques dans l'emplacement défini. Un écran Device Concurrent Maintenance Action In Progress similaire à l'écran suivant apparaît. 11. Maintenez l'unité de disques par-dessous lorsque vous l'alignez sur les rails de guidage de l'unité centrale ou de l'unité d'extension. Ne la tenez pas par la poignée. 12. Faites glisser complètement l'unité de disques dans l'unité centrale ou l'unité d'extension, puis bloquez-la dans son emplacement en poussant la poignée (voir figure 55). Device Concurrent Maintenance Action Installation of device: Location: U0.1-P1-I4/Z1-A8 You have 1 minute to perform the operation During this time, your system can seem unresponsive c=Confirm q=Cancel Figure 53. Exemple d'écran Device Concurrent Action Maintenance Device Concurrent Maintenance Action In Progress Please insert the device located in: Location: U0.1-P1-I4/Z1-A8 Figure 54. Exemple d'écran Device Concurrent Maintenance Action In Progress Figure 55. Installation d'une unité de disques sur un modèle ou 11D/20 Unités de disques 65 Le voyant de maintenance simultanée de l'emplacement ne clignote plus et reste allumé lorsque l'unité de disques est correctement installée. Figure 56. Installation d'une unité de disques à encombrement réduit Figure 57. Installation d'une unité de disques à encombrement réduit sur l'unité d'extension 5803 66 Unités de disques Suivez la procédure : Procédure 1. Régénérez les données sur la nouvelle unité de disques. Pour plus d'informations, voir «Régénération des données d'une unité de disques de remplacement sous Linux», à la page 101. 2. Remettez en place le panneau avant. 3. Installez ou fermez le volet avant de l'unité système, de l'unité d'extension ou de l'armoire. Concepts associés Retrait et remplacement de volets et capots Suivez ces procédures pour retirer et remettre en place les carters sur les serveurs systèmes contenant le processeur POWER7. Référence associée «Retrait et remplacement du volet ou panneau d'unité d'extension», à la page 156 Il se peut que vous deviez retirer, remplacer ou installer des panneaux ou des volets sur une unité d'extension pour accéder à des composants ou effectuer une opération de maintenance. Remplacement d'une unité de disques utilisant le noyau 2.6 Linux Les procédures suivantes expliquent comment remplacer une unité de disques à un emplacement contrôlé par un système ou une partition logique qui exécute le noyau 2.6 Linux. Préparation du retrait de l'unité de disques : Procédure 1. Connectez-vous en tant que superutilisateur. 2. Tapez iprconfig sur la ligne de commande de la session Linux et appuyez sur Entrée. L'écran Configuration Utility apparaît. 3. Sélectionnez Analyze log. Appuyez sur Entrée. L'écran Kernel Messages Log apparaît. 4. Sélectionnez View most recent ipr error messages dans l'écran the Kernel Messages Log. Appuyez sur Entrée. 5. Recherchez l'entrée de journal de l'unité de disques à remplacer. 6. Notez les informations d'emplacement de l'unité. Remarque : Les informations d'emplacement ont le format 2:0:8:0. Dans cet exemple, 2 correspond au numéro d'hôte SCSI, 0 au bus SCSI, 8 à l'ID cible SCSI et 0 à l'unité logique. 7. Retournez à la ligne de commande. 8. Entrez la commande suivante : Kernel Messages Log Select one of the following: 1. View most recent ipr error messages 2. View ipr error messages 3. View all kernel error messages 4. View iprconfig error messages 5. Set root kernel message log directory 6. Set default editor 7. Restore defaults 8. View ipr boot time messages Selection: e=Exit Figure 58. Kernel Messages Log Unités de disques 67 ls -ld /sys/class/scsi_host/host#/device Où # correspond au numéro d'hôte SCSI. Appuyez sur Entrée. 9. Notez les informations d'emplacement PCI. Remarque : Les informations d'emplacement PCI ont le format 61:01:0:2. 10. Tapez iprconfig sur la ligne de commande et appuyez sur Entrée. L'écran Configuration Utility apparaît. 11. Sélectionnez Display hardware status dans l'écran Configuration Utility. Appuyez sur Entrée. L'écran Display Hardware Status apparaît. 12. Recherchez l'unité de disques dans l'emplacement PCI que vous avez noté. L'unité de disques peut avoir l'état Incident. 13. Si l'unité de disques à remplacer n'est pas protégée ou si elle est en cours d'utilisation, transférez les données de l'unité de disques avant de poursuivre la procédure. Pour plus d'informations, voir PCI-X SCSI RAID Controller Reference Guide for Linux. 14. Tapez l'option 1 (affichage des informations de ressource matérielle) à côté de l'unité de disques à remplacer. Appuyez sur Entrée. Un écran Disk Hardware Resource Information Details similaire à l'exemple suivant apparaît. Display Hardware Status Type option, press Enter. 1=Display hardware resource information details OPT Name PCI/SCSI Location Description Status --- ------ -------------------------- ------------------------- ---------------- 0000:01:01.0.0/ PCI-X SCSI Adapter Operational 0000:41:01.0.1/ PCI-X SCSI Adapter Operational sda 0000:41:01.0.1/0:3:0 Physical Disk Active sdb 0000:41:01.0.1/0:4:0 Physical Disk Active sdc 0000:41:01.0.1/0:8:0 Physical Disk Active sdd 0000:41:01.0.1/1:3:0 Physical Disk Active sde 0000:41:01.0.1/1:4:0 Physical Disk Active sdf 0000:41:01.0.1/1:5:0 Physical Disk Active 0001:61:01.0.2/ PCI-X SCSI RAID Adapter Operational sdg 0001:61:01.0.2/0:3:0 Physical Disk Active 0001:61:01.0.2/0:6:0 Advanced Function Disk Active sdi 0001:61:01.0.2/0:9:0 Physical Disk Active sdh 0001:61:01.0.2/255:0:0 RAID 10 Disk Array Failed 0001:61:01.0.2/0:4:0 RAID 10 Array Member Failed 0001:61:01.0.2/0:5:0 RAID 10 Array Member Failed e=Exit q=Cancel r=Refresh t=Toggle Figure 59. Exemple d'écran Display Hardware Status 68 Unités de disques 15. Notez les informations d'emplacement physique. 16. Revenez à l'écran Configuration Utility. Retrait d'une unité de disques : Procédure 1. Si vous retirez une unité SSD située sur un adaptateur PCIe RAID et SSD SAS, reportez-vous à Retrait et remplacement de l'adaptateur PCIe RAID et SSD SASRetrait et remplacement de l'adaptateur PCIe RAID et SSD SAS et suivez la procédure, puis revenez ici et effectuez les étapes suivantes. 2. Dans l'écran Configuration Utility, sélectionnez Work with disk unit recovery. Appuyez sur Entrée. 3. Dans l'écran Work with Disk Unit Recovery, sélectionnez Concurrent remove device. Appuyez sur Entrée. Un écran Concurrent Device Remove similaire à l'écran suivant apparaît. 4. Tapez l'option 1 (sélection) à côté de l'unité de disques à remplacer. Appuyez sur Entrée. 5. L'écran Verify Device Concurrent Remove apparaît. Le voyant de maintenance simultané correspondant à l'emplacement choisi s'allume. Pour afficher l'emplacement des voyants de maintenance simultanée, sélectionnez le modèle approprié dans la liste suivante : Disk Unit Hardware Resource Information Details Manufacturer . . . . . . . . . . . . . . : IBM Product ID . . . . . . . . . . . . . . . : ST336607LC Firmware Version . . . . . . . . . . . . : 43353048 (C50H) Serial Number. . . . . . . . . . . . . . : 00006719 Capacity . . . . . . . . . . . . . . . . : 36.40 GB Resource Name. . . . . . . . . . . . . . : /dev/sdd Physical location PCI Address. . . . . . . . . . . . . . . : 0001:50:01.0 SCSI Host Number . . . . . . . . . . . . : 2 SCSI Channel . . . . . . . . . . . . . . : 0 SCSI Id. . . . . . . . . . . . . . . . . : 4 SCSI Lun . . . . . . . . . . . . . . . . : 0 Extended Details FRU Number . . . . . . . . . . . . . . . : 00P2676 More... Press Enter to Continue e=Exit q=Cancel f=PageDn b=PageUp Figure 60. Exemple d'écran Disk Hardware Resource Information Concurrent Device Remove Choose a single location for remove operations 1=Select OPT Name PCI/SCSI Location Description Status --- ------ -------------------------- ------------------------- ----------------- sdc 0000:41:01.0.1/0:8:0 Physical Disk Active 0000:41:01.0.1/0:5: Empty sdb 0000:41:01.0.1/0:4:0 Physical Disk Active sda 0000:41:01.0.1/0:3:0 Physical Disk Active 0000:41:01.0.1/1:8: Empty sdf 0000:41:01.0.1/1:5:0 Physical Disk Active sde 0000:41:01.0.1/1:4:0 Physical Disk Active sdd 0000:41:01.0.1/1:3:0 Physical Disk Active 0001:61:01.0.2/0:8: Empty sdh 0001:61:01.0.2/0:9:0 Physical Disk Active sdg 0001:61:01.0.2/0:3:0 Physical Disk Active Figure 61. Exemple d'écran Concurrent Device Remove Unités de disques 69 v «Emplacements des unités de disques et voyants de maintenance simultanée pour les boîtiers d'unité SCSI 31D/24 et 31T/24», à la page 98 6. Dans l'écran Verify Device Concurrent Remove, vérifiez que l'unité de disques correspond bien à celle à remplacer et appuyez sur Entrée. Le voyant d'identification de l'unité de disques s'allume. Remarque : Vérifiez que l'unité de disques n'est pas en cours d'utilisation afin de ne pas perdre des données. 7. L'écran Device Concurrent Remove apparaît. Avertissement : v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel contre les risques de décharge électrostatique. v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation d'appareils électriques. v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique, d'installer ou de remplacer du matériel. 8. Appuyez sur la poignée de l'unité de disques et tirez-la vers vous pour retirer l'unité de disques (voir figure 62). Si la poignée n'est pas complètement sortie, vous ne pouvez pas retirer l'unité de disques de l'unité centrale ou de l'unité d'extension. 9. Maintenez l'unité de disques par-dessous lorsque vous la faites glisser hors de l'unité centrale ou de l'unité d'extension. Ne la tenez pas par la poignée. 10. Si l'unité de disques de remplacement n'est pas déjà équipée d'un panneau frontal, procédez comme suit : Dévissez et retirez la vis moletée (B) pour dégager le panneau frontal (C) de l'unité de disques (voir figure suivante). Figure 62. Retrait d'une unité de disques d'un modèle ou 11D/20 70 Unités de disques 11. Appuyez sur Entrée dans l'écran Configuration Utility pour indiquer que vous avez retiré l'unité de disques. Remplacement d'une unité de disques : Procédure 1. Prenez l'emballage contenant la nouvelle unité de disques. Avertissement : Les unités de disques sont fragiles. Manipulez-les avec précaution. 2. Retirez l'unité de disques de son emballage protecteur. 3. Déverrouillez la poignée de l'unité de disques en la tirant vers vous avant d'installer l'unité de disques. Si la poignée n'est pas complètement sortie, vous ne pouvez pas insérer l'unité de disques dans l'unité centrale ou l'unité d'extension. 4. Si l'unité de disques de remplacement n'est pas déjà équipée d'un panneau frontal, procédez comme suit pour l'installer : a. Fixez le panneau frontal (A) sur l'unité de disques (voir figure 64, à la page 72). b. Appuyez sur la vis moletée (B), puis serrez la vis pour maintenir le panneau frontal sur l'unité de disques. Figure 63. Retrait du panneau frontal d'une unité de disques Unités de disques 71 5. Dans l'écran Configuration Utility, sélectionnez Work with disk unit recovery. Appuyez sur Entrée. 6. Dans l'écran Work with Disk Unit Recovery, sélectionnez Concurrent add device. Appuyez sur Entrée. Un écran Concurrent Device Add similaire à l'exemple suivant apparaît. 7. Tapez l'option 1 (sélection) à côté de l'emplacement de l'unité de disques que vous avez retirée. L'écran Verify Device Concurrent Add apparaît. 8. Appuyez sur Entrée dans l'écran Verify Device Concurrent Add. L'écran Complete Device Concurrent Add apparaît. 9. Maintenez l'unité de disques par-dessous lorsque vous l'alignez sur les rails de guidage de l'unité centrale ou de l'unité d'extension. Ne la tenez pas par la poignée. 10. Faites glisser complètement l'unité de disques dans l'unité centrale ou l'unité d'extension, puis bloquez l'unité dans son emplacement en poussant la poignée (voir figure 66, à la page 73). Figure 64. Fixation d'un panneau frontal à une unité de disques Concurrent Device Add Choose a single location for add operations 1=Select OPT Name PCI/SCSI Location Description Status --- ------ -------------------------- ------------------------- ----------------- 0000:41:01.0.1/0:5: Empty 0000:41:01.0.1/1:8: Empty 0001:61:01.0.2/0:8: Empty Figure 65. Exemple d'écran Concurrent Device Add 72 Unités de disques 11. Appuyez sur Entrée dans l'écran Complete Device Concurrent Add pour indiquer que l'unité de disques est installée. Suivez la procédure : Procédure 1. Régénérez les données sur la nouvelle unité de disques. Pour plus d'informations, voir «Régénération des données d'une unité de disques de remplacement sous Linux», à la page 101. 2. Remettez en place le panneau avant. 3. Installez ou fermez le volet avant de l'unité système, de l'unité d'extension ou de l'armoire. Concepts associés Retrait et remplacement de portes et capots Suivez ces procédures pour retirer et remettre en place les carters sur les serveurs systèmes contenant le processeur POWER7. Référence associée «Retrait et remplacement du volet ou panneau d'unité d'extension», à la page 156 Il se peut que vous deviez retirer, remplacer ou installer des panneaux ou des volets sur une unité d'extension pour accéder à des composants ou effectuer une opération de maintenance. Remplacement d'une unité de disques sur un système ou une partition logique hors tension Il peut être nécessaire d'installer une unité de disques SCSI (Small Computer System Interface) ou SAS (serial-attached Attached SCSI) sur un système ou une partition logique contrôlant l'emplacement de l'unité de disques une fois l'alimentation coupée. Exécutez la procédure décrite dans cette section pour effectuer cette tâche. Avant de commencer Avertissement : Si vous installez une unité de disque nouvelle ou mise à niveau, voir «Installation d'une unité de disques ou d'une unité SSD sur un système ou une partition logique hors tension», à la page 37. Si vous retirez une unité de disque dans le cadre d'une procédure de maintenance, continuez de suivre la procédure suivante. Figure 66. Installation d'une unité de disques sur un modèle ou 11D/20 Unités de disques 73 Pourquoi et quand exécuter cette tâche DANGER Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes suivantes : Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut présenter un danger. Pour éviter tout risque de choc électrique : v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce dernier avec un autre produit. v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique. v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de reconfiguration de ce produit au cours d'un orage. v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de danger électrique, débranchez tous les cordons d'alimentation. v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent aux informations de la plaque d'alimentation électrique du système. v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce produit. v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les câbles d'interface. v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence de dommages matériels. v Avant de retirer les carters de l'unité, mettez celle-ci hors tension et déconnectez ses cordons d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication et aux modems (sauf mention contraire dans les procédures d'installation et de configuration). v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et déconnecter les différents cordons. Pour déconnecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Débranchez les cordons d'alimentation des prises. 3. Débranchez les câbles d'interface des connecteurs. 4. Débranchez tous les câbles des unités. Pour connecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Branchez tous les cordons sur les unités. 3. Branchez les câbles d'interface sur des connecteurs. 4. Branchez les cordons d'alimentation aux prises. 5. Mettez l'unité sous tension. (D005a) Préparation du système Procédure 1. Effectuez les tâches de configuration prérequises décrites dans la section «Avant de commencer», à la page 143. 2. Si vous ne pouvez pas démarrer le système ou la partition logique qui contrôle l'emplacement de l'unité de disques ou que le système ou la partition logique est hors tension, voir étape 6, à la page 75. 3. Recherchez l'unité de disques à remplacer et notez les informations d'emplacement. Pour plus d'informations, voir «Identification d'un composant défaillant», à la page 146. 74 Unités de disques 4. Préparez-vous à retirer l'unité de disques de l'unité centrale ou de l'unité d'extension, comme suit : v Si le système ou la partition logique qui contrôle l'emplacement de l'unité de disques exécute AIX, exécutez la procédure de la section «Préparation du retrait d'une unité de disques d'une unité centrale ou d'extension contrôlée par le système d'exploitation AIX», à la page 100. v Si le système ou la partition logique qui contrôle l'emplacement de l'unité de disques exécute Linux et que l'unité de disques à remplacer est protégée ou en cours d'utilisation, transférez les données de l'unité de disques avant de poursuivre la procédure. Pour plus d'informations, voir PCI-X SCSI RAID Controller Reference Guide for Linux. 5. Arrêtez le système ou la partition qui contrôle l'emplacement de l'unité de disques. Pour obtenir des instructions, voir «Arrêt d'un système ou d'une partition logique», à la page 153. 6. Si vous avez arrêté la partition logique, ne déconnectez pas la source d'alimentation. Si vous avez arrêté le système, débranchez-le de la source d'alimentation. Remarque : Le système est peut-être équipé d'un second bloc d'alimentation. Avant de continuer la procédure, vérifiez que toutes les sources d'alimentation ont été débranchées du système. (L003) ou 7. Retirez le volet de l'unité centrale ou de l'unité d'extension ou ouvrez le volet avant de l'armoire. Pour plus d'informations, sélectionnez la procédure appropriée dans la liste suivante : v «Retrait et remplacement du volet ou panneau d'unité d'extension», à la page 156 8. Retirez le panneau avant. Retrait d'une unité de disques Procédure 1. Si vous retirez une unité SSD située sur un adaptateur PCIe RAID et SSD SAS, reportez-vous à Retrait et remplacement de l'adaptateur PCIe RAID et SSD SASRetrait et remplacement de l'adaptateur PCIe RAID et SSD SAS et suivez la procédure, puis revenez ici et effectuez les étapes suivantes. Unités de disques 75 2. Déverrouillez la poignée de l'unité de disques en appuyant dessus et en la tirant vers vous (voir figure 67). Avertissement : v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel contre les risques de décharge électrostatique. v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation d'appareils électriques. v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique, d'installer ou de remplacer du matériel. Figure 67. Retrait d'une unité de disques sur un modèle ou 11D/20 76 Unités de disques 3. Maintenez l'unité de disques par-dessous lorsque vous la faites glisser hors de l'unité centrale ou de l'unité d'extension. Ne la tenez pas par la poignée. 4. Si l'unité de disques de remplacement n'est pas déjà équipée d'un panneau frontal, procédez comme suit : Dévissez puis retirez la vis moletée (B) pour dégager le panneau frontal (C) de l'unité de disques (voir figure suivante). Important : Si le panneau frontal de l'unité de disques n'est pas doté d'une vis moletée, il s'agit d'une unité de disques SAS non amovible. L'unité de remplacement doit alors être pourvue d'un panneau frontal fixé. Figure 68. Retrait d'une unité de disques à encombrement réduit sur l'unité d'extension 5803 Unités de disques 77 Remplacement d'une unité de disques Procédure 1. Prenez l'emballage contenant la nouvelle unité de disques. Avertissement : Les unités de disques sont fragiles. Manipulez-les avec précaution. 2. Retirez l'unité de disques de son emballage protecteur. 3. Déverrouillez la poignée de l'unité de disques en la tirant vers vous avant d'installer l'unité de disques. Si la poignée n'est pas complètement sortie, vous ne pouvez pas insérer l'unité de disques dans l'unité centrale ou l'unité d'extension. 4. Si l'unité de disques de remplacement n'est pas déjà équipée d'un panneau frontal, procédez comme suit pour l'installer : a. Fixez le panneau frontal (A) sur l'unité de disques (voir figure suivante). b. Appuyez sur la vis moletée (B), puis serrez la vis pour maintenir le panneau frontal sur l'unité de disques. Figure 69. Retrait du panneau frontal d'une unité de disques 78 Unités de disques 5. Maintenez l'unité de disques par-dessous lorsque vous l'alignez sur les rails de guidage de l'unité centrale ou de l'unité d'extension. Ne la tenez pas par la poignée. 6. Faites glisser complètement l'unité de disques dans l'unité centrale ou l'unité d'extension. Remarque : Lors de l'installation d'une unité de disques, il est important de s'assurer que l'unité est complètement et parfaitement insérée dans le système. 7. Poussez complètement la poignée de l'unité de disques (voir figure 71, à la page 80). Figure 70. Fixation d'un panneau frontal à une unité de disques Unités de disques 79 Fin de la procédure Procédure 1. Remettez en place le panneau avant. 2. Installez ou fermez le volet avant du système, de l'unité d'extension ou de l'armoire. 3. Rebranchez tous les câbles d'alimentation système. 4. Reconnectez le système à l'alimentation. Figure 71. Installation d'une unité de disques sur un modèle ou 11D/20 Figure 72. Installation d'une unité de disques à encombrement réduit sur l'unité d'extension 5803 80 Unités de disques 5. Démarrez le système ou la partition logique. Pour obtenir des instructions, voir «Démarrage du système ou de la partition logique», à la page 151. 6. Si le système ou la partition logique qui contrôle l'unité de disques remplacée fonctionne sous AIX, vérifiez que l'unité fonctionne (voir «Vérification d'un dispositif installé ou d'un composant remplacé sur un système ou une partition logique AIX», à la page 195. Si vous ne pouvez pas effectuer de diagnostics en ligne, par exemple, si vous avez remplacé l'unité de disques du groupe de volumes racine (rootvg), réalisez des diagnostics autonomes (voir «Vérification d'un composant installé via les programmes de diagnostic autonomes», à la page 198. 7. Pour régénérer les données sur la nouvelle unité de disques, sélectionnez le système d'exploitation du système ou de la partition logique qui contrôle l'unité de disques : v «Régénération des données d'une unité de disques de remplacement sous AIX», à la page 101 v «Régénération des données d'une unité de disques de remplacement sous Linux», à la page 101 Concepts associés Retrait et remplacement de volets et capots Suivez ces procédures pour retirer et remettre en place les carters sur les serveurs systèmes contenant le processeur POWER7. Référence associée «Retrait et remplacement du volet ou panneau d'unité d'extension», à la page 156 Il se peut que vous deviez retirer, remplacer ou installer des panneaux ou des volets sur une unité d'extension pour accéder à des composants ou effectuer une opération de maintenance. Remplacement d'une unité de disques SCSI Découvrez comment remplacer une unité de disques SCSI (Small Computer System Interface) dans une unité centrale ou d'extension. La procédure décrite dans cette information permet d'effectuer cette tâche. Retrait d'une unité de disques Procédure 1. Connectez-vous en tant que superutilisateur. 2. A l'invite, tapez diag et appuyez sur Entrée. 3. Appuyez sur Entrée pour continuer. 4. Dans l'écran de sélection de fonction, sélectionnez Sélection des tâches. 5. Dans l'écran de sélection des tâches, sélectionnez Remplacement à chaud. 6. Sélectionnez Gestionnaire d'unités SCSI et RAID SCSI remplaçables à chaud. 7. Sélectionnez Identification d'une unité connectée à une unité SES. 8. Sélectionnez l'unité de disques à remplacer et appuyez sur Entrée. L'emplacement de l'unité de disques passe en mode Identification, indiqué par le voyant clignotant de maintenance simultanée. Pour afficher l'emplacement des voyants de maintenance simultanée, consultez «Emplacements des unités de disque et voyants de maintenance simultanée», à la page 90. 9. Revenez à l'écran Gestionnaire d'unités SCSI et RAID SCSI remplaçables à chaud. 10. Dans cet écran, sélectionnez Remplacement/Retrait d'une unité connectée à une unité SES. La liste de tous les emplacements contenant une unité de disques pouvant être retirée s'affiche. 11. Sélectionnez l'emplacement qui contient l'unité de disques à retirer. L'emplacement de l'unité de disques passe en mode Retrait et son alimentation est coupée. Le voyant de maintenance simultanée de l'emplacement commence à clignoter rapidement. 12. Déverrouillez l'unité de disques. Déverrouillez la poignée de l'unité de disques en appuyant dessus et en la tirant vers vous (voir figures suivantes). Unités de disques 81 13. Maintenez l'unité de disques par-dessous lorsque vous la faites glisser hors de l'unité centrale ou de l'unité d'extension. Ne la tenez pas par la poignée. Remarque : Le voyant de maintenance simultanée de l'emplacement s'éteint lorsque vous retirez l'unité de disques. 14. Dévissez et retirez la vis moletée B pour dégager le panneau frontal C de l'unité de disques. Figure 73. Retrait d'une unité de disques sur un modèle ou 11D/20 Figure 74. Retrait d'une unité de disques à encombrement réduit sur l'unité d'extension 5803 82 Unités de disques Remplacement d'une unité de disques Procédure 1. Prenez l'emballage contenant la nouvelle unité de disques. Avertissement : Les unités de disques sont fragiles. Manipulez-les avec précaution. 2. Retirez l'unité de disques de son emballage protecteur. 3. Déverrouillez la poignée de l'unité de disques de remplacement en la tirant vers vous. Si la poignée n'est pas complètement sortie, vous ne pouvez pas insérer l'unité de disques dans le système. 4. Fixez le panneau frontal de l'unité de disques A sur l'unité de disques de remplacement (voir figure suivante). Figure 75. Retrait du panneau frontal d'une unité de disques Unités de disques 83 5. Appuyez sur la vis moletée B, puis serrez la vis pour maintenir le panneau frontal A en place sur l'unité de disques. 6. Pour installer l'unité de disques, procédez comme suit : a. Maintenez l'unité de disques par-dessous lorsque vous l'alignez sur les rails de guidage de l'unité centrale. Ne la tenez pas par la poignée. b. Faites glisser complètement l'unité de disques dans l'unité centrale. Remarque : Lors de l'installation d'une unité de disques, il est important de s'assurer que l'unité est complètement et parfaitement insérée dans le système. c. Poussez complètement la poignée de l'unité de disques (voir figure 77, à la page 85). Figure 76. Fixation d'un panneau frontal à une unité de disques 84 Unités de disques 7. Appuyez sur Entrée. 8. Sélectionnez le menu Gestionnaire d'unités SCSI et RAID SCSI remplaçables à chaud. 9. Sélectionnez Configuration des unités ajoutées/remplacées. Appuyez sur Entrée. 10. Sélectionnez l'unité de disques remplacée et appuyez sur Entrée. L'unité de disques est désormais configurée. 11. Passez à l'écran de sélection des tâches. Figure 77. Installation d'une unité de disques sur un modèle ou 11D/20 Figure 78. Installation d'une unité de disques à encombrement réduit sur l'unité d'extension 5803 Unités de disques 85 12. Sélectionnez Consignation d'action corrective. 13. Sélectionnez l'unité de disques remplacée et appuyez sur Entrée. 14. Sélectionnez Validation après la sélection et appuyez sur Entrée. 15. Retournez à la ligne de commande. Suivez la procédure Procédure 1. Remettez en place le panneau avant. 2. Installez ou fermez le volet avant de l'unité système, de l'unité d'extension ou de l'armoire. 3. Vérifiez si l'unité de disques fonctionne. Pour obtenir des instructions, consultez la rubrique «Vérification d'un dispositif installé ou d'un composant remplacé sur un système ou une partition logique AIX», à la page 195. Si vous ne pouvez pas effectuer de diagnostics AIX en ligne, par exemple, si vous avez remplacé l'unité de disques du groupe de volumes racine (rootvg), réalisez des diagnostics autonomes (voir «Vérification d'un composant installé via les programmes de diagnostic autonomes», à la page 198. 4. Pour préparer l'unité de disques, voir «Régénération des données d'une unité de disques de remplacement sous AIX», à la page 101. Concepts associés Retrait et remplacement de volets et capots Suivez ces procédures pour retirer et remettre en place les carters sur les serveurs systèmes contenant le processeur POWER7. Référence associée «Retrait et remplacement du volet ou panneau d'unité d'extension», à la page 156 Il se peut que vous deviez retirer, remplacer ou installer des panneaux ou des volets sur une unité d'extension pour accéder à des composants ou effectuer une opération de maintenance. Retrait ou installation d'un obturateur d'unité de disques Il se peut que vous deviez retirer ou installer un obturateur d'unité de disques SCSI (Small Computer System Interface) dans le serveur ou l'unité d'extension. La présente section explique comment effectuer ces tâches. Remarque : Installez une autre unité de disques ou un obturateur d'unité de disques dans les emplacements des unités de disques. Le fait de remplir l'emplacement d'unité de disques permet d'assurer une ventilation correcte pour le refroidissement et de garantir une conformité EMI optimale. Retrait d'un obturateur d'unité de disques Informations sur le retrait d'un obturateur d'unité de disques d'un serveur ou d'une unité d'extension. La procédure décrite dans cette information permet d'effectuer cette tâche. Pourquoi et quand exécuter cette tâche Remarque : Certaines figures accompagnant cette procédure peuvent ne pas correspondre exactement à votre unité centrale ou à votre unité d'extension. Néanmoins, les procédures demeurent identiques pour l'exécution de chaque tâche. Procédure 1. Retirez le volet de l'unité centrale ou de l'unité d'extension ou ouvrez l'armoire. Pour plus d'informations, voir «Retrait et remplacement du volet ou panneau d'unité d'extension», à la page 156. 2. Retirez le panneau avant. Pour plus d'informations sur le retrait du capot, voir «Carters système», à la page 157.Si vous avez un panneau avant d'un autre modèle, retirez-le. 3. Identifiez l'obturateur d'unité de disques à retirer. 86 Unités de disques 4. Tirez l'obturateur d'unité de disques de son emplacement. Tirez l'obturateur d'unité de disques pour le détacher de l'emplacement approprié (voir figure 79 et figure 80, à la page 88). Avertissement : v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel contre les risques de décharge électrostatique. v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation d'appareils électriques. v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique, d'installer ou de remplacer du matériel. Figure 79. Retrait d'un obturateur d'unité de disques d'un modèle 33E/8B ou 36E/8C Unités de disques 87 Installation d'un obturateur d'unité de disques Il se peut que vous deviez installer un obturateur d'unité de disques dans le serveur ou l'unité d'extension. La procédure de cette section permet d'effectuer cette tâche. Pourquoi et quand exécuter cette tâche Remarque : Certaines figures accompagnant cette procédure peuvent ne pas correspondre exactement à votre unité centrale ou à votre unité d'extension. Néanmoins, les procédures demeurent identiques pour l'exécution de chaque tâche. Procédure 1. Retirez le volet de l'unité centrale ou de l'unité d'extension ou ouvrez l'armoire. Pour plus d'informations, voir «Retrait et remplacement du volet ou panneau d'unité d'extension», à la page 156. 2. Retirez le carter avant. Pour plus d'informations sur le retrait du capot, voir «Carters système», à la page 157. Si vous avez un panneau avant d'un autre modèle, retirez-le. 3. Installez l'obturateur d'unité de disques dans l'unité centrale ou l'unité d'extension (voir figures suivantes). Figure 80. Retrait d'un obturateur d'unité de disques d'un modèle 17M/MB ou 79M/HB 88 Unités de disques 4. Réinstallez le panneau avant. 5. Installez ou fermez le volet avant du système, de l'unité d'extension ou de l'armoire. Figure 81. Installation d'un obturateur d'unité de disques d'un modèle 33E/8B ou 36E/8C Figure 82. Installation d'un obturateur d'unité de disques d'un modèle 17M/MB ou 79M/HB Unités de disques 89 Concepts associés Retrait et remplacement de portes et capots Suivez ces procédures pour retirer et remettre en place les carters sur les serveurs systèmes contenant le processeur POWER7. Référence associée «Retrait et remplacement du volet ou panneau d'unité d'extension», à la page 156 Il se peut que vous deviez retirer, remplacer ou installer des panneaux ou des volets sur une unité d'extension pour accéder à des composants ou effectuer une opération de maintenance. Emplacements des unités de disque et voyants de maintenance simultanée Cette rubrique présente les emplacements des unités de disque et les voyants de maintenance simultanée sur différents modèles. Repérez votre unité centrale ou votre unité d'extension pour identifier les emplacements des unités et des voyants de maintenance simultanée. Emplacements de l'adaptateur PCIe RAID et SSD SAS Informations sur les emplacements de l'adaptateur PCIe RAID et SSD SAS sur l'emplacement des voyants de maintenance simultanée sur l'adaptateur PCIe RAID et SSD SAS La figure figure 83 montre les emplacements de l'adaptateur PCIe RAID et SSD SAS. Emplacements des unités de disques et voyants de maintenance simultanée pour les systèmes 02E/4B et 05F/6B Cette section décrit les emplacements des unités de disques et des voyants de maintenance simultanée sur le système. Figure 83. Emplacements de l'adaptateur PCIe RAID et SSD SAS 90 Unités de disques Les figures suivantes illustrent les emplacements des unités de disques et les voyants de maintenance simultanée pour les modèles 02E/4B ou 05F/6B. Les voyants de maintenance simultanée sont situés au-dessus de la poignée de verrouillage des unités de disques SAS. Figure 84. Emplacements des unités de disques sur le modèle d'armoire Figure 85. Emplacements des unités de disques sur le modèle de tour Unités de disques 91 Emplacements des unités de disque et voyants de maintenance simultanée pour 31E/2B Informations sur les emplacements des unités de disques et des voyants de maintenance simultanée sur le système. Les figures suivantes illustrent les emplacements des unités de disques et les voyants de maintenance simultanée pour le modèle 31E/2B. Les voyants de maintenance simultanée sont situés au-dessus de la Figure 86. Emplacement des voyants de maintenance simultanée sur le modèle d'armoire Figure 87. Emplacement des voyants de maintenance simultanée sur le modèle de tour 92 Unités de disques poignée de verrouillage des unités de disques SAS. Figure 88. Emplacements des unités de disque sur un modèle de base avec six baies SFF Figure 89. Emplacements des unités de disque sur un modèle avec trois baies SFF (facultatif) Figure 90. Emplacements des voyants de maintenance simultanée sur un modèle de base avec six baies SFF Unités de disques 93 Emplacements des unités de disques et voyants de maintenance simultanée sur les modèles 33E/8B et 36E/8C Cette section décrit les emplacements des unités de disques et des voyants de maintenance simultanée sur le système. Les figures suivantes illustrent les emplacements des unités de disques et les voyants de maintenance simultanée sur les modèles 33E/8B et 36E/8C. Les voyants de maintenance simultanée sont situés au-dessus de la poignée de verrouillage des unités de disques SAS. Figure 91. Emplacements des voyants de maintenance simultanée sur un modèle de base avec trois baies SFF (facultatif) 94 Unités de disques Emplacements des unités de disques et voyants de maintenance simultanée pour les modèles 17M/MB et 79M/HB Cette rubrique décrit les emplacements des unités de disques dans le système. Les figures suivantes illustrent les emplacements des unités de disques et les voyants de maintenance simultanée pour les modèles 17M/MB et 79M/HB. Figure 92. Emplacements des unités de disques Figure 93. Emplacement des voyants de maintenance simultanée Unités de disques 95 Emplacements des unités de disques et voyants de maintenance simultanée sur le boîtier du disque 58/86 Cette section décrit les emplacements des unités de disques et des voyants de maintenance simultanée sur le boîtier du disque. La figure suivante illustre les emplacements des unités de disques sur le modèle 58/86. Figure 94. Emplacements des unités de disques Figure 95. Emplacement des voyants de maintenance simultanée 96 Unités de disques La figure suivante représente les emplacements des voyants de maintenance simultanée des unités de disques sur le boîtier d'unité de disques 58/86. Figure 96. Emplacements des unités de disques pour le boîtier du modèle 58/86 Figure 97. Vue de face des voyants de maintenance simultanée sur le boîtier d'unité de disques 58/86 Unités de disques 97 Emplacements des unités de disques et voyants de maintenance simultanée pour les boîtiers d'unité SCSI 31D/24 et 31T/24 Cette section décrit les emplacements des unités de disques et des voyants de maintenance simultanée sur le boîtier de l'unité de disque. La disponibilité des emplacements d'unités de disques dépend du nombre de cartes d'interface installées dans le boîtier d'unités de disques SCSI. Si l'emplacement disponible suivant d'unité de disques correspond au premier emplacement dans un boîtier, tel que D21, D26 ou D46, vérifiez pour le premier contrôleur que le boîtier est connecté à un port de bus SCSI. L'extrémité du câble du contrôleur mentionne clairement le boîtier d'unités de disques auquel il se connecte. Si le boîtier n'est pas connecté à un contrôleur d'unité de disques, installez l'unité de disques dans un autre boîtier ou connectez le boîtier à un port de bus SCSI sur un contrôleur. Vous devrez peut-être ajouter un contrôleur d'unité de disques pour pouvoir ajouter des unités de disques à l'unité d'extension. Les figures suivantes illustrent les emplacements des unités de disques à l'avant et à l'arrière des boîtiers d'unités de disques SCSI 31D/24 et 31T/24. Figure 98. Vue de face des boîtiers d'unités de disques SCSI 31D/24 et 31T/24 98 Unités de disques Les figures suivantes illustrent l'emplacement des voyants de maintenance simultanée A du boîtier d'unités de disques SCSI 31D/24 et 31T/24. Figure 99. Vue arrière des boîtiers d'unités de disques SCSI 31D/24 et 31T/24 Figure 100. Vue de face des voyants de maintenance simultanée sur les boîtiers d'unités de disques 31D/24 et 31T/24 Figure 101. Vue arrière des voyants de maintenance simultanée des boîtiers d'unités de disques SCSI 31D/24 et 31T/24 Unités de disques 99 Emplacements des unités de disques et voyants de maintenance simultanée sur une unité d'extension 58/02 Cette section décrit les emplacements des unités de disques et des voyants de maintenance simultanée sur l'unité d'extension. La figure suivante illustre les emplacements des unités de disques sur l'unité d'extension 58/02. Les voyants de maintenance simultanée se trouvent au bas de la poignée de verrouillage de chaque unité de disques. Information associée Câblage du sous-système du disque 5802 Informations connexes Cette section apporte d'autres informations et instructions associées aux unités de disques dont vous pourriez avoir besoin. Préparation d'une unité de disques dans un système ou une partition logique AIX Configurez la nouvelle unité de disques en fonction de votre environnement. Pourquoi et quand exécuter cette tâche Préparation du retrait d'une unité de disques d'une unité centrale ou d'extension contrôlée par le système d'exploitation AIX Les procédures suivantes expliquent comment retirer correctement une unité de disques. Pourquoi et quand exécuter cette tâche Avant de retirer une unité de disques sur une unité centrale ou une unité d'extension contrôlée par le système d'exploitation AIX, vérifiez que toutes les données sur cette unité ou sur le module où elle figure sont sauvegardées et que l'unité se trouve à l'état Defined. Figure 102. Emplacements des unités de disques 100 Unités de disques Régénération des données d'une unité de disques de remplacement sous AIX Pour régénérer des données sur la nouvelle unité de disques, consultez les informations du contrôleur auquel l'unité de disques est connectée. Pourquoi et quand exécuter cette tâche v Si l'unité de disques est connectée à un contrôleur RAID PCI-X SCSI, voir PCI-X SCSI RAID Controller Reference Guide for AIX. v Si l'unité de disques est connectée à une carte PCI SCSI RAID (code dispositif 2498), voir PCI 4–Channel Ultra3 SCSI RAID Adapter Reference Guide. Pour plus d'informations, voir AIX System Management Guide: Operating System and Devices. Régénération des données d'une unité de disques de remplacement sous Linux Utilisez cette procédure pour régénérer les données de l'unité de disque de remplacement. Pourquoi et quand exécuter cette tâche Pour reconstruire les données sur l'unité de disque de remplacement, effectuez les étapes ci-après. Pour une unité de disques non protégée Si l'unité de disques que vous remplacez se trouve dans une grappe de disques RAID de niveau 0 ou dans une grappe de disques RAID de niveau 5 ou 10 défaillant, procédez comme suit : Procédure 1. Recréez le module de disques. 2. Recréez les systèmes de fichiers dans le module de disques. 3. Copiez les données vers le module de disques restaurés depuis le support de sauvegarde. Résultats Pour plus d'informations sur ces tâches, voir PCI-X SCSI RAID Controller Reference Guide for Linux. . Pour une unité de disques dans une grappe de disques RAID de niveau 5 ou 10, sélectionnez la procédure appropriée : Régénération de données sur une unité de disques utilisant le noyau 2.4 Linux Effectuez les étapes ci-après pour régénérer les données. Pourquoi et quand exécuter cette tâche Si un disque de secours était disponible et protégeait le module de disques pendant toute la durée de la défaillance de l'unité de disques, vous pouvez configurer l'unité de disques que vous venez d'installer comme disque de secours. Pour plus d'informations, voir PCI-X SCSI RAID Controller Reference Guide for Linux. . Si la régénération des données sur un disque de secours n'a pas été initialisée par le contrôleur, vous devez la lancer sur la nouvelle unité de disques installée. Pour ce faire, procédez comme suit : Unités de disques 101 Procédure 1. Connectez-vous en tant que superutilisateur. 2. Tapez iprconfig sur la ligne de commande de la session Linux et appuyez sur Entrée. L'écran Work with Disk Units apparaît. 3. Dans l'écran Work with Disk Units, sélectionnez Work with disk unit recovery. L'écran Work with Disk Unit Recovery apparaît. 4. Dans l'écran Work with Disk Unit Recovery, sélectionnez Rebuild disk unit data. Un écran Rebuild Disk Unit Data similaire à la figure suivante apparaît. 5. Tapez l'option 1 (régénération) à côté de l'unité de disques à reconstruire et appuyez sur Entrée. Remarque : La régénération sur une unité de disques remplace les données qui figure sur l'unité. Un écran Confirm Rebuild Disk Unit Data similaire à la figure suivante apparaît. 6. Pour confirmer que vous voulez régénérer les données sur l'unité de disques, appuyez sur Entrée. Le message annonçant que la régénération a commencé apparaît au bas de l'écran Work with Disk Unit Recovery. L'opération de régénération peut prendre plusieurs minutes. Régénération de données sur une unité de disques utilisant le noyau 2.6 Linux Les procédures suivantes expliquent comment régénérer des données sur une unité de disques. Rebuild Disk Unit Data Select the units to be rebuilt Type choice, press Enter. 1=Rebuild Vendor Product Serial PCI PCI SCSI SCSI SCSI Option ID ID Number Bus Dev Bus ID Lun IBM IC35L036UCDY10-0 E3V0J55B 353 01 0 5 0 e=Exit q=Cancel f=PageDn b=PageUp Figure 103. Exemple d'écran Rebuild Disk Unit Data Confirm Rebuild Disk Unit Data Rebuilding the disk unit data could take several minutes for each unit selected. Press Enter to confirm having the data rebuilt. Press q=Cancel to return and change your choice. Vendor Product Serial PCI PCI SCSI SCSI SCSI Option ID ID Number Bus Dev Bus ID Lun 1 IBM IC35L036UCDY10-0 E3V0J55B 353 01 0 5 0 q=Cancel f=PageDn b=PageUp Figure 104. Exemple d'écran Confirm Rebuild Disk Unit Data 102 Unités de disques Pourquoi et quand exécuter cette tâche Si un disque de secours était disponible et protégeait le module de disques pendant toute la durée de la défaillance de l'unité de disques, vous pouvez configurer l'unité de disques que vous venez d'installer comme disque de secours. Pour plus d'informations, voir PCI-X SCSI RAID Controller Reference Guide for Linux. Si la régénération des données sur un disque de secours n'a pas été initialisée par le contrôleur, vous devez la lancer sur la nouvelle unité de disques installée. Pour ce faire, procédez comme suit : Procédure 1. Connectez-vous en tant que superutilisateur. 2. Tapez iprconfig sur la ligne de commande de la session Linux et appuyez sur Entrée. L'écran Configuration Utility apparaît. 3. Sélectionnez Work with disk unit recovery sur l'écran Configuration Utility. Appuyez sur Entrée. L'écran Work with Disk Unit Recovery apparaît. 4. Dans l'écran Work with Disk Unit Recovery, sélectionnez Rebuild disk unit data. Un écran Rebuild Disk Unit Data similaire à la figure suivante apparaît. 5. Tapez l'option 1 (régénération) à côté de l'unité de disques à reconstruire et appuyez sur Entrée. Remarque : La régénération sur une unité de disques remplace les données qui figure sur l'unité. Un écran Confirm Rebuild Disk Unit Data similaire à la figure suivante apparaît. Rebuild Disk Unit Data Select the disks to be rebuilt Type choice, press Enter. 1=Rebuild OPT Name PCI/SCSI Location Description Status --- ------ -------------------------- ------------------------- ------------- 1 0000:58:01.0.0/0:4:0 RAID Array Member Failed e=Exit q=Cancel t=Toggle Figure 105. Exemple d'écran Rebuild Disk Unit Data Confirm Rebuild Disk Unit Data Rebuilding the disk unit data might take several minutes for each disk selected. Press Enter to confirm having the data rebuilt. q=Cancel to return and change your choice. OPT Name PCI/SCSI Location Description Status --- ------ -------------------------- ------------------------- --------------- 1 0000:58:01.0.0/0:4:0 RAID Array Member Failed q=Cancel t=Toggle Figure 106. Exemple d'écran Confirm Rebuild Disk Unit Data Unités de disques 103 6. Pour confirmer que vous voulez régénérer les données sur l'unité de disques, appuyez sur Entrée. Le message annonçant que la régénération a commencé apparaît au bas de l'écran Work with Disk Unit Recovery. L'opération de régénération peut prendre plusieurs minutes. Partage d'unités de disques internes sur le modèle 02E/4B ou 05F/6B Découvrez comment diviser des disques internes installés dans le boîtier de l'unité centrale en groupes pouvant être gérés séparément. Utilisez les informations suivantes pour exécuter la tâche. Avant de commencer Le fond de panier DASD prend en charge le mode de division DASD. La carte d'activation de division DASD FC 5631 est utilisée pour mettre le fond de panier DASD en mode de division DASD dans lequel 3 SFF DASD sur la gauche (vue de face) sont affectés à la carte d'activation de division DASD (CCIN 2BE0) et 3 SFF DASD sur la droite sont affectés au contrôleur de stockage intégré (CCIN 572C). Un adaptateur SAS PCIe ou PCIX (tel que FC 5901, FC5904 ou FC5912) peut accéder aux 3 SFF DASD via un câble SAS AI externe, comme indiqué dans figure 107, à la page 105. Important : Cette configuration n'est pas prise en charge avec le code dispositif de fond de panier 5630. Utilisez les informations suivantes une fois l'adaptateur installé. Pour plus d'informations, voir Adaptateurs PCI. Effectuez les tâches prérequises (voir «Avant de commencer», à la page 143). Remarque : Un câble interne FC 3670 pour modèle 02E/4B ou 05F/6B doit être installé pour activer la configuration DASD partagé. Pour obtenir des instructions d'installation, voir «Installation du port SAS externe sur un modèle 02E/4B ou 05F/6B», à la page 109. Pourquoi et quand exécuter cette tâche Les tâches suivantes permettent de diviser des disques internes installés dans le boîtier de l'unité centrale en groupes pouvant être gérés séparément. Procédure 1. Arrêtez et mettez hors tension l'alimentation au système, voir Arrêt du système ou de la partition logique. 2. Câblez le système en connectant le câble SAS à la carte, puis à la connexion SAS interne (voir figure 107, à la page 105). 104 Unités de disques 3. Démarrez le système. Voir la rubrique «Démarrage du système ou de la partition logique», à la page 151. 4. Vérifiez que le composant est bien installé et fonctionne correctement. Voir la rubrique «Vérification du composant installé», à la page 195. Exemple Après l'exécution de cette procédure, trois disques du boîtier système sont gérés par la carte d'activation de division DASD et trois disques sont gérés par le système interne. Que faire ensuite Vous pouvez à présent gérer les disques de la même manière que n'importe quels autres disques. Partage d'unités de disques internes sur un modèle 33E/8B ou 36E/8C Cette rubrique vous explique la fonction de cette carte et comment la connecter. Suivez les instructions données ci-après pour effectuer ces tâches. Avant de commencer Le fond de panier DASD prend en charge le mode de division DASD. Un câble SAS interne FC 3669 est utilisé pour mettre le fond de panier DASD en mode de division DASD dans lequel 4 SFF DASD sur la gauche (vue de face) sont assignés au contrôleur SAS intégré et 4 SFF DASD sur la droite sont assignés au port SAS arrière interne. Un adaptateur SAS PCIe ou PCIX (tel que FC 5901, FC5904 ou FC5912) peut accéder aux 4 SFF DASD via un câble SAS AI externe, comme indiqué dans la figure qui suit. Remarques : Le second connecteur de l'adaptateur peut permettre de connecter un tiroir d'extension de support ou d'extension de disque. Figure 107. Connexion d'un adaptateur SAS aux emplacements de disque SAS internes via un port SAS externe système Unités de disques 105 Les informations fournies ci-après sont destinées à être utilisées une fois que la carte a été installée. Installez la carte, puis revenez à cette rubrique. Pour plus d'informations, consultez la rubrique relative aux cartes PCI (Cartes PCI). Si vous ne l'avez pas encore fait, passez en revue les tâches décrites dans la section «Avant de commencer», à la page 143. Remarque : Le câble interne FC3669 du modèle 33E/8B ou 36E/8C être installé pour activer cette configuration. Pour plus d'informations sur l'installation, voir Installation du port SAS externe sur un modèle 33E/8B ou 36E/8C Pourquoi et quand exécuter cette tâche Cette carte permet de diviser des disques internes installés dans le boîtier de l'unité centrale en groupes pouvant être gérés séparément. Procédure 1. Si ce n'est pas déjà fait, arrêtez et mettez le système hors tension. Voir la rubrique «Arrêt d'un système ou d'une partition logique», à la page 153. 2. Câblez le système en connectant le câble SAS à la carte, puis à la connexion SAS interne (voir figure suivante). 3. Démarrez le système. Voir la rubrique «Démarrage du système ou de la partition logique», à la page 151. Remarques : v Le second connecteur de l'adaptateur peut permettre de connecter un tiroir d'extension de support ou d'extension de disque. Figure 108. Connexion d'un adaptateur SAS aux emplacements de disque SAS internes via un port SAS externe système 106 Unités de disques 4. Vérifiez que le composant est bien installé et fonctionne correctement. Voir la rubrique «Vérification du composant installé», à la page 195. Exemple Une fois la procédure terminée, quatre des disques du boîtier système seront gérés par le contrôleur SAS PCI et quatre seront gérés par le contrôleur SAS intégré de l'unité centrale. Que faire ensuite Les disques peuvent à présent être gérés de la même manière que n'importe quels autres disques. Information associée Contrôleurs SAS RAID pour AIX sur les modèles 8233-E8B et 8236-E8C Partage d'unités de disques internes sur un modèle 17M/MB ou 79M/HB Cette rubrique vous explique la fonction de cette carte et comment la connecter. Suivez les instructions données ci-après pour effectuer ces tâches. Avant de commencer Passez en revue les modes des unités de disque et les emplacements d'unités pris en charge, comme indiqué dans le Tableau 1. La figure suivante présente l'emplacement des unités de disques sur les modèles 17M/MB et 79M/HB. Tableau 4. Modes des unités de disque et emplacements des unités prises en charge Mode de l'unité de disque Carte Port SAS Carte Câble SAS Emplacement de l'unité de disque Disques internes RAID Aucune Contrôleur RAID intégré double canal Aucune D1, D2, D3, D4, D5, et D6 Figure 109. Emplacements des unités de disques Unités de disques 107 Tableau 4. Modes des unités de disque et emplacements des unités prises en charge (suite) Mode de l'unité de disque Carte Port SAS Carte Câble SAS Emplacement de l'unité de disque Disques RAID internes et externes Dispositif de port externe SAS (FC1819) - Ensemble de câbles SAS pour port SAS Contrôleur RAID intégré double canal v Câble YI (FC3686) - Système au boîtier SAS, Contrôleur unique / Double accès - 1,5 mètre v Câble YI (FC3687) - Système au boîtier SAS, Contrôleur unique / Double accès - 3 mètres D1, D2, D3, D4, D5, et D6 Fond de panier deux sections Aucune v Contrôleur SAS intégré (SAS A) v Contrôleur SAS intégré (SAS B) Aucune v D1, D2 et D3 v D4, D5 et D6 Fond de panier trois sections Dispositif de port interne SAS (FC1815) - Câble SAS pour fond de panier trois sections v Contrôleur SAS intégré (SAS A) v Contrôleur SAS intégré (SAS B) v (FC5901) - Carte SAS PCIe Dual-x4 Câble AI(FC3679) - Carte à disque interne - 1 mètre v D1 et D4 v D2 et D5 v D3 et D6 Les informations suivantes sont destinées à être utilisées une fois que la carte de stockage FC5901 SAS a été installée. Installez la carte, puis revenez à cette rubrique. Pour plus d'informations, consultez la rubrique relative aux cartes PCI (Cartes PCI). Si vous ne l'avez pas encore fait, passez en revue les tâches décrites dans la section «Avant de commencer», à la page 143. Pourquoi et quand exécuter cette tâche Cette carte permet de diviser des disques internes installés dans le boîtier de l'unité centrale en groupes pouvant être gérés séparément. Procédure 1. Si ce n'est pas déjà fait, arrêtez et mettez le système hors tension. Voir la rubrique «Arrêt d'un système ou d'une partition logique», à la page 153. 2. Connectez un boîtier d'unité centrale en procédant comme suit : a. Branchez le câble au port SAS situé sur la cloison arrière du boîtier de l'unité centrale, au port supérieur de la carte de stockage SAS, comme illustré dans la figure suivante. Restriction : Le partage d'unité de disque interne n'est possible que lorsque le dispositif pour câble interne FC1815 connecte le fond de panier DASD et la cloison arrière du boîtier de l'unité centrale. En outre, la carte d'activation Cache RAID FC662 175 Mo - Adaptateur d'E-S double ne doit pas être installée. enablement card must not be installed. La carte de stockage SAS peut être installé dans n'importe quel autre emplacement la prenant en charge. 108 Unités de disques b. Fixez tout câble supplémentaire. 3. Démarrez le système. Voir la rubrique «Démarrage du système ou de la partition logique», à la page 151. 4. Vérifiez que le composant est bien installé et fonctionne correctement. Voir la rubrique «Vérification du composant installé», à la page 195. Exemple Lorsque cette fonction est installée, 2 des 6 disques du boîtier système peuvent être gérés par la carte de stockage SAS. Remarque : Le support de stockage amovible est toujours contrôlé par un contrôleur SAS distinct incorporé à la carte système. Que faire ensuite Les disques peuvent à présent être gérés de la même manière que n'importe quels autres disques. Installation du port SAS externe sur un modèle 02E/4B ou 05F/6B Suivez cette procédure pour installer un port SAS externe dans le cadre de la mise à niveau de votre fond de panier d'unité de disques ou pour activer le partage de disque interne. Avant de commencer Effectuez les tâches prérequises (voir «Avant de commencer», à la page 143). Pourquoi et quand exécuter cette tâche Suivez les étapes ci-après pour installer un port SAS externe dans le serveur : Unités de disques 109 Procédure 1. Arrêtez le système. Pour obtenir des instructions pour l'arrêt du système, voir «Arrêt d'un système ou d'une partition logique», à la page 153. 2. Retirez toutes les cassettes d'adaptateur PCI. Voir Cassette de quatrième génération simple largeur d'adaptateur PCI. 3. Si nécessaire, retirez le câble SAS existant ou l'obturateur (voir figure 110). a. Appuyez sur les taquets latéraux et retirez le capuchon SAS (A) de la connexion SAS sur le fond de panier système. b. Ouvrez le serre-câble (B) sur la paroi latérale. Le serre-câble s'ouvre à partir du haut. c. Retirez le câble SAS (D) du système. 4. Installez le nouveau câble SAS (voir figure 111, à la page 111). a. Connectez le port externe à l'arrière du système jusqu'à ce qu'il s'enclenche. b. Ouvrez le serre-câble (C) sur la paroi latérale, et placez-y le câble (A). Le serre-câble s'ouvre à partir du haut. c. Refermez le serre-câble (C) et continuez à faire glisser le câble vers la connexion SAS sur le fond de panier système. d. Poussez le capuchon SAS (D) dans la connexion SAS sur le fond de panier système jusqu'à ce qu'il s'enclenche. Figure 110. Retrait du câble SAS 110 Unités de disques 5. Remplacez toutes les cassettes d'adaptateur PCI. Voir Cassette de quatrième génération simple largeur d'adaptateur PCI. 6. Démarrez le système. Voir http://publib.boulder.ibm.com/infocenter/powersys/v3r1m5/topic/p7haj/ crustartsys.htm Résultats Vous pouvez à présent associer d'autres fonctions SAS prises en charge à votre système. Installation du port SAS externe sur un modèle 31E/2B Suivez cette procédure pour installer un port SAS externe dans le cadre de la mise à niveau de votre fond de panier d'unité de disques ou pour activer le partage de disque interne. Avant de commencer Effectuez les tâches prérequises (voir «Avant de commencer», à la page 143). Pourquoi et quand exécuter cette tâche Suivez les étapes ci-après pour installer le port SAS externe dans le serveur : Procédure 1. Arrêtez le système. Pour obtenir des instructions pour l'arrêt du système, voir «Arrêt d'un système ou d'une partition logique», à la page 153. 2. Retirez toutes les cassettes d'adaptateur PCI. Voir Cassette de quatrième génération simple largeur d'adaptateur PCI. 3. Si nécessaire, retirez le câble SAS existant ou l'obturateur (voir figure 112, à la page 112). Figure 111. Remplacement du câble SAS Unités de disques 111 a. Appuyez sur les taquets latéraux et retirez le capuchon SAS de la connexion SAS sur le fond de panier système. b. Ouvrez le serre-câble sur la paroi latérale et sortez le câble (B). Le serre-câble s'ouvre à partir du haut. c. Enfoncez le port SAS externe (A), puis sortez le câble SAS du système. 4. Installation du nouveau câble SAS. Pour plus d'informations, voir figure 113, à la page 113. a. Connectez le port externe à l'arrière du système (B) jusqu'à ce qu'il s'enclenche. b. Si nécessaire, ouvrez le serre-câble sur la paroi latérale, et placez-y le câble. Le serre-câble s'ouvre à partir du haut. c. Refermez le serre-câble et continuez à faire glisser le câble (A) sur la connexion SAS sur le fond de panier système. d. Poussez le capuchon SAS dans la connexion SAS (B) sur le fond de panier système jusqu'à ce qu'il s'enclenche. Figure 112. Retrait du câble SAS 112 Unités de disques 5. Remplacez toutes les cassettes d'adaptateur PCI. Voir Cassette de quatrième génération simple largeur d'adaptateur PCI. 6. Démarrez le système. Voir http://publib.boulder.ibm.com/infocenter/powersys/v3r1m5/topic/p7haj/ crustartsys.htm Résultats Vous pouvez à présent associer d'autres fonctions SAS prises en charge à votre système. Installation du port SAS externe des modèles 33E/8B ou 36E/8C Il peut être nécessaire d'installer un port SAS externe dans le cadre de la mise à niveau de votre fond de panier d'unité de disques ou d'activer le partage de disque interne. La procédure décrite dans cette section permet d'effectuer cette tâche. Pourquoi et quand exécuter cette tâche Si votre système est géré par la console HMC, utilisez cette dernière pour installer la fonctionnalité SAS 3699 sur le serveur. Pour plus d'informations, voir «Retrait d'un composant via la console HMC», à la page 194. Si vous ne disposez pas d'une console HMC, suivez les étapes ci-après pour installer la fonctionnalité SAS 3699 dans le serveur : Procédure 1. Effectuez les tâches de configuration prérequises (voir «Avant de commencer», à la page 143). 2. Arrêtez le système. Pour obtenir des instructions, voir «Arrêt d'un système ou d'une partition logique», à la page 153. Figure 113. Remplacement du câble SAS Unités de disques 113 3. Mettez le système en position de maintenance s'il est monté en armoire. Pour plus d'informations, voir «Mise en position de maintenance de l'unité 33E/8B ou 36E/8C», à la page 184. 4. Retirez le capot d'accès. Pour plus d'informations, voir «Retrait du capot d'accès des modèles 33E/8B ou 36E/8C», à la page 170. 5. Retirez le bloc processeur ou l'obturateur de bloc processeur situé à l'emplacement P1-C16 en soulevant la poignée A, puis en extrayant le bloc du système (voir figure suivante). Remarque : Il vous suffit de retirer un assemblage ou un obturateur en P1-C16. Pour retirer un obturateur, saisissez-le par son milieu et extrayez-le du système. 6. Retirez les ventilateurs et le capot de ventilation en procédant comme suit : a. Pour retirer les ventilateurs, appuyez sur la patte (A) et soulevez le ventilateur (B) du capot de ventilation, comme illustré dans la figure suivante. 114 Unités de disques b. Pour retirer les déflecteurs d'air, saisissez et ôtez-les du boîtier de ventilation, comme illustré dans la figure suivante. Figure 114. Retrait du ventilateur du boîtier de ventilation 33E/8B ou 36E/8C Figure 115. Retrait des déflecteurs d'air du boîtier de ventilation 33E/8B ou 36E/8C Unités de disques 115 c. Pour retirer le boîtier de ventilation, desserrez les deux vis moletées (A) et ôtez le boîtier de ventilation (B) du système, comme illustré dans la figure suivante. 7. Faites passer la partie avant du câble doté du connecteur mâle à travers l'ouverture entre le boîtier du processeur et le côté du système. Conseil : Pour effectuer les étapes suivantes, reportez-vous à la figure 117, à la page 117. 8. Insérez les broches (A) situées sur le côté du système dans les emplacements (B) de l'ensemble de câble SAS. 9. Faites de nouveau glisser l'ensemble de câble jusqu'à ce que les broches (A) soient insérées et le loquet (C) en place. Conseil : Le port situé à l'arrière de l'ensemble de câble doit être inséré dans l'emplacement à l'arrière du système. 10. Fixez le support de câble (H) en plaçant l'encoche du support au-dessus de la colonne située sur le côté du système, alignez le trou de la vis (I) avec celui qui se trouve sur le côté du système, puis fixez le support à l'aide de la vis. Remarque : Le support doit reposer confortablement sur l'ensemble de câble SAS. Figure 116. Retrait du boîtier de ventilation du système 33E/8B ou 36E/8C 116 Unités de disques 11. Selon le fond de panier de l'unité de disques, effectuez l'une des opérations suivantes : Conseil : Pour effectuer cette étape, reportez-vous à la figure 118, à la page 118. v Si le raccord SAS est visible au centre du fond de panier de l'unité de disques (D), fixez le câble au raccord et passez à l'étape 12, à la page 118. v Si le capuchon de protection du câble SAS (E) ressort du fond de panier de l'unité de disques, passez à l'étape 11a. Figure 117. Fixation du câble SAS Unités de disques 117 a. Recherchez le capuchon de protection du câble SAS (E) sur le fond de panier de l'unité de disques, puis retirez du système la partie arrière du capuchon pour dégager celui-ci. b. Retirez le capuchon du port SAS. c. Fixez le capuchon au connecteur SAS mâle (F) de l'ensemble de câble en la faisant glisser par-dessus le connecteur de câble (F) depuis l'arrière. Le connecteur SAS du câble doit tenir dans les deux emplacements situés à l'avant du capuchon, et il doit ressortir de celui-ci afin de pouvoir être inséré dans le port. d. Faites glisser le câble et le capuchon de protection (G) dans le port SAS jusqu'à ce qu'ils soient solidement fixés. 12. Reposez les ventilateurs et le boîtier de ventilation en procédant comme suit : a. Pour installer le boîtier de ventilation (A) dans le système, alignez les broches coudées avec les orifices, puis serrez les deux vis moletées (B), comme illustré dans la figure suivante. Figure 118. Raccordement du câble SAS 118 Unités de disques b. Pour installer les déflecteurs d'air, mettez-les en place dans le boîtier de ventilation, puis appuyez dessus jusqu'à ce qu'ils s'enclenchent, comme illustré dans la figure suivante. Figure 119. Installation du boîtier de ventilation dans le système 33E/8B ou 36E/8C Figure 120. Installation des déflecteurs d'air dans le boîtier de ventilation 33E/8B ou 36E/8C Unités de disques 119 c. Pour installer les ventilateurs, mettez-les en place dans le boîtier de ventilation, et appuyez dessus jusqu'à ce qu'ils s'enclenchent, comme illustré dans la figure suivante. 13. Installez le processeur système (B) et fermez les deux loquets de sécurité (C) comme indiqué à la figure suivante. Figure 121. Installation des ventilateurs dans le boîtier de ventilation 33E/8B ou 36E/8C 120 Unités de disques 14. Reposez le capot d'accès. Pour plus d'informations, consultez la rubrique «Installation du capot d'accès des modèles 33E/8B ou 36E/8C.», à la page 171. 15. Remettez le système en position de fonctionnement. Pour plus d'informations, voir «Placement de 33E/8B ou de 36E/8C en position de fonctionnement», à la page 187. Résultats Vous pouvez à présent associer les fonctions SAS prises en charge à votre système. Revenez à la procédure qui vous a amené vers cette page ou démarrez le système. Pour plus d'informations, reportez-vous à la rubrique «Démarrage du système ou de la partition logique», à la page 151. Installation du port SAS externe sur un modèle 17M/MB ou 79M/HB Suivez cette procédure pour installer un port SAS externe dans le cadre de la mise à niveau de votre fond de panier d'unité de disques ou pour activer le partage de disque interne. Pourquoi et quand exécuter cette tâche Suivez les étapes ci-après pour installer le port SAS externe dans le serveur : Procédure 1. Effectuez les tâches de configuration prérequises (voir «Avant de commencer», à la page 143). 2. Arrêtez le système. Voir «Arrêt d'un système ou d'une partition logique», à la page 153. 3. Retirez toutes les cassettes d'adaptateur PCI. Voir Cassette de quatrième génération simple largeur d'adaptateur PCI. 4. Si nécessaire, retirez le câble SAS existant ou l'obturateur. a. Appuyez sur les taquets latéraux et retirez le capuchon SAS (A) de la connexion SAS sur le fond de panier système. b. Ouvrez le serre-câble sur la paroi latérale et sortez le câble. Le serre-câble s'ouvre à partir du haut. c. Enfoncez le port SAS externe (B), puis sortez le câble SAS du système. Unités de disques 121 5. Installation du nouveau câble SAS. a. Connectez le port externe à l'arrière du système (B) jusqu'à ce qu'il s'enclenche. b. Si nécessaire, ouvrez le serre-câble sur la paroi latérale, et placez-y le câble. Le serre-câble s'ouvre à partir du haut. c. Refermez le serre-câble et continuez à faire glisser le câble sur la connexion SAS sur le fond de panier système. d. Poussez le capuchon SAS (A) dans la connexion SAS sur le fond de panier système jusqu'à ce qu'il s'enclenche. Figure 122. Retrait du câble SAS 122 Unités de disques 6. Remplacez toutes les cassettes d'adaptateur PCI. Voir Cassette de quatrième génération simple largeur d'adaptateur PCI. 7. Démarrez le système. Voir «Démarrage du système ou de la partition logique», à la page 151 Résultats Vous pouvez à présent associer d'autres fonctions SAS prises en charge à votre système. Remise en place d'une carte d'extension SAS dans une unité d'extension 58/02 hors tension Les procédures suivantes expliquent comment remettre en place une carte d'extension SAS dans une unité d'extension 58/02 hors tension. Avant de commencer Examinez les conditions préalablement requises décrites dans la section «Avant de commencer», à la page 143. Pourquoi et quand exécuter cette tâche Si le système est géré par la console HMC (Hardware Management Console), utilisez-la pour remplacer la carte. Pour obtenir des instructions, voir «Retrait d'un composant via la console HMC», à la page 194 Procédure 1. Arrêtez le système. Pour obtenir des instructions, voir «Arrêt d'un système ou d'une partition logique», à la page 153. 2. Débranchez le système pour l'isoler de la source d'alimentation. Figure 123. Remplacement du câble SAS Unités de disques 123 Avertissement : Le système est peut-être équipé d'un second bloc d'alimentation. Vous devez couper l'alimentation du système en débranchant tous les cordons d'alimentation pour empêcher tout endommagement du système pendant cette opération. (L003) ou 3. Placez le connecteur de la carte (C) sur le raccord du fond de panier central (D) en vous aidant du guide (B) pour assurer un raccordement adéquat. Relevez ensuite le levier (A) pour insérer et fixer la carte d'extension SAS dans le fond de panier central, comme illustré dans la figure suivante. 124 Unités de disques 4. Reposez le fond de panier central et la carte d'extension SAS dans le système en procédant comme suit : a. Alignez le fond de panier central avec le guide et assurez-vous que l'emplacement (A) du fond de panier est fixé à la colonne (B) du système. b. Serrez les vis moletées (C) et (D) de la carte d'extension SAS. c. Serrez les vis moletées (E) de part et d'autre du fond de panier central. Figure 124. Installation de la carte d'extension SAS dans le fond de panier central Unités de disques 125 5. Fixez le cache du cordon d'alimentation et serrez la vis moletée (A) du système. Figure 125. Installation de la carte d'extension SAS et du fond de panier central Figure 126. Installation de la plaque d'alimentation 126 Unités de disques 6. Insérez avec précaution le contrôleur de gestion du boîtier dans le système et fermez le loquet (A) comme indiqué dans la figure suivante. 7. Soulevez avec précaution la carte d'entrée-sortie et alignez-la avec l'emplacement situé à l'arrière de l'unité d'extension. 8. Insérez la carte d'entrée-sortie fermement dans le serveur, comme indiqué dans la figure suivante. 9. Fixez la carte d'entrée-sortie avec les languettes de verrouillage (A) comme indiqué dans la figure suivante. Figure 127. Remplacement du contrôleur de gestion du boîtier Unités de disques 127 10. Pour reposer le bloc d'alimentation, insérez-le dans l'unité d'extension en veillant à ce que le levier de verrouillage (A) soit en position ouverte, comme illustré dans la figure suivante. 11. Fermez le levier de verrouillage (A) jusqu'à ce que le bloc d'alimentation soit maintenu en place. Figure 128. Installation de la carte d'entrée-sortie sur un modèle monté en armoire 128 Unités de disques 12. Pour remplacer le fond de panier de l'unité de disques, soulevez le fond de panier sur les deux côtés et alignez-le avec les rails de guidage situés dans l'unité d'extension. ATTENTION : Le fond de panier peut être lourd à porter. Assurez-vous de pouvoir effectuer cette tâche en toute sécurité avant de commencer. 13. Faites glisser le fond de panier dans l'unité d'extension en le tenant par le bas, comme illustré dans la figure suivante. 14. Fixez le fond de panier en amenant les pattes de fixation de la position ouverte (B) à la position verrouillée (A). Figure 129. Remplacement d'un bloc d'alimentation Unités de disques 129 15. Reposez toutes les unités de disques et les obturateurs, si vous les avez retirés précédemment. a. Alignez l'unité de disques avec les rails de guidage du fond de panier en la tenant par le bas. b. Faites glisser l'unité de disques complètement dans le fond de panier. c. Appuyez sur le levier (A) vers l'unité de disques pour le verrouiller. Figure 130. Installation du fond de panier sur un modèle monté en armoire 130 Unités de disques 16. Remettez en place l'ensemble des cartes d'extension et obturateurs, si vous les aviez retirés précédemment. a. Alignez la carte d'extension avec les rails de guidage du fond de panier en la tenant par le bas. b. Faites glisser la carte d'extension complètement dans le fond de panier. c. Appuyez sur le levier (A) vers la carte pour le verrouiller. 17. Démarrez le système. Pour obtenir des instructions, voir «Démarrage du système ou de la partition logique», à la page 151. Que faire ensuite Procédez à toutes les autres tâches de maintenance nécessaires. v Pour plus d'informations sur le raccordement du système, voir Connexion des unités d'extension v Pour plus d'informations sur l'installation des adaptateurs PCI sur la carte, voir Unités d'extension, cartes PCI et cassettes - Modèles 58/02 et 58/77. Retrait d'une carte d'extension SAS dans une unité d'extension 58/02 hors tension Les procédures suivantes expliquent comment retirer une carte d'extension SAS d'une unité d'extension (montée en armoire)58/02 pour remplacer un composant défectueux nécessitant une mise hors tension. Avant de commencer Si le système est géré par la console HMC (Hardware Management Console), utilisez-la pour retirer la carte d'extension SAS. Pour plus d'informations, voir «Retrait d'un composant via la console HMC», à la page 194. Procédure 1. Effectuez les tâches prérequises (voir «Avant de commencer», à la page 143). Unités de disques 131 2. Identifiez la carte que vous souhaitez retirer. Pour obtenir des instructions, voir «Identification d'un composant défaillant», à la page 146. 3. Arrêtez le système. Pour obtenir des instructions, voir «Arrêt d'un système ou d'une partition logique», à la page 153. 4. Retirez les unités de disques et les obturateurs, comme illustré dans la figure suivante. a. Appuyez sur le loquet de l'unité de disques et tirez sur la poignée (A) vers vous pour dégager les unités. b. Faites glisser l'unité de disques à l'extérieur du fond de panier en la tenant par le bas. 5. Retirez les cartes de port et les obturateurs, comme illustré dans la figure ci-dessous. a. Saisissez l'extrémité du levier de verrouillage (A) et tirez-le vers le haut pour déverrouiller la carte de port. b. Retirez la carte de port du fond de panier en la tenant par le bas. c. Posez la carte de port sur une surface de protection contre les décharges électrostatiques. Figure 131. Retrait des unités de disques du fond de panier 132 Unités de disques 6. Retirez le fond de panier, comme illustré dans la figure suivante. ATTENTION : Le fond de panier peut être lourd à porter. Assurez-vous de pouvoir effectuer cette procédure en toute sécurité. a. Appuyez sur les loquets bleus (A) pour déverrouiller les leviers. b. Tirez les leviers (B) vers l'extérieur dans le sens indiqué pour libérer le fond de panier. c. Retirez le fond de panier du boîtier en le tenant par le bas. Figure 132. Retrait de la carte de port du fond de panier Unités de disques 133 7. Débranchez les cordons d'alimentation des deux ensembles de conversion hors ligne (OCA) en procédant comme suit : a. A l'avant des deux ensembles de conversion hors ligne, faites glisser les supports de cordon d'alimentation (A) vers la gauche. b. Etiquetez et débranchez les cordons d'alimentation (B) de chaque ensemble de conversion hors ligne. Figure 133. Retrait du fond de panier du boîtier 134 Unités de disques 8. Retirez les ensembles de conversion hors ligne en procédant comme suit : a. Tirez le levier (A) à fond vers le bas dans le sens indiqué pour dégager l'ensemble de conversion hors ligne. b. Saisissez l'ensemble par les extrémités et retirez-le du boîtier. Remarque : Tenez l'ensemble des deux mains pour l'extraire. c. Répétez ces opérations pour l'autre ensemble de conversion hors ligne. Figure 134. Débranchement des cordons d'alimentation des ensembles de conversion hors ligne Unités de disques 135 9. Retirez le contrôleur de gestion du boîtier en procédant comme suit : a. Etiquetez et débranchez tous les câbles de la carte du contrôleur de gestion du boîtier. b. Pincez le loquet (A) pour déverrouiller le levier (B). c. Tirez le levier vers l'extérieur dans le sens indiqué pour dégager la carte du contrôleur de gestion du boîtier. d. Faites glisser la carte du contrôleur de gestion hors du boîtier. Figure 135. Retirer les ensembles de conversion hors ligne 136 Unités de disques 10. Retirez la plaque d'alimentation en tournant la vis moletée (A) et en ôtant la plaque du système. 11. Déconnectez les câbles de l'arrière de la carte d'entrée-sortie. Figure 136. Retirer le contrôleur de gestion du boîtier Figure 137. Retrait de la plaque d'alimentation Unités de disques 137 12. Appuyez sur les loquets de déverrouillage (A) et tirez sur les leviers (B) pour dégager la carte du système, comme illustré dans la figure suivante. 13. Retirez la carte du système. 14. Retirez la carte d'extension SAS et le fond de panier central du boîtier en procédant comme suit : a. Tirez les verrous éclipsables à ressort (A) vers le haut et tournez-les d'un quart vers la gauche. b. Tournez les vis imperdables (B) et (C) dans le sens inverse des aiguilles d'une montre jusqu'à ce que qu'elles soient complètement desserrées. c. Retirez la carte d'extension SAS et le fond de panier central du boîtier. Figure 138. Retrait de la carte d'entrée-sortie 138 Unités de disques 15. Appuyez sur les leviers (A) de la carte d'extension pour dégager cette dernière du fond de panier central, puis retirez la carte. Figure 139. Retrait de la carte d'extension SAS et du fond de panier central Figure 140. Retrait de la carte d'extension SAS du fond de panier central Unités de disques 139 Que faire ensuite Poursuivez la procédure de remplacement de la carte d'extension SAS ou revenez à la procédure qui vous a amené vers cette page. Remise en place d'une carte d'extension SAS dans une unité d'extension 58/02 hors tension Les procédures suivantes expliquent comment remplacer une carte d'extension SAS dans une unité d'extension 58/02 hors tension. Avant de commencer Examinez les conditions préalables requises décrites dans la section «Avant de commencer», à la page 143. Pourquoi et quand exécuter cette tâche Si le système est géré par la console HMC (Hardware Management Console), utilisez-la pour retirer la carte. Pour obtenir des instructions, voir «Retrait d'un composant via la console HMC», à la page 194 Procédure 1. Avant d'installer ou de réinstaller une carte d'extension SAS, examinez les connecteurs de signal afin de vous assurer qu'ils ne sont pas recourbés ou endommagés. Avertissement : Les connecteurs recourbés risquent d'endommager le réceptacle du fond de panier. Veillez à ne pas toucher ni endommager les connecteurs de signal. 2. Insérez complètement la carte dans l'emplacement et poussez le levier (A) vers le bas pour maintenir la carte en place, comme illustré dans la figure suivante. Figure 141. Installation de la d'extension SAS 140 Unités de disques 3. Démarrez le système. Pour obtenir des instructions, voir «Démarrage du système ou de la partition logique», à la page 151. 4. Vérifiez que la nouvelle carte est correctement installée. Pour plus d'informations, voir «Vérification du composant installé», à la page 195. Que faire ensuite Procédez à toutes les autres tâches de maintenance nécessaires. Retrait d'une carte d'extension SAS d'une unité d'extension 58/02 hors tension Les procédures suivantes expliquent comment retirer une carte d'extension SAS d'une unité d'extension (montée en armoire) 58/02 pour remplacer un composant défectueux nécessitant une mise hors tension. Avant de commencer Si le système est géré par la console HMC (Hardware Management Console), utilisez-la pour retirer la carte d'extension SAS. Pour plus d'informations, voir «Retrait d'un composant via la console HMC», à la page 194. Procédure 1. Effectuez les tâches prérequises (voir «Avant de commencer», à la page 143). 2. Identifiez la carte que vous souhaitez retirer. Pour obtenir des instructions, voir «Identification d'un composant défaillant», à la page 146. 3. Soulevez le levier (A) et retirez la carte du système, comme illustré dans la figure suivante. Figure 142. Retrait de la carte d'extension SAS Unités de disques 141 Que faire ensuite Poursuivez la procédure de remplacement de la carte d'extension SAS ou revenez à la procédure qui vous a amené vers cette page. 142 Unités de disques Procédures communes pour les fonctions installables Cette section contient toutes les procédures communes concernant l'installation, le retrait et le remplacement de fonctionnalités. Avant de commencer Respectez les précautions suivantes lorsque vous installez, retirez et remplacez des dispositifs et des composants. Pourquoi et quand exécuter cette tâche Ces précautions sont destinées à créer un environnement sûr pour la maintenance de votre système et ne sont pas des étapes pour la maintenance de votre système. Les procédures d'installation, de retrait et de remplacement fournissent les étapes requises pour la maintenance de votre système. 143 DANGER Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes suivantes : Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut présenter un danger. Pour éviter tout risque de choc électrique : v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce dernier avec un autre produit. v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique. v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de reconfiguration de ce produit au cours d'un orage. v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de danger électrique, débranchez tous les cordons d'alimentation. v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent aux informations de la plaque d'alimentation électrique du système. v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce produit. v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les câbles d'interface. v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence de dommages matériels. v Avant de retirer les carters de l'unité, mettez celle-ci hors tension et déconnectez ses cordons d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication et aux modems (sauf mention contraire dans les procédures d'installation et de configuration). v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et déconnecter les différents cordons. Pour déconnecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Débranchez les cordons d'alimentation des prises. 3. Débranchez les câbles d'interface des connecteurs. 4. Débranchez tous les câbles des unités. Pour connecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Branchez tous les cordons sur les unités. 3. Branchez les câbles d'interface sur des connecteurs. 4. Branchez les cordons d'alimentation aux prises. 5. Mettez l'unité sous tension. (D005a) DANGER 144 Unités de disques Observez les consignes suivantes lors de l'utilisation du système en armoire ou lorsque vous travaillez à proximité de ce dernier : v Un mauvais maniement de l'équipement lourd peut engendrer blessures et dommages matériels. v Abaissez toujours les vérins de mise à niveau de l'armoire. v Installez toujours des équerres de stabilisation sur l'armoire. v Pour prévenir tout danger lié à une mauvaise répartition de la charge, installez toujours les unités les plus lourdes dans la partie inférieure de l'armoire. Installez toujours les serveurs et les unités en option en commençant par le bas de l'armoire. v Un serveur monté en armoire n'est pas une étagère ou un espace de travail. Ne posez pas d'objet sur un serveur monté en armoire. v Chaque armoire peut être équipée de plusieurs cordons d'alimentation. Avant de manipuler l'armoire, vous devez débrancher l'ensemble des cordons d'alimentation. v Reliez toutes les unités installées dans l'armoire aux dispositifs d'alimentation installés dans la même armoire. Vous ne devez pas brancher le cordon d'alimentation d'une unité installée dans une armoire au dispositif d'alimentation installé dans une autre armoire. v Un mauvais câblage du socle de prise de courant peut provoquer une mise sous tension dangereuse des parties métalliques du système ou des unités qui lui sont raccordées. Il appartient au client de s'assurer que le socle de prise de courant est correctement câblé et mis à la terre afin d'éviter tout risque de choc électrique. ATTENTION v N'installez pas d'unité dans une armoire dont la température ambiante interne dépasse la température ambiante que le fabricant recommande pour toutes les unités montées en armoire. v N'installez pas d'unité dans une armoire où la ventilation n'est pas assurée. Vérifiez que les côtés, l'avant et l'arrière de l'unité sont correctement ventilés. v Le matériel doit être correctement raccordé au circuit d'alimentation pour éviter qu'une surcharge des circuits n'entrave le câblage des dispositifs d'alimentation ou de protection contre les surintensités. Pour choisir des connexions d'alimentation à l'armoire adaptées, consultez les étiquettes de puissance nominale situées sur le matériel dans l'armoire afin de déterminer l'alimentation totale requise par le circuit d'alimentation. v Armoires dotées de tiroirs coulissants : Si l'armoire n'est pas équipée d'équerres de stabilisation, ne sortez et n'installez pas de tiroir ou de dispositif. Ne retirez pas plusieurs tiroirs à la fois. Si vous retirez plusieurs tiroirs simultanément, l'armoire risque de devenir instable. v Armoires dotées de tiroirs fixes : Sauf indication du fabricant, les tiroirs fixes ne doivent pas être retirés à des fins de maintenance. Si vous tentez de retirer une partie ou l'ensemble du tiroir, l'armoire risque de devenir instable et le tiroir risque de tomber. (R001) Avant de commencer toute procédure de remplacement ou d'installation, procédez comme suit : Procédure 1. Si vous installez un nouveau dispositif, vous devez disposer des logiciels requis pour la prise en charge de ce nouveau dispositif. 2. Si vous exécutez une procédure d'installation ou de remplacement qui risque d'affecter la sécurité des données, vous devez disposer, dans la mesure du possible, d'une copie de sauvegarde récente de votre système ou de la partition logique (systèmes d'exploitation, logiciels sous licence et données). Procédures communes pour les fonctions installables 145 3. Passez en revue la procédure d'installation ou de remplacement relative au dispositif ou au composant. 4. Notez la correspondance des couleurs utilisée sur le système. La couleur bleue ou ocre sur les composants matériels indique un point de contact que vous pouvez utiliser pour retirer ou installer le composant sur le système, ouvrir ou fermer un levier, etc. La couleur ocre peut également désigner un composant que vous pouvez retirer ou remplacer alors que le système ou la partition logique est sous tension. 5. Vous devez vous munir d'un tournevis à lame plate moyen, d'un tournevis cruciforme et d'une paire de ciseaux. 6. Si certains composants sont incorrects, manquants ou visiblement endommagés, procédez comme suit : v Si vous remplacez un composant, contactez votre fournisseur de composants ou le support technique. v Si vous installez un dispositif, contactez l'un des fournisseurs de services suivants : – Votre fournisseur de composants ou le support technique. 7. Si vous rencontrez des difficultés durant l'installation, contactez votre fournisseur de services, ou le support technique. 8. Si vous installez du nouveau matériel dans une partition logique, vous devez planifier et comprendre les implications inhérentes au partitionnement du système. Pour plus d'informations, consultez la rubrique Partitionnement logique. Identification d'un composant défaillant Ces instructions permettent de savoir comment localiser et identifier un composant défaillant sur un système ou une unité d'extension en utilisant la méthode propre à votre système. Pourquoi et quand exécuter cette tâche Pour les serveurs systèmes dotés d'un processeur POWER7, les voyants peuvent être utilisés pour identifier ou vérifier un composant en cours de maintenance. Le voyant d'erreur (ambre) signale une erreur et correspond au code d'emplacement du code SRC. Le voyant est activé et désactivé automatiquement. Si vous avez besoin d'utiliser la fonction d'identification, utilisez les procédures suivantes. Voyants du panneau de commande Ces informations peuvent s'utiliser comme un guide sur les voyants et boutons du panneau de commande. Le panneau de commande est doté de voyants qui indiquent les différents états du système. 146 Unités de disques v A : Bouton de mise sous-tension v B: Voyant de système sous tension – Un voyant fixe indique que l'alimentation système de l'unité est complète. – Un voyant clignotant indique que l'unité est sous alimentation de mode veille. Remarque : Une période d'environ 30 secondes s'écoule entre le moment où vous appuyez sur le bouton de mise sous tension et celui où le voyant d'alimentation cesse de clignoter pour devenir fixe. Durant cette période de transition, le voyant peut clignoter plus rapidement. v C : Voyant d'identification de boîtier – Une lumière constante indique l'état d'identification, qui permet d'identifier une pièce. – L'absence de lumière indique un fonctionnement normal du système. v D : Voyant d'informations système – L'absence de lumière indique un fonctionnement normal du système. – Un voyant signifie qu'une intervention est requise sur le système. v E : Port USB v F: Voyant de panne de boîtier – Un voyant fixe indique une panne dans l'unité centrale – L'absence de lumière indique un fonctionnement normal du système. v G : Ecran Fonctions/Données v H : Bouton de décrémentation v I : Bouton Entrée v J : Bouton d'incrémentation v K : Bouton de réinitialisation Figure 143. Panneau de commande Procédures communes pour les fonctions installables 147 Concepts associés Identification d'un composant défaillant Ces instructions permettent de savoir comment localiser et identifier un composant défaillant sur un système ou une unité d'extension en utilisant la méthode propre à votre système. Identification d'un composant défaillant sur un système AIX ou une partition logique Les instructions permettent de savoir comment rechercher un composant défaillant, puis d'activer le voyant de ce composant sur un système ou une partition logique exécutant le système d'exploitation AIX. Localisation d'un composant défaillant sur un système AIX ou une partition logique Il se peut que vous deviez utiliser les outils AIX, avant d'activer le voyant dans le cadre de la recherche d'un composant défaillant. Procédure 1. Connectez-vous en tant qu'utilisateur root ou celogin-. 2. A l'invite, tapez diag et appuyez sur Entrée. 3. Dans le menu Sélection des fonctions, choisissez Sélection d'une tâche et appuyez sur Entrée. 4. Sélectionnez Affichage des résultats de tests de diagnostic précédents, puis appuyez sur Entrée. 5. Dans les précédents diagnostics, choisissez Affichage du journal des diagnostics abrégé. L'écran Affichage du journal des diagnostics qui contient une liste chronologique des événements s'affiche. 6. Dans la colonne T, recherchez l'entrée S la plus récente. Sélectionnez la ligne, puis appuyez sur Entrée. 7. Sélectionnez Validation. Le programme affiche les détails de l'entrée du journal. 8. Notez l'emplacement et le code SRN figurant vers la fin de l'entrée. 9. Retournez à la ligne de commande. Que faire ensuite Utilisez les informations de localisation du composant défaillant pour activer le voyant qui identifie ce dernier. «Activation du voyant associé au composant défaillant». Activation du voyant associé au composant défaillant Ces instructions permettent d'identifier physiquement l'emplacement d'un composant pour lequel vous intervenez. Procédure 1. Connectez-vous en tant que superutilisateur. 2. A l'invite, tapez diag et appuyez sur Entrée. 3. Dans le menu Sélection des fonctions, choisissez Sélection d'une tâche et appuyez sur Entrée. 4. Dans le menu Sélection des tâches, sélectionnez Indicateurs d'identification et d'avertissement, puis appuyez sur Entrée. 5. Dans la liste des voyants, sélectionnez le code d'emplacement associé au composant défaillant et appuyez sur Entrée. 6. Sélectionnez Validation. Le programme allume le voyant d'incident associé au composant défaillant. 7. Retournez à la ligne de commande. Désactivation du voyant d'un composant défaillant Cette procédure permet d'éteindre un voyant allumé dans le cadre d'une opération de maintenance. 148 Unités de disques Pourquoi et quand exécuter cette tâche Pour le désactiver, procédez comme suit : Procédure 1. Connectez-vous en tant que superutilisateur. 2. A l'invite, tapez diag et appuyez sur Entrée. 3. Dans le menu Sélection des fonctions, choisissez Sélection d'une tâche et appuyez sur Entrée. 4. Dans le menu Sélection des tâches, sélectionnez Voyants d'identification et d'avertissement, puis appuyez sur Entrée. 5. Dans la liste des voyants, sélectionnez le code d'emplacement associé au composant défaillant et appuyez sur Entrée. Lorsque le voyant d'un composant défaillant est activé, le caractère I précède le code d'emplacement. 6. Sélectionnez Validation. Le programme désactive l'intervention du système ainsi que le voyant associé au composant défaillant. 7. Retournez à la ligne de commande. Identification d'un composant défaillant sur un système Linux ou une partition logique Si l'aide à la maintenance a été installée sur un système ou une partition logique, vous pouvez activer ou désactiver les voyants pour localiser un composant ou effectuer une opération de maintenance. Localisation d'un composant défaillant sur un système Linux ou une partition logique Si l'aide à la maintenance a été installée sur un système ou une partition logique, vous devez activer les voyants pour localiser un composant. Pourquoi et quand exécuter cette tâche Pour l'activer, procédez comme suit : Procédure 1. Connectez-vous en tant que superutilisateur. 2. A partir de la ligne de commande, tapez /usr/sbin/usysident -s identify -l et appuyez sur Entrée. 3. Repérez le voyant d'incident pour identifier le boîtier qui renferme le composant défaillant. Localisation d'un composant défaillant sur un système ou une partition logique Linux Pour retrouver le code d'emplacement d'un composant défaillant si vous ne le connaissez pas, utilisez la procédure de cette rubrique. Pourquoi et quand exécuter cette tâche Pour localiser le composant défaillant sur un système ou une partition logique, procédez comme suit : Procédure 1. Connectez-vous en tant que superutilisateur. 2. Sur la ligne de commande, tapez grep diagela /var/log/platform et appuyez sur Entrée. 3. Recherchez l'entrée la plus récente contenant un code SRC (System Reference Code). 4. Notez les informations d'emplacement. Procédures communes pour les fonctions installables 149 Activation du voyant associé au composant défaillant Si vous connaissez le code d'emplacement du composant défaillant, activez le voyant pour vous aider à rechercher le composant à remplacer. Pourquoi et quand exécuter cette tâche Pour l'activer, procédez comme suit : Procédure 1. Connectez-vous en tant que superutilisateur. 2. A partir de la ligne de commande, tapez /usr/sbin/usysident -s identify -l et appuyez sur Entrée. 3. Repérez le voyant d'incident pour identifier le boîtier qui renferme le composant défaillant. Résultats Désactivation du voyant d'un composant défaillant Après avoir terminé la procédure de retrait et de remplacement, vous devez désactiver le voyant du composant défaillant. Pourquoi et quand exécuter cette tâche Pour le désactiver, procédez comme suit : Procédure 1. Connectez-vous en tant que superutilisateur. 2. A partir de la ligne de commande, tapez /usr/sbin/usysident -s normal -l et appuyez sur Entrée. Localisation d'un composant défaillant sur un système Virtual I/O Server ou une partition logique Il se peut que vous deviez utiliser les outils Virtual I/O Server (VIOS), avant d'activer le voyant dans le cadre de la recherche d'un composant défaillant. Procédure 1. Connectez-vous en tant qu'utilisateur root ou celogin-. 2. A l'invite, tapez diagmenu et appuyez sur Entrée. 3. Dans le menu Sélection des fonctions, choisissez Sélection d'une tâche et appuyez sur Entrée. 4. Sélectionnez Affichage des résultats de tests de diagnostic précédents, puis appuyez sur Entrée. 5. Dans l'écran Affichage des résultats de tests de diagnostic précédents, sélectionnez Affichage du journal des diagnostics abrégé. L'écran Affichage du journal des diagnostics apparaît. Il contient la liste des événements triée dans l'ordre chronologique. 6. Dans la colonne T, recherchez l'entrée S la plus récente. Sélectionnez la ligne, puis appuyez sur Entrée. 7. Sélectionnez Validation. Le programme affiche les détails de l'entrée du journal. 8. Notez l'emplacement et le code SRN figurant vers la fin de l'entrée. 9. Retournez à la ligne de commande. 150 Unités de disques Résultats Utilisez les informations de localisation du composant défaillant pour activer le voyant qui identifie ce dernier. Pour plus d'informations, voir «Identification d'un composant à l'aide du Virtual I/O Server». Identification d'un composant à l'aide du Virtual I/O Server Ces instructions permettent d'activer le voyant afin de repérer l'emplacement physique d'un composant à l'aide du Virtual I/O Server (VIOS). Procédure 1. Connectez-vous en tant que superutilisateur. 2. A l'invite, tapez diagmenu et appuyez sur Entrée. 3. Dans le menu Sélection des fonctions, choisissez Sélection d'une tâche. Appuyez sur Entrée. 4. Dans le menu Sélection des tâches, sélectionnez Indicateurs d'identification et d'avertissement. Appuyez sur Entrée. 5. Dans la liste des voyants, sélectionnez le code d'emplacement associé au composant défaillant et appuyez sur Entrée. 6. Sélectionnez Validation. Le programme allume le voyant d'incident associé au composant défaillant. 7. Retournez à la ligne de commande. Démarrage du système ou de la partition logique Apprenez comment démarrer un système ou une partition logique après avoir effectué une procédure de maintenance ou une mise à niveau du système. Démarrage d'un système non géré par une console HMC Le bouton d'alimentation ou l'interface ASMI permet de démarrer un système non géré par une console HMC (Hardware Management Console). Pourquoi et quand exécuter cette tâche Pour démarrer un système non géré par une console HMC (Hardware Management Console), procédez comme suit : Procédure 1. Le cas échéant, ouvrez le volet avant de l'armoire. 2. Avant d'appuyer sur le bouton d'alimentation du panneau de commande, vérifiez que le système est bien relié à une source d'alimentation : v Tous les cordons d'alimentation système doivent être reliés à une source d'alimentation. v Le voyant d'alimentation clignote lentement, comme indiqué dans la figure suivante. v Comme indiqué dans la figure ci-après, la partie supérieure de l'écran doit afficher la mention 01 V=F. 3. Appuyez sur le bouton d'alimentation (A) du panneau de commande (voir figure suivante). Procédures communes pour les fonctions installables 151 v A : Bouton de mise sous-tension v B : Voyant d'alimentation – Un voyant fixe indique que l'unité est sous alimentation système complète. – Un voyant clignotant indique que l'unité est sous alimentation de mode veille. Remarque : Une période d'environ 30 secondes s'écoule entre le moment où vous appuyez sur le bouton de mise sous tension et celui où le voyant d'alimentation cesse de clignoter pour devenir fixe. Durant cette période de transition, le clignotement peut s'accélérer. v $C : Voyant d'identification de boîtier – Une lumière constante indique l'état d'identification, qui permet d'identifier une pièce. – L'absence de lumière indique un fonctionnement normal du système. v D : Voyant d'incident – L'absence de lumière indique un fonctionnement normal du système. – Un voyant clignotant signifie qu'une intervention est requise sur le système. v E : Port USB v F: Voyant de panne de boîtier – Un voyant fixe indique une panne dans le système. – L'absence de lumière indique un fonctionnement normal du système. v G : Ecran Fonctions/Données v H : Bouton de décrémentation v I : Bouton Entrée v J : Bouton d'incrémentation v K : Bouton de réinitialisation 4. Une fois que vous avez appuyé sur le bouton d'alimentation, notez les éléments suivants : v Le voyant d'alimentation commence à clignoter plus rapidement. v Les ventilateurs système sont activés après environ 30 secondes et commencent à tourner plus rapidement. Figure 144. Panneau de commande 152 Unités de disques v L'écran du panneau de commande affiche les jauges (ou points de contrôle) pendant le démarrage du système. Le voyant de mise sous tension arrête de clignoter et reste fixe, indiquant que le système est sous tension. Que faire ensuite Conseil : Si le système ne démarre pas alors que vous avez appuyé sur le bouton d'alimentation, utilisez la procédure suivante pour démarrer le système à l'aide de l'interface ASMI (Advanced System Management Interface) : 1. Lancez l'interface ASMI. Pour plus d'informations, reportez-vous à la rubrique Accès à l'interface ASMI. 2. Démarrez le système à l'aide de l'interface ASMI. Pour plus d'informations, reportez-vous à la rubrique Mise sous tension et arrêt du système. Démarrage d'un système ou d'une partition logique via la console HMC (Hardware Management Console) Vous pouvez utiliser l'interface utilisateur de la console HMC (Hardware Management Console) pour démarrer le système ou la partition logique après avoir installé les câbles requis et relié les cordons d'alimentation à une source d'alimentation. Pourquoi et quand exécuter cette tâche Pour plus d'informations sur l'utilisation de la console HMC, reportez-vous à la rubrique Gestion de la console HMC (Hardware Management Console). Pour plus d'informations sur le démarrage d'une partition logique, reportez-vous à la rubrique Partitionnement logique. Pour plus d'informations sur le démarrage du système, reportez-vous à la rubrique Mise sous tension du système géré. L'écran du panneau de commande affiche les jauges (ou points de contrôle) pendant le démarrage du système. Lorque le voyant du panneau de commande cesse de clignoter et reste allumé, le système est sous tension. Arrêt d'un système ou d'une partition logique La présente section explique comment démarrer un système ou une partition logique dans le cadre d'une mise à niveau du système ou d'une procédure de maintenance. Pourquoi et quand exécuter cette tâche Avertissement : Si vous utilisez le bouton de mise sous tension ou les commandes de la console HMC (Hardware Management Console) pour arrêter le système, les fichiers de données risquent d'être endommagés. Par ailleurs, le système risque de mettre plus de temps à démarrer la prochaine fois si toutes les applications n'ont pas été fermées avant l'arrêt du système. Pour arrêter le système ou la partition logique, sélectionnez la procédure appropriée. Tâches associées Arrêt de partitions logiques à l'aide du gestionnaire Integrated Virtualization Manager Cette procédure permet d'arrêter une partition logique ou l'ensemble d'un système géré par le gestionnaire Integrated Virtualization Manager. Arrêt d'un système non géré par une console HMC Pour effectuer une autre tâche, vous devez arrêter le système. Ces instructions indiquent comment arrêter le système à l'aide du bouton d'alimentation ou de l'interface ASMI. Procédures communes pour les fonctions installables 153 Avant de commencer Avant d'arrêter le système, procédez comme suit : 1. Vérifiez que tous les travaux sont terminés et fermez toutes les applications. 2. Assurez-vous que le système d'exploitation est arrêté. Avertissement : Si vous ne le faites pas, vous risquez de perdre des données. 3. Si une partition logique I/O Server (VIOS) est en cours d'exécution, assurez-vous que tous les clients sont déconnectés ou qu'ils peuvent accéder à leurs unités par un autre moyen. Pourquoi et quand exécuter cette tâche La procédure ci-après explique comment arrêter un système non géré par une console HMC (Hardware Management Console). Procédure 1. Connectez-vous au système sous un nom d'utilisateur bénéficiant des droits d'accès nécessaires pour exécuter la commande shutdown ou pwrdwnsys (mettre le système hors tension). 2. A l'invite, tapez la commande appropriée : v Si le système exécute le système d'exploitation AIX, tapez shutdown. v Si le système exécute le système d'exploitation Linux, tapez shutdown -h now. v Si votre système exécute le système d'exploitation , tapez PWRDWNSYS. Si votre système est partitionné, utilisez la commande PWRDWNSYS pour arrêter chaque partition secondaire. Ensuite, utilisez la commande PWRDWNSYS pour arrêter la partition principale. La commande arrête le système d'exploitation. Une fois le système d'exploitation arrêté, le voyant de mise sous tension commence à clignoter lentement et le système passe en veille. 3. Pour restaurer l'état du système à l'issue de l'opération d'installation ou de remplacement, notez le type et le mode de l'IPL. 4. Mettez les interrupteurs d'alimentation des unités reliées au système sur Off. 5. Débranchez tous les cordons d'alimentation connectés à l'unité des socles de prise de courant. Vérifiez que vous débranchez également les cordons d'alimentation des périphériques (imprimantes et unités d'extension, par exemple). Important : Le système est équipé d'un second bloc d'alimentation. Avant de continuer la procédure, vérifiez que toutes les sources d'alimentation ont été débranchées du système. (L003) ou 154 Unités de disques Arrêt d'un système à l'aide de la console HMC Pourquoi et quand exécuter cette tâche Vous pouvez utiliser l'interface utilisateur de la console HMC (Hardware Management Console) pour arrêter le système ou la partition logique. Pour effectuer cette tâche, procédez comme suit. Par défaut, le système géré est configuré pour se mettre hors tension automatiquement lorsque vous arrêtez sa dernière partition logique en cours d'exécution. Si vous définissez les propriétés du système géré sur la console HMC de façon à empêcher la mise hors tension automatique du système, vous devez utiliser cette procédure pour mettre hors tension le système géré. Avertissement : Dans la mesure du possible, arrêtez les partitions logiques en cours d'exécution avant de mettre le système géré hors tension. Si vous n'arrêtez pas d'abord les partitions logiques, vous risquez de provoquer leur arrêt anormal et de perdre des données. Si vous utilisez une partition logique I/O Server (VIOS), assurez-vous que tous les clients sont déconnectés ou qu'ils peuvent accéder à leurs unités par un autre moyen. Pour mettre le système géré hors tension, vous devez utiliser l'un des profils utilisateur suivants : v Super administrateur v Technicien de maintenance v Opérateur v Responsable produit au laboratoire Procédure 1. Dans la zone de navigation, développez le dossier Gestion de systèmes. 2. Cliquez sur l'icône Serveurs. 3. Dans la zone de contenu, sélectionnez le système géré. 4. Sélectionnez Tâches, Opérations, puis Mise hors tension 5. Sélectionnez le mode de mise hors tension approprié et cliquez sur OK. Procédures communes pour les fonctions installables 155 Information associée Arrêt et redémarrage de partitions logiques Retrait et remplacement du volet ou panneau d'unité d'extension Il se peut que vous deviez retirer, remplacer ou installer des panneaux ou des volets sur une unité d'extension pour accéder à des composants ou effectuer une opération de maintenance. Retrait du panneau avant sur un modèle 14G/30 Il se peut que vous deviez retirer le panneau pour effectuer des opérations de maintenance du système. Pourquoi et quand exécuter cette tâche Pour retirer le panneau avant, procédez comme suit : Procédure 1. Retirez les deux vis moletées (B) figurant sur la partie gauche et à l'arrière du panneau. 2. Appuyez sur les loquets (C) situés de part et d'autre du panneau pour libérer celui-ci. 3. Soulevez le panneau (A) et retirez-le du châssis. Conseil : Si un bloc de ventilation est présent dans une partie vide du châssis, ne le retirez que si vous installez des composants dans cette partie du châssis. Installation du panneau avant sur un modèle 14G/30 Cette procédure permet d'installer le panneau avant après avoir installé le système ou effectué une procédure de maintenance. Figure 145. Retrait du panneau avant sur un modèle 14G/30 156 Unités de disques Avant de commencer Si des composants ne sont installés que d'un seul côté du châssis, assurez-vous que le bloc de ventilation est présent du côté vide. Procédure 1. Insérez les colonnes situées à l'arrière du panneau (A) dans les crochets du châssis. 2. Appuyez sur les loquets (C) situés de part et d'autre du panneau avant. 3. Insérez le panneau dans le châssis et ouvrez les loquets. Le panneau s'enclenche dans le logement. 4. Remettez en place les deux vis moletées (B) dans les emplacements gauche et droite du panneau avant. Carters système Suivez ces procédures pour retirer et remettre en place les carters sur les serveurs systèmes contenant le processeur POWER7. Retrait et remplacement des carters du système 02E/4B ou 05F/6B Ces instructions permettent de retirer, de remettre en place ou d'installer des carters pour accéder à des composants ou effectuer une opération de maintenance. Retrait du carter avant d'un système monté en armoire 02E/4B ou 05F/6B Cette procédure permet de retirer le carter pour accéder aux composants ou effectuer une opération de maintenance. Figure 146. Panneau avant d'un modèle 14G/30 Procédures communes pour les fonctions installables 157 Pourquoi et quand exécuter cette tâche Pour retirer le carter avant, procédez comme suit : Procédure 1. Retirez les deux vis qui maintiennent le système fixé à l'armoire. 2. Retirez le carter du système. Le carter est doté d'un renfoncement permettant de le tenir plus facilement. Retrait du carter avant sur un système autonome 02E/4B ou 05F/6B Cette procédure permet de retirer le carter pour accéder aux composants ou effectuer une opération de maintenance. Pourquoi et quand exécuter cette tâche Pour retirer le carter avant, procédez comme suit : Procédure 1. Ouvrez le volet avant (A) en le faisant pivoter vers la gauche, comme indiqué dans la figure suivante. 2. Appuyez sur le taquet de verrouillage (B) et retirez le carter du système. Le carter est doté d'un renfoncement permettant de le tenir plus facilement. Figure 147. Retrait du carter avant 158 Unités de disques Installation du carter avant sur un système monté en armoire 02E/4B ou 05F/6B Cette procédure permet d'installer le carter permettant d'accéder aux composants ou d'effectuer une opération de maintenance. Pourquoi et quand exécuter cette tâche Pour installer le carter avant, procédez comme suit : Procédure 1. Poussez le carter sur le système. 2. Poussez doucement le carter jusqu'à ce que les quatre taquets de déverrouillage soient positionnés dans leurs montants de fixation comme indiqué dans la figure suivante. Le carter avant se met en place ; il est doté d'un renfoncement permettant de le tenir plus facilement. Figure 148. Retrait du carter avant Procédures communes pour les fonctions installables 159 Installation du carter avant et du volet avant sur un système autonome 02E/4B ou 05F/6B Cette procédure permet d'installer le carter permettant d'accéder aux composants ou d'effectuer une opération de maintenance. Pourquoi et quand exécuter cette tâche Pour installer le carter avant et le volet avant, procédez comme suit. Procédure 1. Alignez le carter (A) jusqu'à ce que les deux pattes du couvercle soient placées dans les encoches de plaque de base (B), comme illustré dans la figure suivante. 2. Faites pivoter le carter vers le haut et vers le système, jusqu'à ce que le taquet de déverrouillage soit inséré dans son emplacement. Le carter est doté d'un renfoncement permettant de le tenir plus facilement. Figure 149. Retrait du carter avant 160 Unités de disques 3. Maintenez le volet avant à un angle d'environ 120 degrés vers le système comme indiqué dans la figure suivante. Enclenchez les taquets. Retrait du capot d'accès d'un système monté sur armoire 02E/4B ou 05F/6B Cette procédure décrit comment retirer le capot d'accès. Procédure 1. Desserrez les deux vis moletées (A) situées à l'arrière du capot. Figure 150. Installation du carter avant Figure 151. Installation du volet avant Procédures communes pour les fonctions installables 161 2. Faites glisser le capot vers l'arrière de l'unité centrale. Lorsque la partie avant du capot d'accès n'est plus sur le bord du cadre supérieur, soulevez le capot et retirez-le de l'unité centrale. Avertissement : Pour assurer un refroidissement et une circulation de l'air corrects, replacez le capot avant la mise sous tension du système. Vous risquez d'endommager les composants système si vous utilisez le système sans son capot pendant plus de 30 minutes. Retrait du capot d'accès d'un système autonome 02E/4B ou 05F/6B Cette procédure décrit comment retirer le capot d'accès. Procédure 1. Desserrez la vis de serrage à main (A) située à l'arrière du capot. 2. Faites glisser le capot (B) vers l'arrière de l'unité centrale. Lorsque la partie avant du capot d'accès n'est plus sur le bord du cadre supérieur, soulevez le capot et retirez-le de l'unité centrale. Figure 152. Retrait du capot d'accès 162 Unités de disques 3. Desserrez les deux vis moletées (A) situées à l'arrière du capot intérieur. 4. Faites glisser le capot (B) vers l'arrière de l'unité centrale. Lorsque la partie avant du capot d'accès n'est plus sur le bord du cadre supérieur, soulevez le capot et retirez-le de l'unité centrale. Figure 153. Installation du capot d'accès extérieur Figure 154. Retrait du capot d'accès intérieur Procédures communes pour les fonctions installables 163 Installation du capot d'accès d'un système monté sur armoire 02E/4B ou 05F/6B Cette procédure décrit comment installer le capot d'accès. Procédure 1. Placez le capot d'accès sur le haut du système, à environ 25 mm du rebord du châssis supérieur. 2. Maintenez le capot d'accès contre l'unité centrale et faites-le glisser vers l'avant du système. Les languettes du capot d'accès glissent sous le rebord du boîtier supérieur. 3. Alignez les deux vis moletées (A) situées à l'arrière du capot d'accès avec les deux trous à l'arrière du boîtier du système. 4. Serrez les vis moletées afin de fixer le capot d'accès. Installation du capot d'accès d'un système autonome 02E/4B ou 05F/6B Cette procédure décrit comment installer le capot d'accès. Procédure 1. Placez le capot d'accès interne (A) sur le haut du système, à environ 25 mm du rebord du châssis supérieur. 2. Maintenez le capot d'accès contre l'unité centrale et faites-le glisser vers l'avant du système. Les languettes du capot d'accès glissent sous le rebord du boîtier supérieur. 3. Alignez les deux vis moletées (B) situées à l'arrière du capot d'accès avec les deux trous à l'arrière du boîtier du système. 4. Serrez les vis moletées afin de fixer le capot d'accès. Figure 155. Installation du capot d'accès 164 Unités de disques 5. Placez le capot d'accès extérieur (A) sur le haut du système. 6. Maintenez le capot d'accès contre l'unité centrale et faites-le glisser vers l'avant du système. Les languettes du capot d'accès glissent sous le rebord du boîtier supérieur. 7. Alignez la vis moletée (B) située à l'arrière du capot d'accès avec le trou à l'arrière du boîtier du système. 8. Serrez la vis moletée afin de fixer le capot d'accès. Figure 156. Installation du capot d'accès intérieur Procédures communes pour les fonctions installables 165 Retrait et remplacement des carters du système 31E/2B Ces instructions permettent de retirer, de remettre en place ou d'installer des carters pour accéder à des composants ou effectuer une opération de maintenance. Retrait du capot d'accès du système 31E/2B Cette procédure décrit comment retirer le capot d'accès. Procédure 1. Desserrez la vis moletée (A) située à l'arrière du capot. 2. Faites glisser le capot (B) vers l'arrière de l'unité centrale. Lorsque la partie avant du capot d'accès n'est plus sur le bord du cadre supérieur, soulevez le capot et retirez-le de l'unité centrale. Figure 157. Installation du capot d'accès extérieur 166 Unités de disques Avertissement : Pour assurer un refroidissement et une circulation de l'air corrects, replacez le capot avant la mise sous tension du système. Vous risquez d'endommager les composants système si vous utilisez le système sans son capot pendant plus de 30 minutes. Installation du capot d'accès du système 31E/2B Cette procédure décrit comment installer le capot d'accès. Procédure 1. Placez le capot d'accès (A) sur le haut du système, à environ 25 mm du rebord du châssis supérieur. 2. Maintenez le capot d'accès contre l'unité centrale et faites-le glisser vers l'avant du système. Les languettes du capot d'accès glissent sous le rebord du boîtier supérieur. 3. Alignez la vis moletée (B) située à l'arrière du capot d'accès avec le trou à l'arrière du boîtier du système. 4. Serrez les vis moletées afin de fixer le capot d'accès. Figure 158. Retrait du capot d'accès Procédures communes pour les fonctions installables 167 Retrait et installation des carters sur le système 33E/8B ou 36E/8C Ces instructions permettent de retirer, de remettre en place ou d'installer des carters pour accéder à des composants ou effectuer une opération de maintenance. Retrait du carter avant des modèles 33E/8B ou 36E/8C. Cette procédure permet de retirer le carter pour accéder aux composants ou effectuer une opération de maintenance. Pourquoi et quand exécuter cette tâche Pour retirer le carter avant, procédez comme suit : Procédure 1. Retirez les deux vis (A) qui maintiennent l'unité centrale à l'armoire (B) (voir figure suivante). 2. Retirez le carter du système. Le carter est doté d'un renfoncement permettant de le tenir plus facilement. Figure 159. Installation du capot d'accès 168 Unités de disques Installation du carter avant des modèles 33E/8B ou 36E/8C Cette procédure permet de remettre en place le carter après avoir accédé à des composants ou effectué une opération de maintenance. Pourquoi et quand exécuter cette tâche Pour installer le carter avant, procédez comme suit : Procédure 1. Alignez le carter face au système. 2. Poussez doucement le carter jusqu'à ce que les quatre taquets de déverrouillage soient positionnés dans leurs montants de fixation respectifs (B) sur le système. 3. Remettez en place les deux vis (C) qui maintiennent le système fixé à l'armoire (A). Figure 160. Retrait du carter avant Procédures communes pour les fonctions installables 169 Retrait du capot d'accès des modèles 33E/8B ou 36E/8C Cette procédure permet de retirer le capot d'accès pour effectuer des opérations de maintenance ou accéder à des composants internes. Pourquoi et quand exécuter cette tâche Pour retirer le capot d'accès, procédez comme suit : Procédure 1. Placez le système en position de maintenance. Pour plus d'informations, voir «Mise en position de maintenance de l'unité 33E/8B ou 36E/8C», à la page 184. 2. Desserrez les deux vis moletées (A) situées à l'arrière du capot. 3. Placez le capot (B) à l'arrière de l'unité centrale. Une fois l'avant du capot d'accès dégagé du bord supérieur, soulevez et retirez le capot. Résultats Avertissement : Pour garantir un niveau de refroidissement et de ventilation approprié, installez le capot avant de démarrer le système. Vous risquez d'endommager les composants système si vous utilisez le système sans son capot pendant plus de 30 minutes. Figure 161. Installation du carter avant 170 Unités de disques Installation du capot d'accès des modèles 33E/8B ou 36E/8C. Cette procédure permet d'installer le capot d'accès après avoir effectué une opération de maintenance ou accédé aux composants internes. Pourquoi et quand exécuter cette tâche Pour installer le capot d'accès, procédez comme suit : Procédure 1. Placez le capot d'accès (A) sur le haut du système, à environ 25 mm du panneau avant. 2. Tout en maintenant le capot d'accès contre l'unité centrale, faites glisser le capot vers l'avant de l'unité centrale. Les taquets du capot d'accès doivent se trouver en-dessous du bord supérieur et les deux vis doivent être alignées avec les trous situés à l'arrière de l'unité centrale. Important : Assurez-vous que les câble LED du ventilateur ne sont pas coincés par l'arête avant du capot d'accès lorsque vous le déplacez vers l'avant. 3. Serrez les deux vis (B) situées à l'arrière du capot. Figure 162. Retrait du capot d'accès Procédures communes pour les fonctions installables 171 Retrait et remplacement des carters du système 17M/MB ou 79M/HB Ces instructions permettent de retirer, de remettre en place ou d'installer des carters pour accéder à des composants ou effectuer une opération de maintenance. Retrait du capot avant des modèles 17M/MB ou 79M/HB Cette procédure permet de retirer le capot pour accéder aux composants ou effectuer une opération de maintenance. Pourquoi et quand exécuter cette tâche Pour retirer le panneau avant, procédez comme suit : Procédure 1. Le cas échéant, ouvrez le volet avant de l'armoire. 2. Tirez les taquets (A) situés de chaque côté du capot (voir figure suivante). Figure 163. Installation du capot d'accès 172 Unités de disques 3. Tirez le panneau (B) pour l'ôter de l'unité centrale. Installation du panneau avant du modèle 17M/MB ou 79M/HB Cette procédure permet d'installer le capot après avoir accédé à des composants ou effectué une opération de maintenance. Pourquoi et quand exécuter cette tâche Pour installer le panneau avant, procédez comme suit : Procédure 1. Placez le panneau (A) sur la face avant de l'unité centrale en alignant les quatre goupilles sur les quatre emplacements correspondants à l'arrière du panneau. Figure 164. Retrait du capot avant Procédures communes pour les fonctions installables 173 2. Appuyez sur les taquets (B) pour enclencher le panneau dans son emplacement. 3. Fermez le volet avant de l'armoire. Mise en position de maintenance ou de fonctionnement du système 33E/8B ou 36E/8C Ces procédures décrivent comment mettre un système en maintenance et en fonctionnement dans le cadre d'une opération de maintenance ou pour accéder à des composants internes. Mise en position de maintenance d'un système monté en armoire 31E/2B Utilisez cette procédure pour placer le système monté en armoire en position de maintenance. Avant de commencer Remarques : v Lorsque vous placez l'armoire en position de maintenance, il est essentiel de positionner correctement toutes les plaques de stabilisation pour empêcher l'armoire de basculer. Assurez-vous qu'il n'y ait qu'une seule unité centrale en position de maintenance à la fois. v Veillez à ne pas bloquer ou plier les câbles situés à l'arrière de l'unité centrale lorsque vous extrayez celle-ci de l'armoire. v Une fois les glissières de l'unité centrale complètement étendues, les taquets de sécurité s'enclenchent, ce qui empêche l'unité de trop sortir de son emplacement. Procédure 1. Ouvrez le volet avant de l'armoire. 2. Identifiez l'unité système dont vous effectuez la maintenance dans l'armoire. Figure 165. Installation du panneau avant 174 Unités de disques 3. Tout en maintenant enfoncés les taquets de déverrouillage droit et gauche de l'unité centrale, tirez l'unité centrale hors de l'armoire jusqu'à ce que les glissières soient complètement étendues et verrouillées. Mise en position de fonctionnement d'un système monté en armoire 31E/2B Utilisez cette procédure pour placer le système monté en armoire en position de fonctionnement. Avant de commencer Lors du placement en position de fonctionnement du système, veillez à ne pas bloquer ou plier les câbles situés à l'arrière du système lorsque vous poussez l'unité dans l'armoire. Procédure 1. Déverrouiller les taquets de sécurité en les faisant glisser vers l'avant du système. 2. Poussez l'unité centrale dans l'armoire jusqu'à ce que les deux taquets de déverrouillage de l'unité soient verrouillés. Figure 166. Placement du système en position de maintenance Procédures communes pour les fonctions installables 175 3. Fermez le volet avant de l'unité dont vous effectuez la maintenance. Mise en position de maintenance d'un système monté en armoire 02E/4B ou 05F/6B Utilisez cette procédure pour placer le système monté en armoire en position de maintenance. Avant de commencer Remarques : v Lorsque vous placez l'armoire en position de maintenance, il est essentiel de positionner correctement toutes les plaques de stabilisation pour empêcher l'armoire de basculer. Assurez-vous qu'il n'y ait qu'une seule unité centrale en position de maintenance à la fois. v Veillez à ne pas bloquer ou plier les câbles situés à l'arrière de l'unité centrale lorsque vous extrayez celle-ci de l'armoire. v Une fois les glissières de l'unité centrale complètement étendues, les taquets de sécurité s'enclenchent, ce qui empêche l'unité de trop sortir de son emplacement. Procédure 1. Ouvrez le volet avant de l'armoire. 2. Identifiez l'unité système dont vous effectuez la maintenance dans l'armoire. 3. Retirez les deux vis (A) (si elles sont présentes) fixant l'unité centrale (B) à l'armoire. Figure 167. Mise en position de fonctionnement du système 176 Unités de disques 4. Tout en maintenant enfoncés les taquets de déverrouillage droit et gauche de l'unité centrale (A), tirez l'unité centrale (B) hors de l'armoire jusqu'à ce que les glissières soient complètement étendues et verrouillées. Figure 168. Placement du système en position de maintenance Procédures communes pour les fonctions installables 177 Mise en position de fonctionnement du système monté en armoire 02E/4B ou 05F/6B Utilisez cette procédure pour placer le système monté en armoire en position de fonctionnement. Avant de commencer Lors du placement en position de fonctionnement du système, veillez à ne pas bloquer ou plier les câbles situés à l'arrière du système lorsque vous poussez l'unité dans l'armoire. Procédure 1. Déverrouillez les taquets de sécurité (A) en les faisant glisser vers le haut du système. Les taquets de sécurité sont situés près du milieu du système. 2. Poussez l'unité centrale (B) dans l'armoire jusqu'à ce que les deux taquets de déverrouillage de l'unité soient verrouillés. 3. Replacez et serrez les deux vis (A) fixant l'unité centrale (B) à l'armoire. 4. Fermez le volet avant de l'unité dont vous effectuez la maintenance. Figure 169. Mise en position de fonctionnement du système 178 Unités de disques Positions de maintenance et de fonctionnement Suivez ces procédures pour placer les serveurs systèmes en position de maintenance ou de fonctionnement. Mise en position de fonctionnement du système 02E/4B ou 05F/6B Ces procédures décrivent comment mettre un système en maintenance et en fonctionnement dans le cadre d'une opération de maintenance ou pour accéder à des composants internes. Mise en position de maintenance d'un système monté en armoire 02E/4B ou 05F/6B Utilisez cette procédure pour placer le système monté en armoire en position de maintenance. Avant de commencer Remarques : v Lorsque vous placez l'armoire en position de maintenance, il est essentiel de positionner correctement toutes les plaques de stabilisation pour empêcher l'armoire de basculer. Assurez-vous qu'il n'y ait qu'une seule unité centrale en position de maintenance à la fois. v Veillez à ne pas bloquer ou plier les câbles situés à l'arrière de l'unité centrale lorsque vous extrayez celle-ci de l'armoire. v Une fois les glissières de l'unité centrale complètement étendues, les taquets de sécurité s'enclenchent, ce qui empêche l'unité de trop sortir de son emplacement. Procédure 1. Ouvrez le volet avant de l'armoire. 2. Identifiez l'unité système dont vous effectuez la maintenance dans l'armoire. 3. Retirez les deux vis (A) (si elles sont présentes) fixant l'unité centrale (B) à l'armoire. Figure 170. Mise en position de fonctionnement du système Procédures communes pour les fonctions installables 179 4. Tout en maintenant enfoncés les taquets de déverrouillage droit et gauche de l'unité centrale (A), tirez l'unité centrale (B) hors de l'armoire jusqu'à ce que les glissières soient complètement étendues et verrouillées. Figure 171. Placement du système en position de maintenance 180 Unités de disques Mise en position de fonctionnement du système monté en armoire 02E/4B ou 05F/6B Utilisez cette procédure pour placer le système monté en armoire en position de fonctionnement. Avant de commencer Lors du placement en position de fonctionnement du système, veillez à ne pas bloquer ou plier les câbles situés à l'arrière du système lorsque vous poussez l'unité dans l'armoire. Procédure 1. Déverrouillez les taquets de sécurité (A) en les faisant glisser vers le haut du système. Les taquets de sécurité sont situés près du milieu du système. 2. Poussez l'unité centrale (B) dans l'armoire jusqu'à ce que les deux taquets de déverrouillage de l'unité soient verrouillés. 3. Replacez et serrez les deux vis (A) fixant l'unité centrale (B) à l'armoire. 4. Fermez le volet avant de l'unité dont vous effectuez la maintenance. Figure 172. Mise en position de fonctionnement du système Procédures communes pour les fonctions installables 181 Mise en position de maintenance ou de fonctionnement du système 33E/8B ou 36E/8C Ces procédures décrivent comment mettre un système en maintenance et en fonctionnement dans le cadre d'une opération de maintenance ou pour accéder à des composants internes. Mise en position de maintenance d'un système monté en armoire 31E/2B Utilisez cette procédure pour placer le système monté en armoire en position de maintenance. Avant de commencer Remarques : v Lorsque vous placez l'armoire en position de maintenance, il est essentiel de positionner correctement toutes les plaques de stabilisation pour empêcher l'armoire de basculer. Assurez-vous qu'il n'y ait qu'une seule unité centrale en position de maintenance à la fois. v Veillez à ne pas bloquer ou plier les câbles situés à l'arrière de l'unité centrale lorsque vous extrayez celle-ci de l'armoire. v Une fois les glissières de l'unité centrale complètement étendues, les taquets de sécurité s'enclenchent, ce qui empêche l'unité de trop sortir de son emplacement. Procédure 1. Ouvrez le volet avant de l'armoire. 2. Identifiez l'unité système dont vous effectuez la maintenance dans l'armoire. 3. Tout en maintenant enfoncés les taquets de déverrouillage droit et gauche de l'unité centrale, tirez l'unité centrale hors de l'armoire jusqu'à ce que les glissières soient complètement étendues et verrouillées. Figure 173. Mise en position de fonctionnement du système 182 Unités de disques Mise en position de fonctionnement d'un système monté en armoire 31E/2B Utilisez cette procédure pour placer le système monté en armoire en position de fonctionnement. Avant de commencer Lors du placement en position de fonctionnement du système, veillez à ne pas bloquer ou plier les câbles situés à l'arrière du système lorsque vous poussez l'unité dans l'armoire. Procédure 1. Déverrouiller les taquets de sécurité en les faisant glisser vers l'avant du système. 2. Poussez l'unité centrale dans l'armoire jusqu'à ce que les deux taquets de déverrouillage de l'unité soient verrouillés. Figure 174. Placement du système en position de maintenance Procédures communes pour les fonctions installables 183 3. Fermez le volet avant de l'unité dont vous effectuez la maintenance. Mise en position de maintenance ou de fonctionnement du système 33E/8B ou 36E/8C Ces procédures décrivent comment mettre un système en maintenance et en fonctionnement dans le cadre d'une opération de maintenance ou pour accéder à des composants internes. Mise en position de maintenance de l'unité 33E/8B ou 36E/8C Cette procédure permet d'effectuer des opérations de maintenance ou d'accéder à des composants internes en plaçant l'unité centrale ou l'unité d'extension montée en armoire en position de maintenance. Pourquoi et quand exécuter cette tâche Remarque : Certaines figures accompagnant ces procédures peuvent ne pas correspondre exactement à votre unité centrale ou votre unité d'extension. Néanmoins, les procédures demeurent identiques pour l'exécution de chaque tâche. Figure 175. Mise en position de fonctionnement du système 184 Unités de disques DANGER Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes suivantes : Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut présenter un danger. Pour éviter tout risque de choc électrique : v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce dernier avec un autre produit. v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique. v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de reconfiguration de ce produit au cours d'un orage. v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de danger électrique, débranchez tous les cordons d'alimentation. v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent aux informations de la plaque d'alimentation électrique du système. v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce produit. v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les câbles d'interface. v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence de dommages matériels. v Avant de retirer les carters de l'unité, mettez celle-ci hors tension et déconnectez ses cordons d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication et aux modems (sauf mention contraire dans les procédures d'installation et de configuration). v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et déconnecter les différents cordons. Pour déconnecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Débranchez les cordons d'alimentation des prises. 3. Débranchez les câbles d'interface des connecteurs. 4. Débranchez tous les câbles des unités. Pour connecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Branchez tous les cordons sur les unités. 3. Branchez les câbles d'interface sur des connecteurs. 4. Branchez les cordons d'alimentation aux prises. 5. Mettez l'unité sous tension. (D005a) DANGER Procédures communes pour les fonctions installables 185 Observez les consignes suivantes lors de l'utilisation du système en armoire ou lorsque vous travaillez à proximité de ce dernier : v Un mauvais maniement de l'équipement lourd peut engendrer blessures et dommages matériels. v Abaissez toujours les vérins de mise à niveau de l'armoire. v Installez toujours des équerres de stabilisation sur l'armoire. v Pour prévenir tout danger lié à une mauvaise répartition de la charge, installez toujours les unités les plus lourdes dans la partie inférieure de l'armoire. Installez toujours les serveurs et les unités en option en commençant par le bas de l'armoire. v Un serveur monté en armoire n'est pas une étagère ou un espace de travail. Ne posez pas d'objet sur un serveur monté en armoire. v Chaque armoire peut être équipée de plusieurs cordons d'alimentation. Avant de manipuler l'armoire, vous devez débrancher l'ensemble des cordons d'alimentation. v Reliez toutes les unités installées dans l'armoire aux dispositifs d'alimentation installés dans la même armoire. Vous ne devez pas brancher le cordon d'alimentation d'une unité installée dans une armoire au dispositif d'alimentation installé dans une autre armoire. v Un mauvais câblage du socle de prise de courant peut provoquer une mise sous tension dangereuse des parties métalliques du système ou des unités qui lui sont raccordées. Il appartient au client de s'assurer que le socle de prise de courant est correctement câblé et mis à la terre afin d'éviter tout risque de choc électrique. ATTENTION v N'installez pas d'unité dans une armoire dont la température ambiante interne dépasse la température ambiante que le fabricant recommande pour toutes les unités montées en armoire. v N'installez pas d'unité dans une armoire où la ventilation n'est pas assurée. Vérifiez que les côtés, l'avant et l'arrière de l'unité sont correctement ventilés. v Le matériel doit être correctement raccordé au circuit d'alimentation pour éviter qu'une surcharge des circuits n'entrave le câblage des dispositifs d'alimentation ou de protection contre les surintensités. Pour choisir des connexions d'alimentation à l'armoire adaptées, consultez les étiquettes de puissance nominale situées sur le matériel dans l'armoire afin de déterminer l'alimentation totale requise par le circuit d'alimentation. v Armoires dotées de tiroirs coulissants : Si l'armoire n'est pas équipée d'équerres de stabilisation, ne sortez et n'installez pas de tiroir ou de dispositif. Ne retirez pas plusieurs tiroirs à la fois. Si vous retirez plusieurs tiroirs simultanément, l'armoire risque de devenir instable. v Armoires dotées de tiroirs fixes : Sauf indication du fabricant, les tiroirs fixes ne doivent pas être retirés à des fins de maintenance. Si vous tentez de retirer une partie ou l'ensemble du tiroir, l'armoire risque de devenir instable et le tiroir risque de tomber. (R001) Pour mettre l'unité centrale ou l'unité d'extension montée en armoire en position de maintenance, procédez comme suit : Procédure 1. Le cas échéant, ouvrez le volet avant de l'armoire. 2. Retirez les deux vis (A) qui maintiennent l'unité centrale à l'armoire (voir figure suivante). 3. Débloquez les taquets de sécurité (B) situés sur les deux côtés de l'unité (voir figure suivante). 186 Unités de disques 4. Si vous devez sortir l'unité centrale ou l'unité d'extension de l'armoire, veillez à débrancher auparavant les câbles situés à l'arrière de l'unité. Veillez à ne pas bloquer ou plier les câbles situés à l'arrière de l'unité centrale ou de l'unité d'extension en tirant l'unité hors de l'armoire. 5. Tirez doucement l'unité centrale ou l'unité d'extension hors de l'armoire jusqu'à ce que les glissières soient complètement étendues et bloquées. Vérifiez que les glissières sont complètement étendues. Une fois les glissières complètement étendues, les taquets de sécurité s'enclenchent. Ils évitent de sortir l'unité centrale ou l'unité d'extension de manière excessive. Placement de 33E/8B ou de 36E/8C en position de fonctionnement Cette procédure permet de mettre l'unité centrale (le système) ou l'unité d'extension en position de fonctionnement pour pouvoir l'utiliser. Pourquoi et quand exécuter cette tâche Pour mettre le système en position de fonctionnement, procédez comme suit : Conseil : Certaines figures accompagnant ces procédures peuvent ne pas correspondre exactement à votre système ou votre unité d'extension. Néanmoins, les procédures demeurent identiques pour l'exécution de chaque tâche. Procédure 1. Libérez simultanément les taquets de sécurité bleus (B) situés vers l'avant de chaque taquet, puis poussez le système ou l'unité d'extension dans l'armoire (voir la figure suivante). Remarque : Veillez à ne pas bloquer ou plier les câbles situés à l'arrière de le système ou de l'unité d'extension lorsque vous poussez l'unité dans l'armoire. Figure 176. Libération des loquets de sécurité Procédures communes pour les fonctions installables 187 2. Réinstallez et serrez les deux vis moletées (C) qui maintiennent le système ou l'unité d'extension (A) à l'armoire (voir figure suivante). 3. Fermez le volet avant de l'armoire. Figure 177. Déblocage des taquets de sécurité du rail Figure 178. Remise en place des vis moletées 188 Unités de disques Cordons d'alimentation Suivez ces procédures pour déconnecter et reconnecter les cordons d'alimentation les serveurs systèmes contenant le processeur POWER7. Déconnexion des cordons d'alimentation d'un système 02E/4B ou 05F/6B Utilisez ces procédures pour déconnecter les cordons d'alimentation d'un système 02E/4B ou 05F/6B. Procédure 1. Pour un système monté en armoire, ouvrez le volet arrière de l'armoire sur l'unité sur laquelle vous effectuez la maintenance et identifiez l'unité centrale. 2. Pour un système monté en armoire et un système autonome, débranchez tous les cordons d'alimentation de l'unité sur laquelle vous effectuez la maintenance. Le système est peut-être équipé de deux blocs d'alimentation. Avant de continuer toute procédure de retrait et de remplacement, vérifiez que toutes les deux sources d'alimentation ont été débranchées du système. Figure 179. Retrait des cordons d'alimentation pour un système monté en armoire Procédures communes pour les fonctions installables 189 Connexion des cordons d'alimentation d'un système 02E/4B ou 05F/6B Cette procédure permet de connecter les cordons d'alimentation. Procédure 1. Identifiez l'unité système dont vous effectuez la maintenance dans l'armoire. 2. Connectez tous les cordons d'alimentation à l'unité dont vous effectuez la maintenance. Vérifiez que les cordons d'alimentation passent par les poignées. Figure 180. Retrait des cordons d'alimentation pour un système autonome 190 Unités de disques 3. Fermez le volet arrière de l'armoire. Déconnexion des cordons d'alimentation du système 31E/2B Utilisez ces procédures pour déconnecter les cordons d'alimentation du système 31E/2B. Procédure 1. Ouvrez le volet arrière de l'unité dont vous effectuez la maintenance. 2. Identifiez l'unité système dont vous effectuez la maintenance dans l'armoire. 3. Débranchez tous les cordons d'alimentation de l'unité dont vous effectuez la maintenance. Le système est peut-être équipé de deux blocs d'alimentation. Avant de continuer toute procédure de retrait et de remplacement, vérifiez que toutes les deux sources d'alimentation ont été débranchées du système. Figure 181. Connexion des cordons d'alimentation Procédures communes pour les fonctions installables 191 Connexion des cordons d'alimentation au système 31E/2B Utilisez ces procédures pour connecter les cordons d'alimentation au système 31E/2B. Procédure 1. Ouvrez le volet arrière de l'unité dont vous effectuez la maintenance. 2. Identifiez l'unité système dont vous effectuez la maintenance dans l'armoire. 3. Débranchez tous les cordons d'alimentation de l'unité dont vous effectuez la maintenance. Vérifiez que les cordons d'alimentation passent par les poignées. Figure 182. Retrait des cordons d'alimentation 192 Unités de disques Installation d'un dispositif via la console HMC (Hardware Management Console) Vous pouvez utiliser la console HMC (Hardware Management Console) pour effectuer diverses tâches de maintenance, y compris l'installation d'un nouveau dispositif ou composant. Avant de commencer Pour utiliser l'interface utilisateur de la console HMC (Hardware Management Console) afin d'installer un dispositif ou un composant sur une unité centrale ou une unité d'extension gérée par une console HMC, procédez comme suit : Procédure 1. Dans la zone de navigation, développez Gestion de systèmes ? Serveurs. 2. Sélectionnez le système géré sur lequel vous allez installer le composant. 3. Dans la zone des tâches, développez Maintenabilité ? Matériel. 4. Sélectionnez Ajout d'une FRU (unité remplaçable sur site, Field Replaceable Unit). 5. Dans la fenêtre Installation/ajout/suppression de matériel, sélectionnez le système ou le boîtier dans lequel vous installez le dispositif. 6. Sélectionnez dans la liste le type de dispositif que vous souhaitez installer et cliquez sur Suivant. 7. Sélectionnez le code d'emplacement pour l'installation du dispositif et cliquez sur Ajouter. 8. Une fois l'unité remplaçable sur site FRU placée dans la catégorie des actions en attente, cliquez sur Lancer la procédure et suivez les instructions d'installation du dispositif. Remarque : La console HMC peut afficher des instructions expliquant comment installer le dispositif. Dans ce cas, suivez ces instructions pour installer le dispositif. Figure 183. Connexion des cordons d'alimentation Procédures communes pour les fonctions installables 193 Retrait d'un composant via la console HMC Vous pouvez utiliser la console HMC (Hardware Management Console) pour effectuer diverses tâches de maintenance, y compris le retrait d'une unité remplaçable sur site (FRU) ou d'un composant. Pourquoi et quand exécuter cette tâche Pour utiliser l'interface utilisateur de la console HMC (Hardware Management Console) afin de retirer un composant d'une unité centrale ou d'une unité d'extension gérée par une console HMC, procédez comme suit : Procédure 1. Dans la zone de navigation, développez Gestion de systèmes ? Serveurs. 2. Sélectionnez le système géré sont vous souhaitez retirer un composant. 3. Dans la zone des tâches, développez Maintenabilité ? Matériel. 4. Sélectionnez Retrait d'une FRU. 5. Dans la fenêtre Installation/ajout/suppression de matériel - Retrait de FRU, sélectionnez le type de FRU, sélectionnez le système ou le boîtier duquel vous souhaitez retirer le composant. 6. Sélectionnez dans la liste le type de composant que vous souhaitez retirer et cliquez sur Suivant. 7. Sélectionnez l'emplacement du composant que vous retirez et cliquez sur Ajouter. 8. Une fois que le composant figure dans la liste des actions en attente, cliquez sur Lancer la procédure et suivez les instructions de retrait du composant. Remarque : La console HMC peut afficher des instructions de l'Information Center expliquant comment retirer le composant. Dans ce cas, suivez ces instructions pour retirer le composant. Remplacement d'un composant via la console HMC Vous pouvez utiliser la console HMC (Hardware Management Console) pour effectuer diverses tâches de maintenance, y compris l'échange d'une unité remplaçable sur site (FRU) ou d'un composant. Pourquoi et quand exécuter cette tâche Si vous procédez à un échange de composant dans le but de solutionner un événement réparable, suivez ces instructions. Si vous échangez un composant dans le cadre d'une autre procédure à l'aide de la version 7 ou d'une version ultérieure de la console HMC, procédez comme suit : Procédure 1. Dans la zone de navigation, développez Gestion de systèmes ? Serveurs. 2. Sélectionnez le système géré dont vous souhaitez remplacer un composant. 3. Dans la zone des tâches, développez Maintenabilité ? Matériel. 4. Sélectionnez Echange de FRU. 5. Dans la fenêtre Remplacement de matériel - Remplacement de FRU, Sélectionnez le type de FRU, sélectionnez dans la liste le système ou le boîtier dans lequel vous souhaitez échanger un composant. 6. Sélectionnez dans la liste le type de composant que vous souhaitez échanger et cliquez sur Suivant. 7. Sélectionnez dans la liste le code d'emplacement du composant que vous souhaitez échanger et cliquez sur Ajouter. 8. Une fois l'unité remplaçable sur site FRU placée dans la catégorie des actions en attente, cliquez sur Lancer la procédure et suivez les instructions d'échange du dispositif. Remarque : La console HMC peut afficher des instructions concernant le replacement du composant. Dans ce cas, suivez les instructions pour remplacer le composant. 194 Unités de disques Vérification du composant installé Vous pouvez vérifier un composant nouvellement installé ou remplacé sur votre système, partition logique ou unité d'extension à l'aide du système d'exploitation, des diagnostics autonomes ou de la console HMC. Vérification d'un dispositif installé ou d'un composant remplacé sur un système ou une partition logique AIX Si vous avez installé un dispositif ou remplacé un composant, il est recommandé d'utiliser les outils AIX pour vérifier que ce dispositif ou composant est reconnu par le système ou la partition logique. Pourquoi et quand exécuter cette tâche Pour vérifier le fonctionnement d'un dispositif ou d'un composant de rechange récemment installé, sélectionnez la procédure appropriée : v Vérification du dispositif installé via AIX v Vérification du composant de rechange via AIX Vérifiez le dispositif installé via AIX : 1. Connectez-vous en tant qu'utilisateur root. 2. A l'invite, tapez diag et appuyez sur Entrée. 3. Sélectionnez Routines de diagnostic avancé et appuyez sur Entrée. 4. Dans le menu Sélection du mode de diagnostic, sélectionnez Vérification du système et appuyez sur Entrée. 5. Dans le menu Sélection des tests de diagnostic, mode étendu, exécutez l'une des procédures suivantes : v Pour tester une seule ressource, sélectionnez la ressource que vous venez d'installer dans la liste des ressources et appuyez sur Entrée. v Pour tester toutes les ressources disponibles du système d'exploitation, sélectionnez Toutes les ressources et appuyez sur Entrée. 6. Sélectionnez Validation, puis attendez la fin de l'exécution des programmes de diagnostic en répondant aux invites éventuelles. 7. Les programmes de diagnostic ont-ils été exécutés complètement avant d'afficher le message Aucun incident n’a été détecté ? v Non : Si un code SRN (Service Request Number - numéro de demande d'intervention) ou un autre code de référence s'affiche, il est possible qu'un adaptateur ou qu'un câble ne soit pas bien connecté. Passez en revue les procédures d'installation pour vérifier que le nouveau dispositif est correctement installé. Si vous ne parvenez pas à corriger l'incident, rassemblez tous les codes SRN et autres codes de référence éventuels. Si le système fonctionne en mode de partitionnement logique (LPAR), notez la partition logique sur laquelle vous avez installé la fonctionnalité. Pour obtenir de l'aide, contactez votre fournisseur de services. v Oui : Le nouveau dispositif est installé correctement. Quittez les programmes de diagnostic et revenez en mode de fonctionnement normal. Vérifiez le composant de rechange via AIX Pour vérifier le fonctionnement d'un dispositif ou d'un composant de rechange récemment installé, procédez comme suit : 1. Avez-vous remplacé le composant à l'aide du système d'exploitation AIX ou des opérations de remplacement à chaud simultané du service de diagnostic en ligne ? Non : Passez à l'étape 2, à la page 196. Procédures communes pour les fonctions installables 195 Oui : Passez à l'étape 5. 2. Le système est-il hors tension ? Non : Passez à l'étape 4. Oui : Si le système prend en charge l'initialisation lente, configurez la fonction. Pour plus d'informations, voir Effectuer une initialisation lente. 3. Démarrez le système et attendez que l'invite de connexion du système d'exploitation AIX s'affiche ou que l'activité apparente du système sur le panneau de commande ou l'écran s'arrête. Avez-vous vu l'invite de connexion AIX ? v Non : Si un code SRN (Service Request Number - numéro de demande d'intervention) ou un autre code de référence s'affiche, il est possible qu'un adaptateur ou qu'un câble ne soit pas bien connecté. Passez en revue les procédures de remplacement pour vérifier que le nouveau composant est installé correctement. Si vous ne parvenez pas à corriger l'incident, rassemblez tous les codes SRN et autres codes de référence éventuels. Si le système ne démarre pas ou que l'invite de connexion ne s'affiche pas, voir la rubrique relative aux Incidents de chargement et de démarrage du système d'exploitation. Si le système est partitionné, notez la partition logique sur laquelle vous avez remplacé le composant. Pour obtenir de l'aide, contactez votre fournisseur de services. v Oui : Passez à l'étape 4. 4. A l'invite, tapez diag —a et appuyez sur Entrée pour vérifier s'il manque des ressources. Si une invite s'affiche, passez à l'étape 5. Si le menu de sélection des tests de diagnostic s'affiche avec un M en regard d'une ressource, procédez comme suit : a. Sélectionnez la ressource, puis appuyez sur Entrée. b. Sélectionnez Validation. c. Suivez les instructions affichées. d. Si le message Souhaitez-vous revoir l'erreur affichée précédemment ? cliquez sur Oui et appuyez sur Entrée. e. Si le système génère un code SRN, une carte ou un câble est probablement mal connecté. Si aucun incident évident n'apparaît, notez le code SRN et contactez votre fournisseur de services pour obtenir de l'aide.. f. Si aucun code SRN ne s'affiche, passez à l'étape 5. 5. Testez le composant en effectuant les opérations suivantes : a. A l'invite, tapez diag et appuyez sur Entrée. b. Dans le menu Sélection des fonctions, sélectionnez Programmes de diagnostic, mode étendu et appuyez sur Entrée. c. Dans le menu Sélection du mode de diagnostic, sélectionnez Vérification du système et appuyez sur Entrée. d. Sélectionnez Toutes les ressources ou les programmes de diagnostic du composant pour tester uniquement le composant remplacé ou les périphériques qui y sont raccordés et appuyez sur Entrée. Le menu Action corrective sur ressource s'est-il affiché ? Non : Passez à l'étape 6. Oui : Passez à l'étape 7, à la page 197. 6. Le message Test terminé, Aucun incident n'a été détecté s'est-il affiché ? v Non : Il existe encore un incident. Contactez votre fournisseur de services. La procédure est terminée. v Oui : Si l'incident ne figure pas dans le journal des erreurs, sélectionnez Consignation d'action corrective dans le menu Sélection des tâches pour mettre à jour le journal des erreurs AIX. Si la 196 Unités de disques réparation consistait à remettre en place un câble ou une carte, sélectionnez la ressource correspondante. Si la ressource n'apparaît pas dans la liste des ressources, sélectionnez sysplanar0 et appuyez sur Entrée. Conseil : Le voyant du composant passe de l'état Incident à l'état normal. Passez à l'étape 9. 7. Dans le menu Action de réparation de ressource, sélectionnez la ressource du composant remplacé. En cas de test réussi sur une ressource en mode de vérification du système, dont une entrée figure dans le journal des erreurs AIX, le menu Action corrective sur ressource apparaît. Procédez comme suit pour que le journal des erreurs AIX se mette à jour et indique qu'un composant détectable par le système a été remplacé. Remarque : Sur les systèmes équipés d'un voyant associé au composant défaillant, le voyant passe à l'état normal. a. Dans le menu Action corrective sur ressource, sélectionnez la ressource remplacée. Si la réparation consistait à remettre en place un câble ou une carte, sélectionnez la ressource correspondante. Si la ressource n'apparaît pas dans la liste des ressources, sélectionnez sysplanar0 et appuyez sur Entrée. b. Sélectionnez Validation une fois les sélections effectuées. L'écran Action corrective sur ressource s'est-il affiché de nouveau ? Non : Si l'écran Aucun incident détecté apparaît, passez à l'étape 9. Oui : Passez à l'étape 8. 8. Dans le menu Action de réparation de ressource, sélectionnez au besoin le parent ou l'enfant de la ressource pour le composant remplacé. En cas de test réussi sur une ressource en mode de vérification du système, dont une entrée figure dans le journal des erreurs AIX, le menu Action corrective sur ressource apparaît. Procédez comme suit pour que le journal des erreurs AIX se mette à jour et indique qu'un composant détectable par le système a été remplacé. Remarque : Le voyant du composant passe de l'état Incident à l'état normal. a. Dans le menu Action corrective sur ressource, sélectionnez le composant parent ou enfant de la ressource remplacée. Si la réparation consistait à remettre en place un câble ou une carte, sélectionnez la ressource correspondante. Si la ressource n'apparaît pas dans la liste des ressources, sélectionnez sysplanar0 et appuyez sur Entrée. b. Sélectionnez Validation une fois les sélections effectuées. c. Si l'écran Aucun incident détecté apparaît, passez à l'étape 9. 9. Si vous avez modifié les paramètres du processeur de maintenance ou du réseau (expliqué dans les procédures précédentes), restaurez la valeur initiale des paramètres. 10. Avez-vous exécuté des procédures de remplacement à chaud avant cette procédure ? Non : Passez à l'étape 11. Oui : Passez à l'étape 12. 11. Démarrez le système d'exploitation (système ou partition logique en mode normal). Avez-vous pu démarrer le système d'exploitation ? Non : Contactez votre fournisseur de services. La procédure est terminée. Oui : Passez à l'étape 12. 12. Les voyants sont-ils encore allumés ? v Non. La procédure est terminée. v Oui : Désactivez les voyants. Pour obtenir des instructions: voir Modification des voyants de maintenance. Procédures communes pour les fonctions installables 197 Vérification d'un composant installé sur un système Linux ou une partition logique Si vous avez installé un nouveau composant, utilisez les instructions de cette section pour savoir comment vérifier que le système reconnaît ce composant. Pourquoi et quand exécuter cette tâche Pour vérifier un composant que vous venez d'installer ou de remplacer consultez la rubrique «Vérification d'un composant installé via les programmes de diagnostic autonomes». Vérification d'un composant installé via les programmes de diagnostic autonomes Si vous avez installé ou remplacé un composant, vérifiez que le système le reconnaît. Vous pouvez utiliser les programmes de diagnostic autonomes pour vérifier un composant installé sur un système Linux, une unité d'extension ou une partition logique. Avant de commencer v Si le serveur est directement connecté à un autre serveur ou à un réseau, vérifiez que la communication avec les autres serveurs est interrompue. v Les programmes de diagnostic autonomes doivent utiliser l'ensemble des ressources de la partition logique. Aucune autre activité ne peut être exécutée sur la partition. v Les programmes de diagnostic autonomes doivent accéder à la console système. Pourquoi et quand exécuter cette tâche Pour accéder aux programmes de diagnostic, utilisez le CD-ROM approprié ou le serveur NIM (Network Installation Management). La procédure suivante explique comment utiliser les programmes de diagnostic depuis le CD-ROM. Pour obtenir plus d'informations sur l'exécution de diagnostics à partir du serveur NIM, reportez-vous à la rubrique Exécution de diagnostics autonomes à partir d'un serveur NIM (Network Installation Management). Pour utiliser les programmes de diagnostic autonomes, procédez comme suit : Procédure 1. Arrêtez les travaux et les applications, puis arrêtez le système d'exploitation sur le système ou la partition logique. 2. Retirez les bandes, les disquettes et les CD-ROM. 3. Mettez l'unité centrale hors tension. L'étape suivante permet d'initialiser le serveur ou la partition logique à partir du CD-ROM des programmes de diagnostic autonomes. Si l'unité de disques optique n'est pas configurée comme unité d'amorçage sur le serveur ou la partition logique que vous utilisez, procédez comme suit : a. Lancez l'interface ASMI. Pour plus d'informations sur l'utilisation de l'interface ASMI, reportez-vous à la rubrique Accès à l'interface ASMI. b. Dans le menu principal de l'interface ASMI, cliquez sur Contrôle d'alimentation/redémarrage. c. Cliquez sur Mettre le système sous/hors tension. d. Dans le menu déroulant des modes d'initialisation de la partition logique AIX ou Linux, sélectionnez l'option Amorçage du mode service à partir de la liste par défaut. e. Cliquez sur Sauvegarder les paramètres et mettre sous tension. Dès que l'unité de disques optique est sous tension, insérez le CD-ROM des programmes de diagnostic autonomes. f. Passez à l'étape 5, à la page 199. 198 Unités de disques 4. Mettez l'unité centrale sous tension et insérez immédiatement le CD-ROM de programme de diagnostic autonome dans l'unité de disque optique. 5. Après l'affichage du voyant POST clavier sur la console système et avant l'affichage du dernier voyant POST (haut-parleur), appuyez sur la touche numérique 5 de la console système pour indiquer qu'une initialisation en mode service doit être effectuée au moyen de la liste des unités d'amorçage en mode service par défaut. 6. Tapez le mot de passe requis. 7. Dans l'écran Diagnostic Operating Instructions, appuyez sur Entrée. Conseil : Si un code SRN (Service Request Number - numéro de demande d'intervention) ou un autre code de référence s'affiche, il est possible qu'un adaptateur ou qu'un câble ne soit pas bien connecté. Remarque : Si le système a généré un code SRN ou un autre code de référence alors que vous tentiez de le démarrer, contactez votre fournisseur de service pour obtenir de l'aide. 8. Si le type de terminal est requis, sélectionnez l'option Initialisation du terminal du menu Sélection des fonctions pour initialiser le système d'exploitation. 9. Dans le menu Sélection des fonctions, sélectionnez Programmes de diagnostic, mode étendu et appuyez sur Entrée. 10. Dans le menu Sélection du mode de diagnostic, sélectionnez Vérification du système et appuyez sur Entrée. 11. Lorsque le menu Sélection des tests de diagnostic, mode étendu s'affiche, sélectionnez Toutes les ressources ou testez uniquement le composant remplacé ou les unités qui y sont rattachées en sélectionnant les programmes de diagnostic appropriés puis en appuyant sur Entrée. 12. Le message Test terminé, Aucun incident n'a été détecté s'est-il affiché ? v Non : Il existe encore un incident. Contactez votre fournisseur de services. v Oui : Passez à l'étape 13. 13. Si vous avez modifié les paramètres du processeur de service ou du réseau (voir procédures précédentes), restaurez la valeur initiale des paramètres. 14. Si les voyants sont toujours allumés, procédez comme suit : a. Dans le menu Sélection des tâches, sélectionnez Indicateurs d'identification et d'avertissement pour désactiver les voyants d'identification et d'avertissement et appuyez sur Entrée. b. Sélectionnez Set System Attention Indicator to NORMAL, puis appuyez sur Entrée. c. Sélectionnez Set All Identify Indicators to NORMAL, puis appuyez sur Entrée. d. Sélectionnez Validation. Remarque : Les voyants d'identification et d'incident passent de l'état Incident à l'état Normal. e. Retournez à la ligne de commande. Vérification d'un composant installé via la console HMC (Hardware Management Console) Si vous avez installé ou remplacé un composant, utilisez la console HMC (Hardware Management Console) pour mettre à jour les enregistrements de la console HMC une fois que vous avez effectué une opération de maintenance sur le serveur. Si vous vous êtes servi de codes de référence, de symptômes ou de codes d'emplacement lors de l'opération, repérez les enregistrements à utiliser pendant la procédure. Pourquoi et quand exécuter cette tâche Pour vérifier un composant installé, procédez comme suit : Procédures communes pour les fonctions installables 199 Procédure 1. Sur la console HMC, recherchez d'éventuels événements d'action de maintenance ouverts dans le journal des événements de maintenance. Pour plus d'informations, reportez-vous à la rubrique «Visualisation d'événements réparables», à la page 202. 2. Des événements d'action de maintenance sont-ils ouverts ? Non : Si le voyant d'avertissement système ou d'identification est encore allumé, utilisez la console HMC pour le désactiver. Consultez la rubrique «Activation et désactivation de voyants». La procédure est terminée. Oui : Passez à l'étape suivante. 3. Notez la liste des événements d'action de maintenance ouverts. 4. Examinez les détails de chacun de ces événements. Les codes d'erreur associés à ces derniers sont-ils les mêmes que ceux que vous avez collectés auparavant ? v Non : Sélectionnez l'une des options suivantes : – Examinez les autres événements réparables pour en trouver un dont le code d'erreur correspond et passez à l'étape suivante. – Si le contenu du journal ne correspond pas à ce que vous avez collecté précédemment, contactez votre fournisseur de services. v Oui : Passez à l'étape suivante. 5. Dans la fenêtre Error Associated With This Serviceable Event, sélectionnez et mettez en évidence l'événement d'action de maintenance. 6. Cliquez sur Close Event. 7. Ajoutez des commentaires sur l'événement réparable. Incluez toute autre information propre à cet événement. Cliquez sur OK. 8. Avez-vous remplacé, ajouté ou modifié une unité remplaçable sur site (FRU) de l'événement d'action de maintenance ouvert ? v Non : Sélectionnez l'option No FRU Replaced for this Serviceable Event et cliquez sur OK pour fermer l'événement d'action de maintenance. v Oui : Procédez comme suit : a. Dans la liste des unités remplaçables sur site, sélectionnez une unité à mettre à jour. b. Cliquez deux fois sur l'unité, puis mettez à jour les informations correspondantes. c. Cliquez sur OK pour refermer l'événement d'action de maintenance. 9. Si les incidents persistent, contactez votre fournisseur de services. Activation et désactivation de voyants Cette procédure permet d'activer ou de désactiver les voyants à l'aide des utilitaires SFP (Service Focal Point) pour la console HMC. Pourquoi et quand exécuter cette tâche Choisissez l'une des options suivantes : v «Désactivation d'un voyant d'avertissement système ou de partition» v «Activation ou désactivation de voyants d'identification», à la page 201 Désactivation d'un voyant d'avertissement système ou de partition : Pourquoi et quand exécuter cette tâche Vous pouvez désactiver un voyant d'avertissement système ou un voyant de partition logique. Par exemple, vous pouvez constater qu'un incident n'est pas prioritaire et peut être résolu ultérieurement. Cependant, si vous souhaitez être averti en cas d'incident, vous devez désactiver le voyant d'avertissement système pour qu'il puisse être activé quand un nouvel incident survient. 200 Unités de disques Procédure 1. Dans la zone de navigation, ouvrez le dossier Gestion de systèmes. 2. Ouvrez Serveurs et sélectionnez le système approprié. 3. Dans la zone de contenu, cochez la case correspondant à la partition concernée. 4. Sélectionnez Tâches, Opérations, puis Gestion des voyants d'avertissement système. 5. Sélectionnez la partition appropriée. 6. Sélectionnez Désactivation du voyant d'avertissement système dans le menu Action. La fenêtre de confirmation qui s'affiche fournit les informations suivantes : v Un message de vérification indique que le voyant voyant d'avertissement système a été désactivé. v Un message indique que le système présente encore des incidents non résolus. v Un message indique que le voyant d'avertissement système ne peut pas être activé. 7. Sélectionnez l'une des partitions logiques dans le tableau du bas, puis sélectionnez Désactivation du voyant de partition dans le menu Opérations sur la partition. La fenêtre de confirmation qui s'affiche fournit les informations suivantes : v Une vérification indique que le voyant de la partition logique a été désactivé. v Un message indique que la partition logique présente encore des incidents non résolus. v Un message indique que le voyant de la partition logique ne peut pas être activé. Activation ou désactivation de voyants d'identification : Pourquoi et quand exécuter cette tâche Le système comporte plusieurs voyants d'identification de composants, tels que les boîtiers ou les unités remplaçables sur site (FRU). On les appelle donc voyants d'identification. Vous pouvez activer ou désactiver les types de voyants d'identification suivants : v Voyant d'identification d'un boîtier Si vous souhaitez ajouter une carte dans un tiroir précis (boîtier), vous devez connaître le type et le modèle de l'ordinateur ainsi que le numéro de série (MTMS) du tiroir. Pour déterminer si vous disposez du MTMS approprié du tiroir nécessitant la nouvelle carte, vous pouvez activer le voyant d'un tiroir et vérifier que le MTMS correspond au tiroir qui doit accueillir le nouveau matériel. v Voyant d'identification d'une unité FRU associée à un boîtier Pour connecter un câble à une carte d'E-S, vous pouvez activer le voyant de la carte s'il s'agit d'une unité FRU, puis vérifier sur le matériel où le câble doit être fixé. Cette opération est particulièrement utile en présence de ports ouverts sur plusieurs cartes. Pour activer ou désactiver le voyant d'identification d'un boîtier ou d'une unité FRU, procédez comme suit : Procédure 1. Dans la zone de navigation, ouvrez l'élément Gestion de systèmes. 2. Sélectionnez Serveurs. 3. Dans la zone de contenu, cochez la case correspondant au système concerné. 4. Sélectionnez Tâches, Opérations, Statut du voyant, puis Voyant d'identification. 5. Pour activer ou désactiver le voyant d'identification d'un boîtier, sélectionnez le boîtier dans le tableau, puis cliquez sur Activation du voyant ou Désactivation du voyant. Le voyant associé s'allume ou s'éteint. 6. Pour activer ou désactiver le voyant d'identification d'une unité FRU, choisissez un boîtier dans le tableau, puis sélectionnez Sélectionné ? Liste des FRU. 7. Sélectionnez une ou plusieurs unités FRU dans le tableau, et cliquez sur Activation du voyant ou Désactivation du voyant. Le voyant associé s'allume ou s'éteint. Procédures communes pour les fonctions installables 201 Visualisation d'événements réparables Utilisez cette procédure pour afficher un événement réparable, y compris les détails, commentaires et historique de maintenance. Pourquoi et quand exécuter cette tâche Pour pouvoir consulter des événements réparables et les informations qui leur sont associées, vous devez avoir l'un des rôles suivants : v Super administrateur v Technicien de maintenance v Opérateur v Responsable produit v Visionneur Pour afficher des événements réparables, procédez comme suit : Procédure 1. Dans la zone de navigation, sélectionnez Gestion de la maintenance. 2. Sélectionnez Gestion des événements réparables. 3. Sélectionnez les critères des événements réparables à visualiser, puis cliquez sur OK. La fenêtre Présentation d'un événement réparable s'affiche. La liste comporte les événements réparables correspondant à vos critères de sélection. La barre de menus vous propose des actions réalisables sur les événements réparables. 4. Sélectionnez une ligne dans la fenêtre Présentation d'un événement réparable, puis choisissez Sélectionné ? Affichage des détails La fenêtre Détails de l'événement réparable affiche des informations détaillées sur l'événement réparable. Le tableau du haut comporte des informations telles que le numéro d'incident et le code de référence. Le tableau du bas indique les unités remplaçables sur site (FRU) associées à cet événement. 5. Sélectionnez l'erreur pour laquelle vous souhaitez consulter les commentaires et l'historique, puis procédez comme suit : a. Sélectionnez Actions ? Visualisation des commentaires. b. Pour quitter les commentaires, cliquez sur Fermeture. c. Sélectionnez Actions ? Affichage de l'historique de maintenance. La fenêtre Historique de maintenance comporte l'historique de maintenance associé à l'erreur sélectionnée. d. Pour quitter l'historique de maintenance, cliquez sur Fermeture. 6. Quand vous avez terminé, cliquez sur Annulation dans les fenêtres Détails de l'événement réparable et Présentation de l'événement réparable pour les refermer. Vérification de la présence d'un dispositif installé ou d'un composant remplacé sur un système ou une partition logique à l'aide des outils Virtual I/O Server Si vous avez installé un dispositif ou remplacé un composant, il est recommandé d'utiliser les outils dans Virtual I/O Server (VIOS) pour vérifier que ce dispositif ou ce composant est reconnu par le système ou la partition logique. Pourquoi et quand exécuter cette tâche Pour vérifier le fonctionnement d'un dispositif ou d'un composant de rechange récemment installé, sélectionnez la procédure appropriée : v Vérification du dispositif installé via VIOS 202 Unités de disques v Vérification du composant de rechange via VIOS Vérifiez le dispositif installé via VIOS : 1. Connectez-vous en tant qu'utilisateur root. 2. Sur la ligne de commande, tapez diagmenu puis appuyez sur la touche Entrée. 3. Sélectionnez Routines de diagnostic avancé et appuyez sur Entrée. 4. Dans le menu Sélection du mode de diagnostic, sélectionnez Vérification du système et appuyez sur Entrée. 5. Dans le menu Sélection des tests de diagnostic, mode étendu, exécutez l'une des procédures suivantes : v Pour tester une seule ressource, sélectionnez la ressource que vous venez d'installer dans la liste des ressources et appuyez sur Entrée. v Pour tester toutes les ressources disponibles du système d'exploitation, sélectionnez Toutes les ressources et appuyez sur Entrée. 6. Sélectionnez Validation, puis attendez la fin de l'exécution des programmes de diagnostic en répondant aux invites éventuelles. 7. Les programmes de diagnostic ont-ils été exécutés complètement avant d'afficher le message Aucun incident n’a été détecté ? v Non : Si un code SRN (Service Request Number - numéro de demande d'intervention) ou un autre code de référence s'affiche, il est possible qu'un adaptateur ou qu'un câble ne soit pas bien connecté. Passez en revue les procédures d'installation pour vérifier que le nouveau dispositif est installé correctement. Si vous ne parvenez pas à corriger l'incident, rassemblez tous les codes SRN et autres codes de référence éventuels. Si le système fonctionne en mode de partitionnement logique, notez la partition logique sur laquelle le dispositif est installé. Pour obtenir de l'aide, contactez votre fournisseur de services. v Oui : Le nouveau dispositif est installé correctement. Quittez les programmes de diagnostic et revenez en mode de fonctionnement normal. Vérifiez le composant de rechange via VIOS: Pour vérifier le fonctionnement d'un dispositif ou d'un composant de rechange récemment installé, procédez comme suit : 1. Avez-vous remplacé le composant à l'aide d'VIOS ou des opérations de remplacement à chaud du service de diagnostic en ligne ? Non : Passez à l'étape 2. Oui : Passez à l'étape 5, à la page 204. 2. Le système est-il hors tension ? Non : Passez à l'étape 4, à la page 204. Oui : Si le système prend en charge l'initialisation lente, configurez la fonction. Pour plus d'informations, voir Effectuer une initialisation lente. 3. Démarrez le système et attendez que l'invite de connexion du système d'exploitation VIOS s'affiche ou que l'activité apparente du système sur le panneau de commande ou l'écran s'arrête. Avez-vous vu l'invite de connexion VIOS ? v No : Si un code SRN ou un autre code de référence s'affiche, il est possible qu'un adaptateur ou qu'un câble ne soit pas bien fixé. Passez en revue les procédures de remplacement pour vérifier que le nouveau composant est installé correctement. Si vous ne parvenez pas à corriger l'incident, rassemblez tous les codes SRN et autres codes de référence éventuels. Si le système ne démarre pas ou que l'invite de connexion ne s'affiche pas, voir la rubrique relative aux Incidents de chargement et de démarrage du système d'exploitation. Si le système est partitionné, notez la partition logique sur laquelle vous avez remplacé le composant. Pour obtenir de l'aide, contactez votre fournisseur de services. Procédures communes pour les fonctions installables 203 v Oui : Passez à l'étape 4. 4. A l'invite, tapez diag —a et appuyez sur Entrée pour vérifier s'il manque des ressources. Si une invite s'affiche, passez à l'étape 5. Si le menu de sélection des tests de diagnostic s'affiche avec un M en regard d'une ressource, procédez comme suit : a. Sélectionnez la ressource, puis appuyez sur Entrée. b. Sélectionnez Validation. c. Suivez les instructions affichées. d. Si un message vous demande si vous souhaitez revoir l'erreur affichée précédemment, cliquez sur Oui et appuyez sur Entrée. e. Si le système génère un code SRN, une carte ou un câble est probablement mal connecté. Si aucun incident évident n'apparaît, notez le code SRN et contactez votre fournisseur de services pour obtenir de l'aide. f. Si aucun code SRN ne s'affiche, passez à l'étape 5. 5. Testez le composant en effectuant les opérations suivantes : a. Sur la ligne de commande, tapez diagmenu puis appuyez sur la touche Entrée. b. Dans le menu Sélection des fonctions, sélectionnez Programmes de diagnostic, mode étendu et appuyez sur Entrée. c. Dans le menu Sélection du mode de diagnostic, sélectionnez Vérification du système et appuyez sur Entrée. d. Sélectionnez Toutes les ressources ou les programmes de diagnostic du composant pour tester uniquement le composant remplacé ou les périphériques qui y raccordées et appuyez sur Entrée. Le menu Action corrective sur ressource s'est-il affiché ? Non : Passez à l'étape 6. Oui : Passez à l'étape 7. 6. Le message Test terminé, Aucun incident n'a été détecté s'est-il affiché ? v Non : Il existe encore un incident. Contactez votre fournisseur de services. La procédure est terminée. v Oui : Si l'incident ne figure pas dans le journal des erreurs, sélectionnez Consignation d'action corrective dans le menu Sélection des tâches pour mettre à jour le journal des erreurs AIX. Si la réparation consistait à remettre en place un câble ou une carte, sélectionnez la ressource correspondante. Si la ressource n'apparaît pas dans la liste des ressources, sélectionnez sysplanar0 et appuyez sur Entrée. Conseil : Le voyant du composant passe de l'état Incident à l'état normal. Passez à l'étape 9, à la page 205. 7. Dans le menu Action de réparation de ressource, sélectionnez la ressource du composant remplacé. En cas de test réussi sur une ressource en mode de vérification du système, dont une entrée figure dans le journal des erreurs AIX, le menu Action corrective sur ressource apparaît. Procédez comme suit pour que le journal des erreurs AIX se mette à jour et indique qu'un composant détectable par le système a été remplacé. Remarque : Sur les systèmes équipés d'un voyant associé au composant défaillant, le voyant passe à l'état normal. a. Dans le menu Action corrective sur ressource, sélectionnez la ressource remplacée. Si la réparation consistait à remettre en place un câble ou une carte, sélectionnez la ressource correspondante. Si la ressource associée à votre action n'apparaît pas dans la liste des ressources, sélectionnez sysplanar0. Appuyez sur Entrée. b. Sélectionnez Validation une fois les sélections effectuées. L'écran Action corrective sur ressource s'est-il affiché de nouveau ? 204 Unités de disques Non : Si l'écran Aucun incident détecté apparaît, passez à l'étape 9. Oui : Passez à l'étape 8. 8. Dans le menu Action de réparation de ressource, sélectionnez au besoin le parent ou l'enfant de la ressource pour le composant remplacé. En cas de test réussi sur une ressource en mode de vérification du système, dont une entrée figure dans le journal des erreurs AIX, le menu Action corrective sur ressource apparaît. Procédez comme suit pour que le journal des erreurs AIX se mette à jour et indique qu'un composant détectable par le système a été remplacé. Remarque : Le voyant du composant passe de l'état Incident à l'état normal. a. Dans le menu Action corrective sur ressource, sélectionnez le composant parent ou enfant de la ressource remplacée. Si la réparation consistait à remettre en place un câble ou une carte, sélectionnez la ressource correspondante. Si la ressource associée à votre action n'apparaît pas dans la liste des ressources, sélectionnez sysplanar0. Appuyez sur Entrée. b. Sélectionnez Validation une fois les sélections effectuées. c. Si l'écran Aucun incident détecté apparaît, passez à l'étape 9. 9. Si vous avez modifié les paramètres du processeur de maintenance ou du réseau (expliqué dans les procédures précédentes), restaurez la valeur initiale des paramètres. 10. Avez-vous exécuté des procédures de remplacement à chaud avant cette procédure ? Non : Passez à l'étape 11. Oui : Passez à l'étape 12. 11. Démarrez le système d'exploitation (système ou partition logique en mode normal). Avez-vous pu démarrer le système d'exploitation ? Non : Contactez votre fournisseur de services. La procédure est terminée. Oui : Passez à l'étape 12. 12. Les voyants sont-ils encore allumés ? v Non. La procédure est terminée. v Oui : Désactivez les voyants. Pour plus d'informations, consultez l'une des rubriques suivantes : Changement des voyants de maintenance. Procédures communes pour les fonctions installables 205 206 Unités de disques Remarques Le présent document peut contenir des informations ou des références concernant certains produits, logiciels ou services non annoncés dans ce pays. Pour plus d'informations, référez-vous aux documents d'annonce disponibles dans votre pays ou adressez-vous au partenaire commercial du fabricant. Toute référence à un produit, logiciel ou service du fabricant n'implique pas que seul ce produit, logiciel ou service puisse convenir. Tout autre élément fonctionnellement équivalent peut être utilisé, s'il n'enfreint aucun droit du fabricant. Il est toutefois de la responsabilité de l'utilisateur d'évaluer et de vérifier lui-même le fonctionnement de tout produit, programme ou service. Le fabricant peut détenir des brevets ou des demandes de brevet couvrant les produits mentionnés dans le présent document. La remise de ce document ne vous octroie aucun droit de licence sur ces brevets ou demandes de brevet. Si vous désirez recevoir des informations concernant l'acquisition de licences, veuillez en faire la demande par écrit au fabricant. Le paragraphe suivant ne s'applique ni au Royaume-Uni, ni dans aucun pays dans lequel il serait contraire aux lois locales. LE PRESENT DOCUMENT EST LIVRE «EN L'ETAT» SANS AUCUNE GARANTIE EXPLICITE OU IMPLICITE. LE FABRICANT DECLINE NOTAMMENT TOUTE RESPONSABILITE RELATIVE A CES INFORMATIONS EN CAS DE CONTREFAÇON AINSI QU'EN CAS DE DEFAUT D'APTITUDE A L'EXECUTION D'UN TRAVAIL DONNE. Certaines juridictions n'autorisent pas l'exclusion des garanties implicites, auquel cas l'exclusion ci-dessus ne vous sera pas applicable. Le présent document peut contenir des inexactitudes ou des coquilles. Il est mis à jour périodiquement. Chaque nouvelle édition inclut les mises à jour. IBM peut modifier sans préavis les produits et logiciels décrits dans ce document. Les références à des sites Web non référencés par le fabricant sont fournies à titre d'information uniquement et n'impliquent en aucun cas une adhésion aux données qu'ils contiennent. Les éléments figurant sur ces sites Web ne font pas partie des éléments du présent produit et l'utilisation de ces sites relève de votre seule responsabilité. Le fabricant pourra utiliser ou diffuser, de toute manière qu'il jugera appropriée et sans aucune obligation de sa part, tout ou partie des informations qui lui seront fournies. Les données de performance indiquées dans ce document ont été déterminées dans un environnement contrôlé. Par conséquent, les résultats peuvent varier de manière significative selon l'environnement d'exploitation utilisé. Certaines mesures évaluées sur des systèmes en cours de développement ne sont pas garanties sur tous les systèmes disponibles. En outre, elles peuvent résulter d'extrapolations. Les résultats peuvent donc varier. Il incombe aux utilisateurs de ce document de vérifier si ces données sont applicables à leur environnement d'exploitation. Les informations concernant les produits de fabricants tiers ont été obtenues auprès des fournisseurs de ces produits, par l'intermédiaire d'annonces publiques ou via d'autres sources disponibles. Ce fabricant n'a pas testé ces produits et ne peut confirmer l'exactitude de leurs performances ni leur compatibilité. Il ne peut recevoir aucune réclamation concernant des produits de fabricants tiers. Toute question concernant les performances de produits de fabricants tiers doit être adressée aux fournisseurs de ces produits. Toute instruction relative aux intentions du fabricant pour ses opérations à venir est susceptible d'être modifiée ou annulée sans préavis, et doit être considérée uniquement comme un objectif. 207 Tous les tarifs indiqués sont les prix de vente actuels suggérés et sont susceptibles d'être modifiés sans préavis. Les tarifs appliqués peuvent varier selon les revendeurs. Ces informations sont fournies uniquement à titre de planification. Elles sont susceptibles d'être modifiées avant la mise à disposition des produits décrits. Le présent document peut contenir des exemples de données et de rapports utilisés couramment dans l'environnement professionnel. Ces exemples mentionnent des noms fictifs de personnes, de sociétés, de marques ou de produits à des fins illustratives ou explicatives uniquement. Tous ces noms sont fictifs et toute ressemblance avec des noms et adresses utilisés par une entreprise réelle serait purement fortuite. Si vous visualisez ces informations en ligne, il se peut que les photographies et illustrations en couleur n'apparaissent pas à l'écran. Les figures et les spécifications contenues dans le présent document ne doivent pas être reproduites, même partiellement, sans l'autorisation écrite d'IBM. Le fabricant a conçu le présent document pour expliquer comment utiliser les machines indiquées. Il n'est exploitable dans aucun autre but. Les ordinateurs du fabricant contiennent des mécanismes conçus pour réduire les risques d'altération ou de perte de données. Ces risques, cependant, ne peuvent pas être éliminés. En cas de rupture de tension, de défaillances système, de fluctuations ou de rupture de l'alimentation ou d'incidents au niveau des composants, l'utilisateur doit s'assurer de l'exécution rigoureuse des opérations, et que les données ont été sauvegardées ou transmises par le système au moment de la rupture de tension ou de l'incident (ou peu de temps avant ou après). De plus, ces utilisateurs doivent établir des procédures garantissant la vérification indépendante des données, afin de permettre une utilisation fiable de ces dernières dans le cadre d'opérations stratégiques. Ces utilisateurs doivent enfin consulter régulièrement sur les sites Web de support IBM les mises à jour et les correctifs applicables au système et aux logiciels associés. Marques IBM, le logo IBM et ibm.com sont des marques d'International Business Machines Corp. aux Etats-Unis et/ou dans certains autres pays. Les autres noms de produits et de services peuvent appartenir à IBM ou des tiers. Une liste à jour des marques IBM est disponible sur Internet : Copyright and trademark information, à l'adresse www.ibm.com/legal/copytrade.shtml. Linux est une marque de Linus Torvalds aux Etats-Unis et/ou dans certains autres pays. Les autres noms de sociétés, de produits et de services peuvent appartenir à des tiers. Bruits radioélectriques Remarques sur la classe A Les avis de conformité de classe A suivants s'appliquent aux serveurs. Recommandation de la Federal Communications Commission (FCC) [Etats-Unis] Remarque : Cet appareil respecte les limites des caractéristiques d'immunité des appareils numériques définies pour la classe A, conformément au chapitre 15 de la réglementation de la FCC. La conformité aux spécifications de cette classe offre une garantie acceptable contre les perturbations électromagnétiques dans les zones commerciales. Ce matériel génère, utilise et peut émettre de l'énergie radiofréquence. Il risque de parasiter les communications radio s'il n'est pas installé conformément aux instructions du constructeur. L'exploitation faite en zone résidentielle peut entraîner le brouillage des réceptions radio et télé, ce qui obligerait le propriétaire à prendre les dispositions nécessaires pour en éliminer les causes. 208 Unités de disques Utilisez des câbles et connecteurs correctement blindés et mis à la terre afin de respecter les limites de rayonnement définies par la réglementation de la FCC. IBM® ne peut pas être tenue pour responsable du brouillage des réceptions radio ou télévision résultant de l'utilisation de câbles et connecteurs inadaptés ou de modifications non autorisées apportées à cet appareil. Toute modification non autorisée pourra annuler le droit d'utilisation de cet appareil. Cet appareil est conforme aux restrictions définies dans le chapitre 15 de la réglementation de la FCC. Son utilisation est soumise aux deux conditions suivantes : (1) il ne peut pas causer de perturbations électromagnétiques gênantes et (2) il doit accepter toutes les perturbations reçues, y compris celles susceptibles d'occasionner un fonctionnement indésirable. Avis de conformité à la réglementation d'Industrie Canada Cet appareil numérique de la classe A est conforme à la norme NMB-003 du Canada. Avis de conformité aux exigences de l'Union européenne Le présent produit satisfait aux exigences de protection énoncées dans la directive 2004/108/CEE du Conseil concernant le rapprochement des législations des Etats membres relatives à la compatibilité électromagnétique. IBM décline toute responsabilité en cas de non-respect de cette directive résultat d'une modification non recommandée du produit, y compris l'ajout de cartes en option non IBM. Ce produit respecte les limites des caractéristiques d'immunité des appareils de traitement de l'information définies par la classe A de la norme européenne EN 55022 (CISPR 22). La conformité aux spécifications de la classe A offre une garantie acceptable contre les perturbations avec les appareils de communication agréés, dans les zones commerciales et industrielles. Dans l'Union européenne, contactez : IBM Deutschland GmbH Technical Regulations, Department M456 IBM-Allee 1, 71139 Ehningen, Germany Tele: +49 7032 15-2937 e-mail: tjahn@de.ibm.com Avertissement : Ce matériel appartient à la classe A. Il est susceptible d'émettre des ondes radioélectriques risquant de perturber les réceptions radio. Son emploi dans une zone résidentielle peut créer des perturbations électromagnétiques. L'utilisateur devra alors prendre les mesures nécessaires pour en éliminer les causes. Avis de conformité aux exigences du Voluntary Control Council for Interference (VCCI) - Japon Voici un résumé de la recommandation du VCCI japonais figurant dans l'encadré ci-dessus : Ce produit de la classe A respecte les limites des caractéristiques d'immunité définies par le VCCI (Voluntary Control Council for Interference) japonais. Si ce matériel est utilisé dans une zone résidentielle, il peut créer des perturbations électromagnétiques. L'utilisateur devra alors prendre les mesures nécessaires pour en éliminer les causes. Remarques 209 Directive relative aux harmoniques confirmée par l'association japonaise JEITA (Japanese Electronics and Information Technology Industries Association) (produits inférieurs ou égaux à 20 A par phase) Directive relative aux harmoniques confirmée avec modifications par l'association japonaise JEITA (Japanese Electronics and Information Technology Industries Association) (produits supérieurs 20 A par phase) Avis d'interférences électromagnétiques (EMI) - République populaire de Chine Ce matériel appartient à la classe A. Il est susceptible d'émettre des ondes radioélectriques risquant de perturber les réceptions radio. L'utilisateur devra alors prendre les mesures nécessaires pour en éliminer les causes. Avis d'interférences électromagnétiques (EMI) - Taïwan Voici un résumé de l'avis EMI de Taïwan figurant ci-dessus. Avertissement : Ce matériel appartient à la classe A. Il est susceptible d'émettre des ondes radioélectriques risquant de perturber les réceptions radio. Son emploi dans une zone résidentielle peut créer des interférences. L'utilisateur devra alors prendre les mesures nécessaires pour les supprimer. 210 Unités de disques Avis d'interférences électromagnétiques (EMI) - Corée Cet équipement a obtenu l'agrément EMC en vue de son utilisation commerciale. S'il a été vendu ou acheté par erreur, veuillez l'échanger avec du matériel conforme à un usage domestique. Avis de conformité pour l'Allemagne Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse A EU-Richtlinie zur Elektromagnetischen Verträglichkeit Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die Grenzwerte der EN 55022 Klasse A ein. Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne Empfehlung von IBM gesteckt/eingebaut werden. EN 55022 Klasse A Geräte müssen mit folgendem Warnhinweis versehen werden: "Warnung: Dieses ist eine Einrichtung der Klasse A. Diese Einrichtung kann im Wohnbereich Funk-Störungen verursachen; in diesem Fall kann vom Betreiber verlangt werden, angemessene Maßnahmen zu ergreifen und dafür aufzukommen." Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland. Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse A Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen - CE - zu führen. Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller: International Business Machines Corp. New Orchard Road Armonk, New York 10504 Tel: 914-499-1900 Der verantwortliche Ansprechpartner des Herstellers in der EU ist: IBM Deutschland GmbH Technical Regulations, Abteilung M456 IBM-Allee 1, 71139 Ehningen, Germany Tel: +49 7032 15-2937 e-mail: tjahn@de.ibm.com Remarques 211 Generelle Informationen: Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse A. Avis d'interférences électromagnétiques (EMI) - Russie Remarques sur la classe B Les avis de conformité de classe B suivants s'appliquent aux dispositifs déclarés comme relevant de la compatibilité électromagnétique (EMC) de classe B dans les informations d'installation des dispositifs. Recommandation de la Federal Communications Commission (FCC) [Etats-Unis] Cet appareil respecte les limites des caractéristiques d'immunité des appareils numériques définies par la classe B, conformément au chapitre 15 de la réglementation de la FCC. La conformité aux spécifications de la classe B offre une garantie acceptable contre les perturbations électromagnétiques dans les zones résidentielles. Ce matériel génère, utilise et peut émettre de l'énergie radiofréquence. Il risque de parasiter les communications radio s'il n'est pas installé conformément aux instructions du constructeur. Toutefois, il n'est pas garanti que des perturbations n'interviendront pas pour une installation particulière. Si cet appareil provoque des perturbations gênantes dans les communications radio ou télévision, mettez-le hors tension puis sous tension pour vous en assurer. L'utilisateur peut tenter de remédier à cet incident en appliquant une ou plusieurs des mesures suivantes : v Réorienter ou repositionner l'antenne de réception. v Eloigner l'appareil du récepteur. v Brancher l'appareil sur une prise différente de celle du récepteur, sur un circuit distinct. v Prendre contact avec un distributeur agréé IBM ou un représentant commercial IBM pour obtenir de l'aide. Utilisez des câbles et connecteurs correctement blindés et mis à la terre afin de respecter les limites de rayonnement définies par la réglementation de la FCC. Ces câbles et connecteurs sont disponibles chez votre distributeur agréé IBM. IBM ne peut pas être tenue pour responsable du brouillage des réceptions radio ou télévision résultant de modifications non autorisées apportées à cet appareil. Toute modification non autorisée pourra annuler le droit d'utilisation de cet appareil. Cet appareil est conforme aux restrictions définies dans le chapitre 15 de la réglementation de la FCC. Son utilisation est soumise aux deux conditions suivantes : (1) il ne peut pas causer de perturbations électromagnétiques gênantes et (2) il doit accepter toutes les perturbations reçues, y compris celles susceptibles d'occasionner un fonctionnement indésirable. Avis de conformité à la réglementation d'Industrie Canada Cet appareil numérique de la classe B est conforme à la norme NMB-003 du Canada. 212 Unités de disques Avis de conformité aux exigences de l'Union européenne Le présent produit satisfait aux exigences de protection énoncées dans la directive 2004/108/CEE du Conseil concernant le rapprochement des législations des Etats membres relatives à la compatibilité électromagnétique. IBM décline toute responsabilité en cas de non-respect de cette directive résultant d'une modification non recommandée du produit, y compris l'ajout de cartes en option non IBM. Ce produit respecte les limites des caractéristiques d'immunité des appareils de traitement de l'information définies par la classe B de la norme européenne EN 55022 (CISPR 22). La conformité aux spécifications de la classe B offre une garantie acceptable contre les perturbations avec les appareils de communication agréés, dans les zones résidentielles. Dans l'Union européenne, contactez : IBM Deutschland GmbH Technical Regulations, Department M456 IBM-Allee 1, 71139 Ehningen, Germany Tele: +49 7032 15-2937 e-mail: tjahn@de.ibm.com Avis de conformité aux exigences du Voluntary Control Council for Interference (VCCI) - Japon Directive relative aux harmoniques confirmée par l'association japonaise JEITA (Japanese Electronics and Information Technology Industries Association) (produits inférieurs ou égaux à 20 A par phase) Directive relative aux harmoniques confirmée avec modifications par l'association japonaise JEITA (Japanese Electronics and Information Technology Industries Association) (produits supérieurs 20 A par phase) Avis d'interférences électromagnétiques (EMI) - Corée Remarques 213 Avis de conformité pour l'Allemagne Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse B EU-Richtlinie zur Elektromagnetischen Verträglichkeit Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die Grenzwerte der EN 55022 Klasse B ein. Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne Empfehlung von IBM gesteckt/eingebaut werden. Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland. Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse B Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen - CE - zu führen. Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller: International Business Machines Corp. New Orchard Road Armonk, New York 10504 Tel: 914-499-1900 Der verantwortliche Ansprechpartner des Herstellers in der EU ist: IBM Deutschland GmbH Technical Regulations, Abteilung M456 IBM-Allee 1, 71139 Ehningen, Germany Tel: +49 7032 15-2937 e-mail: tjahn@de.ibm.com Generelle Informationen: Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse B. Dispositions Les droits d'utilisation relatifs à ces publications sont soumis aux dispositions suivantes. Usage personnel : Vous pouvez reproduire ces publications pour votre usage personnel, non commercial, sous réserve que toutes les mentions de propriété soient conservées. Vous ne pouvez distribuer ou publier tout ou partie de ces publications ou en faire des oeuvres dérivées, sans le consentement exprès du fabricant. Usage commercial : Vous pouvez reproduire, distribuer et afficher ces publications uniquement au sein de votre entreprise, sous réserve que toutes les mentions de propriété soient conservées. Vous ne pouvez reproduire, distribuer, afficher ou publier tout ou partie de ces publications en dehors de votre entreprise, ou en faire des oeuvres dérivées, sans le consentement exprès du fabricant. 214 Unités de disques Excepté les droits d'utilisation expressément accordés dans ce document, aucun autre droit, licence ou autorisation, implicite ou explicite, n'est accordé pour ces publications ou autres informations, données, logiciels ou droits de propriété intellectuelle contenus dans ces publications. Le fabricant se réserve le droit de retirer les autorisations accordées dans le présent document si l'utilisation des publications s'avère préjudiciable à ses intérêts ou que, selon son appréciation, les instructions susmentionnées n'ont pas été respectées. Vous ne pouvez télécharger, exporter ou réexporter ces informations qu'en total accord avec toutes les lois et règlements applicables dans votre pays, y compris les lois et règlements américains relatifs à l'exportation. LE FABRICANT NE DONNE AUCUNE GARANTIE SUR LE CONTENU DE CES PUBLICATIONS. CES PUBLICATIONS SONT LIVREES EN L'ETAT SANS AUCUNE GARANTIE EXPLICITE OU IMPLICITE. LE FABRICANT DECLINE NOTAMMENT TOUTE RESPONSABILITE RELATIVE A CES INFORMATIONS EN CAS DE CONTREFAÇON AINSI QU'EN CAS DE DEFAUT D'APTITUDE A L'EXECUTION D'UN TRAVAIL DONNE. Activation SAS (Serial-Attached SCSI) RAID pour les modèles E2-700, E2-700T, E2-705, E2-705T, E4-700, E4-700T ou E4-705 REFERENCE 86 F1 90FF 03 ESCALA Power7 Activation SAS (Serial-Attached SCSI) RAID pour les modèles E2-700, E2-700T, E2-705, E2-705T, E4-700, E4-700T ou E4-705 Les publications pour ESCALA Power7 concernent les modèles suivants : - Bull Escala E5-700 (Power 750 / 8233-E8B) - Bull Escala M6-700 (Power 770 / 9117-MMB) - Bull Escala M6-705 (Power 770 / 9117-MMC) - Bull Escala M7-700 (Power 780 / 9179-MHB) - Bull Escala M7-705 (Power 780 / 9179-MHC) - Bull Escala E1-700 (Power 710 / 8231-E2B) - Bull Escala E1-705 (Power 710 / 8231-E1C) - Bull Escala E2-700 / E2-700T (Power 720 / 8202-E4B) - Bull Escala E2-705 / E2-705T (Power 720 / 8202-E4C) - Bull Escala E3-700 (Power 730 / 8231-E2B) - Bull Escala E3-705 (Power 730 / 8231-E2C) - Bull Escala E4-700 / E4-700T (Power 740 / 8205-E6B) - Bull Escala E4-705 (Power 740 / 8205-E6C) Les références au modèle Power 755 / 8236-E8C sont sans objet. Hardware October 2011 BULL CEDOC 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE REFERENCE 86 F1 90FF 03 L’avis juridique de copyright ci–après place le présent document sous la protection des lois de Copyright qui prohibent, sans s’y limiter, des actions comme la copie, la distribution, la modification et la création de produits dérivés. Copyright Bull SAS 2011 Imprimé en France Marques déposées Toutes les marques citées dans ce manuel sont la propriété de leurs détenteurs respectifs. Tous les noms de marques ainsi que les noms de produits matériels et/ou logiciels sont régis par le droit des marques et/ou des brevets. La citation des noms de marques et de produits est purement informative et ne constitue pas une violation du droit des marques et/ou des brevets. Des corrections ou des modifications au contenu de ce document peuvent intervenir sans préavis. Bull SAS ne pourra pas être tenu pour responsable des éventuelles erreurs qui pourraient être contenues dans ce manuel, ni pour tout dommage p ouvant résulter de son application. Table des matières Avis aux lecteurs canadiens . . . . . . . . . . . . . . . . . . . . . . . . . . . v Consignes de sécurité . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix Activation SAS RAID et bloc de batteries de cache pour le modèle 02E/4B, 02E/4C, 05F/6B ou 05E/6C. . . . . . . . . . . . . . . . . . . . . . . . . . . 1 Activation SA RAID - Nouveautés. . . . . . . . . . . . . . . . . . . . . . . . . 3 Installation du fond de panier de stockage - 6 baies SFF/DVD SATA (FC 5618) . . . . . 5 Installation des baies d'unité partagée 3x3 sans prise en charge RAID 5/6 (FC 5631 et FC EJ02) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9 Installation du fond de panier de stockage - 8 baies SFF/RAID 175 Mo/Adaptateur d'E-S double (FC 5630 et FC EJ01) . . . . . . . . . . . . . . . . . . . . . . . . . . 15 Retrait et remise en place des adaptateurs et batteries SAS RAID . . . . . . . . . . 23 Retrait et remplacement du fond de panier d'unités de disque . . . . . . . . . . . . . . . . . . 23 Retrait du fond de panier d'unités de disques sur les modèles 02E/4B, 02E/4C, 05F/6B ou 05E/6C. . . . . 24 Remplacement du fond de panier d'unités de disques sur les modèles 02E/4B, 02E/4C, 05F/6B ou 05E/6C . . 29 Retrait et remplacement du fond de panier du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C . . . . . . . . 33 Retrait du fond de panier du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C . . . . . . . . . . . . . 33 Remise en place du fond de panier du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C. . . . . . . . . . 44 Considérations sur la maintenance du sous-système SAS 02E/4B ou 05F/6B. . . . . . . . . . . . . . 60 Considérations de maintenance pour le bloc de batterie de cache . . . . . . . . . . . . . . . . 60 Retrait de la carte d'activation RAID 02E/4B ou 05F/6B . . . . . . . . . . . . . . . . . . . . 62 Remplacement de la carte d'activation RAID 02E/4B ou 05F/6B. . . . . . . . . . . . . . . . . . 65 Retrait du contrôleur de stockage RAID/cache 02E/4B, 02E/4C, 05F/6B ou 05E/6C . . . . . . . . . . . 67 Remplacement du contrôleur de stockage RAID et cache 02E/4B, 02E/4C, 05F/6B ou 05E/6C . . . . . . . 70 Retrait et remise en place de la carte de batterie de cache . . . . . . . . . . . . . . . . . . . . 71 Retrait et installation du bloc de batterie de cache . . . . . . . . . . . . . . . . . . . . . . 77 Procédures communes aux dispositifs installables . . . . . . . . . . . . . . . . 81 Avant de commencer . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81 Identification d'un composant défaillant. . . . . . . . . . . . . . . . . . . . . . . . . . 84 Voyants du panneau de commande . . . . . . . . . . . . . . . . . . . . . . . . . . 84 Identification d'un composant défaillant sur un système AIX ou une partition logique . . . . . . . . . 86 Localisation d'un composant défaillant sur un système AIX ou une partition logique. . . . . . . . . 86 Activation du voyant associé au composant défaillant . . . . . . . . . . . . . . . . . . . 86 Désactivation du voyant d'un composant défaillant . . . . . . . . . . . . . . . . . . . . 86 Identification d'un composant défaillant sur un système Linux ou une partition logique. . . . . . . . . 87 Localisation d'un composant défaillant sur un système Linux ou une partition logique . . . . . . . . 87 Localisation d'un composant défaillant sur un système ou une partition logique Linux . . . . . . . . 87 Activation du voyant associé au composant défaillant . . . . . . . . . . . . . . . . . . . 88 Désactivation du voyant d'un composant défaillant . . . . . . . . . . . . . . . . . . . . 88 Localisation d'un composant défaillant sur un système ou une partition logique Virtual I/O Server . . . . . 88 Identification d'un composant à l'aide du Virtual I/O Server . . . . . . . . . . . . . . . . . 89 Démarrage du système ou de la partition logique . . . . . . . . . . . . . . . . . . . . . . 89 Démarrage d'un système non géré par une console HMC ou une console SDMC . . . . . . . . . . . 89 Démarrage d'un système ou d'une partition logique via la console HMC. . . . . . . . . . . . . . 91 Démarrage d'un système ou d'un serveur virtuel via la console SDMC . . . . . . . . . . . . . . 91 iii Arrêt d'un système ou d'une partition logique. . . . . . . . . . . . . . . . . . . . . . . . 91 Arrêt d'un système non géré par une console HMC ou une console SDMC . . . . . . . . . . . . . 92 Arrêt d'un système via la console HMC . . . . . . . . . . . . . . . . . . . . . . . . . 93 Arrêt d'un système via la console SDMC . . . . . . . . . . . . . . . . . . . . . . . . 94 Retrait et remplacement des capots sur le système 02E/4B, 02E/4C, 05F/6B ou 05E/6C. . . . . . . . . . 94 Retrait du capot avant sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire . . . . . . 95 Retrait du capot avant sur un système autonome 02E/4B, 02E/4C ou 05F/6B . . . . . . . . . . . . 95 Installation du capot avant sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire . . . . . 96 Installation des capot et volet avant sur un système 02E/4B, 02E/4C ou 05F/6B autonome. . . . . . . . 97 Retrait du capot d'accès sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire . . . . . . 98 Retrait du capot d'accès d'un système autonome 02E/4B, 02E/4C ou 05F/6B . . . . . . . . . . . . 99 Installation du capot d'accès sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire . . . . 101 Installation du capot d'accès d'un système autonome 02E/4B, 02E/4C ou 05F/6B . . . . . . . . . . 102 Mise en position de maintenance ou de fonctionnement du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C . . . 104 Mise en position de maintenance du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire. . . . 104 Mise en position de fonctionnement du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire . . . 106 Déconnexion des cordons d'alimentation du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C . . . . . . . . 107 Connexion des cordons d'alimentation au système 02E/4B, 02E/4C, 05F/6B ou 05E/6C . . . . . . . . . 109 Installation d'un dispositif via la console HMC . . . . . . . . . . . . . . . . . . . . . . . 109 Retrait d'un composant via la console HMC . . . . . . . . . . . . . . . . . . . . . . . . 110 Remplacement d'un composant via la console HMC . . . . . . . . . . . . . . . . . . . . . 110 Vérification du composant installé . . . . . . . . . . . . . . . . . . . . . . . . . . . 111 Vérification d'un dispositif installé ou d'un composant remplacé sur un système ou une partition logique AIX 111 Vérification d'un composant installé sur un système Linux ou une partition logique. . . . . . . . . . 114 Vérification d'un composant installé via les programmes de diagnostic autonome . . . . . . . . . . 114 Vérification d'un composant installé à l'aide de la console HMC (Hardware Management Console). . . . . 116 Activation/Désactivation des voyants via la console HMC . . . . . . . . . . . . . . . . . 117 Désactivation d'un voyant d'avertissement système ou de partition . . . . . . . . . . . . . 117 Activation ou désactivation d'un voyant d'identification . . . . . . . . . . . . . . . . . 117 Affichage des événements réparables à l'aide de la console HMC . . . . . . . . . . . . . . . 118 Vérification du composant installé via la console SDMC . . . . . . . . . . . . . . . . . . . 119 Activation/Désactivation de voyants via la console de gestion Systems Director . . . . . . . . . . 119 Désactivation d'un voyant d'avertissement système ou de partition . . . . . . . . . . . . . 119 Activation ou désactivation de voyants d'identification via la console SDMC . . . . . . . . . . 120 Visualisation d'événements réparables via la console de gestion Systems Director . . . . . . . . . 120 Vérification d'un dispositif installé ou d'un composant remplacé sur un système ou une partition logique à l'aide des outils Virtual I/O Server . . . . . . . . . . . . . . . . . . . . . . . . . . 121 Installation d'un composant via la console SDMC . . . . . . . . . . . . . . . . . . . . . . 124 Remplacement d'un composant via la console SDMC . . . . . . . . . . . . . . . . . . . . . 124 Retrait d'un composant via la console SDMC. . . . . . . . . . . . . . . . . . . . . . . . 125 Remarques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 127 Marques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 128 Bruits radioélectriques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 128 Remarques sur la classe A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 129 Remarques sur la classe B . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 132 Dispositions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 135 iv Activation SAS (Serial-attached SCSI) RAID Avis aux lecteurs canadiens Le présent document a été traduit en France. Voici les principales différences et particularités dont vous devez tenir compte. Illustrations Les illustrations sont fournies à titre d'exemple. Certaines peuvent contenir des données propres à la France. Terminologie La terminologie des titres IBM peut différer d'un pays à l'autre. Reportez-vous au tableau ci-dessous, au besoin. IBM France IBM Canada ingénieur commercial représentant agence commerciale succursale ingénieur technico-commercial informaticien inspecteur technicien du matériel Claviers Les lettres sont disposées différemment : le clavier français est de type AZERTY, et le clavier français-canadien, de type QWERTY. OS/2 - Paramètres canadiens Au Canada, on utilise : v les pages de codes 850 (multilingue) et 863 (français-canadien), v le code pays 002, v le code clavier CF. Nomenclature Les touches présentées dans le tableau d'équivalence suivant sont libellées différemment selon qu'il s'agit du clavier de la France, du clavier du Canada ou du clavier des États-Unis. Reportez-vous à ce tableau pour faire correspondre les touches françaises figurant dans le présent document aux touches de votre clavier. v Recommandations à l'utilisateur Ce matériel utilise et peut émettre de l'énergie radiofréquence. Il risque de parasiter les communications radio et télévision s'il n'est pas installé et utilisé conformément aux instructions du constructeur (instructions d'utilisation, manuels de référence et manuels d'entretien). Si cet équipement provoque des interférences dans les communications radio ou télévision, mettez-le hors tension puis sous tension pour vous en assurer. Il est possible de corriger cet état de fait par une ou plusieurs des mesures suivantes : v Réorienter l'antenne réceptrice ; v Déplacer l'équipement par rapport au récepteur ; v Éloigner l'équipement du récepteur ; v Brancher l'équipement sur une prise différente de celle du récepteur pour que ces unités fonctionnent sur des circuits distincts ; v S'assurer que les vis de fixation des cartes et des connecteurs ainsi que les fils de masse sont bien serrés ; v Vérifier la mise en place des obturateurs sur les connecteurs libres. Si vous utilisez des périphériques non IBM avec cet équipement, nous vous recommandons d'utiliser des câbles blindés mis à la terre, à travers des filtres si nécessaire. En cas de besoin, adressez-vous à votre détaillant. Le fabricant n'est pas responsable des interférences radio ou télévision qui pourraient se produire si des modifications non autorisées ont été effectuées sur l'équipement. vi Activation SAS (Serial-attached SCSI) RAID L'obligation de corriger de telles interférences incombe à l'utilisateur. Au besoin, l'utilisateur devrait consulter le détaillant ou un technicien qualifié pour obtenir de plus amples renseignements. Brevets Il est possible qu'IBM détienne des brevets ou qu'elle ait déposé des demandes de brevets portant sur certains sujets abordés dans ce document. Le fait qu'IBM vous fournisse le présent document ne signifie pas qu'elle vous accorde un permis d'utilisation de ces brevets. Vous pouvez envoyer, par écrit, vos demandes de renseignements relatives aux permis d'utilisation au directeur général des relations commerciales d'IBM, 3600 Steeles Avenue East, Markham, Ontario, L3R 9Z7. Assistance téléphonique Si vous avez besoin d'assistance ou si vous voulez commander du matériel, des logiciels et des publications IBM, contactez IBM direct au 1 800 465-1234. Avis aux lecteurs canadiens vii viii Activation SAS (Serial-attached SCSI) RAID Consignes de sécurité Les consignes de sécurité peuvent être imprimées tout au long de ce guide. v DANGER - Consignes attirant votre attention sur un risque de blessures graves, voire mortelles. v ATTENTION - Consignes attirant votre attention sur un risque de blessures graves, en raison de certaines circonstances réunies. v Avertissement - Consignes attirant votre attention sur un risque de dommages sur un programme, une unité, un système ou des données. Consignes de sécurité relatives au commerce international Plusieurs pays nécessitent la présentation des consignes de sécurité indiquées dans les publications du produit dans leur langue nationale. Si votre pays en fait partie, un livret de consignes de sécurité est inclus dans l'ensemble des publications livré avec le produit. Ce livret contient les consignes de sécurité dans votre langue en faisant référence à la source en anglais (Etats-Unis). Avant d'utiliser une publication en version originale anglaise pour installer, faire fonctionner ou dépanner ce produit, vous devez vous familiariser avec les consignes de sécurité figurant dans ce livret. Vous devez également consulter ce livret chaque fois que les consignes de sécurité des publications en anglais (Etats-Unis) ne sont pas assez claires pour vous. Informations sur les appareils à laser Les serveurs peuvent comprendre des cartes d'E-S ou des composants à fibres optiques, utilisant des lasers ou des diodes électroluminescentes (LED). Conformité aux normes relatives aux appareils à laser Les serveurs peuvent être installés à l'intérieur ou à l'extérieur d'une armoire d'équipement informatique. ix DANGER Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes suivantes : Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut présenter un danger. Pour éviter tout risque de choc électrique : v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce dernier avec un autre produit. v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique. v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de reconfiguration de ce produit au cours d'un orage. v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de danger électrique, débranchez tous les cordons d'alimentation. v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent aux informations de la plaque d'alimentation électrique du système. v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce produit. v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les cordons d'interface. v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence de dommages matériels. v Avant de retirer les capots de l'unité, mettez celle-ci hors tension et déconnectez ses cordons d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication et aux modems (sauf mention contraire dans les procédures d'installation et de configuration). v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et déconnecter les différents cordons. Pour déconnecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Débranchez les cordons d'alimentation des prises. 3. Débranchez les cordons d'interface des connecteurs. 4. Débranchez tous les câbles des unités. Pour connecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Branchez tous les cordons sur les unités. 3. Branchez les cordons d'interface sur des connecteurs. 4. Branchez les cordons d'alimentation aux prises. 5. Mettez l'unité sous tension. (D005a) x Activation SAS (Serial-attached SCSI) RAID DANGER Observez les consignes suivantes lors de l'utilisation du système en armoire ou lorsque vous travaillez à proximité de ce dernier : v Un mauvais maniement de l'équipement lourd peut engendrer blessures et dommages matériels. v Abaissez toujours les vérins de mise à niveau de l'armoire. v Installez toujours des équerres de stabilisation sur l'armoire. v Pour prévenir tout danger lié à une mauvaise répartition de la charge, installez toujours les unités les plus lourdes dans la partie inférieure de l'armoire. Installez toujours les serveurs et les unités en option en commençant par le bas de l'armoire. v Un serveur monté en armoire n'est pas une étagère ou un espace de travail. Ne posez pas d'objet sur un serveur monté en armoire. v Chaque armoire peut être équipée de plusieurs cordons d'alimentation. Avant de manipuler l'armoire, vous devez débrancher l'ensemble des cordons d'alimentation. v Reliez toutes les unités installées dans l'armoire aux dispositifs d'alimentation installés dans la même armoire. Vous ne devez pas brancher le cordon d'alimentation d'une unité installée dans une armoire au dispositif d'alimentation installé dans une autre armoire. v Un mauvais câblage du socle de prise de courant peut provoquer une mise sous tension dangereuse des parties métalliques du système ou des unités qui lui sont raccordées. Il appartient au client de s'assurer que le socle de prise de courant est correctement câblé et mis à la terre afin d'éviter tout risque de choc électrique. ATTENTION v N'installez pas d'unité dans une armoire dont la température ambiante interne dépasse la température ambiante que le fabricant recommande pour toutes les unités montées en armoire. v N'installez pas d'unité dans une armoire où la ventilation n'est pas assurée. Vérifiez que les côtés, l'avant et l'arrière de l'unité sont correctement ventilés. v Le matériel doit être correctement raccordé au circuit d'alimentation pour éviter qu'une surcharge des circuits n'entrave le câblage des dispositifs d'alimentation ou de protection contre les surintensités. Pour choisir des connexions d'alimentation à l'armoire adaptées, consultez les étiquettes de puissance nominale situées sur le matériel dans l'armoire afin de déterminer l'alimentation totale requise par le circuit d'alimentation. v Armoires dotées de tiroirs coulissants : Si l'armoire n'est pas équipée d'équerres de stabilisation, ne sortez et n'installez pas de tiroir ou de dispositif. Ne retirez pas plusieurs tiroirs à la fois. Si vous retirez plusieurs tiroirs simultanément, l'armoire risque de devenir instable. v Armoires dotées de tiroirs fixes : Sauf indication du fabricant, les tiroirs fixes ne doivent pas être retirés à des fins de maintenance. Si vous tentez de retirer une partie ou l'ensemble du tiroir, l'armoire risque de devenir instable et le tiroir risque de tomber. (R001) Consignes de sécurité xi ATTENTION : Le retrait des composants des parties supérieures de l'armoire améliore sa stabilité au cours du déplacement. Pour déplacer une armoire remplie de composants dans une pièce ou dans un bâtiment, procédez comme suit : v Pour réduire le poids de l'armoire, retirez les équipements, à commencer par celui situé en haut. Si possible, restaurez la configuration d'origine de l'armoire. Si vous ne connaissez pas cette configuration, procédez comme suit : – Retirez toutes les unités de la position 32U et plus. – Assurez-vous que les unités les plus lourdes sont installées dans la partie inférieure de l'armoire. – Assurez-vous qu'il ne reste aucun niveau U vide entre les unités installées dans l'armoire, en dessous du niveau 32U. v Si l'armoire déplacée fait partie d'un groupe d'armoires, séparez-la de ce dernier. v Vérifiez l'itinéraire envisagé pour éliminer tout risque. v Vérifiez que l'armoire une fois chargée n'est pas trop lourde pour l'itinéraire choisi. Pour plus d'informations sur le poids d'une armoire chargée, consultez la documentation fournie avec votre armoire. v Vérifiez que toutes les ouvertures mesurent au moins 760 x 230 mm. v Vérifiez que toutes les unités, toutes les étagères, tous les tiroirs, toutes les portes et tous les câbles sont bien fixés. v Vérifiez que les vérins de mise à niveau sont à leur position la plus haute. v Vérifiez qu'aucune équerre de stabilisation n'est installée sur l'armoire pendant le déplacement. v N'utilisez pas de rampe inclinée à plus de dix degrés. v Dès que l'armoire est à son nouvel emplacement, procédez comme suit : – Abaissez les quatre vérins de mise à niveau. – Installez des équerres de stabilisation sur l'armoire. – Si vous avez retiré des unités de l'armoire, remettez-les à leur place, en remontant de la partie inférieure à la partie supérieure de l'armoire. v Si un déplacement important est nécessaire, restaurez la configuration d'origine de l'armoire. Mettez l'armoire dans son emballage d'origine ou dans un autre emballage équivalent. De plus, abaissez les vérins de mise à niveau pour que les roulettes ne soient plus au contact de la palette et fixez l'armoire à celle-ci. (R002) (L001) (L002) xii Activation SAS (Serial-attached SCSI) RAID (L003) ou Aux Etats-Unis, tous les appareils à laser sont certifiés conformes aux normes indiquées dans le sous-chapitre J du DHHS 21 CFR relatif aux produits à laser de classe 1. Dans les autres pays, ils sont certifiés être des produits à laser de classe 1 conformes aux normes CEI 60825. Consultez les étiquettes sur chaque pièce du laser pour les numéros d'accréditation et les informations de conformité. ATTENTION : Ce produit peut contenir des produits à laser de classe 1 : lecteur de CD-ROM, DVD-ROM, DVD-RAM ou module à laser. Notez les informations suivantes : v Ne retirez pas les capots. En ouvrant le produit à laser, vous vous exposez au rayonnement dangereux du laser. Vous ne pouvez effectuer aucune opération de maintenance à l'intérieur. v Pour éviter tout risque d'exposition au rayon laser, respectez les consignes de réglage et d'utilisation des commandes, ainsi que les procédures décrites dans le présent manuel. (C026) Consignes de sécurité xiii ATTENTION : Les installations informatiques peuvent comprendre des modules à laser fonctionnant à des niveaux de rayonnement excédant les limites de la classe 1. Il est donc recommandé de ne jamais examiner à l'oeil nu la section d'un cordon optique ni une prise de fibres optiques ouverte. (C027) ATTENTION : Ce produit contient un laser de classe 1M. Ne l'observez pas à l'aide d'instruments optiques. (C028) ATTENTION : Certains produits à laser contiennent une diode à laser intégrée de classe 3A ou 3B. Prenez connaissance des informations suivantes. Rayonnement laser lorsque le capot est ouvert. Evitez toute exposition directe au rayon laser. Evitez de regarder fixement le faisceau ou de l'observer à l'aide d'instruments optiques. (C030) ATTENTION : Cette pile contient du lithium. Pour éviter tout risque d'explosion, n'essayez pas de la recharger et ne la faites pas brûler. Ne pas : v ___ la jeter à l'eau v ___ l'exposer à une température supérieure à 100 °C v ___ chercher à la réparer ou à la démonter Ne la remplacez que par une pile agréée. Pour le recyclage ou la mise au rebut, reportez-vous à la réglementation en vigueur. (C003a) Informations sur l'alimentation électrique et sur le câblage relatives au document GR-1089-CORE du NEBS (Network Equipment-Building System) Les commentaires suivants s'appliquent aux serveurs qui ont été déclarés conformes au document GR-1089-CORE du NEBS (Network Equipment-Building System) : Cet équipement peut être installé : v dans des infrastructures de télécommunications réseau v aux endroits préconisés dans les directives NEC (National Electrical Code). Les ports de ce matériel qui se trouvent à l'intérieur du bâtiment peuvent être connectés à des câbles internes ou non exposés uniquement. Ils ne doivent pas être connectés par leur partie métallique aux interfaces connectées au réseau extérieur ou à son câblage. Ces interfaces sont conçues pour être exclusivement utilisées à l'intérieur d'un bâtiment (ports de type 2 ou 4 décrits dans le document GR-1089-CORE) ; elles doivent être isolées du câblage à découvert du réseau extérieur. L'ajout de dispositifs de protection primaires n'est pas suffisant pour pouvoir connecter ces interfaces par leur partie métallique au câblage du réseau extérieur. Remarque : Tous les câbles Ethernet doivent être blindés et mis à la terre aux deux extrémités. Dans le cas d'un système alimenté en courant alternatif, il n'est pas nécessaire d'installer un dispositif externe de protection contre les surtensions (SPD). Un système alimenté en courant continu fait appel à un dispositif de retour du continu (DC-I). La borne de retour de la batterie en courant continu ne doit pas être connectée à la masse. xiv Activation SAS (Serial-attached SCSI) RAID Activation SAS RAID et bloc de batteries de cache pour le modèle 02E/4B, 02E/4C, 05F/6B ou 05E/6C Vous pouvez être amené à configurer des disques RAID (Redundant Array of Independent Disks) sur les unités de disque SAS (serial-attached SCSI) du boîtier système. Suivez les instructions de cette rubrique pour installer sur le système les dispositifs d'activation RAID. Les instructions suivantes permettent d'installer des cartes d'activation SAS RAID afin de préparer le système à l'utilisation d'unités de disque SAS dans une configuration RAID. 1 2 Activation SAS (Serial-attached SCSI) RAID Activation SA RAID - Nouveautés Prenez connaissance des nouveautés et des modifications significatives apportées à "Activation SAS RAID" depuis la dernière mise à jour de cet ensemble de rubriques. Octobre 2011 v Ajout d'informations relatives aux serveurs 02E/4C et 05E/6C. 3 4 Activation SAS (Serial-attached SCSI) RAID Installation du fond de panier de stockage - 6 baies SFF/DVD SATA (FC 5618) Utilisez cette procédure pour installer les composants inclus avec le dispositif de fond de panier de stockage - 6 baies SFF/DVD SATA. Cette procédure s'applique à l'installation de nouveaux dispositifs. Avant de commencer Avertissement : Si vous effectuez la maintenance d'une pièce défaillante, consultez la procédure de maintenance suivante : v Pour le contrôleur de stockage RAID, voir «Retrait du contrôleur de stockage RAID/cache 02E/4B, 02E/4C, 05F/6B ou 05E/6C», à la page 67 et «Remplacement du contrôleur de stockage RAID et cache 02E/4B, 02E/4C, 05F/6B ou 05E/6C», à la page 70. Pourquoi et quand exécuter cette tâche Avant d'installer ou de remplacer un dispositif, assurez-vous que le logiciel requis pour prendre en charge ce dispositif est installé sur votre système. Avertissement : Pour maintenir la disponibilité du système et éviter d'éventuelles pertes de données, vous devez comprendre que ces fonctions ont des conséquences sur la configuration de SAS (serial-attached SCSI) et l'accessibilité des données du système. Consultez la rubrique Sous-système SAS pour comprendre les différentes remarques sur la configuration et la maintenance SAS. Avant toute opération d'installation ou de maintenance d'un dispositif, consultez les configurations d'E-S pour chaque partition logique de votre système. Il est important de comprendre les différences entre les diverses configurations et les considérations de maintenance qui en résultent. En outre, tenez compte des incidences possibles sur les domaines suivants : v Partitions logiques et leurs affectations à l'adaptateur d'E-S v Configurations de grappes RAID v Emplacements de disque physique v Chemin d'accès à votre unité d'amorçage Si votre système est géré par la console HMC, utilisez la console HMC pour effectuer la procédure d'installation d'un dispositif sur le serveur. Pour plus d'instructions, voir «Installation d'un dispositif via la console HMC», à la page 109. Si le système est géré par la console SDMC (Systems Director Management Console), utilisez la console SDMC pour exécuter la procédure d'installation d'un dispositif sur le serveur. Pour des instructions, voir «Remplacement d'un composant via la console SDMC», à la page 124. Si vous ne disposez pas d'une console HMC ou SDMC, procédez comme suit : Procédure 1. Effectuez les tâches prérequises, comme décrit dans la rubrique «Avant de commencer», à la page 81. 2. Arrêtez le système (voir «Arrêt d'un système ou d'une partition logique», à la page 91). 3. Retirez le capot avant. v Pour un système monté en armoire, voir «Retrait du capot avant sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 95. 5 v Pour un système autonome, voir «Retrait du capot avant sur un système autonome 02E/4B, 02E/4C ou 05F/6B», à la page 95. 4. Pour un système monté en armoire, mettez-le en position de maintenance, comme indiqué dans «Mise en position de maintenance du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 104. 5. Retirez le capot d'accès. v Pour un système monté en armoire, voir «Retrait du capot d'accès sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 98. v Pour un système autonome, voir «Retrait du capot d'accès d'un système autonome 02E/4B, 02E/4C ou 05F/6B», à la page 99. 6. Débranchez le système pour l'isoler de la source d'alimentation. Remarque : Ce système est équipé d'un second bloc d'alimentation. Avant de poursuivre cette procédure, vérifiez que la source d'alimentation a été débranchée du système. (L003) ou 7. Attachez le bracelet antistatique. 6 Activation SAS (Serial-attached SCSI) RAID Avertissement : v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel contre les risques de décharge électrostatique. v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation d'appareils électriques. v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique, d'installer ou de remplacer du matériel. 8. Installez le fond de panier de l'unité de disque. Pour des instructions, voir Remplacement du fond de panier des unités de disque 02E/4B, 02E/4C, 05F/6B ou 05E/6C. 9. Installez les unités de disque. v Pour l'unité de disque interne, voir Remplacement de l'unité de disque interne dans un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C. v Pour une unité SSD, voir Installation d'une unité de disque ou d'une unité SSD. 10. Installez l'unité de DVD SATA. Pour des instructions, voir Installation d'une unité de stockage extra-plate sur le modèle 02E/4B, 02E/4C, 05F/6B ou 05E/6C. 11. Réinstallez le capot d'accès. v Pour un système monté en armoire, voir «Installation du capot d'accès sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 101. v Pour un système autonome, voir «Installation du capot d'accès d'un système autonome 02E/4B, 02E/4C ou 05F/6B», à la page 102. 12. Pour un système monté en armoire, mettez-le en position de fonctionnement, comme indiqué dans «Mise en position de fonctionnement du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 106. 13. Remettez en place le capot avant. v Pour un système monté en armoire, voir «Installation du capot avant sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 96. v Pour un système autonome, voir «Installation des capot et volet avant sur un système 02E/4B, 02E/4C ou 05F/6B autonome», à la page 97. 14. Rebranchez la source d'alimentation. 15. Démarrez le système (voir «Démarrage du système ou de la partition logique», à la page 89). 16. Vérifiez le composant installé : v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme décrit dans Vérification du composant installé. Installation du fond de panier de stockage - 6 baies SFF/DVD SATA (FC 5618) 7 8 Activation SAS (Serial-attached SCSI) RAID Installation des baies d'unité partagée 3x3 sans prise en charge RAID 5/6 (FC 5631 et FC EJ02) Utilisez cette procédure pour installer les composants inclus avec le dispositif de baies d'unité partagée 3x3 sans prise en charge RAID 5/6. Avant de commencer Remarque : FC 5631 s'applique à 02E/4B ou 05F/6B et FC EJ02 s'applique à 02E/4C ou 05E/6C. Avertissement : Cette procédure s'applique à l'installation d'une capacité de baies d'unité partagée. Pourquoi et quand exécuter cette tâche Avant d'installer ou de remplacer un dispositif, assurez-vous que le logiciel requis pour prendre en charge ce dispositif est installé sur votre système. Avertissement : Pour maintenir la disponibilité du système et éviter d'éventuelles pertes de données, vous devez comprendre que ces fonctions ont des conséquences sur la configuration de SAS (serial-attached SCSI) et l'accessibilité des données du système. Consultez la rubrique Sous-système SAS pour comprendre les différentes remarques sur la configuration et la maintenance SAS. Avant toute opération d'installation ou de maintenance d'un dispositif, consultez les configurations d'E-S pour chaque partition logique de votre système. Il est important de comprendre les différences entre les diverses configurations et les considérations de maintenance qui en résultent. En outre, tenez compte des incidences possibles sur les domaines suivants : v Partitions logiques et leurs affectations à l'adaptateur d'E-S v Configurations de grappes RAID v Emplacements de disque physique v Chemin d'accès à votre unité d'amorçage Si votre système est géré par la console HMC, utilisez la console HMC pour effectuer la procédure d'installation d'un dispositif sur le serveur. Pour plus d'instructions, voir «Installation d'un dispositif via la console HMC», à la page 109 Si le système est géré par la console SDMC (Systems Director Management Console), utilisez la console SDMC pour exécuter la procédure d'installation d'un dispositif sur le serveur. Pour des instructions, voir «Remplacement d'un composant via la console SDMC», à la page 124. Si vous ne disposez pas d'une console HMC ou SDMC, procédez comme suit : Procédure 1. Effectuez les tâches prérequises, comme décrit dans la rubrique «Avant de commencer», à la page 81. 2. Arrêtez le système (voir «Arrêt d'un système ou d'une partition logique», à la page 91). 3. Pour un système monté en armoire, mettez-le en position de maintenance, comme indiqué dans «Mise en position de maintenance du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 104. 4. Retirez le capot d'accès. v Pour un système monté en armoire, voir «Retrait du capot d'accès sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 98. 9 v Pour un système autonome, voir «Retrait du capot d'accès d'un système autonome 02E/4B, 02E/4C ou 05F/6B», à la page 99. 5. Utilisez les voyants de maintenance pour vous aider à identifier le composant, comme indiqué dans «Identification d'un composant défaillant», à la page 84. 6. Débranchez le système pour l'isoler de la source d'alimentation. Remarque : Ce système est équipé d'un second bloc d'alimentation. Avant de poursuivre cette procédure, vérifiez que la source d'alimentation a été débranchée du système. (L003) ou 7. Attachez le bracelet antistatique. Avertissement : v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel contre les risques de décharge électrostatique. v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation d'appareils électriques. v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique, d'installer ou de remplacer du matériel. 8. Identifiez l'emplacement du contrôleur de stockage RAID-10. 10 Activation SAS (Serial-attached SCSI) RAID 9. Alignez le contrôleur de stockage RAID-10 (A) sur l'emplacement dans le système. Figure 1. Emplacements de contrôleur de stockage RAID-10 (P1 à C19) pour 02E/4B ou 05F/6B Figure 2. Emplacements de contrôleur de stockage RAID-10 (P1 à C19) pour 02E/4C ou 05E/6C Installation des baies d'unité partagée 3x3 sans prise en charge RAID 5/6 (FC 5631 et FC EJ02) 11 10. Faites glisser le contrôleur de stockage RAID-10 dans le système jusqu'au verrouillage du loquet (B), comme indiqué dans l'illustration. 11. Installez le fond de panier de l'unité de disque. Pour des instructions, voir Retrait du fond de panier des unités de disque 02E/4B, 02E/4C, 05F/6B ou 05E/6C. 12. Installez les unités de disque. v Pour l'unité de disque interne, voir Remplacement de l'unité de disque interne dans un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C. v Pour une unité SSD, voir Installation d'une unité de disque ou d'une unité SSD. 13. Installez l'unité de DVD SATA. Pour des instructions, voir Installation d'une unité de stockage extra-plate sur le modèle 02E/4B, 02E/4C, 05F/6B ou 05E/6C. 14. Installez le port SAS externe. Pour plus d'informations, voir Installation du port SAS externe sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C. 15. Réinstallez le capot d'accès. v Pour un système monté en armoire, voir «Installation du capot d'accès sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 101. v Pour un système autonome, voir «Installation du capot d'accès d'un système autonome 02E/4B, 02E/4C ou 05F/6B», à la page 102. 16. Pour un système monté en armoire, mettez-le en position de fonctionnement, comme indiqué dans «Mise en position de fonctionnement du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 106. 17. Remettez en place le capot avant. v Pour un système monté en armoire, voir «Installation du capot avant sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 96. v Pour un système autonome, voir «Installation des capot et volet avant sur un système 02E/4B, 02E/4C ou 05F/6B autonome», à la page 97. Figure 3. Modèle monté en armoire : remplacement du contrôleur de stockage RAID-10 12 Activation SAS (Serial-attached SCSI) RAID 18. Rebranchez la source d'alimentation. 19. Démarrez le système (voir «Démarrage du système ou de la partition logique», à la page 89). 20. Vérifiez le composant installé ou réparé. v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme décrit dans Vérification du composant installé. Installation des baies d'unité partagée 3x3 sans prise en charge RAID 5/6 (FC 5631 et FC EJ02) 13 14 Activation SAS (Serial-attached SCSI) RAID Installation du fond de panier de stockage - 8 baies SFF/RAID 175 Mo/Adaptateur d'E-S double (FC 5630 et FC EJ01) Utilisez cette procédure pour installer les composants inclus avec le dispositif de fond de panier de stockage - 8 baies SFF/RAID 175 Mo/Adaptateur d'E-S double. Cette procédure s'applique à l'installation de nouveaux dispositifs. Avant de commencer Remarque : FC 5630 s'applique à 02E/4B ou 05F/6B et FC EJ01 s'applique à 02E/4C ou 05E/6C. Avertissement : Si vous effectuez la maintenance d'un composant défaillant, voir les procédures de maintenance suivantes : v Pour la carte d'activation, voir Retrait de la carte d'activation et Remise en place de la carte d'activation. v Pour les contrôleurs de stockage RAID et Cache, voir Retrait du contrôleur de stockage et Remise en place du contrôleur de stockage. Pourquoi et quand exécuter cette tâche Avertissement : Pour maintenir la disponibilité du système et éviter d'éventuelles pertes de données, vous devez comprendre que ces fonctions ont des conséquences sur la configuration de SAS (serial-attached SCSI) et l'accessibilité des données du système. Consultez la rubrique Sous-système SAS pour comprendre les différentes remarques sur la configuration et la maintenance SAS. Avant toute opération d'installation ou de maintenance d'un dispositif, consultez les configurations d'E-S pour chaque partition logique de votre système. Il est important de comprendre les différences entre les diverses configurations et les considérations de maintenance qui en résultent. En outre, tenez compte des incidences possibles sur les domaines suivants : v Partitions logiques et leurs affectations à l'adaptateur d'E-S v Configurations de grappes RAID v Emplacements de disque physique v Chemin d'accès à votre unité d'amorçage Avant d'installer ou de remplacer un dispositif, assurez-vous que le logiciel requis pour prendre en charge ce dispositif est installé sur votre système. Si votre système est géré par la console HMC, utilisez la console HMC pour effectuer la procédure d'installation d'un dispositif sur le serveur. Pour plus d'instructions, voir «Installation d'un dispositif via la console HMC», à la page 109. Si le système est géré par la console SDMC (Systems Director Management Console), utilisez la console SDMC pour exécuter la procédure d'installation d'un dispositif sur le serveur. Pour des instructions, voir «Remplacement d'un composant via la console SDMC», à la page 124. Si vous ne disposez pas d'une console ou SDMC, procédez comme suit. Procédure 1. Effectuez les tâches prérequises, comme décrit dans la rubrique «Avant de commencer», à la page 81. 2. Utilisez les voyants de maintenance pour vous aider à identifier le composant, comme indiqué dans «Identification d'un composant défaillant», à la page 84. 3. Arrêtez le système (voir «Arrêt d'un système ou d'une partition logique», à la page 91). 15 4. Pour un système monté en armoire, mettez-le en position de maintenance, comme indiqué dans «Mise en position de maintenance du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 104. 5. Débranchez le système pour l'isoler de la source d'alimentation. Remarque : Ce système est équipé d'un second bloc d'alimentation. Avant de poursuivre cette procédure, vérifiez que la source d'alimentation a été débranchée du système. (L003) ou 6. Retirez le capot d'accès. v Pour un système monté en armoire, voir «Retrait du capot d'accès sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 98. v Pour un système autonome, voir «Retrait du capot d'accès d'un système autonome 02E/4B, 02E/4C ou 05F/6B», à la page 99. 7. Attachez le bracelet antistatique. Avertissement : v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel contre les risques de décharge électrostatique. v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation d'appareils électriques. v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique, d'installer ou de remplacer du matériel. 16 Activation SAS (Serial-attached SCSI) RAID 8. Pour le système 02E/4B ou 05F/6B, localisez la carte d'activation RAID (P1–C13), le contrôleur de stockage RAID/cache (P1-C19) et la carte de batterie de cache (P1-C14) et passez à l'étape 10, à la page 18. 9. Pour le système 02E/4C ou 05E/6C, localisez le contrôleur de stockage RAID/cache (P1-C19) et la carte de batterie de cache (P1-C14) et passez à l'étape 11, à la page 18. Figure 4. Emplacement de la carte d'activation RAID, des contrôleurs de stockage RAID et cache et de la carte de batterie de cache Figure 5. Emplacement du contrôleur de stockage RAID et cache et de la carte de batterie de cache Installation du fond de panier de stockage - 8 baies SFF/RAID 175 Mo/Adaptateur d'E-S double (FC 5630 et FC EJ01) 17 10. Installez la carte d'activation RAID pour 02E/4B ou 05F/6B en procédant comme suit : Remarque : Le système 02E/4C ou 05E/6C ne dispose pas de la carte d'activation. a. Alignez la carte d'activation RAID (A) sur l'emplacement (B) dans le système. b. Faites glisser la carte d'activation RAID dans le système jusqu'au verrouillage des loquets, comme indiqué dans l'illustration. 11. Installez les contrôleurs de stockage RAID et cache en procédant comme suit : a. Alignez les contrôleurs de stockage RAID et cache (A) sur l'emplacement dans le système. b. Faites glisser les contrôleurs de stockage RAID et cache dans le système jusqu'au verrouillage du loquet (B), comme indiqué dans l'illustration. Figure 6. Modèle monté en armoire : remplacement de la carte d'activation RAID 18 Activation SAS (Serial-attached SCSI) RAID 12. Installez la carte de batterie de cache en procédant comme suit : a. Faites glisser la carte de batterie de cache (A) dans les glissières de montage du contrôleur jusqu'à ce qu'elle s'insère dans le connecteur de batterie. b. Lorsque la batterie est placée dans le connecteur, déplacez le levier vers la position de verrouillage afin d'insérer complètement la batterie dans le connecteur. Figure 7. Modèle monté en armoire : remplacement des contrôleurs de stockage RAID et cache Installation du fond de panier de stockage - 8 baies SFF/RAID 175 Mo/Adaptateur d'E-S double (FC 5630 et FC EJ01) 19 13. Retirez les unités de disque. v Pour les unités de disque internes, voir Retrait de l'unité de disque interne dans un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C. v Pour les unités SSD, voir Retrait d'une unité de disque ou d'une unité SSD. 14. Retirez le fond de panier des unités de disque [CCIN 2BD5]. Pour des instructions, voir Retrait du fond de panier des unités de disque 02E/4B, 02E/4C, 05F/6B ou 05E/6C. 15. Installez le fond de panier de l'unité de disque à haute fonctionnalité [CCIN 2BD6]. Pour des instructions, voir Remplacement du fond de panier des unités de disque 02E/4B, 02E/4C, 05F/6B ou 05E/6C. 16. Remplacez les unités de disque dans le fond de panier d'unité de disque à haute fonctionnalité. v Pour les unités de disque internes, voir Remplacement de l'unité de disque interne dans un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C. v Pour les unités SSD, voir Installation d'une unité de disque ou d'une unité SSD. 17. Remettez en place le capot d'accès de maintenance. Voir v Pour un système monté en armoire, voir «Installation du capot d'accès sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 101. v Pour un système autonome, voir «Installation du capot d'accès d'un système autonome 02E/4B, 02E/4C ou 05F/6B», à la page 102. 18. Pour un système monté en armoire, mettez-le en position de fonctionnement, comme indiqué dans «Mise en position de fonctionnement du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 106. 19. Remettez en place le capot avant. v Pour un système monté en armoire, voir «Installation du capot avant sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 96. v Pour un système autonome, voir «Installation des capot et volet avant sur un système 02E/4B, 02E/4C ou 05F/6B autonome», à la page 97. Figure 8. Modèle monté en armoire : remplacement de la carte de batterie de cache RAID 20 Activation SAS (Serial-attached SCSI) RAID 20. Rebranchez la source d'alimentation. 21. Démarrez le système (voir «Démarrage du système ou de la partition logique», à la page 89). 22. Vérifiez le composant installé : v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme décrit dans Vérification du composant installé. Installation du fond de panier de stockage - 8 baies SFF/RAID 175 Mo/Adaptateur d'E-S double (FC 5630 et FC EJ01) 21 22 Activation SAS (Serial-attached SCSI) RAID Retrait et remise en place des adaptateurs et batteries SAS RAID Il se peut que vous deviez retirer ou remettre en place les adaptateurs SAS RAID et les batteries. Cette section comprend les procédures relatives à ces tâches. Remarque : Le remplacement du dispositif incombe à l'utilisateur. Vous pouvez soit exécuter la tâche vous-même, soit faire appel à un prestataire agréé. Il vous imputera probablement les frais correspondants. Retrait et remplacement du fond de panier d'unités de disque Utilisez ces procédures pour retirer, remplacer ou installer le fond de panier d'unités de disques sur les serveurs 7/20 (02E/4B et 02E/4C) et 7/40 (05F/6B et 05E/6C). Les figures suivantes représentent les fonds de panier d'unité de disque d'un système monté en armoire et d'un système autonome. Figure 9. Fond de panier d'unités de disques d'un système monté en armoire 23 Retrait du fond de panier d'unités de disques sur les modèles 02E/4B, 02E/4C, 05F/6B ou 05E/6C Cette procédure permet de retirer un fond de panier d'unités de disque du serveur. Pourquoi et quand exécuter cette tâche Si le système est géré par la console HMC (Hardware Management Console), utilisez la console HMC pour retirer le fond de panier d'unité de disque du serveur. Pour obtenir des informations sur l'utilisation de la console HMC en vue de retirer le fond de panier d'unité de disque, voir Retrait d'un composant via la console HMC. Si le système est géré par la console SDMC (Systems Director Management Console), utilisez la console SDMC pour retirer le fond de panier d'unité de disque du serveur. Pour des instructions, voir Retrait d'un composant via la console SDMC. Si vous ne disposez pas d'une console HMC ou SDMC, procédez comme suit pour retirer le fond de panier d'unité de disque : Avertissement : Veillez à retirer le panneau de commande avant le fond de panier d'unité de disque pour ne pas endommager les câbles du panneau de commande. Procédure 1. Identifiez le système sur lequel vous travaillez à l'aide du processus d'identification du système afin d'activer l'indicateur de système (bleu). Pour plus d'informations, voir Voyants du panneau de commande. Figure 10. Fond de panier d'unités de disques d'un système autonome 24 Activation SAS (Serial-attached SCSI) RAID 2. Utilisez les voyants des indicateurs de service pour vous aider à identifier le composant, comme indiqué dans «Identification d'un composant défaillant», à la page 84. 3. Effectuez les tâches prérequises (voir «Avant de commencer», à la page 81). 4. Arrêtez le système (voir «Arrêt d'un système ou d'une partition logique», à la page 91). 5. Retirez le capot avant : v Pour un système monté en armoire, voir «Retrait du capot avant sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 95. v Pour un système autonome, voir «Retrait du capot avant sur un système autonome 02E/4B, 02E/4C ou 05F/6B», à la page 95. 6. Pour un système monté en armoire, mettez-le en position de maintenance, comme indiqué à la section «Mise en position de maintenance du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 104. 7. Débranchez le système pour l'isoler de la source d'alimentation, comme décrit dans «Déconnexion des cordons d'alimentation du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C», à la page 107. Remarque : Ce système est équipé d'un second bloc d'alimentation. Avant de poursuivre cette procédure, vérifiez que la source d'alimentation a été débranchée du système. (L003) ou 8. Retirez le capot d'accès : v Pour un système monté en armoire, voir «Retrait du capot d'accès sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 98. v Pour un système autonome, voir «Retrait du capot d'accès d'un système autonome 02E/4B, 02E/4C ou 05F/6B», à la page 99. 9. Attachez le bracelet antistatique. Retrait et remise en place des adaptateurs et batteries SAS RAID 25 Avertissement : v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel contre les risques de décharge électrostatique. v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation d'appareils électriques. v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique, d'installer ou de remplacer du matériel. 10. Si vous devez remplacer le fond de panier en vue d'une mise à niveau ou d'un incident système, procédez comme suit afin de retirer les unités de disques, en fonction de votre fond de panier. Remarque : Si vous retirez un fond de panier comportant des unités de disques, le fond de panier est lourd. Assurez-vous de pouvoir effectuer cette procédure en toute sécurité. a. Pour retirer l'unité de disque du fond de panier, procédez comme suit: Remarque : Notez l'emplacement de chaque unité de disque retirée, de manière à remettre en place les unités de disque à leur emplacement d'origine. 1) Déverrouillez la poignée de l'unité de disques (A) et tirez-la vers vous pour la dégager. 2) Placez une main sous l'unité de disques et tirez avec précaution l'unité de disques du système, comme indiqué dans la figure 11. 11. Retirez l'unité de stockage en appuyant sur le loquet (A) et en faisant glisser l'unité en dehors du système, comme indiqué dans la figure suivante. Figure 11. Retrait de l'unité de disque du fond de panier 26 Activation SAS (Serial-attached SCSI) RAID Pour plus d'informations sur le retrait des différents types d'unités de stockage, voir Retrait et remplacement d'unités de stockage sur les modèles 02E/4B, 02E/4C, 05F/6B ou 05E/6C. 12. Débranchez les câbles situés à l'arrière du fond de panier d'unités de disque. Si vous avez connecté un câble SAS (serial attached SCSI), pensez à débrancher ce câble du fond de panier d'unités de disques. Pour plus d'informations, voir Installation du port SAS externe sur un modèle 02E/4B, 02E/4C, 05F/6B ou 05E/6C. 13. Déverrouillez le fond de panier de l'unité de disques en tirant les poignées (A) afin de les mettre en position ouverte, au centre du fond de panier, comme indiqué dans la figure 13, à la page 28 et figure 14, à la page 28. Figure 12. Retrait de l'unité de stockage du fond de panier d'unité de disque Retrait et remise en place des adaptateurs et batteries SAS RAID 27 14. Tirez le fond de panier en dehors du serveur. Figure 13. Retrait du fond de panier comportant des unités de disque Figure 14. Retrait du fond de panier ne comportant pas d'unités de disque 28 Activation SAS (Serial-attached SCSI) RAID Que faire ensuite Si vous avez retiré le fond de panier dans le cadre d'une autre procédure, reprenez cette procédure. Pour remplacer le fond de panier ou en installer un nouveau, voir Remplacement du fond de panier d'unités de disques sur les modèles 02E/4B, 02E/4C, 05F/6B ou 05E/6C. Remplacement du fond de panier d'unités de disques sur les modèles 02E/4B, 02E/4C, 05F/6B ou 05E/6C Utilisez cette procédure pour remplacer un fond de panier d'unités de disque. Pourquoi et quand exécuter cette tâche Si vous remplacez un fond de panier d'unités de disques dans le cadre d'une procédure de maintenance et souhaitez le retirer, voir sur les modèles sur les modèles 02E/4B, 02E/4C, 05F/6B ou 05E/6C. Si le système est géré par la console HMC (Hardware Management Console), utilisez la console HMC pour effectuer la procédure de remplacement du fond de panier d'unité de disque sur le serveur. Pour plus d'informations, voir la rubrique Remplacement d'un composant via la console HMC. Si le système est géré par la console SDMC (Systems Director Management Console), utilisez la console SDMC pour effectuer la procédure de remplacement du fond de panier d'unité de disque sur le serveur. Pour obtenir des instructions, voir la rubrique Remplacement d'un composant via la console SDMC. Si vous ne disposez pas d'une console HMC ou SDMC, procédez comme suit pour remplacer le fond de panier d'unité de disque : Procédure 1. Vérifiez la présence éventuelle d'un capot de protection sur les broches pour la partie inférieure médiane à l'arrière du fond de panier. Si un capot de protection est installé, retirez-le. 2. Vérifiez les taquets de verrouillage (A) visibles dans les figures ci-dessous. Si les taquets sont en position verrouillée, ouvrez-les. 3. Soulevez délicatement le fond de panier sur les deux côtés et alignez-le avec son emplacement dans le serveur. 4. Insérez fermement le fond de panier dans le serveur. 5. Fixez le fond de panier à l'aide des taquets de verrouillage (A), comme indiqué dans les figures suivantes. Retrait et remise en place des adaptateurs et batteries SAS RAID 29 6. Réinstallez les unités de disques si vous les avez préalablement retirées. Utilisez les informations que vous avez enregistrées lors du retrait des unités pour les installer dans les emplacements appropriés. Pour remplacer les unités de disque, procédez comme suit : Figure 15. Installation d'un fond de panier sans unités de disque dans un modèle monté en armoire Figure 16. Installation d'un fond de panier doté d'unités de disque dans un modèle monté en armoire 30 Activation SAS (Serial-attached SCSI) RAID a. Tenez l'unité de disque par les bords hauts et bas pour insérer le lecteur dans l'unité d'extension. Ne la tenez pas par la poignée. b. Faites glisser complètement l'unité de disques dans l'unité centrale ou l'unité d'extension, puis verrouillez l'unité dans son emplacement en poussant la poignée, comme indiqué dans la figure suivante. 7. Réinstallez l'unité de stockage extra-plate en l'insérant dans l'emplacement et en la poussant jusqu'à ce que le levier (B) soit verrouillé, comme l'illustre la figure suivante. Pour plus d'informations sur le remplacement des différents types d'unités de stockage, voir Retrait et remplacement d'unités de stockage sur les modèles 02E/4B, 02E/4C, 05F/6B ou 05E/6C. Figure 17. Installation des unités de disque d'un fond de panier Retrait et remise en place des adaptateurs et batteries SAS RAID 31 8. Le cas échéant, connectez le câble au fond de panier. Pour plus d'informations, voir Installation du port SAS externe sur un modèle 02E/4B, 02E/4C, 05F/6B ou 05E/6C. 9. Le cas échéant, installez la carte d'activation RAID. Pour plus d'informations, voir Activation SAS RAID et bloc de batteries de cache pour les modèles 02E/4B, 02E/4C, 05F/6B ou 05E/6C. 10. Remettez le capot d'accès en place : v Pour un système monté en armoire, voir «Installation du capot d'accès sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 101. v Pour un système autonome, voir «Installation du capot d'accès d'un système autonome 02E/4B, 02E/4C ou 05F/6B», à la page 102. 11. Pour un système monté en armoire, mettez-le en position de fonctionnement, comme indiqué dans «Mise en position de fonctionnement du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 106. 12. Connectez les cordons d'alimentation au système, comme décrit dans Connexion des cordons d'alimentation. 13. Remettez le capot avant en place : v Pour un système monté en armoire, voir «Installation du capot avant sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 96. v Pour un système autonome, voir «Installation des capot et volet avant sur un système 02E/4B, 02E/4C ou 05F/6B autonome», à la page 97. 14. Démarrez le système ou la partition logique. Voir «Démarrage du système ou de la partition logique», à la page 89. 15. Vérifiez le composant installé : v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme décrit dans Vérification du composant installé. Figure 18. Installation de l'unité de stockage 32 Activation SAS (Serial-attached SCSI) RAID Retrait et remplacement du fond de panier du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C Découvrez comment retirer ou remplacer le fond de panier système. Pourquoi et quand exécuter cette tâche Les procédures suivantes expliquent comment retirer et remplacer le fond de panier système. Retrait du fond de panier du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C Prenez connaissance des instructions de retrait du fond de panier système d'un serveur en vue du remplacement d'un composant défaillant ou dans le cadre d'une autre procédure de maintenance. Avant de commencer Remarques : v Avant de commencer cette procédure, vérifiez que vous disposez des droits de connexion en tant que fournisseur de services agréé. Les différents niveaux d'autorisation sont décrits dans Niveaux d'autorisation ASMI. v Identifiez le système sur lequel vous travaillez à l'aide du processus d'identification du système afin d'activer l'indicateur de système (bleu). Pour de plus amples informations, voir Voyant du panneau de commande. Pourquoi et quand exécuter cette tâche Si votre système est géré par la console HMC (Hardware Management Console), utilisez-la pour retirer le fond de panier système. Pour plus d'informations, voir la rubrique Remplacement d'un composant à l'aide de la console HMC. Si votre système est géré par la console SDMC (Systems Director Management Console), utilisez-la pour retirer le fond de panier système. Pour obtenir des instructions, voir Remplacement d'un composant à l'aide de la console SDMC. Si vous ne disposez pas d'une console HMC ou SDMC, procédez comme suit pour retirer le fond de panier système : Procédure 1. Accédez à l'interface ASMI. L'interface ASMI (Advanced System Management Interface) est l'interface du processeur de service requise pour effectuer des tâches de maintenance générales et de niveau administrateur, telles que la lecture des journaux d'erreurs du processeur de service, la lecture des données essentielles du produit, la configuration du processeur de service et le contrôle de l'alimentation système. L'interface ASMI est également désignée comme l'ensemble des menus du processeur de service. Pour accéder à l'interface ASMI, voir Accès à l'interface ASMI. Une fois que vous avez accédé à l'interface ASMI, passez à l'étape suivante. 2. Vérifiez et enregistrez le type de boîtier et le code dispositif. Le numéro de série du boîtier est un numéro à sept chiffres imprimé sur une étiquette code barre placée sur l'unité centrale. Remarque : Le code dispositif de fond de panier du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C est 78AA-001. 3. Vérifiez et enregistrez les paramètres de l'unité carte Ethernet hôte (HEA) serveur. Retrait et remise en place des adaptateurs et batteries SAS RAID 33 v Pour la console HMC : a. Développez le dossier Gestion de systèmes dans le volet de l'arborescence de navigation. b. Cliquez deux fois sur le dossier Serveurs. c. Sélectionnez un serveur dans le panneau central. d. Dans le menu Tâches, cliquez sur Informations sur le matériel > Cartes > Ethernet hôte. La fenêtre Cartes Ethernet hôte apparaît. e. Sélectionnez le port physique requis et cliquez sur Configurer. La fenêtre de configuration du port physique de la carte HEA apparaît. f. Enregistrez la valeur dans la zone Pending Port Group Multi-Core Scaling value. v Pour la console SDMC : a. Allez dans l'onglet Resources et cliquez sur Hosts. b. Sélectionnez un serveur dans le panneau de droite. c. Dans le menu Actions, cliquez sur Informations sur le matériel > Cartes > Ethernet hôte. La fenêtre Cartes Ethernet hôte apparaît. d. Sélectionnez le port physique requis et cliquez sur Configurer. La fenêtre de configuration du port physique de la carte HEA apparaît. e. Enregistrez la valeur dans la zone Pending Port Group Multi-Core Scaling value. 4. Vérifiez et notez les paramètres du serveur. Avant de remplacer cette unité remplaçable sur site (FRU), vérifiez et enregistrez tous les paramètres du serveur étant donné que vous devrez peut-être mettre à jour le système après avoir remplacé l'unité FRU. Cela peut être effectué via l'interface ASMI (Advanced System Management Interface). a. Recherchez et notez le niveau de microprogramme du serveur indiqué sur le bord droit de la fenêtre d'état, la zone dans laquelle se trouve le bouton Déconnexion. Vous devrez peut-être mettre à jour le système avec le niveau de code de microprogramme de serveur le plus récent après avoir remplacé l'unité FRU. b. Vérifiez et notez les paramètres du processeur de service que vous avez précédemment définis à l'aide de l'interface ASMI. Vous devrez réinitialiser les paramètres du processeur de service après la mise à jour de cette unité FRU, faute de quoi les paramètres par défaut seront utilisés. Vous devez noter les paramètres suivants : v Paramètre de nom de système v Paramètres de contrôle de l'alimentation système v Paramètres de l'aide à la maintenance système v Paramètres de configuration système v Paramètres des services réseau v Paramètres des options de performances v Paramètres de profil de connexion v Paramètres de liste des unités d'amorçage v Paramètres de carte Ethernet hôte c. Notez tous les paramètres du processeur de service que vous avez éventuellement définis à l'aide des commandes du système d'exploitation. 5. Arrêtez le système ou la partition logique. Pour obtenir des instructions, voir Arrêt du système ou de la partition logique. 6. Débranchez le système pour l'isoler de la source d'alimentation. Remarque : Ce système est équipé en option d'un second bloc d'alimentation. Avant de poursuivre cette procédure, vérifiez que la source d'alimentation du système est complètement déconnectée. (L003) 34 Activation SAS (Serial-attached SCSI) RAID ou 7. Etiquetez et débranchez les câbles externes connectés à l'unité remplaçable sur site (FRU) dont vous effectuez la maintenance. 8. Placez le système en position de maintenance. Pour des instructions, voir Mise en position de maintenance d'un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire. 9. Retirez le capot d'accès de maintenance : v Pour un système monté en armoire, voir Retrait du capot d'accès sur un système monté en armoire 02E/4B, 02E/4C, 05F/6B ou 05E/6C. v Pour un système autonome, voir Retrait du capot d'accès sur un système autonome 02E/4B, 02E/4C ou 05F/6B. 10. Retirez la carte d'extension de mémoire. Pour des instructions, voir Retrait de la carte d'extension de mémoire 02E/4B, 02E/4C, 05F/6B ou 05E/6C. 11. Retirez les cartes GX. Pour des instructions, voir Retrait des adaptateurs GX pour 02E/4B, 02E/4C, 05F/6B, ou 05E/6C. 12. Retirez les cartes PCI. Pour des instructions, voir Retrait hors tension d'une carte PCI sur le serveur 02E/4B, 02E/4C, 05F/6B ou 05E/6C.. 13. Retirez la carte Ethernet hôte (02E/4B ou 05F/6B). Pour obtenir des instructions, voir Retrait hors tension de l'adaptateur Ethernet hôte 02E/4B ou 05F/6B. 14. Retirez tous les régulateurs de tension. Pour des instructions, voir Retrait du régulateur de tension 02E/4B, 02E/4C, 05F/6B, ou 05E/6C. 15. Retirez les ventilateurs intermédiaires. Pour des instructions, voir Retrait des ventilateurs intermédiaires 02E/4B, 02E/4C, 05F/6B ou 05E/6C. 16. Retirez les blocs d'alimentation. Pour des instructions, voir Retrait hors tension des blocs d'alimentation 02E/4B, 02E/4C, 05F/6B, ou 05E/6C ou Retrait sous tension des blocs d'alimentation 02E/4B, 02E/4C, 05F/6B ou 05E/6C. 17. Débranchez les câbles internes du fond de panier. Retrait et remise en place des adaptateurs et batteries SAS RAID 35 Retirez le câble SPCN à l'arrière du châssis (02E/4C ou 05E/6C). 18. Retirez le boîtier de l'unité de disque : a. Saisissez les loquets de l'unité de disque et tirez dessus. b. Saisissez les poignées, puis tirez le boîtier de l'unité de disque vers vous. Figure 19. Débranchement des câbles internes 36 Activation SAS (Serial-attached SCSI) RAID 19. Retirez le répartiteur d'air de la carte PCI. Saisissez fermement le répartiteur d'air de la carte PCI et soulevez-le. Figure 20. Retrait du boîtier de l'unité de disque Figure 21. Retrait du répartiteur d'air de la carte PCI Retrait et remise en place des adaptateurs et batteries SAS RAID 37 20. Retirez la carte PCI (en option) : a. Desserrez les deux vis moletées (A) en les tournant dans le sens inverse des aiguilles d'une montre. b. Déverrouillez les taquets de la carte PCI (B) en appuyant sur leurs extrémités et en les soulevant. c. Retirez la carte PCI en la soulevant. 21. Retirez le bouclier du boîtier de l'unité de disque, le régulateur de tension et le répartiteur d'air. a. Retirez les cinq vis qui maintiennent le bouclier du boîtier de l'unité de disque, le répartiteur d'air et le régulateur de tension. b. Retirez le bouclier du boîtier de l'unité de disque, le régulateur de tension et le répartiteur d'air en les soulevant. Figure 22. Retrait de la carte PCI 38 Activation SAS (Serial-attached SCSI) RAID 22. Retirez le logement de barrette DIMM : a. Retirez les quatre vis qui maintiennent le boîtier de barrette DIMM (vue A). Figure 23. Retrait du boîtier de l'unité de disque (02E/4B ou 05F/6B) Figure 24. Retrait du boîtier de l'unité de disque (02E/4C ou 05E/6C) Retrait et remise en place des adaptateurs et batteries SAS RAID 39 b. Retirez le boîtier de barrette DIMM en le soulevant. 23. Débranchez le câble du ventilateur avant. Figure 25. Retrait du boîtier de barrette DIMM 40 Activation SAS (Serial-attached SCSI) RAID 24. Retirez le boîtier du ventilateur intermédiaire : a. Retirez les vis (B) qui maintiennent le boîtier de ventilation. b. Retirez les deux vis (A) qui maintiennent le boîtier de ventilation à l'extérieur du boîtier du système. c. Retirez le boîtier de ventilation en le soulevant. Figure 26. Retrait du câble du ventilateur avant Retrait et remise en place des adaptateurs et batteries SAS RAID 41 25. Retirez le commutateur de verrouillage d'alimentation. Pour des instructions, voir Retrait du commutateur de verrouillage d'alimentation 02E/4B, 02E/4C, 05F/6B ou 05E/6C. 26. Retrait du module du processeur système. Pour des instructions, voir Retrait du module processus système 02E/4B, 02E/4C, 05F/6B, ou 05E/6C. 27. Retirez le support métallique du régulateur de tension. a. Localisez les trois vis qui maintiennent le support sur le côté du système. b. Retirez les trois vis. 28. Retirez le fond de panier système. Remarque : Retirez le fond du panier système avec précaution. Ne le soulevez pas à l'aide des modules fixés. a. Etiquetez et débranchez tous les câbles Ethernet qui sont présents. b. Retirez les quatre vis (A). Remarque : Ne retirez aucune des vis Torx. Retirez uniquement les quatre vis à tête hexagonale et à fente. Figure 27. Retrait du boîtier du ventilateur intermédiaire 42 Activation SAS (Serial-attached SCSI) RAID c. Saisissez le fond de panier système par les deux bras de chargement du dissipateur thermique (B). Figure 28. Retrait du fond de panier système (02E/4B ou 05F/6B) Figure 29. Retrait du fond de panier système Retrait et remise en place des adaptateurs et batteries SAS RAID 43 d. Soulevez l'avant du fond de panier système pour dégager les chevilles de positionnement (6,35 mm environ). e. Faites coulisser le fond de panier vers l'avant du système de façon à ce que ses connecteurs laissent apparaître le châssis arrière. f. Une fois les connecteurs dégagés du châssis, soulevez l'arrière du fond de panier système et retirez ce dernier en l'inclinant. Remise en place du fond de panier du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C Informations sur le remplacement du fond de panier système dans le cadre d'une opération de maintenance. Pourquoi et quand exécuter cette tâche Avant d'effectuer cette procédure, vous devez avoir effectué le retrait. Pour connaître les instructions de la procédure de retrait, voir Retrait du fond de panier système 02E/4B, 02E/4C, 05F/6B, ou 05E/6C. Si votre système est géré par la console HMC (Hardware Management Console), utilisez-la pour remplacer le fond de panier système. Pour plus d'informations, voir la rubrique Remplacement d'un composant à l'aide de la console HMC. Si votre système est géré par la console SDMC (Systems Director Management Console), utilisez-la pour remplacer le fond de panier système. Pour obtenir des instructions, voir Remplacement d'un composant à l'aide de la console SDMC. Si vous ne disposez pas d'une console HMC ou SDMC, procédez comme suit pour remplacer le fond de panier système : Avertissement : Attachez un bracelet antistatique à une surface métallique non peinte de votre matériel afin d'empêcher d'éventuelles décharges électrostatiques d'endommager celui-ci. Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation d'appareils électriques. Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du système pendant au moins 5 secondes avant de déballer le produit de son emballage antistatique et d'installer ou de remettre en place le matériel. Procédure 1. Remplacez le fond de panier système. Remarque : Remplacez le fond de panier système avec précaution. a. Saisissez avec précaution le fond de panier système de remplacement par les deux bras de chargement du dissipateur thermique (A). b. Abaissez le fond de panier système vers l'avant du boîtier du système. c. Abaissez l'arrière du fond de panier système, puis faites-le glisser vers l'arrière du système jusqu'à ce que les connecteurs laissent apparaître les ouvertures dans le boîtier (voir illustration). d. Abaissez l'avant du fond de panier système sur les broches d'alignement (B). 44 Activation SAS (Serial-attached SCSI) RAID e. Serrez les quatre vis (C) qui maintiennent le fond de panier système au boîtier du système. f. Rebranchez tous les câbles Ethernet qui ont été précédemment retirés. Figure 30. Installation du fond de panier système (02E/4B ou 05F/6B) Figure 31. Installation du fond de panier système Retrait et remise en place des adaptateurs et batteries SAS RAID 45 2. Installez le support métallique du régulateur de tension. a. Alignez les trois trous du support sur les trois trous du côté du système. b. Serrez les trois vis. 3. Remplacez le module processeur système. Pour des instructions, voir Remplacement du module processus système 02E/4B, 02E/4C, 05F/6B ou 05E/6C. 4. Remplacez le commutateur de verrouillage d'alimentation. Pour des instructions, voir Remplacement du commutateur de verrouillage d'alimentation 02E/4B, 02E/4C, 05F/6B ou 05E/6C. 5. Remplacez le boîtier de ventilation : a. Abaissez le boîtier de ventilation pour le mettre en position, en vous assurant que la rainure d'alignement (A) et la broche d'alignement (B) sont correctement alignés. b. Serrez les deux vis (D) qui maintiennent le boîtier de ventilation au boîtier système. c. Serrez les vis (C) qui maintiennent le boîtier de ventilation. 6. Connectez le câble du ventilateur avant. Figure 32. Installation du boîtier de ventilation 46 Activation SAS (Serial-attached SCSI) RAID 7. Remplacez le logement de barrette DIMM : a. Abaissez le boîtier de barrette DIMM pour le mettre en place. b. Insérez et serrez les quatre vis qui maintiennent le boîtier de barrette DIMM (Vue A). Figure 33. Connexion du câble du ventilateur avant Retrait et remise en place des adaptateurs et batteries SAS RAID 47 8. Remplacez le guide plastique du régulateur de tension (A). a. Abaissez le guide plastique pour le mettre en place. b. Insérez et serrez les trois vis qui maintiennent le guide. 9. Remplacez le boîtier de l'unité de disque (B). a. Abaissez le boîtier de l'unité de disque pour le mettre en place. b. Insérez et serrez les deux vis qui maintiennent le boîtier de l'unité de disque. Figure 34. Remplacement du boîtier de barrette DIMM 48 Activation SAS (Serial-attached SCSI) RAID 10. Remplacez la carte de connexion PCI : a. Abaissez la carte de connexion PCI pour la mettre en place. Figure 35. Remplacement du boîtier de l'unité de disque (02E/4B ou 05F/6B) Figure 36. Remplacement du boîtier de l'unité de disque (02E/4C ou 05E/6C) Retrait et remise en place des adaptateurs et batteries SAS RAID 49 b. Appuyez sur les loquets de la carte de connexion PCI (A) jusqu'à ce qu'ils s'enclenchent. c. Serrez les deux vis moletées (D). 11. Remplacez le répartiteur d'air de la carte de connexion PCI. a. Abaissez le répartiteur d'air de la carte de connexion PCI et appuyez dessus pour le mettre en place. Figure 37. Remplacement de la carte de connexion PCI 50 Activation SAS (Serial-attached SCSI) RAID 12. Remplacez le boîtier de l'unité de disque : a. Les loquets de l'unité de disque étant ouverts, poussez le boîtier de l'unité de disque à l'intérieur du système. b. Poussez les loquets de l'unité de disque vers le système jusqu'à ce qu'ils soient verrouillés. Figure 38. Remplacement du répartiteur d'air de la carte de connexion PCI Retrait et remise en place des adaptateurs et batteries SAS RAID 51 13. Connectez les câbles internes au fond de panier. Replacez le câble SPCN à l'arrière du châssis (02E/4C ou 05E/6C). Figure 39. Remplacement du boîtier de l'unité de disque Figure 40. Connexion des câbles internes 52 Activation SAS (Serial-attached SCSI) RAID 14. Remplacez les blocs d'alimentation. Pour des instructions, voir Remplacement hors tension des blocs d'alimentation 02E/4B, 02E/4C, 05F/6B ou 05E/6C ou Remplacement sous tension des blocs d'alimentation 02E/4B, 02E/4C, 05F/6B ou 05E/6C. 15. Remplacez les ventilateurs. Pour des instructions, voir Remplacement des ventilateurs 02E/4B, 02E/4C, 05F/6B ou 05E/6C. 16. Remplacez la carte d'extension de mémoire. Pour des instructions, voir Remplacement de la carte d'extension de mémoire 02E/4B, 02E/4C, 05F/6B ou 05E/6C. 17. Remplacez tous les régulateurs de tension. Pour des instructions, voir Remplacement du régulateur de tension 02E/4B, 02E/4C, 05F/6B ou 05E/6C. 18. Remplacez la carte Ethernet hôte (02E/4B ou 05F/6B). Pour obtenir des instructions, voir Remplacement hors tension de la carte Ethernet hôte02E/4B ou 05F/6B. 19. Remplacez les adaptateurs GX. Pour des instructions, voir Remplacement des adaptateurs GX pour 02E/4B, 02E/4C, 05F/6B ou 05E/6C. 20. Remplacez les cartes PCI. Pour des instructions, voir Remplacement hors tension d'une carte PCI sur le serveur 02E/4B, 02E/4C, 05F/6B ou 05E/6C. 21. Réinstallez le capot d'accès : v Pour un système monté en armoire, voir Installation du capot d'accès sur un système monté en armoire 02E/4B, 02E/4C, 05F/6B ou 05E/6C. v Pour un système autonome, voir Installation du capot d'accès sur un système autonome 02E/4B, 02E/4C ou 05F/6B. 22. Remettez en place le capot avant : v Pour un système monté en armoire, voir Installation du capot avant sur un système monté en armoire 02E/4B, 02E/4C, 05F/6B ou 05E/6C. v Pour un système autonome, voir Installation du capot avant sur un système autonome 02E/4B, 02E/4C ou 05F/6B. 23. Placez le système en position de fonctionnement. Pour des instructions, voir Mise en position de fonctionnement du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire. 24. Démarrez le système ou la partition logique. Voir Démarrage du système ou de la partition logique. 25. Restaurez la connectivité réseau. Le processeur de service doit être connecté au réseau pour être accessible à partir des console de gestion. Comment l'adresse IP du processeur de service sera-t-elle gérée ? Obtenir automatiquement une adresse IP à partir d'un serveur DHCP (Dynamic Host Configuration Protocol) Passez à l'étape suivante. Configurer une adresse IP statique Passez à l'étape 27, à la page 54. 26. Restaurez les paramètres du processeur de service. Activez l'accès réseau sur le processeur de service. Si la connexion réseau utilise le protocole DHCP (Dynamic Host Configuration Protocol) pour établir une adresse IP, et utilise les Consoles de gestion comme serveur DHCP, aucune configuration supplémentaire n'est nécessaire pour activer l'accès réseau sur le processeur de service. Procédez comme suit : a. Assurez-vous que le processeur de service est connecté au réseau de service existant en vérifiant que le câble console de gestion est connecté au port console de gestion du fond de panier système. b. Le cas échéant, connectez tous les câbles d'alimentation système en les branchant dans les prises. Remarque : ne démarrez pas le système à ce stade. Pouvez-vous accéder au réseau de maintenance ? Oui Non Retrait et remise en place des adaptateurs et batteries SAS RAID 53 Passez à l'étape 29. Passez à l'étape 32, à la page 55. 27. Restaurez les paramètres du processeur de service. L'interface ASMI (Advanced System Management Interface) est l'interface d'accès au processeur de service qui est requise pour effectuer des tâches de maintenance générales et de niveau administrateur, telles que la lecture des journaux d'erreurs du processeur de service, la lecture des données essentielles du produit, la configuration du processeur de service, le contrôle de l'alimentation système et la configuration des ports réseau du processeur de service. L'interface ASMI est également désignée comme l'ensemble des menus du processeur de service. L'interface ASMI est accessible via HTTPS. La connexion réseau utilisant des affectations d'adresses IP statiques, procédez comme suit. a. Pour identifier les ports réseau du processeur de service, vous pouvez utiliser le panneau de commande en mode manuel. Reportez-vous aux fonctions de maintenance pour savoir comment utiliser le panneau de commande dans ce cas. b. Connectez un client disposant d'un navigateur Web directement aux ports réseau du processeur de service que vous avez notés précédemment. Il peut s'agir d'une URL similaire aux exemples suivants : v https://169.254.2.147 v https://169.254.3.147 c. Le cas échéant, connectez tous les câbles d'alimentation système en les branchant dans les prises. Remarque : ne démarrez pas le système à ce stade. d. Connectez-vous à l'interface ASMI à l'aide de l'ID utilisateur admin et du mot de passe par défaut admin. e. A l'invite, modifiez le mot de passe de l'ID administrateur et le mot de passe de l'ID utilisateur général. f. Pour configurer l'accès réseau, procédez comme suit. 1) Cliquez sur Configuration de réseau sous le noeud Services réseau. 2) Dans le panneau de droite, configurez les interfaces réseau eth0 et eth1, sélectionnez Statique comme Type d'adresse IP, puis définissez un nom d'hôte, une adresse IP et un masque de sous-réseau. 28. Restaurez les paramètres du processeur système. La connexion réseau étant maintenant configurée pour utiliser des affectations d'adresses IP statiques, essayez d'accéder aux ports réseau du processeur de service. Pouvez-vous accéder au réseau de maintenance ? Oui Non Passez à l'étape suivante. Passez à l'étape 33, à la page 55. 29. Lancez l'interface ASMI. L'interface ASMI (Advanced System Management Interface) est l'interface du processeur de service requise pour effectuer des tâches de maintenance générales et de niveau administrateur, telles que la lecture des journaux d'erreurs du processeur de service, la lecture des données essentielles du produit, la configuration du processeur de service et le contrôle de l'alimentation système. L'interface ASMI est également désignée comme l'ensemble des menus du processeur de service. Pour accéder à l'interface ASMI, voir Accès à l'interface ASMI. Une fois que vous avez accédé à l'interface ASMI, passez à l'étape suivante. 30. Définissez le type de boîtier et le code dispositif. Le numéro de série du boîtier est un numéro à sept chiffres imprimé sur une étiquette code barre placée sur l'unité centrale. Lorsque le fond de panier système est remplacé, le numéro de série à sept chiffres du boîtier et le code dispositif doivent être entrés manuellement et stockés dans le nouveau fond de panier système à l'aide de l'interface ASMI (Advanced System Management Interface). 54 Activation SAS (Serial-attached SCSI) RAID Remarque : Le code dispositif de fond de panier du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C est 78AA-001. Remarque : Pour effectuer cette opération, vous devez disposer du niveau d'autorisation Prestataire de services agréé (ID : celogin). Pour entrer le numéro de série du système : a. Dans la fenêtre d'accueil de l'interface ASMI, indiquez votre ID utilisateur et votre mot de passe, et cliquez sur Connexion. b. Dans la zone de navigation, développez Configuration système et Données techniques essentielles du programme. c. Sélectionnez Boîtiers du système. Dans le panneau de droite, les boîtiers du système actuel s'affichent. d. Sur le système concerné, enregistrez le numéro de série du boîtier imprimé l'étiquette code barre apposée sur l'unité centrale. e. Entrez le numéro de série du boîtier enregistré dans la zone Numéro de série du boîtier de l'unité centrale. f. Entrez le numéro du code dispositif dans la zone Code dispositif/Numéro de séquence. g. Cliquez sur Sauvegarder les paramètres pour mettre à jour et enregistrer le numéro de série du boîtier et le code dispositif. 31. Passez à l'étape 35, à la page 56. 32. Restaurez les paramètres du processeur de service. Si la connexion réseau utilise DHCP pour établir une adresse IP, mais n'utilise pas de en tant que serveur DHCP, procédez comme suit : a. Effectuez toutes les configurations de réseau requises pour permettre au serveur DHCP d'identifier une adresse IP et de l'affecter au processeur de service. b. Assurez-vous que le processeur de service est connecté au réseau de service existant en vérifiant que le câble réseau est connecté au port réseau. c. Le cas échéant, connectez tous les câbles d'alimentation système en les branchant dans les prises. Remarque : Ne démarrez pas le système à ce stade. Pouvez-vous accéder au réseau de maintenance ? Oui Non Accéder à 29, à la page 54 Passez à l'étape suivante. 33. Restaurez les paramètres du processeur de service. Vous avez indiqué que vous n'êtes toujours pas en mesure d'accéder au réseau de service. Vous devrez réinitialiser le processeur de service en appuyant sur le commutateur (E) du panneau de commande. Retrait et remise en place des adaptateurs et batteries SAS RAID 55 34. Passez à l'étape 25, à la page 53. 35. Restaurez les paramètres du processeur de service. Modifiez le mot de passe de l'ID utilisateur d'administrateur lorsque vous y êtes invité en procédant de la manière suivante. a. Dans la zone de navigation de l'interface ASMI, développez Profil de connexion. b. Sélectionnez Modifier le mot de passe sous Profil de connexion. c. Indiquez les informations requises, puis cliquez sur Continuer. Par mesure de sécurité, vous êtes invité à entrer votre mot de passe d'utilisateur en cours dans le champ Mot de passe en cours pour l'ID utilisateur en cours. Remarque : Les mots de passe peuvent être une combinaison quelconque de 64 caractères alphanumériques au maximum. Le mot de passe par défaut pour l'ID utilisateur d'administrateur est admin. Après votre première connexion à l'interface ASMI, le mot de passe admin doit être modifié. Figure 41. Commutateur du panneau de commande 56 Activation SAS (Serial-attached SCSI) RAID Par mesure de sécurité, vous êtes invité à entrer le mot de passe de votre ID utilisateur en cours dans le champ Mot de passe en cours de l'utilisateur en cours. 36. Restaurez les paramètres du processeur de service. Définissez le nom du système. Le nom du système peut être modifié pour toute chaîne ASCII autorisée. Il ne doit pas nécessairement correspondre au format numéro de série_modèle_type de machine. Pour modifier le nom du système, procédez comme suit. a. Dans la zone de navigation de l'interface, développez Configuration système. b. Sélectionnez Nom du système sous Configuration système. c. Entrez le nom souhaité. d. Cliquez sur Sauvegarder les paramètres pour mettre à jour le nom du système avec la nouvelle valeur. Le nouveau nom du système s'affiche dans le cadre d'état, zone dans laquelle le bouton Déconnexion se trouve. Si une autre méthode, telle que la console de gestion est utilisée pour modifier le nom du système, le cadre d'état ne reflète pas la modification. 37. Restaurez les paramètres du processeur de service. Pour définir l'heure, procédez comme suit. a. Dans la zone de navigation de l'interface, développez Configuration système. b. Sélectionnez Heure locale sous Configuration système. c. Si le système est hors tension, le volet droit affiche un formulaire qui indique la date (mois, jour et année) et l'heure (heures, minutes, secondes). d. Modifiez la date ou l'heure (ou les deux) et cliquez sur Sauvegarder les paramètres. 38. Restaurez les paramètres du processeur de service. Vérifiez que le niveau du microprogramme de serveur est identique à la valeur enregistrée avant le remplacement du fond de panier système. Si la valeur est différente, elle doit être corrigée. Pour plus d'informations sur la correction du microprogramme, voir Obtention des correctifs de microprogramme. 39. Restaurez les paramètres de l'unité carte Ethernet hôte (HEA) serveur. v Pour la console HMC : a. Développez le dossier Gestion de systèmes dans le volet de l'arborescence de navigation. b. Cliquez deux fois sur le dossier Serveurs. c. Sélectionnez un serveur dans le panneau central. d. Dans le menu Tâches, cliquez sur Informations sur le matériel > Cartes > Ethernet hôte. La fenêtre Cartes Ethernet hôte apparaît. e. Sélectionnez le port physique requis et cliquez sur Configurer. La fenêtre de configuration du port physique de la carte HEA apparaît. f. Entrez la valeur dans la zone Pending Port Group Multi-Core Scaling value et cliquez sur OK. v Pour la console SDMC : a. Allez dans l'onglet Resources et cliquez sur Hosts. b. Sélectionnez un serveur dans le panneau de droite. c. Dans le menu Actions, cliquez sur Informations sur le matériel > Cartes > Ethernet hôte. La fenêtre Cartes Ethernet hôte apparaît. d. Sélectionnez le port physique requis et cliquez sur Configurer. La fenêtre de configuration du port physique de la carte HEA apparaît. e. Entrez la valeur dans la zone Pending Port Group Multi-Core Scaling value et cliquez sur OK. 40. Restaurez les paramètres du processeur de service. Retrait et remise en place des adaptateurs et batteries SAS RAID 57 Entrez à nouveau les paramètres suivants qui ont été précédemment modifiés via l'interface ASMI (Advanced System Management Interface), sauf si vous souhaitez utiliser les paramètres par défaut. v Paramètre de nom de système v Paramètres de contrôle de l'alimentation système v Paramètres de l'aide à la maintenance système v Paramètres de configuration système v Paramètres des services réseau v Paramètres des options de performances v Paramètres de profil de connexion v Paramètres de liste des unités d'amorçage v Paramètres de carte Ethernet hôte 41. Restaurez les paramètres du processeur de service. a. Redéfinissez les paramètres du processeur de service que vous aviez éventuellement définis via les commandes du système d'exploitation. Vous avez enregistré ces paramètres au début de cette procédure. b. Si vous choisissez de réinitialiser le mot de passe d'accès aux Consoles de gestion, procédez comme suit : Pour modifier le mot de passe d'accès à la console HMC, choisissez l'une des méthodes suivantes ; 1) A partir de l'interface graphique de la console HMC (méthode recommandée) : a) Développez le dossier Gestion de systèmes dans le volet de l'arborescence de navigation. b) Cliquez deux fois sur le dossier Serveurs. c) Sélectionnez un serveur dans le panneau central. d) Sous le dossier Opérations, sélectionnez Modifier le mot de passe. e) Indiquez les informations souhaitées et cliquez sur OK. 2) Sur la ligne de commande de la console HMC, saisissez : /usr/hmcrbin/chsyspwd -m système_géré -t access --passwd --newpasswd nouveau_mot_de_passe où : v La valeur de système_géré correspond au nom du système géré du nouveau processeur de service. v Aucune valeur n'est indiquée pour passwd-- de façon à permettre l'authentification. v La valeur de newpasswd est celle de nouveau_mot_de_passe. Pour modifier le mot de passe d'accès à la console SDMC, procédez comme suit : 1) Allez dans l'onglet Resources et cliquez sur Hosts. 2) Sélectionnez un serveur dans le panneau de droite. 3) Dans le menu Actions, cliquez sur Operations > Change Password. 4) Indiquez les informations souhaitées et cliquez sur OK. 42. Récupérez les données de partition sur le processeur de service. Les données de profil stockées sur le serveur géré ont été supprimées ou endommagées. Pour restaurer les données de profil à l'aide de la console HMC, procédez comme suit. a. Développez le dossier Gestion de systèmes dans le volet de l'arborescence de navigation. b. Cliquez deux fois sur le dossier Serveurs. c. Dans le panneau central, sélectionnez un serveur en cochant la case correspondante. d. Dans le dossier Configuration, si vous avez accès au dossier Gestion des données de partition, sélectionnez Restaurer. Pour restaurer les données de profil à l'aide de la console SDMC, procédez comme suit : 58 Activation SAS (Serial-attached SCSI) RAID a. Allez dans l'onglet Resources et cliquez sur Hosts. b. Sélectionnez un serveur dans le panneau de droite. c. Dans le menu Actions, cliquez sur System Configuration > Manage Partition Data > Restore. d. Sélectionnez l'option de restauration souhaitée et cliquez sur OK. 43. Mettez à jour les données de produit essentielles. Si votre système exécute et n'est pas géré par les Consoles de gestion, vous devez mettre à jour la source de chargement après avoir remplacé le fond de panier système. Pour mettre à jour la source de chargement sans l'aide des Consoles de gestion, procédez comme suit : a. Chargez le support optique du microcode sous licence I_BASE_01 dans l'unité qui est définie pour le système et accédez-y pour effectuer un IPL (procédure de chargement initial). Remarque : Cet IPL peut prendre plusieurs minutes. b. Une fois l'écran de sélection de groupe de langues affiché, appuyez sur la touche Entrée pour sélectionner la langue par défaut 2924 (anglais). Remarque : Pour modifier la langue, tapez celle qui apparaît sur le support du système d'exploitation, puis appuyez sur la touche Entrée. Confirmez la langue en appuyant sur la touche Entrée. c. A partir de l'écran d'installation du microcode sous licence, sélectionnez l'option 2 (utilisation des outils de maintenance en mode dédié) et appuyez sur la touche Entrée. Utilisez les outils de maintenance en mode dédié pour vous assurer que les unités de disque, y compris la source de chargement, génèrent correctement des rapports et qu'il n'y a pas d'erreurs susceptibles d'empêcher l'exécution de l'IPL à partir du disque. Lorsque toutes les erreurs ont été résolues et que le disque source de chargement génère correctement des rapports, passez à l'étape suivante. d. Dans l'écran d'utilisation des outils de maintenance en mode dédié (DST), sélectionnez l'option 2 (utilisation des unités de disque) et appuyez sur la touche Entrée. e. Sélectionnez l'option 8 (mise à jour des données produit essentielles du système) pour identifier l'emplacement de l'unité de disque source de chargement. Appuyez sur Entrée. L'écran de mise à jour des données produit essentielles du système s'affiche. f. Appuyez sur la touche Entrée pour confirmer que vous souhaitez que les données produit essentielles soient enregistrées. g. Appuyez deux fois sur F12 (Précédent). h. Tapez 1 et appuyez sur la touche Entrée pour quitter l'écran des outils de maintenance en mode dédié. Utilisez le panneau de configuration système pour retarder l'arrêt du système. Passez le mode IPL sur B et démarrez le système pour effectuer un IPL à partir du disque. 44. Restaurez les paramètres de la liste des unités d'amorçage et de la carte Ethernet hôte. La liste des unités d'amorçage n'est pas présente sur un nouveau fond de panier système. Par conséquent, il est nécessaire de créer les unités d'amorçage en démarrant les menus des services SMS. Voir Sélection des options d'amorçage pour obtenir des informations sur la configuration de la liste des unités d'amorçage. Une fois le système d'exploitation démarré, vous pouvez utiliser la Commande bootlist pour vérifier la liste des unités d'amorçage d'un système ou d'une partition, ou y apporter des modifications supplémentaires. Pour restaurer les paramètres de la carte Ethernet hôte, voir Configuration de ports physiques sur une carte Ethernet hôte. 45. Vérifiez la réparation. Pour plus d'informations, voir Vérification d'une réparation. Retrait et remise en place des adaptateurs et batteries SAS RAID 59 Considérations sur la maintenance du sous-système SAS 02E/4B ou 05F/6B Il existe plusieurs éléments à prendre en compte pour la maintenance du sous-système SAS du système 02E/4B ou 05F/6B. Avertissement : Ne tentez pas de retirer les composants associés au sous-système SAS si le voyant des données en cache sur la carte de P1-C19 ou P1-C14 clignote. Pour voir ce voyant, vous devez arrêter le système et retirer le capot de service. Une icône d'interdiction de toucher (une main barrée) s'affiche sur le voyant, indiquant la présence possible de données dans le cache de l'adaptateur. Pour éviter la perte de données, assurez-vous tout d'abord que le système a été arrêté correctement. Si le voyant des données en cache clignote, reportez-vous à l'une des rubriques suivantes avant de retirer tout composant du sous-système SAS. Vous pouvez remplacer en toute sécurité le bloc de batterie de cache lorsque Yes est affiché en regard de Battery pack can be safely replaced sur l'écran d'information de batterie (pour les systèmes Linux et IBM® i) ou l'écran de statut de la commande (pour les systèmes AIX). Avant d'effectuer la maintenance, vérifiez les configurations d'ES de chaque partition logique sur le système. Il est important de comprendre les différences entre les diverses configurations et les considérations qui en découlent en matière de maintenance. En outre, vous devez tenir compte des incidences possibles pour les domaines suivants. v Partitions logiques et leurs affectations d'adaptateur d'E-S v Configuration des grappes de disques RAID v Emplacements de disque physique v Chemin d'accès à votre unité d'amorçage Considérations de maintenance pour le bloc de batterie de cache Pour éviter la perte de données, suivez les procédures appropriées avant de remplacer le bloc de batterie de cache sur la carte Cache RAID. Remarque : La maintenance simultanée de la batterie de l'adaptateur n'est pas prise en charge. Même en cas de batteries défaillantes ou manquantes, la carte associée à la batterie doit être retirée en premier. Pour plus d'informations, voir Retrait et remplacement des adaptateurs RAID SAS et des batteries. Avertissement : Pour maintenir la disponibilité du système et éviter d'éventuelles pertes de données, vous devez comprendre que ces fonctions ont des conséquences sur la configuration de SAS et l'accessibilité des données du système. Avertissement : Ne tentez pas de supprimer les composants associés au sous-système SAS si le voyant des données en cache sur la carte de C19-C19 ou P1-C14 clignote. Pour voir ce voyant, vous devez arrêter le système et retirer le capot de service. Une icône d'interdiction de toucher (une main barrée) s'affiche sur le voyant, indiquant la présence possible de données dans le cache de l'adaptateur. Pour éviter la perte de données, assurez-vous tout d'abord que le système a été arrêté correctement. Si le voyant des données en cache clignote, reportez-vous à l'une des rubriques suivantes avant de retirer tout composant du sous-système SAS. Vous pouvez remplacer en toute sécurité le bloc de batterie de cache lorsque Yes est affiché en regard de Battery pack can be safely replaced sur l'écran d'information de batterie (pour les systèmes Linux et IBM i) ou l'écran de statut de la commande (pour les systèmes AIX). La figure suivante indique l'emplacement du voyant de présence de données en cache dans les sous-systèmes SAS 02E/4B ou 05F/6B. 60 Activation SAS (Serial-attached SCSI) RAID Le tableau suivant décrit les états du voyant de présence des données en cache et les éléments à prendre en considération avant de tenter une opération de maintenance. Tableau 1. Indications et actions de voyants Carte de batterie de cache (Un-P1-C14) Contrôleur de stockage RAID/cache (Un-P1-C19) Indication Action désactivée désactivée La mémoire cache n'est pas active. Les actions de maintenance peuvent s'effectuer. désactivée clignote La mémoire cache est active sur C19. La mémoire cache peut avoir été vidée sur C14 en cas de défaillance de C19. Essayez de mettre le système sous tension et de l'éteindre correctement avant d'effectuer toute action de maintenance. clignote désactivée La mémoire cache est active sur C14. La mémoire cache peut avoir été vidée sur C19 en cas de défaillance de C14. Essayez de mettre le système sous tension et de l'éteindre correctement avant d'effectuer toute action de maintenance. clignote clignote La mémoire cache est active sur les contrôleurs doubles. Essayez de mettre le système sous tension et de l'éteindre correctement avant d'effectuer toute action de maintenance. Si cela ne parvient pas à effacer les voyants de cache actifs, contactez le support. Voir Contacter le support et la maintenance IBM. Figure 42. Voyant des données en cache Retrait et remise en place des adaptateurs et batteries SAS RAID 61 Retrait de la carte d'activation RAID 02E/4B ou 05F/6B Utilisez cette procédure pour retirer une carte d'activation RAID (Redundant Array of Independent Disks) avant d'installer une carte d'activation nouvelle ou mise à niveau. Avant de commencer Avertissement : Pour maintenir la disponibilité du système et éviter d'éventuelles pertes de données, vous devez comprendre que ces fonctions ont des conséquences sur la configuration de SAS (serial-attached SCSI) et l'accessibilité des données du système. Consultez la rubrique Sous-système SAS pour comprendre les différentes remarques sur la configuration et la maintenance SAS. Avant toute opération d'installation ou de maintenance d'un dispositif, consultez les configurations d'E-S pour chaque partition logique de votre système. Il est important de comprendre les différences entre les diverses configurations et les considérations de maintenance qui en résultent. En outre, tenez compte des incidences possibles sur les domaines suivants : v Partitions logiques et leurs affectations à l'adaptateur d'E-S v Configurations de grappes RAID v Emplacements de disque physique v Chemin d'accès à votre unité d'amorçage Pourquoi et quand exécuter cette tâche Si le système est géré par la console HMC (Hardware Management Console), utilisez la console HMC pour retirer la carte du serveur. Pour plus d'informations, voir «Retrait d'un composant via la console HMC», à la page 110. Si le système est géré par la console Systems Director Management Console (SDMC), utilisez SDMC pour retirer la carte du serveur. Pour des instructions, voir «Retrait d'un composant via la console SDMC», à la page 125. Si vous ne disposez pas d'une console HMC ou SDMC, procédez comme suit pour retirer la carte d'activation : Procédure 1. Effectuez les tâches prérequises, comme décrit dans la rubrique «Avant de commencer», à la page 81. 2. Arrêtez le système (voir «Arrêt d'un système ou d'une partition logique», à la page 91). 3. Vérifiez que le système est hors tension. Sur le panneau de commande, vérifiez que le voyant d'alimentation clignote. 4. Retirez le capot avant. v Pour un système monté en armoire, voir «Retrait du capot avant sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 95. v Pour un système autonome, voir «Retrait du capot avant sur un système autonome 02E/4B, 02E/4C ou 05F/6B», à la page 95. 5. Pour un système monté en armoire, mettez-le en position de maintenance, comme indiqué dans «Mise en position de maintenance du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 104. 6. Retirez le capot d'accès. v Pour un système monté en armoire, voir «Retrait du capot d'accès sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 98. v Pour un système autonome, voir «Retrait du capot d'accès d'un système autonome 02E/4B, 02E/4C ou 05F/6B», à la page 99. 62 Activation SAS (Serial-attached SCSI) RAID 7. Vérifiez l'état des voyants verts du cache d'écriture des cartes placées dans P1-C19 et P1-C14 de la Figure 1. . v Si les deux voyants verts du cache clignotent, remettez en place le capot système et suivez la procédure de «Démarrage du système ou de la partition logique», à la page 89 avant de revenir à l'étape actuelle. v Si les deux voyants sont éteints, passez à l'étape suivante. v S'il s'avère qu'un seul voyant du cache clignote, contactez votre fournisseur de services pour obtenir de l'aide. 8. Débranchez le système pour l'isoler de la source d'alimentation. Remarque : Ce système est équipé d'un second bloc d'alimentation. Avant de poursuivre cette procédure, vérifiez que la source d'alimentation a été débranchée du système. (L003) ou 9. Attachez le bracelet antistatique. Avertissement : v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel contre les risques de décharge électrostatique. v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation d'appareils électriques. v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique, d'installer ou de remplacer du matériel. 10. Identifiez la carte d'activation RAID à retirer. Retrait et remise en place des adaptateurs et batteries SAS RAID 63 Remarque : Si une unité de bande en option est installée, déplacez-la pour pouvoir accéder à la carte d'activation RAID. 11. Appuyez sur le loquet et faites glisser la carte en dehors du système, comme illustré. La carte d'activation RAID est fixée à une plaque en plastique. Vous devez faire glisser cette plaque en dehors du système pour retirer la carte d'activation RAID. Figure 43. Emplacement d'installation de la carte d'activation RAID (P1-C13) 64 Activation SAS (Serial-attached SCSI) RAID Pour remplacer la carte d'activation RAID, voir «Remplacement de la carte d'activation RAID 02E/4B ou 05F/6B». 12. Remettez en place le capot d'accès de maintenance. v Pour un système monté en armoire, voir «Installation du capot d'accès sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 101. v Pour un système autonome, voir «Installation du capot d'accès d'un système autonome 02E/4B, 02E/4C ou 05F/6B», à la page 102. 13. Pour un système monté en armoire, mettez-le en position de fonctionnement, comme indiqué dans «Mise en position de fonctionnement du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 106. 14. Remettez en place le capot avant. v Pour un système monté en armoire, voir «Installation du capot avant sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 96. v Pour un système autonome, voir «Installation des capot et volet avant sur un système 02E/4B, 02E/4C ou 05F/6B autonome», à la page 97. 15. Rebranchez la source d'alimentation. 16. Démarrez le système (voir «Démarrage du système ou de la partition logique», à la page 89). 17. Vérifiez le composant installé : v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme décrit dans Vérification du composant installé. Remplacement de la carte d'activation RAID 02E/4B ou 05F/6B Utilisez cette procédure pour remplacer une carte d'activation RAID (Redundant Array of Independent Disks. Figure 44. Retrait d'une carte d'activation RAID d'une unité centrale montée en armoire Retrait et remise en place des adaptateurs et batteries SAS RAID 65 Pourquoi et quand exécuter cette tâche Vous devez avoir exécuté la procédure «Retrait de la carte d'activation RAID 02E/4B ou 05F/6B», à la page 62. Pour remplacer une carte d'activation RAID, procédez comme suit : Procédure 1. Identifiez l'emplacement de la carte d'activation RAID. Remarque : Si une unité de bande en option est installée, déplacez-la pour pouvoir accéder à la carte d'activation RAID. 2. Alignez la carte d'activation RAID sur l'emplacement dans le système. 3. Faites glisser complètement la carte d'activation RAID dans le système jusqu'au verrouillage du loquet (A). 4. Remettez le capot d'accès en place : v Pour un système monté en armoire, voir «Installation du capot d'accès sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 101. v Pour un système autonome, voir «Installation du capot d'accès d'un système autonome 02E/4B, 02E/4C ou 05F/6B», à la page 102. 5. Pour un système monté en armoire, mettez-le en position de fonctionnement, comme indiqué dans «Mise en position de fonctionnement du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 106. 6. Remettez en place le capot avant : v Pour un système monté en armoire, voir «Installation du capot avant sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 96. Figure 45. Remplacement d'une carte d'activation RAID sur une unité centrale montée en armoire 66 Activation SAS (Serial-attached SCSI) RAID v Pour un système autonome, voir «Installation des capot et volet avant sur un système 02E/4B, 02E/4C ou 05F/6B autonome», à la page 97. 7. Rebranchez la source d'alimentation. 8. Démarrez le système (voir «Démarrage du système ou de la partition logique», à la page 89). 9. Vérifiez le composant installé : v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme décrit dans Vérification du composant installé. Retrait du contrôleur de stockage RAID/cache 02E/4B, 02E/4C, 05F/6B ou 05E/6C Utilisez cette procédure pour retirer un contrôleur de stockage RAID (Redundant Array of Independent Disks) avant d'installer un contrôleur de stockage nouveau ou mis à niveau. Vous pouvez également utiliser cette procédure pour retirer un contrôleur de stockage RAID-10. Avant de commencer Identifiez le système sur lequel vous travaillez à l'aide du processus d'identification du système afin d'activer l'indicateur de système (bleu). Pour plus d'informations, voir «Voyants du panneau de commande», à la page 84. Avertissement : Pour maintenir la disponibilité du système et éviter d'éventuelles pertes de données, vous devez comprendre que ces fonctions ont des conséquences sur la configuration de SAS (serial-attached SCSI) et l'accessibilité des données du système. Consultez la rubrique Sous-système SAS pour comprendre les différentes remarques sur la configuration et la maintenance SAS. Avant toute opération d'installation ou de maintenance d'un dispositif, consultez les configurations d'E-S pour chaque partition logique de votre système. Il est important de comprendre les différences entre les diverses configurations et les considérations de maintenance qui en résultent. En outre, tenez compte des incidences possibles sur les domaines suivants : v Partitions logiques et leurs affectations à l'adaptateur d'E-S v Configurations de grappes RAID v Emplacements de disque physique v Chemin d'accès à votre unité d'amorçage Pourquoi et quand exécuter cette tâche Si le système est géré par la console HMC (Hardware Management Console), utilisez la console HMC pour retirer les contrôleurs de stockage RAID et cache du serveur. Pour plus d'informations, voir «Retrait d'un composant via la console HMC», à la page 110. Si le système est géré par la console Systems Director Management Console (SDMC), utilisez SDMC pour retirer les contrôleurs de stockage RAID et cache du serveur. Pour des instructions, voir «Retrait d'un composant via la console SDMC», à la page 125. Si vous ne disposez pas d'une console HMC ou SDMC, procédez comme suit pour retirer le contrôleur de stockage RAID/cache : Procédure 1. Effectuez les tâches prérequises, comme décrit dans la rubrique «Avant de commencer», à la page 81. 2. Arrêtez le système (voir «Arrêt d'un système ou d'une partition logique», à la page 91). 3. Retirez le capot avant. Retrait et remise en place des adaptateurs et batteries SAS RAID 67 v Pour un système monté en armoire, voir «Retrait du capot avant sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 95. v Pour un système autonome, voir «Retrait du capot avant sur un système autonome 02E/4B, 02E/4C ou 05F/6B», à la page 95. 4. Pour un système monté en armoire, mettez-le en position de maintenance, comme indiqué dans «Mise en position de maintenance du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 104. 5. Retirez le capot d'accès. v Pour un système monté en armoire, voir «Retrait du capot d'accès sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 98. v Pour un système autonome, voir «Retrait du capot d'accès d'un système autonome 02E/4B, 02E/4C ou 05F/6B», à la page 99. 6. Utilisez les voyants de maintenance pour vous aider à identifier le composant, comme indiqué dans «Identification d'un composant défaillant», à la page 84. 7. Débranchez le système pour l'isoler de la source d'alimentation. Remarque : Ce système est équipé d'un second bloc d'alimentation. Avant de poursuivre cette procédure, vérifiez que la source d'alimentation a été débranchée du système. (L003) ou 8. Attachez le bracelet antistatique. 68 Activation SAS (Serial-attached SCSI) RAID Avertissement : v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel contre les risques de décharge électrostatique. v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation d'appareils électriques. v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique, d'installer ou de remplacer du matériel. 9. Identifiez les contrôleurs de stockage RAID et cache à retirer. 10. Tirez sur le loquet (A) et retirez la carte (B) du système en la faisant glisser, comme l'illustre la figure suivante. Pour remplacer les contrôleurs de stockage RAID et cache, voir «Remplacement du contrôleur de stockage RAID et cache 02E/4B, 02E/4C, 05F/6B ou 05E/6C», à la page 70. 11. Remettez le capot d'accès en place : v Pour un système monté en armoire, voir «Installation du capot d'accès sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 101. v Pour un système autonome, voir «Installation du capot d'accès d'un système autonome 02E/4B, 02E/4C ou 05F/6B», à la page 102. 12. Pour un système monté en armoire, mettez-le en position de fonctionnement, comme indiqué dans «Mise en position de fonctionnement du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 106. 13. Remettez en place le capot avant : v Pour un système monté en armoire, voir «Installation du capot avant sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 96. Figure 46. Retrait de contrôleurs de stockage RAID et cache d'une unité centrale montée en armoire Retrait et remise en place des adaptateurs et batteries SAS RAID 69 v Pour un système autonome, voir «Installation des capot et volet avant sur un système 02E/4B, 02E/4C ou 05F/6B autonome», à la page 97. 14. Rebranchez la source d'alimentation. 15. Démarrez le système (voir «Démarrage du système ou de la partition logique», à la page 89). 16. Vérifiez le composant installé : v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme décrit dans Vérification du composant installé. Remplacement du contrôleur de stockage RAID et cache 02E/4B, 02E/4C, 05F/6B ou 05E/6C Utilisez cette procédure pour remplacer un contrôleur de stockage RAID (Redundant Array of Independent Disks). Vous pouvez également utiliser cette procédure pour remplacer un contrôleur de stockage RAID-10. Pourquoi et quand exécuter cette tâche Identifiez le système sur lequel vous travaillez à l'aide du processus d'identification du système afin d'activer l'indicateur de système (bleu). Pour plus d'informations, voir «Voyants du panneau de commande», à la page 84. Vous devez avoir exécuté la procédure «Retrait du contrôleur de stockage RAID/cache 02E/4B, 02E/4C, 05F/6B ou 05E/6C», à la page 67. Pour remplacer des contrôleurs de stockage RAID et cache, procédez comme suit : Procédure 1. Identifiez l'emplacement des contrôleurs de stockage RAID et cache. 2. Alignez les contrôleurs de stockage RAID et cache sur l'emplacement dans le système. 3. Faites glisser les contrôleurs de stockage RAID et cache (A) entièrement dans le système jusqu'au verrouillage du loquet (B). 70 Activation SAS (Serial-attached SCSI) RAID 4. Remettez le capot d'accès en place : v Pour un système monté en armoire, voir «Installation du capot d'accès sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 101. v Pour un système autonome, voir «Installation du capot d'accès d'un système autonome 02E/4B, 02E/4C ou 05F/6B», à la page 102. 5. Pour un système monté en armoire, mettez-le en position de fonctionnement, comme indiqué dans «Mise en position de fonctionnement du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 106. 6. Remettez en place le capot avant. v Pour un système monté en armoire, voir «Installation du capot avant sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 96. v Pour un système autonome, voir «Installation des capot et volet avant sur un système 02E/4B, 02E/4C ou 05F/6B autonome», à la page 97. 7. Rebranchez la source d'alimentation. 8. Démarrez le système (voir «Démarrage du système ou de la partition logique», à la page 89). 9. Vérifiez le composant installé : v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme décrit dans Vérification du composant installé. Retrait et remise en place de la carte de batterie de cache Pour effectuer la maintenance de la carte de batterie de cache, procédez comme suit : Avertissement : Pour maintenir la disponibilité du système et éviter d'éventuelles pertes de données, vous devez comprendre que ces fonctions ont des conséquences sur la configuration de SAS (serial-attached SCSI) et l'accessibilité des données du système. Consultez la rubrique Sous-système SAS pour comprendre les différentes remarques sur la configuration et la maintenance SAS. Figure 47. Remplacement des contrôleurs de stockage RAID et cache sur une unité centrale montée en armoire Retrait et remise en place des adaptateurs et batteries SAS RAID 71 Avant toute opération d'installation ou de maintenance d'un dispositif, consultez les configurations d'E-S pour chaque partition logique de votre système. Il est important de comprendre les différences entre les diverses configurations et les considérations de maintenance qui en résultent. En outre, tenez compte des incidences possibles sur les domaines suivants : v Partitions logiques et leurs affectations à l'adaptateur d'E-S v Configurations de grappes RAID v Emplacements de disque physique v Chemin d'accès à votre unité d'amorçage Procédez comme suit pour retirer et installer la carte de batterie de cache : 1. Commencez avec le système sous tension. Pour éviter la perte de données, la carte de batterie de cache doit être à l'état d'erreur avant d'être retirée du système. Si votre système est hors tension, rallumez-le avant de continuer. 2. Choisissez votre système d'exploitation : v AIX : Passez à l'étape 3. v Linux : Passez à l'étape 5. v IBM i : Passez à l'étape 7. 3. Sous AIX, mettez la carte de batterie de cache à l'état d'erreur (afin d'éviter d'éventuelles pertes de données) en procédant comme suit : a. Accédez au gestionnaire de grappes de disques SAS IBM, comme indiqué ci-dessous : 1) A l'invite de commande, tapez smit, puis appuyez sur Entrée. 2) Sélectionnez Devices. 3) Sélectionnez Disk Array. 4) Sélectionnez IBM SAS Disk Array. 5) Sélectionnez IBM SAS Disk Array Manager dans le menu comprenant les options de configuration et de gestion du contrôleur IBM SAS RAID. b. Sélectionnez Diagnostics and Recovery Options. c. Sélectionnez l'option Controller Rechargeable Battery Maintenance. d. Sélectionnez l'option Force Controller Rechargeable Battery Error. e. Sélectionnez le contrôleur SAS RAID IBM dont vous souhaitez remplacer la batterie. Cette option met la batterie à l'état d'erreur, ce qui nécessite son remplacement. f. Déterminez si vous pouvez remplacer la carte de batterie de cache sans risque. Voir la procédure Affichage des informations relatives aux batteries rechargeables. Vous pouvez remplacer la carte de batterie de cache sans danger si la mention Yes apparaît en regard de l'option Battery pack can be safely replaced. Vous devrez peut-être sélectionner à plusieurs reprises l'option permettant d'afficher les informations relatives à la batterie rechargeable du contrôleur car plusieurs minutes peuvent s'écouler avant que le remplacement de la carte de batterie de cache puisse s'effectuer sans danger. g. Vérifiez que le voyant Cache Present ne clignote pas. Affichage des informations relatives aux batteries rechargeables a. Accédez au gestionnaire de grappes de disques SAS IBM en suivant la procédure précédente. b. Sélectionnez Diagnostics and Recovery Options. c. Sélectionnez l'option Controller Rechargeable Battery Maintenance. d. Sélectionnez l'option Displaying Rechargeable Battery Information. e. Sélectionnez le contrôleur IBM SAS RAID. 4. Passez à l'étape 8. 72 Activation SAS (Serial-attached SCSI) RAID 5. Placez la carte de batterie de cache à l'état d'erreur sur le système d'exploitation Linux (afin d'éviter d'éventuelles pertes de données) en procédant comme suit : a. Exécutez l'utilitaire iprconfig en saisissant iprconfig. b. Sélectionnez Work with disk unit recovery. c. Sélectionnez Work with resources containing cache battery packs. d. Sélectionnez votre carte et tapez 2. Appuyez ensuite sur Entrée pour mettre la batterie à l'état d'erreur. Remarque : Cette option permet de mettre la batterie à l'état d'erreur, ce qui nécessite son remplacement. e. Si vous êtes sûr de devoir mettre la batterie à l'état d'erreur, tapez c pour confirmer. Si vous ne souhaitez pas mettre la batterie à l'état d'erreur, tapez q pour annuler. f. Déterminez si vous pouvez remplacer la carte de batterie de cache sans risque. Voir Affichage des informations relatives aux batteries rechargeables. Vous pouvez remplacer la carte de batterie de cache sans danger si la mention Yes apparaît en regard de l'option Battery pack can be safely replaced. Vous devrez peut-être sélectionner à plusieurs reprises l'option permettant d'afficher les informations relatives à la batterie rechargeable du contrôleur car plusieurs minutes peuvent s'écouler avant que le remplacement de la carte de batterie de cache puisse s'effectuer sans danger. 6. Passez à l'étape 8. 7. Mettez la carte de batteries de cache à l'état d'erreur sous (afin d'éviter d'éventuelles pertes de données) en appliquant la procédure suivante au système ou à la partition logique qui comporte l'adaptateur : a. Vous devez avoir ouvert une session avec au minimum les droits d'accès aux outils de maintenance. b. Tapez strsst sur la ligne de commande et appuyez sur Entrée. c. Sur l'écran System Service Tools (SST) Sign On, tapez votre ID utilisateur et votre mot de passe d'accès aux outils de maintenance. Appuyez sur Entrée. d. Sur l'écran System Service Tools (SST), sélectionnez Start a Service Tool et appuyez sur Entrée. e. Dans l'écran Start a Service Tool, sélectionnez Hardware Service Manager et appuyez sur la touche Entrée. f. Sélectionnez Work with resources containing cache battery packs dans l'écran Hardware Service Manager. g. Sélectionnez Force battery pack into error state pour l'adaptateur d'entrée-sortie que vous utilisez dans l'écran Work with Resources containing cache battery packs, puis appuyez sur Entrée. h. Dans l'écran Force Battery Packs Into Error State, vérifiez que l'adaptateur d'entrée-sortie approprié est sélectionné et appuyez sur la touche de fonction correspondant à votre choix pour confirmer. i. Revenez à l'écran Work with Resources containing cache battery packs et sélectionnez Display battery information. Assurez-vous que la zone Safe to replace cache battery a la valeur Yes. Cette opération peut prendre plusieurs minutes et vous devrez peut-être appuyer sur le bouton d'actualisation pour que la zone soit mise à jour. 8. Placez le système monté en armoire en position de fonctionnement. Voir «Mise en position de fonctionnement du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 106. 9. Retirez le capot d'accès : v Pour un système monté en armoire, voir «Retrait du capot d'accès sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 98. v Pour un système autonome, voir «Retrait du capot d'accès d'un système autonome 02E/4B, 02E/4C ou 05F/6B», à la page 99. Retrait et remise en place des adaptateurs et batteries SAS RAID 73 10. Vérifiez le voyant de présence de données en cache (A) en examinant le voyant sur la carte de batterie de cache. v Si le voyant clignote, passez à l'étape suivante. v Si le voyant ne clignote pas, passez à l'étape 12. 11. Ne retirez pas la carte de batterie de cache si le voyant clignote. Lorsque le voyant clignote, cela indique qu'il existe des données en cache non sauvegardées sur la batterie. Ces données seront perdues si la batterie est retirée à ce stade. Effectuez cette procédure de maintenance sans remplacer la batterie, puis effectuez-la de nouveau pour vous assurer que toutes les données en cache sont enregistrées sur le disque avant de remplacer la batterie. Passez à l'étape 18. 12. Retirez la carte de batterie de cache en procédant comme suit : Avertissement : Ne retirez pas la carte de batterie de cache si le voyant de présence de données en cache clignote, car il indique que des données existent dans le cache. a. Dégagez la carte de batterie de cache (B) du connecteur (A). b. Faites glisser la carte de batterie de cache sur les glissières de montage afin de l'extraire du système. c. Une fois la carte extraite du système, retirez le bloc de batterie de la carte de batterie : 1) Déconnectez le bloc de batterie de la carte de batterie. 2) Saisissez les pattes de fixation et serrez-les pour pouvoir extraire le bloc de batterie des pattes. Figure 48. Voyant de présence de données en cache 74 Activation SAS (Serial-attached SCSI) RAID Si vous remplacez le bloc de batterie de cache, passez à la section «Retrait et installation du bloc de batterie de cache», à la page 77, puis revenez à cette section et poursuivez avec l'étape suivante. 13. Remplacez la carte de batteries de cache en procédant comme suit : Avertissement : Laissez la carte de batterie de cache déconnectée pendant au moins 60 secondes avant de connecter la nouvelle batterie. Il s'agit du temps minimum nécessaire à l'adaptateur pour détecter que la batterie a été remplacée. a. Faites glisser la carte de batterie de cache dans les glissières de montage du contrôleur jusqu'à ce qu'elle s'insère dans le connecteur de batterie. Figure 49. Retrait de la carte de batterie de cache Retrait et remise en place des adaptateurs et batteries SAS RAID 75 14. Mettez le système sous tension et redémarrez le cache d'écriture de l'adaptateur en procédant comme suit : Choisissez votre système d'exploitation : v AIX : Passez à l'étape 15. v Linux : Passez à l'étape 16. v IBM i : Passez à l'étape 17. 15. Procédez comme suit : a. Accédez au gestionnaire de grappes de disques IBM SAS en suivant la procédure de la section 3a. b. Sélectionnez Diagnostics and Recovery Options. c. Sélectionnez Controller Rechargeable Battery Maintenance. d. Sélectionnez Start Adapter Cache. e. Sélectionnez le contrôleur dont vous venez de remplacer la batterie et appuyez sur Entrée. f. Passez à l'étape 18. 16. Procédez comme suit : a. Exécutez l'utilitaire iprconfig en saisissant iprconfig. b. Sélectionnez Work with disk unit recovery. c. Sélectionnez Work with resources containing cache battery packs. d. Démarrez le cache de l'adaptateur d'E-S sur l'adaptateur correspondant à la batterie que vous venez de remplacer en saisissant 3, puis en appuyant sur Entrée. e. Passez à l'étape 18. 17. Procédez comme suit : a. Revenez à l'écran Work with Resources containing Cache Battery Packs en utilisant les informations obtenues à l'étape 7a à 7f et sélectionnez Start IOA cache. Appuyez sur Entrée. b. Vérifiez que vous obtenez bien le message Cache was started. Figure 50. Installation de la carte de batterie de cache 76 Activation SAS (Serial-attached SCSI) RAID c. Passez à l'étape 18. 18. Installez le capot d'accès : v Pour un système monté en armoire, voir «Installation du capot d'accès sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 101. v Pour un système autonome, voir «Installation du capot d'accès d'un système autonome 02E/4B, 02E/4C ou 05F/6B», à la page 102. 19. Mettez-le en position de fonctionnement. Voir «Mise en position de fonctionnement du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire», à la page 106. 20. Fermez le volet arrière de l'unité dont vous effectuez la maintenance. 21. Vérifiez que la nouvelle ressource fonctionne correctement, comme indiqué dans la rubrique «Vérification du composant installé», à la page 111. Retrait et installation du bloc de batterie de cache Cette procédure permet de remplacer le bloc de batterie de cache sur une carte de batterie. Avant de commencer Identifiez le système sur lequel vous travaillez à l'aide du processus d'identification du système afin d'activer l'indicateur de système (bleu). Pour plus d'informations, voir «Voyants du panneau de commande», à la page 84. Avertissement : Avant de poursuivre ces procédures, déterminez si vous pouvez remplacer le bloc de batterie de cache sans risque. Voir Affichage des informations sur la batterie rechargeable. Vous pouvez remplacer le bloc de batterie de cache en toute sécurité si la mention Oui apparaît en regard de l'option indiquant que le bloc de batterie peut être remplacé en toute sécurité. Avertissement : Pour maintenir la disponibilité du système et éviter d'éventuelles pertes de données, vous devez comprendre que ces fonctions ont des conséquences sur la configuration de SAS (serial-attached SCSI) et l'accessibilité des données du système. Consultez la rubrique Sous-système SAS pour comprendre les différentes remarques sur la configuration et la maintenance SAS. Avant toute opération d'installation ou de maintenance d'un dispositif, consultez les configurations d'E-S pour chaque partition logique de votre système. Il est important de comprendre les différences entre les diverses configurations et les considérations de maintenance qui en résultent. En outre, tenez compte des incidences possibles sur les domaines suivants : v Partitions logiques et leurs affectations à l'adaptateur d'E-S v Configurations de grappes RAID v Emplacements de disque physique v Chemin d'accès à votre unité d'amorçage Pourquoi et quand exécuter cette tâche Procédez comme suit pour remplacer un bloc de batterie n'acceptant pas la maintenance simultanée : Procédure 1. Retirez le contrôleur du système. Pour plus d'informations, voir «Retrait du contrôleur de stockage RAID/cache 02E/4B, 02E/4C, 05F/6B ou 05E/6C», à la page 67. 2. Placez le contrôleur sur une surface protégée contre les décharges électrostatiques. 3. Débranchez le connecteur de batterie (A) de son connecteur sur la carte, en tirant la languette de verrouillage tout en tirant doucement sur la fiche. La fiche est connectée à la carte mère d'une seule façon ; elle ne peut donc être insérée de manière incorrecte lors de la procédure de remplacement. Retrait et remise en place des adaptateurs et batteries SAS RAID 77 Remarque : Laissez le bloc de batterie de cache déconnecté pendant au moins 60 secondes avant de connecter la nouvelle batterie. Il s'agit du temps minimum nécessaire à la carte pour détecter que la batterie a été remplacée. Connecteur de la batterie (A) Bloc de batterie de cache (B) 4. La batterie est montée sur une plaque en plastique noir avec deux taquets, maintenue par des vis de butée en plastique blanc à l'aide de goupilles de retenue. Pour déposer le bloc de batterie, retirez la plaque de montage des vis de butée, une vis à la fois. 5. Appuyez sur la goupille de retenue de la vis de butée et tirez sur le taquet de montage jusqu'à ce que la vis de butée apparaisse dans le trou du taquet de montage. Répétez cette opération pour la deuxième vis de butée. Remarque : Si la goupille de retenue ne s'enfonce pas correctement, faites pivoter la vis de butée blanche afin que sa goupille de retenue soit opposée au bloc de batterie. 6. Installez la batterie sur les supports de façon à ce que les loquets s'enclenchent. Figure 51. Retrait de la batterie de cache 78 Activation SAS (Serial-attached SCSI) RAID 7. Réintroduisez les broches dans les rivets à l'arrière de la carte. 8. Connectez le connecteur du bloc de batterie de cache (A) à la carte. La fiche est connectée à la carte mère d'une seule façon ; elle ne peut donc être insérée de manière incorrecte. 9. Réinstallez l'adaptateur et redémarrez le cache. Pour plus d'informations, voir les procédures suivantes : v «Retrait et remise en place de la carte de batterie de cache», à la page 71 v «Retrait du contrôleur de stockage RAID/cache 02E/4B, 02E/4C, 05F/6B ou 05E/6C», à la page 67. Figure 52. Remplacement de la batterie de cache Retrait et remise en place des adaptateurs et batteries SAS RAID 79 80 Activation SAS (Serial-attached SCSI) RAID Procédures communes aux dispositifs installables La présente section contient l'ensemble des procédures communes concernant l'installation, le retrait et le remplacement de dispositifs. Avant de commencer Respectez ces précautions lorsque vous installez, retirez ou remplacez des dispositifs ou des composants. Pourquoi et quand exécuter cette tâche Ces précautions visent à créer un environnement sûr pour la maintenance de votre système, mais ne proposent pas de procédure de maintenance du système. Les procédures d'installation, de retrait et de remplacement fournissent les processus étape par étape pour la maintenance de votre système. 81 DANGER Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes suivantes : Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut présenter un danger. Pour éviter tout risque de choc électrique : v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce dernier avec un autre produit. v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique. v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de reconfiguration de ce produit au cours d'un orage. v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de danger électrique, débranchez tous les cordons d'alimentation. v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent aux informations de la plaque d'alimentation électrique du système. v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce produit. v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les cordons d'interface. v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence de dommages matériels. v Avant de retirer les capots de l'unité, mettez celle-ci hors tension et déconnectez ses cordons d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication et aux modems (sauf mention contraire dans les procédures d'installation et de configuration). v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et déconnecter les différents cordons. Pour déconnecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Débranchez les cordons d'alimentation des prises. 3. Débranchez les cordons d'interface des connecteurs. 4. Débranchez tous les câbles des unités. Pour connecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Branchez tous les cordons sur les unités. 3. Branchez les cordons d'interface sur des connecteurs. 4. Branchez les cordons d'alimentation aux prises. 5. Mettez l'unité sous tension. (D005a) 82 Activation SAS (Serial-attached SCSI) RAID DANGER Observez les consignes suivantes lors de l'utilisation du système en armoire ou lorsque vous travaillez à proximité de ce dernier : v Un mauvais maniement de l'équipement lourd peut engendrer blessures et dommages matériels. v Abaissez toujours les vérins de mise à niveau de l'armoire. v Installez toujours des équerres de stabilisation sur l'armoire. v Pour prévenir tout danger lié à une mauvaise répartition de la charge, installez toujours les unités les plus lourdes dans la partie inférieure de l'armoire. Installez toujours les serveurs et les unités en option en commençant par le bas de l'armoire. v Un serveur monté en armoire n'est pas une étagère ou un espace de travail. Ne posez pas d'objet sur un serveur monté en armoire. v Chaque armoire peut être équipée de plusieurs cordons d'alimentation. Avant de manipuler l'armoire, vous devez débrancher l'ensemble des cordons d'alimentation. v Reliez toutes les unités installées dans l'armoire aux dispositifs d'alimentation installés dans la même armoire. Vous ne devez pas brancher le cordon d'alimentation d'une unité installée dans une armoire au dispositif d'alimentation installé dans une autre armoire. v Un mauvais câblage du socle de prise de courant peut provoquer une mise sous tension dangereuse des parties métalliques du système ou des unités qui lui sont raccordées. Il appartient au client de s'assurer que le socle de prise de courant est correctement câblé et mis à la terre afin d'éviter tout risque de choc électrique. ATTENTION v N'installez pas d'unité dans une armoire dont la température ambiante interne dépasse la température ambiante que le fabricant recommande pour toutes les unités montées en armoire. v N'installez pas d'unité dans une armoire où la ventilation n'est pas assurée. Vérifiez que les côtés, l'avant et l'arrière de l'unité sont correctement ventilés. v Le matériel doit être correctement raccordé au circuit d'alimentation pour éviter qu'une surcharge des circuits n'entrave le câblage des dispositifs d'alimentation ou de protection contre les surintensités. Pour choisir des connexions d'alimentation à l'armoire adaptées, consultez les étiquettes de puissance nominale situées sur le matériel dans l'armoire afin de déterminer l'alimentation totale requise par le circuit d'alimentation. v Armoires dotées de tiroirs coulissants : Si l'armoire n'est pas équipée d'équerres de stabilisation, ne sortez et n'installez pas de tiroir ou de dispositif. Ne retirez pas plusieurs tiroirs à la fois. Si vous retirez plusieurs tiroirs simultanément, l'armoire risque de devenir instable. v Armoires dotées de tiroirs fixes : Sauf indication du fabricant, les tiroirs fixes ne doivent pas être retirés à des fins de maintenance. Si vous tentez de retirer une partie ou l'ensemble du tiroir, l'armoire risque de devenir instable et le tiroir risque de tomber. (R001) Avant de commencer toute procédure de remplacement ou d'installation, procédez comme suit : Procédure 1. Si vous installez un nouveau dispositif, vous devez disposer des logiciels requis pour la prise en charge de ce nouveau dispositif. Procédures communes aux dispositifs installables 83 2. Si vous exécutez une procédure d'installation ou de remplacement qui risque d'affecter la sécurité des données, vous devez disposer, dans la mesure du possible, d'une copie de sauvegarde récente de votre système ou de la partition logique (systèmes d'exploitation, logiciels sous licence et données). 3. Passez en revue la procédure d'installation ou de remplacement relative au dispositif ou au composant. 4. Notez la correspondance des couleurs utilisées sur le système. La couleur bleue ou ocre sur les composants matériels indique un point de contact que vous pouvez utiliser pour retirer ou installer le composant sur le système, ouvrir ou fermer un levier, etc. La couleur ocre peut également désigner un composant que vous pouvez retirer ou remplacer alors que le système ou la partition logique est sous tension. 5. Vous devez vous munir d'un tournevis à lame plate moyen, d'un tournevis cruciforme et d'une paire de ciseaux. 6. Si certains composants sont incorrects, manquants ou visiblement endommagés, procédez comme suit : v Si vous remplacez un composant, contactez votre fournisseur de services ou le support technique. v Si vous installez une fonctionnalité, contactez l'un des services suivants : – Le fournisseur de services ou le support technique 7. Si vous rencontrez des difficultés lors de l'installation, contactez votre fournisseur de services, ou le support technique. 8. Si vous installez du nouveau matériel dans une partition logique, vous devez planifier et comprendre les implications inhérentes au partitionnement du système. Pour plus d'informations, consultez la rubrique Partitionnement logique. Identification d'un composant défaillant Ces instructions permettent d'apprendre à localiser et identifier un composant défaillant ou à retirer, ou encore de localiser l'emplacement d'installation d'un nouveau composant sur votre système ou unité d'extension en utilisant une méthode adaptée à votre système. Pourquoi et quand exécuter cette tâche Pour les serveurs systèmes dotés du processeur POWER7, les voyants peuvent être utilisés pour identifier ou vérifier un composant à retirer, installer ou en cours de maintenance. La combinaison de la fonction d'identification et du voyant d'incident (couleur ambre) montre l'emplacement d'une unité remplaçable sur site (unité FRU). Lorsque vous retirez une unité FRU, vérifiez d'abord que vous travaillez sur l'unité FRU correcte à l'aide de la fonction d'identification dans la console de gestion ou dans une autre interface utilisateur. Lorsque vous retirez une unité FRU dans la console de gestion du matériel, la fonction d'identification est activée et désactivée automatiquement aux moments opportuns. La fonction d'identification produit le clignotement du voyant jaune. Lorsque vous désactivez la fonction d'identification, le voyant revient à son état précédent. Pour les composants dotés d'un bouton de maintenance bleu, la fonction d'identification définit les informations du voyant pour le bouton de maintenance de sorte que lorsque le bouton est activé, les voyants corrects du composant clignotent. Si vous avez besoin d'utiliser la fonction d'identification, utilisez les procédures suivantes. Voyants du panneau de commande Ces informations peuvent s'utiliser comme un guide sur les voyants et boutons du panneau de commande. Le panneau de commande est doté de voyants qui indiquent les différents états du système. 84 Activation SAS (Serial-attached SCSI) RAID v A : Bouton de mise sous tension v B : Voyant d'alimentation – Un voyant fixe indique que l'unité est sous alimentation système complète. – Un voyant clignotant indique que l'unité est sous alimentation de mode veille. Remarque : Une période d'environ 30 secondes s'écoule entre le moment où vous appuyez sur le bouton de mise sous tension et celui où le voyant d'alimentation cesse de clignoter pour devenir fixe. Durant cette période de transition, le clignotement peut s'accélérer. v C : Voyant d'identification du boîtier – Une lumière constante indique l'état d'identification, qui permet d'identifier une pièce. – L'absence de lumière indique un fonctionnement normal du système. v D: Voyant d'information système – L'absence de lumière indique un fonctionnement normal du système. – Un voyant allumé signifie qu'une intervention est requise sur le système. v E : Port USB v F : Voyant de panne de boîtier – Un voyant fixe indique une panne dans l'unité centrale. – L'absence de lumière indique un fonctionnement normal du système. v G : Ecran Fonctions/Données v H : Bouton de décrémentation v I : Bouton Entrée v J : Bouton d'incrémentation v K : Bouton de réinitialisation Figure 53. Panneau de commande Procédures communes aux dispositifs installables 85 Concepts associés Identification d'un composant défaillant Ces instructions permettent de savoir comment localiser et identifier un composant défaillant sur un système ou une unité d'extension en utilisant la méthode propre à votre système. Identification d'un composant défaillant sur un système AIX ou une partition logique Les instructions permettent de savoir comment rechercher un composant défaillant, puis d'activer le voyant de ce composant sur un système ou une partition logique exécutant le système d'exploitation AIX. Localisation d'un composant défaillant sur un système AIX ou une partition logique Il se peut que vous deviez utiliser les outils AIX, avant d'activer le voyant dans le cadre de la recherche d'un composant défaillant. Procédure 1. Connectez-vous en tant qu'utilisateur root ou celogin-. 2. A l'invite, tapez diag et appuyez sur Entrée. 3. Dans le menu Sélection des fonctions, choisissez Sélection d'une tâche et appuyez sur Entrée. 4. Sélectionnez Affichage des résultats de tests de diagnostic précédents, puis appuyez sur Entrée. 5. Dans les précédents diagnostics, choisissez Affichage du journal des diagnostics abrégé. L'écran Affichage du journal des diagnostics qui contient une liste chronologique des événements s'affiche. 6. Dans la colonne T, recherchez l'entrée S la plus récente. Sélectionnez la ligne, puis appuyez sur Entrée. 7. Sélectionnez Validation. Le programme affiche les détails de l'entrée du journal. 8. Notez l'emplacement et le code SRN figurant vers la fin de l'entrée. 9. Retournez à la ligne de commande. Que faire ensuite Utilisez les informations de localisation du composant défaillant pour activer le voyant qui identifie ce dernier. Pour plus d'informations, voir «Activation du voyant associé au composant défaillant». Activation du voyant associé au composant défaillant Ces instructions permettent d'identifier physiquement l'emplacement d'un composant pour lequel vous intervenez. Procédure 1. Connectez-vous en tant qu'utilisateur root. 2. A l'invite, tapez diag et appuyez sur Entrée. 3. Dans le menu Sélection des fonctions, choisissez Sélection d'une tâche et appuyez sur Entrée. 4. Dans le menu Sélection des tâches, sélectionnez Indicateurs d'identification et d'avertissement, puis appuyez sur Entrée. 5. Dans la liste des voyants, sélectionnez le code d'emplacement associé au composant défaillant et appuyez sur Entrée. 6. Sélectionnez Validation. Le programme allume le voyant d'incident associé au composant défaillant. 7. Retournez à la ligne de commande. Désactivation du voyant d'un composant défaillant Cette procédure permet d'éteindre un voyant allumé dans le cadre d'une opération de maintenance. 86 Activation SAS (Serial-attached SCSI) RAID Pourquoi et quand exécuter cette tâche Pour le désactiver, procédez comme suit : Procédure 1. Connectez-vous en tant qu'utilisateur root. 2. A l'invite, tapez diag et appuyez sur Entrée. 3. Dans le menu Sélection des fonctions, choisissez Sélection d'une tâche et appuyez sur Entrée. 4. Dans le menu Sélection des tâches, sélectionnez Indicateurs d'identification et d'avertissement, puis appuyez sur Entrée. 5. Dans la liste des voyants, sélectionnez le code d'emplacement associé au composant défaillant et appuyez sur Entrée. Lorsque le voyant d'un composant défaillant est activé, le caractère I précède le code d'emplacement. 6. Sélectionnez Validation. Le programme désactive l'intervention du système ainsi que le voyant associé au composant défaillant. 7. Retournez à la ligne de commande. Identification d'un composant défaillant sur un système Linux ou une partition logique Si l'aide à la maintenance a été installée sur un système ou une partition logique, vous pouvez activer ou désactiver les voyants pour localiser un composant ou effectuer une opération de maintenance. Localisation d'un composant défaillant sur un système Linux ou une partition logique Si l'aide à la maintenance a été installée sur un système ou une partition logique, vous devez activer les voyants pour localiser un composant. Pourquoi et quand exécuter cette tâche Pour l'activer, procédez comme suit : Procédure 1. Connectez-vous en tant qu'utilisateur root. 2. Sur la ligne de commande, tapez /usr/sbin/usysident -s identify -l et appuyez sur Entrée. 3. Repérez le voyant d'incident pour identifier le boîtier qui renferme le composant défaillant. Localisation d'un composant défaillant sur un système ou une partition logique Linux Pour retrouver le code d'emplacement d'un composant défaillant si vous avez oublié cette information, utilisez la procédure décrite dans cette rubrique. Pourquoi et quand exécuter cette tâche Pour localiser le composant défaillant sur un système ou une partition logique, procédez comme suit : Procédure 1. Connectez-vous en tant qu'utilisateur root. 2. Sur la ligne de commande, tapez grep diagela /var/log/platform et appuyez sur Entrée. 3. Recherchez l'entrée la plus récente contenant un code SRC (System Reference Code). 4. Notez les informations d'emplacement. Procédures communes aux dispositifs installables 87 Activation du voyant associé au composant défaillant Si vous connaissez le code d'emplacement du composant défaillant, activez le voyant pour vous aider à rechercher le composant à remplacer. Pourquoi et quand exécuter cette tâche Pour l'activer, procédez comme suit : Procédure 1. Connectez-vous en tant qu'utilisateur root. 2. Sur la ligne de commande, tapez /usr/sbin/usysident -s identify -l et appuyez sur Entrée. 3. Repérez le voyant d'incident pour identifier le boîtier qui renferme le composant défaillant. Résultats Désactivation du voyant d'un composant défaillant Après avoir terminé la procédure de retrait et de remplacement, vous devez désactiver le voyant du composant défaillant. Pourquoi et quand exécuter cette tâche Pour le désactiver, procédez comme suit : Procédure 1. Connectez-vous en tant qu'utilisateur root. 2. Sur la ligne de commande, tapez /usr/sbin/usysident -s normal -l et appuyez sur Entrée. Localisation d'un composant défaillant sur un système ou une partition logique Virtual I/O Server Vous pouvez utiliser les outils Virtual I/O Server (VIOS) avant d'activer le voyant, pour rechercher un composant défaillant. Procédure 1. Connectez-vous en tant qu'utilisateur root ou celogin-. 2. A l'invite, tapez diagmenu et appuyez sur Entrée. 3. Dans le menu Sélection des fonctions, choisissez Sélection d'une tâche et appuyez sur Entrée. 4. Sélectionnez Affichage des résultats de tests de diagnostic précédents, puis appuyez sur Entrée. 5. Dans l'écran Affichage des résultats de tests de diagnostic précédents, sélectionnez Affichage du journal des diagnostics abrégé. L'écran Affichage du journal des diagnostics apparaît. Il contient la liste des événements triée dans l'ordre chronologique. 6. Dans la colonne T, recherchez l'entrée S la plus récente. Sélectionnez la ligne, puis appuyez sur Entrée. 7. Sélectionnez Validation. Le programme affiche les détails de l'entrée du journal. 8. Notez l'emplacement et le code SRN figurant vers la fin de l'entrée. 9. Retournez à la ligne de commande. 88 Activation SAS (Serial-attached SCSI) RAID Résultats Utilisez les informations de localisation du composant défaillant pour activer le voyant qui identifie ce dernier. Pour plus d'informations, voir «Identification d'un composant à l'aide du Virtual I/O Server». Identification d'un composant à l'aide du Virtual I/O Server Ces instructions permettent d'activer le voyant afin de repérer l'emplacement physique d'un composant à l'aide du Virtual I/O Server (VIOS). Procédure 1. Connectez-vous en tant qu'utilisateur root. 2. A l'invite, tapez diagmenu et appuyez sur Entrée. 3. Dans le menu Sélection des fonctions, choisissez Sélection d'une tâche. Appuyez sur Entrée. 4. Dans le menu Sélection des tâches, sélectionnez Indicateurs d'identification et d'avertissement. Appuyez sur Entrée. 5. Dans la liste des voyants, sélectionnez le code d'emplacement associé au composant défaillant et appuyez sur Entrée. 6. Sélectionnez Validation. Le programme allume le voyant d'incident associé au composant défaillant. 7. Retournez à la ligne de commande. Démarrage du système ou de la partition logique La présente section explique comment démarrer un système ou une partition logique après avoir effectué une action de maintenance ou une mise à niveau du système. Démarrage d'un système non géré par une console HMC ou une console SDMC Vous pouvez utiliser le bouton d'alimentation ou l'interface ASMI pour démarrer un système qui n'est pas géré par une console HMC (Hardware Management Console) ni par une console Systems Director Management Console (SDMC). Pourquoi et quand exécuter cette tâche Pour démarrer un système non géré par une console HMC ou SDMC, procédez comme suit : Procédure 1. Si nécessaire, ouvrez le volet avant de l'armoire. 2. Avant d'appuyer sur le bouton d'alimentation du panneau de commande, vérifiez que le système est bien relié à une source d'alimentation : v Tous les cordons d'alimentation système doivent être reliés à une source d'alimentation. v Le voyant d'alimentation clignote lentement, comme indiqué dans la figure suivante. v Comme indiqué dans la figure ci-après, la partie supérieure de l'écran doit afficher la mention 01 V=F. 3. Appuyez sur le bouton d'alimentation (A) du panneau de commande (voir figure suivante). Procédures communes aux dispositifs installables 89 v A : Bouton de mise sous tension v B : Voyant d'alimentation – Un voyant fixe indique que l'unité est sous alimentation système complète. – Un voyant clignotant indique que l'unité est sous alimentation de mode veille. Remarque : Une période d'environ 30 secondes s'écoule entre le moment où vous appuyez sur le bouton de mise sous tension et celui où le voyant d'alimentation cesse de clignoter pour devenir fixe. Durant cette période de transition, le clignotement peut s'accélérer. v C : Voyant d'identification de boîtier – Une lumière fixe indique l'état d'identification du boîtier ou d'une ressource du boîtier. – Aucune lumière signifie qu'aucune ressource du boîtier n'est identifiée. v D : Voyant d'incident – L'absence de lumière indique un fonctionnement normal du système. – Un voyant fixe indique qu'une intervention est requise sur le système. v E : Port USB v F : Voyant de panne de boîtier – Un voyant fixe correspond à un voyant de panne actif sur le système. – L'absence de lumière indique un fonctionnement normal du système. v G : Ecran Fonctions/Données v H : Bouton de décrémentation v I : Bouton Entrée v J : Bouton d'incrémentation v K : Bouton de réinitialisation 4. Une fois que vous avez appuyé sur le bouton d'alimentation, notez les éléments suivants : v Le voyant d'alimentation commence à clignoter plus rapidement. v Les ventilateurs système sont activés après environ 30 secondes et commencent à tourner plus rapidement. Figure 54. Panneau de commande 90 Activation SAS (Serial-attached SCSI) RAID v L'écran du panneau de commande affiche les jauges (ou points de contrôle) pendant le démarrage du système. Le voyant de mise sous tension arrête de clignoter et reste fixe, indiquant que le système est sous tension. Que faire ensuite Conseil : Si le système ne démarre pas alors que vous avez appuyé sur le bouton d'alimentation, utilisez la procédure suivante pour démarrer le système à l'aide de l'interface ASMI (Advanced System Management Interface) : 1. Lancez l'interface ASMI. Pour plus d'informations, reportez-vous à la rubrique Accès à l'interface ASMI. 2. Démarrez le système à l'aide de l'interface ASMI. Pour plus d'informations, reportez-vous à la rubrique Mise sous tension et arrêt du système. Démarrage d'un système ou d'une partition logique via la console HMC Vous pouvez utiliser l'interface utilisateur de la console HMC (Hardware Management Console) pour démarrer le système ou la partition logique une fois les câbles requis installés et connectés à une source d'alimentation. Pourquoi et quand exécuter cette tâche Pour obtenir des instructions sur l'utilisation de la console HMC, voir Gestion de la console HMC (Hardware Management Console). Pour plus d'informations sur le démarrage d'une partition logique, reportez-vous à la rubrique Partitionnement logique. Pour plus d'informations sur le démarrage du système, reportez-vous à la rubrique Mise sous tension du système géré. L'écran du panneau de commande affiche les jauges (ou points de contrôle) pendant le démarrage du système. Lorsque le voyant d'alimentation s'arrête de clignoter et reste fixe, cela signifie que le système est sous tension. Démarrage d'un système ou d'un serveur virtuel via la console SDMC Vous pouvez utiliser l'interface utilisateur de la console SDMC (Systems Director Management Console) pour démarrer le système ou le serveur virtuel une fois les câbles requis installés et les cordons d'alimentation connectés à une alimentation. Pourquoi et quand exécuter cette tâche Pour obtenir des instructions sur l'utilisation de la console SDMC, voir Gestion et configuration de la console SDMC. Pour des instructions sur le démarrage d'un serveur virtuel, voir Gestion de serveurs virtuels. Pour des instructions sur l'arrêt et le redémarrage de serveurs virtuels, voir Arrêt et redémarrage de serveurs virtuels. L'écran du panneau de commande affiche les jauges, ou points de contrôle, pendant le démarrage du système. Lorsque le voyant de mise sous tension du panneau de commande cesse de clignoter et reste fixe, le système est sous tension. Arrêt d'un système ou d'une partition logique La présente section explique comment arrêter un système ou une partition logique dans le cadre d'une mise à niveau du système ou d'une procédure de maintenance. Procédures communes aux dispositifs installables 91 Pourquoi et quand exécuter cette tâche Avertissement : Si vous utilisez le bouton de mise sous tension ou les commandes de la console HMC (Hardware Management Console) pour arrêter le système, les fichiers de données risquent d'être endommagés. Par ailleurs, le système risque de mettre plus de temps à démarrer la prochaine fois si toutes les applications n'ont pas été fermées avant l'arrêt du système. Pour arrêter le système ou la partition logique, sélectionnez la procédure appropriée. Arrêt d'un système non géré par une console HMC ou une console SDMC Pour effectuer une autre tâche, vous devez arrêter le système. Ces instructions indiquent comment arrêter le système à l'aide du bouton d'alimentation ou de l'interface ASMI. Avant de commencer Avant d'arrêter le système, procédez comme suit : 1. Vérifiez que tous les travaux sont terminés et fermez toutes les applications. 2. Assurez-vous que le système d'exploitation est arrêté. Avertissement : Si vous ne le faites pas, vous risquez de perdre des données. 3. Si une partition logique Virtual I/O Server (VIOS) est en cours d'exécution, assurez-vous que tous les clients sont déconnectés ou qu'ils peuvent accéder à leurs unités par un autre moyen. Pourquoi et quand exécuter cette tâche La procédure ci-dessous explique comment arrêter un système non géré par une console HMC (Hardware Management Console). Procédure 1. Connectez-vous au système sous un nom d'utilisateur bénéficiant des droits d'accès nécessaires pour exécuter la commande shutdown ou pwrdwnsys (mettre le système hors tension). 2. A l'invite, tapez la commande appropriée : v Si le système exécute le système d'exploitation AIX, tapez shutdown. v Si votre système exécute le système d'exploitation Linux, tapez shutdown -h now. v Si votre système exécute le système d'exploitation , tapez PWRDWNSYS. Si votre système est partitionné, utilisez la commande PWRDWNSYS pour arrêter chaque partition secondaire. Ensuite, utilisez la commande PWRDWNSYS pour arrêter la partition principale. La commande arrête le système d'exploitation. Une fois le système d'exploitation arrêté, le voyant de mise sous tension commence à clignoter lentement et le système passe en veille. 3. Sur la ligne de commande Linux, tapez shutdown -h now. La commande arrête le système d'exploitation. Une fois le système d'exploitation arrêté, le voyant de mise sous tension commence à clignoter lentement et le système passe en veille. 4. Sur la ligne de commande Linux, tapez shutdown -h now. La commande arrête le système d'exploitation. Une fois le système d'exploitation arrêté, le voyant de mise sous tension commence à clignoter lentement et le système passe en veille. 5. Pour restaurer l'état du système à l'issue de l'opération d'installation ou de remplacement, notez le type et le mode de l'IPL. 6. Mettez les interrupteurs d'alimentation des unités reliées au système sur Off. 7. Débranchez tous les cordons d'alimentation connectés à l'unité des socles de prise de courant. Vérifiez que vous débranchez également les cordons d'alimentation des périphériques (imprimantes et unités d'extension, par exemple). 92 Activation SAS (Serial-attached SCSI) RAID Important : Le système est équipé d'un second bloc d'alimentation. Avant de continuer la procédure, vérifiez que toutes les sources d'alimentation ont été débranchées du système. (L003) ou Arrêt d'un système via la console HMC Vous pouvez utiliser l'interface utilisateur de la console HMC (Hardware Management Console) pour arrêter le système ou une partition logique. Pourquoi et quand exécuter cette tâche Par défaut, le système géré est configuré pour se mettre hors tension automatiquement lorsque vous arrêtez sa dernière partition logique en cours d'exécution. Si vous définissez les propriétés du système géré sur la console HMC de façon à empêcher la mise hors tension automatique du système, vous devez utiliser cette procédure pour mettre hors tension le système géré. Avertissement : Dans la mesure du possible, arrêtez les partitions logiques en cours d'exécution avant de mettre le système géré hors tension. Si vous n'arrêtez pas d'abord les partitions logiques, vous risquez de provoquer leur arrêt anormal et de perdre des données. Si vous utilisez une partition logique Virtual I/O Server (VIOS), assurez-vous que tous les clients sont déconnectés ou qu'ils peuvent accéder à leurs unités par un autre moyen. Pour mettre le système géré hors tension, vous devez utiliser l'un des profils utilisateur suivants : v Administrateur central v Technicien de maintenance v Opérateur v Responsable produit Procédures communes aux dispositifs installables 93 Procédez comme suit pour arrêter le système via la console HMC. Procédure 1. Dans la zone de navigation, développez le dossier Gestion de systèmes. 2. Cliquez sur l'icône Serveurs. 3. Dans la zone de contenu, sélectionnez le système géré. 4. Sélectionnez Tâches, Opérations, puis Mise hors tension 5. Sélectionnez le mode de mise hors tension approprié et cliquez sur OK. Information associée Arrêt et redémarrage de partitions logiques Arrêt d'un système via la console SDMC Vous pouvez utiliser l'interface utilisateur de la console SDMC (Systems Director Management Console) pour arrêter le système ou un serveur virtuel. Pourquoi et quand exécuter cette tâche Par défaut, le système géré est configuré pour se mettre hors tension automatiquement lorsque vous arrêtez le dernier serveur virtuel en cours d'exécution. Si vous définissez les propriétés du système géré sur la console SDMC de façon à empêcher la mise hors tension automatique du système, vous devez utiliser cette procédure pour mettre hors tension le système géré. Avertissement : Si possible, arrêtez les serveurs en cours d'exécution sur le système géré avant de mettre ce dernier hors tension. Si vous n'arrêtez pas d'abord les serveurs virtuels, vous risquez de provoquer leur arrêt anormal et de perdre des données. Si vous utilisez une partition logique Virtual I/O Server (VIOS) assurez-vous que tous les clients sont déconnectés ou qu'ils peuvent accéder à leurs unités par un autre moyen. Pour mettre le système géré hors tension, vous devez utiliser l'un des profils utilisateur suivants : v Administrateur central v Technicien de maintenance v Opérateur v Responsable produit Procédez comme suit pour arrêter le système via la console SDMC. Procédure 1. Dans la zone des ressources Power Systems sélectionnez le système géré que vous voulez mettre hors tension. 2. Dans le menu Actions, sélectionnez Opérations > Mise hors tension. 3. Sélectionnez le mode de mise hors tension approprié et cliquez sur OK. Retrait et remplacement des capots sur le système 02E/4B, 02E/4C, 05F/6B ou 05E/6C Les présentes instructions permettent de retirer, remettre en place ou installer des capots pour accéder à des composants ou effectuer des opérations de maintenance. 94 Activation SAS (Serial-attached SCSI) RAID Retrait du capot avant sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire Cette procédure permet de retirer le capot pour accéder aux composants ou effectuer une opération de maintenance. Pourquoi et quand exécuter cette tâche Pour retirer le capot avant, procédez comme suit : Procédure 1. Retirez les deux vis de transport (si elles sont présentes) fixant l'unité centrale à l'armoire. Remarque : La remise en place des vis de transport est facultative mais doit être effectuée dans le cas de zones géographiques soumises à une activité sismique. 2. Retirez le capot du système. Le capot est doté d'un renfoncement permettant de le tenir plus facilement. Retrait du capot avant sur un système autonome 02E/4B, 02E/4C ou 05F/6B Cette procédure permet de retirer le capot pour accéder aux composants ou effectuer une opération de maintenance. Pourquoi et quand exécuter cette tâche Pour retirer le capot avant, procédez comme suit : Procédure 1. Ouvrez le volet avant (A) en le faisant pivoter vers la gauche, comme indiqué dans la figure suivante. Figure 55. Retrait du capot avant Procédures communes aux dispositifs installables 95 2. Appuyez sur le taquet de verrouillage (B) et retirez le capot du système. Le capot est doté d'un renfoncement permettant de le tenir plus facilement. Installation du capot avant sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire Cette procédure permet d'installer le capot permettant d'accéder aux composants ou d'effectuer une opération de maintenance. Pourquoi et quand exécuter cette tâche Pour installer le capot avant, procédez comme suit : Procédure 1. Poussez le capot sur le système. 2. Poussez doucement le capot jusqu'à ce que les quatre taquets de déverrouillage soient positionnés dans leurs montants de fixation comme indiqué dans la figure suivante. Le capot avant se met en place ; il est doté d'un renfoncement permettant de le tenir plus facilement. Remarque : La remise en place des vis de transport est facultative mais doit être effectuée dans le cas de zones géographiques soumises à une activité sismique. Figure 56. Retrait du capot avant 96 Activation SAS (Serial-attached SCSI) RAID Installation des capot et volet avant sur un système 02E/4B, 02E/4C ou 05F/6B autonome Cette procédure permet d'installer le capot permettant d'accéder aux composants ou d'effectuer une opération de maintenance. Pourquoi et quand exécuter cette tâche Pour installer le capot et le volet avant, procédez comme suit. Procédure 1. Alignez le capot (A) jusqu'à ce que les deux taquets du couvercle soient placées dans les encoches de plaque de base (B), comme illustré dans la figure suivante. 2. Faites pivoter le capot vers le haut et vers le système, jusqu'à ce que le taquet de déverrouillage soit inséré dans son emplacement. Le capot est doté d'un renfoncement permettant de le tenir plus facilement. Figure 57. Retrait du capot avant Procédures communes aux dispositifs installables 97 3. Maintenez le volet avant à un angle d'environ 120 degrés par rapport au système comme l'illustre la figure suivante. Engagez les taquets. Retrait du capot d'accès sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire Cette procédure décrit comment retirer le capot d'accès. Procédure 1. Desserrez les deux vis moletées (A) situées à l'arrière du capot. Figure 58. Installation du capot avant Figure 59. Installation du volet avant 98 Activation SAS (Serial-attached SCSI) RAID 2. Faites glisser le capot vers l'arrière de l'unité centrale. Lorsque la partie avant du capot d'accès n'est plus sur le bord du cadre supérieur, soulevez le capot et retirez-le de l'unité centrale. Avertissement : Lorsque le capot d'accès pour la maintenance est retiré, le système est mis hors tension. Retrait du capot d'accès d'un système autonome 02E/4B, 02E/4C ou 05F/6B Cette procédure décrit comment retirer le capot d'accès. Procédure 1. Desserrez la vis moletée (A) située à l'arrière du capot. 2. Faites glisser le capot (B) vers l'arrière de l'unité centrale. Lorsque la partie avant du capot d'accès n'est plus sur le bord du cadre supérieur, soulevez le capot et retirez-le de l'unité centrale. Figure 60. Retrait du capot d'accès Procédures communes aux dispositifs installables 99 3. Desserrez les deux vis moletées (A) situées à l'arrière du capot intérieur. 4. Faites glisser le capot (B) vers l'arrière de l'unité centrale. Lorsque la partie avant du capot d'accès n'est plus sur le bord du cadre supérieur, soulevez le capot et retirez-le de l'unité centrale. Avertissement : Lorsque le capot d'accès pour la maintenance est retiré, le système est mis hors tension. Figure 61. Installation du capot d'accès extérieur 100 Activation SAS (Serial-attached SCSI) RAID Installation du capot d'accès sur un système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire Cette procédure décrit comment installer le capot d'accès. Procédure 1. Placez le capot d'accès sur le haut du système, à environ 25 mm du rebord du châssis supérieur. 2. Maintenez le capot d'accès contre l'unité centrale et faites-le glisser vers l'avant du système. Les languettes du capot d'accès glissent sous le rebord du boîtier supérieur. 3. Alignez les deux vis moletées (A) situées à l'arrière du capot d'accès avec les deux trous à l'arrière du boîtier du système. 4. Serrez les vis moletées afin de fixer le capot d'accès. Figure 62. Retrait du capot d'accès intérieur Procédures communes aux dispositifs installables 101 Installation du capot d'accès d'un système autonome 02E/4B, 02E/4C ou 05F/6B Cette procédure décrit comment installer le capot d'accès. Procédure 1. Placez le capot d'accès interne (A) sur le haut du système, à environ 25 mm du rebord du châssis supérieur. 2. Maintenez le capot d'accès contre l'unité centrale et faites-le glisser vers l'avant du système. Les languettes du capot d'accès glissent sous le rebord du boîtier supérieur. 3. Alignez les deux vis moletées (B) situées à l'arrière du capot d'accès avec les deux trous à l'arrière du boîtier du système. 4. Serrez les vis moletées afin de fixer le capot d'accès. Figure 63. Installation du capot d'accès 102 Activation SAS (Serial-attached SCSI) RAID 5. Placez le capot d'accès extérieur (A) sur le haut du système. 6. Maintenez le capot d'accès contre l'unité centrale et faites-le glisser vers l'avant du système. Les languettes du capot d'accès glissent sous le rebord du boîtier supérieur. 7. Alignez la vis moletée (B) située à l'arrière du capot d'accès avec le trou à l'arrière du boîtier du système. 8. Serrez la vis moletée afin de fixer le capot d'accès. Figure 64. Installation du capot d'accès intérieur Procédures communes aux dispositifs installables 103 Mise en position de maintenance ou de fonctionnement du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C Ces procédures décrivent comment mettre un système en maintenance et en fonctionnement dans le cadre d'une opération de maintenance ou pour accéder à des composants internes. Mise en position de maintenance du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire Utilisez cette procédure pour placer en position de maintenance le système monté en armoire. Avant de commencer Remarques : v Lorsque vous placez l'armoire en position de maintenance, il est essentiel de positionner correctement toutes les plaques de stabilisation pour empêcher l'armoire de basculer. Assurez-vous qu'il n'y ait qu'une seule unité centrale en position de maintenance à la fois. v Veillez à ne pas bloquer ou plier les câbles situés à l'arrière de l'unité centrale lorsque vous extrayez celle-ci de l'armoire. v Une fois les glissières de l'unité centrale complètement étendues, les taquets de sécurité s'enclenchent, ce qui empêche l'unité de trop sortir de son emplacement. Procédure 1. Ouvrez le volet avant de l'armoire. 2. Identifiez l'unité système dont vous effectuez la maintenance dans l'armoire. 3. Retirez les deux vis (A) (si elles sont présentes) fixant l'unité centrale (B) à l'armoire. Figure 65. Installation du capot d'accès extérieur 104 Activation SAS (Serial-attached SCSI) RAID Remarque : La remise en place des vis de transport est facultative mais doit être effectuée dans le cas de zones géographiques soumises à une activité sismique. 4. Tout en maintenant enfoncés les taquets de déverrouillage droit et gauche de l'unité centrale (A), tirez l'unité centrale (B) hors de l'armoire jusqu'à ce que les glissières soient complètement étendues et verrouillées. Figure 66. Placement du système en position de maintenance Procédures communes aux dispositifs installables 105 Mise en position de fonctionnement du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C monté en armoire Utilisez cette procédure pour placer le système monté en armoire en position de fonctionnement. Avant de commencer Lors du placement en position de fonctionnement du système, veillez à ne pas bloquer ou plier les câbles situés à l'arrière du système lorsque vous poussez l'unité dans l'armoire. Procédure 1. Déverrouillez les taquets de sécurité bleus (A) en les levant vers le haut. Les taquets de sécurité sont situés près du milieu du système. 2. Poussez l'unité centrale (B) dans l'armoire jusqu'à ce que les deux taquets de déverrouillage de l'unité soient verrouillés. 3. Le cas échéant, replacez et serrez les deux vis (A) fixant l'unité centrale (B) à l'armoire. Remarque : La remise en place des deux vis de transport est facultative mais doit être effectuée dans le cas de zones géographiques soumises à une activité sismique. 4. Fermez le volet avant de l'unité dont vous effectuez la maintenance. Figure 67. Mise en position de fonctionnement du système 106 Activation SAS (Serial-attached SCSI) RAID Déconnexion des cordons d'alimentation du système 02E/4B, 02E/4C, 05F/6B ou 05E/6C Utilisez ces procédures pour déconnecter les cordons d'alimentation du système. Procédure 1. Pour un système monté en armoire, ouvrez le volet arrière de l'armoire sur l'unité sur laquelle vous effectuez la maintenance et identifiez l'unité centrale. 2. Pour un système monté en armoire et un système autonome, débranchez tous les cordons d'alimentation de l'unité sur laquelle vous effectuez la maintenance. Le système est peut-être équipé de deux blocs d'alimentation. Avant de continuer toute procédure de retrait et de remplacement, vérifiez que toutes les deux sources d'alimentation ont été débranchées du système. Figure 68. Mise en position de fonctionnement du système Procédures communes aux dispositifs installables 107 Figure 69. Retrait des cordons d'alimentation pour un système monté en armoire Figure 70. Retrait des cordons d'alimentation pour un système autonome 108 Activation SAS (Serial-attached SCSI) RAID Connexion des cordons d'alimentation au système 02E/4B, 02E/4C, 05F/6B ou 05E/6C Cette procédure permet de connecter les cordons d'alimentation au système. Procédure 1. Identifiez l'unité système dont vous effectuez la maintenance dans l'armoire. 2. Connectez tous les cordons d'alimentation à l'unité dont vous effectuez la maintenance. Vérifiez que les cordons d'alimentation passent par les poignées. 3. Fermez le volet arrière de l'armoire. Installation d'un dispositif via la console HMC Vous pouvez utiliser la console HMC (Hardware Management Console) pour effectuer diverses tâches de maintenance, y compris l'installation d'un nouveau dispositif ou composant. Avant de commencer Pour employer l'interface utilisateur de la console HMC pour installer un dispositif ou un composant dans une unité centrale ou une unité d'extension gérée par une console HMC, procédez comme suit : Procédure 1. Dans la zone de navigation, développez Gestion de systèmes > Serveurs. 2. Sélectionnez le système géré sur lequel vous allez installer le composant. 3. Dans la zone des tâches, développez Maintenabilité > Matériel > Tâches de modification de matériel. 4. Sélectionnez Ajout d'une FRU (unité remplaçable sur site). Figure 71. Connexion des cordons d'alimentation Procédures communes aux dispositifs installables 109 5. Dans la fenêtre Installation/ajout/suppression de matériel, sélectionnez le système ou le boîtier dans lequel vous installez le dispositif. 6. Sélectionnez dans la liste le type de dispositif que vous souhaitez installer et cliquez sur Suivant. 7. Sélectionnez le code d'emplacement pour l'installation du dispositif et cliquez sur Ajouter. 8. Une fois l'unité remplaçable sur site FRU placée dans la catégorie des actions en attente, cliquez sur Lancer la procédure et suivez les instructions d'installation du dispositif. Remarque : La console HMC peut afficher des instructions expliquant comment installer le dispositif. Dans ce cas, suivez ces instructions pour installer le dispositif. Retrait d'un composant via la console HMC Vous pouvez utiliser la console HMC (Hardware Management Console) pour effectuer de nombreuses opérations de maintenance, y compris le retrait d'une unité remplaçable sur site (FRU) ou d'un composant. Pourquoi et quand exécuter cette tâche Pour employer l'interface utilisateur de la console HMC afin de retirer un composant dans une unité centrale ou une unité d'extension gérée par une console HMC, procédez comme suit : Procédure 1. Dans la zone de navigation, développez Gestion de systèmes > Serveurs. 2. Sélectionnez le système géré sont vous souhaitez retirer un composant. 3. Sélectionnez l'une des options suivantes. v Si vous installez un nouveau composant dans le cadre d'une mise à niveau, dans la zone des tâches, développez Maintenabilité > Matériel > Tâches de modification de matériel. v Si vous remplacez un composant dans le cadre d'une opération de maintenance, depuis la zone des tâches, développez, Maintenabilité > Matériel > Retrait de FRU. 4. Pour installer un nouveau composant dans le cadre d'une mise à niveau, sélectionnez Ajout de FRU. Pour remplacer un composant dans le cadre d'une opération de maintenance, sélectionnez Retrait de FRU. 5. Dans la fenêtre Installation/ajout/suppression de matériel - Retrait de FRU, sélectionnez le type de FRU, sélectionnez le système ou le boîtier duquel vous souhaitez retirer le composant. 6. Sélectionnez dans la liste le type de composant que vous souhaitez retirer et cliquez sur Suivant. 7. Sélectionnez l'emplacement du composant que vous retirez et cliquez sur Ajouter. 8. Une fois que le composant figure dans la liste des actions en attente, cliquez sur Lancer la procédure et suivez les instructions de retrait du composant. Remarque : La console HMC peut afficher des instructions de l'Information Center expliquant comment retirer le composant. Dans ce cas, suivez ces instructions pour retirer le composant. Remplacement d'un composant via la console HMC Vous pouvez utiliser la console HMC (Hardware Management Console) pour effectuer diverses tâches de maintenance, y compris le remplacement d'une unité remplaçable sur site (FRU) ou d'un composant. Pourquoi et quand exécuter cette tâche Si vous procédez à un échange de composant dans le but de solutionner un événement réparable, suivez ces instructions. Si vous échangez un composant dans le cadre d'une autre procédure à l'aide de la version 7 ou d'une version ultérieure de la console HMC, procédez comme suit : 110 Activation SAS (Serial-attached SCSI) RAID Procédure 1. Dans la zone de navigation, développez Gestion de systèmes > Serveurs. 2. Sélectionnez le système géré dont vous souhaitez remplacer un composant. 3. Sélectionnez l'une des options suivantes. v Si vous installez un nouveau composant dans le cadre d'une mise à niveau, dans la zone des tâches, développez Maintenabilité > Matériel > Tâches de modification de matériel. v Si vous remplacez un composant dans le cadre d'une opération de maintenance, depuis la zone des tâches, développez, Maintenabilité > Matériel > Remplacement de FRU. 4. Pour installer un nouveau composant dans le cadre d'une mise à niveau, sélectionnez Ajout de FRU. Pour remplacer un composant dans le cadre d'une opération de maintenance, sélectionnez Remplacement de FRU. 5. Dans la fenêtre Remplacement de matériel - Remplacement de FRU, Sélectionnez le type de FRU, sélectionnez dans la liste le système ou le boîtier dans lequel vous souhaitez échanger un composant. 6. Dans le menu, sélectionnez le type du composant et cliquez sur Suivant. 7. Dans le menu, sélectionnez le code emplacement du composant et cliquez sur Ajouter. 8. Une fois l'unité remplaçable sur site FRU placée dans la catégorie des actions en attente, cliquez sur Lancer la procédure et suivez les instructions d'échange du dispositif. Remarque : Il se peut que la console HMC ouvre une fenêtre d'instructions pour le remplacement du composant. Dans ce cas, suivez les instructions pour remplacer le composant. Vérification du composant installé Vous pouvez vérifier un nouveau composant installé ou remplacé sur votre système, partition logique ou unité d'extension à l'aide du système d'exploitation, de diagnostics autonomes ou de la console HMC (Hardware Management Console). Vérification d'un dispositif installé ou d'un composant remplacé sur un système ou une partition logique AIX Si vous avez installé un dispositif ou remplacé un composant, il est recommandé d'utiliser les outils dans le système d'exploitation AIX pour vérifier que ce dispositif ou composant est reconnu par le système ou la partition logique. Pourquoi et quand exécuter cette tâche Pour vérifier le fonctionnement d'un dispositif ou d'un composant de remplacement récemment installé, sélectionnez la procédure appropriée : v Vérification du dispositif installé via AIX v Vérification du composant de remplacement via AIX Vérification du dispositif installé via le système d'exploitation AIX : 1. Connectez-vous en tant qu'utilisateur root. 2. A l'invite, tapez diag et appuyez sur Entrée. 3. Sélectionnez Programmes de diagnostic, mode étendu et appuyez sur Entrée. 4. Dans le menu Sélection du mode de diagnostic, sélectionnez Vérification du système et appuyez sur Entrée. 5. Dans le menu Sélection des tests de diagnostic, mode étendu, exécutez l'une des procédures suivantes : v Pour tester une seule ressource, sélectionnez la ressource que vous venez d'installer dans la liste des ressources et appuyez sur Entrée. Procédures communes aux dispositifs installables 111 v Pour tester toutes les ressources disponibles du système d'exploitation, sélectionnez Toutes les ressources et appuyez sur Entrée. 6. Sélectionnez Validation, puis attendez la fin de l'exécution des programmes de diagnostic en répondant aux invites éventuelles. 7. Les programmes de diagnostic ont-ils été exécutés complètement avant d'afficher le message Aucun incident n’a été détecté ? v Non : Si un code SRN (Service Request Number - numéro de demande d'intervention) ou un autre code de référence s'affiche, il est possible qu'un adaptateur ou qu'un câble ne soit pas bien connecté. Passez en revue les procédures d'installation pour vérifier que le nouveau dispositif est installé correctement. Si vous ne parvenez pas à corriger l'incident, rassemblez tous les codes SRN et autres codes de référence éventuels. Si le système fonctionne en mode de partitionnement logique, notez la partition logique sur laquelle le dispositif est installé. Pour obtenir de l'aide, contactez votre fournisseur de services. v Oui : Le nouveau dispositif est installé correctement. Quittez les programmes de diagnostic et revenez en mode de fonctionnement normal. Vérification du composant de remplacement via le système d'exploitation AIX : Pour vérifier le fonctionnement d'un dispositif ou d'un composant de remplacement récemment installé, procédez comme suit : 1. Avez remplacé le composant à l'aide du système d'exploitation AIX ou des opérations de remplacement à chaud du service de diagnostic en ligne ? Non : Passez à l'étape 2. Oui : Passez à l'étape 5, à la page 113. 2. Le système est-il hors tension ? Non : Passez à l'étape 4. Oui : Si le système prend en charge l'initialisation lente, configurez la fonction. Pour plus d'informations, consultez la rubrique Exécution d'une initialisation lente. 3. Démarrez le système et attendez que l'invite de connexion du système d'exploitation AIX s'affiche ou que l'activité apparente du système sur le panneau de commande ou l'écran s'arrête. Avez-vous vu l'invite de connexion AIX ? v Non : Si un code SRN (Service Request Number - numéro de demande d'intervention) ou un autre code de référence s'affiche, il est possible qu'un adaptateur ou qu'un câble ne soit pas bien connecté. Passez en revue les procédures de remplacement pour vérifier que le nouveau composant est installé correctement. Si vous ne parvenez pas à corriger l'incident, rassemblez tous les codes SRN et autres codes de référence éventuels. Si le système ne démarre pas ou que l'invite de connexion ne s'affiche pas, consultez la rubrique : Incidents de chargement et de démarrage du système d'exploitation. Si le système est partitionné, notez la partition logique sur laquelle vous avez remplacé le composant. Pour obtenir de l'aide, contactez votre fournisseur de services. v Oui : Passez à l'étape 4. 4. A l'invite, tapez diag —a et appuyez sur Entrée pour vérifier s'il manque des ressources. Si une invite s'affiche, passez à l'étape 5, à la page 113. Si le menu Sélection des tests de diagnostic s'affiche avec un M en regard d'une ressource, procédez comme suit : a. Sélectionnez la ressource, puis appuyez sur Entrée. b. Sélectionnez Validation. c. Suivez les instructions affichées. d. Si le message Souhaitez-vous revoir l'erreur affichée précédemment ? s'affiche, cliquez sur Oui et appuyez sur Entrée. 112 Activation SAS (Serial-attached SCSI) RAID e. Si le système génère un code SRN, une carte ou un câble est probablement mal connecté. Si aucun incident évident n'apparaît, notez le code SRN et contactez votre fournisseur de services. f. Si aucun code SRN ne s'affiche, passez à l'étape 5. 5. Testez le composant en procédant comme suit : a. A l'invite, tapez diag et appuyez sur Entrée. b. Dans le menu Sélection des fonctions, sélectionnez Programmes de diagnostic, mode étendu et appuyez sur Entrée. c. Dans le menu Sélection du mode de diagnostic, sélectionnez Vérification du système et appuyez sur Entrée. d. Sélectionnez Toutes les ressources ou sélectionnez les programmes de diagnostic du composant pour tester uniquement le composant remplacé ou les unités qui y raccordées et appuyez sur Entrée. Le menu Action corrective sur ressource s'est-il affiché ? Non : Passez à l'étape 6. Oui : Passez à l'étape 7. 6. Le message Test terminé, aucun incident n'a été détecté s'est-il affiché ? v Non : Il existe encore un incident. Contactez votre fournisseur de services. La procédure est terminée. v Oui : Si l'incident ne figure pas dans le journal des erreurs, sélectionnez Consignation d'action corrective dans le menu Sélection des tâches pour mettre à jour le journal des erreurs AIX. Si la réparation consistait à remettre en place un câble ou une carte, sélectionnez la ressource correspondante. Si la ressource associée à l'action ne s'affiche pas dans la liste des ressources, sélectionnez sysplanar0 et appuyez sur Entrée. Conseil : Le voyant du composant passe de l'état Incident à l'état normal. Passez à l'étape 9, à la page 114. 7. Dans le menu Action corrective sur ressource, sélectionnez la ressource remplacée. En cas de test réussi sur une ressource en mode de vérification du système, dont une entrée figure dans le journal des erreurs AIX, le menu Action corrective sur ressource apparaît. Pour mettre à jour le journal des erreurs AIX pour indiquer qu'un composant détectable par le système a été remplacé, procédez comme suit. Remarque : Sur les systèmes équipés d'un voyant correspondant au composant défaillant, le voyant passe à l'état normal. a. Dans le menu Action corrective sur ressource, sélectionnez la ressource remplacée. Si la réparation consistait à remettre en place un câble ou une carte, sélectionnez la ressource correspondante. Si la ressource associée à l'action ne s'affiche pas dans la liste des ressources, sélectionnez sysplanar0 et appuyez sur Entrée. b. Sélectionnez Validation une fois les sélections effectuées. L'écran Action corrective sur ressource s'est-il affiché de nouveau ? Non : Si l'écran Aucun incident détecté apparaît, passez à l'étape 9, à la page 114. Oui : Passez à l'étape 8. 8. Dans le menu Action corrective sur ressource, le composant parent ou enfant de la ressource que vous venez de remplacer, si nécessaire. En cas de test réussi sur une ressource en mode de vérification du système, dont une entrée figure dans le journal des erreurs AIX, le menu Action corrective sur ressource apparaît. Pour mettre à jour le journal des erreurs AIX pour indiquer qu'un composant détectable par le système a été remplacé, procédez comme suit. Remarque : Le voyant du composant passe de l'état Incident à l'état normal. a. Dans le menu Action corrective sur ressource, sélectionnez le composant parent ou enfant de la ressource remplacée. Si la réparation consistait à remettre en place un câble ou une carte, Procédures communes aux dispositifs installables 113 sélectionnez la ressource correspondante. Si la ressource associée à l'action ne s'affiche pas dans la liste des ressources, sélectionnez sysplanar0 et appuyez sur Entrée. b. Sélectionnez Validation une fois les sélections effectuées. c. Si l'écran Aucun incident détecté apparaît, passez à l'étape 9. 9. Si vous avez modifié les paramètres du processeur de service ou du réseau (voir procédures précédentes), restaurez la valeur initiale des paramètres. 10. Avez-vous exécuté des procédures de remplacement à chaud avant cette procédure ? Non : Passez à l'étape 11. Oui : Passez à l'étape 12. 11. Démarrez le système d'exploitation (système ou partition logique en mode normal). Avez-vous pu démarrer le système d'exploitation ? Non : Contactez votre fournisseur de services. La procédure est terminée. Oui : Passez à l'étape 12. 12. Les voyants sont-ils encore allumés ? v Non. Cela marque la fin de la procédure. v Oui : Désactivez les voyants. Pour plus d'informations, consultez la rubrique suivante : Modification des indicateurs de service. Vérification d'un composant installé sur un système Linux ou une partition logique Si vous avez installé un nouveau composant, utilisez les instructions de cette section pour savoir comment vérifier que le système reconnaît ce composant. Pourquoi et quand exécuter cette tâche Pour vérifier un composant que vous venez d'installer ou de remplacer consultez la rubrique «Vérification d'un composant installé via les programmes de diagnostic autonome». Vérification d'un composant installé via les programmes de diagnostic autonome Si vous avez installé ou remplacé un composant, vérifiez que le système le reconnaît. Vous pouvez utiliser les programmes de diagnostic autonomes pour vérifier un composant installé sur un système AIX ou Linux, une unité d'extension ou une partition logique. Avant de commencer v Si le serveur est directement connecté à un autre serveur ou à un réseau, vérifiez que la communication avec les autres serveurs est interrompue. v Les programmes de diagnostic autonomes doivent utiliser l'ensemble des ressources de la partition logique. Aucune autre activité ne peut être exécutée sur la partition. v Les programmes de diagnostic autonomes doivent accéder à la console système. Pourquoi et quand exécuter cette tâche Pour accéder aux programmes de diagnostic, utilisez le CD-ROM approprié ou le serveur NIM (Network Installation Management). La procédure suivante explique comment utiliser les programmes de diagnostic depuis le CD-ROM. Pour obtenir plus d'informations sur l'exécution de diagnostics à partir du serveur NIM, voir Exécution de diagnostics autonomes à partir d'un serveur NIM. Pour utiliser les programmes de diagnostic autonomes, procédez comme suit : 114 Activation SAS (Serial-attached SCSI) RAID Procédure 1. Arrêtez les travaux et les applications, puis arrêtez le système d'exploitation sur le système ou la partition logique. 2. Retirez les bandes, les disquettes et les CD-ROM. 3. Mettez l'unité centrale hors tension. L'étape suivante permet d'initialiser le serveur ou la partition logique à partir du CD-ROM des programmes de diagnostic autonomes. Si l'unité de disques optique n'est pas configurée comme unité d'amorçage sur le serveur ou la partition logique que vous utilisez, procédez comme suit : a. Lancez l'interface ASMI. Pour plus d'informations sur l'utilisation de l'interface ASMI, reportez-vous à la rubrique Accès à l'interface ASMI. b. Dans le menu principal de l'interface ASMI, cliquez sur Contrôle d'alimentation/redémarrage. c. Cliquez sur Mettre le système sous/hors tension. d. Dans le menu déroulant des modes d'initialisation de la partition logique AIX ou Linux sélectionnez l'option Amorçage du mode service à partir de la liste par défaut. e. Cliquez sur Sauvegarder les paramètres et mettre sous tension. Dès que l'unité de disques optique est sous tension, insérez le CD-ROM des programmes de diagnostic autonomes. f. Passez à l'étape 5. 4. Mettez l'unité centrale sous tension et insérez immédiatement le CD-ROM de programme de diagnostic autonome dans l'unité de disque optique. 5. Après l'affichage du voyant POST clavier sur la console système et avant l'affichage du dernier voyant POST (haut-parleur), appuyez sur la touche numérique 5 de la console système pour indiquer qu'une initialisation en mode service doit être effectuée au moyen de la liste des unités d'amorçage en mode service par défaut. 6. Tapez le mot de passe requis. 7. Dans l'écran des instructions d'exécution des tests de diagnostic, appuyez sur Entrée. Conseil : Si un code SRN (Service Request Number - numéro de demande d'intervention) ou un autre code de référence s'affiche, il est possible qu'un adaptateur ou qu'un câble ne soit pas bien connecté. Remarque : Si le système a généré un code SRN ou un autre code de référence alors que vous tentiez de le démarrer, contactez votre fournisseur de service pour obtenir de l'aide. 8. Si le type de terminal est requis, sélectionnez l'option Initialisation du terminal du menu Sélection des fonctions pour initialiser le système d'exploitation. 9. Dans le menu Sélection des fonctions, sélectionnez Programmes de diagnostic, mode étendu et appuyez sur Entrée. 10. Dans le menu Sélection du mode de diagnostic, sélectionnez Vérification du système et appuyez sur Entrée. 11. Lorsque le menu Sélection des tests de diagnostic, mode étendu s'affiche, sélectionnez Toutes les ressources ou testez uniquement le composant remplacé ou les unités qui y sont rattachées en sélectionnant les programmes de diagnostic appropriés puis en appuyant sur Entrée. 12. Le message Test terminé, Aucun incident n'a été détecté s'est-il affiché ? v Non : Il existe encore un incident. Contactez votre fournisseur de services. v Oui : Passez à l'étape 13. 13. Si vous avez modifié les paramètres du processeur de service ou du réseau (voir procédures précédentes), restaurez la valeur initiale des paramètres. 14. Si les voyants sont toujours allumés, procédez comme suit : a. Dans le menu Sélection des tâches, sélectionnez Indicateurs d'identification et d'avertissement pour désactiver les voyants d'identification et d'avertissement et appuyez sur Entrée. b. Sélectionnez Mettre l'indicateur d'avertissement système à NORMAL, puis appuyez sur Entrée. Procédures communes aux dispositifs installables 115 c. Sélectionnez Mettre tous les indicateurs d'identification à NORMAL, puis appuyez sur Entrée. d. Sélectionnez Validation. Remarque : Les voyants d'identification et d'incident passent de l'état Incident à l'état Normal. e. Retournez à la ligne de commande. Vérification d'un composant installé à l'aide de la console HMC (Hardware Management Console) Si vous avez installé ou remplacé un composant, utilisez la console HMC (Hardware Management Console) pour mettre à jour les enregistrements de la console HMC une fois que vous avez effectué une opération de maintenance sur le serveur. Si vous vous êtes servi de codes de référence, de symptômes ou de codes d'emplacement lors de l'opération, repérez les enregistrements à utiliser pendant la procédure. Pourquoi et quand exécuter cette tâche Pour vérifier un composant installé, procédez comme suit : Procédure 1. Sur la console HMC, recherchez d'éventuels événements d'action de maintenance ouverts dans le journal des événements de maintenance. Pour plus d'informations, reportez-vous à la rubrique «Affichage des événements réparables à l'aide de la console HMC», à la page 118. 2. Des événements d'action de maintenance sont-ils ouverts ? Non : Si le voyant d'avertissement système est encore allumé, utilisez la console HMC pour le désactiver. Consultez la rubrique «Activation/Désactivation des voyants via la console HMC», à la page 117. La procédure est terminée. Oui : Passez à l'étape suivante. 3. Notez la liste des événements d'action de maintenance ouverts. 4. Examinez les détails de chacun de ces événements. Les codes d'erreur associés à ces derniers sont-ils les mêmes que ceux que vous avez collectés auparavant ? v Non : Sélectionnez l'une des options suivantes : – Examinez les autres événements réparables pour en trouver un dont le code d'erreur correspond et passez à l'étape suivante. – Si le contenu du journal ne correspond pas à ce que vous avez collecté précédemment, contactez votre fournisseur de services. v Oui : Passez à l'étape suivante. 5. Dans la fenêtre Erreur associée à cet événement réparable, sélectionnez et mettez en évidence l'événement d'action de maintenance. 6. Cliquez sur Fermer l'événement. 7. Ajoutez des commentaires sur l'événement réparable. Incluez toute autre information propre à cet événement. Cliquez sur OK. 8. Avez-vous remplacé, ajouté ou modifié une unité remplaçable sur site (FRU) de l'événement d'action de maintenance ouvert ? v Non : Sélectionnez l'option Aucune FRU remplacée pour cet événement réparable et cliquez sur OK pour fermer l'événement d'action de maintenance. v Oui : Procédez comme suit : a. Dans la liste des unités remplaçables sur site, sélectionnez une unité à mettre à jour. b. Cliquez deux fois sur l'unité, puis mettez à jour les informations correspondantes. c. Cliquez sur OK pour refermer l'événement d'action de maintenance. 9. Si les incidents persistent, contactez votre fournisseur de services. 116 Activation SAS (Serial-attached SCSI) RAID Activation/Désactivation des voyants via la console HMC Utilisez cette procédure pour activer ou désactiver des voyants via le Point focal de service pour la console HMC (Hardware Management Console). Pourquoi et quand exécuter cette tâche Choisissez l'une des options suivantes : v «Désactivation d'un voyant d'avertissement système ou de partition» v «Activation ou désactivation d'un voyant d'identification» Désactivation d'un voyant d'avertissement système ou de partition : Pourquoi et quand exécuter cette tâche Vous pouvez désactiver un voyant d'avertissement système ou un voyant de partition logique. Par exemple, vous pouvez constater qu'un incident n'est pas prioritaire et peut être résolu ultérieurement. Cependant, si vous souhaitez être averti en cas d'incident, vous devez désactiver le voyant d'avertissement système pour qu'il puisse être activé quand un nouvel incident survient. Procédure 1. Dans la zone de navigation, ouvrez le dossier Gestion de systèmes. 2. Ouvrez Serveurs et sélectionnez le système approprié. 3. Dans la zone de contenu, cochez la case correspondant à la partition concernée. 4. Sélectionnez Tâches, Opérations, puis Gestion des voyants d'avertissement système. 5. Sélectionnez la partition appropriée. 6. Sélectionnez Désactivation du voyant d'avertissement système dans le menu Action. La fenêtre de confirmation qui s'affiche fournit les informations suivantes : v Un message de vérification indique que le voyant d'avertissement système a été désactivé. v Un message indique que le système présente encore des incidents non résolus. v Un message indique que le voyant d'avertissement système ne peut pas être activé. 7. Sélectionnez l'une des partitions logiques dans le tableau du bas, puis sélectionnez Désactivation du voyant de partition dans le menu Opérations sur la partition. La fenêtre de confirmation qui s'affiche fournit les informations suivantes : v Une vérification indique que le voyant de la partition logique a été désactivé. v Un message indique que la partition logique présente encore des incidents non résolus. v Un message indique que le voyant de la partition logique ne peut pas être activé. Activation ou désactivation d'un voyant d'identification : Pourquoi et quand exécuter cette tâche Le système comporte plusieurs voyants d'identification de composants, tels que les boîtiers ou les unités remplaçables sur site (FRU). On les appelle donc voyants d'identification. Vous pouvez activer ou désactiver les types de voyants d'identification suivants : v Voyant d'identification d'un boîtier : Si vous souhaitez ajouter un adaptateur dans un tiroir spécifique (boîtier), vous devez connaître les type, modèle et numéro de série (MTMS) du tiroir. Pour déterminer si vous disposez des informations MTMS appropriées du tiroir nécessitant le nouvel adaptateur, vous pouvez activer le voyant d'un tiroir et vérifier que le MTMS correspond au tiroir qui doit accueillir le nouveau matériel. v Voyant d'identification d'une unité FRU associée à un boîtier : Pour connecter un câble à un adaptateur d'E-S, vous pouvez activer le voyant de l'adaptateur s'il s'agit d'une unité FRU, puis vérifier sur le matériel où le câble doit être fixé. Cette opération est particulièrement utile en présence de ports ouverts sur plusieurs adaptateurs. Procédures communes aux dispositifs installables 117 Pour activer ou désactiver le voyant d'identification d'un boîtier ou d'une unité FRU, procédez comme suit : Procédure 1. Dans la zone de navigation, ouvrez l'élément Gestion de systèmes. 2. Sélectionnez Serveurs. 3. Dans la zone de contenu, cochez la case correspondant au système concerné. 4. Sélectionnez Tâches, Opérations, Statut du voyant, puis Voyant d'identification. 5. Pour activer ou désactiver le voyant d'identification d'un boîtier, sélectionnez le boîtier dans le tableau, puis cliquez sur Activation du voyant ou Désactivation du voyant. Le voyant associé s'allume ou s'éteint. 6. Pour activer ou désactiver le voyant d'identification d'une unité FRU, choisissez un boîtier dans le tableau, puis sélectionnez Sélectionné > Liste des FRU. 7. Sélectionnez une ou plusieurs unités FRU dans le tableau, et cliquez sur Activation du voyant ou Désactivation du voyant. Le voyant associé s'allume ou s'éteint. Affichage des événements réparables à l'aide de la console HMC Utilisez cette procédure pour afficher un événement réparable, y compris les détails, commentaires et historique de maintenance. Pourquoi et quand exécuter cette tâche Pour pouvoir consulter des événements réparables et les informations qui leur sont associées, vous devez avoir l'un des rôles suivants : v Administrateur central v Technicien de maintenance v Opérateur v Responsable produit v Visionneur Pour afficher des événements réparables, procédez comme suit : Procédure 1. Dans la zone de navigation, sélectionnez Gestion de la maintenance. 2. Sélectionnez Gestion des événements réparables. 3. Sélectionnez les critères des événements réparables à visualiser, puis cliquez sur OK. La fenêtre Présentation d'un événement réparable s'affiche. La liste comporte les événements réparables correspondant à vos critères de sélection. La barre de menus vous propose des actions réalisables sur les événements réparables. 4. Sélectionnez une ligne dans la fenêtre Présentation d'un événement réparable, puis choisissez Sélectionné > Affichage des détails La fenêtre Détails de l'événement réparable affiche des informations détaillées sur l'événement réparable. Le tableau du haut comporte des informations telles que le numéro d'incident et le code de référence. Le tableau du bas indique les unités remplaçables sur site (FRU) associées à cet événement. 5. Sélectionnez l'erreur pour laquelle vous souhaitez consulter les commentaires et l'historique, puis procédez comme suit : a. Sélectionnez Actions > Visualisation des commentaires. b. Pour quitter les commentaires, cliquez sur Fermeture. c. Sélectionnez Actions > Affichage de l'historique de maintenance. La fenêtre Historique de maintenance comporte l'historique de maintenance associé à l'erreur sélectionnée. d. Pour quitter l'historique de maintenance, cliquez sur Fermeture. 118 Activation SAS (Serial-attached SCSI) RAID 6. Quand vous avez terminé, cliquez sur Annulation dans les fenêtres Détails de l'événement réparable et Présentation de l'événement réparable pour les refermer. Vérification du composant installé via la console SDMC Si vous avez installé ou remplacé un composant, utilisez la console Systems Director Management Console (SDMC) pour mettre à jour vos enregistrements SDMC lorsqu'une opération de maintenance est terminée sur le serveur. Si vous vous êtes servi de codes de référence, de symptômes ou de codes d'emplacement lors de l'opération, repérez les enregistrements à utiliser pendant la procédure. Pourquoi et quand exécuter cette tâche Pour vérifier un composant installé, procédez comme suit : Procédure 1. Depuis la console SDMC, recherchez dans le journal des événements les opérations de maintenance ouvertes. Pour plus de détails, voir «Visualisation d'événements réparables via la console de gestion Systems Director», à la page 120. 2. Des événements d'action de maintenance sont-ils ouverts ? Non : Si le voyant d'avertissement système est encore allumé, utilisez la console SDMC pour l'éteindre. Pour plus d'informations, voir «Activation/Désactivation de voyants via la console de gestion Systems Director». La procédure est terminée. Oui : Passez à l'étape suivante. 3. Notez la liste des événements d'action de maintenance ouverts. 4. Examinez les détails de chacun de ces événements. Les codes d'erreur associés à ces derniers sont-ils les mêmes que ceux que vous avez collectés auparavant ? v Non : Sélectionnez l'une des options suivantes : – Examinez les autres événements réparables pour en trouver un dont le code d'erreur correspond et passez à l'étape suivante. – Si le contenu du journal ne correspond pas à ce que vous avez collecté précédemment, contactez votre fournisseur de services. v Oui : Passez à l'étape suivante. 5. Dans la fenêtre Erreur associée à cet événement réparable, sélectionnez et mettez en évidence l'événement d'action de maintenance. 6. Cliquez sur Supprimer ou Ignorer. Remarque : Ces options ne sont disponibles qu'à partir du journal des événements. Activation/Désactivation de voyants via la console de gestion Systems Director Utilisez cette procédure pour activer ou désactiver des voyants via la console Systems Director Management Console. Pourquoi et quand exécuter cette tâche Choisissez l'une des options suivantes : v «Désactivation d'un voyant d'avertissement système ou de partition» v «Activation ou désactivation de voyants d'identification via la console SDMC», à la page 120 Désactivation d'un voyant d'avertissement système ou de partition : Pourquoi et quand exécuter cette tâche Vous pouvez désactiver un voyant d'avertissement système ou un voyant de partition logique. Par exemple, vous pouvez constater qu'un incident n'est pas prioritaire et peut être résolu ultérieurement. Procédures communes aux dispositifs installables 119 Cependant, si vous souhaitez être averti en cas d'incident, vous devez désactiver le voyant d'avertissement système pour qu'il puisse être activé quand un nouvel incident survient. Procédure 1. Dans l'onglet Ressources, sélectionnez l'hôte ou le serveur virtuel approprié. 2. Sélectionnez Actions > Service et support > Matériel > Voyant d'avertissement système. 3. Sélectionnez Désactivation du voyant d'avertissement système. La fenêtre de confirmation qui s'affiche fournit les informations suivantes : v Un message de vérification indique que le voyant d'avertissement système a été désactivé. v Un message indique que le système présente encore des incidents non résolus. v Un message indique que le voyant d'avertissement système ne peut pas être activé. 4. Sélectionnez l'un des serveurs virtuels, puis sélectionnez Désactivation du voyant d'avertissement système. La fenêtre de confirmation qui s'affiche fournit les informations suivantes : v Un message de vérification indique que le voyant d'avertissement système a été désactivé. v Un message indique que la partition logique présente encore des incidents non résolus. v Un message indique que le voyant du serveur virtuel ne peut pas être activé. Activation ou désactivation de voyants d'identification via la console SDMC : Pourquoi et quand exécuter cette tâche Le système comporte différents voyants d'identification de composants, tels que les boîtiers ou les unités remplaçables sur site (FRU). On les appelle donc voyants d'identification. Vous pouvez activer ou désactiver les types de voyants d'identification suivants : v Voyant d'identification d'un boîtier : Si vous souhaitez ajouter un adaptateur dans un tiroir spécifique (boîtier), vous devez connaître les type, modèle et numéro de série (MTMS) du tiroir. Pour déterminer si vous disposez des informations MTMS appropriées du tiroir nécessitant le nouvel adaptateur, vous pouvez activer le voyant d'un tiroir et vérifier que le MTMS correspond au tiroir qui doit accueillir le nouveau matériel. v Voyant d'identification d'une unité FRU associée à un boîtier : Pour connecter un câble à un adaptateur d'E-S, vous pouvez activer le voyant de l'adaptateur s'il s'agit d'une unité FRU, puis vérifier sur le matériel où le câble doit être fixé. Cette opération est particulièrement utile en présence de ports ouverts sur plusieurs adaptateurs. Pour activer ou désactiver le voyant d'identification d'un boîtier ou d'une unité FRU, procédez comme suit : Procédure 1. Dans l'onglet Ressources, sélectionnez l'hôte ou le serveur virtuel approprié. 2. Sélectionnez Actions > Service et support > Matériel > Voyant d'identification. 3. Sous Voyant d'identification, sélectionnez la fenêtre Boîtier puis l'unité centrale ou le boîtier. 4. Pour activer ou désactiver un voyant d'identification, cliquez sur Activation du voyant ou Désactivation du voyant. Le voyant associé s'allume ou s'éteint. 5. Pour activer ou désactiver le voyant d'identification d'une unité FRU, sélectionnez un système ou un boîtier dans le tableau, puis sélectionnez Liste des unités FRU. 6. Sélectionnez une ou plusieurs unités FRU dans le tableau, et cliquez sur Activation du voyant ou Désactivation du voyant. Le voyant associé s'allume ou s'éteint. Visualisation d'événements réparables via la console de gestion Systems Director Utilisez cette procédure pour afficher un événement réparable, y compris les détails, commentaires et historique de maintenance. 120 Activation SAS (Serial-attached SCSI) RAID Pourquoi et quand exécuter cette tâche Pour afficher des événements réparables, procédez comme suit : Procédure 1. Dans l'onglet Ressources, sélectionnez l'hôte ou le serveur virtuel approprié. 2. Sélectionnez Actions > Etat et santé du système > Journal des événements. 3. Facultatif : Vous pouvez limiter les critères de recherche d'événement à l'aide du menu de filtrage des événements. 4. Sélectionnez une ligne dans la fenêtre des événements puis sélectionnez Actions > Propriétés La fenêtre Propriétés s'ouvre, affichant les informations détaillées relatives à l'événement réparables. Le tableau indique des informations telles que le numéro d'incident, le code de référence et les unités remplaçables sur site (FRU) associées à l'événement. Vérification d'un dispositif installé ou d'un composant remplacé sur un système ou une partition logique à l'aide des outils Virtual I/O Server Si vous avez installé un dispositif ou remplacé un composant, il est recommandé d'utiliser les outils Virtual I/O Server (VIOS) pour vérifier que ce dispositif ou composant est reconnu par le système ou la partition logique. Pourquoi et quand exécuter cette tâche Pour vérifier le fonctionnement d'un dispositif récemment installé ou d'un composant de remplacement, sélectionnez la procédure appropriée : v Vérification du dispositif installé via VIOS v Vérification du composant de remplacement via VIOS Vérifiez le dispositif installé via VIOS : 1. Connectez-vous en tant qu'utilisateur root. 2. Sur la ligne de commande, tapez diagmenu et appuyez sur Entrée. 3. Sélectionnez Routines de diagnostic avancé et appuyez sur Entrée. 4. Dans le menu Sélection du mode de diagnostic, sélectionnez Vérification du système et appuyez sur Entrée. 5. Dans le menu Sélection des tests de diagnostic, mode étendu, exécutez l'une des procédures suivantes : v Pour tester une seule ressource, sélectionnez la ressource que vous venez d'installer dans la liste des ressources et appuyez sur Entrée. v Pour tester toutes les ressources disponibles du système d'exploitation, sélectionnez Toutes les ressources et appuyez sur Entrée. 6. Sélectionnez Validation, puis attendez la fin de l'exécution des programmes de diagnostic en répondant aux invites éventuelles. 7. Les programmes de diagnostic ont-ils été exécutés complètement avant d'afficher le message Aucun incident n’a été détecté ? v Non : Si un code SRN (Service Request Number - numéro de demande d'intervention) ou un autre code de référence s'affiche, il est possible qu'un adaptateur ou qu'un câble ne soit pas bien connecté. Passez en revue les procédures d'installation pour vérifier que le nouveau dispositif est installé correctement. Si vous ne parvenez pas à corriger l'incident, rassemblez tous les codes SRN et autres codes de référence éventuels. Si le système fonctionne en mode de partitionnement logique, notez la partition logique sur laquelle le dispositif est installé. Pour obtenir de l'aide, contactez votre fournisseur de services. Procédures communes aux dispositifs installables 121 v Oui : Le nouveau dispositif est installé correctement. Quittez les programmes de diagnostic et revenez en mode de fonctionnement normal. Vérifiez le composant de remplacement via VIOS : Pour vérifier le fonctionnement d'un dispositif ou d'un composant de remplacement récemment installé, procédez comme suit : 1. Avez-vous remplacé le composant à l'aide de VIOS ou des opérations de remplacement à chaud du service de diagnostic en ligne ? Non : Passez à l'étape 2. Oui : Passez à l'étape 5. 2. Le système est-il hors tension ? Non : Passez à l'étape 4. Oui : Si le système prend en charge l'initialisation lente, configurez la fonction. Pour plus d'informations, reportez-vous à la rubrique Exécution d'une initialisation lente. 3. Démarrez le système et attendez que l'invite de connexion du système d'exploitation VIOS s'affiche ou que l'activité apparente du système sur le panneau de commande ou l'écran s'arrête. Avez-vous vu l'invite de connexion VIOS ? v Non : Si un code SRN ou un autre code de référence s'affiche, il est possible qu'un adaptateur ou un câble ne soit pas bien connecté. Passez en revue les procédures de remplacement pour vérifier que le nouveau composant est installé correctement. Si vous ne parvenez pas à corriger l'incident, rassemblez tous les codes SRN et autres codes de référence éventuels. Si le système ne démarre pas ou que l'invite de connexion ne s'affiche pas, consultez la rubrique : Incidents de chargement et de démarrage du système d'exploitation. Si le système est partitionné, notez la partition logique sur laquelle vous avez remplacé le composant. Pour obtenir de l'aide, contactez votre fournisseur de services. v Oui : Passez à l'étape 4. 4. A l'invite, tapez diag —a et appuyez sur Entrée pour vérifier s'il manque des ressources. Si une invite s'affiche, passez à l'étape 5. Si le menu Sélection des tests de diagnostic s'affiche avec un M en regard d'une ressource, procédez comme suit : a. Sélectionnez la ressource, puis appuyez sur Entrée. b. Sélectionnez Validation. c. Suivez les instructions affichées. d. Si un message vous demande si vous souhaitez revoir l'erreur affichée précédemment, cliquez sur Oui et appuyez sur Entrée. e. Si le système génère un code SRN, une carte ou un câble est probablement mal connecté. Si aucun incident évident n'apparaît, notez le code SRN et contactez votre fournisseur de services pour obtenir de l'aide. f. Si aucun code SRN ne s'affiche, passez à l'étape 5. 5. Testez le composant en effectuant les opérations suivantes : a. Sur la ligne de commande, tapez diagmenu et appuyez sur Entrée. b. Dans le menu Sélection des fonctions, sélectionnez Programmes de diagnostic, mode étendu et appuyez sur Entrée. c. Dans le menu Sélection du mode de diagnostic, sélectionnez Vérification du système et appuyez sur Entrée. d. Sélectionnez Toutes les ressources ou les programmes de diagnostic du composant pour tester uniquement le composant remplacé ou les périphériques qui y sont raccordées et appuyez sur Entrée. Le menu Action corrective sur ressource s'est-il affiché ? 122 Activation SAS (Serial-attached SCSI) RAID Non : Passez à l'étape 6. Oui : Passez à l'étape 7. 6. Le message Test terminé, aucun incident n'a été détecté s'est-il affiché ? v Non : Il existe encore un incident. Contactez votre fournisseur de services. La procédure est terminée. v Oui : Si l'incident ne figure pas dans le journal des erreurs, sélectionnez Consignation d'action corrective dans le menu Sélection des tâches pour mettre à jour le journal des erreurs AIX. Si la réparation consistait à remettre en place un câble ou une carte, sélectionnez la ressource correspondante. Si la ressource associée à l'action ne s'affiche pas dans la liste des ressources, sélectionnez sysplanar0 et appuyez sur Entrée. Conseil : Le voyant du composant passe de l'état Incident à l'état normal. Passez à l'étape 9. 7. Dans le menu Action corrective sur ressource, sélectionnez la ressource du composant remplacé. En cas de test réussi sur une ressource en mode de vérification du système, dont une entrée figure dans le journal des erreurs AIX, le menu Action corrective sur ressource apparaît. Pour mettre à jour le journal des erreurs AIX pour indiquer qu'un composant détectable par le système a été remplacé, procédez comme suit. Remarque : Sur les systèmes équipés d'un voyant correspondant au composant défaillant, le voyant passe à l'état normal. a. Dans le menu Action corrective sur ressource, sélectionnez la ressource remplacée. Si la réparation consistait à remettre en place un câble ou une carte, sélectionnez la ressource correspondante. Si la ressource associée à votre action n'apparaît pas dans la liste des ressources, sélectionnez sysplanar0. Appuyez sur Entrée. b. Sélectionnez Validation une fois les sélections effectuées. L'écran Action corrective sur ressource s'est-il affiché de nouveau ? Non : Si l'écran Aucun incident détecté apparaît, passez à l'étape 9. Oui : Passez à l'étape 8. 8. Dans le menu Action corrective sur ressource, sélectionnez le composant parent ou enfant de la ressource que vous venez de remplacer, si nécessaire. En cas de test réussi sur une ressource en mode de vérification du système, dont une entrée figure dans le journal des erreurs AIX, le menu Action corrective sur ressource apparaît. Pour mettre à jour le journal des erreurs AIX pour indiquer qu'un composant détectable par le système a été remplacé, procédez comme suit. Remarque : Le voyant du composant passe de l'état Incident à l'état normal. a. Dans le menu Action corrective sur ressource, sélectionnez le composant parent ou enfant de la ressource remplacée. Si la réparation consistait à remettre en place un câble ou une carte, sélectionnez la ressource correspondante. Si la ressource associée à votre action n'apparaît pas dans la liste des ressources, sélectionnez sysplanar0. Appuyez sur Entrée. b. Sélectionnez Validation une fois les sélections effectuées. c. Si l'écran Aucun incident détecté apparaît, passez à l'étape 9. 9. Si vous avez modifié les paramètres du processeur de service ou du réseau (voir procédures précédentes), restaurez la valeur initiale des paramètres. 10. Avez-vous exécuté des procédures de remplacement à chaud avant cette procédure ? Non : Passez à l'étape 11. Oui : Passez à l'étape 12, à la page 124. 11. Démarrez le système d'exploitation (système ou partition logique en mode normal). Avez-vous pu démarrer le système d'exploitation ? Non : Contactez votre fournisseur de services. La procédure est terminée. Oui : Passez à l'étape 12, à la page 124. Procédures communes aux dispositifs installables 123 12. Les voyants sont-ils encore allumés ? v Non. La procédure est terminée. v Oui : Désactivez les voyants. Pour plus d'informations, consultez l'une de ces rubriques : Modification des indicateurs de service Installation d'un composant via la console SDMC Vous pouvez utiliser la console SDMC (Systems Director Management Console) pour effectuer diverses tâches de maintenance, y compris l'installation d'une nouvelle unité remplaçable sur site (FRU) ou d'un nouveau composant. Avant de commencer Pour utiliser l'interface utilisateur de la console SDMC pour installer un composant sur un système ou une unité d'extension géré(e) par une console SDMC, procédez comme suit : Procédure 1. Dans la zone des ressources Power Systems sélectionnez le système sur lequel vous voulez installer un composant. 2. Dans le menu Actions, développez Service and Support > Matériel > Tâches de modification de matériel > Ajout de FRU. 3. Dans l'onglet d'ajout de FRU, sélectionnez le système ou le type de boîtier. 4. Sélectionnez dans le menu le type de FRU que vous souhaitez installer et cliquez sur Suivant. 5. Sélectionnez le code d'emplacement pour l'installation du composant et cliquez sur Ajouter. 6. Une fois l'unité remplaçable sur site (FRU) placée dans la catégorie des actions en attente, cliquez sur Lancer la procédure et suivez les instructions d'installation du composant. Remarque : Il se peut que la console SDMC ouvre une fenêtre d'instructions pour l'installation du composant. Dans ce cas, suivez les instructions pour installer le composant. Remplacement d'un composant via la console SDMC Vous pouvez utiliser la console SDMC (Systems Director Management Console) pour effectuer diverses tâches de maintenance, y compris le remplacement d'une unité remplaçable sur site (FRU) ou d'un composant. Pourquoi et quand exécuter cette tâche Si vous procédez à un échange de composant dans le but de solutionner un événement réparable, suivez ces instructions. Procédure 1. Dans la zone des ressources Power Systems, sélectionnez le système sur lequel vous souhaitez remplacer un composant. 2. Sélectionnez l'une des options suivantes : v Si vous remplacez un composant en dehors du cadre d'une opération de maintenance, depuis le menu Actions, développez Service and Support > Matériel > Remplacement de FRU. v Si vous procédez à un échange de composant dans le but de corriger un événement réparable, voir Démarrage d'une réparation. 3. Pour remplacer un composant dans le cadre d'une opération de maintenance, sélectionnez Remplacement de FRU. 124 Activation SAS (Serial-attached SCSI) RAID 4. Dans l'onglet Remplacement de FRU, sélectionnez l'unité centrale ou d'extension sur laquelle vous souhaitez remplacer un composant. 5. Sélectionnez dans le menu le type de composant que vous souhaitez remplacer et cliquez sur Suivant. 6. Sélectionnez dans le menu le code d'emplacement du composant que vous souhaitez remplacer et cliquez sur Ajouter. 7. Une fois l'unité remplaçable sur site (FRU) placée dans la fenêtre des actions en attente, cliquez sur Lancer la procédure et suivez les instructions d'installation du composant. Remarque : Il se peut que la console SDMC ouvre une fenêtre d'instructions pour le remplacement du composant. Dans ce cas, suivez les instructions pour remplacer le composant. Retrait d'un composant via la console SDMC Vous pouvez utiliser la console SDMC (Systems Director Management Console) pour effectuer diverses tâches de maintenance, y compris le retrait d'une unité remplaçable sur site (FRU) ou d'un composant. Pourquoi et quand exécuter cette tâche Pour utiliser l'interface utilisateur de la console SDMC pour retirer un composant sur un système ou une unité d'extension géré(e) par une console SDMC, procédez comme suit : Procédure 1. Dans la zone des ressources Power Systems, sélectionnez le système duquel vous souhaitez retirer un composant. 2. Sélectionnez l'une des options suivantes : v Si vous installez un nouveau composant dans le cadre d'une mise à niveau, depuis le menu Actions développez Service and Support > Matériel > Tâches de modification de matériel. v Si vous retirez un composant dans le cadre d'une opération de maintenance, depuis le menu Actions développez Service and Support > Matériel > Remplacement de FRU. 3. Pour installer un nouveau composant dans le cadre d'une mise à niveau, sélectionnez Ajout de FRU. Pour remplacer un composant dans le cadre d'une opération de maintenance, sélectionnez Remplacement de FRU. 4. Dans la fenêtre d'ajout ou de remplacement de FRU, sélectionnez le système ou l'unité d'extension duquel ou de laquelle vous souhaitez retirer le composant. 5. Sélectionnez dans la liste le type de composant que vous souhaitez retirer et cliquez sur Suivant. 6. Sélectionnez l'emplacement du composant, puis cliquez sur Ajouter. 7. Une fois que le composant figure dans la fenêtre des actions en attente, cliquez sur Lancer la procédure et suivez les instructions de retrait du composant. Remarque : Il se peut que la console SDMC ouvre une fenêtre d'instructions pour le retrait du composant. Dans ce cas, suivez ces instructions pour retirer le composant. Procédures communes aux dispositifs installables 125 126 Activation SAS (Serial-attached SCSI) RAID Remarques Le présent document peut contenir des informations ou des références concernant certains produits, logiciels ou services du fabricant non annoncés dans ce pays. Le présent document peut contenir des informations ou des références concernant certains produits, logiciels ou services non annoncés dans ce pays. Pour plus d'informations, référez-vous aux documents d'annonce disponibles dans votre pays ou adressez-vous au partenaire commercial du fabricant. Toute référence à un produit, logiciel ou service du fabricant n'implique pas que seul ce produit, logiciel ou service puisse convenir. Tout autre élément fonctionnellement équivalent peut être utilisé, s'il n'enfreint aucun droit du fabricant. Il est toutefois de la responsabilité de l'utilisateur d'évaluer et de vérifier lui-même le fonctionnement de tout produit, programme ou service. Le fabricant peut détenir des brevets ou des demandes de brevet couvrant les produits mentionnés dans le présent document. La remise de ce document ne vous octroie aucun droit de licence sur ces brevets ou demandes de brevet. Si vous désirez recevoir des informations concernant l'acquisition de licences, veuillez en faire la demande par écrit au fabricant. Le paragraphe suivant ne s'applique ni au Royaume-Uni, ni dans aucun pays dans lequel il serait contraire aux lois locales. LE PRESENT DOCUMENT EST LIVRE «EN L'ETAT» SANS AUCUNE GARANTIE EXPLICITE OU IMPLICITE. LE FABRICANT DECLINE NOTAMMENT TOUTE RESPONSABILITE RELATIVE A CES INFORMATIONS EN CAS DE CONTREFAÇON AINSI QU'EN CAS DE DEFAUT D'APTITUDE A L'EXECUTION D'UN TRAVAIL DONNE. Certaines juridictions n'autorisent pas l'exclusion des garanties implicites, auquel cas l'exclusion ci-dessus ne vous sera pas applicable. Le présent document peut contenir des inexactitudes ou des coquilles. Il est mis à jour périodiquement. Chaque nouvelle édition inclut les mises à jour. IBM peut modifier sans préavis les produits et logiciels décrits dans ce document. Les références à des sites Web non référencés par le fabricant sont fournies à titre d'information uniquement et n'impliquent en aucun cas une adhésion aux données qu'ils contiennent. Les éléments figurant sur ces sites Web ne font pas partie des éléments du présent produit et l'utilisation de ces sites relève de votre seule responsabilité. Le fabricant pourra utiliser ou diffuser, de toute manière qu'il jugera appropriée et sans aucune obligation de sa part, tout ou partie des informations qui lui seront fournies. Les données de performance indiquées dans ce document ont été déterminées dans un environnement contrôlé. Par conséquent, les résultats peuvent varier de manière significative selon l'environnement d'exploitation utilisé. Certaines mesures évaluées sur des systèmes en cours de développement ne sont pas garanties sur tous les systèmes disponibles. En outre, elles peuvent résulter d'extrapolations. Les résultats peuvent donc varier. Il incombe aux utilisateurs de ce document de vérifier si ces données sont applicables à leur environnement d'exploitation. Les informations concernant les produits de fabricants tiers ont été obtenues auprès des fournisseurs de ces produits, par l'intermédiaire d'annonces publiques ou via d'autres sources disponibles. Ce fabricant n'a pas testé ces produits et ne peut confirmer l'exactitude de leurs performances ni leur compatibilité. Il ne peut recevoir aucune réclamation concernant des produits de fabricants tiers. Toute question concernant les performances de produits de fabricants tiers doit être adressée aux fournisseurs de ces produits. 127 Toute instruction relative aux intentions du fabricant pour ses opérations à venir est susceptible d'être modifiée ou annulée sans préavis, et doit être considérée uniquement comme un objectif. Tous les tarifs indiqués sont les prix de vente actuels suggérés et sont susceptibles d'être modifiés sans préavis. Les tarifs appliqués peuvent varier selon les revendeurs. Ces informations sont fournies uniquement à titre de planification. Elles sont susceptibles d'être modifiées avant la mise à disposition des produits décrits. Le présent document peut contenir des exemples de données et de rapports utilisés couramment dans l'environnement professionnel. Ces exemples mentionnent des noms fictifs de personnes, de sociétés, de marques ou de produits à des fins illustratives ou explicatives uniquement. Tous ces noms sont fictifs et toute ressemblance avec des noms et adresses utilisés par une entreprise réelle serait purement fortuite. Si vous visualisez ces informations en ligne, il se peut que les photographies et illustrations en couleur n'apparaissent pas à l'écran. Les figures et les spécifications contenues dans le présent document ne doivent pas être reproduites, même partiellement, sans l'autorisation écrite d'IBM. Le fabricant a conçu le présent document pour expliquer comment utiliser les machines indiquées. Il n'est exploitable dans aucun autre but. Les ordinateurs du fabricant contiennent des mécanismes conçus pour réduire les risques d'altération ou de perte de données. Ces risques, cependant, ne peuvent pas être éliminés. En cas de rupture de tension, de défaillances système, de fluctuations ou de rupture de l'alimentation ou d'incidents au niveau des composants, l'utilisateur doit s'assurer de l'exécution rigoureuse des opérations, et que les données ont été sauvegardées ou transmises par le système au moment de la rupture de tension ou de l'incident (ou peu de temps avant ou après). De plus, ces utilisateurs doivent établir des procédures garantissant la vérification indépendante des données, afin de permettre une utilisation fiable de ces dernières dans le cadre d'opérations stratégiques. Ces utilisateurs doivent enfin consulter régulièrement sur les sites Web de support IBM les mises à jour et les correctifs applicables au système et aux logiciels associés. Restrictions liées à l'utilisation de connexion Ethernet Ce produit n'est pas destiné à être connecté, directement ou indirectement, par quelque moyen que ce soit, à des interfaces de réseaux de télécommunications publiques. Marques IBM, le logo IBM et ibm.com sont des marques d'International Business Machines aux Etats-Unis et/ou dans certains autres pays. Les autres noms de produits et de services peuvent être des marques d'IBM ou d'autres sociétés. La liste actualisée de toutes les marques d'IBM est disponible sur la page Web Copyright and trademark information à http://www.ibm.com/legal/copytrade.shtml. Linux est une marque déposée de Linus Torvalds aux Etats-Unis et/ou dans certains autres pays. Les autres noms de produits et de services peuvent être des marques d'IBM ou d'autres sociétés. Bruits radioélectriques Lorsque vous connectez un moniteur à l'équipement, vous devez utiliser le câble fourni à cet effet, ainsi que toute unité de suppression des interférences. 128 Activation SAS (Serial-attached SCSI) RAID Remarques sur la classe A Les avis de conformité de classe A suivants s'appliquent aux serveurs. Recommandation de la Federal Communications Commission (FCC) [Etats-Unis] Remarque : Cet appareil respecte les limites des caractéristiques d'immunité des appareils numériques définies pour la classe A, conformément au chapitre 15 de la réglementation de la FCC. La conformité aux spécifications de cette classe offre une garantie acceptable contre les perturbations électromagnétiques dans les zones commerciales. Ce matériel génère, utilise et peut émettre de l'énergie radiofréquence. Il risque de parasiter les communications radio s'il n'est pas installé conformément aux instructions du constructeur. L'exploitation faite en zone résidentielle peut entraîner le brouillage des réceptions radio et télé, ce qui obligerait le propriétaire à prendre les dispositions nécessaires pour en éliminer les causes. Utilisez des câbles et connecteurs correctement blindés et mis à la terre afin de respecter les limites de rayonnement définies par la réglementation de la FCC. IBM ne peut pas être tenue pour responsable du brouillage des réceptions radio ou télévision résultant de l'utilisation de câbles et connecteurs inadaptés ou de modifications non autorisées apportées à cet appareil. Toute modification non autorisée pourra annuler le droit d'utilisation de cet appareil. Cet appareil est conforme aux restrictions définies dans le chapitre 15 de la réglementation de la FCC. Son utilisation est soumise aux deux conditions suivantes : (1) il ne peut pas causer de perturbations électromagnétiques gênantes et (2) il doit accepter toutes les perturbations reçues, y compris celles susceptibles d'occasionner un fonctionnement indésirable. Avis de conformité à la réglementation d'Industrie Canada Cet appareil numérique de la classe A est conforme à la norme NMB-003 du Canada. Avis de conformité aux exigences de l'Union européenne Le présent produit satisfait aux exigences de protection énoncées dans la directive 2004/108/CEE du Conseil concernant le rapprochement des législations des Etats membres relatives à la compatibilité électromagnétique. IBM décline toute responsabilité en cas de non-respect de cette directive résultat d'une modification non recommandée du produit, y compris l'ajout de cartes en option non IBM. Ce produit respecte les limites des caractéristiques d'immunité des appareils de traitement de l'information définies par la classe A de la norme européenne EN 55022 (CISPR 22). La conformité aux spécifications de la classe A offre une garantie acceptable contre les perturbations avec les appareils de communication agréés, dans les zones commerciales et industrielles. Dans l'Union européenne, contactez : IBM Deutschland GmbH Technical Regulations, Department M456 IBM-Allee 1, 71139 Ehningen, Germany Tele: +49 7032 15-2937 email: tjahn@de.ibm.com Avertissement : Ce matériel appartient à la classe A. Il est susceptible d'émettre des ondes radioélectriques risquant de perturber les réceptions radio. Son emploi dans une zone résidentielle peut créer des perturbations électromagnétiques. L'utilisateur devra alors prendre les mesures nécessaires pour en éliminer les causes. Remarques 129 Avis de conformité aux exigences du Voluntary Control Council for Interference (VCCI) - Japon Voici un résumé de la recommandation du VCCI japonais figurant dans l'encadré ci-dessus : Ce produit de la classe A respecte les limites des caractéristiques d'immunité définies par le VCCI (Voluntary Control Council for Interference) japonais. Si ce matériel est utilisé dans une zone résidentielle, il peut créer des perturbations électromagnétiques. L'utilisateur devra alors prendre les mesures nécessaires pour en éliminer les causes. Directive relative aux harmoniques confirmée par l'association japonaise JEITA (Japanese Electronics and Information Technology Industries Association) (produits inférieurs ou égaux à 20 A par phase) Directive relative aux harmoniques confirmée avec modifications par l'association japonaise JEITA (Japanese Electronics and Information Technology Industries Association) (produits supérieurs 20 A par phase) Avis d'interférences électromagnétiques (EMI) - République populaire de Chine Ce matériel appartient à la classe A. Il est susceptible d'émettre des ondes radioélectriques risquant de perturber les réceptions radio. L'utilisateur devra alors prendre les mesures nécessaires pour en éliminer les causes. 130 Activation SAS (Serial-attached SCSI) RAID Avis d'interférences électromagnétiques (EMI) - Taïwan Voici un résumé de l'avis EMI de Taïwan figurant ci-dessus. Avertissement : Ce matériel appartient à la classe A. Il est susceptible d'émettre des ondes radioélectriques risquant de perturber les réceptions radio. Son emploi dans une zone résidentielle peut créer des interférences. L'utilisateur devra alors prendre les mesures nécessaires pour les supprimer. Avis d'interférences électromagnétiques (EMI) - Corée Avis de conformité pour l'Allemagne Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse A EU-Richtlinie zur Elektromagnetischen Verträglichkeit Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die Grenzwerte der EN 55022 Klasse A ein. Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne Empfehlung von IBM gesteckt/eingebaut werden. EN 55022 Klasse A Geräte müssen mit folgendem Warnhinweis versehen werden: "Warnung: Dieses ist eine Einrichtung der Klasse A. Diese Einrichtung kann im Wohnbereich Funk-Störungen verursachen; in diesem Fall kann vom Betreiber verlangt werden, angemessene Maßnahmen zu ergreifen und dafür aufzukommen." Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland. Remarques 131 Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse A Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen - CE - zu führen. Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller: International Business Machines Corp. New Orchard Road Armonk, New York 10504 Tel: 914-499-1900 Der verantwortliche Ansprechpartner des Herstellers in der EU ist: IBM Deutschland GmbH Technical Regulations, Abteilung M456 IBM-Allee 1, 71139 Ehningen, Germany Tel: +49 7032 15-2937 email: tjahn@de.ibm.com Generelle Informationen: Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse A. Avis d'interférences électromagnétiques (EMI) - Russie Remarques sur la classe B Les avis de conformité de classe B suivants s'appliquent aux dispositifs déclarés comme relevant de la compatibilité électromagnétique (EMC) de classe B dans les informations d'installation des dispositifs. Recommandation de la Federal Communications Commission (FCC) [Etats-Unis] Cet appareil respecte les limites des caractéristiques d'immunité des appareils numériques définies par la classe B, conformément au chapitre 15 de la réglementation de la FCC. La conformité aux spécifications de la classe B offre une garantie acceptable contre les perturbations électromagnétiques dans les zones résidentielles. Ce matériel génère, utilise et peut émettre de l'énergie radiofréquence. Il risque de parasiter les communications radio s'il n'est pas installé conformément aux instructions du constructeur. Toutefois, il n'est pas garanti que des perturbations n'interviendront pas pour une installation particulière. Si cet appareil provoque des perturbations gênantes dans les communications radio ou télévision, mettez-le hors tension puis sous tension pour vous en assurer. L'utilisateur peut tenter de remédier à cet incident en appliquant une ou plusieurs des mesures suivantes : v Réorienter ou repositionner l'antenne de réception. v Eloigner l'appareil du récepteur. 132 Activation SAS (Serial-attached SCSI) RAID v Brancher l'appareil sur une prise différente de celle du récepteur, sur un circuit distinct. v Prendre contact avec un distributeur agréé IBM ou un représentant commercial IBM pour obtenir de l'aide. Utilisez des câbles et connecteurs correctement blindés et mis à la terre afin de respecter les limites de rayonnement définies par la réglementation de la FCC. Ces câbles et connecteurs sont disponibles chez votre distributeur agréé IBM. IBM ne peut pas être tenue pour responsable du brouillage des réceptions radio ou télévision résultant de modifications non autorisées apportées à cet appareil. Toute modification non autorisée pourra annuler le droit d'utilisation de cet appareil. Cet appareil est conforme aux restrictions définies dans le chapitre 15 de la réglementation de la FCC. Son utilisation est soumise aux deux conditions suivantes : (1) il ne peut pas causer de perturbations électromagnétiques gênantes et (2) il doit accepter toutes les perturbations reçues, y compris celles susceptibles d'occasionner un fonctionnement indésirable. Avis de conformité à la réglementation d'Industrie Canada Cet appareil numérique de la classe B est conforme à la norme NMB-003 du Canada. Avis de conformité aux exigences de l'Union européenne Le présent produit satisfait aux exigences de protection énoncées dans la directive 2004/108/CEE du Conseil concernant le rapprochement des législations des Etats membres relatives à la compatibilité électromagnétique. IBM décline toute responsabilité en cas de non-respect de cette directive résultant d'une modification non recommandée du produit, y compris l'ajout de cartes en option non IBM. Ce produit respecte les limites des caractéristiques d'immunité des appareils de traitement de l'information définies par la classe B de la norme européenne EN 55022 (CISPR 22). La conformité aux spécifications de la classe B offre une garantie acceptable contre les perturbations avec les appareils de communication agréés, dans les zones résidentielles. Dans l'Union européenne, contactez : IBM Deutschland GmbH Technical Regulations, Department M456 IBM-Allee 1, 71139 Ehningen, Germany Tele: +49 7032 15-2937 email: tjahn@de.ibm.com Avis de conformité aux exigences du Voluntary Control Council for Interference (VCCI) - Japon Directive relative aux harmoniques confirmée par l'association japonaise JEITA (Japanese Electronics and Information Technology Industries Association) (produits inférieurs ou égaux à 20 A par phase) Remarques 133 Directive relative aux harmoniques confirmée avec modifications par l'association japonaise JEITA (Japanese Electronics and Information Technology Industries Association) (produits supérieurs 20 A par phase) Avis d'interférences électromagnétiques (EMI) - Corée Avis de conformité pour l'Allemagne Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse B EU-Richtlinie zur Elektromagnetischen Verträglichkeit Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die Grenzwerte der EN 55022 Klasse B ein. Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne Empfehlung von IBM gesteckt/eingebaut werden. Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland. Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse B Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen - CE - zu führen. Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller: International Business Machines Corp. New Orchard Road Armonk, New York 10504 Tel: 914-499-1900 Der verantwortliche Ansprechpartner des Herstellers in der EU ist: IBM Deutschland GmbH Technical Regulations, Abteilung M456 134 Activation SAS (Serial-attached SCSI) RAID IBM-Allee 1, 71139 Ehningen, Germany Tel: +49 7032 15-2937 email: tjahn@de.ibm.com Generelle Informationen: Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse B. Dispositions Les droits d'utilisation relatifs à ces publications sont soumis aux dispositions suivantes. Usage personnel : Vous pouvez reproduire ces publications pour votre usage personnel, non commercial, sous réserve que toutes les mentions de propriété soient conservées. Vous ne pouvez distribuer ou publier tout ou partie de ces publications ou en faire des oeuvres dérivées, sans le consentement exprès du fabricant. Usage commercial : Vous pouvez reproduire, distribuer et afficher ces publications uniquement au sein de votre entreprise, sous réserve que toutes les mentions de propriété soient conservées. Vous ne pouvez reproduire, distribuer, afficher ou publier tout ou partie de ces publications en dehors de votre entreprise, ou en faire des oeuvres dérivées, sans le consentement exprès du fabricant. Excepté les droits d'utilisation expressément accordés dans ce document, aucun autre droit, licence ou autorisation, implicite ou explicite, n'est accordé pour ces publications ou autres informations, données, logiciels ou droits de propriété intellectuelle contenus dans ces publications. Le fabricant se réserve le droit de retirer les autorisations accordées dans le présent document si l'utilisation des publications s'avère préjudiciable à ses intérêts ou que, selon son appréciation, les instructions susmentionnées n'ont pas été respectées. Vous ne pouvez télécharger, exporter ou réexporter ces informations qu'en total accord avec toutes les lois et règlements applicables dans votre pays, y compris les lois et règlements américains relatifs à l'exportation. LE FABRICANT NE DONNE AUCUNE GARANTIE SUR LE CONTENU DE CES PUBLICATIONS. CES PUBLICATIONS SONT LIVREES EN L'ETAT SANS AUCUNE GARANTIE EXPLICITE OU IMPLICITE. LE FABRICANT DECLINE NOTAMMENT TOUTE RESPONSABILITE RELATIVE A CES INFORMATIONS EN CAS DE CONTREFAÇON AINSI QU'EN CAS DE DEFAUT D'APTITUDE A L'EXECUTION D'UN TRAVAIL DONNE. Bull ESCALA PL 3200R Service Guide 86 A1 82EF 03 ORDER REFERENCE Bull ESCALA PL 3200R Service Guide Hardware March 2004 BULL CEDOC 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE 86 A1 82EF 03 ORDER REFERENCE The following copyright notice protects this book under the Copyright laws of the United States of America and other countries which prohibit such actions as, but not limited to, copying, distributing, modifying, and making derivative works. Copyright Bull S.A. 1992, 2004 Printed in France Suggestions and criticisms concerning the form, content, and presentation of this book are invited. A form is provided at the end of this book for this purpose. To order additional copies of this book or other Bull Technical Publications, you are invited to use the Ordering Form also provided at the end of this book. Trademarks and Acknowledgements We acknowledge the right of proprietors of trademarks mentioned in this book. AIX is a registered trademark of International Business Machines Corporation, and is being used under licence. UNIX is a registered trademark in the United States of America and other countries licensed exclusively through the Open Group. Linux is a registered trademark of Linus Torvalds. The information in this document is subject to change without notice. Bull will not be liable for errors contained herein, or for incidental or consequential damages in connection with the use of this material. Contents Safety Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .vii Rack Safety Instructions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .vii Electrical Safety . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .vii Laser Safety Information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .ix Data Integrity and Verification . . . . . . . . . . . . . . . . . . . . . . . . . . .xi About This Book . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .xiii ISO 9000 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .xiii Highlighting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .xiii Related Publications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .xiv Trademarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .xiv Chapter 1. Reference Information . . . . . . . . . . . . . . . . . . . . . . . . . .1 406/90 Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1 Hardware Management Console (HMC) . . . . . . . . . . . . . . . . . . . . . . . . .3 Powering the System On and Off . . . . . . . . . . . . . . . . . . . . . . . . . .10 406/90 Data Flow . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .11 Power-On Self-Test . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .12 POST Indicators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .12 POST Keys . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .12 406/90 System Locations . . . . . . . . . . . . . . . . . . . . . . . . . . . . .13 406/1D I/O Subsystem Locations . . . . . . . . . . . . . . . . . . . . . . . . . . .24 Power Subsystem Locations . . . . . . . . . . . . . . . . . . . . . . . . . . . .29 Power Distribution Cabling . . . . . . . . . . . . . . . . . . . . . . . . . . . . .30 I/O Subsystem Power Distribution Cabling . . . . . . . . . . . . . . . . . . . . . . .33 Integrated Battery Feature Locations . . . . . . . . . . . . . . . . . . . . . . . . .35 Subsystem Positioning and Cabling . . . . . . . . . . . . . . . . . . . . . . . . . .36 System Memory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .60 I/O Subsystem Features . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .62 Logical and Physical Locations . . . . . . . . . . . . . . . . . . . . . . . . . . .64 Physical Location Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .64 AIX Location Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .66 Specifications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .129 Service Inspection Guide . . . . . . . . . . . . . . . . . . . . . . . . . . . . .142 Preventive Maintenance . . . . . . . . . . . . . . . . . . . . . . . . . . . . .142 Chapter 2. Diagnostics Overview . . . . . . . . . . . . . . . . . . . . . . . . .145 Maintenance Analysis Procedures (MAPs) . . . . . . . . . . . . . . . . . . . . . . .145 Checkpoints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .145 FRU Isolation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .147 FRU Identify LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .147 Running Diagnostics on the Native Serial Port on a Full Machine Partition with an HMC Attached 148 Chapter 3. Maintenance Analysis Procedures (MAPs) . . . . . . . . . . . . . . . . .149 Entry MAP . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .149 Quick Entry MAP . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .151 MAP 1020: Problem Determination . . . . . . . . . . . . . . . . . . . . . . . . .158 MAP 1230: Linux Platform Error Problem Resolution . . . . . . . . . . . . . . . . . . .163 MAP 1235: System Array Self-Repair Problem Resolution for Linux . . . . . . . . . . . . .168 MAP 1320: Service Focal Point . . . . . . . . . . . . . . . . . . . . . . . . . . .169 MAP 1321: Quick Entry MAP for Systems with Service Focal Point . . . . . . . . . . . . . .170 MAP 1322: End of Call MAP for Systems with Service Focal Point . . . . . . . . . . . . . .183 iii MAP 1420: Recovery Procedures for Hot-Pluggable PCI Adapters . . . . . . . . . . . . . .188 MAP 1520: Power . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .192 MAP 1521: The System Will Not Power On And No Error Codes Are Available . . . . . . . . .196 MAP 1522: UEPO Switch On The BPC Is In The Bypass Position . . . . . . . . . . . . . .199 MAP 1523: There Is a Bulk Power Regulator (BPR) Communications Fault . . . . . . . . . . .200 MAP 1524: An Open Room EPO Switch Has Been Detected From One Side . . . . . . . . . .201 MAP 1525: There Is a 350 Volt Bulk Failure . . . . . . . . . . . . . . . . . . . . . .203 MAP 1526: There Is An Integrated Battery Feature (IBF) Failure . . . . . . . . . . . . . . .206 MAP 1527: An Airflow Loss Has Been Detected . . . . . . . . . . . . . . . . . . . . .209 MAP 1528: There Is A Processor (Critical/Warning) Overtemperature Fault . . . . . . . . . . .211 MAP 1529: There Is A Bulk Power Assembly (BPA) Communication Failure . . . . . . . . . .212 MAP 152a: Loss of ac Power or Phase Missing . . . . . . . . . . . . . . . . . . . . .214 MAP 152b: 2.5 V Current/Voltage Problem in Processor Subsystem . . . . . . . . . . . . .216 MAP 152c: 1.8 V Current/Voltage Problem in Processor Subsystem . . . . . . . . . . . . .223 MAP 152d: 1.5 V Auxiliary Current/Voltage Problem in Processor Subsystem . . . . . . . . . .231 MAP 152e: 1.5 V CPU Current/Voltage Problem in Processor Subsystem . . . . . . . . . . .240 MAP 152f: 3.3 V Current/Voltage Problem in Processor Subsystem . . . . . . . . . . . . .245 MAP 152g: 5.0 V Current/Voltage Problem in Processor Subsystem . . . . . . . . . . . . .253 MAP 152h: 3.3V Current/Voltage Problem in I/O Subsystem . . . . . . . . . . . . . . . .259 MAP 152i: 5.0 V Current/Voltage Problem in I/O Subsystem . . . . . . . . . . . . . . . .267 MAP 152j: 2.5 V Current/Voltage Problem in I/O Subsystem . . . . . . . . . . . . . . . .276 MAP 152k: 12.0 V Current/Voltage Problem in I/O Subsystem . . . . . . . . . . . . . . .284 MAP 152l: -12.0V Current/Voltage Problem in I/O Subsystem . . . . . . . . . . . . . . . .293 MAP 152m: Cable Problem in Power Subsystem . . . . . . . . . . . . . . . . . . . .300 MAP 152n: DASD Subsystem Power Problem . . . . . . . . . . . . . . . . . . . . .304 MAP 1540: Problem Isolation Procedures . . . . . . . . . . . . . . . . . . . . . . .308 MAP 1541: JTAG Problem Isolation . . . . . . . . . . . . . . . . . . . . . . . . .310 MAP 1542: I/O Problem Isolation . . . . . . . . . . . . . . . . . . . . . . . . . .318 MAP 1543: MCM Module Problem Isolation . . . . . . . . . . . . . . . . . . . . . .337 MAP 1544: Inner L3 Module Problem Isolation . . . . . . . . . . . . . . . . . . . . .340 MAP 1545: Outer L3 Module Problem Isolation . . . . . . . . . . . . . . . . . . . . .342 MAP 1546: Memory Book Problem Isolation . . . . . . . . . . . . . . . . . . . . . .345 MAP 1547: I/O Book Problem Isolation . . . . . . . . . . . . . . . . . . . . . . . .348 MAP 1548: Memory and Processor Problem Isolation . . . . . . . . . . . . . . . . . . .350 MAP 1549: Attention Problem Isolation . . . . . . . . . . . . . . . . . . . . . . . .353 MAP 154A: I2C Bus Problem Isolation . . . . . . . . . . . . . . . . . . . . . . . .361 MAP 154B: Insufficient Hardware Resources Problem Isolation . . . . . . . . . . . . . . .365 Chapter 4. Checkpoints . . . . . . . . . . . . . . . . . . . . . . . . . . . . .369 IPL Flow . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .369 Service Processor Checkpoints . . . . . . . . . . . . . . . . . . . . . . . . . . .372 Firmware Checkpoints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .379 Chapter 5. Error Code to FRU Index . . . . . . . . . . . . . . . . . . . . . . . .413 Performing Slow Boot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .413 Confirming Initial Error Code . . . . . . . . . . . . . . . . . . . . . . . . . . . .414 Four-Character Checkpoints . . . . . . . . . . . . . . . . . . . . . . . . . . . .415 Replacing the Media Subsystem Operator Panel . . . . . . . . . . . . . . . . . . . .415 Replacing the Network Adapter . . . . . . . . . . . . . . . . . . . . . . . . . . .415 Determining Location Code . . . . . . . . . . . . . . . . . . . . . . . . . . . .415 Checkpoints and Error Codes Index . . . . . . . . . . . . . . . . . . . . . . . . .416 Operator Panel Error Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . .419 SPCN Error Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .420 Firmware/POST Error Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . .488 Service Processor Error Codes . . . . . . . . . . . . . . . . . . . . . . . . . . .516 System Firmware Update Messages . . . . . . . . . . . . . . . . . . . . . . . . .736 iv 406/90 Service Guide Capacity Upgrade on Demand (CUoD) Messages . . . . . . . . . . . . . . . . . . . .736 Scan Dump Messages . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .736 Common Firmware Error Codes . . . . . . . . . . . . . . . . . . . . . . . . . .737 Problem Determination-Generated Error Codes . . . . . . . . . . . . . . . . . . . . .757 Hypervisor Dump Retrieval Procedure . . . . . . . . . . . . . . . . . . . . . . . .757 Chapter 6. Using the Online and Standalone Diagnostics . . . . . . . . . . . . . . . .759 Operating Considerations for Online and Standalone Diagnostics . . . . . . . . . . . . . .759 Online Diagnostics Modes of Operation . . . . . . . . . . . . . . . . . . . . . . . .760 Standalone Diagnostic Operation . . . . . . . . . . . . . . . . . . . . . . . . . .764 Chapter 7. Using the Service Processor . . . . . . . . . . . . . . . . . . . . . . .769 Service Processor Menus . . . . . . . . . . . . . . . . . . . . . . . . . . . . .769 General User Menu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .771 Privileged User Menus . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .772 Service Processor Parameters in Service Mode (Full System Partition) . . . . . . . . . . . .801 System Power-On Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . .802 Service Processor Reboot/Restart Recovery . . . . . . . . . . . . . . . . . . . . . .803 System Firmware Updates . . . . . . . . . . . . . . . . . . . . . . . . . . . . .805 Frame (Power Subsystem) Firmware Update with a 45H/W4 Switch Installed . . . . . . . . . .809 Installing Corrective Service on the Frame . . . . . . . . . . . . . . . . . . . . . . .810 Reconfiguration Procedure for SNI Adapters . . . . . . . . . . . . . . . . . . . . . .811 Configuring and Deconfiguring Processors or Memory . . . . . . . . . . . . . . . . . .812 Service Processor System Monitoring - Surveillance . . . . . . . . . . . . . . . . . . .813 Call-Out (Call-Home) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .815 Console Mirroring . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .816 Service Processor Error Logs . . . . . . . . . . . . . . . . . . . . . . . . . . .817 LCD Progress Indicator Log . . . . . . . . . . . . . . . . . . . . . . . . . . . .818 Resetting the Service Processor . . . . . . . . . . . . . . . . . . . . . . . . . .819 Service Processor Operational Phases . . . . . . . . . . . . . . . . . . . . . . . .820 Chapter 8. Using System Management Services . . . . . . . . . . . . . . . . . . .823 Select Language . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .824 Change Password Options . . . . . . . . . . . . . . . . . . . . . . . . . . . .825 View Error Log . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .825 Setup Remote IPL (Initial Program Load) . . . . . . . . . . . . . . . . . . . . . . .826 Change SCSI Settings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .828 Select Console . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .828 Select Boot Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .829 Select Boot Devices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .831 Display Current Settings . . . . . . . . . . . . . . . . . . . . . . . . . . . . .833 Restore Default Settings . . . . . . . . . . . . . . . . . . . . . . . . . . . . .834 Multiboot Startup . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .834 Exiting System Management Services . . . . . . . . . . . . . . . . . . . . . . . .834 Chapter 9. Removal and Replacement Procedures . . . . . . . . . . . . . . . . . . .835 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .837 Powering Off and Powering On the System . . . . . . . . . . . . . . . . . . . . . .838 Hot-Pluggable FRUs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .842 Power Subsystem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .844 Processor Subsystem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .853 I/O Subsystem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .913 Media Subsystem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .998 Chapter 10. Parts Information . . . . . . . . . . . . . . . . . . . . . . . . . .1009 406/1R Rack Subsystem . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1010 Contents v 406/81 Processor Subsystem (Front) . . . . . . . . . . . . . . . . . . . . . . . .1012 406/81 Processor Subsystem (Front) . . . . . . . . . . . . . . . . . . . . . . . .1014 406/81 Processor Subsystem (Rear) . . . . . . . . . . . . . . . . . . . . . . . .1018 406/81 Media Subsystem . . . . . . . . . . . . . . . . . . . . . . . . . . . .1021 Power and SCSI Cables to the Media Subsystem . . . . . . . . . . . . . . . . . . .1023 406/1D I/O Subsystem . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1025 Power Cabling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1027 RIO Cables and I/O Power Cables . . . . . . . . . . . . . . . . . . . . . . . . .1029 Keyboards and Mouse . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1031 System Power Cables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1032 Tools . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1033 Chapter 11. SRNs for Linux . . . . . . . . . . . . . . . . . . . . . . . . . . .1035 Appendix A. Environmental Notices . . . . . . . . . . . . . . . . . . . . . . . .1049 Product Recycling and Disposal . . . . . . . . . . . . . . . . . . . . . . . . . .1049 Acoustical Noise Emissions . . . . . . . . . . . . . . . . . . . . . . . . . . . .1054 Appendix B. Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1055 Appendix C. Service Processor Setup and Test . . . . . . . . . . . . . . . . . . .1057 Service Processor Setup Checklist . . . . . . . . . . . . . . . . . . . . . . . . .1057 Testing the Service Processor Setup . . . . . . . . . . . . . . . . . . . . . . . .1058 Appendix D. Modem Configurations . . . . . . . . . . . . . . . . . . . . . . . .1061 Sample Modem Configuration Files . . . . . . . . . . . . . . . . . . . . . . . . .1061 Configuration File Selection . . . . . . . . . . . . . . . . . . . . . . . . . . . .1062 Transfer of a Modem Session . . . . . . . . . . . . . . . . . . . . . . . . . . .1066 Modem Configuration Sample Files . . . . . . . . . . . . . . . . . . . . . . . . .1068 Appendix E. Ground Path . . . . . . . . . . . . . . . . . . . . . . . . . . . .1089 Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1091 vi 406/90 Service Guide Safety Notices A danger notice indicates the presence of a hazard that has the potential of causing death or serious personal injury. Danger notices appear on the following pages: v viii v 192 v 837 v 861 v 987 A caution notice indicates the presence of a hazard that has the potential of causing moderate or minor personal injury. Caution notices appear on the following pages: v viii v ix v 133 v 192 v 837 v 859 v 908 v 998 v 935 v 946 For a translation of the safety notices contained in this book, see the System Unit Safety Information, order number SA23-2652. Rack Safety Instructions v Do not install this unit in a rack where the internal rack ambient temperatures will exceed 35 degrees C. v Do not install this unit in a rack where the airflow is compromised. Any side, front or back of the unit used for air flow through the unit must not be in direct contact with the rack. v Care should be taken to ensure that a hazardous condition is not created due to uneven mechanical loading when installing this unit in a rack. If the rack has a stabilizer it must be firmly attached before installing or removing this unit. v Consideration should be given to the connection of the equipment to the supply circuit so that overloading of circuits does not compromise the supply wiring or overcurrent protection. To provide the correct power connection to the rack, refer to the rating labels located on the equipment in the rack to determine the total power requirement for the supply circuit. v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of the system or the devices that attach to the system. It is the responsibility of the customer to ensure that the outlet is correctly wired and grounded to prevent an electrical shock.Electrical Safety Observe the following safety instructions any time you are connecting or disconnecting devices attached to the system. vii DANGERAn electrical outlet that is not correctly wired could place hazardous voltage on metal parts of the system or the devices that attach to the system. It is the responsibility of the customer to ensure that the outlet is correctly wired and grounded to prevent an electrical shock. Use one hand, when possible, to connect or disconnect signal cables to prevent a possible shock from touching two surfaces with different electrical potentials. During an electrical storm, do not connect cables for display stations, printers, telephones, or station protectors for communications lines.D06 CAUTION: This product is equipped with a four-wire (three-phase and ground) power cable for the user’s safety. Use this power cable with a properly grounded electrical outlet to avoid electrical shock.C27 DANGERTo prevent electrical shock hazard, disconnect all power cables from the electrical outlet before relocating the system.D01 CAUTION: This unit has more than one power supply cord. Follow procedures for removal of power from the system when directed.C28 CAUTION: Energy hazard, remove power before servicing.C22 CAUTION: Energy hazard, remove all jewelry before servicing.C29 viii 406/90 Service Guide Laser Safety Information CAUTION: This product may contain a CD-ROM, DVD-ROM, or laser module on a PCI card, which are class 1 laser products.C30 Laser Compliance All lasers are certified in the U.S. to conform to the requirements of DHHS 21 CFR Subchapter J for class 1 laser products. Outside the U.S., they are certified to be in compliance with the IEC 825 (first edition 1984) as a class 1 laser product. Consult the label on each part for laser certification numbers and approval information. CAUTION: All laser modules are designed so that there is never any human access to laser radiation above a class 1 level during normal operation, user maintenance, or prescribed service conditions. Data processing environments can contain equipment transmitting on system links with laser modules that operate at greater than class 1 power levels. For this reason, never look into the end of an optical fiber cable or open receptacle. Only trained service personnel should perform the inspection or repair of optical fiber cable assemblies and receptacles.C25, C26 Preface ix x 406/90 Service Guide Data Integrity and Verification These computer systems contain mechanisms designed to reduce the possibility of undetected data corruption or loss. This risk, however, cannot be eliminated. Users who experience unplanned outages, system failures, power fluctuations or outages, or component failures must verify the accuracy of operations performed and data saved or transmitted by the system at or near the time of the outage or failure. In addition, users must establish procedures to ensure that there is independent data verification before relying on such data in sensitive or critical operations. Users should periodically check our support websites for updated information and fixes applicable to the system and related software. xi xii 406/90 Service Guide About This Book This book provides maintenance information that is specific to the 406/90, adapters, and attached devices that do not have their own service information. It also contains Maintenance Analysis Procedures (MAPs) that are not common to other systems. MAPs that are common to all systems are contained in the Diagnostic Information for Multiple Bus Systems. This book is used by the service representative to repair system failures. This book assumes that the service representative has had training on the system. ISO 9000 ISO 9000 registered quality systems were used in the development and manufacturing of this product. Highlighting The following highlighting conventions are used in this book: Bold Identifies commands, subroutines, keywords, files, structures, directories, and other items whose names are predefined by the system. Also identifies graphical objects such as buttons, labels, and icons that the user selects. Italics Identifies parameters whose actual names or values are to be supplied by the user. Monospace Identifies examples of specific data values, examples of text similar to what you might see displayed, examples of portions of program code similar to what you might write as a programmer, messages from the system, or information you should actually type. xiii In this manual, 406/90 stands for ESCALA PL 3200R. Related Publications The following publications are available: v The System Unit Safety Information, order number 86 X1 11WD contains translations of safety information used throughout this book. v The Hardware Management Console Operations Guide, order number SA23-1251, contains information on how to set up and cable the hardware management console and verify system operation. v The Hardware Management Console Maintenance Guide, order number SA23-1273, contains MAPS, removal and replacement procedures, error codes, and parts information that help trained service representatives diagnose and repair the hardware management console. v The 45H/W4 Switch Planning, Installation, and Service Guide, order number GA22-7954, contains information on installing and servicing the pSeries High Performance Switch (machine type 7045-SW4), which is an option on this system. v The 406/90, order number SA23-1248, contains information on how to set up and cable the system, install and remove options, and verify system operation. v The 406/90, order number SA23-1249, contains information on how to use the system, use diagnostics, use service aids, and operations. v The Diagnostic Information for Multiple Bus Systems, order number SA23-2769, contains common diagnostic procedures, error codes, service request numbers, and failing function codes. This manual is intended for trained service technicians. v The Adapters, Devices, and Cable Information for Multiple Bus Systems, order number SA23-2778, contains information about adapters, external devices, and cabling. This manual is intended to supplement information found in the Diagnostic Information for Multiple Bus Systems. v The PCI Adapter Placement Reference, order number SA23-2504, contains information regarding slot restrictions for adapters that can be used in this system. v The Site and Hardware Planning Information, order number SA38-0508, contains information to help you plan your installation. v The AIX Installation Guide and Reference, order number SC23-4389, describes how to install systems, and to use a network server to do functions such as installing the operating system or diagnostics on systems connected to the network.Trademarks The following terms are trademarks of International Business Machines Corporation in the United States, other countries, or both: v AIX v Electronic Service Agent v IBM® v LANstreamerOther company, product, and service names may be trademarks or service marks of others. xiv 406/90 Service Guide Chapter 1. Reference Information This chapter provides an overview of the 406/90, including a logical description and a physical overview of the system. Additional details pertaining to the server are also provided. They include the following: v Memory overview and ordering rules v General description of the operator panel v Cabling rules v System location rules and descriptions v Powering on and powering off the system v Power flow v Data flow406/90 Overview The 406/90 server is a multiprocessor, multibus system composed of the following subsystems: v 406/1R, a 24-inch wide and 42-EIA-high rack unit that also contains the following: – An 8-EIA unit power subsystem, consisting of two redundant bulk power assemblies that distribute bulk power at 350 V to each of the subsystems. – An optional 2 EIA-unit integrated battery feature (IBF), which is available in either a redundant or non-redundant configuration.v 406/81 Processor Subsystem, a 17 EIA-unit processor subsystem, which contains the processors, L3 cache, memory, and service processor and a 1 EIA-unit media subsystem containing a diskette drive, a CD-ROM, and an operator panel, with space for up to three additional optional media devices. There is an option to expand the number of RIO (in this case RIO-2) ports to . This allows up to RIO-2 I/O subsystems, each with 2 GB-per-second transfers on each planar. Processor speeds up to 1.5 GHz are also available. Additional memory cards that operate at the higher 567 MHz bus speed are available. v 406/1D I/O Subsystem, 4 EIA-units, which can have up to 20 PCI adapters and up to 16 DASD disk drives. This subsystem may also be purchased to allow 10 or 20 PCI-X adapters, for use with the RIO-2 cabling option. Additionally, there is an option that allows 1 GB RIO-2 on each remote I/O port. RIO-2 allows up to 2 GB I/O transfers when cabled as a single planar loop (four CEC RIO-2 ports per I/O subsystem). v A Hardware Management Console (HMC), which provides a standard user interface for configuring and operating partitioned systems. One HMC can support multiple partitions on multiple systems. 1 The basic system consists of the redundant bulk power subsystem, one processor subsystem, a media subsystem, and one I/O subsystem in the same rack, and one HMC, as shown in the following illustration: 1 Hardware Management Console 5, 6* 406/1R primary Integrated Battery Feature (IBF), optional for the primary power subsystem. Redundant IBF, also optional, and installs from the rear. 2 406/1R Bulk Power Subsystem 7 406/1D I/O Subsystem 3 406/81 Processor Subsystem 8 406/1D I/O Subsystem (Optional) 4 406/81 Media Subsystem 9 406/1D I/O Subsystem (Optional) * 406/1D I/O Subsystem (Optional, If features 5 and 6 are not installed) You can expand the system by adding I/O subsystems for a total of eight in a minimum of two racks. The preceding figure shows the single rack. Additional I/O drawers and integrated battery features are installed in a second rack. A number of cables connect the power, processor, and I/O subsystems. These cables include the following: v UPIC (Universal Power Interface Controller) cables v RIO (Remote Input Output) cablesThe ac power cords are connected to the bulk power subsystem, which distributes the power throughout the system. Each I/O subsystem contains two I/O backplanes. Each backplane is capable of supporting the following: v Up to 10 adapters each (20 per subsystem) v Up to 8 disk drives (16 per subsystem) in two DASD 4-packs (4 per subsystem) v One distributed converter assembly (DCA) (2 per subsystem) v Two I/O subsystem fans (4 per subsystem)On each I/O backplane, PCI bus slots 1 through 7 are 3.3 V dc connectors running at 66 MHz, supporting 64-bit PCI adapters. Slots 8 through 10 are 64-bit 5 V dc slots, running at 33 MHz. 2 406/90 Service Guide The media subsystem provides space for up to three media devices (diskette drive, CD-ROM drive, and optional tape drive or DVD-RAM) in the front and two additional optional devices in the rear. Hardware Management Console (HMC) The Hardware Management Console (HMC) supports your system with features that allow you to manage configuration and operation of partitions in a system, as well as add and remove hardware without interrupting system operation. In this book, a system that is managed by the HMC is referred to as the managed system. The HMC uses its serial connection to the managed system to perform various functions. The HMC’s main functions include the following: v Creating and maintaining a multiple partition environment v Detecting, reporting, and storing changes in hardware conditions v Acting as a service focal point for service representatives to determine an appropriate service strategyPartitioning the system is similar to partitioning a hard drive. When you partition a hard drive, you divide a single hard drive so that the operating system recognizes it as a number of separate hard drives. The same holds true for the HMC’s partitioning capabilities, except the HMC allows you to divide the system’s processors, memory, and I/O. On each of these divisions, you can install an operating system and use each partition as you would a separate physical machine. This division of system resources is called a logical partition. The objective of partitioning is to provide users with the capability to split a single system into several independent systems, each capable of running applications in multiple, independent environments simultaneously. For example, partitioning makes it possible for a user to run a single application using different sets of data on separate partitions, as if it were running independently on separate physical systems. Service representatives use Service Focal Point (SFP), an application that runs on the HMC, to start and end their service calls. SFP provides service representatives with the serviceable event, vital product data (VPD), and diagnostic information. The HMC is a closed system. Additional applications cannot be installed on the HMC. All the tasks needed to maintain the platform, the underlying operating system, and the HMC application code are available by using the HMC’s management applications. Partition Standby and Full System Partition Power-On Options Booting your system in partition standby is markedly different from booting your system in a traditional single-machine full system partition. In partition standby, the system sets aside system memory for partition management. Partition Standby Memory Issues Unique issues are associated with assigning memory to each partition created in partition standby. In partition standby, the HMC allocates a portion of each assigned memory block to the system. Each partition requires a minimum of 1 gigabyte (GB) of system memory. When you start creating partitions, the system sets aside 256 megabytes (MB) of contiguous memory for its own use, and allocates another 256 MB of contiguous memory for each 16 GB allocated. Page Table Memory Usage: Partition page tables are additional memory required for a partition to operate, and is in addition to the total logical memory size of a partition. The partition table is outside of a partition’s accessible memory. The partition page table must be constructed with contiguous real system memory segments. Use the following table to help you keep track of the system’s page-table memory usage: Chapter 1. Reference Information 3 Partition Memory Size (256 MB increments) Partition Page Table Size (4 16-byte entries per 4 K real page) Partition Page Table Alignment Assigned Memory Segments (256 MB) 1 GB 16 MB 16 MB 1 1 GB - 2 GB 32 MB 32 MB 1 2 GB - 3 GB 64 MB 64 MB 1 4 GB - 7 GB 128 MB 128 MB 1 8 GB - 15 GB 256 MB 256 MB 1 16 GB - 31 GB 512 MB 512 MB 2 32 GB - 63 GB 1 GB 1 GB 4 64 GB - 96 GB 2 GB 2 GB 8 In a full system partition, the operating system uses all of the installed memory; the system does not set aside contiguous memory for its own use. Keep this in mind when you record the amount of memory used when you are completing the planning worksheets in the Hardware Management Console Operations Guide, order number SA23-1251. Partition Requirements To activate a partition, you need a minimum of the following: v 1 GB of available system memory v One available system processor v One available network adapter (for error reporting)If you attempt to activate a partition and the resources that you specified are not available at that time, activation fails. It is important to keep track of your system’s resources and your activation times to avoid failures. Service Focal Point This chapter provides information about using the Service Focal Point application. The Service Focal Point application is used to help the service representative diagnose and repair problems on partitioned systems. Service representatives use the HMC as the starting point for all service issues. The HMC groups various system management issues at one control point, allowing service representatives to use the Service Focal Point application to determine an appropriate service strategy. Traditional service strategies become more complicated in a partitioned environment. Each partition runs on its own, unaware that other partitions exist on the same system. If one partition reports an error for a shared resource, such as a managed system power supply, other active partitions report the same error. The Service Focal Point application enables service representatives to avoid long lists of repetitive call-home information by recognizing that these errors repeat, and by filtering them into one error code. The following types of errors are reported to Service Focal Point: v Permanent hardware errors (detected by the managed system or operating system) v LAN surveillance errors detected by Service Focal Point v Hardware boot failure errorsThe following errors are not reported to Service Focal Point: v Software errors v Temporary hardware errors v Undetermined hardware errors 4 406/90 Service Guide v Informational hardware errorsErrors that require service are reported to the HMC as serviceable events. Because the HMC stores these serviceable events for 90 days and then discards them, it is important to have the partition and HMC date and time set correctly. For instance, if the date on a partition’s software is set 90 days behind the HMC’s set time, the serviceable events reported by this partition are immediately discarded. For more information about setting the HMC’s date and time, see the Hardware Management Console Installation and Operations Guide, order number SA23-1251. To set the partition’s date and time, see the documentation provided with the operating system that is running on that partition. Getting Started When you are setting up Service Focal Point, keep the following in mind: v If the time configured on a partition is 90 days older than time configured on the HMC, serviceable events cannot be reported. v Verify that the HMC host names are defined. For more information about using fully qualified and short host names, see Hardware Management Console Installation and Operations Guide, order number SA23-1251. v If you need to add or change a partition name, see the Hardware Management Console Installation and Operations Guide, order number SA23-1251.Testing Error Reporting To ensure that Service Focal Point is configured correctly, generate a test error by doing the following: 1. In the partition, run diagnostics to test the managed system’s operator panel. 2. When the diagnostics window asks you if you see 0000 on the managed system’s operator panel, select NO. This action generates an error. 3. In the SRN window, press Enter to proceed. 4. When the system asks you if you want the error sent to Service Focal Point, select YES. 5. Type F3 to exit diagnostics. 6. Wait for one minute while the managed system sends the error to Service Focal Point. 7. Check the Serviceable Event window to ensure that the error was sent to Service Focal Point and that Service Focal Point reported the error. For more information about working with serviceable events, see “Working With Serviceable Events” on page 7.Service Focal Point Settings The Service Focal Point Settings task in the HMC Contents area allows you to configure your Service Focal Point application. Note: The Advanced Operator, Operator, and Viewer roles have read-only access to the following tasks. Automatic Call-Home Feature: You can configure the HMC to automatically call an appropriate service center when it identifies a serviceable event. To enable or disable the call-home feature, you must be a member of one of the following roles: v System Administrator v Service RepresentativeTo enable or disable the call-home feature for one or more systems, do the following:Note: It is strongly recommended that you not disable the call-home feature. When you disable the call-home feature, serviceable events are not automatically reported to the service representative. 1. In the Navigation area, click the Service Applications icon. 2. In the Navigation area, double-click the Service Focal Point icon. 3. In the Contents area, click Service Focal Point Settings. Chapter 1. Reference Information 5 4. The Service Focal Point Settings window opens. Select the CEC Call Home tab at the top of the window. 5. Click on the managed system you want to enable or disable. 6. Click Enable to enable call-home for the selected system, or click Disable to disable call-home for the selected system. 7. Click OK.Setting Up Surveillance: Service Focal Point surveillance generates serviceable events when it detects communication problems between the HMC and its managed systems. You can configure how you want the HMC to survey the following: v The number of disconnected minutes that are considered an outage v The number of connected minutes you want the HMC to consider a recovery v The number of minutes between outages that are considered a new incidentTo set up surveillance, you must be a member of one of the following roles: v System Administrator v Service RepresentativeTo set up surveillance, do the following: 1. In the Navigation area, click the Service Applications icon. 2. In the Navigation area, double-click the Service Focal Point icon. 3. In the Contents area, select Service Focal Point Settings. 4. The Service Focal Point Settings window opens. Select the Surveillance Setup tab on the top of the window. 5. In the first field, select the number of minutes you want the HMC to wait before sending a disconnection error message. 6. In the second field, select the amount of connection time that the HMC is considered to be recovered. This amount is expressed in minutes. 7. In the third field, select the number of minutes between outages that you want the HMC to wait before sending a new incident report. 8. Select one or more managed systems from the table in the lower part of the window, then click Enable or Disable. Surveillance is then either enabled or disabled for the selected managed systems.Enabling Surveillance Notifications: You can enable or disable surveillance-error notification from this HMC to connected managed systems. Enabling this notification causes errors to be passed to the Service Agent application for notification. Note: You must further configure Service Agent to handle notifications sent by Service Focal Point. For more information about Service Agent, see the Hardware Management Console Installation and Operations Guide, order number SA23-1251. To set up surveillance, you must be a member of one of the following roles: v System Administrator v Service RepresentativeTo set up surveillance-error notification, do the following: 1. In the Navigation area, click the Service Applications icon. 2. In the Navigation area, double-click the Service Focal Point icon. 3. In the Contents area, select Service Focal Point Settings. 6 406/90 Service Guide 4. The Service Focal Point Settings window opens. Select the Surveillance Notification tab at the top of the window. 5. Select one or more managed systems from the list, and click Enable or Disable. Surveillance notification is then either enabled or disabled for the selected managed systems.Working With Serviceable Events You can view, add, or update serviceable event information, including error details. Viewing Serviceable Events: To view serviceable events, you must be a member of one of the following roles: v System Administrator v Service Representative v Advanced Operator v Operator v ViewerTo view a serviceable event, do the following: 1. In the Navigation area, click the Service Applications icon. 2. In the Navigation area, double-click the Service Focal Point icon. 3. In the Contents area, click Select Serviceable Event. 4. Designate the set of serviceable events you want to view. When you are finished, click OK. 5. The Serviceable Event Overview window opens, and the entries displayed are ordered by time stamp. Each line in the Serviceable Event Overview window corresponds to one error within a serviceable event. On this window, designate the set of serviceable events you want to view by specifying your search criteria (such as event status or error class). Note: Only events that match all of the criteria that you specify are shown. 6. When you are finished, click OK.When you select a line in the Serviceable Event Overview window, all lines in the same serviceable event are selected. To open the Serviceable Event Details window for the selected event, select the event and click Event Details. Viewing Serviceable Event Details: To view serviceable event details, do the following: 1. Perform the steps in “Viewing Serviceable Events.” 2. The Serviceable Event Details window opens, showing extended serviceable event information, including the following: v Status v Earliest original time stamp of any managed object v AIX error log. (The Linux system error log does not place entries into Service Focal Point.) v Should this error ever get called home? v Error was called home v Pointer to extended error-data collection on the HMCThe window’s lower table displays all of the errors associated with the selected serviceable event. The information is shown in the following sequence: v Failing device system name v Failing device machine type/model/serial v Error class v Descriptive error text Chapter 1. Reference Information 7 Viewing Serviceable Event Error Details: To view serviceable event error details, do the following: 1. Perform the steps in “Viewing Serviceable Event Details” on page 7. 2. Select an error in the lower table, and click Error Details.Viewing Service Processor Error Details: To view service processor error details, do the following: 1. Perform the steps in “Viewing Serviceable Event Error Details.” 2. If the serviceable event error details you are viewing are for a service processor-class error, the lower table on the resulting window contains service processor errors. Select a service processor error from the lower table, and click Service Processor Error Details to see further details.Saving and Managing Extended Error Data: To save extended error (EE) data, do the following: 1. Perform the steps in “Viewing Serviceable Event Details” on page 7. 2. Click Save EE Data. To save extended error data for only one error associated with the serviceable event (rather than for the entire serviceable event), select the error from the lower table, and click Error Details. In the next menu, click Manage EE Data.Viewing and Adding Serviceable Event Comments: To add comments to a serviceable event, you must be a member of the Service Representative or System Administrator roles. To add comments to a serviceable event, do the following: Note: You cannot edit or delete previous comments. 1. Perform the steps in “Viewing Serviceable Event Details” on page 7. 2. Select the error to which you want to add comments to and click Comments. If you want to close the event and add comments, click Close Event from this window. The Serviceable Event Comments window opens. 3. Type your name and add comments as appropriate. You can also review previous comments, but you cannot edit this information. 4. If you clicked Comments on the Serviceable Event Details window, clicking OK commits your entry and returns you to the Serviceable Event Details window.If you clicked Close Event on the Serviceable Event Details window, clicking OK commits all changes and opens the Update FRU Information window. For more information about updating field replaceable unit information, see “Updating Field Replaceable Unit (FRU) Information.” Closing a Serviceable Event: To close a serviceable event, do the following: 1. Perform the steps in “Viewing Serviceable Event Details” on page 7. 2. Click Close Event from this window. The Serviceable Event Comments window opens. 3. Click OK to commit your comments. The Update FRU Information window displays. See “Updating Field Replaceable Unit (FRU) Information” for information on completing this window. Click OK on the Update FRU Information window to close the serviceable event.Note: You must close a serviceable event after it has been serviced to ensure that if a similar error is reported later, it is called home. If an old problem remains open, the new similar problem is reported as a duplicate. Duplicate errors are neither reported nor called home to a service center. Close a serviceable event when the partition that reports the error is active. Closing the event causes the new status of the serviceable event to be correctly sent to the partition.Updating Field Replaceable Unit (FRU) Information: This task allows you to update the FRU information you changed or modified as a result of this serviceable event. From this panel, you can also activate and deactivate LEDs and search for other serviceable events that contain the same FRU entries. To update FRU information, do the following: 8 406/90 Service Guide 1. Perform the steps in “Viewing Serviceable Event Details” on page 7. 2. Click FRU Information. The Update FRU Information window opens. The lower table shows any parts that you have replaced or added during your current update session but that have not been committed to the serviceable event. The changes from the lower table are committed by clicking OK or Apply.From this window, you can also activate and deactivate LEDs and search for other serviceable events that contain the same FRU entries. Replacing an Existing FRU: To replace a part already listed for this serviceable event, do the following: 1. Perform the steps in “Updating Field Replaceable Unit (FRU) Information” on page 8. 2. In the upper table, double-click the part you want to replace. 3. If the FRU has a new part number, type it in the New FRU Part Number field. 4. Click Replace FRU. The Update FRU Information window displays the FRU replacement information in the lower table. Click OK or Apply to commit the changes to the serviceable event.Adding a New FRU: You can add a part to the serviceable event that was not listed in the upper table of the Update FRU Information window. To add a new FRU for this serviceable event, do the following: 1. Perform the steps in “Updating Field Replaceable Unit (FRU) Information” on page 8. 2. Click Add New FRU. 3. Type the FRU’s location code and its part number in the appropriate fields. 4. Click Add to List. The Update FRU Information window opens and displays the newly added FRU in the lower table. 5. Click OK or Apply to commit these changes to the serviceable event. Note: After you click OK or Apply, you cannot change this information. If you clicked the Close Event button in the Serviceable Event Details window, then clicking OK also completes the close dialog and changes the status of the serviceable event to Closed.Viewing Serviceable Event Partition Information: You can view partition information associated with this serviceable event. This information includes each affected partition’s state and resource use. 1. Perform the steps in “Viewing Serviceable Event Details” on page 7. 2. Click Partition Information.Activating and Deactivating FRU LEDs This task allows you to activate or deactivate a managed system’s system attention LED or any FRU LED. FRU LEDs are helpful in determining which FRUs need servicing. To activate or deactivate a managed system’s System Attention LED, do the following: 1. In the Navigation area, click the Service Applications icon. 2. In the Navigation area, double-click the Service Focal Point icon. 3. In the Contents area, select Hardware Service Functions. The LED Management window opens. 4. In the LED Management window, select one or more managed systems from the table. 5. Select either Activate LED or Deactivate LED. The associated System Attention LED is then either turned on or off.To activate or deactivate a FRU associated with a particular managed system, do the following: 1. In the Navigation area, click the Service Applications icon. 2. In the Navigation area, double-click the Service Focal Point icon. 3. In the Contents area, click Hardware Service Functions. The LED Management window opens. 4. In the LED Management window, select one managed system from the table. Chapter 1. Reference Information 9 5. Click the List FRUs... button. The list of FRU slot indexes and their respective current LED states display. 6. Select one or more FRU slot indexes. 7. Click either the Activate LED or the Deactivate LED button.The associated FRU LEDs are now either enabled (blinking) or off. Powering the System On and Off For complete procedures to power on and power off the system in various system configurations running various operating systems, see “Powering the System On” on page 841. 10 406/90 Service Guide 406/90 Data Flow The following figure shows the data flow for the system. Chapter 1. Reference Information 11 Power-On Self-Test After power is turned on and before the operating system is loaded, the partition does a power-on self-test (POST). This test performs checks to ensure that the hardware is functioning correctly before the operating system is loaded. During the POST, a POST screen displays, and POST indicators display on the virtual terminal. The next section describes the POST indicators and functions that can be accessed during the POST. POST Indicators POST indicators indicate tests that are being performed as the partition is preparing to load the operating system. The POST indicators are words that display on the virtual terminal. Each time that the firmware starts another step in the POST, a POST indicator word displays on the console. The POST screen displays the following words: Memory Memory test Keyboard Initialize the keyboard and mouse. The time period for pressing a key to access the System Management Services, or to initiate a service mode boot is now open. See “POST Keys” for more information. Network Self-test on network adapters SCSI Adapters are being initialized Speaker Sounds an audible tone at the end of POSTPOST Keys The POST keys, if pressed after the keyboard POST indicator displays and before the last POST indicator (speaker) displays, cause the system to start services or boot modes used for configuring the system and diagnosing problems. The system will beep to remind you to press the POST key (if desired) at the appropriate time. The keys are described as follows: 1 Key The numeric 1 key, when pressed during POST, starts the System Management Services (SMS) interface. 5 Key The numeric 5 key, when pressed during POST, initiates a partition boot in service mode using the default service mode boot list. This mode attempts to boot from the first device of each type found in the list. It does not search for other bootable devices of that type if the first device is not bootable. Instead, it continues to the next device type in the list. The firmware supports up to five entries in the boot list. The default boot sequence is: 1. Diskette 2. CD-ROM 3. Hard file 4. Tape drive (if installed) 5. Network a. Token ring b. Ethernet6 Key The numeric 6 key works like the numeric 5 key, except that firmware uses the customized service mode boot list, which can be set in the AIX service aids. 12 406/90 Service Guide 8 Key To enter the open firmware command line, press the numeric 8 key after the word keyboard displays and before the last word (speaker) displays during startup. After you press the 8 key, the remaining POST indicators display until initialization completes. This option is used only by service personnel to collect additional debug information. To exit from the open firmware command prompt, type reset-all or power off the system and reboot. 406/90 System Locations The 406/90 system consists of a minimum of four subsystems in one rack as follows: v Processor subsystem v Media subsystem v One I/O subsystem v Power subsystemThese components are connected by cables that transmit data and control signals. I/O subsystems can be added if further expansion of the system is required (up to seven additional I/O subsystems, using two racks). Chapter 1. Reference Information 13 In each of the following figures, the minimum system configuration is shown on the left. The minimum system configuration with the integrated battery feature (IBF) is shown on the right in each of the figures. 13326547 1 Rack 5 Media Subsystem 2 Bulk Power Assembly 6 Optional Integrated Battery Feature 3 UEPO Switch 7 First I/O subsystem 4 Memory Books 14 406/90 Service Guide 1 BPA Fan (U1.35-P1-F1) 4 Processor Subsystem Fan 1, (U1.18-F1) 2 BPA Fan (U1.35-P2-F1) 5 Processor Subsystem Fan 3, (U1.18-F3) 3 Processor Subsystem Fan 2, (U1.18-F2) 6 Processor Subsystem Fan 4, (U1.18-F4) Note: The I/O subsystem fan assemblies are inside the I/O subsystems, positioned from left to right behind the DASD 4-packs in order of fan assembly 1, fan assembly 2, fan assembly 3, and fan assembly 4. Chapter 1. Reference Information 15 UEPO Switch The UEPO (Unit Emergency Power Off) switch is accessible through the external cover. The switch contains a room emergency power off (EPO) interlock connector and bypass switch, a dual element rack service amber LED (one element driven per bulk power controller, a service complete button (white) and start service button (green), an off button and four thermistors for BPC ambient temperature sensing. The UEPO switch is shown in the following figure. 41235667891011 1 UEPO Front Panel 7 J02 Connector 2 UEPO Side Panel 8 Power Switch 3 UEPO Rear Panel 9 System Fault LED 4 J00 Connector Front BPA (Back) 10 Start Service Button (Green) 5 J01 Connector Rear BPA (Back) 11 Service Complete Button (White) 6 Room EPO Bypass Interlock The UEPO switch disables 350 V bulk power conversion and isolates all battery power to the battery compartments, without having to disconnect the heavy power cords under the customer’s raised floor. At the system level, the UEPO switch is connected to each of the two BPCs through two interface connectors. The switch can be concurrently replaced by forcing the BPCs into UEPO bypass mode, using the small slide switches on the face of the BPCs. The switch is mounted in the side pocket area of the rack and remains stationary when the front cover of the system is opened. The service buttons are used by service personnel to repair the power subsystem. If the system is off when the service complete button is pressed, the system will power on. A sliding plastic cover over the red switch prevents it from being accidentally activated in the customer environment. 16 406/90 Service Guide 406/81 Processor Subsystem Locations This section provides a front view and a rear view of the processor subsystem with descriptions and locations for each part of the processor subsystem. Detailed location information for processor subsystem I/O books is also included. 406/81 Processor Subsystem Front View Note: For slot assignments, see “Memory to Processor Relationships” on page 60. 13333222244445555677 1 Processor Subsystem Chassis 5 Fan Controller 2 Outer Memory Book 6 Fan Chassis 3 Inner Memory Book 7 Fan Filter 4 Fan Chapter 1. Reference Information 17 406/81 Processor Subsystem Rear View 1 I/O Book – Slot 1 8 Capacitor Book 2 DCA Book 1 9 DCA Book 6 (32-way, 1.7 GHz MCMs) 3 DCA Book 2 10 I/O book - Slot 0 4 Capacitor Book 11 I/O Book 3 - Slot 3 5 DCA Book 3 12 I/O Book 2 - Slot 2 (Inner Connectors) 6 DCA Book 4 13 I/O Book 2 - Slot 2 (Outer Connectors) 7 DCA Book 5 18 406/90 Service Guide I/O Book Connector Locations 1 Primary I/O Book, Slot 0 (U1.18-P1-H2) 19 I/O Book Slot 2 (D0) (U1.18–P1–H3/Q6)I/O Book Slot 1 (D0) (U1.18–P1–H1/Q6) 2 Secondary I/O Book, Slot 1 (U1.18-P1-H1), Slot 2 (U1.18-P1-H3) 20 I/O Book Slot 2 (C1) (U1.18-P1-H3/Q7)I/O Book Slot 1 (C1) (U1.18-P1-H1/Q7) 3 Third I/O Book, Slot 3 (U1.18-P1-H4) 21 I/O Book Slot 2 (C0) (U1.18-P1-H3/Q8)I/O Book Slot 1 (C0) (U1.18-P1-H1/Q8) 4 I/O Book Slot 0 (A0) (U1.18-P1-H2/Q1) 22 I/O Book Slot 2 (A0) (U1.18-P1-H3/Q4)I/O Book Slot 1 (A0) (U1.18-P1-H1/Q4) 5 I/O Book Slot 0 (A1) (U1.18-P1-H2/Q2) 23 I/O Book Slot 2 (A1) (U1.18-P1-H3/Q3)I/O Book Slot 1 (A1) (U1.18-P1-H1/Q3) 6 Operator Panel (U1.18-P1-H2/Q7) 24 Indicator LEDs 7 BPC Y-cable* connector U1.18-P1-H2/Q8 or U1.18-P1-H2/Q9 25 I/O Book Slot 2 (B0) (U1.18-P1-H3/Q2)I/O Book Slot 1 (B0) (U1.18-P1-H1/Q2) 8 I/O Book Slot 0 (B0) (U1.18-P1-H2/Q3) 26 Camming Latches 9 I/O Book Slot 0 (B1) (U1.18-P1-H2/Q4) 27 I/O Book Slot 2 (B1) (U1.18-P1-H3/Q1)I/O Book Slot 1 (B1) (U1.18-P1-H1/Q1) 10 Diskette Drive (U1.18-P1-H2/Q10) 28 I/O Book Slot 3 (D1) (U1.18-P1-H4/Q4) 11 HMC 1 (U1.18-P1-H2/S3) 29 I/O Book Slot 3 (D0) (U1.18-P1-H4/Q3) 12 HMC 2 (U1.18-P1-H2/S4) 30 I/O Book Slot 3 (C1) (U1.18-P1-H4/Q2) 13 Serial Port 1 (U1.18-P1-H2/S1) 31 I/O Book Slot 3 (C0) (U1.18-P1-H4/Q1) 14 Serial Port 2 (U1.18-P1-H2/S2) 32 I/O Book Slot 3 (A0) (U1.18-P1-H4/Q5) 15 SPCN 0 (Manufacturing use only) (U1.18-P1-H2/Q5) 33 I/O Book Slot 3 (A1) (U1.18-P1-H4/Q6) 16 SPCN 1 (Manufacturing use only) (U1.18-P1-H2/Q6) 34 I/O Book Slot 3 (B0) (U1.18-P1-H4/Q7) 17 Debug (Manufacturing use only) 35 I/O Book Slot 3 (B1) (U1.18-P1-H4/Q8) 18 I/O Book Slot 2 (D1) (U1.18-P1-H3/Q5)I/O Book Slot 1 (D1) (U1.18-P1-H1/Q5) * The Y-cable that attaches to this connector terminates at BPC-A connector U1.35-P1-X4/Q10 and BPC-B connector U1.35-P2-X4/Q10. Chapter 1. Reference Information 19 Media Subsystem Locations The following figures show locations for devices installed in the media subsystem, and the SCSI IDs for the media devices.Note: The SCSI IDs shown for media indicate how installed devices are set when shipped from the factory. Media Subsystem Front View 2SCSISCSI314 1 Diskette Drive 3 Optional Media Drive (4 mm Tape, CD-ROM, or DVD-RAM), SCSI Address 06 2 CD-ROM Drive, SCSI Address 05 4 Operator Panel Media Subsystem Rear View Note: DVD-RAM and 4-mm tape drives can be located in the rear of the media subsystem only if the room is maintained at a nominal room temperature of 24C (75.2F).28361SCSISCSI754 1 Optional Media (4-mm Tape, CD-ROM, or DVD-RAM), SCSI Address 05 5 Front SCSI Power Connection 2 Optional Media (4-mm Tape, CD-ROM, or DVD-RAM), SCSI Address 06 6 Front SCSI Connection 3 Rear SCSI Power Connection 7 Diskette Drive Connection 4 Rear SCSI Connection 8 Operator Panel Connection 20 406/90 Service Guide Media Subsystem Operator Panel R125346! 1 Power on/off button 4 Reset button 2 Power on/off LED 5 Service processor reset button (Service use only) 3 Operator panel display 6 Disturbance or system attention LED Note: You must activate the service processor reset button very carefully. Using an insulated paper clip is recommended. Unbend the clip so that it has a straight section about two inches long. Insert the clip straight into the hole, keeping the clip perpendicular to the plastic bezel. When you engage the test reset switch, you should feel the detent of the switch. After you press the switch, the service processor resets and then shuts down the system. Disturbance or System Attention LED The system attention LED lights on solid when an event occurs that either needs customer intervention or service. The system attention LED is turned on when an entry is made in the service processor error log. The error entry is transmitted to the following: v System-level error logs v the operating system error log v As an entry in the service action event log in the Service Focal Point application. For example, the loss of surveillance from the HMC to a logical partition.Continue to the following appropriate system error interrogation section. Accessing System Log Error Information Note: The system attention LED can be reset by following the procedures described in “Resetting the System Attention LED” on page 22. When an error or potential problem is detected by the system, the attention LED on the operator panel is turned on. Information about the error or potential problem is stored in error logs. The following sections explain how to access the logs that are available on a system. The error logs are accessed from the console that is used to manage the system. From the following console descriptions, go to the instructions under the heading that best describes the system on which you are working: v “Accessing Errors when a System is Attached to an HMC.” v “Accessing Errors when a System is Running AIX and the Console is Not an HMC” on page 22. v “Accessing Errors when a System is Running Linux and the Console is Not an HMC” on page 22.Accessing Errors when a System is Attached to an HMC If the system attention LED comes on and the system is attached to an HMC for its console, do the following to access the error logs: 1. Open the Service Action Event Log in the Service Focal Point application on the HMC. Chapter 1. Reference Information 21 2. Check the open service events using the procedure described in ?Working with Serviceable Events? in the Hardware Management Console Installation and Operations Guide.Note: For information on these Service Focal Point settings, see ?Setting Up Surveillance and Enabling Surveillance Notifications? in the Hardware Management Console Installation and Operations Guide. Events requiring customer intervention are marked Call Home Candidate? NO. For each of these events, examine the description in the serviceable event error details. If actions are listed in the description, perform those actions. If the error indicates a loss of surveillance between the HMC and a partition, check the status of the partition, the network, and the cabling between the HMC and the partition. Correct any problems found. If a surveillance problem is still indicated, call service support. If the service action event is labeled Call Home Candidate? YES, an error code is supplied to identify the problem. If the system is configured to automatically call home on error, the request for service is placed. If the system is not configured to automatically call home, you should place a call to service support. If the system attention LED comes on and an HMC is used to manage the system, do one of the following: v If you performed “Step 1321-43” on page 182 in “MAP 1321: Quick Entry MAP for Systems with Service Focal Point” on page 170, return to “Step 1321-43” on page 182 to process any other error codes or symptoms. v If you have not performed any of the steps in “MAP 1321: Quick Entry MAP for Systems with Service Focal Point” on page 170, go to step “Step 1321-1” on page 170 and perform the procedures in the MAP.Accessing Errors when a System is Running AIX and the Console is Not an HMC If the system attention LED comes on and the system is running AIX and the console is not an HMC, do the following to access the error logs: 1. If the system is operational, examine the AIX error log to see if user intervention is required. 2. If the system is powered off, examine the service processor error log for entries. 3. If the AIX error log indicates service support is required, or there are entries in the service processor error log, go to “Entry MAP” on page 149 with the information noted from the error log.Accessing Errors when a System is Running Linux and the Console is Not an HMC If the system attention LED comes on and the system is running Linux and is not attached to an HMC, do one of the following: v If you performed “Step 1321-43” on page 182 in “MAP 1321: Quick Entry MAP for Systems with Service Focal Point” on page 170, return to “Step 1321-43” on page 182 to process any other error codes or symptoms. v If you have not performed any of the steps in “MAP 1321: Quick Entry MAP for Systems with Service Focal Point” on page 170, go to step “Step 1321-1” on page 170 and perform the procedures in the MAP.Resetting the System Attention LED Choose one of the following procedures to reset your attention LED. Resetting the LED when a System is Attached To an HMC To reset the system attention LED on HMC-attached systems, do the following: 1. On the HMC graphical interface, click Service Applications. 2. Double-click Service Focal Point. 3. In the Contents area of the screen, select Hardware Service Functions. The LED Management window opens. 22 406/90 Service Guide 4. In the LED Management window, select one or more managed systems from the table. 5. Select Deactivate LED. The associated system attention LED is turned off.For more information about the virtual operator panel on the HMC, see the Hardware Management Console Installation and Operations Guide. Resetting the LED when a System is Running AIX and the Console is not an HMC As a user with root authority, type diag on the AIX command line, and do the following: 1. Select Task Selection. 2. On the Task Selection Menu, select Identify and Attention Indicators. 3. When the list of LEDs displays, use the cursor to highlight Set System Attention Indicator to Normal. 4. Press Enter, and then press F7 to commit. This action turns off the LED.If the system is powered off, access the service processor menus. From the service processor main menu, do the following: 1. Select System Information Menu. 2. Select LED Control Menu. 3. Select Clear System Attention Indicator. This action turns off the LED.Resetting the LED when a System is Running Linux and the Console is not an HMC The system must be powered off to access the service processor menus. From the service processor main menu, do the following: 1. Select System Information Menu. 2. Select LED Control Menu. 3. Select Clear System Attention Indicator. This action turns off the LED. Chapter 1. Reference Information 23 406/1D I/O Subsystem Locations This section shows the locations for the I/O subsystem. A front view and a rear view with descriptions and locations for each part of the I/O subsystem are shown. 406/1D I/O Subsystem Front View 111123668557479981010111312141516 1 DASD 4-Pack 9 Drawer Fault/Identify LED (Amber) 2 DASD Disk Drive 10 Power Good Out LED (Green) 3 Disk Activity LED (Green) 11 Fan Fault LED (Amber)Ux.y-F1 4 Disk Drive Fault/Identify LED (Amber) 12 Fan Fault LED (Amber)Ux.y-F3 5 BPD 1 (Front) Cable In 13 Fan Fault LED (Amber)Ux.y-F2 6 BPD 2 (Rear) Cable In 14 Fan Fault LED (Amber)Ux.y-F4 7 Power Good In LED (Green) 15 I/O Subsystem Left DCAUx.y-V1 8 DCA Fault LED (Amber) 16 I/O Subsystem Right DCAUx.y-V2 Note: The fan fault LEDs shown in the preceding illustration indicate if an internal fan has a fault. The fan assemblies inside are positioned from left to right behind the DASD 4-packs in order of fan assembly 1, fan assembly 2, fan assembly 3, and fan assembly 4. 24 406/90 Service Guide 406/1D I/O Subsystem Rear View 1399811111212456271010 1 I/O port connector 1 (Ux.y-P1/Q2) 7 I/O Card Power On LED (Green) 2 I/O port connector 0 (Ux.y-P1/Q1) 8 I/O Adapter Fault/Identify LED (Amber/Bottom) 3 Media subsystem power connector (Ux.y-P1-V1/Q3) 9 Auxiliary Power Good (Green) 4 I/O port connector 1 (Ux.y-P2/Q2) 10 I/O Subsystem Backplane Fault (Amber) 5 I/O port connector 0 (Ux.y-P2/Q1) 11 I/O Subsystem Backplane Power On (Green) 6 Media subsystem power connector (Ux.y-P2-V1/Q3) 12 I/O LED (Currently Unused) Chapter 1. Reference Information 25 I/O Subsystem DASD Locations The following figures show the DASD drive positions in an I/O subsystem.Note: The SCSI IDs are set when the drive is installed into the DASD 4-pack. 406/1D I/O Subsystem DASD Locations 21368574910111312141516 1 Ux.y-P2/Z2-A8 9 Ux.y-P1/Z2-A8 2 Ux.y-P2/Z2-A9 10 Ux.y-P1/Z2-A9 3 Ux.y-P2/Z2-Aa 11 Ux.y-P1/Z2-Aa 4 Ux.y-P2/Z2-Ab 12 Ux.y-P1/Z2-Ab 5 Ux.y-P2/Z1-A8 13 Ux.y-P1/Z1-A8 6 Ux.y-P2/Z1-A9 14 Ux.y-P1/Z1-A9 7 Ux.y-P2/Z1-Aa 15 Ux.y-P1/Z1-Aa 8 Ux.y-P2/Z1-Ab 16 Ux.y-P1/Z1-Ab Notes: 1. In the preceding table, x is equal to the rack number and y is equal to the I/O subsystem position in the rack. The SCSI ID for each drive is set when the drive is installed. Each I/O subsystem can have up to four DASD 4-packs installed. The SCSI IDs for the drives in each 4-pack are 8, 9, a, and b, from left to right. 2. The DASD attachment locations are as follows: v DASD 1, 2, 3, 4 are attached to Ux.y-P3 DASD backplane. v DASD 5, 6, 7, 8 are attached to Ux.y-P4 DASD backplane. v DASD 9, 10, 11, 12 are attached to Ux.y-P5 DASD backplane. v DASD 13, 14, 15, 16 are attached to Ux.y-P6 DASD backplane. 26 406/90 Service Guide I/O Subsystem PCI and PCI-X PHB and PCI and PCI-X Slot Locations I/O and Media Subsystem Connections RIO BusPCI- PCIBridgeRIO-PCIBridgePCI- PCIBridgePCI Slot - 64 bit 66/33mhz 3.3VPCI Slot - 64 bit 33mhz 5.0VPCI- PCIBridgeUltra3SCSIUltra3SCSI3210 slots10 slotsMedia SubsystemPowerSCSI4 DASDBays4 DASDBaysSESSESI/OSubsystemRearMedia Subsystem FrontI/O PlanarInternal SCSIControllers Chapter 1. Reference Information 27 I/O Subsystem Internal SCSI Distribution RearP1P2PPRPPISISPPRPPISISDASD4 packDASD4 packDASD4 packDASD4 packP=PCI-PCI bridgeRP=RIO-PCI bridgeIS=Internal SCSI Ultra 3 I/O Subsystem Power Distribution RearP1P2PPRPPISISPPRPPISISDCA 1Power SupplyDCA 2Power SupplyMidplaneBPABPAFan 2Fan 3Fan 1Fan 4P=PCI-PCI BridgeRP=RIO-PCI BridgeIS=Internal SCSI Ultra 3 28 406/90 Service Guide Power Subsystem Locations Note: Use the table located in “I/O Subsystem Power Distribution Cabling” on page 33 to identify the connector ids for BPD 2 and BPD 3.1235781191064I/O-1AJ00I/O-5AJ00I/O-5AJ00DCA-4J08DCA-5J09I/O-8BJ09I/O-8BJ09I/O-3AJ04I/O-2AJ02I/O-1BJ01J01J01J02J02J03J03J04J04J05J05J06J06J07J07J08J08I/O-4BJ07I/O-2BJ03DCA-1J08DCA-3J10DCA-2J09MDA-3J06MDA-1J04BPFJ03J00BPC-BPCSYSTEMJ01P02MDA-4J07MDA-2J05I/O-3BJ05BPD(1)BPD(2)BPD(3)BPCI/O-4AJ06J02EPOUPSooooooooUEPO Note: BPD (2) connectors J04 and J05 (I/O-4A/J04 and I/O-4B/J05 in the preceding figure) are for I/O subsystem 4 when the IBF is installed. 1 BPD Power Good LED 6 ac Power In Connector 2 Bulk Power Distributor (BPD) 7 Connectors to IBF 3 Bulk Power Controller (BPC) 8 UEPO Power (PWR) LED (Top LED)UEPO Power Complete (CMPLT) LED 4 Bulk Power Fan (BPF) 9 BPC Power Good LED 5 Bulk Power Regulators (BPR) 10 BPD for 32-way 1.7 GHz MCM systems 11 System Power (SP) LED(Top LED) Standby Power Note: For 32-way 1.7 GHz MCM systems a third BPD was added to supply the additional power needed by the CEC. To get the correct connector location numbers, refer to the table located in “Power Distribution Cabling” on page 30. Chapter 1. Reference Information 29 Power Distribution Cabling Power to the I/O subsystems, DCAs, Fans, and IBFs is connected from the bulk power distribution connectors to each subsystem through cables. The cable connectors are labeled on each end. The following describes the cables that connect from the BPC BPDs, and BPRs to the DCAs, fans, power supplies, I/O subsystems, and integrated battery features. 30 406/90 Service Guide Key From To Description Label Color 1 Bulk Power BPE A Side Bulk power enclosure on the A side of the system 2 Bulk Power BPE B Side Bulk power enclosure on the B side of the system 3 A35-BPC-J03 A35-P1FAN (BPF-A) A35-BPC-P03 to Bulk Power Fan P1-F1 Red with white stripe 4 B35-BPC-J03 B35-P2FAN (BPF-B) B35-BPC-P03 to Bulk Power Fan P2-F1 Red 5 B35-BPC-J08 DCA-1 J01 B35-BPC-P08 to DCA-1-P01 Red 6 A35-BPC-J08 DCA-1 J00 A35-BPC-P08 to DCA-1-P00 Red with white stripe 7 B35-BPC-J09 DCA-2 J01 B35-BPC-P09 to DCA-2-P01 Red 8 A35-BPC-J09 DCA-2 J00 A35-BPC-P09 to DCA-2-P00 Red with white stripe 9 B35-BPC-J10 DCA-3 J01 B35-BPC-P10 to DCA-3-P01 Red 10 A35-BPC-J10 DCA-3 J00 A35-BPC-P10 to DCA-3-P00 Red with white stripe 11 B35-BPD-J08 DCA-4 J01 B35-BPD1-P08 to DCA-4-P01 Red 12 A35-BPD-J08 DCA-4 J00 A35-BPD1-P08 to DCA-4-P00 Red with white stripe 13 B35-BPD-J09 DCA-5 J01 B35-BPD1-P09 to DCA-5-P01 Red 14 A35-BPD-J09 DCA-5 J00 A35-BPD1-P09 to DCA-5-P00 Red with white stripe 14a A35-BPD-J06 DCA-6-J00* A35-BPD1-P06 to DCA-6-P01 (32-way, 1.7 GHz) Red with white stripe 14b A35-BPD-J06 DCA-6-J01* A35-BPD1-P06 to DCA-6-P00(32-way, 1.7 GHz) Red 15, 17 B17-Media-J02 I/O Subsystem 1B-J01 I/OB9-P01 to Media P02 16, 18 B17-Media-J00 I/O Subsystem 1B-J00 I/OB9-P00 to Media P00 19, 20, 21, 22 B35-BPDx-JxxSee “I/O Subsystem Power Distribution Cabling” on page 33 for I/O subsystem power cabling. I/O SubsystemSee “I/O Subsystem Power Distribution Cabling” on page 33 for I/O subsystem power cabling. B35-BPDx-P0x to I/O Subsystem #x 23 B35-BPC-J05 A09-MDA2-J02 A35-BPC-P05 to (AMD2-P01) J02 Red 24 A35-BPC-J05 A09-MDA2-J01 B35-BPC-P05 to (AMD2-P00) J01 Red with white stripe 25 B35-BPC-J04 A09-MDA1-J02 A35-BPC-P04 to (AMD1-P01) J02 Red Chapter 1. Reference Information 31 Key From To Description Label Color 26 A35-BPC-J04 A09-MDA1-J01 A35-BPC-P04 to (AMD1-P00) J01 Red with white stripe 27 A35-BPC-J06 A18-MDA3-J01 A35-BPC-P06 to (AMD3-P00) J01 Red with white stripe 28 B35-BPC-J06 A18-MDA3-J02 B35-BPC-P06 to (AMD3-P01) J02 Red 29 B35-BPC-J07 A18-MDA4-J02 B35-BPC-P07 to (AMD4-P00) J02 Red 30 A35-BPC-J07 A18-MDA4-J01 B35-BPC-P07 to (AMD4-P01) J01 Red with white stripe 31 B35-BPC-J02 UEPO Switch J01 B35-BPC-P02 to UEPO J01 Red 32 A35-BPC-J02 UEPO Switch J00 A35-BPC-P02 to UEPO J00 Red with white stripe 33 A35-BPR-J00 A13-IBF1-J01 A35-BPR-P00 to A13-IBF1-P00 B35-BPR-J00 B13-IBF1-J01 B35-BPR-P00 to B13-IBF1-P00 34 A35-BPR-J00 A13-IBF2-J01 A35-BPR-P00 to A13-IBF2-P00 B35-BPR-J00 B13-IBF2-J01 B35-BPR-P00 to B13-IBF2-P00 35 A35-BPR-J00 A17-IBF3-J01(Secondary Frame) A35-BPR-P00 to A-IBF3-P00 (Secondary Frame) B35-BPR-J00 B17-IBF3-J01(Secondary Frame) B35-BPR-P00 to B-IBF3-P00 (Secondary Frame) Note: * DCA 6 is populated only for the 32-way 1.7 GHz MCMs. When DCA 6 is installed, BPD ports A35-BPD1-P07 and B35-BPD1-P07 are not used. Attention: For systems configured with the 1.5 GHz MCM, the total number of CEC DCAs installed must be one more than the number of 1.5 GHz MCMs installed. For systems configured with the 1.7 GHz MCMs, the total number of CEC DCAs installed must be two more than the number of 1.7 GHz MCMs installed. 32 406/90 Service Guide I/O Subsystem Power Distribution Cabling Power to the I/O subsystems is connected from the bulk power distribution connectors to each subsystem through cables. The cable connectors are labeled on each end. The following table describes the cables that connect from the BPDs to the I/O Subsystem DCAs (power supplies). From To(A=Primary)(Z=Secondary) Description Label Color A35B-BPD1A-P00 A09B-DCA1-P01 BPD1A to I/O Subsystem #1 Purple With Stripes A35P-BPD1B-P00 A09B-DCA1-P00 BPD1B to I/O Subsystem #1 Purple A35B-BPD1A-P01 A09B-DCA2-P01 BPD1A to I/O Subsystem #1 Purple With Stripes A35P-BPD1B-P01 A09B-DCA2-P00 BPD1B to I/O Subsystem #1 Purple A35B-BPD1A-P02 A05B-DCA1-P01 BPD1A to I/O Subsystem #2 Yellow With Stripes A35P-BPD1B-P02 A05B-DCA1-P00 BPD1B to I/O Subsystem #2 Yellow A35B-BPD1A-P03 A05B-DCA2-P01 BPD1A to I/O Subsystem #2 Yellow With Stripes A35P-BPD1B-P03 A05B-DCA2-P00 BPD1B to I/O Subsystem #2 Yellow A35B-BPD1A-P04 A01B-DCA1-P01 BPD1A to I/O Subsystem #3 Green With Stripes A35P-BPD1B-P04 A01B-DCA1-P00 BPD1B to I/O Subsystem #3 Green A35B-BPD1A-P05 A01B-DCA2-P01 BPD1A to I/O Subsystem #3 Green With Stripes A35P-BPD1B-P05 A01B-DCA2-P00 BPD1B to I/O Subsystem #3 Green The following 4 rows apply to systems that are not configured with 32-way with 1.7 GHz MCMs. A35B-BPD1A-P06 A13B-DCA1-P01 BPD1A to I/O Subsystem #4 (no IBF) Blue With Stripes A35P-BPD1B-P06 A13B-DCA1-P00 BPD1B to I/O Subsystem #4 (no IBF) Blue A35B-BPD1A-P07 A13B-DCA2-P01 BPD1A to I/O Subsystem #4 (no IBF) Blue With Stripes A35P-BPD1B-P07 A13B-DCA2-P00 BPD1B to I/O Subsystem #4 (no IBF) Blue The following 4 rows apply to systems that are configured with 32-way with 1.7 GHz MCMs. A35B-BPD1A-P06 Goes to CEC DCA-6 (see item 14b in the previous table) BPD1A to CEC DCA-6 (see item 14b in the previous table) Blue With Stripes A35P-BPD1B-P06 Goes to CEC DCA-6 (see item 14a in the previous table) BPD1B to CEC DCA-6 (see item 14b in the previous table) Blue A35B-BPD1A-P07 No Connection Chapter 1. Reference Information 33 From To(A=Primary)(Z=Secondary) Description Label Color A35P-BPD1B-P07 No Connection A35B-BPD2A-P04 Z09B-DCA1-P01 BPD1A to I/O Subsystem #4 (with IBF) Yellow With Stripes A35P-BPD2B-P04 Z09B-DCA1-P00 BPD1B to I/O Subsystem #4 (with IBF) Yellow A35B-BPD2A-P05 Z09B-DCA2-P01 BPD1A to I/O Subsystem #4 (with IBF) Yellow With Stripes A35P-BPD2B-P05 Z09B-DCA2-P00 BPD1B to I/O Subsystem #4 (with IBF) Yellow A35B-BPD2A-P00 Z01B-DCA1-P01 BPD2A to I/O Subsystem #5 Red With Stripes A35P-BPD2B-P00 Z01B-DCA1-P00 BPD2B to I/O Subsystem #5 Red A35B-BPD2A-P01 Z01B-DCA2-P01 BPD2A to I/O Subsystem #5 Red With Stripes A35P-BPD2B-P01 Z01B-DCA2-P00 BPD2B to I/O Subsystem #5 Red A35B-BPD2A-P02 Z05B-DCA1-P01 BPD2A to I/O Subsystem #6 Purple With Stripes A35P-BPD2B-P02 Z05B-DCA1-P00 BPD2B to I/O Subsystem #6 Purple A35B-BPD2A-P03 Z05B-DCA2-P01 BPD2A to I/O Subsystem #6 Purple With Stripes A35P-BPD2B-P03 Z05B-DCA2-P00 BPD2B to I/O Subsystem #6 Purple A35B-BPD2A-P06 Z13B-DCA1-P01 BPD2A to I/O Subsystem #7 Yellow With Stripes A35P-BPD2B-P06 Z13B-DCA1-P00 BPD2B to I/O Subsystem #7 Yellow A35B-BPD2A-P07 Z13B-DCA2-P01 BPD2A to I/O Subsystem #7 Yellow With Stripes A35P-BPD2B-P07 Z13B-DCA2-P00 BPD2B to I/O Subsystem #7 Yellow The following 4 rows apply to systems that are not configured with 32-way with 1.7 GHz MCMs. A35B-BPD2A-P08 Z19B-DCA1-P01 BPD2A to I/O Subsystem #8 Blue With Stripes A35P-BPD2B-P08 Z19B-DCA1-P00 BPD2B to I/O Subsystem #8 Blue A35B-BPD2A-P09 Z19B-DCA2-P01 BPD2A to I/O Subsystem #8 Blue With Stripes A35P-BPD2B-P09 Z19B-DCA2-P00 BPD2B to I/O Subsystem #8 Blue The following 8 rows apply to systems that are configured with 32-way with 1.7 GHz MCMs. A35B-BPD2A-P08 A13B-DCA1-P01 BPD2A to I/O Subsystem #4 (no IBF) Blue With Stripes 34 406/90 Service Guide From To(A=Primary)(Z=Secondary) Description Label Color A35P-BPD2B-P08 A13B-DCA1-P00 BPD2B to I/O Subsystem #4 (no IBF) Blue A35B-BPD2A-P09 A13B-DCA2-P01 BPD2A to I/O Subsystem #4 (no IBF) Blue With Stripes A35P-BPD2B-P09 A13B-DCA2-P00 BPD2B to I/O Subsystem #4 (no IBF) Blue A35B-BPD3A-P08 Z19B-DCA1-P01 BPD3A to I/O Subsystem #8 Blue With Stripes A35P-BPD3B-P08 Z19B-DCA1-P00 BPD3B to I/O Subsystem #8 Blue A35B-BPD3A-P09 Z19B-DCA2-P01 BPD3A to I/O Subsystem #8 Blue With Stripes A35P-BPD3B-P09 Z19B-DCA2-P00 BPD3B to I/O Subsystem #8 Blue Integrated Battery Feature Locations 1 1 Connector to BPR Chapter 1. Reference Information 35 Subsystem Positioning and Cabling This section describes subsystem positioning and cabling for the 406/90. Operator Panel Cable and Diskette Drive Cable to the Media Subsystem (Rear View) 1 Primary I/O Book 3 Diskette Drive Data Cable 2 Cable Connecting the Operator Panel to the Media Subsystem 4 Media Subsystem 36 406/90 Service Guide I/O Subsystem Positions, Fully Populated Primary and Secondary Rack (Rear View without IBF) C1 Primary Rack 3 Third I/O Subsystem C2 Secondary Rack 4 Fourth I/O Subsystem C3 Blank Space 5 Fifth I/O Subsystem C4 Blank Space 6 Sixth I/O Subsystem 1 First I/O Subsystem 7 Seventh I/O Subsystem 2 Second I/O Subsystem 8 Eighth I/O Subsystem Chapter 1. Reference Information 37 I/O Subsystem Positions, Fully Populated Primary and Secondary Rack (Rear View with IBFs) C1 Primary Rack 4B Four IBFs (primary rack), Position 4 C2 Primary I/O Book (Book 0) 5 Fifth I/O Subsystem C3 Secondary Rack 6 Sixth I/O Subsystem 1 First I/O Subsystem 7 Seventh I/O Subsystem 2 Second I/O Subsystem 2B Two IBFs (secondary rack) 3 Third I/O Subsystem 8 Eighth I/O Subsystem 4 Fourth I/O Subsystem 38 406/90 Service Guide Placement of Subsystem 4 in a Two-Rack Configuration (Rear View with IBF) C1 Primary Rack 3 Third I/O Subsystem C2 Secondary Rack 4 Fourth I/O Subsystem 1 First I/O Subsystem 4B Four IBFs 2 Second I/O Subsystem Note: The physical location shown for I/O Subsystem 4 is populated only when IBFs are present and four or more I/O subsystems are installed. Chapter 1. Reference Information 39 RIO and RIO-2 Cabling Rules If the CEC RIO ports are standard RIO, thumbscrew-style connectors, then connect a two-planar I/O loop to the RIO ports of the CEC in the following order: 1. Primary I/O Book Slot 0 (1, 2, 3, or 4 MCMs): RIO 1 A0&A1, RIO 2 B0&B1 2. Secondary I/O Book Slot 2: v 2 MCMs: RIO 3 C0&C1, RIO 4 D0&D1 v 3 or 4 MCMs: RIO 3 C0&C1, RIO 4 D0&D1, RIO 5 A0&A1, RIO 6 B0&B13. Secondary I/O Book Slot 3 (4 MCMs): RIO 7 C0&C1, RIO 8 D0&D1Notes: 1. Secondary I/O Book Slot 1 is not allowed with CEC standard RIO connections. 2. One I/O planar loops are not allowed with CEC standard RIO connections.If the CEC RIO ports are gigabyte RIO, or RIO-2, connectors, then connect either a one I/O planar loop or a two planar I/O loop to the first 16 RIO ports of the CEC in the following order: 1. Primary I/O Book Slot 0 (1, 2, 3, or 4 MCMs): RIO 1 A0&A1, RIO 2 B0&B1 2. Secondary I/O Book Slot 2: v 2 MCMs: RIO 3 C0&C1, RIO 4 D0&D1 v 3 or 4 MCMs: RIO 3 C0&C1, RIO 4 D0&D1, RIO 5 A0&A1, RIO 6 B0&B13. Secondary I/O Book Slot 3: v 2 MCMs: RIO 5 A0&A1, RIO 6 B0&B1 v 3 or 4 MCMs: RIO 7 A0&A1, RIO 8 B0&B1Although two I/O planar loops are not permitted on the last 12 RIO ports of the CEC, you can connect additional one I/O planar loops to the next 12 RIO ports of the CEC in the following order: 1. Secondary I/O Book Slot 3 (4 MCMs): RIO 9 C0&C1, RIO 10 D0&D1 2. Secondary I/O Book Slot 1: v 3 MCMs: RIO 9 C0&C1, RIO 10 D0&D1, RIO 11 A0&A1, RIO 12 B0&B1 v 4 MCMs: RIO 11 C0&C1, RIO 12 D0&D1, RIO 13 A0&A1, RIO 14 B0&B1Note: Use required pairs of RIO ports without omitting any. The following RIO port speeds are available: v Standard RIO uses a thumbscrew retention physical connector and runs at 500 MB per second. v Gigabyte RIO, or RIO-2, uses a bayonette retention physical connector and runs at 1 GB per second.Two I/O planars are available, one for RIO and one for RIO-2, that may be placed in either side of an I/O subsystem. They can be identified by their RIO connector style. Rules for connecting RIO and RIO-2 CEC ports to I/O subsystem RIO and RIO-2 ports are the following: v CEC RIO ports must not be connected to I/O subsystem RIO-2 ports, even with the available RIO-2 to RIO cables. v RIO-2 to RIO cables must only be used to connect CEC RIO-2 ports to I/O subsystem RIO ports. v RIO to RIO cables must only be used to connect CEC RIO ports to I/O subsystem RIO ports. v RIO-2 to RIO-2 cables must only be used to connect CEC RIO-2 ports to I/O subsystem RIO-2 ports.Rules for connecting RIO and RIO-2 I/O planars to the CEC RIO and RIO-2 ports are the following: v CEC RIO ports must be connected to 2 RIO planars per loop. Up to 8 I/O subsystems can be connected. 40 406/90 Service Guide v CEC RIO-2 ports can be connected to RIO and RIO-2 I/O planars with appropriate cables in one planar loop or two planar loops in the following order: 1. Cable any I/O subsystems with RIO I/O planars in both the left and the right positions with 2 I/O planar loop connections. The maximum data rate for each RIO I/O planar is 500 MB per second. 2. Cable any I/O subsystems with RIO-2 I/O planars in both the left and right positions with 2 I/O planar loop connections. The maximum data rate for each RIO-2 I/O planar is 1 GB per second. 3. Cable any I/O subsystems with a RIO I/O planar in one side and a RIO-2 I/O planar in the other side with 1 I/O planar loop connection. The maximum data rate for the RIO I/O planar is 500 MB per second, and for the RIO-2 I/O planar, the maximum data rate is 2 GB per second. However, in the event of a cable failure, there is no redundant path for the 2 GB per second RIO-2 I/O planar. Restoring the redundant path for the RIO-2 planar reduces the data rate to 1 GB per second. I/O subsystems with a RIO I/O planar in one side and a RIO-2 I/O planar in the other side with 2 I/O planar loop connections are not allowed. 4. Cable any I/O subsystems with RIO-2 I/O planars in both the left and the right positions with one I/O planar loop connections. The maximum data rate for each RIO-2 I/O planar is 1 GB per second with data path redundancy, or 2 GB per second without data path redundancy.Note: If all I/O planars are connected to the CEC with one I/O planar loops, the maximum number of I/O subsystems connected to the system is reduced to seven. Chapter 1. Reference Information 41 I/O Subsystems (1 Through 4 without IBFs) RIO Cabling to I/O Books, Two I/O Planar Loop Configuration, 2, 3, or 4 MCMs Only Note: The following figure applies to CECs with RIO, or thumbscrew retention, connections.A0C0A1C1B0000000001114356711111B1D1D012 1 Primary Rack 5 First I/O Subsystem 2 Primary I/O Book 6 Fourth I/O Subsystem 3 Third I/O Subsystem 7 Secondary I/O Book 4 Second I/O Subsystem 42 406/90 Service Guide I/O Subsystems (5 and 6) RIO Cabling to the Secondary I/O Book, Two I/O Planar Loop Configuration, 3 or 4 MCMs Only Note: The following figure applies to CECs with RIO, or thumbscrew retention, connections.ShownWithoutIBFs C1 Primary Rack 5 Fifth I/O Subsystem C2 Secondary Rack 6 Sixth I/O Subsystem C3 Secondary I/O Book Note: RIO cabling to I/O Subsystems 5 and 6 is the same for configurations with or without the IBFs. Chapter 1. Reference Information 43 I/O Subsystems (7 and 8) RIO Cabling to the Third I/O Book, Two I/O Planar Loop Configuration, 4 MCMs Only Note: The following figure applies to CECs with RIO, or thumbscrew retention, connections. C1 Primary Rack 4 Fourth I/O Subsystem C2 Secondary Rack 4B Two IBFs (maximum), Position 4 C3 Third I/O Book (Book 3) 5 Fifth I/O Subsystem 1 First I/O Subsystem 6 Sixth I/O Subsystem 2 Second I/O Subsystem 7 Seventh I/O Subsystem 3 Third I/O Subsystem 8 Eighth I/O Subsystem Note: RIO cabling to I/O subsystems 7 and 8 is the same for configurations with or without the IBFs. 44 406/90 Service Guide I/O Subsystems (1 through 3 with IBFs) RIO Cabling to I/O Books, Two I/O Planar Loop Configuration, 2, 3, or 4 MCMs Only Note: The following figure applies to CECs with RIO, or thumbscrew retention, connections.A0C0A1C1B00000001143561111B112 1 Primary Rack A0 Connection to I/O Book (Book 0) 2 Primary I/O Book A1 Connection to I/O Book (Book 0) 3 Third I/O Subsystem B0 Connection to I/O Book (Book 0) 4 Second I/O Subsystem B1 Connection to I/O Book (Book 0) 5 First I/O Subsystem C0 Connection to I/O Book (Book 2) 6 Secondary I/O Book C1 Connection to I/O Book (Book 2) Chapter 1. Reference Information 45 I/O Subsystems (4 through 6 with IBFs) RIO Cabling to I/O Books, Two I/O Planar Loop Configuration, 3 or 4 MCMs Only Note: The following figure applies to CECs with RIO, or thumbscrew retention, connections.C1C2C353214B4B6478B0A0A1D1D0B1 C1 Secondary Rack 6 Sixth I/O Subsystem C2 Primary Rack 7 Seventh I/O Subsystem C3 Secondary I/O Book in I/O Slot 3 8 Eighth I/O Subsystem C4 Secondary I/O Book in Slot 2 A0 Connection to I/O Book (Book 2) 1 First I/O Subsystem A1 Connection to I/O Book (Book 2) 2 Second I/O Subsystem B0 Connection to I/O Book (Book 2) 3 Third I/O Subsystem B1 Connection to I/O Book (Book 2) 4 Fourth I/O Subsystem D0 Connection to I/O Book (Book 2) 4B Integrated Battery Feature (IBF) D1 Connection to I/O Book (Book 2) 5 Fifth I/O Subsystem 46 406/90 Service Guide I/O Subsystems (1 Through 4 without IBFs) RIO-2 Cabling to I/O Books, Two I/O Planar Loop Configuration, 2, 3, or 4 MCMs Only Note: The following figure applies to CECs with RIO-2, or bayonette retention, connections.A0C0A1C1B0000000001114356711111B1D1D012 1 Primary Rack 5 First I/O Subsystem 2 Primary I/O Book 6 Fourth I/O Subsystem 3 Third I/O Subsystem 7 Secondary I/O Book 4 Second I/O Subsystem Chapter 1. Reference Information 47 I/O Subsystems (5 and 6) RIO-2 Cabling to the Secondary I/O Book, Two I/O Planar Loop Configuration, 2 MCMs Only Note: The following figure applies to CECs with RIO-2, or bayonette retention, connections. C1 Primary Rack 5 Fifth I/O Subsystem C2 Secondary Rack 6 Sixth I/O Subsystem C3 Secondary I/O Book Note: RIO-2 cabling to I/O Subsystems 5 and 6 is the same for configurations with or without the IBFs. 48 406/90 Service Guide I/O Subsystems (5 and 6) RIO-2 Cabling to the Secondary I/O Book, Two I/O Planar Loop Configuration, 3 or 4 MCMs Only Note: The following figure applies to CECs with RIO-2, or bayonette retention, connections.ShownWithoutIBFs C1 Primary Rack 5 Fifth I/O Subsystem C2 Secondary Rack 6 Sixth I/O Subsystem C3 Secondary I/O Book Note: RIO-2 cabling to I/O Subsystems 5 and 6 is the same for configurations with or without the IBFs. Chapter 1. Reference Information 49 I/O Subsystems (7 and 8) RIO-2 Cabling to the Third I/O Book, Two I/O Planar Loop Configuration, 3 or 4 MCMs Only Note: The following figure applies to CECs with RIO-2, or bayonette retention connections. C1 Primary Rack 5 Fifth I/O Subsystem C2 Secondary Rack 6 Sixth I/O Subsystem C3 Third I/O Book (Book 3) 7 Seventh I/O Subsystem 1 First /O Subsystem 8 Eighth I/O Subsystem 2 Second Subsystem A0 Connection to I/O Book (Book 3) 3 Third I/O Subsystem A1 Connection to I/O Book (Book 3) 4 Fourth I/O Subsystem B0 Connection to I/O Book (Book 3) 4B Two IBFs (maximum), Position 4 B1 Connection to I/O Book (Book 3) 50 406/90 Service Guide I/O Subsystems (1 Through 4 without IBFs) RIO-2 Cabling to I/O Books, One I/O Planar Loop Configuration, 3 or 4 MCMs Only Note: The following figure applies to CECs with RIO-2, or bayonette retention, connections. 1 Primary Rack 5 First I/O Subsystem 2 Primary I/O Book 6 Fourth I/O Subsystem 3 Third I/O Subsystem 7 Secondary I/O Book 4 Second I/O Subsystem Chapter 1. Reference Information 51 I/O Subsystems (5 and 6) RIO-2 Cabling to the Secondary I/O Book, One I/O Planar Loop Configuration, 3 MCMs Only Note: The following figure applies to CECs with RIO-2, or bayonette retention, connections. C1 Primary Rack 5 Fifth I/O Subsystem C2 Secondary Rack 6 Sixth I/O Subsystem C3 Secondary I/O Book Note: RIO-2 cabling to I/O Subsystems 5 and 6 is the same for configurations with or without the IBFs. 52 406/90 Service Guide I/O Subsystems (5 and 6) RIO-2 Cabling to the Secondary I/O Book, One I/O Planar Loop Configuration, 4 MCMs Only Note: The following figure applies to CECs with RIO-2, or bayonette retention, connections. C1 Primary Rack 5 Fifth I/O Subsystem C2 Secondary Rack 6 Sixth I/O Subsystem C3 Secondary I/O Book Note: RIO-2 cabling to I/O Subsystems 5 and 6 is the same for configurations with or without the IBFs. Chapter 1. Reference Information 53 I/O Subsystems (7) RIO-2 Cabling to the Fourth I/O Book, One I/O Planar Loop Configuration, 4 MCMs Only Note: The following figure applies to CECs with RIO-2, or bayonette retention, connections. C1 Primary Rack 4 Fourth I/O Subsystem C2 Secondary Rack 4B Two IBFs (maximum), Position 4 C3 Third I/O Book (Book 3) 5 Fifth I/O Subsystem 1 First I/O Subsystem 6 Sixth I/O Subsystem 2 Second I/O Subsystem 7 Seventh I/O Subsystem 3 Third I/O Subsystem 8 Eighth I/O Subsystem Notes: 1. RIO-2 cabling to I/O subsystem 7 is the same for configurations with or without the IBFs. 2. The maximum number of I/O subsystems allowed is reduced to seven when all I/O planars are connected to their own RIO-2 port pairs. 54 406/90 Service Guide I/O Subsystems (1 through 3 with IBFs) RIO-2 Cabling to I/O Books, One I/O Planar Loop Configuration, 3 or 4 MCMs Only Note: The following figure applies to CECs with RIO-2, or bayonette retention, connections. 1 Primary Rack A0 Connection to I/O Book (Book 0) 2 Primary I/O Book A1 Connection to I/O Book (Book 0) 3 Third I/O Subsystem B0 Connection to I/O Book (Book 0) 4 Second I/O Subsystem B1 Connection to I/O Book (Book 0) 5 First I/O Subsystem C0 Connection to I/O Book (Book 2) 6 Secondary I/O Book C1 Connection to I/O Book (Book 2) Chapter 1. Reference Information 55 I/O Subsystems (1 through 3 with IBFs) RIO-2 Cabling to I/O Books, One I/O Planar Loop Configuration, 2 MCMs Only Note: The following figure applies to CECs with RIO-2, or bayonette retention, connections. 1 Primary Rack A0 Connection to I/O Book (Book 0) 2 Primary I/O Book A1 Connection to I/O Book (Book 0) 3 Third I/O Subsystem B0 Connection to I/O Book (Book 0) 4 Second I/O Subsystem B1 Connection to I/O Book (Book 0) 5 First I/O Subsystem C0 Connection to I/O Book (Book 2) 6 Secondary I/O Book C1 Connection to I/O Book (Book 2) 56 406/90 Service Guide I/O Subsystem (4 with IBFs Installed) RIO-2 Cabling to I/O Book 3, One I/O Planar Loop Configuration, 3 or 4 MCMs Only Note: The following figure applies to CECs with RIO-2, or bayonette retention, connections. C1 Secondary Rack 6 Sixth I/O Subsystem C2 Primary Rack 7 Seventh I/O Subsystem C3 Secondary I/O Book in I/O Slot 3 8 Eighth I/O Subsystem C4 Secondary I/O Book in Slot 2 A0 Connection to I/O Book (Book 2) 1 First I/O Subsystem A1 Connection to I/O Book (Book 2) 2 Second I/O Subsystem B0 Connection to I/O Book (Book 2) 3 Third I/O Subsystem B1 Connection to I/O Book (Book 2) 4 Fourth I/O Subsystem D0 Connection to I/O Book (Book 2) 4B Integrated Battery Feature (IBF) D1 Connection to I/O Book (Book 2) 5 Fifth I/O Subsystem Chapter 1. Reference Information 57 Power and SCSI Cables to the Media Subsystem 1235476 1 Primary Rack 5 SCSI Cable for Media Subsystem Rear Drives 2 Primary I/O Book 6 SCSI Cable for Media Subsystem Front Drives 3 Power Cable for Media Subsystem Front Drives 7 Power Cable for Media Subsystem Rear Drives 4 Rear of Media Subsystem 58 406/90 Service Guide Hardware Management Console (HMC) to the Primary I/O Book To connect your HMC to the 406/90, connect the serial cable into serial port 1 located on the back of the HMC. Connect the other end of the serial cable into the HMC1 connector located on the primary I/O book in the managed system. The following illustration shows the location of the serial ports on the back of the HMC and the HMC connectors located on the processor subsystem. 1 Primary Rack 4 Primary HMC Connected to HMC #1 in Primary I/O Book (HMC #1 may be labeled HSC #1) 2 Primary I/O Book 5 Optional Second HMC Connected to HMC #2 in Primary I/O Book (HMC #2 may be labeled HSC #2) 3 Power Plug to External Power Source (wall plug) Optional 8-port or 128 Port Async Adapters Up to two optional async adapters can be installed in the HMC. The adapters have breakout boxes that allow connection to serial cables that connect to the managed system’s HMC 1 serial ports. For information about installing these adapters in the HMC, see the 406/90, order number SA23-1248. Chapter 1. Reference Information 59 System Memory All of the main memory for the 406/90 consists of pluggable memory books in reserved slots with two memory controllers per book. The memory books are available in various sizes. Memory modules are not removable from the memory books. A minimum of 8 GB of memory must be installed for the system to operate. Eight slots, located on the front of the processor subsystem, are available for system memory books. These slots are shown in “406/81 Processor Subsystem Front View” on page 17. Memory to Processor Relationships The following illustration shows the relationship between the memory riser cards, L3 cache, and the MCM modules: MemorySlot0(800-PIN)BusBarMemorySlot7(800-PIN)MemorySlot3(800-PIN)MemorySlot6(800-PIN)MemorySlot4(800-PIN)MemorySlot1(800-PIN)MemorySlot5(800-PIN)MemorySlot2(800-PIN)GXSlot0(680-PIN)GXSlot3(680-PIN)GXSlot1(680-PIN)GXSlot2680-PIN)BusBarBusBarBusBarMCMModule0MCMModule3MCMModule2MCMModule1clockACADBL3L3L3L3L3L3L3L3L3L3L3L3L3L3L3L3CABDABCDABCDOR0OR1OR3OR4OR5OR7OR6OR15OR14OR13OR12OR11OR10OR9OR8OR21301234567891011121514 60 406/90 Service Guide Memory Requirements Note: The configuration of a system that was ordered with memory Capacity Upgrade on Demand (CUoD) features will appear slightly different than the configuration in this discussion. When the system is ordered, the installed memory is positioned according to the following guidelines: v The 406/90 has eight memory slots. Four memory slots use inward-facing memory cards and four use outward-facing memory cards. The inward-facing memory slots are used by the first and second processor module positions, while the outward-facing memory slots support the third and fourth 8-way processor module positions. v A 406/90 system with only one processor MCM position populated has access to only two inward-facing memory slots. v Minimum system memory is 8 GB. Maximum system memory is 512 GB. v Memory is available in 4 GB, 8 GB, 16 GB, 32 GB, and 64 GB inward-facing and outward-facing increments. v The following memory configuration considerations are recommended to ensure the optimum performance is obtained for the system: – Memory should be installed in identical pairs. – All available memory locations should be populated with memory cards. – Memory sizes should be balanced as closely as possible across all populated MCM locations.v The number of active memory book slots in a system is related to the number of populated MCM locations. Two memory book slots are activated with each MCM that is installed. – The first MCM installed activates the first and second inward-facing memory positions. – The second MCM installed activates the third and fourth inward-facing memory positions. – The third MCM installed activates the first and second outward-facing memory positions. – The fourth MCM installed activates the third and fourth outward-facing memory positions.v It is recommended that all systems have access to a 4-mm or 8-mm tape drive for submission of system dump information, if required. This function can be accomplished through locally attached or network-attached devices as appropriate. Chapter 1. Reference Information 61 I/O Subsystem Features I/O subsystems have two redundant power supplies. One of these power supplies is capable of providing the necessary voltages and currents, independent of the other power supply. The left and right power-supply output voltages are connected and monitored by the power distribution board contained in the I/O subsystem. The left and right power supplies are hot-pluggable and may be changed one at a time while the system is operational. Each power supply provides 5V dc, 3.3V dc, 12V dc, and 5V dc standby. I/O to Processor Relationships The following illustration shows the relationship between the I/O books, L3 cache, and the MCM modules: MemorySlot0BusBarMemorySlot7MemorySlot3MemorySlot6MemorySlot4MemorySlot1MemorySlot5MemorySlot2GXSlot0(680-PIN)GXSlot3(680-PIN)GXSlot1(680-PIN)GXSlot2680-PIN)BusBarBusBarBusBarMCMModule0MCMModule3MCMModule2MCMModule1clockACADBL3L3L3L3L3L3L3L3L3L3L3L3L3L3L3L3CABDABCDABCDGX0AGX0CGX3DGX3BGX3AGX3CGX2DGX2BGX0BGX0DGX1CGX1AGX1BGX1DGX2CGX2A3012456789101112131415(0)(1)(3)(2)(5)(4)(7)(6)(9)(11)(10)(13)(NC)(NC)(12)(8) 62 406/90 Service Guide I/O Subsystem Power Supply (DCA) LED Status Use the following table to determine the condition of the I/O subsystem power supply (DCA) based on the status of the power supply LEDs. Status of LED Operator Panel LED Right Power Supply LED Left Power Supply LED Off No power connected No power connected or system power connected, not turned on, power supply detects no faults No power connected or system power connected, not turned on, power supply detects no faults On, fast blinking green, on for one second, off for one second System power connected, power on initiated System power connected, not turned on, power supply detected fault System power connected, not turned on, power supply detected fault On, slow blinking green, on for two seconds, off for two seconds System power connected, not turned on System power connected, not turned on, power supply detected system power control network fault System power connected, not turned on, power supply detected system power control network fault On, steady green System power connected and turned on System power connected and turned on System power connected and turned on Chapter 1. Reference Information 63 Logical and Physical Locations The 406/90 system uses physical location codes in conjunction with AIX location codes to provide mapping of the failing field replaceable units (FRUs). The location codes are produced by the system’s firmware and the operating system. There are no Linux-specific location codes, but physical location codes may appear in the Linux error logs. Physical Location Codes Physical location codes provide a mapping of logical functions in a platform (or expansion sites for logical functions, such as connectors or ports) to their specific locations within the physical structure of the platform. Location Code Format The format for the location code is a string of alphanumeric characters separated by a dash (-), slash (/), pound sign (#) or period (.) character. The base location is all of the information preceding the slash (/) or pound sign (#). The base location identifies a device that is connected to or plugged into the parent. Extended location information follows the slash (/). Extended location information identifies a device that is part of the parent, a connector, or a cable. Cable information follows the pound sign (#). Cable information identifies a cable that is connected to parent. The following are examples: v P1-C1 identifies a processor card C1 (MCM) plugged into planar (backplane) P1. v P1-M1 identifies a memory card (book) M1 plugged into planar (backplane) P1. v P1-H2/S1 identifies serial port 1 controller on planar (backplane) P1, the connector for serial port 1, the cable attached to serial port 1 would be represented by a # sign after the S1. v P1-I2/E3 identifies a Ethernet controller 3 on the card in slot 2 (I2) on planar (backplane) P1, the connector for Ethernet controller 3, or the cable attached to Ethernet controller 3. v P1-I2/E3# identifies the cable attached to Ethernet controller 3 on the card in slot 2 (I2) on planar (backplane) P1.The period (.) identifies sublocations (SCSI addresses, cables). The following are examples: v P1-C1.1 identifies processor 1 on processor card (MCM) plugged into planar (backplane) P1. v P2-Z1-A3.1 identifies a SCSI device with SCSI address of LUN 1 at SCSI ID 3 attached to SCSI bus 1 from planar (backplane) 2. v P1-I2/E3.2# identifies the second in a series of cables attached to Ethernet controller 3 on the card in slot 2 (I2) on planar (backplane) P1.Depending on the AIX and firmware levels, AIX diagnostics may include the extended location information when identifying a planar (backplane) or card. The extended location information or cable information is always included when identifying a cable or connector. Location codes with extended location information that are displayed without a description identifying the devices always identify the cable attached to the port. 64 406/90 Service Guide Multiple FRU Callout Instructions If an eight-digit error code appears in the operator panel display or in Chapter 5, “Error Code to FRU Index,” on page 413, a location code for a failing part may also be specified. If the location code includes a blank space followed by a lowercase x followed by a number, this is an error code with multiple FRU callouts. This error can typically happen with memory books or processors, and may involve mixed types of parts. In this case, check the system’s configuration for FRU part numbers to determine the appropriate set of FRUs. You can determine the FRU part numbers of the electronic assemblies in the entire system by using the service processor menus. From the General User menu, select Read VPD Image from Last System Boot, then enter 90 to display detailed Vital Product Data (VPD). You can determine the FRU part numbers of the electronic assemblies in a partition by using the lscfg -vp | pg command on the AIX command line to display the detailed VPD of all assemblies. Notice that the FRU part number information for processors and memory books may be at the bottom of the command output. FRU Identify LEDs This system is configured with an arrangement of LEDs that help identify various components of the system. These include, but are not limited to: v Rack identify LED v Processor subsystem drawer identify LED v I/O drawer identify LED v RIO port identify LED v FRU identify LED – Power subsystem FRUs – Processor subsystem FRUs – I/O subsystem FRUsv I/O adapter identify LED v DASD identify LEDThe identify LEDs are arranged hierarchically with the FRU identify LED at the bottom of the hierarchy, followed by the corresponding processor subsystem or I/O drawer identify LED, and the corresponding rack-identify LED to locate the failing FRU more easily. Any identify LED in the system may be flashed (when the processor subsystem is in the failed state and power is on) by using the service processor LED Control Menu contained in the System Information Menu of the Privileged User Menus. For information about using the LED Control Menu, see page 788. Any identify LED in the system can also be flashed by using the Identify and Attention Indicators task in diagnostics. The procedure to use the Identify and Attention Indicators task in diagnostics is outlined in Diagnostic Information for Multiple Bus Systems. If the service processor menus and AIX diagnostics are not available, the FRU identify LEDs can be flashed by one of the following procedures: v If the system is configured as a full partition, the system may be booted to the open firmware prompt and the command FRU-LED-MENU entered. A menu displays that allows you to enable the desired FRU identify LED. See “System Power Control Menu” on page 778 for instructions on setting up the boot mode to enable the boot to the open firmware prompt. v If the system is logically partitioned, the HMC must be attached. You can use the HMC to enable any FRU identify LED to be flashed. See the Hardware Management Console Operations Guide for instructions on activating and deactivating a FRU identify LED. Chapter 1. Reference Information 65 AIX Location Codes Note: AIX location codes are not available if Linux is the operating system. The basic formats of the AIX location codes are as follows: v For non-SCSI devices/drives: AB-CD-EF-GHv For SCSI devices/drives: AB-CD-EF-G,HNon-SCSI Devices For planars (backplanes), cards, and non-SCSI devices, the location code is defined as follows: AB-CD-EF-GH | | | | | | | Device/FRU/Port ID | | Connector ID | devfunc Number, Adapter Number or Physical Location Bus Type or PCI Parent Bus v The AB value identifies a bus type or PCI parent bus as assigned by the firmware. v The CD value identifies adapter number, adapter’s devfunc number, or physical location. The devfunc number is defined as the PCI device number times 8, plus the function number. v The EF value identifies a connector. v The GH value identifies a port, address, device, or FRU.Adapters and cards are identified only with AB-CD. The possible values for CD depend on the adapter/card. For pluggable PCI adapters/cards, CD is the device’s devfunc number (PCI device number times 8, plus the function number). The C and D are characters in the range of 0-9, and A-F (hex numbers). The location codes therefore uniquely identify multiple adapters on individual PCI cards. EF is the connector ID, used to identify the adapter’s connector to which a resource is attached. GH is used to identify a port, device, or FRU. For example: v For asynchronous devices, GH defines the port on the fanout box. The values are 00 to 15. v For a diskette drive, H identifies either diskette drive 1 or 2. G is always 0. v For all other devices, GH is equal to 00.For integrated adapter, EF-GH is the same as the definition for a pluggable adapter. For example, the location code for a diskette drive is 01-D1-00-00. A second diskette drive is 01-D1-00-01. 66 406/90 Service Guide SCSI Devices For SCSI devices, the location code is defined as follows: AB-CD-EF-G,H | | | | | | | | | Logical Unit address of the SCSI Device | | | Control Unit Address of the SCSI Device | | Connector ID | devfunc Number, Adapter Number or Physical Location Bus Type or PCI Parent Bus Where: v AB-CD-EF are the same as non-SCSI devices. v G defines the control unit address of the device. Values of 0 to 15 are valid. v H defines the logical unit address of the device. Values of 0 to 255 are valid.A bus location code is also generated as ’00-XXXXXXXX’, where XXXXXXXX is equivalent to the node’s unit address. Examples of physical location codes and AIX location codes are: PCI adapter in first I/O subsystem, slot 1 (primary rack): – Physical location code U1.9-P1-I1 – AIX location code 2V-08 PCI adapter in fifth I/O subsystem, slot 1 (secondary rack): – Location Code U2.1-P1-I1 – AIX location Code 8V-08 Chapter 1. Reference Information 67 AIX and Physical Location Code Reference Tables Attention: AIX location codes are not available when the operating system is Linux. The following illustrations show the location codes that are used to identify the major functional units in the 406/90 primary and secondary racks. The following figure shows the primary rack locations. V7U1.18-P1-V8REARFRONTU1.35-P1-X2U1.35-P2-X2U1.35-P1-X4U1.35-P2-X4U1.17U1.17U1.9U1.9U1.1U1.1U1.5U1.5U1.13-P1-U1.15-P1-V1orU1.13V2orU1.13U1.13-P2-V3orU1.13U1.15-P2-V4orU1.13U1.18-F1U1.18-X1U1.18-F2U1.18-X2U1.35-P1-F1U1.35-P2-F1U1.18-P1-M3U1.18-P1-M4U1.18-P1-M2U1.18-P1-M1U1.18-P1-M7U1.18-P1-M8U1.18-P1-M6U1.18-P1-M5U1.18-F3U1.18-X3U1.18-X4U1.18-F4U1.35-P1-X1UnusedU1.35-P2-X1UnusedU1.35-P1-U1.35-P1-V3U1.35-P1-V2V1U1.35-P2-U1.35-P2-V2U1.35-P2-V3V1U1.35-P1-X3U1.35-P2-X3U1.18-P1-V3U1.18-P1-V6U1.18-P1-V5U1.18-P1-H2U1.18-P1-H4U1.18-P1-H3U1.18-P1-H1U1.18-P1-V4U1.18-P1-U1.18-P1-V1U1.18-P1-V2 68 406/90 Service Guide The following figure shows the secondary rack locations. REARFRONTU2.17-P1-V5U2.13U2.13U2.17-P2-V6U2.9U2.1U2.1U2.5U2.5U2.9U2.19U2.19 Chapter 1. Reference Information 69 The following illustration shows the location codes that are used to identify the MCM modules and L3 modules (located behind fans 3--U1.18-P1-F3 and 4--U1.18-P1-F4). AA123456789101112131415161718192021 1 L3, Location: U1.18-P1-C5 12 L3, Location: U1.18-P1-C16 2 L3, Location: U1.18-P1-C6 13 L3, Location: U1.18-P1-C20 3 L3, Location: U1.18-P1-C10 14 L3, Location: U1.18-P1-C19 4 MCM 0, Location: U1.18-P1-C1 15 MCM 2, Location: U1.18-P1-C4 5 Clock Card, Location: U1.18-P1-X5 16 L3, Location: U1.18-P1-C18 6 L3, Location: U1.18-P1-C7 17 L3, Location: U1.18-P1-C17 7 L3, Location: U1.18-P1-C8 18 L3, Location: U1.18-P1-C13 8 L3, Location: U1.18-P1-C12 19 L3, Location: U1.18-P1-C14 9 L3, Location: U1.18-P1-C11 20 MCM 3, Location: U1.18-P1-C2 10 MCM 1, Location: U1.18-P1-C3 21 L3, Location: U1.18-P1-C9 11 L3, Location: U1.18-P1-C15 70 406/90 Service Guide The following tables contain location codes that are used to identify functional units in the 406/90 systems. Each table shows the locations for a physical part of the system. FRU Name Location Code AIX Location Code Physical Connection Logical Connection System Rack (MT/M Serial#) U1 Bulk Power Subsystem 1 Locations Bulk Power Enclosure A (front) U1.35 BPE Backplane A U1.35-P1 AC to BPE (A) Line Cord Connector U1.35-P1/Q1 AC to BPE (A) Line Cord U1.35-P1/Q1# Bulk Power Reg 1 U1.35-P1-V1 BPR 1 Connector U1.35-P1-V1/Q1 BPR 1 Cable to IBF 1A U1.35-P1-V1/Q1# Bulk Power Distribution 3 (Unused) U1.35-P1-X1 SPARE U1.35-P1-X1/Q1 SPARE Cable U1.35-P1-X1/Q1# SPARE U1.35-P1-X1/Q2 SPARE Cable U1.35-P1-X1/Q2# SPARE U1.35-P1-X1/Q3 SPARE Cable U1.35-P1-X1/Q3# SPARE U1.35-P1-X1/Q4 SPARE Cable U1.35-P1-X1/Q4# SPARE U1.35-P1-X1/Q5 SPARE Cable U1.35-P1-X1/Q5# SPARE U1.35-P1-X1/Q6 SPARE Cable U1.35-P1-X1/Q6# SPARE U1.35-P1-X1/Q7 SPARE Cable U1.35-P1-X1/Q7# SPARE U1.35-P1-X1/Q8 SPARE Cable U1.35-P1-X1/Q8# SPARE U1.35-P1-X1/Q9 SPARE Cable U1.35-P1-X1/Q9# SPARE U1.35-P1-X1/Q10 SPARE Cable U1.35-P1-X1/Q10# Bulk Power Reg 2 (BPR) U1.35-P1-V2 BPR 2 (BPR) Connector U1.35-P1-V2/Q1 BPR 2 Cable to IBF 2A U1.35-P1-V2/Q1# Bulk Power Distribution 1 U1.35-P1-X3 Chapter 1. Reference Information 71 FRU Name Location Code AIX Location Code Physical Connection Logical Connection I/O Subsystem -1A Connector 0 I/O Subsystem 1 (DCA 1) U1.35-P1-X3/Q1 I/O Subsystem -1A Cable U1.35-P1-X3/Q1# I/O Subsystem -1B Connector 1 I/O Subsystem 1 (DCA 2) U1.35-P1-X3/Q2 I/O Subsystem -1B Cable U1.35-P1-X3/Q2# I/O Subsystem -2A Connector 2 I/O Subsystem 2 (DCA 1) U1.35-P1-X3/Q3 I/O Subsystem -2A Cable U1.35-P1-X3/Q3# I/O Subsystem -2B Connector 3 I/O Subsystem 2 (DCA 2) U1.35-P1-X3/Q4 I/O Subsystem -2B Cable U1.35-P1-X3/Q4# I/O Subsystem -3A Connector 4 I/O Subsystem 3 (DCA 1) U1.35-P1-X3/Q5 I/O Subsystem -3A Cable U1.35-P1-X3/Q5# I/O Subsystem -3B Connector 5 I/O Subsystem 3 (DCA 2) U1.35-P1-X3/Q6 I/O Subsystem -3B Cable U1.35-P1-X3/Q6# I/O Subsystem -4A Connector 6 I/O Subsystem 4 (DCA 1) U1.35-P1-X3/Q7# I/O Subsystem -4A Cable U1.35-P1-X3/Q7# I/O Subsystem -4B Connector 7 I/O Subsystem 4 (DCA 2) U1.35-P1-X3/Q8 I/O Subsystem -4B Cable U1.35-P1-X3/Q8# DCA-4 Connector U1.35-P1-X3/Q9 DCA-4 Cable U1.35-P1-X3/Q9# DCA-5 Connector U1.35-P1-X3/Q10 DCA-5 Cable U1.35-P1-X3/Q10# Bulk Power Distribution 2 U1.35-P1-X2 I/O Subsystem -5A Connector 0 I/O Subsystem 5 (DCA 1) U1.35-P1-X2/Q1 I/O Subsystem -5A Cable U1.35-P1-X2/Q1# I/O Subsystem -5B Connector 1 I/O Subsystem 5 (DCA 2) U1.35-P1-X2/Q2 I/O Subsystem -5B Cable U1.35-P1-X2/Q2# 72 406/90 Service Guide FRU Name Location Code AIX Location Code Physical Connection Logical Connection I/O Subsystem -6A Connector 2 I/O Subsystem 6 (DCA 1) U1.35-P1-X2/Q3 I/O Subsystem -6A Cable U1.35-P1-X2/Q3# I/O Subsystem -6B Connector 3 I/O Subsystem 6 (DCA 2) U1.35-P1-X2/Q4 I/O Subsystem -6B Cable U1.35-P1-X2/Q4# I/O Subsystem -7A Connector 4 I/O Subsystem 7 (DCA 1) U1.35-P1-X2/Q5 I/O Subsystem -7A Cable U1.35-P1-X2/Q5# I/O Subsystem -7B Connector 5 I/O Subsystem 7 (DCA 2) U1.35-P1-X2/Q6 I/O Subsystem -7B Cable U1.35-P1-X2/Q6# SPARE (Future Subsystem 8) U1.35-P1-X2/Q7 SPARE cable U1.35-P1-X2/Q7# SPARE 5 (Future Subsystem 8) U1.35-P1-X2/Q8 SPARE cable U1.35-P1-X2/Q8# SPARE connector U1.35-P1-X2/Q9 SPARE cable U1.35-P1-X2/Q9# DCA-6 Connector U1.35-P1-X2/Q10 DCA-6 Cable U1.35-P1-X2/Q10# Bulk Power Regulator 3 U1.35-P1-V3 BPR 3 Connector U1.35-P1-V3/Q1 BPR 3 Cable to IBF 3A U1.35-P1-V3/Q1# Bulk Power Fan (BPF) U1.35-P1-F1 BPF TO BPC Connector U1.35-P1-F1/Q1 BPF TO BPC Cable U1.35-P1-F1/Q1# Bulk Power Controller A U1.35-P1-X4 BPC A - MDA-1 Connector U1.35-P1-X4/Q1 BPC A - MDA-1 Cable U1.35-P1-X4/Q1# BPC A - MDA-2 Connector U1.35-P1-X4/Q2 BPC A - MDA-2 Cable U1.35-P1-X4/Q2# BPC A - MDA-3 Connector U1.35-P1-X4/Q3 BPC A - MDA-3 Cable U1.35-P1-X4/Q3# Chapter 1. Reference Information 73 FRU Name Location Code AIX Location Code Physical Connection Logical Connection BPC A - MDA-4 Connector U1.35-P1-X4/Q4 BPC A - MDA-4 Cable U1.35-P1-X4/Q4# BPC A - DCA-1 Connector U1.35-P1-X4/Q5 BPC A - DCA-1 Cable U1.35-P1-X4/Q5# BPC A - DCA-2 Connector U1.35-P1-X4/Q6 BPC A - DCA-2 Cable U1.35-P1-X4/Q6# BPC A - DCA-3 Connector U1.35-P1-X4/Q7 BPC A - DCA-3 Cable U1.35-P1-X4/Q7# BPC A TO BPC B Connector U1.35-P1-X4/Q8 BPC A TO BPC B Cable U1.35-P1-X4/Q8# BPC A To Primary I/O Book Connector U1.18-P1-H2/Q8 U1.35-P1-X4/Q10 BPC A ?Y? Cable To Primary I/O Book Connector U1.18-P1-H2/Q8 U1.35-P1-X4/Q10# BPC A TO BPF A Connector U1.35-P1-X4/Q11 BPC A TO BPF A Cable U1.35-P1-X4/Q11# UEPO Switch (attached to frame) U1-V1 UEPO Connector J00 U1-V1/Q1 UEPO Cable to BPC A U1-V1/Q1# UEPO Connector J01 U1-V1/Q2 UEPO Cable to BPC B U1-V1/Q2# Bulk Power Enclosure B (back) U1.35 BPE Backplane B U1.35-P2 AC TO BPE (A) Line Cord Connector U1.35-P2/Q1 AC TO BPE (A) Line Cord U1.35-P2/Q1# Bulk Power Regulator (BPR) 1 U1.35-P2-V1 BPR 1 Connector U1.35-P2-V1/Q1 BPR 1 cable to IBF 1A U1.35-P2-V1/Q1# Bulk Power Distribution 2 U1.35-P2-X2 74 406/90 Service Guide FRU Name Location Code AIX Location Code Physical Connection Logical Connection I/O Subsystem -5A Connector 0 I/O Subsystem 5 (DCA 1) U1.35-P2-X2/Q1 I/O Subsystem -5A Cable U1.35-P2-X2/Q1# I/O Subsystem -5B Connector 1 I/O Subsystem 5 (DCA 2) U1.35-P2-X2/Q2 I/O Subsystem -5B Cable U1.35-P2-X2/Q2# I/O Subsystem -6A Connector 2 I/O Subsystem 6 (DCA 1) U1.35-P2-X2/Q3 I/O Subsystem -6A Cable U1.35-P2-X2/Q3# I/O Subsystem -6B Connector 3 I/O Subsystem 6 (DCA 2) U1.35-P2-X2/Q4 I/O Subsystem -6B Cable U1.35-P2-X2/Q4# I/O Subsystem -7A Connector 4 I/O Subsystem 7 (DCA 1) U1.35-P2-X2/Q5 I/O Subsystem -7A cable U1.35-P2-X2/Q5# I/O Subsystem -7B Connector 5 I/O Subsystem 7 (DCA 2) U1.35-P2-X2/Q6 I/O Subsystem -7B cable U1.35-P2-X2/Q6# SPARE (Future Drawer 8) U1.35-P2-X2/Q7 SPARE cable U1.35-P2-X2/Q7# SPARE 5 (Future Drawer 8) U1.35-P2-X2/Q8 SPARE cable U1.35-P2-X2/Q8# SPARE U1.35-P2-X2/Q9 SPARE cable U1.35-P2-X2/Q9# DCA-6 Connector / F UTURE USE U1.35-P2-X2/Q10 DCA-6 Cable / FUTURE USE U1.35-P2-X2/Q10# Bulk Power Regulator 2 U1.35-P2-V2 BPR 2 Connector U1.35-P2-V2/Q1 BPR 2 Cable to IBF 2A U1.35-P2-V2/Q1# Bulk Power Distribution 1 U1.35-P2-X3 I/O Subsystem -1A Connector 0 I/O Subsystem 1 (DCA 1) U1.35-P2-X3/Q1 I/O Subsystem -1A Cable U1.35-P2-X3/Q1# Chapter 1. Reference Information 75 FRU Name Location Code AIX Location Code Physical Connection Logical Connection I/O Subsystem -1B Connector 1 I/O Subsystem 1 (DCA 2) U1.35-P2-X3/Q2 I/O Subsystem -1B Cable U1.35-P2-X3/Q2# I/O Subsystem -2A Connector 2 I/O Subsystem 2 (DCA 1) U1.35-P2-X3/Q3 I/O Subsystem -2A Cable U1.35-P2-X3/Q3# I/O Subsystem -2B Connector 3 I/O Subsystem 2 (DCA 2) U1.35-P2-X3/Q4 I/O Subsystem -2B Cable U1.35-P2-X3/Q4# I/O Subsystem -3A Connector 4 I/O Subsystem 3 (DCA 1) U1.35-P2-X3/Q5 I/O Subsystem -3A Cable U1.35-P2-X3/Q5# I/O Subsystem -3B Connector 5 I/O Subsystem 3 (DCA 2) U1.35-P2-X3/Q6 I/O Subsystem -3B Cable U1.35-P2-X3/Q6# I/O Subsystem -4A Connector 6 I/O Subsystem 4 (DCA 1) U1.35-P2-X3/Q7 I/O Subsystem -4A Cable U1.35-P2-X3/Q7# I/O Subsystem -4B Connector 7 I/O Subsystem 4 (DCA 2) U1.35-P2-X3/Q8 I/O Subsystem -4B Cable U1.35-P2-X3/Q8# DCA-4 Connector U1.35-P2-X3/Q9 DCA-4 Cable U1.35-P2-X3/Q9# DCA-5 Connector U1.35-P2-X3/Q10 DCA-5 Cable U1.35-P2-X3/Q10# Bulk Power Regulator 3 U1.35-P2-V3 BPR 2 Connector U1.35-P2-V3/Q1 BPR 2 Cable to IBF 3A U1.35-P2-V3/Q1# Bulk Power Distribution (Unused) U1.35-P2-X1 SPARE U1.35-P2-X1/Q1 SPARE cable U1.35-P2-X1/Q1# SPARE U1.35-P2-X1/Q2 SPARE cable U1.35-P2-X1/Q2# SPARE U1.35-P2-X1/Q3 SPARE cable U1.35-P2-X1/Q3# SPARE U1.35-P2-X1/Q4 76 406/90 Service Guide FRU Name Location Code AIX Location Code Physical Connection Logical Connection SPARE cable U1.35-P2-X1/Q4# SPARE U1.35-P2-X1/Q5 SPARE cable U1.35-P2-X1/Q5# SPARE U1.35-P2-X1/Q6 SPARE cable U1.35-P2-X1/Q6# SPARE U1.35-P2-X1/Q7 SPARE cable U1.35-P2-X1/Q7# SPARE U1.35-P2-X1/Q8 SPARE cable U1.35-P2-X1/Q8# SPARE U1.35-P2-X1/Q9 SPARE cable U1.35-P2-X1/Q9# SPARE U1.35-P2-X1/Q10 SPARE cable U1.35-P2-X1/Q10# Bulk Power Fan U1.35-P2-F1 BPF TO BPC Connector U1.35-P2-F1/Q1 BPF TO BPC Cable U1.35-P2-F1/Q1# Bulk Power Controller B U1.35-P2-X4 BPC B - MDA-1 Connector U1.35-P2-X4/Q1 BPC B - MDA-1 Cable U1.35-P2-X4/Q1# BPC B - MDA-2 Connector U1.35-P2-X4/Q2 BPC B - MDA-2 Cable U1.35-P2-X4/Q2# BPC B - MDA-3 Connector U1.35-P2-X4/Q3 BPC B - MDA-3 Cable U1.35-P2-X4/Q3# BPC B - MDA-4 Connector U1.35-P2-X4/Q4 BPC B - MDA-4 Cable U1.35-P2-X4/Q4# BPC B - DCA-1 Connector U1.35-P2-X4/Q5 BPC B - DCA-1 Cable U1.35-P2-X4/Q5# BPC B - DCA-2 Connector U1.35-P2-X4/Q6 BPC B - DCA-2 Cable U1.35-P2-X4/Q6# BPC B - DCA-3 Connector U1.35-P2-X4/Q7 BPC B - DCA-3 Cable U1.35-P2-X4/Q7# BPC TO BPC Connector U1.35-P2-X4/Q8 BPC TO BPC Cable U1.35-P2-X4/Q8# Chapter 1. Reference Information 77 FRU Name Location Code AIX Location Code Physical Connection Logical Connection BPC TO EPOC Connector U1.35-P2-X4/Q9 BPC TO EPOC Cable U1.35-P2-X4/Q9# BPC TO Primary I/O Book Connector U1.18-P1-H2/Q8 U1.35-P2-X4/Q10 BPC B ?Y? Cable To Primary I/O Book Connector U1.18-P1-H2/Q8 U1.35-P2-X4/Q10# BPC TO BPF Connector U1.35-P2-X4/Q11 BPC TO BPF Cable U1.35-P2-X4/Q11# UEPO TO BPC A Connector U1-V1/Q1 UEPO TO BPC A Cable U1-V1/Q1# FRU Name Location Code AIX Location Code Physical Connection Logical Connection Processor Subsystem Locations Processor Subsystem (MT/M Serial #) U1.18 System Board (backplane) U1.18-P1 Memory Book - Slot 0 U1.18-P1-M2 00-00 Memory Book - Slot 1 U1.18-P1-M3 00-00 Memory Book - Slot 3 U1.18-P1-M6 00-00 Memory Book - Slot 2 U1.18-P1-M7 00-00 Memory Book - Slot 7 U1.18-P1-M1 00-00 Memory Book - Slot 4 U1.18-P1-M4 00-00 Memory Book - Slot 6 U1.18-P1-M5 00-00 Memory Book - Slot 5 U1.18-P1-M8 00-00 Primary I/O book - Slot 0 U1.18-P1-H2 System VPD Module (Backup) U1.18-P1-H2-N1 Diskette Drive Controller U1.18-P1-H2/D1 01-D1 Serial Port 1 Connector U1.18-P1-H2/S1 01-S1 Serial Port 1 Cable U1.18-P1-H2/S1# Serial Port 2 Connector U1.18-P1-H2/S2 01-S2 Serial Port 2 Cable U1.18-P1-H2/S2# HMC Serial Port 1 Connector U1.18-P1-H2/S3 01-S3 HMC Serial Port 1 Cable U1.18-P1-H2/S3# HMC Serial Port 2 Connector U1.18-P1-H2/S4 01-S4 HMC Serial Port 2 Cable U1.18-P1-H2/S4# Debug Port Only U1.18-P1-H2/R1 01-R1 78 406/90 Service Guide FRU Name Location Code AIX Location Code Physical Connection Logical Connection Operator Panel Connector U1.18-P1-H2/Q7 Diskette Drive Connector U1.18-P1-H2/Q10 Diskette Drive Cable U1.18-P1-H2/Q10# BPC A & B Connector U1.18-P1-H2/Q8 BPC A Connector U1.18-P1-H2/Q8 BPC A Cable U1.18-P1-H2/Q8# BPC B Connector U1.18-P1-H2/Q9 BPC B Cable U1.18-P1-H2/Q9# SPCN Port 0 Connector U1.18-P1-H2/Q5 SPCN Port 0 Cable U1.18-P1-H2/Q5# SPCN Port 1 Connector U1.18-P1-H2/Q6 SPCN Port 1 Cable U1.18-P1-H2/Q6# JTAG U1.18-P1-H2 NVRAM U1.18-P1-H2 System Firmware U1.18-P1-H2/Y1 Primary I/O book-0 Port 0 Connector 0 (1-2) (A0), U1.18-P1-H2/Q1 Primary I/O book-0 Port 0 (1-2) Cable 1, see “I/O Subsystems (1 Through 4 without IBFs) RIO Cabling to I/O Books, Two I/O Planar Loop Configuration, 2, 3, or 4 MCMs Only” on page 42 for cable connections. U1.18-P1-H2/Q1# Primary I/O book-0 Port 0 Connector 1 (1-3) (A1) U1.18-P1-H2/Q2 Primary I/O book-0 Port 0 Connector 1 (1-3) Cable 2, see “I/O Subsystems (1 Through 4 without IBFs) RIO Cabling to I/O Books, Two I/O Planar Loop Configuration, 2, 3, or 4 MCMs Only” on page 42 for cable connections. U1.18-P1-H2/Q2# Primary I/O book-1 Port 1 Connector 2 (2-0) (B0) U1.18-P1-H2/Q3 Primary I/O book-1 Port 1 Connector 2 (2-0) Cable 3, see “I/O Subsystems (1 Through 4 without IBFs) RIO Cabling to I/O Books, Two I/O Planar Loop Configuration, 2, 3, or 4 MCMs Only” on page 42 for cable connections. U1.18-P1-H2/Q3# Primary I/O book-1 Port 1 Connector 3 (2-1) (B1), U1.18-P1-H2/Q4 Primary I/O book-1 Port 1 Connector 3 (2-1) Cable 4 see “I/O Subsystems (1 Through 4 without IBFs) RIO Cabling to I/O Books, Two I/O Planar Loop Configuration, 2, 3, or 4 MCMs Only” on page 42 for cable connections. U1.18-P1-H2/Q4# Chapter 1. Reference Information 79 FRU Name Location Code AIX Location Code Physical Connection Logical Connection ISA Bridge U1.18-P1-H2 Real Time Clock (RTC) U1.18-P1-H2 Battery (TOD) U1.18-P1-H2-V1 Error logging VPD Card (MCM -0) U1.18-P1-H2.3 Error logging VPD Card (MCM -1) U1.18-P1-H2.5 Error logging VPD Card (MCM -2) U1.18-P1-H2.4 Error logging VPD Card (MCM -3) U1.18-P1-H2.6 Error logging VPD Card (L3) U1.18-P1-H2.1 COD Security VPD Card U1.18-P1-H2.2 SNI Adapter for High Performance Switch Book 1, Slot 1 Note: There are two types of SNI adapter books. One type has two ports and the other has four ports. Either type can be plugged in U1.18-P1-H1. U1.18-P1-H1 SNI adapter book 1, connector 1 U1.18-P1-H1/Q1 SNI adapter book 1, cable attached to connector 1 U1.18-P1-H1/Q1# SNI adapter book 1, connector 2 U1.18-P1-H1/Q2 SNI adapter book 1, cable attached to connector 2 U1.18-P1-H1/Q2# SNI adapter book 1, connector 3 U1.18-P1-H1/Q3 SNI adapter book 1, cable attached to connector 3 U1.18-P1-H1/Q3# SNI adapter book 1, connector 4 U1.18-P1-H1/Q4 SNI adapter book 1, cable attached to connector 4 U1.18-P1-H1/Q4# I/O Book 2, Slot 2 U1.18-P1-H3 I/O Book 2, Port 11 Connector 3 (1-1) (B1) U1.18-P1-H3/Q1 I/O Book 2, Port 11 Connector 3 (1-1) Cable 1, see “I/O Subsystems (5 and 6) RIO Cabling to the Secondary I/O Book, Two I/O Planar Loop Configuration, 3 or 4 MCMs Only” on page 43 for cable connections. U1.18-P1-H3/Q1# I/O Book 2, Port 11 Connector 2 (1-0) (B0) U1.18-P1-H3/Q2 I/O Book 2, Port 11 Connector 2 (1-0) Cable 2, see “I/O Subsystems (5 and 6) RIO Cabling to the Secondary I/O Book, Two I/O Planar Loop Configuration, 3 or 4 MCMs Only” on page 43 for cable connections. U1.18-P1-H3/Q2# I/O Book 2, Port 10 Connector 1 (2-1) (A1) U1.18-P1-H3/Q3 80 406/90 Service Guide FRU Name Location Code AIX Location Code Physical Connection Logical Connection I/O Book 2, Port 10 Connector 1 (2-1) Cable 3, see “I/O Subsystems (5 and 6) RIO Cabling to the Secondary I/O Book, Two I/O Planar Loop Configuration, 3 or 4 MCMs Only” on page 43 for cable connections. U1.18-P1-H3/Q3# I/O Book 2, Port 10 Connector 0 (2-0) (A0) U1.18-P1-H3/Q4 I/O Book 2, Port 10 Connector 0 (2-0) Cable 4, see “I/O Subsystems (5 and 6) RIO Cabling to the Secondary I/O Book, Two I/O Planar Loop Configuration, 3 or 4 MCMs Only” on page 43 for cable connections. U1.18-P1-H3/Q4# I/O Book 2, Port 5 Connector 3 (1-1) (D1) U1.18-P1-H3/Q5 I/O Book 2, Port 5 Connector 3 (1-1) Cable 5, see “I/O Subsystems (1 Through 4 without IBFs) RIO Cabling to I/O Books, Two I/O Planar Loop Configuration, 2, 3, or 4 MCMs Only” on page 42 for cable connections. U1.18-P1-H3/Q5# I/O Book 2, Port 5 Connector 2 (1-0) (D0) U1.18-P1-H3/Q6 I/O Book 2, Port 5 Connector 2 (1-0) Cable 6, see “I/O Subsystems (1 Through 4 without IBFs) RIO Cabling to I/O Books, Two I/O Planar Loop Configuration, 2, 3, or 4 MCMs Only” on page 42 for cable connections. U1.18-P1-H3/Q6# I/O Book 2, Port 4 Connector 1 (2-1) (C1) U1.18-P1-H3/Q7 I/O Book 2, Port 4 Connector 1 (2-1) Cable 7, see “I/O Subsystems (1 Through 4 without IBFs) RIO Cabling to I/O Books, Two I/O Planar Loop Configuration, 2, 3, or 4 MCMs Only” on page 42 for cable connections. U1.18-P1-H3/Q7# I/O Book 2, Port 4 Connector 0 (2-0) (C0) U1.18-P1-H3/Q8 I/O Book 2, Port 4 Connector 1 (2-0) Cable 8 , see “I/O Subsystems (1 Through 4 without IBFs) RIO Cabling to I/O Books, Two I/O Planar Loop Configuration, 2, 3, or 4 MCMs Only” on page 42 for cable connections. U1.18-P1-H3/Q8# I/O Book 3 or SNI Adapter Book- Slot 3 Note: This slot may have either a secondary I/O adapter book or an SNI adapter book for the High Performance Switch. If there is an SNI adapter in U1.18-P1-H4, go to ?SNI Adapter for High Performance Switch? on page 82 U1.18-P1-H4 I/O Book 3, Port 12 Connector 2 (1-0) (C0) U1.18-P1-H4/Q1 Chapter 1. Reference Information 81 FRU Name Location Code AIX Location Code Physical Connection Logical Connection I/O Book 3, Port 12 Connector 2 (1-0) Cable 2 , see “I/O Subsystems (7 and 8) RIO Cabling to the Third I/O Book, Two I/O Planar Loop Configuration, 4 MCMs Only” on page 44 for cable connections. U1.18-P1-H4/Q1# I/O Book 3, Port 12 Connector 3 (1-1) (C1) U1.18-P1-H4/Q2 I/O Book 3, Port 12 Connector 3 (1-1) Cable 1, see “I/O Subsystems (7 and 8) RIO Cabling to the Third I/O Book, Two I/O Planar Loop Configuration, 4 MCMs Only” on page 44 for cable connections. U1.18-P1-H4/Q2# I/O Book 3, Port 13 Connector 0 (2-0) (D0) U1.18-P1-H4/Q3 I/O Book 3, Port 13 Connector 0 (2-0) Cable 4, see “I/O Subsystems (7 and 8) RIO Cabling to the Third I/O Book, Two I/O Planar Loop Configuration, 4 MCMs Only” on page 44 for cable connections. U1.18-P1-H4/Q3# I/O Book 3, Port 13 Connector 1 (2-1) (D1) U1.18-P1-H4/Q4 I/O Book 3, Port 13 Connector 1 (2-1) Cable 3, see “I/O Subsystems (7 and 8) RIO Cabling to the Third I/O Book, Two I/O Planar Loop Configuration, 4 MCMs Only” on page 44 for cable connections. U1.18-P1-H4/Q4# I/O Book 3, Port 6 Connector 3 (1-1) (A0) U1.18-P1-H4/Q5 I/O Book 3, Port 6 Connector 3 (1-1) Cable 5 U1.18-P1-H4/Q5# I/O Book 3, Port 6 Connector 2 (1-0) (A1) U1.18-P1-H4/Q6 I/O Book 3, Port 6 Connector 2 (1-0) Cable 6 U1.18-P1-H4/Q6# I/O Book 3, Port 7 Connector 1 (2-1) (B0) U1.18-P1-H4/Q7 I/O Book 3, Port 7 Connector 1 (2-1) Cable 7 U1.18-P1-H4/Q7# I/O Book 3, Port 7 Connector 0 (2-0) (B1) U1.18-P1-H4/Q8 I/O Book 3, Port 7 Connector 1 (2-0) Cable 8 U1.18-P1-H4/Q8# SNI Adapter for High Performance Switch Book 1, Slot 1 Note: There are two types of SNI adapter books. One type has two ports and the other four ports. Either type can be plugged into U1.18-P1-H4. U1.18-P1-H4 SNI adapter book 3, connector 1 U1.18-P1-H4/Q1 SNI adapter book 3, cable attached to connector 1 U1.18-P1-H4/Q1# SNI adapter book 3, connector 2 U1.18-P1-H4/Q2 82 406/90 Service Guide FRU Name Location Code AIX Location Code Physical Connection Logical Connection SNI adapter book 3, cable attached to connector 2 U1.18-P1-H4/Q2# SNI adapter book 3, connector 3 U1.18-P1-H4/Q3 SNI adapter book 3, cable attached to connector 3 U1.18-P1-H4/Q3# SNI adapter book 3, connector 4 U1.18-P1-H4/Q4 SNI adapter book 3, cable attached to connector 4 U1.18-P1-H4/Q4# DCA 1-RH U1.18-P1-V1 DCA 1- BPA A - BPC - 5 Connector U1.18-P1-V1/Q1 DCA 1- BPA A - BPC -5 Cable U1.18-P1-V1/Q1# DCA 1- BPA B - BPC - 5 Connector U1.18-P1-V1/Q2 DCA 1- BPA B - BPC -5 Cable U1.18-P1-V1/Q2# DCA 2-RH U1.18-P1-V2 DCA 2- BPA A - BPC - 6 Connector U1.18-P1-V2/Q1 DCA 2- BPA A - BPC -6 Cable U1.18-P1-V2/Q1# DCA 2- BPA B - BPC - 6 Connector U1.18-P1-V2/Q2 DCA 2- BPA B - BPC -6 Cable U1.18-P1-V2/Q2# DCA 3-RH U1.18-P1-V4 DCA 3- BPA A - BPC - 7 Connector U1.18-P1-V4/Q1 DCA 3- BPA A - BPC - 7 Cable U1.18-P1-V4/Q1# DCA 3- BPA B - BPC - 7 Connector U1.18-P1-V4/Q2 DCA 3- BPA B - BPC - 7 Cable U1.18-P1-V4/Q2# DCA 4-RH U1.18-P1-V5 DCA 4- BPA A - BPD1 - 9 Connector U1.18-P1-V5/Q1 DCA 4- BPA A - BPD1 - 9 Cable U1.18-P1-V5/Q1# DCA 4- BPA B - BPD1 - 9 Connector U1.18-P1-V5/Q2 DCA 4- BPA B - BPD1 - 9 Cable U1.18-P1-V5/Q2# DCA 5-RH U1.18-P1-V6 DCA 5- BPA A - BPD1 - 10 Connector U1.18-P1-V6/Q1 DCA 5- BPA A - BPD1 - 10 Cable U1.18-P1-V6/Q1# DCA 5- BPA B - BPD1 - 10 Connector U1.18-P1-V6/Q2 DCA 5- BPA B - BPD1 - 10 Cable U1.18-P1-V6/Q2# DCA 6-RH U1.18-P1-V8 DCA 6- BPA A - BPD2 - 10 Connector U1.18-P1-V8/Q1 DCA 6- BPA A - BPD2 - 10 Cable U1.18-P1-V8/Q1# DCA 6- BPA B - BPD2 - 10 Connector U1.18-P1-V8/Q2 DCA 6- BPA B - BPD2 - 10 Cable U1.18-P1-V8/Q2# RH CAP Card U1.18-P1-V3 RH CAP Card U1.18-P1-V7 Chapter 1. Reference Information 83 FRU Name Location Code AIX Location Code Physical Connection Logical Connection Hourglass Card Assembly U1.18-P1-X5 MCM Module 0 4 & 8 way U1.18-P1-C1 Processor 0 (W/ L1 & L2 Cache) U1.18-P1-C1 00-00 Processor 1 (W/ L1 & L2 Cache) U1.18-P1-C1 00-01 Processor 2 (W/ L1 & L2 Cache) U1.18-P1-C1 00-02 Processor 3 (W/ L1 & L2 Cache) U1.18-P1-C1 00-03 Processor 4 (W/ L1 & L2 Cache) U1.18-P1-C1 00-04 Processor 5 (W/ L1 & L2 Cache) U1.18-P1-C1 00-05 Processor 6 (W/ L1 & L2 Cache) U1.18-P1-C1 00-06 Processor 7 (W/ L1 & L2 Cache) U1.18-P1-C1 00-07 MCM Module 3 U1.18-P1-C2 Processor 24 (W/ L1 & L2 Cache) U1.18-P1-C2 00-24 Processor 25 (W/ L1 & L2 Cache) U1.18-P1-C2 00-25 Processor 26 (W/ L1 & L2 Cache) U1.18-P1-C2 00-26 Processor 27 (W/ L1 & L2 Cache) U1.18-P1-C2 00-27 Processor 28 (W/ L1 & L2 Cache) U1.18-P1-C2 00-28 Processor 29 (W/ L1 & L2 Cache) U1.18-P1-C2 00-29 Processor 30 (W/ L1 & L2 Cache) U1.18-P1-C2 00-30 Processor 31 (W/ L1 & L2 Cache) U1.18-P1-C2 00-31 MCM Module 1 U1.18-P1-C3 Processor 08 (W/ L1 & L2 Cache) U1.18-P1-C3 00-08 Processor 09 (W/ L1 & L2 Cache) U1.18-P1-C3 00-09 Processor 10 (W/ L1 & L2 Cache) U1.18-P1-C3 00-10 Processor 11 (W/ L1 & L2 Cache U1.18-P1-C3 00-11 Processor 12 (W/ L1 & L2 Cache U1.18-P1-C3 00-12 Processor 13 (W/ L1 & L2 Cache U1.18-P1-C3 00-13 Processor 14 (W/ L1 & L2 Cache U1.18-P1-C3 00-14 Processor 15 (W/ L1 & L2 Cache U1.18-P1-C3 00-15 MCM Module 2 U1.18-P1-C4 Processor 16 (W/ L1 & L2 Cache U1.18-P1-C4 00-16 Processor 17 (W/ L1 & L2 Cache U1.18-P1-C4 00-17 Processor 18 (W/ L1 & L2 Cache U1.18-P1-C4 00-18 Processor 19 (W/ L1 & L2 Cache U1.18-P1-C4 00-19 Processor 20 (W/ L1 & L2 Cache U1.18-P1-C4 00-20 Processor 21 (W/ L1 & L2 Cache) U1.18-P1-C4 00-21 Processor 22 (W/ L1 & L2 Cache) U1.18-P1-C4 00-22 Processor 21 (W/ L1 & L2 Cache) U1.18-P1-C4 00-23 L3 Module (Proc 3D) U1.18-P1-C5 00-00 L3 Module (Proc 0A) U1.18-P1-C6 00-00 84 406/90 Service Guide FRU Name Location Code AIX Location Code Physical Connection Logical Connection L3 Module (Proc 0B) U1.18-P1-C7 00-00 L3 Module (Proc 1C) U1.18-P1-C8 00-00 L3 Module (Proc 3B) U1.18-P1-C9 00-00 L3 Module (Proc 0C) U1.18-P1-C10 00-00 L3 Module (Proc 0D) U1.18-P1-C11 00-00 L3 Module (Proc 1A) U1.18-P1-C12 00-00 L3 Module (Proc 3A) U1.18-P1-C13 00-00 L3 Module (Proc 2D) U1.18-P1-C14 00-00 L3 Module (Proc 2C) U1.18-P1-C15 00-00 L3 Module (Proc 1B) U1.18-P1-C16 00-00 L3 Module (Proc 3C) U1.18-P1-C17 00-00 L3 Module (Proc 2B) U1.18-P1-C18 00-00 L3 Module (Proc 2A) U1.18-P1-C19 00-00 L3 Module (Proc 1D) U1.18-P1-C20 00-00 Passthru Card Module 1 U1.18-P1-C3 00-00 Passthru Card Module 3 U1.18-P1-C2 00-00 Fan Controller Assembly 1 U1.18-X1 Fan Controller 1 TO BPC A Connector U1.18-X1/Q1 Fan Controller 1 TO BPC A Cable U1.18-X1/Q1# Fan Controller 1 TO BPC B Connector U1.18-X1/Q2 Fan Controller 1 TO BPC B Cable U1.18-X1/Q2# Fan Controller Assembly 2 U1.18-X2 Fan Controller 2 TO BPC A Connector U1.18-X2/Q1 Fan Controller 2 TO BPC A Connector U1.18-X2/Q1# Fan Controller 2 TO BPC B Connector U1.18-X2/Q2# Fan Controller 2 TO BPC B Connector U1.18-X2/Q2# Fan Controller Assembly 3 U1.18-X3 Fan Controller 3 TO BPC A Connector U1.18-X3/Q1 Fan Controller 3 TO BPC A Cable U1.18-X3/Q1# Fan Controller 3 TO BPC B Connector U1.18-X3/Q2 Fan Controller 3 TO BPC B Cable U1.18-X3/Q2# Fan Controller Assembly 4 U1.18-X4 Fan Controller 4 TO BPC A Connector U1.18-X4/Q1 Fan Controller 4 TO BPC A Cable U1.18-X4/Q1# Fan Controller 4 TO BPC B Connector U1.18-X4/Q2 Fan Controller 4 TO BPC B Cable U1.18-X4/Q2# Fan Assembly 1 U1.18-F1 Fan Assembly 2 U1.18-F2 Fan Assembly 3 U1.18-F3 Chapter 1. Reference Information 85 FRU Name Location Code AIX Location Code Physical Connection Logical Connection Fan Assembly 4 U1.18-F4 FRU Name Location Code AIX Location Code Physical Connection Logical Connection Media Subsystem Locations Media Drawer chassis U1.17 Media Drawer - Operator Panel, U1.17-L1 Operator Panel Connector U1.17-L1/Q1 Operator Panel Cable U1.17-L1/Q1# System VPD Module U1.17-L1-N1 Media Drawer - Diskette U1.17-D1 Diskette Drive Connector U1.17-D1/Q2 Diskette Drive Cable U1.17-D1/Q2# Media Drawer - DVD-RAM/CD-ROM U1.x-Px-Ix/Z1-A1 Media Drawer - Tape Drive U1.x-Px-Ix/Z1-A0 Media Drawer connector U1.17-A1/Q3 Media Drawer cable U1.17-A1/Q3# Media Drawer Power Connector U1.17-A1/Q4 Media Drawer Power Cable U1.17-A1/Q4# FRU Name Location Code AIX Location Code Physical Connection Logical Connection Integrated Battery Feature Locations (If Installed) IBF (Battery) 1A to BPA A U1.13-P1-V1 If Installed IBF 1A Connector U1.13-P1-V1/Q1 If Installed IBF 1A Cable to BPA A - BPR 1 U1.13-P1-V1/Q1# If Installed IBF (Battery) 2A to BPA A U1.15-P1-V2 If Installed IBF 2A Connector U1.15-P1-V2/Q1 If Installed IBF 2A Cable to BPA A - BPR 2 U1.15-P1-V2/Q1# If Installed IBF (Battery) 1B to BPA B U1.13-P2-V3 If Installed IBF 1B Connector U1.13-P2-V3/Q1 If Installed IBF 1B Cable to BPA B -BPR 1 U1.13-P2-V3/Q1# If Installed 86 406/90 Service Guide FRU Name Location Code AIX Location Code Physical Connection Logical Connection IBF (Battery) 2B to BPA B U1.15-P2-V4 If Installed IBF 2B Connector U1.15-P2-V4/Q1 If Installed IBF 2B Cable to BPA B -BPR 2 U1.15-P2-V4/Q1# If Installed IBF (Battery) 3A to BPA A U2.17-P1-V5 If Installed IBF 3A Connector U2.17-P1-V5/Q1 If Installed IBF 3A Cable to BPA A - BPR 3 U2.17-P1-V5/Q1# If Installed IBF (Battery) 3B to BPA B U2.17-P2-V6 If Installed IBF 3B Connector U2.17-P2-V6/Q1 If Installed IBF 3B Cable to BPA B - BPR 3 U2.17-P2-V6/Q1# If Installed FRU Name Location Code AIX Location Code Physical Connection Logical Connection I/O Subsystem 1 Locations I/O Subsystem 1 Chassis & Midplane Card (MT/M Serial #) U1.9 I/O Subsystem Left IO backplane assembly U1.9-P1 I/O Subsystem Left IO backplane assembly VPD U1.9-P1-N1 EADS 1 - PCI Controller U1.9-P1 2U-58, 2U-5A, 2U-5C, 2U-5E PCI Slot 1 Content U1.9-P1-I1 2V-08 to 2V-0For 2W-xx or 2X-xx PCI Slot 2 Content U1.9-P1-I2 2Y-08 to 2Y-0For 2Z-xx or 2a-xx PCI Slot 3 Content U1.9-P1-I3 2b-08 to 2b-0For 2c-xx or 2d-xx PCI Slot 4 Content U1.9-P1-I4 2e-08 to 2e-0For 2f-xx or 2g-xx EADS 2 - PCI Controller U1.9-P1 2j-58, 2j-5A, 2j-5E PCI Slot 5 Content U1.9-P1-I5 2k-08 to 2k-0For 2m-xx or 2n-xx PCI Slot 6 Content U1.9-P1-I6 2p-08 to 2p-0For 2q-xx or 2r-xx PCI Slot 7 Content U1.9-P1-I7 2v-08 to 2v-0For 2w-xx or 2x-xx EADS 3 - PCI Controller U1.9-P1 30-58, 30-5A, 30-5E PCI Slot 8 Content U1.9-P1-I8 31-08 to 31-0For 32-xx or 32-xx PCI Slot 9 Content U1.9-P1-I9 34-08 to 34-0For 35-xx or 36-xx Chapter 1. Reference Information 87 FRU Name Location Code AIX Location Code Physical Connection Logical Connection PCI Slot 10 Content U1.9-P1-I10 3A-08 to 3A-0For 3B-xx or 3C-xx I/O Riser Card U1.9-P1 I/O Port Connector U1.9-P1/Q1 I/O Port Cable From GX1A-port 1 U1.9-P1/Q1# I/O Port Connector U1.9-P1/Q2 I/O Port Cable to port 0 on next Riser card U1.9-P2.1/Q1 U1.9-P1/Q2# I/O Subsystem Right IO backplane assembly U1.9-P2 I/O Subsystem Right IO backplane assembly VPD U1.9-P2-N1 EADS 1 - PCI Controller U1.9-P2 3E-58, 3E-5A, 3E-5C, 3E-5E PCI Slot 1 Content U1.9-P2-I1 3F-08 to 3F-0For 3G-xx or 3H-xx PCI Slot 2 Content U1.9-P2-I2 3J-08 to 3J-0For 3K-xx or 3L-xx PCI Slot 3 Content U1.9-P2-I3 3M-08 to 3M-0For 3N-xx or 3P-xx PCI Slot 4 Content U1.9-P2-I4 3Q-08 to 3Q-0For 3R-xx or 3S-xx EADS 2 - PCI Controller U1.9-P2 3U-58, 3U-5A, 3U-5E PCI Slot 5 Content U1.9-P2-I5 3V-08 to 3V-0For 3W-xx or 3X-xx PCI Slot 6 Content U1.9-P2-I6 3Y-08 to 3Y-0For 3Z-xx or 3a-xx PCI Slot 7 Content U1.9-P2-I7 3e-08 to 3e-0For 3f-xx or 3a-xx EADS 3 - PCI Controller U1.9-P2 3j-58, 3j-5A, 3j-5E PCI Slot 8 Content U1.9-P2-I8 3k-08 to 3k-0For 3m-xx or 3n-xx PCI Slot 9 Content U1.9-P2-I9 3p-08 to 3p-0For 3q-xx or 3r-xx PCI Slot 10 Content U1.9-P2-I10 3v-08 to 3v-0For 3w-xx or 3x-xx I/O Riser Card U1.9-P2 I/O Port Connector U1.9-P2/Q1 I/O Port Cable From Riser port 1 P1.1/Q1 U1.9-P2/Q1# I/O Port Connector U1.9-P2/Q2 88 406/90 Service Guide FRU Name Location Code AIX Location Code Physical Connection Logical Connection I/O Port Cable to GX1A-port 2 U1.9-P2/Q2# I/O Subsystem DCA 1 U1.9-V1 DCA 1 to (BPA B) BPD1- 1A Connector P00 U1.9-V1/Q1 DCA 1 to (BPA B) BPD1- 1A Cable U1.9-V1/Q1# DCA 1 to (BPA A) BPD1- 1A Connector P00 U1.9-V1/Q2 DCA 1 to (BPA A) BPD1- 1A Cable U1.9-V1/Q2# Thermal Sensor U1.9-V1 I/O Subsystem DCA 2 U1.9-V2 DCA 2 to (BPA B) BPD1- 1B Connector P01 U1.9-V2/Q1 DCA 2 to (BPA B) BPD1- 1B Cable U1.9-V2/Q1# DCA 2 to (BPA A) BPD1- 1B Connector P01 U1.9-V2/Q2 DCA 2 to (BPA A) BPD1- 1B Cable U1.9-V2/Q2# Thermal Sensor U1.9-V2 I/O Subsystem SCSI controller 1 on P1 U1.9-P1/Z1 2s-08 I/O Subsystem SCSI controller 2 on P1 U1.9-P1/Z2 37-08 I/O Subsystem SCSI controller 1 on P2 U1.9-P2/Z1 3b-08 I/O Subsystem SCSI controller 2 on P2 U1.9-P2/Z2 3s-08 DASD 4-pack Cage and card (1) U1.9-P3 DASD 4-pack Cage and card (1) VPD U1.9-P3-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P2 U1.9-P2/Z2-A8 3s-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P2 U1.9-P2/Z2-A9 3s-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 2 on P2 U1.9-P2/Z2-Aa 3s-08-00-10,0 SCSI DASD 4 hdisk at ID A connected to controller 2 on P2 U1.9-P2/Z2-Ab 3s-08-00-11,0 Chapter 1. Reference Information 89 FRU Name Location Code AIX Location Code Physical Connection Logical Connection SCSI Enclosure Services SES connected to controller 2 on P2 U1.9-P2/Z2-Bf 3s-08-00-15,0 DASD 4-pack Cage and card (2) U1.9-P4 DASD 4-pack Cage and card (2) VPD U1.9-P4-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P2 U1.9-P2/Z1-A8 3b-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P2 U1.9-P2/Z1-A9 3b-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 1 on P2 U1.9-P2/Z1-Aa 3b-08-00-10,0 SCSI DASD 4 hdisk at ID B connected to controller 1 on P2 U1.9-P2/Z1-Ab 3b-08-00-11,0 SCSI Enclosure Services SES connected to controller 1 on P2 U1.9-P2/Z1-Af 3b-08-00-15,0 DASD 4-pack Cage and card (3) U1.9-P5 DASD 4-pack Cage and card (3) VPD U1.9-P5-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P1 U1.9-P1/Z2-A8 37-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P1 U1.9-P1/Z2-A9 37-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 2 on P1 U1.9-P1/Z2-Aa 37-08-00-10,0 SCSI DASD 4 hdisk at ID B connected to controller 2 on P1 U1.9-P1/Z2-Ab 37-08-00-11,0 SCSI Enclosure Services SES connected to controller 2 on P1 U1.9-P1/Z2-Af 37-08-00-15,0 DASD 4-pack Cage and card (4) U1.9-P6 DASD 4-pack Cage and card (4) VPD U1.9-P6-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P1 U1.9-P1/Z1-A8 2s-08-00-8,0 90 406/90 Service Guide FRU Name Location Code AIX Location Code Physical Connection Logical Connection SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P1 U1.9-P1/Z1-A9 2s-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 1 on P1 U1.9-P1/Z1-Aa 2s-08-00-10,0 SCSI DASD 4 hdisk at ID B connected to controller 1 on P1 U1.9-P1/Z1-Ab 2s-08-00-11,0 SCSI Enclosure Services SES connected to controller 1 on P1 U1.9-P1/Z1-Af 2s-08-00-15,0 I/O Subsystem Fan (MSA) U1.9-F1 I/O Subsystem Fan (MSA) U1.9-F2 I/O Subsystem Fan (MSA) U1.9-F3 I/O Subsystem Fan (MSA) U1.9-F4 Media Drawer Power Connector U1.9-P1-V1/Q3 Media Drawer Power Cable U1.9-P1-V1/Q3# Media Drawer Power Connector U1.9-P2-V2/Q3 Media Drawer Power Cable U1.9-P2-V2/Q3# FRU Name Location Code AIX Location Code Physical Connection Logical Connection I/O Subsystem 2 Locations I/O Subsystem 1 Chassis & Midplane Card (MT/M Serial #) U1.5 I/O Subsystem Left IO backplane assembly U1.5-P1 I/O Subsystem Left IO backplane assembly VPD U1.5-P1-N1 EADS 1 - PCI Controller U1.5-P1 40-58, 40-5A, 40-5C, 40-5E PCI Slot 1 Content U1.5-P1-I1 41-08 to 41-0For 42-xx or 43-xx PCI Slot 2 Content U1.5-P1-I2 44-08 to 44-0For 45-xx or 47-xx PCI Slot 3 Content U1.5-P1-I3 47-08 to 47-0For 48-xx or 49-xx Chapter 1. Reference Information 91 FRU Name Location Code AIX Location Code Physical Connection Logical Connection PCI Slot 4 Content U1.5-P1-I4 4A-08 to 4A-0For 4B-xx or 4C-xx EADS 2 - PCI Controller U1.5-P1 4E-58, 4E-5A, 4E-5E PCI Slot 5 Content U1.5-P1-I5 4F-08 to 4F-0For 4G-xx or 4H-xx PCI Slot 6 Content U1.5-P1-I6 4J-08 to 4J-0For 4K-xx or 4L-xx PCI Slot 7 Content U1.5-P1-I7 4Q-08 to 4Q-0For 4R-xx or 4S-xx EADS 3 - PCI Controller U1.5-P1 4U-58, 4U-5A, 4U-5E PCI Slot 8 Content U1.5-P1-I8 4V-08 to 4V-0For 4W-xx or 4X-xx PCI Slot 9 Content U1.5-P1-I9 4Y-08 to 4Y-0For 4Z-xx or 4a-xx PCI Slot 10 Content U1.5-P1-I10 4e-08 to 4e-0For 4f-xx or 4g-xx I/O Riser Card U1.5-P1 I/O Port Connector U1.5-P1/Q1 I/O Port Cable From GX1A-port 1 U1.5-P1/Q1# I/O Port Connector U1.5-P1/Q2 I/O Port Cable to port 0 on next Riser card U1.5-P2.1/Q1 U1.5-P1/Q2# I/O Subsystem Right IO backplane assembly U1.5-P2 I/O Subsystem Right IO backplane assembly VPD U1.5-P2-N1 EADS 1 - PCI Controller U1.5-P2 4j-58, 4j-5A, 4j-5C, 4j-5E PCI Slot 1 Content U1.5-P2-I1 4k-08 to 4k-0For 4m-xx or 4n-xx PCI Slot 2 Content U1.5-P2-I2 4p-08 to 4p-0For 4q-xx or 4r-xx PCI Slot 3 Content U1.5-P2-I3 4s-08 to 4s-0For 4t-xx or 4t-xx PCI Slot 4 Content U1.5-P2-I4 4v-08 to 4v-0For 4t-xx or 4u-xx EADS 2 - PCI Controller U1.5-P2 50-58, 50-5A, 50-5E PCI Slot 5 Content U1.5-P2-I5 51-08 to 51-0For 52-xx or 53-xx PCI Slot 6 Content U1.5-P2-I6 54-08 to 54-0For 55-xx or 56-xx 92 406/90 Service Guide FRU Name Location Code AIX Location Code Physical Connection Logical Connection PCI Slot 7 Content U1.5-P2-I7 5A-08 to 5A-0For 5B-xx or 5C-xx EADS 3 - PCI Controller U1.5-P2 5E-58, 5E-5A, 5E-5E PCI Slot 8 Content U1.5-P2-I8 5F-08 to 5F-0For 5G-xx or 5H-xx PCI Slot 9 Content U1.5-P2-I9 5J-08 to 5J-0For 5K-xx or 5L-xx PCI Slot 10 Content U1.5-P2-I10 5Q-08 to 5Q-0For 5R-xx or 5S-xx I/O Riser Card U1.5-P2.1 I/O Port Connector U1.5-P2.1/Q1 I/O Port Cable From Riser port 1 P1.1/Q1 U1.5-P2.1/Q1# I/O Port Connector U1.5-P2.1/Q2 I/O Port Cable to GX1A-port 2 U1.5-P2.1/Q2# I/O Subsystem DCA 1 U1.5-V1 DCA 1 to (BPA B) BPD1- 1A Connector P02 U1.5-V1/Q1 DCA 1 to (BPA B) BPD1- 1A Cable U1.5-V1/Q1# DCA 1 to (BPA A) BPD1- 1A Connector P02 U1.5-V1/Q2 DCA 1 to (BPA A) BPD1- 1A Cable U1.5-V1/Q2# Thermal Sensor U1.5-V1 I/O Subsystem DCA 2 U1.5-V2 DCA 2 to (BPA B) BPD1- 1B Connector P03 U1.5-V2/Q1 DCA 2 to (BPA B) BPD1- 1B Cable U1.5-V2/Q1# DCA 2 to (BPA A) BPD1- 1B Connector P03 U1.5-V2/Q2 DCA 2 to (BPA A) BPD1- 1B Cable U1.5-V2/Q2# Thermal Sensor U1.5-V2 I/O Subsystem SCSI controller 1 on P1 U1.5-P1/Z1 4M-08 I/O Subsystem SCSI controller 2 on P1 U1.5-P1/Z2 4b-08 I/O Subsystem SCSI controller 1 on P2 U1.5-P2/Z1 57-08 I/O Subsystem SCSI controller 2 on P2 U1.5-P2/Z2 5M-08 Chapter 1. Reference Information 93 FRU Name Location Code AIX Location Code Physical Connection Logical Connection DASD 4-pack Cage and card (1) U1.5-P3 DASD 4-pack Cage and card (1) VPD U1.5-P3-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P2 U1.5-P2/Z2-A8 5M-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P2 U1.5-P2/Z2-A9 5M-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 2 on P2 U1.5-P2/Z2-Aa 5M-08-00-10,0 SCSI DASD 4 hdisk at ID B connected to controller 2 on P2 U1.5-P2/Z2-Ab 5M-08-00-11,0 SCSI Enclosure Services SES connected to controller 2 on P2 U1.5-P2/Z2-Af 5M-08-00-15,0 DASD 4-pack Cage and card (2) U1.5-P4 DASD 4-pack Cage and card (2) VPD U1.5-P4-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P2 U1.5-P2/Z1-A8 57-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P2 U1.5-P2/Z1-A9 57-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 1 on P2 U1.5-P2/Z1-Aa 57-08-00-10,0 SCSI DASD 4 hdisk at ID B connected to controller 1 on P2 U1.5-P2/Z1-Ab 57-08-00-11,0 SCSI Enclosure Services SES connected to controller 1 on P2 U1.5-P2/Z1-Af 57-08-00-15,0 DASD 4-pack Cage and card (3) U1.5-P5 DASD 4-pack Cage and card (3) VPD U1.5-P5-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P2 U1.5-P1/Z2-A8 4b-80-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P2 U1.5-P1/Z2-A9 4b-80-00-9,0 94 406/90 Service Guide FRU Name Location Code AIX Location Code Physical Connection Logical Connection SCSI DASD 3 hdisk at ID A connected to controller 1 on P2 U1.5-P1/Z2-Aa 4b-80-00-10,0 SCSI DASD 4 hdisk at ID B connected to controller 1 on P2 U1.5-P1/Z2-Ab 4b-80-00-11,0 SCSI Enclosure Services SES connected to controller 2 on P1 U1.5-P1/Z2-Af 4b-80-00-15,0 DASD 4-pack Cage and card (4) U1.5-P6 DASD 4-pack Cage and card (4) VPD U1.5-P6-N1 SCSI DASD 1 hdisk at ID8 connected to controller 1 on P1 U1.5-P1/Z1-A8 4M-08-00-8,0 SCSI DASD 2 hdisk at ID9 connected to controller 1 on P1 U1.5-P1/Z1-A9 4M-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 1 on P1 U1.5-P1/Z1-Aa 4M-08-00-10,0 SCSI DASD 4 hdisk at ID B connected to controller 1 on P1 U1.5-P1/Z1-Ab 4M-08-00-11,0 SCSI Enclosure Services SES connected to controller 1 on P1 U1.5-P1/Z1-Af 4M-08-00-15,0 I/O Subsystem Fan (MSA) U1.5-F1 I/O Subsystem Fan (MSA) U1.5-F2 I/O Subsystem Fan (MSA) U1.5-F3 I/O Subsystem Fan (MSA) U1.5-F4 Media Drawer Power Connector U1.5-P1-X1/Q3 Media Drawer Power Cable U1.5-P1-X1/Q3# Media Drawer Power Connector U1.5-P2-X2/Q3 Media Drawer Power Cable U1.5-P2-X2/Q3# Chapter 1. Reference Information 95 FRU Name Location Code AIX Location Code Physical Connection Logical Connection I/O Subsystem 3 Locations I/O Subsystem 1 Chassis & Midplane Card (MT/M Serial #) U1.1 I/O Subsystem Left IO backplane assembly U1.1-P1 I/O Subsystem Left IO backplane assembly VPD U1.1-P1-N1 EADS 1 - PCI Controller U1.1-P1 5U-58, 5U-5A, 5U-5C, 5U-5E PCI Slot 1 Content U1.1-P1-I1 5V-08 to 5V-0For 5W-xx or 5X-xx PCI Slot 2 Content U1.1-P1-I2 5Y-08 to 5Y-0For 5Z-xx or 5a-xx PCI Slot 3 Content U1.1-P1-I3 5b-08 to 5b-0For 5c-xx or 5d-xx PCI Slot 4 Content U1.1-P1-I4 5e-08 to 5e-0For 5f-xx or 5g-xx EADS 2 - PCI Controller U1.1-P1 5j-58, 5j-5A, 5j-5E PCI Slot 5 Content U1.1-P1-I5 5k-08 to 5k-0For 5m-xx or 5n-xx PCI Slot 6 Content U1.1-P1-I6 5p-08 to 5p-0For 5q-xx or 5r-xx PCI Slot 7 Content U1.1-P1-I7 5v-08 to 5v-0For 5w-xx or 5w-xx EADS 3 - PCI Controller U1.1-P1 60-58, 60-5A, 60-5E PCI Slot 8 Content U1.1-P1-I8 61-08 to 61-0For 62-xx or 63-xx PCI Slot 9 Content U1.1-P1-I9 64-08 to 64-0For 65-xx or 66-xx PCI Slot 10 Content U1.1-P1-I10 6A-08 to 6A-0For 6B-xx or 6C-xx I/O Riser Card U1.1-P1 I/O Port Connector U1.1-P1/Q1 I/O Port Cable From GX1A-port 1 U1.1-P1/Q1# I/O Port Connector U1.1-P1/Q2 I/O Port Cable to port 0 on next Riser card U1.1-P2.1/Q1 U1.1-P1/Q2# I/O Subsystem Right IO backplane assembly U1.1-P2 I/O Subsystem Right IO backplane assembly VPD U1.1-P2-N1 EADS 1 - PCI Controller U1.1-P2 6E-58, 6E-5A, 6E-5C, 6E-5E 96 406/90 Service Guide FRU Name Location Code AIX Location Code Physical Connection Logical Connection PCI Slot 1 Content U1.1-P2-I1 6F-08 to 6F-0For 6G-xx or 6H-xx PCI Slot 2 Content U1.1-P2-I2 6J-08 to 6J-0For 6K-xx or 6L-xx PCI Slot 3 Content U1.1-P2-I3 6M-08 to 6M-0For 6N-xx or 6P-xx PCI Slot 4 Content U1.1-P2-I4 6Q-08 to 6Q-0For 6R-xx or 6S-xx EADS 2 - PCI Controller U1.1-P2 6U-58, 6U-5A, 6U-5E PCI Slot 5 Content U1.1-P2-I5 6V-08 to 6V1-0For 6W-xx or 6F-xx PCI Slot 6 Content U1.1-P2-I6 6Y-08 to 6Y-0For 6Z-xx or 6a-xx PCI Slot 7 Content U1.1-P2-I7 6e-08 to 6Y-0For 6f-xx or 6g-xx EADS 3 - PCI Controller U1.1-P2 6j-58, 6j-5A, 6j-5E PCI Slot 8 Content U1.1-P2-I8 6k-08 to 6Y-0For 6m-xx or 6n-xx PCI Slot 9 Content U1.1-P2-I9 6p-08 to 6p-0For 6q-xx or 6r-xx PCI Slot 10 Content U1.1-P2-I10 6v-08 to 6v-0For 6w-xx or 6x-xx I/O Riser Card U1.1-P2 I/O Port Connector U1.1-P2/Q1 I/O Port Cable From Riser port 1 P1.1/Q1 U1.1-P2/Q1# I/O Port Connector U1.1-P2/Q2 I/O Port Cable to GX1A-port 2 U1.1-P2/Q2# I/O Subsystem DCA 1 U1.1-V1 DCA 1 to (BPA B) BPD1- 1A Connector P04 U1.1-V1/Q1 DCA 1 to (BPA B) BPD1- 1A Cable U1.1-V1/Q1# DCA 1 to (BPA A) BPD1- 1A Connector P04 U1.1-V1/Q2 DCA 1 to (BPA A) BPD1- 1A Cable U1.1-V1/Q2# Thermal Sensor U1.1-V1 I/O Subsystem DCA 2 U1.1-V2 DCA 2 to (BPA B) BPD1- 1B Connector P05 U1.1-V2/Q1 DCA 2 to (BPA B) BPD1- 1B Cable U1.1-V2/Q1# Chapter 1. Reference Information 97 FRU Name Location Code AIX Location Code Physical Connection Logical Connection DCA 2 to (BPA A) BPD1- 1B Connector P05 U1.1-V2/Q2 DCA 2 to (BPA A) BPD1- 1B Cable U1.1-V2/Q2# Thermal Sensor U1.1-V2 I/O Subsystem SCSI controller 1 on P1 U1.1-P1/Z1 5s-08 I/O Subsystem SCSI controller 2 on P1 U1.1-P1/Z2 67-08 I/O Subsystem SCSI controller 1 on P2 U1.1-P2/Z1 6b-08 I/O Subsystem SCSI controller 2 on P2 U1.1-P2/Z2 6s-08 DASD 4-pack Cage and card (1) U1.1-P3 DASD 4-pack Cage and card (1) VPD U1.1-P3-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P2 U1.1-P2/Z2-A8 6s-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P2 U1.1-P2/Z2-A9 6s-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 2 on P2 U1.1-P2/Z2-Aa 6s-08-00-10,0 SCSI DASD 4 hdisk at ID B connected to controller 2 on P2 U1.1-P2/Z2-Ab 6s-08-00-11,0 SCSI Enclosure Services SES connected to controller 2 on P2 U1.1-P2/Z2-Bf 6s-08-00-15,0 DASD 4-pack Cage and card (2) U1.1-P4 DASD 4-pack Cage and card (2) VPD U1.1-P4-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P2 U1.1-P2/Z1-A8 6b-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P2 U1.1-P2/Z1-A9 6b-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 1 on P2 U1.1-P2/Z1-Aa 6b-08-00-10,0 SCSI DASD 4 hdisk at ID B connected to controller 1 on P2 U1.1-P2/Z1-Ab 6b-08-00-11,0 98 406/90 Service Guide FRU Name Location Code AIX Location Code Physical Connection Logical Connection SCSI Enclosure Services SES connected to controller 1 on P2 U1.1-P2/Z1-Af 6b-08-00-15,0 DASD 4-pack Cage and card (3) U1.1-P5 DASD 4-pack Cage and card (3) VPD U1.1-P5/Z1-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P1 U1.1-P1/Z2-A8 67-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P1 U1.1-P1/Z2-A9 67-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 2 on P1 U1.1-P1/Z2-Aa 67-08-00-10,0 SCSI DASD 4 hdisk at ID B connected to controller 2 on P1 U1.1-P1/Z2-Ab 67-08-00-11,0 SCSI Enclosure Services SES connected to controller 2 on P1 U1.1-P1/Z2-Af 67-08-00-15,0 DASD 4-pack Cage and card (4) U1.1-P6 DASD 4-pack Cage and card (4) VPD U1.1-P6/Z2-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P1 U1.1-P1/Z1-A8 5s-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P1 U1.1-P1/Z1-A9 5s-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 1 on P1 U1.1-P1/Z1-Aa 5s-08-00-10,0 SCSI DASD 4 hdisk at ID B connected to controller 1 on P1 U1.1-P1/Z1-Ab 5s-08-00-11,0 SCSI Enclosure Services SES connected to controller 1 on P1 U1.1-P1/Z1-Af 5s-08-00-15,0 I/O Subsystem Fan (MSA) U1.1-F1 I/O Subsystem Fan (MSA) U1.1-F2 I/O Subsystem Fan (MSA) U1.1-F3 I/O Subsystem Fan (MSA) U1.1-F4 Chapter 1. Reference Information 99 FRU Name Location Code AIX Location Code Physical Connection Logical Connection Media Drawer Power Connector U1.1-P1-X1/Q3 Media Drawer Power Cable U1.1-P1-X1/Q3# Media Drawer Power Connector U1.1-P2-X2/Q3 Media Drawer Power Cable U1.1-P2-X2/Q3# FRU Name Location Code AIX Location Code Physical Connection Logical Connection I/O Subsystem 4 Locations (If No IBF Installed) Note: If the Integrated Battery Feature (IBF) is installed, I/O subsystem 4 is in I/O subsystem 7 position. See page 105. I/O Subsystem 1 Chassis & Midplane Card (MT/M Serial #) U1.13 I/O Subsystem Left IO backplane assembly U1.13-P1 I/O Subsystem Left IO backplane assembly VPD U1.13-P1-N1 EADS 1 - PCI Controller U1.13-P1 70-58, 70-5A, 70-5C, 70-5E PCI Slot 1 Content U1.13-P1-I1 71-08 to 71-0For 72-xx or 73-xx PCI Slot 2 Content U1.13-P1-I2 74-08 to 74-0For 75-xx or 76-xx PCI Slot 3 Content U1.13-P1-I3 77-08 to 77-0For 78-xx or 79-xx PCI Slot 4 Content U1.13-P1-I4 7A-08 to 7F-0For 7B-xx or 7C-xx EADS 2 - PCI Controller U1.13-P1 7E-58, 7E-5A, 7E-5E PCI Slot 5 Content U1.13-P1-I5 7F-08 to 7F-0For 7G-xx or 7H-xx PCI Slot 6 Content U1.13-P1-I6 7J-08 to 7J-0For 7K-xx or 7L-xx PCI Slot 7 Content U1.13-P1-I7 7Q-08 to 7Q-0For 7R-xx or 7S-xx EADS 3 - PCI Controller U1.13-P1 7U-58, 7U-5A, 7U-5E PCI Slot 8 Content U1.13-P1-I8 7V-08 to 7V-0For 7W-xx or 7X-xx PCI Slot 9 Content U1.13-P1-I9 7Y-08 to 7Y-0For 7Z-xx or 7a-xx PCI Slot 10 Content U1.13-P1-I10 7e-08 to 7e-0For 7f-xx or 7g-xx 100 406/90 Service Guide FRU Name Location Code AIX Location Code Physical Connection Logical Connection I/O Riser Card U1.13-P1 I/O Port Connector U1.13-P1/Q1 I/O Port Cable From GX1A-port 1 U1.13-P1/Q1# I/O Port Connector U1.13-P1/Q2 I/O Port Cable to port 0 on next Riser card U1.13-P2.1/Q1 U1.13-P1/Q2# I/O Subsystem Right IO backplane assembly U1.13-P2 I/O Subsystem Right IO backplane assembly VPD U1.13-P2-N1 EADS 1 - PCI Controller U1.13-P2 7j-58, 7j-5A, 7j-5C, 7j-5E PCI Slot 1 U1.13-P2-I1 7k-08 to 7k-0For 7m-xx or 7n-xx PCI Slot 2 U1.13-P2-I2 7p-08 to 7p-0For 7q-xx or 7r-xx PCI Slot 3 U1.13-P2-I3 7s-08 to 7s-0For 7t-xx or 7u-xx PCI Slot 4 U1.13-P2-I4 7v-08 to 7v-0For 7w-xx or 7x-xx EADS 2 - PCI Controller U1.13-P2 80-58, 80-5A, 80-5E PCI Slot 5 U1.13-P2-I5 81-08 to 81-0For 82-xx or 83-xx PCI Slot 6 U1.13-P2-I6 84-08 to 84-0For 85-xx or 86-xx PCI Slot 7 U1.13-P2-I7 8A-08 to 8A-0For 8B-xx or 8C-xx EADS 3 - PCI Controller U1.13-P2 8E-58, 8E-5A, 8E-5E PCI Slot 8 U1.13-P2-I8 8F-08 to 8F-0For 8G-xx or 8H-xx PCI Slot 9 U1.13-P2-I9 8J-08 to 8J-0For 8K-xx or 8L-xx PCI Slot 10 U1.13-P2-I10 8Q-08 to 8Q-0For 8R-xx or 8S-xx I/O Riser Card U1.13-P2 I/O Port Connector U1.13-P2/Q1 I/O Port Cable From Riser port 1 P1.1/Q1 U1.13-P2/Q1# I/O Port Connector U1.13-P2/Q2 I/O Port Cable to GX1A-port 2 U1.13-P2/Q2# I/O Subsystem DCA 1 U1.13-V1 Chapter 1. Reference Information 101 FRU Name Location Code AIX Location Code Physical Connection Logical Connection DCA 1 to (BPA B) BPD1- 1A Connector P06 U1.13-V1/Q1 DCA 1 to (BPA B) BPD1- 1A Cable U1.13-V1/Q1# DCA 1 to (BPA A) BPD1- 1A Connector P06 U1.13-V1/Q2 DCA 1 to (BPA A) BPD1- 1A Cable U1.13-V1/Q2# Thermal Sensor U1.13-V1 I/O Subsystem DCA 2 U1.13-V2 DCA 2 to (BPA B) BPD1- 1B Connector P07 U1.13-V2/Q1 DCA 2 to (BPA B) BPD1- 1B Cable U1.13-V2/Q1# DCA 2 to (BPA A) BPD1- 1B Connector P07 U1.13-V2/Q2 DCA 2 to (BPA A) BPD1- 1B Cable U1.13-V2/Q2# Thermal Sensor U1.13-V2 I/O Subsystem SCSI controller 1 on P1 U1.13-P1/Z1 7M-08 I/O Subsystem SCSI controller 2 on P1 U1.13-P1/Z2 7b-08 I/O Subsystem SCSI controller 1 on P2 U1.13-P2/Z1 87-08 I/O Subsystem SCSI controller 2 on P2 U1.13-P2/Z2 8M-08 DASD 4-pack Cage and card (1) U1.13-P3 DASD 4-pack Cage and card (1) VPD U1.13-P3-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P2 U1.13-P2/Z2-A8 8M-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P2 U1.13-P2/Z2-A9 8M-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 2 on P2 U1.13-P2/Z2-Aa 8M-08-00-10,0 SCSI DASD 4 hdisk at ID B connected to controller 2 on P2 U1.13-P2/Z2-Ab 8M-08-00-11,0 SCSI Enclosure Services SES connected to controller 2 on P2 U1.13-P2/Z2-Af 8M-08-00-15,0 102 406/90 Service Guide FRU Name Location Code AIX Location Code Physical Connection Logical Connection DASD 4-pack Cage and card (2) U1.13-P4 DASD 4-pack Cage and card (2) VPD U1.13-P4-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P2 U1.13-P2/Z1-A8 87-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P2 U1.13-P2/Z1-A9 87-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 1 on P2 U1.13-P2/Z1-Aa 87-08-00-10,0 SCSI DASD 4 hdisk at ID B connected to controller 1 on P2 U1.13-P2/Z1-Ab 87-08-00-11,0 SCSI Enclosure Services SES connected to controller 1 on P2 U1.13-P2/Z1-Af 87-08-00-15,0 DASD 4-pack Cage and card (3) U1.13-P5 DASD 4-pack Cage and card (3) VPD U1.13-P5-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P1 U1.13-P1/Z2-A8 7b-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P1 U1.13-P1/Z2-A9 7b-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 2 on P1 U1.13-P1/Z2-Aa 7b-08-00-10,0 SCSI DASD 4 hdisk at ID B connected to controller 2 on P1 U1.13-P1/Z2-Ab 7b-08-00-11,0 SCSI Enclosure Services SES connected to controller 2 on P1 U1.13-P1/Z2-Af 7b-08-00-15,0 DASD 4-pack Cage and card (4) U1.13-P6 DASD 4-pack Cage and card (4) VPD U1.13-P6-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P1 U1.13-P1/Z1-A8 7M-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P1 U1.13-P1/Z1-A9 7M-08-00-9,0 Chapter 1. Reference Information 103 FRU Name Location Code AIX Location Code Physical Connection Logical Connection SCSI DASD 3 hdisk at ID A connected to controller 1 on P1 U1.13-P1/Z1-Aa 7M-08-00-10,0 SCSI DASD 4 hdisk at ID B connected to controller 1 on P1 U1.13-P1/Z1-Ab 7M-08-00-11,0 SCSI Enclosure Services SES connected to controller 1 on P1 U1.13-P1/Z1-Af 7M-08-00-15,0 I/O Subsystem Fan (MSA) U1.13-F1 I/O Subsystem Fan (MSA) U1.13-F2 I/O Subsystem Fan (MSA) U1.13-F3 I/O Subsystem Blower (MSA) U1.13-F4 Media Drawer Power Connector U1.13-P1-X1/Q3 Media Drawer Power Cable U1.13-P1-X1/Q3# Media Drawer Power Connector U1.13-P2-X2/Q3 Media Drawer Power Cable U1.13-P2-X2/Q3# 104 406/90 Service Guide FRU Name Location Code AIX Location Code Physical Connection Logical Connection I/O Subsystem 4 Locations (If IBF Installed) Note: If the Integrated Battery Feature (IBF) is not installed, I/O Subsystem 4 is in I/O Subsystem 4 position. See page 100. I/O Subsystem 1 Chassis & Midplane Card (MT/M Serial #) U2.9 I/O Subsystem Left IO backplane assembly U2.9-P1 I/O Subsystem Left IO backplane assembly VPD U2.9-P1-N1 EADS 1 - PCI Controller U2.9-P1 70-58, 70-5A, 70-5C, 70-5E PCI Slot 1 Content U2.9-P1-I1 71-08 to 71-0For 72-xx or 73-xx PCI Slot 2 Content U2.9-P1-I2 74-08 to 74-0For 75-xx or 76-xx PCI Slot 3 Content U2.9-P1-I3 77-08 to 77-0For 78-xx or 79-xx PCI Slot 4 Content U2.9-P1-I4 7A-08 to 7A-0For 7B-xx or 7C-xx EADS 2 - PCI Controller U2.9-P1 7E-58, 7E-5A, 7E-5E PCI Slot 5 Content U2.9-P1-I5 7F-08 to 7F-0For 7G-xx or 7H-xx PCI Slot 6 Content U2.9-P1-I6 7J-08 to 7J-0For 7K-xx or 7L-xx PCI Slot 7 Content U2.9-P1-I7 7Q-08 to 7Q-0For 7R-xx or 7S-xx EADS 3 - PCI Controller U2.9-P1 7U-58, 7U-5A, 7U-5E PCI Slot 8 Content U2.9-P1-I8 7V-08 to 7V-0For 7W-xx or 7X-xx PCI Slot 9 Content U2.9-P1-I9 7Y-08 to 7Y-0For 7Z-xx or 7a-xx PCI Slot 10 Content U2.9-P1-I10 7e-08 to 7e-0For 7f-xx or 7g-xx I/O Riser Card U2.9-P1 I/O Port Connector U2.9-P1/Q1 I/O Port Cable From GX1A-port 3 U2.9-P1/Q1# I/O Port Connector U2.9-P1/Q2 I/O Port Cable to port 0 on next Riser card U2.9-P2/Q1 U2.9-P1/Q2# I/O Subsystem Right IO backplane assembly U2.9-P2 Chapter 1. Reference Information 105 FRU Name Location Code AIX Location Code Physical Connection Logical Connection I/O Subsystem Right IO backplane assembly VPD U2.9-P2-N1 EADS 1 - PCI Controller U2.9-P2 7j-58, 7j-5A, 7j-5C, 7j-5E PCI Slot 1 Content U2.9-P2-I1 7k-08 to 7k-0For 7m-xx or 7n-xx PCI Slot 2 Content U2.9-P2-I2 7p-08 to 7p-0For 7q-xx or 7r-xx PCI Slot 3 Content U2.9-P2-I3 7s-08 to 7s-0For 7t-xx or 7u-xx PCI Slot 4 Content U2.9-P2-I4 7v-08 to 7v-0For 7w-xx or 7x-xx EADS 2 - PCI Controller U2.9-P2 80-58, 80-5A, 80-5E PCI Slot 5 Content U2.9-P2-I5 81-08 to 81-0For 82-xx or 83-xx PCI Slot 6 Content U2.9-P2-I6 84-08 to 84-0For 85-xx or 86-xx PCI Slot 7 Content U2.9-P2-I7 8A-08 to 8A-0For 8B-xx or 8C-xx EADS 3 - PCI Controller U2.9-P2 8E-58, 8E-5A, 8E-5E PCI Slot 8 Content U2.9-P2-I8 8F-08 to 8F-0For 8G-xx or 8H-xx PCI Slot 9 Content U2.9-P2-I9 8J-08 to 8J-0For 8K-xx or 8L-xx PCI Slot 10 Content U2.9-P2-I10 8Q-08 to 8Q-0For 8R-xx or 8S-xx I/O Riser Card U2.9-P2 I/O Port Connector U2.9-P2/Q1 I/O Port Cable From Riser port 1 P1.1/Q1 U2.9-P2/Q1# I/O Port Connector U2.9-P2/Q2 I/O Port Cable to GX1A-port 2 U2.9-P2/Q2# I/O Subsystem DCA 1 U2.9-V1 DCA 1 to (BPA B) BPD2- 4A Connector P04 U2.9-V1/Q1 DCA 1 to (BPA B) BPD2- 4A Cable U2.9-V1/Q1# DCA 1 to (BPA A) BPD2- 4A Connector P04 U2.9-V1/Q2 DCA 1 to (BPA A) BPD2- 4A Cable U2.9-V1/Q2# Thermal Sensor U2.9-V1 I/O Subsystem DCA 2 U2.9-V2 106 406/90 Service Guide FRU Name Location Code AIX Location Code Physical Connection Logical Connection DCA 2 to (BPA B) BPD2- 4B Connector P05 U2.9-V2/Q1 DCA 2 to (BPA B) BPD2- 4B Cable U2.9-V2/Q1# DCA 2 to (BPA A) BPD2- 4B Connector P05 U2.9-V2/Q2 DCA 2 to (BPA A) BPD2- 4B Cable U2.9-V2/Q2# Thermal Sensor U2.9-V2 I/O Subsystem SCSI controller 1 on P1 U2.9-P1/Z1 7M-08 I/O Subsystem SCSI controller 2 on P1 U2.9-P1/Z2 7b-08 I/O Subsystem SCSI controller 1 on P2 U2.9-P2/Z1 87-08 I/O Subsystem SCSI controller 2 on P2 U2.9-P2/Z2 8M-08 DASD 4-pack Cage and card (1) U2.9-P3 DASD 4-pack Cage and card (1) VPD U2.9-P3-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P2 U2.9-P2/Z2-A8 8M-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P2 U2.9-P2/Z2-A9 8M-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 2 on P2 U2.9-P2/Z2-Aa 8M-08-00-10,0 SCSI DASD 4 hdisk at ID B connected to controller 2 on P2 U2.9-P2/Z2-Ab 8M-08-00-11,0 SCSI Enclosure Services SES connected to controller 2 on P2 U2.9-P2/Z2-Af 8M-08-00-15,0 DASD 4-pack Cage and card (2) U2.9-P4 DASD 4-pack Cage and card (2) VPD U2.9-P4-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P2 U2.9-P2/Z1-A8 87-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P2 U2.9-P2/Z1-A9 87-08-00-9,0 Chapter 1. Reference Information 107 FRU Name Location Code AIX Location Code Physical Connection Logical Connection SCSI DASD 3 hdisk at ID A connected to controller 1 on P2 U2.9-P2/Z1-Aa 87-08-00-10,0 SCSI DASD 4 hdisk at ID B connected to controller 1 on P2 U2.9-P2/Z1-Ab 87-08-00-11,0 SCSI Enclosure Services SES connected to controller 1 on P2 U2.9-P2/Z1-Af 87-08-00-15,0 DASD 4-pack Cage and card (3) U2.9-P5 DASD 4-pack Cage and card (3) VPD U2.9-P5-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P1 U2.9-P1/Z2-A8 7b-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P1 U2.9-P1/Z2-A9 7b-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 2 on P1 U2.9-P1/Z2-Aa 7b-08-00-10,0 SCSI DASD 4 hdisk at ID B connected to controller 2 on P1 U2.9-P1/Z2-Ab 7b-08-00-11,0 SCSI Enclosure Services SES connected to controller 2 on P1 U2.9-P1/Z2-Af 7b-08-00-15,0 DASD 4-pack Cage and card (4) U2.9-P6 DASD 4-pack Cage and card (4) VPD U2.9-P6-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P1 U2.9-P1/Z1-A8 7M-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P1 U2.9-P1/Z1-A9 7M-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 1 on P1 U2.9-P1/Z1-Aa 7M-08-0010,0 SCSI DASD 4 hdisk at ID B connected to controller 1 on P1 U2.9-P1/Z1-Ab 7M-08-00-11,0 SCSI Enclosure Services SES connected to controller 1 on P1 U2.9-P1/Z1-Af 7M-08-00-15,0 I/O Subsystem Fan (MSA) U2.9-F1 108 406/90 Service Guide FRU Name Location Code AIX Location Code Physical Connection Logical Connection I/O Subsystem Fan (MSA) U2.9-F2 I/O Subsystem Fan (MSA) U2.9-F3 I/O Subsystem Fan (MSA) U2.9-F4 Media Drawer Power Connector U2.9-P1-X1/Q3 Media Drawer Power Cable U2.9-P1-X1/Q3# Media Drawer Power Connector U2.9-P2-X2/Q3 Media Drawer Power Cable U2.9-P2-X2/Q3# FRU Name Location Code AIX Location Code Physical Connection Logical Connection I/O Subsystem 5 Locations I/O Subsystem 1 Chassis & Midplane Card (MT/M Serial #) U2.1 I/O Subsystem Left IO backplane assembly U2.1-P1 I/O Subsystem Left IO backplane assembly VPD U2.1-P1-N1 EADS 1 - PCI Controller U2.1-P1 8U-58, 8U-5A, 8U-5C, 8U-5E PCI Slot 1 Content U2.1-P1-I1 8V-08 to 8V-0For 8W-xx or 8X-xx PCI Slot 2 Content U2.1-P1-I2 8Y-08 to 8Y-0For 8Z-xx or 8a-xx PCI Slot 3 Content U2.1-P1-I3 8b-08 to 8b-0For 8c-xx or 8d-xx PCI Slot 4 Content U2.1-P1-I4 8e-08 to 8e-0For 8f-xx or 8g-xx EADS 2 - PCI Controller U2.1-P1 8j-58, 8j-5A, 8j-5E PCI Slot 5 Content U2.1-P1-I5 8k-08 to 8k-0For 8m-xx or 8n-xx PCI Slot 6 Content U2.1-P1-I6 8p-08 to 8p-0For 8q-xx or 8r-xx PCI Slot 7 Content U2.1-P1-I7 8v-08 to 8v-0For 8w-xx or 8x-xx EADS 3 - PCI Controller U2.1-P1 90-58, 90-5A, 90-5E PCI Slot 8 Content U2.1-P1-I8 91-08 to 91-0For 92-xx or 93-xx Chapter 1. Reference Information 109 FRU Name Location Code AIX Location Code Physical Connection Logical Connection PCI Slot 9 Content U2.1-P1-I9 94-08 to 94-0For 95-xx or 96-xx PCI Slot 10 Content U2.1-P1-I10 9A-08 to 9A-0For 9B-xx or 9C-xx I/O Riser Card U2.1-P1 I/O Port Connector U2.1-P1/Q1 I/O Port Cable From GX1A-port 1 U2.1-P1/Q1# I/O Port Connector U2.1-P1/Q2 I/O Port Cable to port 0 on next Riser card U2.1-P2.1/Q1 U2.1-P1/Q2# I/O Subsystem Right IO backplane assembly U2.1-P2 I/O Subsystem Right IO backplane assembly VPD U2.1-P2-N1 EADS 1 - PCI Controller U2.1-P2 9E-58, 9E-5A, 9E-5C, 9E-5E PCI Slot 1 Content U2.1-P2-I1 9F-08 to 9F-0For 9G-xx or 9H-xx PCI Slot 2 Content U2.1-P2-I2 9J-08 to 9J-0For 9K-xx or 9L-xx PCI Slot 3 Content U2.1-P2-I3 9M-08 to 9M-0For 9N-xx or 9P-xx PCI Slot 4 Content U2.1-P2-I4 9Q-08 to 9Q-0For 9R-xx or 9S-xx EADS 2 - PCI Controller U2.1-P2 9U-58, 9U-5A, 9U-5E PCI Slot 5 Content U2.1-P2-I5 9V-08 to 9V-0For 9W-xx or 9X-xx PCI Slot 6 Content U2.1-P2-I6 9Y-08 to 9Y-0For 9Z-xx or 9a-xx PCI Slot 7 Content U2.1-P2-I7 9e-08 to 91-0For 9Z-xx or 9a-xx EADS 3 - PCI Controller U2.1-P2 9j-58, 9j-5A, 9j-5E PCI Slot 8 Content U2.1-P2-I8 9k-08 to 9k-0For 9m-xx or 9n-xx PCI Slot 9 Content U2.1-P2-I9 9p-08 to 9p-0For 9q-xx or 9r-xx PCI Slot 10 Content U2.1-P2-I10 9v-08 to 9v-0For 9w-xx or 9x-xx I/O Riser Card U2.1-P2 I/O Port Connector U2.1-P2/Q1 I/O Port Cable From Riser port 1 P1.1/Q1 U2.1-P2/Q1# 110 406/90 Service Guide FRU Name Location Code AIX Location Code Physical Connection Logical Connection I/O Port Connector U2.1-P2/Q2 I/O Port Cable to GX1A-port 2 U2.1-P2/Q2# I/O Subsystem DCA 1 U2.1-V1 DCA 1 to (BPA B) BPD1- 5A Connector P00 U2.1-V1/Q1 DCA 1 to (BPA B) BPD1- 5A Cable U2.1-V1/Q1# DCA 1 to (BPA A) BPD1- 5A Connector P00 U2.1-V1/Q2 DCA 1 to (BPA A) BPD1- 5A Cable U2.1-V1/Q2# Thermal Sensor U2.1-V1 I/O Subsystem DCA 2 U2.1-V2 DCA 2 to (BPA B) BPD1- 5B Connector P01 U2.1-V2/Q1 DCA 2 to (BPA B) BPD1- 5B Cable U2.1-V2/Q1# DCA 2 to (BPA A) BPD1- 5B Connector P01 U2.1-V2/Q2 DCA 2 to (BPA A) BPD1- 5B Cable U2.1-V2/Q2# Thermal Sensor U2.1-V2 I/O Subsystem SCSI controller 1 on P1 U2.1-P1/Z1 8s-08 I/O Subsystem SCSI controller 2 on P1 U2.1-P1/Z2 97-08 I/O Subsystem SCSI controller 1 on P2 U2.1-P2/Z1 9b-08 I/O Subsystem SCSI controller 2 on P2 U2.1-P2/Z2 9s-08 DASD 4-pack Cage and card (1) U2.1-P3 DASD 4-pack Cage and card (1) VPD U2.1-P3-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P2 U2.1-P2/Z2-A8 9s-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P2 U2.1-P2/Z2-A9 9s-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 2 on P2 U2.1-P2/Z2-Aa 9s-08-00-10,0 Chapter 1. Reference Information 111 FRU Name Location Code AIX Location Code Physical Connection Logical Connection SCSI DASD 4 hdisk at ID B connected to controller 2 on P2 U2.1-P2/Z2-Ab 9s-08-00-11,0 SCSI Enclosure Services SES connected to controller 2 on P2 U2.1-P2/Z2-Af 9s-08-00-15,0 DASD 4-pack Cage and card (2) U2.1-P DASD 4-pack Cage and card (2) VPD U2.1-P4-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P2 U2.1-P2/Z1-A8 9b-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P2 U2.1-P2/Z1-A9 9b-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 1 on P2 U2.1-P2/Z1-Aa 9b-08-00-10,0 SCSI DASD 4 hdisk at ID B connected to controller 1 on P2 U2.1-P2/Z1-Ab 9b-08-00-11,0 SCSI Enclosure Services SES connected to controller 1 on P2 U2.1-P2/Z1-Af 9b-08-00-15,0 DASD 4-pack Cage and card (3) U2.1-P5 DASD 4-pack Cage and card (3) VPD U2.1-P5-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P1 U2.1-P1/Z2-A8 97-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P1 U2.1-P1/Z2-A9 97-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 2 on P1 U2.1-P1/Z2-Aa 97-08-00-10,0 SCSI DASD 4 hdisk at ID B connected to controller 2 on P1 U2.1-P1/Z2-Ab 97-08-00-11,0 SCSI Enclosure Services SES connected to controller 2 on P1 U2.1-P1/Z2-Af 97-08-00-15,0 DASD 4-pack Cage and card (4) U2.1-P6 DASD 4-pack Cage and card (4) VPD U2.1-P6-N1 112 406/90 Service Guide FRU Name Location Code AIX Location Code Physical Connection Logical Connection SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P1 U2.1-P1/Z1-A8 8s-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P1 U2.1-P1/Z1-A9 8s-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 1 on P1 U2.1-P1/Z1-Aa 8s-08-00-10,0 SCSI DASD 4 hdisk at ID B connected to controller 1 on P1 U2.1-P1/Z1-Ab 8s-08-00-11,0 SCSI Enclosure Services SES connected to controller 1 on P1 U2.1-P1/Z1-Af 8s-08-00-15,0 I/O Subsystem Fan (MSA) U2.1-F1 I/O Subsystem Fan (MSA) U2.1-F2 I/O Subsystem Fan (MSA) U2.1-F3 I/O Subsystem Fan (MSA) U2.1-F4 Media Drawer Power Connector U2.1-P1-X1/Q3 Media Drawer Power Cable U2.1-P1-X1/Q3# Media Drawer Power Connector U2.1-P2-X2/Q3 Media Drawer Power Cable U2.1-P2-X2/Q3# Chapter 1. Reference Information 113 FRU Name Location Code AIX Location Code Physical Connection Logical Connection I/O Subsystem 6 Locations I/O Subsystem 1 Chassis & Midplane Card (MT/M Serial #) U2.5 I/O Subsystem Left IO backplane assembly U2.5-P1 I/O Subsystem Left IO backplane assembly VPD U2.5-P1-N1 EADS 1 - PCI Controller U2.5-P1 A0-58, A0-5A, A0-5C, A0-5E PCI Slot 1 Content U2.5-P1-I1 A1-08 to A1-0For A2-xx or A3-xx PCI Slot 2 Content U2.5-P1-I2 A4-08 to A4-0For A5-xx or A6-xx PCI Slot 3 Content U2.5-P1-I3 A7-08 to A7-0For A8-xx or A9-xx PCI Slot 4 Content U2.5-P1-I4 AA-08 to AA-0For AB-xx or AC-xx EADS 2 - PCI Controller U2.5-P1 AE-58, AE-5A, AE-5E PCI Slot 5 Content U2.5-P1-I5 AF-08 to AF-0For AG-xx or AH-xx PCI Slot 6 Content U2.5-P1-I6 AJ-08 to AJ-0For AK-xx or AL-xx PCI Slot 7 Content U2.5-P1-I7 AQ-08 to AQ-0For AR-xx or AS-xx EADS 3 - PCI Controller U2.5-P1 AU-58, AU-5A, AU-5E PCI Slot 8 Content U2.5-P1-I8 AV-08 to AV-0For AW-xx or AX-xx PCI Slot 9 Content U2.5-P1-I9 AY-08 to AY-0For AZ-xx or Aa-xx PCI Slot 10 Content U2.5-P1-I10 Ae-08 to Ae-0For Af-xx or Ag-xx I/O Riser Card U2.5-P1 I/O Port Connector U2.5-P1/Q1 I/O Port Cable From GX1A-port 1 U2.5-P1/Q1# I/O Port Connector U2.5-P1/Q2 I/O Port Cable to port 0 on next Riser card U2.5-P2.1/Q1 U2.5-P1/Q2# I/O Subsystem Right IO backplane assembly U2.5-P2 I/O Subsystem Right IO backplane assembly VPD U2.5-P2-N1 114 406/90 Service Guide FRU Name Location Code AIX Location Code Physical Connection Logical Connection EADS 1 - PCI Controller U2.5-P2 Aj-58, Aj-5A, Aj-5C, Aj-5E PCI Slot 1 Content U2.5-P2-I1 Ak-08 to Ak-0For Am-xx or An-xx PCI Slot 2 Content U2.5-P2-I2 Ap-08 to Ap-0For Aq-xx or Ar-xx PCI Slot 3 Content U2.5-P2-I3 As-08 to As-0For At-xx or Au-xx PCI Slot 4 Content U2.5-P2-I4 Av-08 to Av-0For Aw-xx or Ax-xx EADS 2 - PCI Controller U2.5-P2 B0-58, B0-5A, B0-5E PCI Slot 5 Content U2.5-P2-I5 B1-08 to B1-0For B2-xx or B3-xx PCI Slot 6 Content U2.5-P2-I6 B4-08 to B4-0For B5-xx or B6-xx PCI Slot 7 Content U2.5-P2-I7 BA-08 to BA-0For BB-xx or BC-xx EADS 3 - PCI Controller U2.5-P2 BE-58, BE-5A, BE-5E PCI Slot 8 Content U2.5-P2-I8 BF-08 to BF-0For BG-xx or BH-xx PCI Slot 9 Content U2.5-P2-I9 BJ-08 to BJ-0For BK-xx or BL-xx PCI Slot 10 Content U2.5-P2-I10 BQ-08 to BQ-0For BR-xx or BS-xx I/O Riser Card U2.5-P2 I/O Port Connector U2.5-P2/Q1 I/O Port Cable From Riser port 1 P1.1/Q1 U2.5-P2/Q1# I/O Port Connector U2.5-P2/Q2 I/O Port Cable to GX1A-port 2 U2.5-P2/Q2# I/O Subsystem DCA 1 U2.5-V1 DCA 1 to (BPA B) BPD2- 6A Connector P02 U2.5-V1/Q1 DCA 1 to (BPA B) BPD2- 6A Cable U2.5-V1/Q1# DCA 1 to (BPA A) BPD2- 6A Connector P02 U2.5-V1/Q2 DCA 1 to (BPA A) BPD2- 6A Cable U2.5-V1/Q2# Thermal Sensor U2.5-V1 I/O Subsystem DCA 2 U2.5-V2 Chapter 1. Reference Information 115 FRU Name Location Code AIX Location Code Physical Connection Logical Connection DCA 2 to (BPA B) BPD2- 6B Connector P03 U2.5-V2/Q1 DCA 2 to (BPA B) BPD2- 6B Cable U2.5-V2/Q1# DCA 2 to (BPA A) BPD2- 6B Connector P03 U2.5-V2/Q2 DCA 2 to (BPA A) BPD2- 6B Cable U2.5-V2/Q2# Thermal Sensor U2.5-V2 I/O Subsystem SCSI controller 1 on P1 U2.5-P1/Z1 AM-08 I/O Subsystem SCSI controller 2 on P1 U2.5-P1/Z2 Ab-08 I/O Subsystem SCSI controller 1 on P2 U2.5-P2/Z1 B7-08 I/O Subsystem SCSI controller 2 on P2 U2.5-P2/Z2 BM-08 DASD 4-pack Cage and card (1) U2.5-P3 DASD 4-pack Cage and card (1) VPD U2.5-P3-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P2 U2.5-P2/Z2-A8 BM-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P2 U2.5-P2/Z2-A9 BM-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 2 on P2 U2.5-P2/Z2-Aa BM-08-00-A,0 SCSI DASD 4 hdisk at ID B connected to controller 2 on P2 U2.5-P2/Z2-Ab BM-08-00-B,0 SCSI Enclosure Services SES connected to controller 2 on P2 U2.5-P2/Z2-Af BM-08-00-F,0 DASD 4-pack Cage and card (2) U2.5-P4 DASD 4-pack Cage and card (2) VPD U2.5-P4-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P2 U2.5-P2/Z1-A8 B7-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P2 U2.5-P2/Z1-A9 B7-08-00-9,0 116 406/90 Service Guide FRU Name Location Code AIX Location Code Physical Connection Logical Connection SCSI DASD 3 hdisk at ID A connected to controller 1 on P2 U2.5-P2/Z1-Aa B7-08-00-10,0 SCSI DASD 4 hdisk at ID B connected to controller 1 on P2 U2.5-P2/Z1-Ab B7-08-00-11,0 SCSI Enclosure Services SES connected to controller 1 on P2 U2.5-P2/Z1-Af B7-08-00-15,0 DASD 4-pack Cage and card (3) U2.5-P5 DASD 4-pack Cage and card (3) VPD U2.5-P5-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P1 U2.5-P1/Z2-A8 Ab-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P1 U2.5-P1/Z2-A9 Ab-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 2 on P1 U2.5-P1/Z2-Aa Ab-08-00-10,0 SCSI DASD 4 hdisk at ID B connected to controller 2 on P1 U2.5-P1/Z2-Ab Ab-08-00-11,0 SCSI Enclosure Services SES connected to controller 2 on P1 U2.5-P1/Z2-Af Ab-08-00-15,0 DASD 4-pack Cage and card (4) U2.5-P6 DASD 4-pack Cage and card (4) VPD U2.5-P6-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P1 U2.5-P1/Z1-A8 AM-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P1 U2.5-P1/Z1-A9 AM-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 1 on P1 U2.5-P1/Z1-Aa AM-08-00-10,0 SCSI DASD 4 hdisk at ID B connected to controller 1 on P1 U2.5-P1/Z1-Ab AM-08-00-11,0 SCSI Enclosure Services SES connected to controller 1 on P1 U2.5-P1/Z1-Af AM-08-00-15,0 I/O Subsystem Fan (MSA) U2.5-F1 Chapter 1. Reference Information 117 FRU Name Location Code AIX Location Code Physical Connection Logical Connection I/O Subsystem Fan (MSA) U2.5-F2 I/O Subsystem Fan (MSA) U2.5-F3 I/O Subsystem Fan (MSA) U2.5-F4 Media Drawer Power Connector U2.5-P1-X1/Q3 Media Drawer Power Cable U2.5-P1-X1/Q3# Media Drawer Power Connector U2.5-P2-X2/Q3 Media Drawer Power Cable U2.5-P2-X2/Q3# FRU Name Location Code AIX Location Code Physical Connection Logical Connection I/O Subsystem 7 Locations I/O Subsystem 1 Chassis & Midplane Card (MT/M Serial #) U2.13 I/O Subsystem Left IO backplane assembly U2.13-P1 I/O Subsystem Left IO backplane assembly VPD U2.13-P1-N1 EADS 1 - PCI Controller U2.13-P1 BU-58, BU-5A, BU-5C, BU-5E PCI Slot 1 Content U2.13-P1-I1 BV-08 to BV-0For BW-xx or BX-xx PCI Slot 2 Content U2.13-P1-I2 BY-08 to BY-0F or BZ-xx or Ba-xx PCI Slot 3 Content U2.13-P1-I3 Bb-08 to Bb-0F or Bc-xx or Bd-xx PCI Slot 4 Content U2.13-P1-I4 Be-08 to Be-0F or Bf-xx or Bg-xx EADS 2 - PCI Controller U2.13-P1 Bj-58, Bj-5A, Bj-5E PCI Slot 5 Content U2.13-P1-I5 Bk-08 to Bk-0F or Bm-xx or Bn-xx PCI Slot 6 Content U2.13-P1-I6 Bp-08 to Bp-0F or Bq-xx or Br-xx PCI Slot 7 Content U2.13-P1-I7 Bv-08 to Bv-0F or Bw-xx or Bx-xx EADS 3 - PCI Controller U2.13-P1 C0-58, C0-5A, C0-5E PCI Slot 8 Content U2.13-P1-I8 C1-08 to C1-0F or C2-xx or C3-xx 118 406/90 Service Guide FRU Name Location Code AIX Location Code Physical Connection Logical Connection PCI Slot 9 Content U2.13-P1-I9 C4-08 to C4-0F or C5-xx or C6-xx PCI Slot 10 Content U2.13-P1-I10 CA-08 to CA-0F or CB-xx or CC-xx I/O Riser Card U2.13-P1 I/O Port Connector U2.13-P1/Q1 I/O Port Cable From GX1A-port 1 U2.13-P1/Q1# I/O Port Connector U2.13-P1/Q2 I/O Port Cable to port 0 on next Riser card U2.13-P2.1/Q1 U2.13-P1/Q2# I/O Subsystem Right IO backplane assembly U2.13-P2 I/O Subsystem Right IO backplane assembly VPD U2.13-P2-N1 EADS 1 - PCI Controller U2.13-P2 CE-58, CE-5A, CE-5C, CE-5E PCI Slot 1 Content U2.13-P2-I1 CF-08 to CF-0F or CG-xx or CH-xx PCI Slot 2 Content U2.13-P2-I2 CJ-08 to CJ-0F or CK-xx or CL-xx PCI Slot 3 Content U2.13-P2-I3 CM-08 to CM-0F or CN-xx or CP-xx PCI Slot 4 Content U2.13-P2-I4 CQ-08 to CQ-0F or CR-xx or CS-xx EADS 2 - PCI Controller U2.13-P2 CU-58, CU-5A, CU-5E PCI Slot 5 Content U2.13-P2-I5 CV-08 to CV-0F or CW-xx or CX-xx PCI Slot 6 Content U2.13-P2-I6 CY-08 to CY-0F or CZ-xx or Ca-xx PCI Slot 7 Content U2.13-P2-I7 Ce-08 to Ce-0F or Cf-xx or Cg-xx EADS 3 - PCI Controller U2.13-P2 Cj-58, Cj-5A, Cj-5E PCI Slot 8 Content U2.13-P2-I8 Ck-08 to Ck-0F or Cm-xx or Cn-xx PCI Slot 9 Content U2.13-P2-I9 Cp-08 to Cp-0F or Cq-xx or Cr-xx PCI Slot 10 Content U2.13-P2-I10 Cv-08 to Cv-0F or Cw-xx or Cx-xx I/O Riser Card U2.13-P2 I/O Port Connector U2.13-P2/Q1 I/O Port Cable From Riser port 1 P1.1/Q1 U2.13-P2/Q1# Chapter 1. Reference Information 119 FRU Name Location Code AIX Location Code Physical Connection Logical Connection I/O Port Connector U2.13-P2/Q2 I/O Port Cable to GX1A-port 2 U2.13-P2/Q2# I/O Subsystem DCA 1 U2.13-V1 DCA 1 to (BPA B) BPD2- 7A Connector P02 U2.13-V1/Q1 DCA 1 to (BPA B) BPD2- 7A Cable U2.13-V1/Q1# DCA 1 to (BPA A) BPD2- 7A Connector P02 U2.13-V1/Q2 DCA 1 to (BPA A) BPD2- 7A Cable U2.13-V1/Q2# Thermal Sensor U2.13-V1 I/O Subsystem DCA 2 U2.13-V2 DCA 2 to (BPA B) BPD2- 7B Connector P03 U2.13-V2/Q1 DCA 2 to (BPA B) BPD2- 7B Cable U2.13-V2/Q1# DCA 2 to (BPA A) BPD2- 7B Connector P03 U2.13-V2/Q2 DCA 2 to (BPA A) BPD2- 7B Cable U2.13-V2/Q2# Thermal Sensor U2.13-V2 I/O Subsystem SCSI controller 1 on P1 U2.13-P1/Z1 Bs-08 I/O Subsystem SCSI controller 2 on P1 U2.13-P1/Z2 C7-08 I/O Subsystem SCSI controller 1 on P2 U2.13-P2/Z1 Cb-08 I/O Subsystem SCSI controller 2 on P2 U2.13-P2/Z2 Cs-08 DASD 4-pack Cage and card (1) U2.13-P3 DASD 4-pack Cage and card (1) VPD U2.13-P3-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P2 U2.13-P2/Z2-A8 Cs-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P2 U2.13-P2/Z2-A9 Cs-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 2 on P2 U2.13-P2/Z2-Aa Cs-08-00-10,0 120 406/90 Service Guide FRU Name Location Code AIX Location Code Physical Connection Logical Connection SCSI DASD 4 hdisk at ID B connected to controller 2 on P2 U2.13-P2/Z2-Ab Cs-08-00-11,0 SCSI Enclosure Services SES connected to controller 2 on P2 U2.13-P2/Z2-Af Cs-08-00-15,0 DASD 4-pack Cage and card (2) U2.13-P4 DASD 4-pack Cage and card (2) VPD U2.13-P4-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P2 U2.13-P2/Z1-A8 Cb-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P2 U2.13-P2/Z1-A9 Cb-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 1 on P2 U2.13-P2/Z1-Aa Cb-08-00-10,0 SCSI DASD 4 hdisk at ID B connected to controller 1 on P2 U2.13-P2/Z1-Ab Cb-08-00-11,0 SCSI Enclosure Services SES connected to controller 1 on P2 U2.13-P2/Z1-Af Cb-08-00-15,0 DASD 4-pack Cage and card (3) U2.13-P5 DASD 4-pack Cage and card (3) VPD U2.13-P5-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P1 U2.13-P1/Z2-A8 C7-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P1 U2.13-P1/Z2-A9 C7-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 2 on P1 U2.13-P1/Z2-Aa C7-08-00-10,0 SCSI DASD 4 hdisk at ID B connected to controller 2 on P1 U2.13-P1/Z2-Ab C7-08-00-11,0 SCSI Enclosure Services SES connected to controller 2 on P1 U2.13-P1/Z2-Af C7-08-00-15,0 DASD 4-pack Cage and card (4) U2.13-P6 DASD 4-pack Cage and card (4) VPD U2.13-P6-N1 Chapter 1. Reference Information 121 FRU Name Location Code AIX Location Code Physical Connection Logical Connection SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P1 U2.13-P1/Z1-A8 Bs-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P1 U2.13-P1/Z1-A9 Bs-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 1 on P1 U2.13-P1/Z1-Aa Bs-08-00-10,0 SCSI DASD 4 hdisk at ID B connected to controller 1 on P1 U2.13-P1/Z1-Ab Bs-08-00-11,0 SCSI Enclosure Services SES connected to controller 1 on P1 U2.13-P1/Z1-Af Bs-08-00-15,0 I/O Subsystem Fan (MSA) U2.13-F1 I/O Subsystem Fan (MSA) U2.13-F2 I/O Subsystem Fan (MSA) U2.13-F3 I/O Subsystem Fan (MSA) U2.13-F4 Media Drawer Power Connector U2.13-P1-X1/Q3 Media Drawer Power Cable U2.13-P1-X1/Q3# Media Drawer Power Connector U2.13-P2-X2/Q3 Media Drawer Power Cable U2.13-P2-X2/Q3# FRU Name Location Code AIX Location Code Physical Connection Logical Connection I/O Subsystem 8 Locations I/O Subsystem 1 Chassis & Midplane Card (MT/M Serial #) U2.19 I/O Subsystem Left IO backplane assembly U2.19-P1 I/O Subsystem Left IO backplane assembly VPD U2.19-P1-N1 EADS 1 - PCI Controller U2.19-P1 D0-58, D0-5A, D0-5C, D0-5E PCI Slot 1 Content U2.19-P1-I1 D1-08 to D1-0F or D2-xx or D3-xx 122 406/90 Service Guide FRU Name Location Code AIX Location Code Physical Connection Logical Connection PCI Slot 2 Content U2.19-P1-I2 D4-08 to D4-0F or D5-xx or D6-xx PCI Slot 3 Content U2.19-P1-I3 D7-08 to D7-0F or D8-xx or D9-xx PCI Slot 4 Content U2.19-P1-I4 DA-08 to DA-0F or DB-xx or DC-xx EADS 2 - PCI Controller U2.19-P1 DE-58, DE-5A, DE-5E PCI Slot 5 Content U2.19-P1-I5 DF-08 to DF-0F or DG-xx or DH-xx PCI Slot 6 Content U2.19-P1-I6 DJ-08 to DJ-0F or DK-xx or DL-xx PCI Slot 7 Content U2.19-P1-I7 DQ-08 to DQ-0F or DR-xx or DS-xx EADS 3 - PCI Controller U2.19-P1 DU-58, DU-5A, DU-5E PCI Slot 8 Content U2.19-P1-I8 DV-08 to DV-0F or DW-xx or DX-xx PCI Slot 9 Content U2.19-P1-I9 DY-08 to DY-0F or DZ-xx or Da-xx PCI Slot 10 Content U2.19-P1-I10 De-08 to De-0F or Df-xx or Dg-xx I/O Riser Card U2.19-P1 I/O Port Connector U2.19-P1/Q1 I/O Port Cable From GX1A-port 1 U2.19-P1/Q1# I/O Port Connector U2.19-P1/Q2 I/O Port Cable to port 0 on next Riser card U2.19-P2.1/Q1 U2.19-P1/Q2# I/O Subsystem Right IO backplane assembly U2.19-P2 I/O Subsystem Right IO backplane assembly VPD U2.19-P2-N1 EADS 1 - PCI Controller U2.19-P2 Dj-58, Dj-5A, Dj-5C, Dj-5E PCI Slot 1 Content U2.19-P2-I1 Dk-08 to Dk-0F or Dm-xx or Dn-xx PCI Slot 2 Content U2.19-P2-I2 Dp-08 to Dp-0F or Dq-xx or Dr-xx PCI Slot 3 Content U2.19-P2-I3 Ds-08 to Ds-0F or Dt-xx or Du-xx PCI Slot 4 Content U2.19-P2-I4 Dv-08 to Dv-0F or Dw-xx or Dx-xx EADS 2 - PCI Controller U2.19-P2 E0-58, E0-5A, E0-5E Chapter 1. Reference Information 123 FRU Name Location Code AIX Location Code Physical Connection Logical Connection PCI Slot 5 Content U2.19-P2-I5 E1-08 to E1-0F or E2-xx or E3-xx PCI Slot 6 Content U2.19-P2-I6 E4-08 to E4-0F or E5-xx or E6-xx PCI Slot 7 Content U2.19-P2-I7 EA-08 to EA-0F or EB-xx or EC-xx EADS 3 - PCI Controller U2.19-P2 EE-58, EE-5A, EE-5E PCI Slot 8 Content U2.19-P2-I8 EF-08 to EF-0F or EG-xx or EH-xx PCI Slot 9 Content U2.19-P2-I9 EJ-08 to EJ-0F or EK-xx or EL-xx PCI Slot 10 Content U2.19-P2-I10 EQ-08 to EQ-0F or ER-xx or ES-xx I/O Riser Card U2.19-P2 I/O Port Connector U2.19-P2/Q1 I/O Port Cable From Riser port 1 P1.1/Q1 U2.19-P2/Q1# I/O Port Connector U2.19-P2/Q2 I/O Port Cable to GX1A-port 2 U2.19-P2/Q2# I/O Subsystem DCA 1 U2.19-V1 DCA 1 to (BPA B) BPD2- 8A Connector P02 U2.19-V1/Q1 DCA 1 to (BPA B) BPD2- 8A Cable U2.19-V1/Q1# DCA 1 to (BPA C) BPD2- 8A Connector P02 U2.19-V1/Q2 DCA 1 to (BPA C) BPD2- 8A Cable U2.19-V1/Q2# Thermal Sensor U2.19-V1 I/O Subsystem DCA 2 U2.19-V2 DCA 2 to (BPA B) BPD2- 8B Connector P03 U2.19-V2/Q1 DCA 2 to (BPA B) BPD2- 8B Cable U2.19-V2/Q1# DCA 2 to (BPA A) BPD2- 8B Connector P03 U2.19-V2/Q2 DCA 2 to (BPA A) BPD2- 8B Cable U2.19-V2/Q2# Thermal Sensor U2.19-V2 I/O Subsystem SCSI controller 1 on P1 U2.19-P1/Z1 DM-08 I/O Subsystem SCSI controller 2 on P1 U2.19-P1/Z2 Db-08 124 406/90 Service Guide FRU Name Location Code AIX Location Code Physical Connection Logical Connection I/O Subsystem SCSI controller 1 on P2 U2.19-P2/Z1 E7-08 I/O Subsystem SCSI controller 2 on P2 U2.19-P2/Z2 EM-08 DASD 4-pack Cage and card (1) U2.19-P3 DASD 4-pack Cage and card (1) VPD U2.19-P3-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P2 U2.19-P2/Z2-A8 EM-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P2 U2.19-P2/Z2-A9 EM-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 2 on P2 U2.19-P2/Z2-Aa EM-08-00-10,0 SCSI DASD 4 hdisk at ID B connected to controller 2 on P2 U2.19-P2/Z2-Ab EM-08-00-11,0 SCSI Enclosure Services SES connected to controller 2 on P2 U2.19-P2/Z2-Af EM-08-00-15,0 DASD 4-pack Cage and card (2) U2.19-P4 DASD 4-pack Cage and card (2) VPD U2.19-P4-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P2 U2.19-P2/Z1-A8 E7-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P2 U2.19-P2/Z1-A9 E7-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 1 on P2 U2.19-P2/Z1-Aa E7-08-00-10,0 SCSI DASD 4 hdisk at ID B connected to controller 1 on P2 U2.19-P2/Z1-Ab E7-08-00-11,0 SCSI Enclosure Services SES connected to controller 1 on P2 U2.19-P2/Z1-Af E7-08-00-15,0 DASD 4-pack Cage and card (3) U2.19-P5 DASD 4-pack Cage and card (3) VPD U2.19-P5-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 2 on P1 U2.19-P1/Z2-A8 Db-08-00-8,0 Chapter 1. Reference Information 125 FRU Name Location Code AIX Location Code Physical Connection Logical Connection SCSI DASD 2 hdisk at ID 9 connected to controller 2 on P1 U2.19-P1/Z2-A9 Db-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 2 on P1 U2.19-P1/Z2-Aa Db-08-00-10,0 SCSI DASD 4 hdisk at ID B connected to controller 2 on P1 U2.19-P1/Z2-Ab Db-08-00-11,0 SCSI Enclosure Services SES connected to controller 2 on P1 U2.19-P1/Z2-Af Db-08-00-15,0 DASD 4-pack Cage and card (4) U2.19-P6 DASD 4-pack Cage and card (4) VPD U2.19-P6-N1 SCSI DASD 1 hdisk at ID 8 connected to controller 1 on P1 U2.19-P1/Z1-A8 DM-08-00-8,0 SCSI DASD 2 hdisk at ID 9 connected to controller 1 on P1 U2.19-P1/Z1-A9 DM-08-00-9,0 SCSI DASD 3 hdisk at ID A connected to controller 1 on P1 U2.19-P1/Z1-Aa DM-08-00-10,0 SCSI DASD 4 hdisk at ID B connected to controller 1 on P1 U2.19-P1/Z1-Ab DM-08-00-11,0 SCSI Enclosure Services SES connected to controller 1 on P1 U2.19-P1/Z1-Af DM-08-00-15,0 I/O Subsystem Fan (MSA) U2.19-F1 I/O Subsystem Fan (MSA) U2.19-F2 I/O Subsystem Fan (MSA) U2.19-F3 I/O Subsystem Fan (MSA) U2.19-F4 Media Drawer Power Connector U2.19-P1-X1/Q3 Media Drawer Power Cable U2.19-P1-X1/Q3# Media Drawer Power Connector U2.19-P2-X2/Q3 Media Drawer Power Cable U2.19-P2-X2/Q3# 126 406/90 Service Guide System Requirements for Adding Processor MCMs Processor MCM Total BUS Pass-Thru Modules (FC5257) Required, Locations Total DCA, Power Converter (FC 6170 and 6189) Required, Locations Total Capacitor Books (FC 6198) Required Locations L3 Cache Plug Positions VPD Locations Memory Book Plug Locations Add 1st MCM Position C1 Note: 1.7 GHz MCMs require 1x FC 6189 at location U1.18-P1-V4. All other MCM speeds do not require FC 6189. Qty = 0 FC 6170 (2X) U1.18-P1-V1 U1.18-P1-V2 Qty = 1 U1.18-P1-V3 C6, C7, C10, C11 U1.18-P1-H2.3 U1.18-P1-M2 U1.18-P1-M3 Add 2nd MCM Position C4 Note: 1.7 GHz MCMs require additional FC 6189 (2x total) at location U1.18-P1-V5. All other MCM speeds require 1x FC 6189 as shown. Qty = 2 U1.18-P1-C3 U1.18-P1-C2 FC 6170 (2X) U1.18-P1-V1 U1.18-P1-V2 FC 6189 (1X) U1.18-P1-V4 Qty = 1 U1.18-P1-V3 C14, C15, C18, C19 U1.18-P1-H2.4 U1.18-P1-M7 U1.18-P1-M6 Add 3rd MCM Position C3 Note: 1.7 GHz MCMs require additional FC 6189 (3x total) at location U1.18-P1-V6. All other MCM speeds require 2x FC 6189 as shown. Qty = 1 U1.18-P1-C2 FC 6170 (2X) U1.18-P1-V1 U1.18-P1-V2 FC 6189 (2X) U1.18-P1-V4 U1.18-P1-V5 Qty = 2 U1.18-P1-V3 U1.18-P1-V7 C8, C12, C16, C20 U1.18-P1-H2.5 U1.18-P1-M4 U1.18-P1-M8 Chapter 1. Reference Information 127 Add 4th MCM Position C2 Note: 1.7 GHz MCMs require additional FC 6189 (4x total) at location U1.18-P1-V8. All other MCM speeds require (3x) FC 6189 as shown. Qty = 0 FC 6170 (2X) U1.18-P1-V1 U1.18-P1-V2 FC 6189 (3X) U1.18-P1-V4 U1.18-P1-V5 U1.18-P1-V6 Qty = 2 U1.18-P1-V3 U1.18-P1-V7 C5, C9, C13, C17 U1.18-P1-H2.6 U1.18-P1-M5 U1.18-P1-M1 Number of Bulk Power Regulators (BPR) FC 6186 Required as a Function of MCMs and I/O Drawers Number of I/O Drawers MCM Sizes and Quantities 1 MCM(1.1 or 1.3 GHz) 1 MCM(1.5, 1.7, 1.9 GHz) 2 MCMs(1.1 or 1.3 GHz) 2 MCMs(1.5, 1.7, 1.9 GHz) 0 1 1 1 1 1 1 1 1 1* 2 1 1 1 2 3 2 2 4 2 2 * = MDA-G would require an additional BPR Number of I/O Drawers MCM Sizes and Quantities 3 MCM(1.1 or 1.3 GHz) 3 MCM(1.5, 1.7, 1.9 GHz) 4 MCM(1.1 or 1.3 GHz) 4 MCM(1.5, 1.7, 1.9 GHz) 0 1 2 2 2 1 2 2 2 2 2 2 2 2 2 3 2 2 2 2* 4 2 2 2 3 5 2 2 3 3 6 3 2* 3 3 7 3 3 3 3 8 3 3 3 * = MDA-G would require an additional BPR 128 406/90 Service Guide Specifications This section contains specifications for the 406/90 system. 406/90 Physical Specifications and Loads The following tables illustrate the physical, electrical and thermal, acoustical, and environmental characteristics of various 406/90 system configurations. Power and Electrical Requirements Redundant power line cords are standard on the 406/90. The system uses dual ac power cords. For maximum availability, each of the power cords should be fed from independent power grids. The following table illustrates electrical and thermal characteristics for the 406/90. Electrical/Thermal Characteristic Rated Voltage (V ac, 3 phase) 200 to 240 380 to 415 480 Rated Current (A, per phase) 45 25 20 Frequency (Hertz) 50 to 60 50 to 60 50 to 60 Power (Maximum in kVA) 15.7 15.7 15.7 Typical, full load power factor (pf) 0.99 0.97 0.93 Inrush current (Amps) 162 maximum (see note below) Thermal output (Maximum kBtu/hr) 53.3 53.3 53.3 Notes: 1. Inrush currents occur only at initial application of power (very short duration for charging capacitors). No inrush currents occur during the normal power off-on cycle. 2. System will function normally with a nominal input voltage in the range of 200-480 V, AC, three phase. The following table illustrates the line cord options for the 406/90 with their geographic, breaker rating, and cord information. 3-Phase Supply Voltage (50/60 Hz) 200-240 V 380-415 V 480 V Geography United States, Canada, Japan Europe, Middle East, Africa, Asia Pacific United States, Canada Customer Circuit Breaker Rating (see Note 1 below) 60 A 30 A 30 A Cord Information 6 and 14 foot, 6 AWG line cord 14 foot, 6 or 8 AWG line cord, (electrician installed) 6 and 14 foot, 10 AWG line cord Recommended Receptacle IEC309, 60 A, type 460R9W (not provided) Not specified, electrician installed IEC309, 30 A, type 430R7W (not provided) Notes: 1. The exact circuit breaker ratings may not be available in all countries. Where the specified circuit breaker ratings are not acceptable, use the nearest available rating. Use of a time delayed circuit breaker is recommended. 2. In two-frame systems, frame B receives its power from frame A. The power to frame B is 350 V dc fed from the BPD through UPIC cables. Phase Imbalance and BPR Configuration: Depending on the number of Bulk Power Regulators (BPRs) in your system, phase imbalance can occur in line currents. All systems are provided with two bulk power assemblies (BPAs), with separate line cords. Phase currents will be divided between two line cords in Chapter 1. Reference Information 129 normal operation. The following table illustrates phase imbalance as a function of BPR configuration. Number of BPRs per BPA Phase A Line Current Phase B Line Current Phase C Line Current 1 Power / Vline Power / Vline 0 2 0.5 Power / Vline 0.866 Power / Vline 0.5 Power / Vline 3 0.577 Power / Vline 0.577 Power / Vline 0.577 Power / Vline Note: Power is calculated from “Total System Power Consumption” on page 139. Vline is line-to-line nominal input voltage. Balancing Power Panel Loads: The system requires three-phase power. Depending on the system configuration, the phase currents can be fully balanced or unbalanced. System configurations with three BPRs per BPA have balanced power panel loads, while configurations with only one or two BPRs per BPA have unbalanced loads: v With two BPRs per BPA, two of the three phases will draw an equal amount of current, and will be, nominally, 57.8% of the current on the third phase. v With one BPR per BPA, two of three phases will carry an equal amount of current, with no current drawn on the third phase. The following figure is an example of feeding several loads of this type from two power panels in a way that balances the load among the three phases. The method illustrated in the preceding figure requires that the connection from the three poles of each breaker to the three phase pins of a connector be varied. Some electricians may prefer to maintain a consistent wiring sequence from the breakers to the connectors. The following figure shows a way to balance the load without changing the wiring on the output of any breakers. The three-pole breakers are alternated with single-pole breakers, so that the three-pole breakers do not all begin on Phase A. 130 406/90 Service Guide The following figure shows another way of distributing the unbalanced load evenly. Using this method, the three-pole breakers are alternated with two-pole breakers. Chapter 1. Reference Information 131 Power Cord Configuration: The power cords exit the system from different points of the frame as indicated in the following illustration.AAllSingle-FrameSystems(TopDownView)ABAllDouble-FrameSystems(TopDownView)FrontFrontRearRear 132 406/90 Service Guide Checking the Facility Outlets and Power Source: CAUTION: Do not touch the receptacle or the receptacle faceplate with anything other than your test probes before you have met the requirements in “Checking the Facility Outlets and Power Source” below. Performing the following will ensure that appropriate power will be used by the 406/90. The following checklist is for reference purposes, and will likely be performed by a service engineer prior to installation. __ 1. The 406/90 is equipped to use 200-240 V / 380-415 V / 480 V ac, three-phase. Check that the correct power source is available. __ 2. Before system installation, locate and turn off the branch circuit CB (circuit breaker). Attach tag S229-0237, which reads “Do Not Operate.” Note: All measurements are made with the receptacle faceplate in the normally installed position. __ 3. Some receptacles are enclosed in metal housings. On receptacles of this type, perform the following steps: a. Check for less than 1 volt from the receptacle case to any grounded metal structure in the building, such as a raised-floor metal structure, water pipe, building steel, or similar structure. b. Check for less than 1 volt from receptacle ground pin to a grounded point in the building.Note: If the receptacle case or faceplate is painted, be sure the probe tip penetrates the paint and makes good electrical contact with the metal. __ 4. Check the resistance from the ground pin of the receptacle to the receptacle case. Check resistance from the ground pin to building ground. The reading should be less than 1.0 ohm, which indicates the presence of a continuous grounding conductor. __ 5. If any of the checks made in substeps 2 and 3 are not correct, remove the power from the branch circuit and make the wiring corrections; then check the receptacle again. Note: Do not use the digital multimeter to measure grounding resistance. __ 6. Check for infinite resistance between the phase pins. This is a check for a wiring short. CAUTION: If the reading is other than infinity, do not proceed! You must make the necessary wiring corrections to satisfy the above criteria before continuing. Do not turn on the branch circuit CB until all the above steps are satisfactorily completed. __ 7. Remove tag S229-0237, which reads “Do Not Operate.” __ 8. Turn on the branch circuit CB. Measure for appropriate voltages between phases. If no voltage is present on the receptacle case or grounded pin, the receptacle is safe to touch. __ 9. With an appropriate meter, verify that the voltage at the outlet is correct. __ 10. Verify that the grounding impedance is correct by using the ECOS 1020, 1023, B7106, or an appropriately approved ground impedance tester. __ 11. Turn off the branch circuit CB. __ 12. Attach tag S229-0237, which reads “Do Not Operate.” __ 13. You are now ready to install and connect the power cables to the 406/90.Additional Installation Considerations: In the United States, installation must be made in accordance with Article 645 of the National Electric Code (NEC). In Canada, installation must be in accordance with Article 12-020 of the Canadian Electrical Code (CEC). Dimensions and Weight Physical Characteristic Slimline Doors Acoustical Doors 1 Frame 2 Frames 1 Frame 2 Frames Chapter 1. Reference Information 133 Height 2025 mm (79.72 in.) 2025 mm (79.72 in.) 2025 mm (79.72 in.) 2025 mm (79.72 in.) Width 785 mm (30.91 in.) 1575 mm (62.00 in.) 785 mm (30.91 in.) 1575 mm (62.00 in.) Depth 1342 mm (52.83 in.) 1342 mm (52.83 in.) 1494 mm (58.83 in.) 1494 mm (58.83 in.) Weight (maximum configuration) 1170 kg (2580 lbs.) 1973 kg (4349 lbs.) 1184 kg (2610 lbs.) 2000 kg (4409 lbs.) Notes: 1. Doors are not installed during product shipment to the customer. A maximum configured system with batteries may exceed 1134 kg (2500 lbs.). 2. When moving or relocating certain configurations of the system, the Bulk Power Regulators (BPRs) must be removed from the top of the rack (front and rear) to ensure product stability. Specifically, removal of BPRs from frame A and B in the front and rear is required in systems that have a single I/O drawer, with more than 2 BPRs installed per BPA in the primary rack. System Weights by Configuration Total System Weight (Pounds) Number of I/O Subsystems 1 2 3 4 5 6 7 8 Slimline Doors With IBF 2250 2415 2580 3633 3854 4019 4184 4349 Slimline Doors Without IBF 1865 2030 2195 2418 3266 3431 3596 3761 Acoustical Doors With IBF 2280 2445 2610 3693 3914 4079 4244 4409 Acoustical Doors Without IBF 1923 2088 2253 2506 3326 3491 3656 3821 No Doors With IBF 2192 2357 2522 3517 3738 3903 4068 4233 No Doors Without IBF 1807 1972 2137 2302 3150 3315 3480 3645 Note: Italicized numbers indicate single-frame systems. Total System Weight (Kilograms) Number of I/O Subsystems 1 2 3 4 5 6 7 8 Slimline Doors With IBF 1021 1095 1170 1648 1748 1823 1898 1973 Slimline Doors Without IBF 846 921 996 1097 1481 1556 1631 1706 Acoustical Doors With IBF 1034 1109 1184 1675 1775 1850 1925 2000 Acoustical Doors Without IBF 872 947 1022 1137 1509 1583 1658 1733 No Doors With IBF 994 1069 1144 1595 1696 1770 1845 1920 No Doors Without IBF 820 894 969 1044 1429 1504 1579 1653 Note: Italicized numbers indicate single-frame systems. 134 406/90 Service Guide Acoustical Noise Emissions Acoustical Characteristic Product Configuration Declared A-Weighted Sound Power Level, LWAd (B) Declared A-Weighted Sound Pressure Level, LpAm (dB) Operating Idle Operating Idle 7040 Acoustical Doors 7.5 7.5 57 57 7040 Slimline Doors 7.9 7.9 62 62 Notes: 1. Noise levels cited are for a typical configuration (A-Frame: Bulk Power, CEC cage, battery option, media drawer, and two I/O drawers). 2. The 0.6-B (6-dB) reduction in noise emission levels with the acoustical rear door corresponds to a factor of 4 reduction. That is, the noise level of a single A-Frame with thin covers is about the same as the noise level of four A-Frames with acoustical covers. 3. LWAd is the upper-limit A-weighted sound power level; LpAm is the mean A-weighted sound pressure level at the 1-meter bystander positions; 1 B = 10 dB. 4. All measurements made in conformance with ISO 7779 and declared in conformance with ISO 9296. Environmental Specifications Environmental Specification Operating Non-Operating Storage Shipping Temperature 10 to 32°C(50 to 90°F) Max. of 24 ° C (75.2 ° F) with 4mm tape or DVD RAM in rear positions of the Media Subsystem 10 to 43°C(50 to 109°F) 1 to 60°C(34 to 140°F) –40 to 60°C(–40 to 140°F) Relative Humidity (Noncondensing) 8 to 80 % 8 to 80 % 5 to 80 % 5 to 100 % Maximum Wet Bulb 23°C (73°F) 27°C (73°F) 29°C (84°F) 29°C (84°F) Notes: 1. Storage and shipping specifications are valid for a maximum duration of two weeks each. 2. The upper limit of the dry bulb temperature must be derated 1 degree C per 189 m (619 ft.) above 1295 m (4250 ft.). Maximum altitude for 1.1 GHz modules is 3048 m (10,000 ft.) and for 1.3 GHz modules is 2134 m (7000 ft) Weight Distribution The following table shows dimensions and weights used to calculate floor loading for the 406/90. All floor-loading calculations are intended for a raised-floor environment. 1 Frame with Slimming Covers 2 Frames with Slimming Covers 1 Frame with Acoustical Covers 2 Frames with Acoustical Covers Weight 1170 kg (2580 lbs.) 1973 kg (4349 lbs.) 1184 kg (2610 lbs.) 2000 kg (4409 lbs.) Width 750 mm (29.5 in.) 1539 mm (60.6 in.) 750 mm (29.5 in.) 1539 mm (60.6 in.) Depth 1173 mm (46.2 in.) 1173 mm (46.2 in.) 1173 mm (46.2 in.) 1173 mm (46.2 in.) Chapter 1. Reference Information 135 Notes: 1. For 2 frame systems, widths of Frame A and Frame B. were added (the depth remains 1069 mm (42.1 in.), not including frame extenders). 2. For 2 frame systems, weights are based on maximum configuration (less than addition of maximum weights for each frame). 3. All floor-loading calculations are intended for a raised-floor environment. The following table shows floor-loading specifications for systems with slimline covers. The values contained in the Condition column are described following the table. Condition a (sides)mm (in.) b (front)mm (in.) c (backmm (in.) 1 Frame kg/m2 (lb./ft.2) 2 Frameskg/m2 (lb./ft.2) 1 25 (1.0) 135 (5.3) 135 (5.3) 1080.1 (221.2) 924.4 (189.3) 2 25 (1.0) 554 (21.8) 655 (25.8) 702.2 (143.8) 607.9 (124.5) 3 25 (1.0) 762 (30.0) 762 (30.0) 634.5 (129.9) 551.2 (112.9) 4 254 (10.0) 554 (21.8) 655 (25.8) 491.2 (100.6) 499.4 (102.3) 5 254 (10.0) 762 (30.0) 762 (30.0) 448.1 (91.8) 455.3 (93.3) 6 508 (20.0) 554 (21.8) 655 (25.8) 385.0 (78.9) 424.4 (86.9) 7 508 (20.0) 762 (30.0) 762 (30.0) 354.3 (72.6) 389.1 (79.7) 8 554 (21.8) 554 (21.8) 655 (25.8) 372.1 (76.2) 413.9 (84.8) 9 559 (22) 762 (30.0) 762 (30.0) 341.7 (70.0) 378.9 (77.6) 10 762 (30.0) 521 (20.5) 521 (20.5) 341.7 (70.0) 393.3 (80.6) 11 762 (30.0) 762 (30.0) 762 (30.0) 302.4 (61.9) 344.8 (70.6) Definition of Conditions: v Condition 1 indicates maximum floor loading when systems are stored cover-to-cover on all four sides with covers installed. v Conditions 2 and 3 indicate floor loading when the system has no side clearance (beyond side covers) on both sides while front/back distances varied. v Conditions 4 through 8 indicate floor loading at various points below the maximum weight-distribution distance of 762 mm (30.0 in.) from each edge of the frame. v Conditions 9 through 10 indicate floor-loading options when the installation is limited to 342.0 kg/m2 (70.0 lb/ft2). v Condition 11 is the minimum floor loading required, based on the maximum weight-distribution area (30.0 in. from each side of the base frame).Notes: 1. Service clearance is independent from weight distribution distance and must be at least 45 in. for the front of the frame and 36 in. for the rear of the frame (measured from the base frame). 2. Weight-distribution areas should not be overlapped. 3. Floor-loading weight distribution distances should not exceed 762 mm (30 in.) in any direction when measured from the base frame. 136 406/90 Service Guide The following table shows floor-loading specifications for systems with acoustical covers. The values contained in the Condition column are described following the table. Condition a (sides)mm (in.) b (front)mm (in.) c (backmm (in.) 1 Framekg/m2(lb./ft. 2) 2 Frameskg/m.2(lb./ft 2) 1 25 (1.0) 135 (5.3) 135 (5.3) 1091.9 (223.6) 936.2 (191.7) 2 25 (1.0) 554 (21.8) 757 (29.8) 685.3 (140.4) 594.9 (121.8) 3 25 (1.0) 762 (30.0) 762 (30.0) 640.8 (131.2) 557.5 (114.2) 4 254 (10.0) 554 (21.8) 757 (29.8) 480.5 (98.4) 489.3 (100.2) 5 254 (10.0) 762 (30.0) 762 (30.0) 452.2 (92.6) 460.3 (94.3) 6 508 (20.0) 554 (21.8) 757 (29.8) 377.4 (77.3) 416.3 (85.3) 7 508 (20.0) 762 (30.0) 762 (30.0) 357.2 (73.2) 393.0 (80.5) 8 569 (22.4) 762 (30.0) 762 (30.0) 342.0 (70.0) 380.7 (78.0) 9 762 (30.0) 554 (21.8) 757 (29.8) 320.3 (65.6) 367.5 (75.3) 10 762 (30.0) 533 (21.0) 533 (21.0) 342.0 (70.0) 394.2 (80.7) 11 762 (30.0) 762 (30.0) 762 (30.0) 304.6 (62.4) 348.1 (71.3) Definition of Conditions: v Condition 1 indicates maximum floor loading when systems are stored cover-to-cover on all four sides with covers installed. v Conditions 2 and 3 indicate floor loading when the system has no side clearance (beyond side covers) on both sides while front/back distances varied. v Conditions 4 through 8 indicate floor loading at various points below the maximum weight-distribution distance of 762 mm (30.0 in.) from each edge of the frame. v Conditions 9 through 10 indicate floor-loading options when the installation is limited to 342.0 kg/m2 (70.0 lb/ft2). v Condition 11 is the minimum floor loading required, based on the maximum weight-distribution area (30.0 in. from each side of the base frame).Notes: 1. Service clearance is independent from weight-distribution distance and must be at least 45 in. at the front of the frame and 36 in. at the rear of the frame (measured from the base frame). 2. Weight-distribution areas should not be overlapped. 3. Floor-loading weight distribution distances should not exceed 762 mm (30 in.) in any direction when measured from the base frame. Chapter 1. Reference Information 137 Plan Views The following illustration shows dimensional planning information for single-frame systems and double-frame systems.Double-FrameSystemsAB1575mm(62.0in.)A785mm(30.9)Single-FrameSystemsFRAMEENTRY/EXITDIMENSION(mm)(in.)FRONT117by4034.6by15.9REAR117by4034.6by15.9A750mm(29.5in.)1174mm(46.2in.)1342mm(52.8in.)SlimlineCoversor1494mm(58.8in.)AcousticalCovers109mm(4.3in.)UniversalFrontCoverFRONT18mm(0.7in.)58mm(2.3in.)SlimlineRearCoveror211mm(8.3in.)AcousticalRearCover 138 406/90 Service Guide Total System Power Consumption The following tables contain minimum and maximum power consumption for the 1.1 and 1.3 GHz 406/90. Minimum power consumption is based on a configuration consisting of a single 4 GB memory card, 1 PCI card per I/O subsystem, and 1 DASD device per I/O subsystem. Maximum power consumption is based on a configuration consisting of two 32 GB memory cards per MCM module, maximum PCI cards (20 per I/O drawer), and maximum DASD (16 per I/O drawer). Power consumption calculations are estimates. Actual values may vary. Calculate heat load (Btu per hour) by multiplying the power (in watts) for the configuration by a factor of 3.4. Note: Certain system configurations only support a maximum number of I/O drawers. If a system does not support a particular count of I/O drawers, it is indicated with ?N/A?. Number of I/O Drawers (7040-61D) 1.1 GHz 8-way Modules (minimum power consumption, in watts) 1.1 GHz 8-way Modules (maximum power consumption, in watts) 8-way 16-way 24-way 32-way 8-way 16-way 24-way 32-way 1 1911 2867 3823 4779 3042 4586 6130 7674 2 2279 3235 4191 5147 4090 5634 7178 8722 3 N/A 3603 4559 5515 N/A 6682 8226 9770 4 N/A 3971 4927 5883 N/A 7730 9274 10818 5 N/A N/A 5295 6251 N/A N/A 10322 11866 6 N/A N/A 5663 6619 N/A N/A 11370 12914 7 N/A N/A N/A 6987 N/A N/A N/A 13962 8 N/A N/A N/A 7355 N/A N/A N/A 15010 Number of I/O Drawers (7040-61D) 1.3 GHz 4-way Modules (minimum power consumption, in watts) 1.3 GHz 4-way Modules (maximum power consumption, in watts) 8-way 16-way 8-way 16-way 1 3213 5471 4932 8366 2 3581 5839 5980 9414 3 3949 6207 7028 10462 4 4317 6575 8076 11510 5 N/A 6943 N/A 12558 6 N/A 7311 N/A 13606 7 N/A 7679 N/A 14654 8 N/A 8047 N/A 15702 Chapter 1. Reference Information 139 Number of I/O Drawers (7040-61D) 1.3 GHz 8-way Modules (minimum power consumption, in watts) 1.3 GHz 8-way Modules (maximum power consumption, in watts) 8-way 16-way 24-way 32-way 8-way 16-way 24-way 32-way 1 2084 3213 4342 5471 3215 4932 6649 8366 2 2452 3581 4710 5839 4263 5980 7697 9414 3 N/A 3949 5078 6207 N/A 7028 8745 10462 4 N/A 4317 5446 6575 N/A 8076 9793 11510 5 N/A N/A 5814 6943 N/A N/A 10841 12558 6 N/A N/A 6182 7311 N/A N/A 11889 13606 7 N/A N/A N/A 7679 N/A N/A N/A 14654 8 N/A N/A N/A 8047 N/A N/A N/A 15702 Number of I/O Drawers (7040-61D) 1.5 GHz 4-way Modules (minimum power consumption, in watts) 1.5 GHz 4-way Modules (maximum power consumption, in watts) 4-way 8-way 12-way 16-way 4-way 8-way 12-way 16-way 1 1714 2473 3232 4946 2931 4364 5797 7230 2 2082 2841 3600 5314 3979 5412 6845 8278 3 N/A 3209 3968 5682 N/A 6460 7893 9326 4 N/A 3577 4336 6050 N/A 7508 8941 10374 5 N/A N/A 4704 6418 N/A N/A 9989 11422 6 N/A N/A 5072 6786 N/A N/A 11037 12470 7 N/A N/A N/A 7154 N/A N/A N/A 13518 8 N/A N/A N/A 7522 N/A N/A N/A 14566 Number of I/O Drawers (7040-61D) 1.5 GHz 8-way Modules (minimum power consumption, in watts) 1.5 GHz 8-way Modules (maximum power consumption, in watts) 8-way 16-way 24-way 32-way 8-way 16-way 24-way 32-way 1 1839 2723 3607 4491 3056 4614 6172 7730 2 2207 3091 3975 4859 4104 5662 7220 8778 3 N/A 3459 4343 5227 N/A 6710 8268 9826 4 N/A 3827 4711 5595 N/A 7758 9316 10874 5 N/A N/A 5079 5963 N/A N/A 10364 11922 6 N/A N/A 5447 6331 N/A N/A 11412 12970 7 N/A N/A N/A 6699 N/A N/A N/A 14018 8 N/A N/A N/A 7067 N/A N/A N/A 15066 140 406/90 Service Guide Number of I/O Drawers (7040-61D) 1.7 GHz 8-way Modules (minimum power consumption, in watts) 1.7 GHz 8-way Modules (maximum power consumption, in watts) 8-way 16-way 24-way 32-way 8-way 16-way 24-way 32-way 1 2017 3079 4141 5203 3234 4970 6706 8442 2 2385 3447 4509 5571 4282 6018 7754 9490 3 N/A 3815 4877 5939 N/A 7066 8802 10538 4 N/A 4183 5245 6307 N/A 8114 9850 11586 5 N/A N/A 5613 6675 N/A N/A 10898 12634 6 N/A N/A 5981 7043 N/A N/A 11946 13682 7 N/A N/A N/A 7411 N/A N/A N/A 14730 8 N/A N/A N/A 7779 N/A N/A N/A 15778 W attage Addition/Subtraction for Minimum and Maximum Configurations Minimum configurations are based on a single 4 GB memory card and a single DASD/PCI card in each I/O subsystem. Maximum configurations are based on two 64 GB memory cards per MCM module, sixteen DASD per I/O subsystem and twenty PCI cards per I/O subsystem. To determine the typical power consumption for a specific configuration, use the following typical power values: v 4 GB memory book - 137 Watts v 8 GB memory book - 151 Watts v 16 GB memory book - 235 Watts v 32 GB memory book - 294 Watts v 64 GB memory book - 320 Watts v Each PCI card - 20 Watts v DASD - 20 Watts Chapter 1. Reference Information 141 Service Inspection Guide Perform a service inspection on the system when: v The system is inspected for a maintenance agreement. v Service is requested and service has not recently been performed. v An alterations and attachments review is performed. v Changes have been made to the equipment that may affect its safe operation. v External devices with their own power cables have those cables attached.If the inspection indicates an unacceptable safety condition, the condition must be corrected before anyone can service the machine. Note: The owner of the system is responsible to correct any unsafe conditions. Perform the following checks: 1. Check the covers for sharp edges and for damage or alterations that expose the internal parts of the system. 2. Check the covers for proper fit to the rack. They should be in place and secure. 3. Gently rock the rack from side to side to verify the rack is stable. 4. Turn off the UEPO switch, turn off the circuit breakers on all optional IBF subsystems (if provided), then disconnect both power cords from the customer’s receptacle. 5. Remove the covers. 6. Check for alterations or attachments. If there are any, check for obvious safety hazards, such as broken wires, sharp edges, or broken insulation. 7. Check the internal cables for damage. 8. Check for dirt, water, and any other contamination within the system. 9. Check the air filters in the processor subsystem to ensure they are clean. Replace if necessary. 10. Check the voltage label on the back of the system to ensure that it matches the voltage at the outlet. 11. Check the external power cable for damage. 12. With the external power cable connected to the system, check for 0.1 ohm or less resistance between the ground lug on the external power cable plug and the metal frame. 13. Perform the following checks on each device that has its own power cables: a. Check for damage to the power cord. b. Check for the correct grounded power cable. c. With the external power cable connected to the device, check for 0.1 ohm or less resistance between the ground lug on the external power cable plug and the metal frame of the device.14. Install the covers.Preventive Maintenance Perform preventive maintenance on filters as indicated in the following section. Processor Subsystem Air Filters Attention: To maintain proper airflow, the air filters (part number 11P3781) must be inspected every six months and replaced as necessary. To replace the air filters, it is not necessary to remove power from the system or blower assemblies. 142 406/90 Service Guide Removal To remove the air filters from the processor subsystem, grasp the finger tabs located on the air filter and pull the air filter straight out from the processor subsystem. 1 Top Filter Pull Tab 5 Bottom Filter 2 Top Filter Key 6 Bottom Filter Key 3 Processor Subsystem 7 Bottom Filter Pull Tab 4 Top Filter Replacement To replace an air filter in the processor subsystem, slide the air filter into the slot. The filter is keyed to ensure correct orientation. Chapter 1. Reference Information 143 144 406/90 Service Guide Chapter 2. Diagnostics Overview 406/90 servers use an integrated set of software diagnostic procedures to facilitate system maintenance. This book and the Diagnostic Information for Multiple Bus Systems are the basis of the diagnostic procedures for 406/90 systems. In particular, chapters 3, 4, 5, and 10 in this book are important for the trained service representative to understand and use when isolating a failure on the system. The manufacturer recommends that systems configured with 4 GB of memory or greater have access to a 4-mm or 8-mm tape drive for submission of system dump information if it is required. This function can be accomplished through locally attached or network-attached devices, as appropriate. Maintenance Analysis Procedures (MAPs) Maintenance Analysis Procedures (MAPs) guide trained service representatives through the complex multirack 406/90 system. These MAPs are the entry point for all isolation and error-recovery procedures. 406/90 systems use a set of integrated procedures, to which the MAPS are the primary entry point. The MAPS are as follows: v Entry MAP v Quick Entry MAP v Service Focal Point Procedures v Power MAP v Problem Isolation MAPThe Entry Map is the starting point for problem determination. The purpose of this MAP is to quickly point to the appropriate MAP or service reference information either in this book, or in the common book set, which includes the Diagnostic Information for Multiple Bus Systems and PCI Adapter Placement Reference. The Quick Entry MAP is a subset of the Entry MAP and helps to save time for some types of problems. The Power MAP deals with isolation of components and cables to diagnose a power problem. The nature of power problems can be related to powering the system on and off, or power failures that can occur after power is turned on. The Problem Isolation MAP is used to locate defective components not found by normal diagnostics or error-isolation methods. This MAP provides a systematic method of isolation to the failing item or items. Checkpoints 406/90 systems use various types of checkpoints, error codes, and SRNs that are referred to throughout this book (primarily in Chapters 3, 4 and 5). On an HMC-managed system, the checkpoints and error codes displayed on the physical operator panel are also displayed on the HMC virtual terminal associated with the managed system object. On a system operating as a full system partition, checkpoints and error codes will appear on the physical operator panel from the beginning of the power-on process through the completion of booting AIX. On a partitioned system, checkpoints and error codes for a partition are displayed on the HMC virtual operator panel associated with that logical partition. Checkpoints and error codes generated by logical partitions are not displayed on the managed system’s physical operator panel. 145 During boot up of a partitioned system, there is a global initialization phase where one of the system processors is in control and initializing all of the system resources. During this phase, Exxx codes will be displayed on both the physical operator panel and the HMC managed-system object. After the global initialization phase completes, ?LPAR...? is displayed on the physical operator panel and the HMC managed-system object. Each logical partition also has a partition initialization phase where one of the system processors assigned to that partition is initializing all the resource of that partition. Exxx codes will be displayed on the virtual terminal of the HMC logical partition object during this partition initialization phase. Codes that may appear in the service processor boot progress log, the operating system error log, the virtual terminal, and the operator panel display are as follows: Checkpoints Checkpoints display in the operator panel from the time ac power is connected to the system until the operating system login prompt is displayed after a successful operating system boot. These checkpoints have the following forms: 8xxx 8xxx checkpoints are displayed from the time ac power is connected to the system until the OK prompt is displayed by the service processor on the operator panel display. (Several 9xxx checkpoints are displayed just before the OK prompt displays.) 9xxx 9xxx checkpoints are displayed by the service processor after the power-on sequence is initiated. A system processor takes control when 91FF displays on the operator panel display.Note: Certain checkpoints may remain in the display for long periods of time. A spinning cursor is visible in the upper-right corner of the display during these periods to indicate that system activity is continuing.Exxx Exxx checkpoints indicate that a system processor is in control and is initializing the system resources. Control is being passed to the operating system when E105 displays on the operator panel display. Location code information may also display on the operator panel during this time. 0xxxand2xxx 0xxx codes are AIX progress codes and configuration codes. Location codes may also display on the operator panel display during this time.Note: AIX progress codes will not display when the Linux operating system is booting.Error Codes If a fault is detected, an 8-digit error code is displayed in the operator panel display. A location code may be displayed at the same time on the second line. SRNs Service request numbers, in the form xxx-xxx or xxx-xxxx, may also be displayed on the operator panel display and be noted in the AIX error log.Note: If the operating system is Linux, SRNs will not be logged in the Linux system error log.Checkpoints can become error codes if the system fails to advance past the point at which the code is presented. For a list of the checkpoints, see Chapter 4, “Checkpoints,” on page 369. Each entry provides a description of the event and the recommended action if the system fails to advance. SRNs are listed in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. 146 406/90 Service Guide FRU Isolation For a list of error codes and recommended actions for each code, see Chapter 5, “Error Code to FRU Index,” on page 413. These actions can refer to Chapter 10, “Parts Information,” on page 1009, Chapter 3, “Maintenance Analysis Procedures (MAPs),” on page 149, or provide informational message and directions. If a replacement part is indicated, direct reference is made to the part name. The respective AIX and physical location codes are listed for each occurrence as required. For a list of location codes, see “AIX and Physical Location Code Reference Tables” on page 68. Note: AIX location codes are not available when the running operating system is Linux. To look up part numbers and view component diagrams, see Chapter 10, “Parts Information,” on page 1009. The beginning of the chapter provides a parts index with the predominant field replaceable units (FRUs) listed by name. The remainder of the chapter provides illustrations of the various assemblies and components that compose 406/90 servers. The illustrations are ordered with processor subsystem components listed first, then I/O subsystem, and then accessories. FRU Identify LEDs This system is configured with an arrangement of LEDs that help identify various components of the system. These include, but are not limited to: v Rack identify LED v Processor subsystem drawer identify LED v I/O drawer identify LED v RIO port identify LED v FRU identify LED – Power subsystem FRUs – Processor subsystem FRUs – I/O subsystem FRUsv I/O adapter identify LED v DASD identify LEDThe identify LEDs are arranged hierarchically with the FRU identify LED at the bottom of the hierarchy, followed by the corresponding processor subsystem or I/O drawer identify LED, and the corresponding rack-identify LED to locate the failing FRU more easily. Any identify LED in the system may be flashed (when the processor subsystem is in the failed state and power is on) by using the service processor LED Control Menu contained in the System Information Menu of the privileged user menus. To use operate the LED Control Menu, see page 788. Any identify LED in the system can also be flashed by using the Identify and Attention Indicators task in diagnostics. The procedure to use the Identify and Attention Indicators task in diagnostics is outlined in Diagnostic Information for Multiple Bus Systems. If the service processor menus and AIX diagnostics are not available, the FRU identify LEDs can be flashed by one of the following procedures: v If the system is configured as a full partition, the system may be booted to the open firmware prompt and the command FRU-LED-MENU entered. A menu displays that allows you to enable the desired FRU identify LED. See “System Power Control Menu” on page 778 for instructions on setting up the boot mode to enable the boot to the open firmware prompt. v If the system is logically partitioned, the HMC must be attached. You can use the HMC to enable any FRU identify LED to be flashed. See the Hardware Management Console Operations Guide for instructions on activating and deactivating a FRU identify LED. Chapter 2. Diagnostics Overview 147 Running Diagnostics on the Native Serial Port on a Full Machine Partition with an HMC Attached For a full system partition, the output of the S1 serial port is redirected, or wrapped to the virtual console window. When the S1 serial port is wrapped, the output of any command is directed from the S1 serial port to the virtual console terminal. If you close the virtual console window on the managed system, normal function is restored to the S1 serial port. The implication of this is that the serial port wrap diagnostics may fail with a false SRN if the virtual console window on the managed system is not closed. Another method to prevent the false SRN is to unplug the HMC from its port before running the wrap tests. After running diagnostics on the serial port, make sure that the S1 serial port is enabled for login; use the AIX command chdev -l ’ttyx’ -a login=’enable’ where x is the tty number corresponding to the S1 serial port. 148 406/90 Service Guide Chapter 3. Maintenance Analysis Procedures (MAPs) This chapter contains Maintenance Analysis Procedures (MAPs) for the 406/90. Entry MAP When possible, run online diagnostics in service mode. Online diagnostics perform additional functions compared to standalone diagnostics. This ensures that the error state of the system that was captured in NVRAM is available for your use in fixing the problem. The AIX error log and SMIT are only available when diagnostics are run from the hard drive. Always check Service Focal Point for any open service events in Service Action Event log. Note: Online AIX diagnostics from the hard drive are not available when the running operating system is Linux. If this system has a Hardware Management Console (HMC) with Service Focal Point (SFP) and you have not previously been directed to MAP 1321, go to “MAP 1321: Quick Entry MAP for Systems with Service Focal Point” on page 170. Otherwise, use the following table to help determine your next step.Notes: 1. Licensed programs frequently rely on system information stored in the VPD module on the media subsystem operator panel. If the MAPs indicate that the media subsystem operator panel should be replaced, swap the VPD module from the old media subsystem operator panel to the new one (see “Operator Panel” on page 1000). If the old VPD module has to be replaced, call technical support for recovery instructions. If recovery is not possible, notify the system owner that new keys for licensed programs may be required. 2. If a network adapter is replaced, the network administrator must be notified so that the client IP addresses used by the system can be changed. In addition, the operating system configuration of the network controller may need to be changed in order to enable system startup. Also check to ensure that any client or system that addresses this system is updated.Note: The various codes that might display on the HMC are all listed as error codes by Service Focal Point (SFP). To assist you in identifying the types of error data in this guide, use the following table. SFP Name Number of Digits in Error Code Error code Service Guide Name Error Code Any Contains # Menu Goal Error Code Any Contains - (hyphen) SRN Error Code 5 Does not contain # or - SRN Error Code 6 Does not contain # or - Error Code Error Code 8 Does not contain # or - Error Code, SRC, or Ref Code 149 Symptom Starting Point You have a problem that does not prevent the system from booting and the media subsystem operator panel is functional. Go to the Fast Path MAP in the Diagnostic Information for Multiple Bus Systems. You do not have a symptom. Go to MAP 0020 in the Diagnostic Information for Multiple Bus Systems. You have an SRN, from a partition running Linux, that is one of the following: 651-88x, 651-89x, 652-88x, 652-89x, Axx-xxx, or Axx-xxxx. Go to Chapter 11, “SRNs for Linux,” on page 1035 and perform the actions listed there. You have an SRN, from a partition running Linux, that is not one of the following: 651-88x, 651-89x, 652-88x, 652-89x, Axx-xxx, or Axx-xxxx. Go to the Fast Path MAP in the Diagnostic Information for Multiple Bus Systems. You have an SRN from a partition running AIX or AIX standalone diagnostics Go to the Fast Path MAP in the Diagnostic Information for Multiple Bus Systems. The system stops and a 3-digit number is displayed in the operator panel display. Record SRN 101-xxx, where xxx is the 3-digit number displayed in the operator panel display, then go to the Fast Path MAP in the Diagnostic Information for Multiple Bus Systems. The system stops and a 4-digit number beginning with 0 or 2 is displayed in the operator panel display. Go to the Fast Path MAP in the Diagnostic Information for Multiple Bus Systems. All other symptoms. Go to “Quick Entry MAP” on page 151. 150 406/90 Service Guide Quick Entry MAP The Quick Entry MAP is provided to help you quickly resolve a problem. Use the Quick Entry MAP when you know or have been provided with a symptom. Quick Entry MAP Table of Contents Problem Description Page No. Service actions 152 The system attention LED on the media subsystem operator panel is on. 152 OK does not appear in the media subsystem operator panel display before pressing the power-on button. Other symptoms appear in the media subsystem operator panel display or LEDs before the power-on button is pressed. 152 8-Digit error codes 153 System stops with an 8-digit number displayed 153 System stops with a 4-digit number displayed that does not begin with 0 or 2. 153 System stops with a 3-digit number displayed (see below for 888 sequence). 153 Hardware Management Console (HMC) Problem 153 There appears to be a display problem (distortion, blurring, etc.) 154 Power and cooling problems 155 888 sequence in operator panel display 155 Other symptoms or problems 155 Attention: If you replace FRUs or perform an action on an I/O subsystem and the problem is still not corrected, go to “MAP 1542: I/O Problem Isolation” on page 318 unless you were already directed to any MAP 154x by the error code. Otherwise, call for support if the actions for an error code do not resolve the problem. If you replace FRUs or perform an action, and the problem is corrected, go to ?MAP 410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Note: If the only service actions that you perform are on the HMC, do not use MAP 410 to do the repair checkout. Chapter 3. Maintenance Analysis Procedures 151 Symptom Action Service Actions You have an open service event in the service action event log. Go to “Service Focal Point” on page 4. You have parts to exchange or a corrective action to perform. 1. Go to Chapter 9, “Removal and Replacement Procedures,” on page 835. 2. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. You need to verify that a part exchange or corrective action corrected the problem. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. You need to verify correct system operation. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. The system attention LED on the media subsystem operator panel is on. Go to “Disturbance or System Attention LED” on page 21 and perform any actions indicated. OK does not appear in the media subsystem operator panel display before pressing the power-on button. Other symptoms appear in the media subsystem operator panel display or LEDs before the power on button is pressed. A bouncing or scrolling ball remains on the media subsystem operator panel display or the media subsystem operator panel display is filled with dashes 1. Verify the operator panel cable from the primary I/O book to the media subsystem is connected and correctly seated at both ends. 2. If an ASCII terminal is available, connect it to the primary I/O book through serial port 1. a. If the service processor menu is displayed, then: 1) Replace the operator panel assembly, location: U1.17-L1. Refer to “Operator Panel” on page 1000. 2) Replace the primary I/O book, location: U1.18-P1-H2. Refer to “I/O Books and I/O Blanks” on page 854.b. If the service processor menu is not displayed, then: 1) Replace the primary I/O book, location: U1.18-P1-H2. Refer to “I/O Books and I/O Blanks” on page 854. 2) Replace the operator panel assembly, location: U1.17-L1. Refer to “Operator Panel” on page 1000.3. If an ASCII terminal is not available, replace the following, one at a time, in the sequence listed: a. Operator panel assembly, location: U1.17-L1. Refer to “Operator Panel” on page 1000. b. Primary I/O book, location: U1.18-P1-H2. Refer to “I/O Books and I/O Blanks” on page 854. You have a blank display on the operator panel. Other LEDs on the operator panel appear to behave normally. 1. Check the processor subsystem to media subsystem cable. 2. Replace the operator panel assembly, Location: U1.17-L1. Refer to “Operator Panel” on page 1000. 3. Replace the primary I/O book, Location: U1.18-P1-H2. Refer to “I/O Books and I/O Blanks” on page 854. You have a blank display on the operator panel. Other LEDs on the operator panel are off. Go to “MAP 1520: Power” on page 192. 152 406/90 Service Guide Symptom Action 8-Digit Error Codes You have an 8-digit error code displayed 1. Look up the error code in the table in “Checkpoints and Error Codes Index” on page 416. 2. Look up the service action event (SAE) log. If the FRU has been changed other than the one in step 1, look at the most recent one. System stops with an 8-digit number displayed The system stops with an 8-digit error code displayed when booting. 1. Look up the error code in the table in “Checkpoints and Error Codes Index” on page 416. 2. Look up the SAE log. If the FRU has been changed other than the one in step 1, look at the most recent one. System stops with a 4-digit number displayed that does not begin with 0 or 2. The system stops with a 4-digit checkpoint. Go to “Checkpoints and Error Codes Index” on page 416. If you do not find the error code there, go to the Fast Path MAP in the Diagnostic Information for Multiple Bus Systems. System stops with a 3-digit number displayed (see below for 888 sequence) The system stops with a 3-digit error code. Add 101- to the left of the three digits to create an SRN. Go to Diagnostic Information for Multiple Bus Systems. If there is a location code displayed under the 3-digit error code, then look at location to see if it matches failing component that the SRN pointed to. If they do not match, perform the action from Diagnostic Information for Multiple Bus Systems first. If the problem still exists, then replace the failing component from that location code. Hardware Management Console (HMC) Problem Chapter 3. Maintenance Analysis Procedures 153 Symptom Action Hardware Management Console (HMC) cannot be used to manage a managed system, or the connection to the managed system is failing. If the managed system is operating normally (no error codes or other symptoms), the HMC might have a problem, or the connection to the managed system might be damaged or incorrectly cabled. Do the following: 1. Check the connections (serial cable) between the HMC and the managed system. Correct any cabling errors if found. If another serial cable is available, connect it in place of the existing cable and refresh the HMC graphical user interface. You may have to wait up to 30 seconds for the managed system to reconnect. 2. Verify that any connected HMC is connected to the managed system by checking the Management Environment of the HMC. Note: The managed system must have power connected and the system running, or waiting for a power-on instruction (the OK prompt is in the operator panel.) If the managed system does not appear in the Navigation area of the HMC Management Environment, then the HMC or the connection to the managed system might be failing. 3. Go to the Entry MAP in the Hardware Management Console Maintenance Guide, order number SA23-1273. 4. If you cannot fix the problem using the HMC tests in the Hardware Management Console Maintenance Guide, there might be a problem with the primary I/O book; replace the primary I/O book. Hardware management console (HMC) cannot call out using the attached modem and the customer’s telephone line. If the managed system is operating normally (no error codes or other symptoms), the HMC might have a problem, or the connection to the modem and telephone line may have a problem. Do the following: 1. Check the connections between the HMC and the modem and telephone line. Correct any cabling errors if found. 2. Go to the Entry MAP in the Hardware Management Console Maintenance Guide, order number SA23-1273. There is a Display Problem (Distortion, Blurring, Etc.) 154 406/90 Service Guide Symptom Action All display problems. 1. If using the Hardware Management Console, go to the Hardware Management Console Maintenance Guide, order number SA23-1273. 2. If using a graphics display: a. Go to the problem determination procedures for the display. b. If you do not find a problem, replace the display adapter.3. If using an ASCII terminal: a. Make sure that the ASCII terminal is connected to S1. b. If problems persist, go to the problem determination procedures for the terminal. c. If you do not find a problem, replace the primary I/O book. Power and Cooling Problems The system will not power on and no error codes are available. Go to “MAP 1521: The System Will Not Power On And No Error Codes Are Available” on page 196. The power light does not come on or stay on. (both the operator panel power-on LED and the I/O subsystem indicator panel power LEDs do not come on or stay on). 1. Check service processor error log. 2. Go to “MAP 1520: Power” on page 192. A rack or a rack-mounted unit will not power on. 1. Check service processor error log. 2. Go to “MAP 1520: Power” on page 192. You have a power problem. 1. Check service processor error log. 2. Go to “MAP 1520: Power” on page 192. The system attention LED on the media subsystem operator panel is on and there is no error code displayed. 1. Check service processor error log. 2. Go to “MAP 1520: Power” on page 192. 888 Sequence in Operator Panel Display An 888 sequence is displayed in the media subsystem operator panel display. Go to the Fast Path MAP in the Diagnostic Information for Multiple Bus Systems. Other Symptoms or Problems One or more partitions do not boot. You cannot activate one or more partitions. Go to “MAP 1020: Problem Determination” on page 158. The operator panel displays OK and the fans and blowers are off. The service processor is ready. The system is waiting for power-on. All of the system POST indicators are displayed on the system console, the system pauses and then restarts. The term POST indicators refers to the device mnemonics (the words memory, keyboard, network, scsi, and speaker) that appear on the system console during the power-on self-test (POST). Go to “Boot Problems” on page 409. The system stops and all of the POST indicators are displayed on the system console. The term POST indicators refers to the device mnemonics (the words memory, keyboard, network, scsi, and speaker) that appear on the system console during the power-on self-test (POST). Go to “Boot Problems” on page 409. The system stops and the message STARTING SOFTWARE PLEASE WAIT... is displayed on the firmware console. Go to Chapter 4, “Checkpoints,” on page 369. Chapter 3. Maintenance Analysis Procedures 155 Symptom Action The system does not respond to the password being entered or the system login prompt is displayed when booting in service mode. 1. If the password is being entered from the Hardware Management Console (HMC), go to the Hardware Management Console Maintenance Guide. 2. If the password is being entered from a keyboard attached to the system, then the keyboard or its controller may be faulty. In this case, replace these parts in the following order: a. Keyboard cable b. Keyboard c. USB card the keyboard is attached to.3. If the password is being entered from an ASCII terminal, then use the problem determination procedures for the ASCII terminal. Make sure the ASCII terminal is connected to S1. If the problem persists, replace the primary I/O book. No codes are displayed on the operator panel within a few seconds of turning on the system. The operator panel is blank before the system is powered on. Reseat the operator panel cable. If the problem is not resolved, replace in the following order: 1. Operator panel assembly. Swap the VPD module from the old operator panel to the new operator panel. 2. Primary I/O book. (See notes on page 149.) The SMS configuration list or boot sequence selection menu shows more SCSI devices attached to a controller/adapter than are actually attached. A device may be set to use the same SCSI bus ID as the control adapter. Note the ID being used by the controller/adapter (this can be checked and/or changed through an SMS utility), and verify that no device attached to the controller is set to use that ID. If settings do not appear to be in conflict: 1. Go to “MAP 1542: I/O Problem Isolation” on page 318 2. Replace the SCSI cable. 3. Replace the device. 4. Replace the SCSI adapterNote: In a ?twin-tailed? configuration where there is more than one initiator device (normally another system) attached to the SCSI bus, it may be necessary to use SMS utilities to change the ID of the SCSI controller or adapter. The System Management Services menu is displayed. The device or media you are attempting to boot from may be faulty. 1. Check the SMS error log for any errors. To check the error log: v Choose error log v If an error is logged, check the time stamp. v If the error was logged during the current boot attempt, record it. v Look up the error in Chapter 5, “Error Code to FRU Index” and do the listed action. v If no recent error is logged in the error log, continue to the next step below.2. Try to boot from an alternate boot device connected to the same controller as the original boot device. If the boot succeeds, replace the original boot device (for removable media devices, try the media first). 3. Go to “Boot Problems” on page 409. 156 406/90 Service Guide Symptom Action You suspect a cable problem. Go to Adapters, Devices, and Cable Information for Multiple Bus Systems. All other problems. Go to “MAP 1020: Problem Determination” on page 158. Chapter 3. Maintenance Analysis Procedures 157 MAP 1020: Problem Determination Purpose of This MAP Use this MAP to get an error code if you were not provided one by the customer or you are unable to load diagnostics. If you are able to load the diagnostics, go to MAP 0020 in the Diagnostic Information for Multiple Bus Systems. The service processor may have recorded one or more symptoms in its error log. Examine this error log before proceeding (see “System Information Menu” on page 781). The server may have been set up by using the HMC. Check the Service Action Event (SAE) log in the Service Focal Point. The SAE log may have recorded one or more symptoms in the Service Focal Point. To avoid unnecessary replacement of the same FRU for the same problem, it is necessary to check the SAE log for evidence of prior service activity on the same subsystem. If you find any error codes in the service processor error log or the Service Focal Point, record all error codes and any location codes in the order you find them and process each error code and location code beginning at the Entry MAP. Otherwise, continue with the directions for MAP 1020. The service processor may have been set by the user to monitor system operations and to attempt recoveries. You can disable these actions while you diagnose and service the system. If the system was set up according to the recommendations of the 406/90, all the settings of the service processor (except language) were saved by using the SAVE/RESTORE HARDWARE MAINTENANCE POLICIES service aid. You can use the same service aid to restore the settings at the conclusion of your service action. In case the service processor settings were not saved by the user, if you disable them, make notes of their current settings for restoration before you leave. In addition to the parameters in the following table, you can disconnect the modem to prevent incoming signals that could cause the system to power on. Following are the service processor settings. The service processor menus are described in Chapter 7, “Using the Service Processor,” on page 769. Surveillance From the service processor Setup Menu, go to the Surveillance Setup Menu and disable surveillance. (Surveillance is automatically disabled in LPAR mode.) Unattended Start From the service processor System Power Control Menu, disable unattended start mode. Reboot Policy From the System Power Control Menu, go to the Reboot/Restart Policy Setup Menu and set: 1. Number of reboot attempts to 0 (zero) 2. Use OS-Defined restart policy to No 3. Enable supplemental restart policy to No. Call Out From the Call-In/Call-Out Setup Menu, go to the Serial Port Selection Menu and disable call-out on both serial ports. (Call-out is automatically disabled when the system is booted to operate with multiple partitions (also known as a partitioned system). Be prepared to record code numbers and use those numbers in the course of analyzing a problem. Go to “Step 1020-1” on page 159. 158 406/90 Service Guide Step 1020-1 The following steps analyze a failure to load the diagnostic programs. Note: Be prepared to answer questions regarding the operator panel display and to perform certain actions based on displayed POST indicators. 1. Run diagnostics on any partition. Find your symptom in the following table; then follow the instructions given in the Action column. If no fault is identified, continue to the next step. 2. Run diagnostics on the failing partition. Find your symptom in the following table; then follow the instructions given in the Action column. If no fault is identified, continue to the next step. 3. Power off the system. Refer to “Powering Off and Powering On the System” on page 838. 4. Load the standalone diagnostics in service mode to test the full system partition. Refer to Chapter 6, “Using the Online and Standalone Diagnostics,” on page 759. 5. Wait until the diagnostics are loaded or the system appears to stop. If you receive an error code or if the system stops before diagnostics are loaded, find your symptom in the following table; then follow the instructions given in the Action column. If no fault is identified, continue to the next step. 6. Run the standalone diagnostics on the entire system. Find your symptom in the following table; then follow the instructions given in the Action column. If no fault is identified, call service support for assistance. Symptom Action One or more partitions do not boot. 1. Check service processor error log. If an error is indicated, go to the “Entry MAP” on page 149. 2. Check the Serviceable Action Event (SAE) log, refer to “Working With Serviceable Events” on page 7. If there is a Service Action Event, perform the recommended actions or go to the “Entry MAP” on page 149. The system stopped and a code is displayed on the operator panel. Go to the “Entry MAP” on page 149. The system stops with a prompt to enter a password. Enter the password. You cannot continue until a correct password has been entered. When you have entered a valid password, go to the beginning of this table and wait for one of the other conditions to occur. The diagnostic operating instructions are displayed. Go to MAP 0020 in the Diagnostic Information for Multiple Bus Systems. The power good LED does not come on or does not stay on, or you have a power problem. Go to “MAP 1520: Power” on page 192. The system login prompt is displayed. You may not have pressed the correct key or you may not have pressed the key soon enough when you were to trying to indicate a service mode IPL of the diagnostic programs. If this is the case, start again at the beginning of this step. Note: Perform the system shutdown procedure before turning off the system. If you are sure you pressed the correct key in a timely manner, go to “Step 1020-2” on page 161. The system does not respond when the password is entered. Go to “Step 1020-2” on page 161. Chapter 3. Maintenance Analysis Procedures 159 Symptom Action The system stopped. A POST indicator is displayed on the system console and an eight-digit error code is not displayed. If the POST indicator represents: 1. Memory, do a slow boot. If an error code points to memory, replace that memory book and run slow boot again. 2. Keyboard a. Replace the keyboard cable. b. Replace the keyboard. c. Replace the USB card that the keyboard is attached to. d. Go to “MAP 1542: I/O Problem Isolation” on page 318.3. Network, go to “MAP 1542: I/O Problem Isolation” on page 318. 4. SCSI, go to “MAP 1542: I/O Problem Isolation” on page 318. 5. Speaker a. Replace the operator panel. b. Replace the primary I/O book, location: U1.18-P1-H2. c. Go to “MAP 1542: I/O Problem Isolation” on page 318. The System Management Services menu is displayed. Go to “Step 1020-4” on page 162. All other symptoms. If you were directed here from the Entry MAP, go to “MAP 1542: I/O Problem Isolation” on page 318. Otherwise, find the symptom in the “Entry MAP” on page 149. 160 406/90 Service Guide Step 1020-2 There is a problem with the keyboard. Find the type of keyboard you are using in the following table; then follow the instructions given in the Action column. Keyboard Type Action Type 101 keyboard (U.S.). Identified by the size of the Enter key. The Enter key is in only one horizontal row of keys. Record error code M0KB D001; then go to “Step 1020-3.” Type 102 keyboard (W.T.). Identified by the size of the Enter key. The Enter key extends into two horizontal rows. Record error code M0KB D002; then go to “Step 1020-3.” Type 106 keyboard. (Identified by the Japanese characters.) Record error code M0KB D003; then go to “Step 1020-3.” ASCII terminal keyboard Go to the documentation for this type of ASCII terminal and continue with problem determination. Step 1020-3 Take the following actions: 1. Find the eight-digit error code in Chapter 5, “Error Code to FRU Index,” on page 413. Note: If the eight-digit error code is not listed in Chapter 5, “Error Code to FRU Index,” look for it in the following: v Any supplemental service manuals for attached devices v The diagnostic problem report screen for additional information v The Service Hints service aid v The CEREADME file (by using the Service Hints service aid)Note: Service aids can be found in Diagnostic Information for Multiple Bus Systems. 2. Perform the action listed. Chapter 3. Maintenance Analysis Procedures 161 Step 1020-4 1. Turn off, then turn on the system unit. 2. When the keyboard indicator (the word keyboard) displays, press the 1 key on a directly attached keyboard or the 1 key on an ASCII terminal or 1 key on the HMC. 3. When the System Management Services menus appear, check the error log for any errors. a. Choose Error Log b. If an error is logged, check the time stamp. c. If the error was logged during the current boot attempt, record it. d. Look up the error in the Chapter 5, “Error Code to FRU Index,” on page 413 and do the listed action. e. If no recent error is logged in the error log, go to “MAP 1542: I/O Problem Isolation” on page 318. 162 406/90 Service Guide MAP 1230: Linux Platform Error Problem Resolution Purpose of This MAP Use this MAP to resolve problems reported by SRNs A00-000 to A1F-FFF from a partition running Linux. Step 1230-1 1. The last character of the SRN is bit-encoded as follows: 8 4 2 1 | | | | | | | Replace all FRUs listed | | Hot-swap is supported | Software or Firmware could be the cause Reserved 2. Refer to the last character in the SRN. A 4, 5, 6, or 7 indicates a possible software or firmware problem. Does the last character indicate a possible software or firmware problem? NO Go to “Step 1230-4” on page 164. YES Go to “Step 1230-2.”Step 1230-2 Ask the customer if any software or firmware has been installed recently. Has any software or firmware been installed recently? NO Go to “Step 1230-4” on page 164. YES Go to “Step 1230-3.”Step 1230-3 Suspect the new software or firmware. Check with your support center for any known problems with the new software or firmware. Are there any known problems with the software or firmware? NO Go to “Step 1230-4” on page 164. YES Obtain and follow the procedure to correct the software problem. This completes the repair. Chapter 3. Maintenance Analysis Procedures 163 Step 1230-4 Were any FRUs or location code reported with the SRN? NO Go to “Step 1230-5.” YES Go to “Step 1230-9.”Step 1230-5 Run the diagnostics in problem determination mode on sysplanar0. Were there any FRUs reported with the SRN? NO Go to “Step 1230-6.” YES Go to “Step 1230-9.”Step 1230-6 Did the system display: ?Previous Diagnostic Results - Do you want to review the previously displayed error?? NO Go to “Step 1230-7.” YES You have a pending item in the error log. To see this error, select YES at the prompt. Information from the error log displays in order of last event first. Record the error code, the FRU names and the location code of the FRUs. Go to “Step 1230-7.”Step 1230-7 Were there any other SRNs that begin with an A00 to A1F reported? NO Go to “Step 1230-8.” YES Go to “Step 1230-1” on page 163 and use the new SRN.Step 1230-8 Attempt to boot the system on slow boot mode. If the system boots, run the diagnostics in problem determination mode on sysplanar0 Were any new error codes or SRNs reported? NO Call your support center. YES Follow the procedure for the new error code or SRN.Step 1230-9 1. Record the physical location codes and FRU numbers. 2. Refer to the last character in the SRN. A 2, 3, 6, or 7 indicates that hot-swap is possible. Does the last character indicate that hot-swap is possible? NO Go to “Step 1230-10.” YES Go to “Step 1230-14” on page 166.Step 1230-10 Note: If necessary, refer to “Powering Off and Powering On the System” on page 838 for information on system shut down and powering the system on and off. 1. If the operating system is running, perform the operating system’s shutdown procedure. 2. Turn off power to the system. 164 406/90 Service Guide 3. Refer to the last character in the SRN. A 1, 3, 5, or 7 indicates that all FRUs listed on the Problem Report Screen need to be replaced. For SRNs ending with any other character, exchange one FRU at a time, in the order listed.Note: Use the appropriate procedure in Chapter 9, “Removal and Replacement Procedures,” on page 835 to replace any FRUs indicated. 4. Turn on power to the system. 5. Load standalone AIX diagnostics in service mode.Note: If the Diagnostics Operating Instructions do not display or you are unable to select the Task Selection option, check for loose cards, cables, and obvious problems. If you do not find a problem, go to “MAP 1020: Problem Determination” on page 158 and get a new SRN. 6. Wait until the Diagnostics Operating Instructions are displayed or the system appears to stop. 7. Press Enter. 8. Select Diagnostic Routines at the function selection menu. 9. Select System Verification. 10. If a missing options exist, particularly if it is related to the device that was replaced, resolve the missing options before proceeding 11. If the resource associated with your repair action is not displayed on the resource list, select sysplanar0. 12. Run diagnostics on all resources. Was a problem reported? NO The repair is complete. Go to MAP 0410: Repair Checkout, located in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. YES Go to “Step 1230-11.”Step 1230-11 Is the problem the same as the original problem? NO The symptom has changed. Check for loose cards, cables, and obvious problems. If you do not find a problem, go to “MAP 1020: Problem Determination” on page 158 and get a new SRN. YES Go to “Step 1230-12.”Step 1230-12 Look at the physical location codes and FRU part numbers you recorded. Have you exchanged all the FRUs that were listed? NO Go to “Step 1230-13” on page 166. YES The SRN did not identify the failing FRU. Call your support person for assistance. Chapter 3. Maintenance Analysis Procedures 165 Step 1230-13 1. After performing a shutdown of the operating system, turn off power to the system. 2. Remove the new FRU and install the original FRU. 3. Exchange the next FRU in list. 4. Turn on power to the system. 5. Load standalone diagnostics in service mode.Note: If the Diagnostics Operating Instructions do not display or you are unable to select the Task Selection option, check for loose cards, cables, and obvious problems. If you do not find a problem, go to “MAP 1020: Problem Determination” on page 158 and get a new SRN. 6. Wait until the Diagnostics Operating Instructions are displayed or the system appears to stop. 7. Press Enter. 8. Select Diagnostic Routines at the function selection menu. 9. Select System Verification. 10. If a missing options exist, particularly if it is related to the device that was replaced, resolve the missing options before proceeding 11. If the resource associated with your action does not appear on the Resource List, select sysplanar0. 12. Run diagnostics on all resources. Was a problem reported? NO The repair is complete. Go to MAP 0410: Repair Checkout, located in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. YES Go to “Step 1230-11” on page 165.Step 1230-14 The FRUs can be hot-swapped. If you do not want to use the hot-swap capabilities of an identified hot-swap FRU, go to “Step 1230-10” on page 164.Note: See the “Hot-Pluggable FRUs” on page 842 for a list of hot-swap devices used in your system. For a procedure to replace a hot-plug (swap) PCI adapter go to “PCI Adapters” on page 914. 1. Refer to the last character in the SRN. A 1, 3, 5, or 7 indicates that all FRUs listed on the Problem Report Screen must be replaced. For SRNs ending with any other character, exchange one FRU at a time, in the order listed.Note: Although some components are physically hot-swappable, there are no procedures available to determine if a repair has been successful while the Linux operating system is running. If you desire to verify the repair with diagnostics, you must shutdown the system and run the AIX standalone diagnostics. 2. After replacing a hot-swappable FRU, examine the HMC or the platform error log to determine if the error has reappeared. You may need to wait 30 minutes or longer before the error is written back to the HMC or platform error log. Was a problem reported? NO The repair is completed. Return the system to the customer. YES Go to “Step 1230-15” on page 167. 166 406/90 Service Guide Step 1230-15 Is the problem the same as the original problem? NO The symptom has changed. Check for loose cards, cables, and obvious problems. If you do not find a problem, go to “MAP 1020: Problem Determination” on page 158 and get a new SRN. YES Go to “Step 1230-16.”Step 1230-16 Look at the physical location codes and FRU part numbers you recorded. Have you exchanged all the FRUs that were listed? NO Go to “Step 1230-17.” YES The SRN did not identify the failing FRU. Call your support person for assistance.Step 1230-17 1. Remove the new FRU and install the original FRU. 2. Exchange the next hot-swappable FRU in the list. If there are more FRUs in the list that are not hot-swappable, go to “Step 1230-10” on page 164 to replace the rest of the FRUs, if required.Note: Although some components are physically hot-swappable, there are no procedures available to determine if a repair has been successful while the Linux operating system is running. If you desire to verify the repair with diagnostics, you must shutdown the system and run the AIX standalone diagnostics. 3. After replacing a hot-swappable FRU, examine the HMC or the platform error log to determine if the error has reappeared. You may need to wait 30 minutes or longer before the error is written back to the HMC or platform error log. Was a problem reported? NO The repair is complete. Return the system to the customer. YES Go to “Step 1230-15.” Chapter 3. Maintenance Analysis Procedures 167 MAP 1235: System Array Self-Repair Problem Resolution for Linux Purpose of This MAP Use this MAP to resolve problems reported by SRNS A11-560 to A11-580. Note: The following steps may require that the system be rebooted to invoke Array bit steering, so you may wish to schedule deferred maintenance with the system administrator to arrange a convenient time to reboot their system. Step 1235-1 Was the SRN A11-560? NO Go to “Step 1235-3.” YES Go to “Step 1235-2.”Step 1235-2 Were there any other errors on the resource reporting the array bit steering problem? NO Go to “Step 1235-4.” YES Resolve those errors before proceeding.Step 1235-3 Were there any other errors on procx? NO Go to “Step 1235-4.” YES Resolve those errors before proceeding.Step 1235-4 Schedule deferred Maintenance with the customer. When it is possible, reboot the system to invoke Array Bit steering Go to “Step 1235-5.” Step 1235-5 Reboot the system. To determine if the original error code or SRN does not reproduce, examine the problem Linux partition with the HMC, or by listing errors in the platform error log. 1. If there is no problem on the resource that originally reported the problem, then array bit steering was able to correct the problem. Go to MAP 0410: Repair Checkout, located in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. 2. If a problem is reported use the SRN to resolve the problem. Go to “Entry MAP” on page 149 with the new symptom. 168 406/90 Service Guide MAP 1320: Service Focal Point These procedures define the steps to take when servicing a machine that is equipped with Service Focal Point (SFP). You might also be directed to these procedures if the system is running Linux as its only operating system. Note: If the system is attached to an HMC and Service Focal Point is enabled, the various codes that might display on the HMC are all listed as error codes by Service Focal Point (SFP). To assist you in identifying the types of error data in this guide, use the following table. Number of Digits in Error Code Error code Name Used in this Service Guide Any Contains # (pound sign) Menu Goal Any Contains - (hyphen) SRN 5 Does not contain # or - SRN 6 Does not contain # or - Error Code 8 Does not contain # or - Error Code, Service Reference Code (SRC) For steps to be taken when beginning service on a machine with SFP, see the “MAP 1321: Quick Entry MAP for Systems with Service Focal Point” on page 170. For steps to be taken after service is complete on a machine with SFP, see the “MAP 1322: End of Call MAP for Systems with Service Focal Point” on page 183. Chapter 3. Maintenance Analysis Procedures 169 MAP 1321: Quick Entry MAP for Systems with Service Focal Point Notes: 1. If you are working on a system that is managed by an HMC, the term service action event is synonymous with serviceable event in this MAP. 2. If you are working on a system that is managed by an HMC, some steps in this MAP may ask you to open the details of the service action event and obtain error codes and FRU location codes. Depending on the error code(s), this operation may involve looking at service processor error code details. Be sure you select all listed error codes one at a time. Select details to display all error codes and location codes for the serviceable event that you are working with.Step 1321-1 For future reference, record the error code and the location code (that brought you to this MAP) of the FRU to be replaced. Record any other symptoms if an error code is not available. Does the system have an HMC attached? No Go to “Step 1321-23” on page 173. Yes Go to “Step 1321-2.”Step 1321-2 At the HMC console, start Service Focal Point (SFP) and examine the service action event log for any open service action events. For SFP operating instructions, see “Service Focal Point” on page 4. Does the HMC appear to operate correctly? No Go to “Quick Entry MAP” on page 151, look up the HMC symptom, and follow the directions there. Yes Go to “Step 1321-3.”Step 1321-3 Are there any service action events that are open? No Go to “Step 1321-40” on page 181. Yes Go to “Step 1321-4.”Step 1321-4 Record the list of open service action events. Step 1321-5 From the list of serviceable events recorded in “Step 1321-4,” open the details of each service action event and examine the FRU list and error code(s) for each. Step 1321-6 Are there any serviceable actions events with error codes of the form #xxxxxx in the list of error codes found in “Step 1321-5”? No Go to “Step 1321-7” on page 171. Yes Go to “Step 1321-18” on page 172. 170 406/90 Service Guide Step 1321-7 Are there any serviceable events with FRU location codes reported in “Step 1321-5” on page 170? No Go to “Step 1321-14” on page 172. Yes Go to “Step 1321-8.”Step 1321-8 From the list of open service action events with FRUs reported, examine the details of each service action event and record the error code. Step 1321-9 Examine the list of error codes you obtained from “Step 1321-8” and look for any of the form 4xxB xxxx, A0D-34x, or A1D-34x. Step 1321-10 Are there any error codes from “Step 1321-8” of the form 4xxB xxxx, A0D-34x, or A1D-34x? No Go to “Step 1321-13.” Yes Go to “Step 1321-11.”Step 1321-11 Do any of the error codes from “Step 1321-8” of the form 4xxB xxxx, A0D-34x, or A1D-34x have the same first two characters on the left as the first two characters on the left of the error code recorded in “Step 1321-1” on page 170 that sent you to this MAP? No Go to “Step 1321-13.” Yes Record the error codes from “Step 1321-8” of the form 4xxB xxxx, A0D-34x, or A1D-34x that have the same first two characters on the left as the first two characters on the left of the error code recorded in “Step 1321-1” on page 170 that sent you to this MAP. Also, record the service action event associated with each error code. Go to “Step 1321-12.”Step 1321-12 From the list of open service action events with error codes of the form 4xxB xxxx, A0D-34x, or A1D-34x recorded in “Step 1321-11,” choose the service action event with the earliest time stamp (the one that occurred first), record its error code and FRU location(s), and proceed to “Step 1321-14” on page 172 with this new code. When future steps refer to the error code recorded in “Step 1321-1” on page 170, use this new error code and its location code(s) instead of the original error code that sent you to this MAP. Step 1321-13 From the list of open service action events with FRUs reported, choose the service action event with the earliest time stamp (the one that occurred first), record its error code and FRU location(s), and proceed to “Step 1321-14” on page 172 with this new error code. When future steps refer to the error code recorded in “Step 1321-1” on page 170, use this new error code and its location code(s) instead of the original error code that sent you to this MAP. Chapter 3. Maintenance Analysis Procedures 171 Step 1321-14 At the HMC console, open Service Focal Point (SFP) and examine the service action event log for any closed service action events. For SFP operating instructions, refer to “Service Focal Point” on page 4. Step 1321-15 Are there any service action events that are closed? No Go to the table in “Entry MAP” on page 149. Yes Go to “Step 1321-16.”Step 1321-16 Examine the details of the ten most-recently closed service action events. Are there any error codes or symptoms in the ten latest closed service action events that are the same as the error code or symptom recorded in “Step 1321-1” on page 170? No Go to the table in “Entry MAP” on page 149. Yes Go to “Step 1321-17.”Step 1321-17 Examine the FRUs of the same error codes or symptoms. Are there any FRUs in the same error codes or symptoms of the closed service action events left to be replaced? No Go to the table in “Entry MAP” on page 149. Yes Replace the next FRU of same error codes or symptoms of the closed service action events. Go to ?MAP 0410: Repair Checkout? in Diagnostic Information for Multiple Bus Systems.Step 1321-18 From the list of serviceable events with error code(s) of the form #xxxxxx, examine the details of each serviceable event to display the description field of the error code(s) of the form #xxxxxx. Step 1321-19 Double-click on the description field to view the complete text. Step 1321-20 Perform any actions that may be listed in the descriptions of all error codes of the form #xxxxxx before proceeding. Step 1321-21 Did you find a problem? No Make a note of the eight character error code number in the bottom of the description field. Use this error code when you are directed to the “Entry MAP” on page 149 later in this MAP. Go to “Step 1321-7” on page 171. Yes Go to ?MAP 0410: Repair Checkout? in Diagnostic Information for Multiple Bus Systems. 172 406/90 Service Guide Step 1321-22 Look for and record any additional error information or software messages on the operator panel or the console (serial TTY, Vterm, or HMC). Choose a running Linux partition (preferably the partition with the problem). Step 1321-23 Is Linux usable in any partition with Linux installed? No Go to “Step 1321-30” on page 178.Yes Go to “Step 1321-24.” Attention: “Step 1321-24” through “Step 1321-36” on page 180 must be performed on a partition running the Linux operating system. Step 1321-24 1. If your system is functional, determine your level of Linux by logging in to the system as the root user and entering the following command: ls -l /var/log/platform If the /var/log/platform file exists, go to substep 3. 2. Examine the Linux system log by entering the following command: cat /var/log/messages |grep RTAS |more Linux run-time RTAS error messages are logged in the messages file under /var/log. The following is an example of the Linux system RTAS error log messages. Aug 27 18:13:41 rasler kernel: RTAS: -------- event-scan begin -------- Aug 27 18:13:41 rasler kernel: RTAS: Location Code: U0.1-P1-C1 Aug 27 18:13:41 rasler kernel: RTAS: WARNING: (FULLY RECOVERED) type: INTERN_DEV_FAIL Aug 27 18:13:41 rasler kernel: RTAS: initiator: UNKNOWN target: UNKNOWN Aug 27 18:13:41 rasler kernel: RTAS: Status: predictive new Aug 27 18:13:41 rasler kernel: RTAS: Date/Time: 20020827 18134000 Aug 27 18:13:41 rasler kernel: RTAS: CPU Failure Aug 27 18:13:41 rasler kernel: RTAS: CPU id: 0 Aug 27 18:13:41 rasler kernel: RTAS: Failing element: 0x0000 Aug 27 18:13:41 rasler kernel: RTAS: A reboot of the system may correct the problem Aug 27 18:13:41 rasler kernel: RTAS: -------- event-scan end ---------- Go to “Step 1321-25” on page 176. Chapter 3. Maintenance Analysis Procedures 173 3. Record that, for this Linux partition, you performed substep 3 of “Step 1321-24” on page 173 for later steps. Use the following command to list diagela messages recorded in the Linux system log:cat /var/log/platform |grep diagela |moreLinux run-time diagela error messages are logged in the platform file under /var/log.The following is an example of the Linux system error log diagela messages.Aug 13 09:38:45 larry diagela: 08/13/2003 09:38:44Aug 13 09:38:45 larry diagela: Automatic Error Log Analysis has detected a problem. Aug 13 09:38:45 larry diagela: Aug 13 09:38:45 larry diagela: The Service Request Number(s)/Probable Cause(s) Aug 13 09:38:45 larry diagela: (causes are listed in descending order of probability): Aug 13 09:38:45 larry diagela: Aug 13 09:38:45 larry diagela: 651-880: The CEC or SPCN reported an error. Report the SRN and the following reference and physical location codes to your service provider. Aug 13 09:38:45 larry diagela: Location: n/a FRU: n/a Ref-Code: B1004699 Aug 13 09:38:45 larry diagela: Aug 13 09:38:45 larry diagela: Analysis of Error log sequence number: 3 Aug 29 07:13:04 larry diagela: 08/29/2003 07:13:04 Aug 29 07:13:04 larry diagela: Automatic Error Log Analysis has detected a problem. Aug 29 07:13:04 larry diagela: Aug 29 07:13:04 larry diagela: The Service Request Number(s)/Probable Cause(s) Aug 29 07:13:04 larry diagela: (causes are listed in descending order of probability): Aug 29 07:13:04 larry diagela: Aug 29 07:13:04 larry diagela: 651-880: The CEC or SPCN reported an error. Report the SRN and the following reference and physical location codes to your service provider. Aug 29 07:13:04 larry diagela: Location: U0.1-F4 FRU: 09P5866 Ref-Code: 10117661 Aug 29 07:13:04 larry diagela: Aug 29 07:13:04 larry diagela: Analysis of /var/log/platform sequence number: 24 Sep 4 06:00:55 larry diagela: 09/04/2003 06:00:55 Sep 4 06:00:55 larry diagela: Automatic Error Log Analysis reports the following: Sep 4 06:00:55 larry diagela: Sep 4 06:00:55 larry diagela: 651204 ANALYZING SYSTEM ERROR LOG Sep 4 06:00:55 larry diagela: A loss of redundancy on input power was detected. Sep 4 06:00:55 larry diagela: Sep 4 06:00:55 larry diagela: Check for the following: Sep 4 06:00:55 larry diagela: 1. Loose or disconnected power source connections. Sep 4 06:00:55 larry diagela: 2. Loss of the power source. Sep 4 06:00:55 larry diagela: 3. For multiple enclosure systems, loose or Sep 4 06:00:55 larry diagela: disconnected power and/or signal connections Sep 4 06:00:55 larry diagela: between enclosures. Sep 4 06:00:55 larry diagela: Sep 4 06:00:55 larry diagela: Supporting data: Sep 4 06:00:55 larry diagela: Ref. Code: 10111520 Sep 4 06:00:55 larry diagela: Location Codes: P1 P2 Sep 4 06:00:55 larry diagela: Sep 4 06:00:55 larry diagela: Analysis of /var/log/platform sequence number: 13 174 406/90 Service Guide 4. Also use the following command to list RTAS messages recorded in the Linux system log: cat /var/log/platform |grep RTAS |more Linux RTAS error messages are logged in the platform file under /var/log. The following is an example of the Linux system error log RTAS messages. Aug 27 12:16:33 larry kernel: RTAS: 15 -------- RTAS event begin -------- Aug 27 12:16:33 larry kernel: RTAS 0: 04440040 000003f8 96008508 19155800 Aug 27 12:16:33 larry kernel: RTAS 1: 20030827 00000001 20000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 2: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 3: 49424d00 55302e31 2d463400 00503034 Aug 27 12:16:33 larry kernel: RTAS 4: 10117661 04a0005d 10110000 00000000 Aug 27 12:16:33 larry kernel: RTAS 5: 00007701 000000e0 00000003 000000e3 Aug 27 12:16:33 larry kernel: RTAS 6: 00000000 01000000 00000000 31303131 Aug 27 12:16:33 larry kernel: RTAS 7: 37363631 20202020 20202020 55302e31 Aug 27 12:16:33 larry kernel: RTAS 8: 2d463420 20202020 20202020 03705a39 Aug 27 12:16:33 larry kernel: RTAS 9: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 10: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 11: 00000000 00000000 00000000 00000000Aug 27 12:16:33 larry kernel: RTAS 12: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 13: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 14: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 15: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 16: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 17: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 18: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 19: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 20: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 21: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 22: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 23: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 24: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 25: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 26: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 27: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 28: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 29: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 30: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 31: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 32: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 33: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 34: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 35: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 36: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 37: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 38: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 39: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 40: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 41: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 42: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 43: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 44: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 45: 00000000 00000000 00000000 00000000 Chapter 3. Maintenance Analysis Procedures 175 Aug 27 12:16:33 larry kernel: RTAS 46: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 47: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 48: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 49: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 50: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 51: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 52: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 53: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 54: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 55: 00000000 00000000 00000000 00000000Aug 27 12:16:33 larry kernel: RTAS 56: 00000000 00000000 00000000 00000000Aug 27 12:16:33 larry kernel: RTAS 57: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 58: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 59: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 60: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 61: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 62: 00000000 00000000 00000000 00000000 Aug 27 12:16:33 larry kernel: RTAS 63: 00000000 00000000 00000000 00020000 Aug 27 12:16:33 larry kernel: RTAS: 15 -------- RTAS event end ---------- Error codes and location codes may appear as RTAS messages. The extended data is also provided in the form of an RTAS message. The extended data contains other error code words that help in isolating the correct FRUs. The start of the extended data is marked, for example, by the line Aug 27 12:16:33 larry kernel: RTAS: 15 -------- RTAS event begin --------. The number after the colon is a sequence number that correlates this data any diagela data with the same sequence number. The end of the extended data is marked by the line Aug 27 12:16:33 larry kernel: RTAS: 15 -------- RTAS event end ---------- with the same sequence number. Word 13 and word 19 are found in the RTAS messages. For example, to find word 13, first find the error code in the left column of words of the extended data, 10117661. In this example, we find the error code to the right of ?RTAS 4:?. This is also word 11. To get word 13, 10110000, simply count the words left-to-right, beginning at word 11. Step 1321-25 If you performed substep 2 of “Step 1321-24” on page 173, then record any RTAS messages found in the Linux system log in “Step 1321-24” on page 173. If you performed substep 3 of “Step 1321-24” on page 173, then record any RTAS and diagela messages found in the Linux system log in “Step 1321-24” on page 173, and also record any extended data found in the RTAS messages, especially word 13 and word 19. Ignore all other messages in the Linux system log. If the system is configured with more than one logical partition with Linux installed, repeat “Step 1321-24” on page 173 and “Step 1321-25”for all logical partitions that have Linux installed. 176 406/90 Service Guide Step 1321-26 If you performed substep 3 of “Step 1321-24” on page 173 for the current Linux partition, go to “Step 1321-27,” and when asked in “Step 1321-27,” do not record any RTAS messages from “Step 1321-26” for the current Linux partition. Examine the Linux boot (IPL) log by logging in to the system as the root user and entering the following command: cat /var/log/boot.msg /var/log/dmesg 2>/dev/null | grep RTAS |more Linux boot (IPL) error messages are logged into the boot.msg file or the dmesg file under /var/log. The following is an example of the Linux boot error log. RTAS daemon started RTAS: -------- event-scan begin -------- RTAS: Location Code: U0.1-F3 RTAS: WARNING: (FULLY RECOVERED) type: SENSOR RTAS: initiator: UNKNOWN target: UNKNOWN RTAS: Status: bypassed new RTAS: Date/Time: 20020830 14404000 RTAS: Environment and Power Warning RTAS: EPOW Sensor Value: 0x00000001 RTAS: EPOW caused by fan failure RTAS: -------- event-scan end ---------- Step 1321-27 Record any RTAS messages found in the Linux boot (IPL) log in Step 1321-26. Ignore all other messages in the Linux boot (IPL) log. If the system is configured with more than one logical partition with Linux installed, repeat “Step 1321-26” and “Step 1321-27” for all logical partitions that have Linux installed. Chapter 3. Maintenance Analysis Procedures 177 Step 1321-28 If you performed substep 3 of “Step 1321-24” on page 1731321-24 for the current Linux partition, go to “Step 1321-29,” and when asked in “Step 1321-29,” do not record any additional extended data from “Step 1321-26” on page 177 for the current Linux partition. Examine the extended data in both logs. The following is an example of the Linux extended data. <3>RTAS daemon started <3>RTAS: -------- event-scan begin -------- <3>RTAS: Location Code: U0.1-P1-C2 <4>RTAS: Log Debug: 04 4b2726fb04a00011702c0014000000000000000000000000f1800001001801d3ffffffff01000000000000004234313820202020383030343236464238454134303030303 030303030303030 <4>RTAS: Log Debug: D25046413405020d0a000001000271400100000033434d502044415441000001000000000000010000f1800001535953204441544100000000000000002002162715010509200216271501050920020637150105095352432044415441702c001400000000000000020018820201d3820000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000280048ea400000000000000000000000000000000000000004350542044415441702cff08000000001c000000702cf0080000000080000000702cf100702cf200702c000400000800702c01040bf2002e702c02040c1fffbf702c0300702c1000702c11040bf2002e702c12040c1fffbf702c1300702ca000702ca108000000000000a03c702ca208000000000000effc702cb000702cb108000000000000a03c702cb208000000000000effc702cc000702cc108000000000000a03c702cc208000000000000effc702c3000702c31080000000000000003702c3208000000000000007b702c8000702c81080000000020e27a39702c820800000000fffeffff702cd000702cd1080000000010004010702cd208000000007777f3ffffffffffffffffffffffffffffffffffffffffffffffffffffffffffffffffff<3>RTAS: WARNING: (FULLY RECOVERED) type: INTERN_DEV_FAIL <3>RTAS: initiator: UNKNOWN target: UNKNOWN <3>RTAS: Status: unrecoverable new <3>RTAS: Date/Time: 20020905 15372200 <3>RTAS: CPU Failure <3>RTAS: Internal error (not cache) <3>RTAS: CPU id: 0 <3>RTAS: Failing element: 0x0000 <3>RTAS: -------- event-scan end --------- Step 1321-29 Record any extended data found in the Linux system log in Step 1321-24 or the Linux boot (IPL) log in Step 1321-26. Be sure to record word 13. Note: The line(s) in the Linux extended data that begin with ?<4>RTAS: Log Debug: 04? contain the error code listed in the next 8 hex characters. In the previous example, ?4b27 26fb? is an error code. The error code is also known as word 11. Each 4 bytes after the error code in the Linux extended data is another word (for example, ?04a0 0011? is word 12, and ?702c 0014? is word 13, and so on). If the system is configured with more than one logical partition with Linux installed, repeat “Step 1321-28” and “Step 1321-29” for all logical partitions that have Linux installed. Step 1321-30 Were any error codes or checkpoints recorded in Steps 1321-1, 1321-22, 1321-25, 1321-27, or 1321-29? No Go to “Step 1321-31” on page 179. Yes Go to the “Quick Entry MAP” on page 151 with each recorded error code or symptom. Perform the indicated actions one at a time for each error code until the problem has been corrected. If all recorded error codes have been processed and the problem has not been corrected, go to “Step 1321-31” on page 179. 178 406/90 Service Guide Step 1321-31 If no additional error information is available and the problem has not been corrected, shut down the system. If an HMC is not attached, you need a laptop computer (and cable, part number 62H4857) capable of connecting to serial port 1 on the processor subsystem. Note: The Linux login prompt cannot be seen on a laptop computer connected to serial port 1. 1. If there is no HMC, attach the laptop computer and cable to serial port 1 on the processor subsystem. 2. When OK displays in the operator panel, press a key on the virtual terminal on the laptop computer. The service processor menus are available on the attached laptop computer or on the HMC. 3. Examine the service processor (SP) error log. Record all error codes and messages written to the SP error log. Go to “Step 1321-32.” 4. If the service processor menus are not available, perform the following steps: a. Examine and correct all connections to the service processor. b. Replace the service processor.Note: The service processor might be contained on a separate card or book. Or in some systems, the service processor is built into the service processor/PCI backplane, or into the system backplane. c. If you have not already replaced the system backplane in an earlier step, replace the system backplane. Note: Call service support before replacing the system backplane.Step 1321-32 Boot the system to the system management services (SMS) menus. Record all error codes and messages written to the SMS error log. Step 1321-33 Were any error codes recorded in steps 1321-31 or 1321-32? No Go to “Step 1321-41” on page 181. Yes Go to the “Quick Entry MAP” on page 151 with each error code or symptom you have recorded. Perform the indicated actions, one at a time, until the problem has been corrected. If all recorded error codes have been processed and the problem has not been corrected, go to “Step 1321-41” on page 181.Step 1321-34 Reboot the system and and bring all partitions to the login prompt. If Linux is not usable in all partitions, go to “Step 1321-38” on page 181. Step 1321-35 Use the lscfg command to list all resources assigned to all partitions. Record the adapter and the partition for each resource. Chapter 3. Maintenance Analysis Procedures 179 Step 1321-36 To determine if any device(s) or adapter(s) are missing, compare the list of found resources and partition assignments to the customer’s known configuration. Record the location of any missing devices. Also record any differences in the descriptions or the locations of devices. You may also compare this list of found resources to a prior version of the device tree as follows: Note: Every time that the partition is booted, the update-device-tree command is run and the device tree is stored in the /var/lib/lsvpd/ directory in a file with the file name device-tree-YYYY-MM-DD-HH:MM:SS, where YYYY is the year, MM is the month, DD is the day, and HH, MM, and SS are the hour, minute and second, respectively, of the date of creation. v At the command line, type the following: cd /var/lib/lsvpd/ v At the command line, type the following: lscfg -vpd device-tree-2003-03-31-12:26:31. This displays the device tree created on 03/31/2003 at 12:26:31. The diff command offers a way to compare the output from a current lscfg command to the output from an older lscfg command. If the files names for the current and old device trees are current.out and old.out, respectively, type: diff old.out current.out. Any lines that exist in the old, but not in the current will be listed and preceded by a less-than symbol (<). Any lines that exist in the current, but not in the old will be listed and preceded by a greater-than symbol (>). Lines that are the same in both files are not listed; for example, files that are identical will produce no output from the diff command. If the location or description changes, lines preceded by both < and > will be output. If the system is configured with more than one logical partition with Linux installed, repeat “Step 1321-35” on page 179 and “Step 1321-36” for all logical partitions that have Linux installed. Step 1321-37 Was the location of one and only one device recorded in “Step 1321-36”? No If you previously answered Yes to “Step 1321-37,” return the system to its original configuration. This ends the procedure. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. If you did not previously answer Yes to “Step 1321-37,” go to “Step 1321-38” on page 181. Yes Perform the following steps one at a time. Power off the system before performing each step. After performing each step, power on the system and go to “Step 1321-34” on page 179. 1. Check all connections from the system to the device. 2. Replace the device (for example, tape or DASD) 3. If applicable, replace the device backplane. 4. Replace the device cable. 5. Replace the adapter. 6. If the adapter resides in an I/O drawer, replace the I/O backplane. 7. If the device adapter resides in the CEC, replace the I/O riser card, or the CEC backplane where the device adapter is located. 8. Call service support. Do not go to “Step 1321-34” on page 179. 180 406/90 Service Guide Step 1321-38 Does the system appear to stop or hang before reaching the login prompt or did you record any problems with resources in “Step 1321-36” on page 180?Note: If the system console or VTERM window is always blank, choose NO. If you are sure the console or VTERM is operational and connected correctly, answer the question for this step. No Go to “Step 1321-39.”Yes There may be a problem with an I/O device. Go to “MAP 1542: I/O Problem Isolation” on page 318. In MAP 1542, when instructed to boot the system, boot a full system partition.Step 1321-39 Boot the AIX standalone diagnostics, and run diagnostics in problem determination mode on all resources. Be sure to boot a full system partition. Ensure that AIX diagnostics was run on all known resources. You may need to select each resource individually and run diagnostics on each resource one at a time. Did AIX standalone diagnostics find a problem? No Go to “Step 1321-43” on page 182.Yes Go to the table in “Entry MAP” on page 149 with each error code or SRN you have recorded. Until the problem has been corrected, perform the indicated actions, one at a time, for each SRN or error code not already processed in “Step 1321-37” on page 180. If all recorded SRNs or error codes have been processed and the problem has not been corrected, go to “Step 1321-43” on page 182.Step 1321-40 Does the system have Linux installed on one or more partitions? No Go to “Step 1321-14” on page 172.Yes Go to “Step 1321-22” on page 173.Step 1321-41 Were any location codes recorded in Steps 1321-1, 1321-22, 1321-25, 1321-27, 1321-29, 1321-31, or 1321-32? No Go to “Step 1321-34” on page 179.Yes Replace, one at a time, all parts whose location code was recorded in Steps 1321-1, 1321-22, 1321-25, 1321-27, 1321-29, 1321-31, and 1321-32 that have not been replaced. Power off the system before replacing a part. After replacing the part, power on the system to check if the problem has been corrected. Go to “Step 1321-42” on page 182 when the problem has been corrected, or all parts in the location codes list have been replaced. Chapter 3. Maintenance Analysis Procedures 181 Step 1321-42 Was the problem corrected in Step 1321-41? No Go to “Step 1321-34” on page 179.Yes Return the system to its original configuration. This ends the procedure. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1321-43 Were any other symptoms recorded in step 1321-1? No Call support.Yes Go to the “Quick Entry MAP” on page 151 with each symptom you have recorded. Perform the indicated actions for all recorded symptoms, one at a time, until the problem has been corrected. If all recorded symptoms have been processed and the problem has not been corrected, call support. 182 406/90 Service Guide MAP 1322: End of Call MAP for Systems with Service Focal Point Note: If you are working on a system that not managed by an HMC go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems Step 1322-1 For future reference, record the SRC or symptom and the location code of the FRU you replaced. Step 1322-2 For Service Focal Point operating instructions, refer to “Service Focal Point” on page 4. At the HMC, open Service Focal Point and examine the service action event log for any open service action events. Step 1322-3 Are there any service action events that are open? No If the system attention LED is still on, use the HMC to turn off the LED as described in “Resetting the System Attention LED” on page 22. This completes the repair, return the system to the customer. Yes Go to “Step 1322-4”.Step 1322-4 Record the list of open service action events. Step 1322-5 From the list of serviceable events recorded in “Step 1322-4,” perform “Step 1322-6” through “Step 1322-33” on page 186 for each open service action event. Step 1322-6 Determine the error class of the serviceable event. Record for future use. Step 1322-7 Examine the details of the open service action event. Is the error code associated with this service action event the same as recorded in ?Step 1322-1?? No Go to “Step 1322-8.” Yes Go to “Step 1322-11” on page 184.Step 1322-8 Examine the FRU list of the service action event. Are there any FRUs listed for the service action event? No Go to “Step 1321-11” on page 171. Yes Go to “Step 1321-9” on page 171. Chapter 3. Maintenance Analysis Procedures 183 Step 1322-9 Is the FRU list identical (same FRUs, same number of FRUs, and same order of FRUs) to the FRU list of the error code recorded in “Step 1322-1” on page 183? No Go to “Step 1322-10.” Yes Go to “Step 1322-11.”Step 1322-10 The FRU list is different. Is the FRU you replaced and recorded in “Step 1322-1” on page 183 in the list of FRUs for this service action event? No Go to “Step 1322-33” on page 186.Note: There are service action events that will remain open when you leave this MAP. Further service actions may be required to complete the repair.Yes Go to “Step 1322-11.”Step 1322-11 Step 1322-12 Is the error code associated with this service action event of the form A11-xxx or A01-xxx? No Go to “Step 1322-17” on page 185. Yes Go to “Step 1322-13.”Step 1322-13 Have you begun a list of “Axx” partitions from prior service action events that you processed in this MAP? No Go to “Step 1322-14.” Yes Go to “Step 1322-15.”Step 1322-14 Step 1322-15 Add the partition list obtained in “Step 1322-11” to the existing list of “Axx” partitions obtained from processing previous service action events in this MAP. 184 406/90 Service Guide Step 1322-16 Remove all entries in the list of all partition(s) you recorded in “Step 1322-11” on page 184. If you are referred to the list of partition(s) obtained in “Step 1322-11” on page 184 in future steps, the list is empty. Go to “Step 1322-17.” Step 1322-17 Select and highlight the service action event from the “Error Associated With This Serviceable Event” window. Step 1322-18 Click Close Event. Step 1322-19 Add comments for the serviceable event. Include any unique additional information. Click OK. Step 1322-20 The following steps will add or update FRU information. Step 1322-21 Did you replace, add, or modify a FRU of the open service action event? No Go to “Step 1322-23.” Yes Go to “Step 1322-22.”Step 1322-22 From the FRU list, select a FRU that you need to update. Double-click on the FRU, and update the FRU information. Go to “Step 1322-24.” Step 1322-23 Select the No FRU Replaced for this Serviceable Event option. Step 1322-24 Click OK to close the service action event. Step 1322-25 Is the list of all partition(s) you recorded in “Step 1322-11” on page 184 empty? No Go to “Step 1322-26.” Yes Go to “Step 1322-33” on page 186.Step 1322-26 Does the list of all partition(s) you recorded in “Step 1322-11” on page 184 contain more than one entry? No Go to “Step 1322-33” on page 186. Yes Go to “Step 1322-27” on page 186. Chapter 3. Maintenance Analysis Procedures 185 Step 1322-27 Is the error class recorded in “Step 1322-6” on page 183 AIX? No Go to “Step 1322-33.” Yes Go to “Step 1322-28.”Step 1322-28 Perform the following steps for each entry in the list of all partition(s) you recorded in “Step 1322-11” on page 184, except the partition you were using to debug the original problem. Step 1322-29 From the HMC virtual terminal window of a partition in the list of all partitions, type diag at the AIX command prompt. Step 1322-30 1. When the diagnostic operating instructions are displayed, press Enter. 2. Select the Task Selection option. Note: If the terminal type is not defined, you are prompted to define it before you can proceed. 3. Select the Log Repair option. 4. Select the resource associated with the repair action. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your repair action is not displayed on the Resource List, select sysplanar0. 5. Click commit after you have made your selection.Step 1322-31 Exit from diagnostics in this partition and return to the AIX prompt. Step 1322-32 Have all the partitions in the list of all partition(s) you recorded in “Step 1322-11” on page 184 been processed? No Go to “Step 1322-28” to process the next partition in the list you recorded in “Step 1322-11” on page 184. Yes Go to “Step 1322-33”.Step 1322-33 Have all the serviceable events recorded in “Step 1322-4” on page 183 been processed? No Go to “Step 1322-5” on page 183 and process the next service action event in the list of serviceable events recorded in “Step 1322-4” on page 183. Yes Go to “Step 1322-34” on page 187. 186 406/90 Service Guide Step 1322-34 While processing all service action events, were you directed to “Step 1322-14” on page 184? No If the system attention LED is still on, use the HMC to turn off the LED as described in “Resetting the LED when a System is Attached To an HMC” on page 22. This completes the repair. Return the system to the customer.Note: If during the processing of the list of open service action events, some service action events remained open, further service actions may be required to complete the repair.Yes Go to “Step 1322-35.”Step 1322-35 Perform the following steps for each entry in the list of “Axx” partitions you began recording in “Step 1322-14” on page 184, except the partition you were using to debug the original problem. Step 1322-36 From the HMC virtual terminal window of a partition in the list of “Axx” partitions, type diag at the AIX command prompt. Step 1322-37 1. When the diagnostic operating instructions are displayed, press Enter. 2. Select the Task Selection option. Note: If the terminal type is not defined, you are prompted to define it before you can proceed. 3. Select the Log Repair option. 4. Select the resource associated with the repair action. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your repair action is not displayed on the Resource List, select sysplanar0. 5. Click commit after you have made your selection.Step 1322-38 Exit from diagnostics in this partition and return to the AIX prompt. Step 1322-39 Have all the partitions in the list of “Axx” partitions(s) you began recording in “Step 1322-14” on page 184 been processed? No Go to “Step 1322-35” to process the next partition in the list you recorded in “Step 1322-14” on page 184. Yes If the system attention LED is still on, use the HMC to turn off the LED as described in “Resetting the LED when a System is Attached To an HMC” on page 22. This completes the repair. Return the system to the customer.Note: If during the processing of the list of open service action events, some service action events remained open, further service actions may be required to complete the repair. Chapter 3. Maintenance Analysis Procedures 187 MAP 1420: Recovery Procedures for Hot-Pluggable PCI Adapters This procedure is used to locate problems that might occur when performing a PCI adapter hot-plug operation. Find the symptom in the following table; then follow the instructions given in the Action column. Recovery Procedures for Hot-Pluggable PCI Adapters Symptom/Reference Code Action Partition will not boot rendering PCI hot-plug manager useless. Go to “MAP 1421: Partition Will Not Boot Due to Faulty Adapter.” Partition boots, but PCI adapter is not recognized and slot status is empty. Go to “MAP 1422: Slot is Empty Even When Populated” on page 189. MAP 1421: Partition Will Not Boot Due to Faulty Adapter Step 1421-1 The entire system must be shut down and powered off to repair. Does the customer wish to restore the partition by isolating the faulty adapter without powering off the entire system? NO Go to “Step 1421-12” on page 189. YES Go to “Step 1421-2.”Step 1421-2 At the HMC, view the resources assigned to the partition that will not boot. Step 1421-3 Remove the faulty resource from the partition that will not boot. Record the slot location and the adapter type. Step 1421-4 Attempt to boot the partition. Does the partition boot? NO Go to “Step 1421-12” on page 189. YES Go to “Step 1421-5.”Step 1421-5 Shut down the partition. 188 406/90 Service Guide Step 1421-6 Is the PCI slot power LED of the location recorded in “Step 1421-3” on page 188 on? NO Go to “Step 1421-7.” YES Go to “Step 1421-12.”Step 1421-7 Are there any empty PCI slots attached to the system that are not assigned to any partitions? NO Go to “Step 1421-12.” YES Go to “Step 1421-8.”Step 1421-8 Assign the available empty and unassigned slot to the partition that will not boot. Step 1421-9 Insert a new adapter of the same type recorded in “Step 1421-3” on page 188 into the slot assigned in “Step 1421-8.” Step 1421-10 Move any external cabling from the adapter in the slot recorded in “Step 1421-3” on page 188 to the adapter in the slot assigned in “Step 1421-8.” Step 1421-11 Attempt to boot the partition. Does the partition boot? NO Go to “Step 1421-12.” YES Partition and all resources are operational. Schedule repair for the slot recorded in “Step 1421-3” on page 188 with the customer. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. This ends the procedure.Step 1421-12 Cannot isolate the adapter without powering off the entire system. Schedule repair for the slot recorded in “Step 1421-3” on page 188 with the customer. This ends the procedure. MAP 1422: Slot is Empty Even When Populated Step 1422-1 Record the I/O drawer and slot location of the slot that is populated with an adapter but shows empty in the PCI hot-plug manager. Also record the adapter type. Chapter 3. Maintenance Analysis Procedures 189 Step 1422-2 Is the PCI slot power LED of the location recorded in “Step 1422-1” on page 189 on? NO Go to “Step 1422-3.” YES Go to “Step 1422-7.”Step 1422-3 Disconnect any external cables from the adapter identified in “Step 1422-1” on page 189. Step 1422-4 Remove the adapter identified in “Step 1422-1” on page 189. Step 1422-5 Follow the procedure “Installing a New Hot-Pluggable PCI Adapter” on page 923 to add a new adapter at the location recorded in “Step 1422-1” on page 189. The new adapter should be the same type as that removed in “Step 1422-1” on page 189. Return to “Step 1422-6” after the new adapter is added. Step 1422-6 Did the PCI hot-plug procedure successfully add a new adapter at the location recorded in “Step 1422-1” on page 189? NO Go to “Step 1422-7.” YES Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. This ends the procedure.Step 1422-7 Shut down the partition that the faulty slot is assigned to. Step 1422-8 Is the PCI slot power LED of the location recorded in “Step 1422-1” on page 189 on? NO Go to “Step 1422-9.” YES Go to “Step 1422-14” on page 191Step 1422-9 Are there any empty PCI slots attached to the system that are not assigned to any partitions? NO Go to “Step 1422-14” on page 191. YES Go to “Step 1422-10.”Step 1422-10 Assign the available empty and unassigned slot to the partition that the faulty slot is assigned to. 190 406/90 Service Guide Step 1422-11 Insert a new adapter of the same type recorded in “Step 1422-1” on page 189 into the slot assigned in “Step 1422-10” on page 190. Step 1422-12 Move any external cabling from the adapter in the slot recorded in “Step 1422-1” on page 189 to the adapter in the slot assigned in “Step 1422-10” on page 190. Step 1422-13 Attempt to boot the partition. Does the partition boot?. NO Go to “Step 1422-14.” YES Partition and all resources are operational. Schedule repair with the customer for the slot recorded in “Step 1422-1” on page 189. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. This ends the procedure.Step 1422-14 You cannot repair the adapter without powering off the entire system. Schedule repair with the customer for the slot recorded in “Step 1422-1” on page 189. This ends the procedure. Chapter 3. Maintenance Analysis Procedures 191 MAP 1520: Power This procedure is used to locate power problems in the processor subsystem, I/O subsystem(s) or rack If a problem is detected, this procedure helps you isolate the problem to a failing unit. Observe the following safety notices during service procedures. DANGERAn electrical outlet that is not correctly wired could place hazardous voltage on metal parts of the system or the devices that attach to the system. It is the responsibility of the customer to ensure that the outlet is correctly wired and grounded to prevent an electrical shock. Use one hand, when possible, to connect or disconnect signal cables to prevent a possible shock from touching two surfaces with different electrical potentials. During an electrical storm, do not connect cables for display stations, printers, telephones, or station protectors for communications lines.D06 CAUTION: This product is equipped with a three-wire power cable and plug for the user’s safety. Use this power cable with a properly grounded electrical outlet to avoid electrical shock.C01 CAUTION: Energy hazard, remove power before servicing.C22 Attention: To identify a FRU, these MAPs refer to FRU Identify LEDs. Information about the FRU Identify LEDs can be found in “FRU Identify LEDs” on page 65. 192 406/90 Service Guide Find the symptom in the following table; then follow the instructions given in the Action column. Power Problems Symptom/Reference Code Action The system will not power on and no error codes are available Go to “MAP 1521: The System Will Not Power On And No Error Codes Are Available” on page 196. 1011 8700, 1011 8701, 101A 0106, 101B 0106 Go to “MAP 152a: Loss of ac Power or Phase Missing” on page 214. 1014 6014, 1014 6814, 1014 B014, 1014 B814, 1015 6014, 1015 6814, 1015 B014, 1015 B814, 1016 6014, 1016 6814, 1016 B014, 1016 B814,1017 6014, 1017 6814, 1017 B014, 1017 B814, 1021 6014, 1021 6814, 1021 B014, 1021 B814, 1022 6014, 1022 6814, 1022 B014, 1022 B814, 1023 6014, 1023 6814, 1023 B014, 1023 B814, 1024 6014, 1024 6814, 1024 B014, 1024 B814,1025 6014, 1025 6814, 1025 B014, 1025 B814 Go to “MAP 152h: 3.3V Current/Voltage Problem in I/O Subsystem” on page 259. 1014 6024, 1014 6824, 1014 B024, 1014 B824, 1015 6024, 1015 6824, 1015 B024, 1015 B824, 1016 6024, 1016 6824, 1016 B024, 1016 B824,1017 6024, 1017 6824, 1017 B024, 1017 B824, 1021 6024, 1021 6824, 1021 B024, 1021 B824, 1022 6024, 1022 6824, 1022 B024, 1022 B824, 1023 6024, 1023 6824, 1023 B024, 1023 B824, 1024 6024, 1024 6824, 1024 B024, 1024 B824,1025 6024, 1025 6824, 1025 B024, 1025 B824 Go to “MAP 152i: 5.0 V Current/Voltage Problem in I/O Subsystem” on page 267. 1014 6034, 1014 6834, 1014 B034, 1014 B834, 1015 6034, 1015 6834, 1015 B034, 1015 B834, 1016 6034, 1016 6834, 1016 B034, 1016 B834,1017 6034, 1017 6834, 1017 B034, 1017 B834, 1021 6034, 1021 6834, 1021 B034, 1021 B834, 1022 6034, 1022 6834, 1022 B034, 1022 B834, 1023 6034, 1023 6834, 1023 B034, 1023 B834, 1024 6034, 1024 6834, 1024 B034, 1024 B834,1025 6034, 1025 6834, 1025 B034, 1025 B834 Go to “MAP 152j: 2.5 V Current/Voltage Problem in I/O Subsystem” on page 276. 1014 6044, 1014 6844, 1014 B044, 1014 B844, 1015 6044, 1015 6844, 1015 B044, 1015 B844, 1016 6044, 1016 6844, 1016 B044, 1016 B844,1017 6044, 1017 6844, 1017 B044, 1017 B844, 1021 6044, 1021 6844, 1021 B044, 1021 B844, 1022 6044, 1022 6844, 1022 B044, 1022 B844, 1023 6044, 1023 6844, 1023 B044, 1023 B844, 1024 6044, 1024 6844, 1024 B044, 1024 B844,1025 6044, 1025 6844, 1025 B044, 1025 B844 Go to “MAP 152k: 12.0 V Current/Voltage Problem in I/O Subsystem” on page 284. 1014 6054, 1014 6854, 1014 B054, 1014 B854, 1015 6054, 1015 6854, 1015 B054, 1015 B854, 1016 6054, 1016 6854, 1016 B054, 1016 B854,1017 6054, 1017 6854, 1017 B054, 1017 B854, 1021 6054, 1021 6854, 1021 B054, 1021 B854, 1022 6054, 1022 6854, 1022 B054, 1022 B854, 1023 6054, 1023 6854, 1023 B054, 1023 B854, 1024 6054, 1024 6854, 1024 B054, 1024 B854,1025 6054, 1025 6854, 1025 B054, 1025 B854 Go to “MAP 152l: -12.0V Current/Voltage Problem in I/O Subsystem” on page 293. Chapter 3. Maintenance Analysis Procedures 193 Power Problems Symptom/Reference Code Action 1014 CD96, 1014 CDA6, 1014 CDB6, 1014 CDC6,1015 CD96, 1015 CDA6, 1015 CDB6, 1015 CDC6,1016 CD96, 1016 CDA6, 1016 CDB6, 1016 CDC6,1017 CD96, 1017 CDA6, 1017 CDB6, 1017 CDC6,1021 CD96, 1021 CDA6, 1021 CDB6, 1021 CDC6,1022 CD96, 1022 CDA6, 1022 CDB6, 1022 CDC6,1023 CD96, 1023 CDA6, 1023 CDB6, 1023 CDC6,1024 CD96, 1024 CDA6, 1024 CDB6, 1024 CDC6,1025 CD96, 1025 CDA6, 1025 CDB6, 1025 CDC6 Go to “MAP 152n: DASD Subsystem Power Problem” on page 304. 101A FE16 for BPC A or101B FE16 for BPC B Go to “MAP 1522: UEPO Switch On The BPC Is In The Bypass Position” on page 199 101A 0856 or 101A FE56for BPR 1A 101A 0866 or 101A FE66 for BPR 2A 101A 0876 or 101A FE76 for BPR 3A 101B 0856 or 101B FE56 for BPR 1B 101B 0866 or 101B FE66 for BPR 2B 101B 0876 or 101B FE76 for BPR 3B Go to “MAP 1523: There Is a Bulk Power Regulator (BPR) Communications Fault” on page 200. 101A FDB6 for BPI A or 101B FDB6 for BPI B Go to “MAP 1524: An Open Room EPO Switch Has Been Detected From One Side” on page 201. 101A 0D06 for BPA A or 101B 0D06 for BPA B Go to “MAP 1525: There Is a 350 Volt Bulk Failure” on page 203. 101A 7A86 or 101A 7B86 for IBF 1A101A 7A96 or 101A 7B96 for IBF 2A101A 7AA6 or 101A 7BA6 for IBF 3A101B 7A86 or 101B 7B86 for IBF 1B101B 7A96 or 101B 7B96 for IBF 2B101B 7AA6 or 101B 7BA6 for IBF 3B Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206. 101A 1C06, 101B 1C06, 101C 1C06, 1014 1C06, 1015 1C06, 1016 1C06, 1017 1C06, 1021 1C06, 1022 1C06, 1023 1C06 Go to “MAP 1527: An Airflow Loss Has Been Detected” on page 209. 101C E006 or 101C E106 Go to “MAP 1528: There Is A Processor (Critical/Warning) Overtemperature Fault” on page 211. 101A D216 for BPA A or 101B D216 for BPA B Go to “MAP 1529: There Is A Bulk Power Assembly (BPA) Communication Failure” on page 212. 101A F806 or 101A F906 for BPC Aor101B F806 or 101B F906 for BPC B Go to “MAP 152a: Loss of ac Power or Phase Missing” on page 214. 194 406/90 Service Guide Power Problems Symptom/Reference Code Action 1014 xx15, 1014 xx25, 1015 xx15, 1015 xx25, 1016 xx15, 1016 xx25, 1017 xx15, 1017 xx25, 101C xx15, 101C xx25, 101C xx35, 101C xx45, 101C xx55, 101C xx75, 101C xx85, 101C xx95, 101C xxA5, 101A x115, 101A x125, 101A x135, 101A x215,101A x225, 101A x235, 101A x315, 101A x325,101A x335, 101A x415, 101A x425, 101A x435,101A x515, 101A x525, 101A x535, 101A x615,101A x625, 101A x635, 101A x715, 101A x725,101A x735, 101A x825, 101A x835, 101A x925,101A x935, 101A xA25, 101A xA35,101B x115, 101B x125, 101B x135, 101B x215,101B x225, 101B x235, 101B x315, 101B x325,101B x335, 101B x415, 101B x425, 101B x435,101B x515, 101B x525, 101B x535, 101B x615,101B x625, 101B x635, 101B x715, 101B x725,101B x735, 101B x825, 101B x835, 101B x925,101B x935, 101B xA25, 101B xA35,1021 xx15, 1021 xx25, 1022 xx15, 1022 xx25,1023 xx15, 1023 xx25, 1024 xx15, 1024 xx25,1025 xx15, 1025 xx25 Go to “MAP 152m: Cable Problem in Power Subsystem” on page 300. 101C 6014, 101C 6814, 101C B014, 101C B814 Go to “MAP 152b: 2.5 V Current/Voltage Problem in Processor Subsystem” on page 216. 101C 6114, 101C 6914, 101C B114, 101C B914 Go to “MAP 152c: 1.8 V Current/Voltage Problem in Processor Subsystem” on page 223. 101C 6124, 101C 6924, 101C B124, 101C B924 Go to “MAP 152e: 1.5 V CPU Current/Voltage Problem in Processor Subsystem” on page 240. 101C 6134, 101C 6934, 101C B134, 101C B934 Go to “MAP 152d: 1.5 V Auxiliary Current/Voltage Problem in Processor Subsystem” on page 231. 101C 6144, 101C 6944, 101C B144, 101C B944 Go to “MAP 152f: 3.3 V Current/Voltage Problem in Processor Subsystem” on page 245. 101C 6514, 101C 6D14, 101C B514, 101C BD14 Go to “MAP 152g: 5.0 V Current/Voltage Problem in Processor Subsystem” on page 253. Chapter 3. Maintenance Analysis Procedures 195 MAP 1521: The System Will Not Power On And No Error Codes Are Available Step 1521-1 Ensure that the following are true: v Both power cords are properly connected v The UEPO panel switch is in the ON position v The UEPO BYPASS switches on both BPCs are in the Normal position v The on/off switches located on all the BPRs are in the on (left) positionNote: A switch set to the off position cannot cause this problem, however, any switch found in the off position must be reset to the on position (wait 10 seconds) before proceeding. v The cable from UEPO connector J00 to BPC-A connector J05 and the cable from UEPO connector J01 to BPC-B connector J05 are properly connected without visible sign of damage. v The room temperature is not in excess of the maximum allowed (40 degrees Celsius or 104 degrees Fahrenheit)Note: If the room temperature exceeds the specified maximum, the machine may continually cycle on and off.Are all the above conditions true? NO Correct any problems and go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Go to “Step 1521-2.”Step 1521-2 Check the LEDs on both sides of the bulk power assembly (BPA). Are all LEDs off on both sides of BPA? NO Go to “Step 1521-3.” YES Go to “Step 1521-9” on page 198Step 1521-3 Check the following LED states on both Bulk Power Controllers (BPCs): v UEPO PWR LED turned on v BPC GOOD LED turned on v All other LEDs are offAre all of the above true? NO Go to “Step 1521-4” on page 197. YES Go to “Step 1521-5” on page 197. 196 406/90 Service Guide Step 1521-4 Independent faults are indicated on both sides of the BPA. Each side must be isolated separately. Call for support. This ends the procedure. Step 1521-5 Check the External (Room) EPO Connection if used. If a room emergency power off (EPO) circuit is used, the external room EPO circuit is connected to the J02 connector on the lower edge of the UEPO Switch. Is a cable connected into the connector J02 on the UEPO Switch? NO Go to “Step 1521-6.” YES Go to “Step 1521-7.”Step 1521-6 Is the internal toggle switch on the EPO panel card set to the ’RM EPO BYPASS’ position? NO Set the internal toggle switch to the ’RM EPO BYPASS’ position, and go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Exchange the UEPO card assembly. Follow all the steps in the removal and replacement procedure for the “Unit Emergency Power Off (UEPO) Card Assembly” on page 850. Go to “Step 1521-8” on page 198.Step 1521-7 Note: Read the Danger and Caution notices under “Safety Notices” on page vii before continuing with this procedure. Disconnect the cable from UEPO connector J02, and set the toggle switch to the ’RM EPO BYPASS’ position. Does the UEPO CMPLT LED on at least one BPC become lit? NO Exchange the UEPO card assembly. Follow all the steps in the removal and replacement procedure for the “Unit Emergency Power Off (UEPO) Card Assembly” on page 850. Go to “Step 1521-8” on page 198. YES Notify the customer that the room EPO circuit is defective at this connection and requires service. This ends the procedure. Chapter 3. Maintenance Analysis Procedures 197 Step 1521-8 Is the error code that sent you to this MAP still reported? NO The problem has been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Call for support. This ends the procedure.Step 1521-9 Note: Read the Danger and Caution notices under “Safety Notices” on page vii before continuing with this procedure. 1. Prepare a multimeter to measure up to 600 V ac. 2. Use the multimeter to measure the ac voltage at the test points on the frame labeled Phase A and Phase B. Is the ac voltage greater than 180 V ac? NO Inform the customer that the line voltage into the BPA is missing or too low. This ends the procedure. YES Independent faults are indicated on both sides of the BPA. Each side must be isolated separately. Call for support. This ends the procedure. 198 406/90 Service Guide MAP 1522: UEPO Switch On The BPC Is In The Bypass Position Step 1522-1 Check the UEPO Switch on the BPC on the BPA that is exhibiting the failure. Is the switch in the NORMAL position? NO Set UEPO switch on the BPC to the NORMAL position and go to “Step 1522-2.” YES Go to “Step 1522-2.”Step 1522-2 Press the Service Complete button. Is error code 101A FE16 or 101B FE16 displayed? NO The problem has already been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Replace the BPC on the side that is exhibiting the failure. Follow all the steps in the “Bulk Power Controller (BPC)” on page 845. Go to “Step 1522-3.”Step 1522-3 If not done already, press the service complete button. Is error code 101A FE16 or 101B FE16 displayed? NO The problem has been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. This ends the procedure. YES Call for support. This ends the procedure. Chapter 3. Maintenance Analysis Procedures 199 MAP 1523: There Is a Bulk Power Regulator (BPR) Communications Fault Step 1523-1 Check the position of the on/off switch on the BPR referenced by the error code that sent you to this MAP. Is the switch in the ON position (to the left)? NO Set the switch to the ON position (to the left), and go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Go to “Step 1523-2.”Step 1523-2 Exchange the following FRUs one at a time: v BPR referenced by the error code that sent you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Regulator (BPR)” on page 844. v BPC on the side referenced by the error code that sent you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Controller (BPC)” on page 845. v BPE on the side referenced by the error code that sent you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Enclosure (BPE)” on page 849.After each FRU exchanged, is the error code that sent you to this MAP still reported? NO Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. This ends the procedure. YES Replace the next FRU on the list. 200 406/90 Service Guide MAP 1524: An Open Room EPO Switch Has Been Detected From One Side Step 1524-1 Check the unit emergency power off (UEPO) panel. Is a customer room EPO cable plugged into the UEPO panel? NO Go to “Step 1524-3” on page 202. YES Go to “Step 1524-2” on page 202. Chapter 3. Maintenance Analysis Procedures 201 Step 1524-2 1. Set UEPO switch on both BPCs to the BYPASS position. 2. Unplug the customer room EPO cable. 3. Set switch (J4) on the UEPO panel to EPO BYPASS position. 4. Set UEPO switch on both BPCs back to the NORMAL position. 5. Press the service complete button. Is 101A FDB6 or 101B FDB6 generated? NO Go to “Step 1524-4.” YES Go to “Step 1524-3.”Step 1524-3 Exchange the following FRUs one at a time: 1. UEPO card assembly. Follow all steps in the removal and replacement procedures for “Unit Emergency Power Off (UEPO) Card Assembly” on page 850. 2. Cable from UEPO to BPC on the side referenced by the error code that sent you to this MAP. 3. BPC on the side referenced by the error code that sent you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Controller (BPC)” on page 845. After each FRU exchanged, is the error code that sent you to this MAP still reported? NO The problem has been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Replace the next FRU on the list. If all FRUs on the list have been replaced, call for support.Step 1524-4 Perform the following actions one at a time: 1. Replace the UEPO card assembly. Follow all steps in the removal and replacement procedures for “Unit Emergency Power Off (UEPO) Card Assembly” on page 850. 2. Check with the customer’s site services to have them test the room EPO switch for correct operation. After each action is performed, is the error code that sent you to this MAP still reported? NO The problem has been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Perform the next action on the list. If all actions on the list have been performed, call for support. 202 406/90 Service Guide MAP 1525: There Is a 350 Volt Bulk Failure Step 1525-1 Note: Read the Danger and Caution notices under “Safety Notices” on page vii before continuing with this procedure. 1. Prepare a multimeter to measure up to 600 V ac. 2. Use the multimieter to measure the ac voltage at the following test points on the faceplate of the BOA that is referecned by the error code that sent you to this MAP: v Phase A and phase B v Phase B and phase C v Phase C and phase A Are all the readings greater than 180 V ac? NO Go to “Step 1525-7” on page 205. YES Go to “Step 1525-2.”Step 1525-2 Notes: 1. This problem can be caused by any BPR plugged into the failing BPA. 2. The system can have one or two BPRs, depending on the system configuration. 3. Each BPR on the failing BPA must be checked independently. The following steps check all the BPRs on the side of the BPA (A or B) referenced by the error code that sent you to this MAP. Are there any remaining BPRs to be checked? NO Go to “Step 1525-3” on page 204. YES Go to “Step 1525-4” on page 204. Chapter 3. Maintenance Analysis Procedures 203 Step 1525-3 Notes: 1. This problem can be caused by any BPD plugged into the failing BPA. 2. The system can have up to three BPDs, depending on the system configuration. 3. Each BPD on the failing BPA must be checked independently. The following steps check all the BPDs on the side of the BPA (A or B) referenced by the error code that sent you to this MAP. Are there any BPDs remaining to be checked? NO Exchange the following FRUs one at a time: v BPC on the side referenced by the error code that sent you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Controller (BPC)” on page 845. v Replace the BPE on the side referenced by the error code that sent you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Enclosure (BPE)” on page 849. After each FRU is exchanged, is the error code that sent you to this MAP still reported? NO The problem has been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Replace the next FRU on the list. If all FRUs on the list have been replaced, call for support.YES Go to “Step 1525-5” on page 205.Step 1525-4 Note: Be sure you are working on the side of the BPA (A or B) referenced by the error code that sent you to this MAP. FRUs on the operating side of the BPA (with BPC GOOD LED on) must not be disturbed during this procedure. 1. Turn the BPR on/off switch to the off position (to the right). 2. Unplug the BPR to be checked and carefully pull it approximately 25 mm (1 inch) away from the backplane without unplugging any cables. If there is more than one BPR left to be checked, start with the uppermost unchecked BPR and make a note of which BPR you unplugged. Does the BPC GOOD LED on the BPC come on and stay on? NO Replug the BPR and set all the BPR on/off switches on this BPA to the on position (to the left), and continue with “Step 1525-1” on page 203. YES Replace the BPR that is currently unplugged. Follow all steps in the removal and replacement procedures for “Bulk Power Regulator (BPR)” on page 844. Go to “Step 1525-6” on page 205. 204 406/90 Service Guide Step 1525-5 Note: Be sure you are working on the side of the BPA (A or B) referenced by the error code that sent you to this MAP. FRUs on the operating side of the BPA (with BPC GOOD LED on) must not be disturbed during this procedure. 1. Turn all the BPR on/off switches on this BPA to the off position (to the right). 2. Unplug the BPD to be checked and carefully pull it approximately 25 mm (1 inch) away from the backplane without unplugging any cables. If there is more than one BPD left to be checked, start with the uppermost unchecked BPD and make a note of which BPD you unplugged. 3. Set all the BPR on/off switches on this BPA to the on position (to the left). Does the BPC GOOD LED on the BPC come on and stay on? NO Replug the BPD and continue with “Step 1525-3” on page 204. YES Replace the BPD that is currently unplugged. Follow all steps in the removal and replacement procedures for “Bulk Power Distributor (BPD)” on page 847. Go to “Step 1525-6.”Step 1525-6 Is the error code that sent you to this MAP still reported? NO The problem has been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES If the error code that directed you to this map was 1011 8710 or 1011 8711, then return to the “SPCN Error Codes” on page 420 table and take the next action defined in the table for these error codes. If the error code that directed you to this map was neither 1011 8710 nor 1011 8711, then call support.Step 1525-7 Inform the customer that the line voltage into the BPA is missing or is too low and must be checked. Can the line voltage be checked now? NO Defer this repair until the customer’s utility can check the line voltage. YES Go to “Step 1525-8.”Step 1525-8 Does the customer utility voltage checkout OK? NO The customer must restore utility voltage. YES Exchange the power cord and go to “Step 1525-9.”Step 1525-9 Press the service complete button. Is the error code that sent you to this MAP reported? NO The problem has been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Call for support. Chapter 3. Maintenance Analysis Procedures 205 MAP 1526: There Is An Integrated Battery Feature (IBF) Failure Step 1526-1 Is the IBF feature installed? NO Replace the BPR that is connected to the IBF referenced in the error code that brought you to this MAP. Follow all steps in the removal and replacement procedures in “Bulk Power Regulator (BPR)” on page 844. Go to “Step 1526-7” on page 208. YES Go to “Step 1526-2.”Step 1526-2 Examine the following table to determine the BPR/IBF you are working with, then proceed to Step 1526-3. The remaining steps in this MAP refer to the indicated BPR/IBF: Table 1. Error Code BPR and IBF 101A 7A56 or 101A 7B56 BPR1/IBF1, side A 101A 7A66 or 101A 7B66 BPR2/IBF2, side A 101A 7A76 or 101A 7B76 BPR3/IBF3, side A 101A 7A86 or 101A 7B86 BPR1/IBF1, side A 101A 7A96 or 101A 7B96 BPR2/IBF2, side A 101A 7AA6 or 101A 7BA6 BPR3/IBF3, side A 101B 7A56 or 101B 7B56 BPR1/IBF1, side B 101B 7A66 or 101B 7B66 BPR2/IBF2, side B 101B 7A76 or 101B 7B76 BPR3/IBF3, side B 101B 7A86 or 101B 7B86 BPR1/IBF1, side B 101B 7A96 or 101B 7B96 BPR2/IBF2, side B 101B 7AA6 or 101B 7BA6 BPR3/IBF3, side B Step 1526-3 Check the cable between the BPR and the IBF. The IBF LED on the IBF referenced in the error code that brought you to this MAP will be off. Is the cable plugged on both sides? NO Plug in the cable and go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Go to “Step 1526-4” on page 207. 206 406/90 Service Guide Step 1526-4 Check the circuit breaker on the IBF referenced in the error code that brought you to this MAP. The IBF LED on the IBF referenced in the error code that brought you to this MAP will be off. Is the circuit breaker set on? NO Replace the IBF referenced in the error code that brought you to this MAP. Follow all steps in the removal and replacement procedures in “Integrated Battery Feature (IBF)” on page 851. Go to “Step 1526-7” on page 208. YES Exchange the following FRUs one at a time: v The BPR that is connected to the IBF referenced in the error code that brought you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Regulator (BPR)” on page 844. v The IBF that is referenced in the error code that brought you to this MAP. Follow all steps in the removal and replacement procedures for “Integrated Battery Feature (IBF)” on page 851. v The cable between the BPR and the IBF referenced in the error code that brought you to this MAP. After each FRU is exchanged, is the error code that sent you to this MAP still reported? NO The problem has been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems YES Replace the next FRU on the list. If all FRUs in the list have been replaced, call for support.Step 1526-5 Check the circuit breaker on the IBF referenced in the error code that brought you to this MAP. The IBF LED on the IBF referenced in the error code that brought you to this MAP will be off. Is the circuit breaker set on? NO Go to “Step 1526-6” on page 208 YES Replace the IBF referenced in the error code that brought you to this MAP. Follow all steps in the removal and replacement procedures in “Integrated Battery Feature (IBF)” on page 851. Go to “Step 1526-7” on page 208. Chapter 3. Maintenance Analysis Procedures 207 Step 1526-6 Set the breaker to the on (down) position. Wait 10 seconds after the LED on the IBF referenced in the error code that brought you to this MAP comes on. Does the circuit breaker remain on? NO Exchange the following FRUs one at a time: v The BPR that is connected to the IBF referenced in the error code that brought you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Regulator (BPR)” on page 844. v The IBF that is referenced in the error code that brought you to this MAP. Follow all steps in the removal and replacement procedures for “Integrated Battery Feature (IBF)” on page 851. v The cable between the BPR and the IBF referenced in the error code that brought you to this MAP. After each FRU is exchanged, is the error code that sent you to this MAP still reported? NO The problem has been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Replace the next FRU on the list. If all FRUs in the list have been replaced, call for support.YES Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1526-7 Is the error code that sent you to this MAP still reported? NO The problem has been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Call for support. 208 406/90 Service Guide MAP 1527: An Airflow Loss Has Been Detected Step 1527-1 Check the following air inlets and outlets for blockage or obstruction: v Front and rear frame door openings (except for acoustical foam) v Bulk power assembly (BPA) cage – Inlet opening in left side of front BPA, BPA A – Outlet opening in right side of rear BPA, BPB B – Bulk power fan (BPF) cover is present on front BPA and absent on rear BPAv Processor subsystem cage – Inlet opening at top front of processor subassembly – Outlet opening at top rear of processor subassemblyv I/O subsystem – Inlet opening at front of the I/O subassembly – Outlet opening at rear of I/O subassemblyAre all of these openings clear and unobstructed? NO Remove any obstructions and go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Go to “Step 1527-2.”Step 1527-2 Do all book positions in the processor subsystem contain books or fillers? NO Fill any open positions with books or fillers, and go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Go to “Step 1527-3.”Step 1527-3 Are the air filters clean and free of any obstructions? NO Clean and replace the air filters, then go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Go to “Step 1527-4.”Step 1527-4 Remove the blowers or fans from the affected subsystem. Attention: Do not put your hand into the enclosure. Push the spring-loaded door open and hold it open with one hand. Using a suitably long instrument (a 12-inch ruler or long-handled screwdriver), push gently against the recirculation flaps at the back of each blower or fan enclosure. Are all the recirculation flaps in all enclosures free to swing up? NO The enclosure is not a replaceable FRU. Call your next level of support. YES Reinstall the blower or fans removed earlier and go to “Step 1527-5” on page 210 ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Chapter 3. Maintenance Analysis Procedures 209 Step 1527-5 Call your next level of support. 210 406/90 Service Guide MAP 1528: There Is A Processor (Critical/Warning) Overtemperature Fault Step 1528-1 Is the room ambient temperature in normal range (less than 35 degrees C/ 95 degrees F)? NO Notify the customer. The customer must bring the room temperature within normal range. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Go to “Step 1528-2.”Step 1528-2 Are the system front and rear doors free of obstructions? Are the system air filters clean and free from obstructions? NO Notify the customer. The system must be free of obstructions for proper airflow. Clean or replace system air filters as required. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Exchange the following FRUs one at a time: 1. The processor MCM module 0, at location U1.18-P1-C1. Follow all steps in the removal and replacement procedures for “MCM Module (Processor) and Pass Through Modules” on page 863. 2. The processor MCM module 3, at location U1.18-P1-C2, if present. Follow all steps in the removal and replacement procedures for “MCM Module (Processor) and Pass Through Modules” on page 863. 3. The processor MCM module 1, at location U1.18-P1-C3. Follow all steps in the removal and replacement procedures for “MCM Module (Processor) and Pass Through Modules” on page 863. 4. The processor MCM module 2, at location U1.18-P1-C4. Follow all steps in the removal and replacement procedures for “MCM Module (Processor) and Pass Through Modules” on page 863. After each FRU is exchanged, is the error code that sent you to this MAP still reported? NO The problem has been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Replace the next FRU on the list. If all FRUs on the list have been replaced, call for support. Chapter 3. Maintenance Analysis Procedures 211 MAP 1529: There Is A Bulk Power Assembly (BPA) Communication Failure Step 1529-1 Check the position of the on/off switch on the BPRs. Is the switch in the ON position (to the left)? NO Set the switch to the ON position (to the left), and go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Go to “Step 1529-2.”Step 1529-2 Note: Read the Danger and Caution notices under “Safety Notices” on page vii before continuing with this procedure. 1. Prepare a multimeter to measure up to 600 V ac. 2. Use the multimeter to measure the ac voltage at the following test points on the face plate of the BPA that is not referenced by the error code that sent you to this MAP: v Phase A and phase B v Phase B and phase C v Phase C and phase A Are all the readings greater than 180 V ac? NO Go to “Step 1529-3” on page 213. YES Exchange the following FRUs one at a time: 1. BPR 1, that is connected to the IBF not referenced in the error code that brought you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Regulator (BPR)” on page 844. 2. BPR 2, that is connected to the IBF not referenced in the error code that brought you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Regulator (BPR)” on page 844. 3. BPR 3, that is connected to the IBF not referenced in the error code that brought you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Regulator (BPR)” on page 844. 4. BPC on the other side not referenced by the error code that sent you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Controller (BPC)” on page 845. 5. BPE on the other side not referenced by the error code that sent you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Enclosure (BPE)” on page 849. After each FRU is exchanged, is the error code that sent you to this MAP still reported? NO The problem has been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems YES Replace the next FRU on the list. If all FRUs on the list have been replaced, call for support. 212 406/90 Service Guide Step 1529-3 Inform the customer that the line voltage into the BPA is missing or too low, and needs to be checked. Can the line voltage be checked now? NO Defer this repair until the customer’s utility can check the line voltage. This ends the procedure. YES Go to “Step 1529-4.”Step 1529-4 Does the customer utility voltage check out OK? NO The customer must restore utility voltage. This ends the procedure. YES Exchange the power cord, and go to “Step 1529-5.”Step 1529-5 Press the service complete button. Is error code that sent you to this MAP still reported? NO The problem has been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. This ends the procedure. YES Call for support. This ends the procedure. Chapter 3. Maintenance Analysis Procedures 213 MAP 152a: Loss of ac Power or Phase Missing Step 152a-1 Press the Service complete button. Is the error code that sent you to this MAP still reported? NO This was a transient condition. The condition no longer exists. No service action is required. YES Go to “Step 152a-2.”Step 152a-2 Check all LEDs on the BPA that is not referenced by the error code that sent you to this MAP. Are they OK? NO Defer this repair until the failure on the other BPA can be repaired. This ends the procedure. YES Go to “Step 152a-3.”Step 152a-3 Note: Read the Danger and Caution notices under “Safety Notices” on page vii before continuing with this procedure. 1. Prepare a multimeter to measure up to 600 V ac. 2. Use the multimeter to measure the ac voltage at the following test points on the faceplate of the BPA that is referenced by the error code that sent you to this MAP: v Phase A and phase B v Phase B and phase C v Phase C and phase A Are all the readings greater than 180 V ac? NO Go to “Step 152a-4” on page 215. YES Exchange the following FRUs one at a time: 1. BPR 1, that is connected to the BPA side referenced in the error code that brought you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Regulator (BPR)” on page 844. 2. BPR 2, that is connected to the BPA side referenced in the error code that brought you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Regulator (BPR)” on page 844. 3. BPR 3, that is connected to the BPA side referenced in the error code that brought you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Regulator (BPR)” on page 844. 4. BPC on the side referenced by the error code that sent you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Controller (BPC)” on page 845. 5. BPE on the side referenced by the error code that sent you to this MAP. Follow all steps in the removal and replacement procedures for “Bulk Power Enclosure (BPE)” on page 849. 214 406/90 Service Guide After each FRU is exchanged, is the error code that sent you to this MAP still reported? NO The problem has been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES Replace the next FRU on the list. If all FRUs on the list have been replaced, call for support.Step 152a-4 Inform the customer that the line voltage into the BPA is missing or too low, and must be checked. Can the line voltage be checked now? NO Defer this repair until the customer’s utility can check the line voltage. This ends the procedure. YES Go to “Step 152a-5.”Step 152a-5 Does the customer utility voltage check out OK? NO The customer must restore utility voltage. This ends the procedure. YES Exchange the power cord, and go to “Step 152a-6.”Step 152a-6 Press the service complete button. Is error code that sent you to this MAP still reported? NO The problem has been corrected. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. This ends the procedure. YES Call for support. This ends the procedure. Chapter 3. Maintenance Analysis Procedures 215 MAP 152b: 2.5 V Current/Voltage Problem in Processor Subsystem Step 152b-1 Record the error code and location code(s) that sent you to this MAP. Step 152b-2 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152b-3 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152b-4.” No Call for support.Step 152b-4 Pull out the following books about 25 mm (1 inch), if present: v Memory book 0, at location U1.18-P1-M2 v Memory book 1, at location U1.18-P1-M3 v Memory book 2, at location U1.18-P1-M7 v Memory book 3, at location U1.18-P1-M6Step 152b-5 Turn on the power. Step 152b-6 Did the system stop with the same error code as recorded in “Step 152b-1”? Yes Go to “Step 152b-12” on page 217. No Go to “Step 152b-7.”Step 152b-7 One of the books just removed is defective. 1. Turn off the power. 2. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off.Step 152b-8 Reinsert one of the books that is pulled out and listed in step 152b-4. Step 152b-9 Turn on the power. 216 406/90 Service Guide Step 152b-10 Did the system stop with the same error code as recorded in “Step 152b-1” on page 216? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152b-11.”Step 152b-11 Have all the books listed in step 152b-4 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152b-8” on page 216.Step 152b-12 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152b-13 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152b-14.” No Call for support.Step 152b-14 Pull out the following books about 25 mm (1 inch), if present: v Memory book 4, at location U1.18-P1-M4 v Memory book 5, at location U1.18-P1-M8 v Memory book 6, at location U1.18-P1-M5 v Memory book 7, at location U1.18-P1-M1Step 152b-15 Turn on the power. Step 152b-16 Did the system stop with the same error code as recorded in “Step 152b-1” on page 216? Yes Go to “Step 152b-22” on page 219. No Go to “Step 152b-17.”Step 152b-17 One of the books just removed is defective. 1. Turn off the power. 2. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Chapter 3. Maintenance Analysis Procedures 217 Step 152b-18 Reinsert one of the books that is pulled out and listed in step “Step 152b-14” on page 217. 218 406/90 Service Guide Step 152b-19 Turn on the power. Step 152b-20 Did the system stop with the same error code as recorded in “Step 152b-1” on page 216? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152b-21.”Step 152b-21 Have all the books listed in “Step 152b-14” on page 217 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152b-18” on page 218.Step 152b-22 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152b-23 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152b-24.” No Call for support.Step 152b-24 Pull out the following books about 25 mm (1 inch), if present: v Secondary I/O book, at location U1.18-P1-H3 v Third I/O book, at location U1.18-P1-H4 v Fourth I/O book, at location U1.18-P1-H1 v Capacitor book, at location U1.18-P1-V3 v Capacitor book, at location U1.18-P1-V7 Chapter 3. Maintenance Analysis Procedures 219 Step 152b-25 Turn on the power. Step 152b-26 Did the system stop with the same error code as recorded in “Step 152b-1” on page 216? Yes Go to “Step 152b-32.” No Go to “Step 152b-27.”Step 152b-27 One of the books just removed is defective. 1. Turn off the power. 2. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off.Step 152b-28 Reinsert one of the books that is pulled out and listed in “Step 152b-24” on page 219. Step 152b-29 Turn on the power. Step 152b-30 Did the system stop with the same error code as recorded in “Step 152b-1” on page 216? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152b-31.”Step 152b-31 Have all the books listed in “Step 152b-24” on page 219 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152b-28.”Step 152b-32 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152b-33 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152b-34” on page 221. No Call for support. 220 406/90 Service Guide Step 152b-34 Replace the books in the following list, if present, one at a time and in the order listed: v Primary I/O Book, at location U1.18-P1-H2 v DCA 1-RH, at location U1.18-P1-V1 v DCA 2-RH, at location U1.18-P1-V2 v DCA 3-RH, at location U1.18-P1-V4 v DCA 4-RH, at location U1.18-P1-V5 v DCA 5-RH, at location U1.18-P1-V6 v DCA 6-RH, at location U1.18-P1-V8Step 152b-35 Turn on the power. Step 152b-36 Did the system stop with the same error code as recorded in “Step 152b-1” on page 216? Yes Go to “Step 152b-37.” No The book just replaced is defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152b-37 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152b-38 Remove the new book that was just installed in “Step 152b-34” and reinstall the original book. Step 152b-39 Have all the books listed in “Step 152b-34” been replaced with new or original books? Yes Go to “Step 152b-40.” No Go to “Step 152b-34.”Step 152b-40 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152b-41 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152b-42” on page 222. No Call for support. Chapter 3. Maintenance Analysis Procedures 221 Step 152b-42 Attention: Before replacing the system backplane, call for support. Replace the system backplane at location U1.18-P1. Step 152b-43 Turn on the power. Step 152b-44 Did the system stop with the same error code as recorded in “Step 152b-1” on page 216? Yes Go to “Step 152b-45.” No The system backplane was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152b-45 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152b-46 Has the system backplane been replaced? Yes Call for support. No Go to “Step 152b-42.” 222 406/90 Service Guide MAP 152c: 1.8 V Current/Voltage Problem in Processor Subsystem Step 152c-1 For future reference, record the error code in the operator panel display. Step 152c-2 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152c-3 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152c-4.” No Call for support.Step 152c-4 Pull out the following books about 25 mm (1 inch), if present: v Memory book 0, at location U1.18-P1-M2 v Memory book 1, at location U1.18-P1-M3 v Memory book 2, at location U1.18-P1-M7 v Memory book 3, at location U1.18-P1-M6Step 152c-5 Turn on the power. Step 152c-6 Did the system stop with the same error code as recorded in “Step 152c-1”? Yes Go to “Step 152c-12” on page 224. No Go to “Step 152c-7.”Step 152c-7 One of the books just removed is defective. 1. Turn off the power. 2. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off.Step 152c-8 Reinsert one of the books that is pulled out and listed in “Step 152c-4.” Step 152c-9 Turn on the power. Chapter 3. Maintenance Analysis Procedures 223 Step 152c-10 Did the system stop with the same error code as recorded in “Step 152c-1” on page 223? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152c-11.”Step 152c-11 Have all the books listed in “Step 152c-4” on page 223 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152c-8” on page 223.Step 152c-12 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152c-13 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152c-14.” No Call for support.Step 152c-14 Pull out the following books about 25 mm (1 inch), if present: v Memory book 4, at location U1.18-P1-M4 v Memory book 5, at location U1.18-P1-M8 v Memory book 6, at location U1.18-P1-M5 v Memory book 7, at location U1.18-P1-M1Step 152c-15 Turn on the power. Step 152c-16 Did the system stop with the same error code as recorded in “Step 152c-1” on page 223? Yes Go to “Step 152c-22” on page 225. No Go to “Step 152c-17.”Step 152c-17 One of the books just removed is defective. 1. Turn off the power. 2. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. 224 406/90 Service Guide Step 152c-18 Reinsert one of the books that is pulled out and listed in “Step 152c-14” on page 224. Step 152c-19 Turn on the power. Step 152c-20 Did the system stop with the same error code as recorded in “Step 152c-1” on page 223? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152c-21.”Step 152c-21 Have all the books listed in “Step 152c-14” on page 224 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152c-18.”Step 152c-22 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152c-23 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152c-24.” No Call for support.Step 152c-24 Pull out the following books about 25 mm (1 inch), if present: v Secondary I/O book, at location U1.18-P1-H3 v Third I/O book, at location U1.18-P1-H4 v Fourth I/O book, at location U1.18-P1-H1 v Capacitor book, at location U1.18-P1-V3 v Capacitor book, at location U1.18-P1-V7 Chapter 3. Maintenance Analysis Procedures 225 Step 152c-25 Turn on the power. Step 152c-26 Did the system stop with the same error code as recorded in “Step 152c-1” on page 223? Yes Go to “Step 152c-32.” No Go to “Step 152c-27.”Step 152c-27 One of the books just removed is defective. Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152c-28 Reinsert one of the books that is pulled out and listed in “Step 152c-24” on page 225. Step 152c-29 Turn on the power. Step 152c-30 Did the system stop with the same error code as recorded in “Step 152c-1” on page 223? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152c-31.”Step 152c-31 Have all the books listed in “Step 152c-24” on page 225 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152c-28.”Step 152c-32 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. 226 406/90 Service Guide Step 152c-33 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152c-34.” No Call for support.Step 152c-34 Replace the books in the following list, if present, one at a time and in the order listed: v Primary I/O book, at location U1.18-P1-H2 v DCA 1-RH, at location U1.18-P1-V1 v DCA 2-RH, at location U1.18-P1-V2 v DCA 3-RH, at location U1.18-P1-V4 v DCA 4-RH, at location U1.18-P1-V5 v DCA 5-RH, at location U1.18-P1-V6 v DCA 6-RH, at location U1.18-P1-V8Step 152c-35 Turn on the power. Step 152c-36 Did the system stop with the same error code as recorded in “Step 152c-1” on page 223? Yes Go to “Step 152c-37.” No The book just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152c-37 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152c-38 Remove the new book that was just installed in “Step 152c-34” and reinstall the original book. Step 152c-39 Have all the books listed in “Step 152c-34” been replaced with new or original books? Yes Go to “Step 152c-40.” No Go to “Step 152c-34.”Step 152c-40 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Chapter 3. Maintenance Analysis Procedures 227 Step 152c-41 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152c-42.” No Call for support.Step 152c-42 Record the location of each installed L3 module. Attention: L3 modules have a limit of three plug cycles. Before removing any of the L3 modules in the following list, call for support. Remove all of the following modules, if present: v L3 module, at location U1.18-P1-C5 v L3 module, at location U1.18-P1-C6 v L3 module, at location U1.18-P1-C7 v L3 module, at location U1.18-P1-C8 v L3 module, at location U1.18-P1-C9 v L3 module, at location U1.18-P1-C10 v L3 module, at location U1.18-P1-C11 v L3 module, at location U1.18-P1-C12 v L3 module, at location U1.18-P1-C13 v L3 module, at location U1.18-P1-C14 v L3 module, at location U1.18-P1-C15 v L3 module, at location U1.18-P1-C16 v L3 module, at location U1.18-P1-C17 v L3 module, at location U1.18-P1-C18 v L3 module, at location U1.18-P1-C19 v L3 module, at location U1.18-P1-C20 v System clock, at location U1.18-P1-X5Step 152c-43 Turn on the power. Step 152c-44 Did the system stop with the same error code as recorded in “Step 152c-1” on page 223? Yes Go to “Step 152c-50” on page 229. No Go to “Step 152c-45.”Step 152c-45 One of the modules just removed is defective. Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152c-46 Reinstall one of the modules that is removed and listed in “Step 152c-42.” 228 406/90 Service Guide Step 152c-47 Turn on the power. Step 152c-48 Did the system stop with the same error code as recorded in “Step 152c-1” on page 223? Yes The module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152c-49.”Step 152c-49 Have all the modules listed in “Step 152c-42” on page 228 been reinstalled? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152c-45” on page 228.Step 152c-50 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152c-51 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152c-52.” No Call for support.Step 152c-52 Attention: Before replacing the system backplane, call for support. Replace the system backplane, at location U1.18-P1. Step 152c-53 Turn on the power. Step 152c-54 Did the system stop with the same error code as recorded in “Step 152c-1” on page 223? Yes Go to “Step 152c-55” on page 230. No The system backplane was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Chapter 3. Maintenance Analysis Procedures 229 Step 152c-55 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152c-56 Has the system backplane been replaced? Yes Call for support. No Go to “Step 152c-52” on page 229. 230 406/90 Service Guide MAP 152d: 1.5 V Auxiliary Current/Voltage Problem in Processor Subsystem Step 152d-1 Record the error code and the location code(s) that sent you to this MAP. Step 152d-2 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152d-3 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152d-4.” No Call for support.Step 152d-4 Pull out the following books about 25 mm (1 inch), if present: v Memory book 0, at location U1.18-P1-M2 v Memory book 1, at location U1.18-P1-M3 v Memory book 2, at location U1.18-P1-M7 v Memory book 3, at location U1.18-P1-M6Step 152d-5 Turn on the power. Step 152d-6 Did the system stop with the same error code as recorded in “Step 152d-1”? Yes Go to “Step 152d-12” on page 232. No Go to “Step 152d-7.”Step 152d-7 One of the books just removed is defective. Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Chapter 3. Maintenance Analysis Procedures 231 Step 152d-8 Reinsert one of the books that is pulled out and listed in “Step 152d-4” on page 231. Step 152d-9 Turn on the power. Step 152d-10 Did the system stop with the same error code as recorded in “Step 152d-1” on page 231? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152d-11.”Step 152d-11 Have all the books listed in “Step 152d-4” on page 231 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152d-8.”Step 152d-12 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152d-13 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152d-14.” No Call for support.Step 152d-14 Pull out the following books about 25 mm (1 inch), if present: v Memory book 4, at location U1.18-P1-M4 v Memory book 5, at location U1.18-P1-M8 v Memory book 6, at location U1.18-P1-M5 v Memory book 7, at location U1.18-P1-M1 232 406/90 Service Guide Step 152d-15 Turn on the power. Step 152d-16 Did the system stop with the same error code as recorded in “Step 152d-1” on page 231? Yes Go to “Step 152d-22” on page 234. No Go to “Step 152d-17.”Step 152d-17 One of the books just removed is defective. Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152d-18 Reinsert one of the books that is pulled out and listed in “Step 152d-14” on page 232. Step 152d-19 Turn on the power. Step 152d-20 Did the system stop with the same error code as recorded in “Step 152d-1” on page 231? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152d-21.”Step 152d-21 Have all the books listed in “Step 152d-14” on page 232 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152d-18.” Chapter 3. Maintenance Analysis Procedures 233 Step 152d-22 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152d-23 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152d-24.” No Call for support.Step 152d-24 Pull out the following books about 25 mm (1 inch), if present: v Secondary I/O book, at location U1.18-P1-H3 v Third I/O book, at location U1.18-P1-H4 v Fourth I/O book, at location U1.18-P1-H1 v Capacitor book, at location U1.18-P1-V3 v Capacitor book, at location U1.18-P1-V7Step 152d-25 Turn on the power. Step 152d-26 Did the system stop with the same error code as recorded in “Step 152d-1” on page 231? Yes Go to “Step 152d-32” on page 235. No Go to “Step 152d-27.”Step 152d-27 One of the books just removed is defective. Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152d-28 Reinsert one of the books that is pulled out and listed in “Step 152d-24.” Step 152d-29 Turn on the power. 234 406/90 Service Guide Step 152d-30 Did the system stop with the same error code as recorded in “Step 152d-1” on page 231? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152d-31.”Step 152d-31 Have all the books listed in “Step 152d-24” on page 234 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152d-28” on page 234.Step 152d-32 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152d-33 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152d-34.” No Call for support.Step 152d-34 Replace the books in the following list, if present, one at a time and in the order listed: v DCA 1-RH, at location U1.18-P1-V1 v DCA 2-RH, at location U1.18-P1-V2 v DCA 3-RH, at location U1.18-P1-V4 v DCA 4-RH, at location U1.18-P1-V5 v DCA 5-RH, at location U1.18-P1-V6 v DCA 6-RH, at location U1.18-P1-V8Step 152d-35 Turn on the power. Chapter 3. Maintenance Analysis Procedures 235 Step 152d-36 Did the system stop with the same error code as recorded in “Step 152d-1” on page 231? Yes Go to “Step 152d-37.” No The book just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152d-37 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152d-38 Remove the new book that was just installed in “Step 152d-34” on page 235 and reinstall the original book. Step 152d-39 Have all the books listed in “Step 152d-34” on page 235 been replaced with new or original books? Yes Go to “Step 152d-40.” No Go to “Step 152d-34” on page 235.Step 152d-40 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152d-41 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152d-42” on page 237. No Call for support. 236 406/90 Service Guide Step 152d-42 Record the location of each installed L3 module. Attention: L3 modules have a limit of three plug cycles. Before removing any of the L3 modules in the following list, call for support. Remove all of the following modules if present: v L3 module, at location U1.18-P1-C5 v L3 module, at location U1.18-P1-C6 v L3 module, at location U1.18-P1-C7 v L3 module, at location U1.18-P1-C8 v L3 module, at location U1.18-P1-C9 v L3 module, at location U1.18-P1-C10 v L3 module, at location U1.18-P1-C11 v L3 module, at location U1.18-P1-C12 v L3 module, at location U1.18-P1-C13 v L3 module, at location U1.18-P1-C14 v L3 module, at location U1.18-P1-C15 v L3 module, at location U1.18-P1-C16 v L3 module, at location U1.18-P1-C17 v L3 module, at location U1.18-P1-C18 v L3 module, at location U1.18-P1-C19 v L3 module, at location U1.18-P1-C20Step 152d-43 Turn on the power. Step 152d-44 Did the system stop with the same error code as recorded in “Step 152d-1” on page 231? Yes Go to “Step 152d-50” on page 238. No Go to “Step 152d-45.”Step 152d-45 One of the modules just removed is defective. Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152d-46 Reinstall one of the L3 modules that is removed and listed in “Step 152d-42.” Step 152d-47 Turn on the power. Chapter 3. Maintenance Analysis Procedures 237 Step 152d-48 Did the system stop with the same error code as recorded in “Step 152d-1” on page 231? Yes The module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152d-49.”Step 152d-49 Have all the modules listed in “Step 152d-42” on page 237 been reinstalled? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152d-45” on page 237.Step 152d-50 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152d-51 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152d-52.” No Call for support.Step 152d-52 Attention: Before replacing the system backplane, call for support. Replace the system backplane, at location U1.18-P1. Step 152d-53 Turn on the power. Step 152d-54 Did the system stop with the same error code as recorded in “Step 152d-1” on page 231? Yes Go to “Step 152d-55” on page 239. No The system backplane was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. 238 406/90 Service Guide Step 152d-55 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152d-56 Has the system backplane been replaced? Yes Call for support. No Go to “Step 152d-52” on page 238. Chapter 3. Maintenance Analysis Procedures 239 MAP 152e: 1.5 V CPU Current/Voltage Problem in Processor Subsystem Step 152e-1 Record the error code and the location code(s) that sent you to this MAP. Step 152e-2 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152e-3 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152e-4.” No Call for support.Step 152e-4 Pull out the following books about 25 mm (1 inch), if present: v Capacitor book, at location U1.18-P1-V3 v Capacitor book, at location U1.18-P1-V7Step 152e-5 Turn on the power. Step 152e-6 Did the system stop with the same error code as recorded in “Step 152e-1”? Yes Go to “Step 152e-12” on page 241. No Go to “Step 152e-7.”Step 152e-7 One of the books just removed is defective. Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152e-8 Reinsert one of the books that is pulled out and listed in “Step 152e-4.” Step 152e-9 Turn on the power. 240 406/90 Service Guide Step 152e-10 Did the system stop with the same error code as recorded in “Step 152e-1” on page 240? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152e-11.”Step 152e-11 Have all the books listed in “Step 152e-4” on page 240 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152e-8” on page 240.Step 152e-12 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152e-13 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152e-14.” No Call for support.Step 152e-14 Replace the books in the following list, if present, one at a time and in the order listed: v DCA 1, at location U1.18-P1-V1 v DCA 2, at location U1.18-P1-V2 v DCA 3, at location U1.18-P1-V4 v DCA 4, at location U1.18-P1-V5 v DCA 5, at location U1.18-P1-V6 v DCA 6, at location U1.18-P1-V8Step 152e-15 Turn on the power. Chapter 3. Maintenance Analysis Procedures 241 Step 152e-16 Did the system stop with the same error code as recorded in “Step 152e-1” on page 240? Yes Go to “Step 152e-17.” No The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152e-17 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152e-18 Remove the new book that was just installed in “Step 152e-14” on page 241 and reinstall the original book. Step 152e-19 Have all the books listed in “Step 152e-14” on page 241 been replaced with new or original books? Yes Go to “Step 152e-20.” No Go to “Step 152e-14” on page 241.Step 152e-20 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152e-21 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152e-22.” No Call for support.Step 152e-22 Attention: Before you remove or replace any MCM or L3 module, stop, read and understand the following procedures: “MCM Module (Processor) and Pass Through Modules” on page 863 and “L3 Cache Modules” on page 882. Record the location of each installed MCM module. Attention: MCM modules have a limit of three plug cycles. Before replacing any of the MCMs in the following list, call for support. Record the original location on each MCM. Remove the following MCM modules, if present: v MCM module 0, at location U1.18-P1-C1 v MCM module 3, at location U1.18-P1-C2 v MCM module 1, at location U1.18-P1-C3 v MCM module 2, at location U1.18-P1-C4 242 406/90 Service Guide Step 152e-23 Turn on the power. Step 152e-24 Did the system stop with the same error code as recorded in “Step 152e-1” on page 240? Yes Go to “Step 152e-30” on page 244. No Go to “Step 152e-25.”Step 152e-25 One of the modules just removed is defective. Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152e-26 Attention: Before you remove or replace any MCM or L3 module, stop, read, and understand the following procedures: “MCM Module (Processor) and Pass Through Modules” on page 863 and “L3 Cache Modules” on page 882. Reinstall one of the MCM modules that is removed and listed in “Step 152e-22” on page 242. Step 152e-27 Turn on the power. Step 152e-28 Did the system stop with the same error code as recorded in “Step 152e-1” on page 240? Yes The module just reinserted was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152e-29.”Step 152e-29 Have all the modules listed in “Step 152e-22” on page 242 been reinstalled? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152e-25.” Chapter 3. Maintenance Analysis Procedures 243 Step 152e-30 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152e-31 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152e-32.” No Call for support.Step 152e-32 Attention: Before replacing the system backplane, call for support. Replace the system backplane, at location U1.18-P1. Step 152e-33 Turn on the power. Step 152e-34 Did the system stop with the same error code as recorded in “Step 152e-1” on page 240? Yes Go to “Step 152e-35.” No The system backplane was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152e-35 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152e-36 Has the system backplane been replaced? Yes Call for support. No Go to “Step 152e-32.” 244 406/90 Service Guide MAP 152f: 3.3 V Current/Voltage Problem in Processor Subsystem Step 152f-1 Record the error code and the location code(s) that sent you to this MAP. Step 152f-2 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152f-3 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152f-4.” No Call for support.Step 152f-4 Pull out the following books about 25 mm (1 inch), if present: v Memory book 0, at location U1.18-P1-M2 v Memory book 1, at location U1.18-P1-M3 v Memory book 2, at location U1.18-P1-M7 v Memory book 3, at location U1.18-P1-M6Step 152f-5 Turn on the power. Step 152f-6 Did the system stop with the same error code as recorded in “Step 152f-1”? Yes Go to “Step 152f-12” on page 246. No Go to “Step 152f-7.”Step 152f-7 One of the books just removed is defective. Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152f-8 Reinsert one of the books that is pulled out and listed in “Step 152f-4.” Chapter 3. Maintenance Analysis Procedures 245 Step 152f-9 Turn on the power. Step 152f-10 Did the system stop with the same error code as recorded in “Step 152f-1” on page 245? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152f-11.”Step 152f-11 Have all the books listed in “Step 152f-4” on page 245 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152f-8” on page 245.Step 152f-12 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152f-13 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152f-14.” No Call for support.Step 152f-14 Pull out the following books about 25 mm (1 inch), if present: v Memory book 4, at location U1.18-P1-M4 v Memory book 5, at location U1.18-P1-M8 v Memory book 6, at location U1.18-P1-M5 v Memory book 7, at location U1.18-P1-M1Step 152f-15 Turn on the power. 246 406/90 Service Guide Step 152f-16 Did the system stop with the same error code as recorded in “Step 152f-1” on page 245? Yes Go to “Step 152f-22.” No Go to “Step 152f-17.”Step 152f-17 One of the books just removed is defective. Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152f-18 Reinsert one of the books that is pulled out and listed in “Step 152f-14” on page 246. Step 152f-19 Turn on the power. Step 152f-20 Did the system stop with the same error code as recorded in “Step 152f-1” on page 245? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152f-21.”Step 152f-21 Have all the books listed in “Step 152f-14” on page 246 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152f-18.”Step 152f-22 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Chapter 3. Maintenance Analysis Procedures 247 Step 152f-23 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152f-24.” No Call for support.Step 152f-24 Pull out the following books about 25 mm (1 inch), if present: v Secondary I/O book, at location U1.18-P1-H3 v Third I/O book, at location U1.18-P1-H4 v Fourth I/O book, at location U1.18-P1-H1Step 152f-25 Turn on the power. Step 152f-26 Did the system stop with the same error code as recorded in “Step 152f-1” on page 245? Yes Go to “Step 152f-32” on page 249. No Go to “Step 152f-27.”Step 152f-27 The book just removed is defective. Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152f-28 Reinsert the book that is pulled out and listed in “Step 152f-24.” Step 152f-29 Turn on the power. Step 152f-30 Did the system stop with the same error code as recorded in “Step 152f-1” on page 245? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152f-31” on page 249. 248 406/90 Service Guide Step 152f-31 Have all the books listed in “Step 152f-24” on page 248 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152f-28” on page 248.Step 152f-32 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152f-33 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152f-34.” No Call for support.Step 152f-34 Replace the books in the following list, if present, one at a time and in the order listed: v Primary I/O book, at location U1.18-P1-H2 v DCA 1-RH, at location U1.18-P1-V1 v DCA 2-RH, at location U1.18-P1-V2 v DCA 3-RH, at location U1.18-P1-V4 v DCA 4-RH, at location U1.18-P1-V5 v DCA 5-RH, at location U1.18-P1-V6 v DCA 6-RH, at location U1.18-P1-V8Step 152f-35 Turn on the power. Step 152f-36 Did the system stop with the same error code as recorded in “Step 152f-1” on page 245? Yes Go to “Step 152f-37.” No The book just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152f-37 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Chapter 3. Maintenance Analysis Procedures 249 Step 152f-38 Remove the new book that was just installed in “Step 152f-34” on page 249 and reinstall the original book. Step 152f-39 Have all the books listed in “Step 152f-34” on page 249 been replaced with new or original books? Yes Go to “Step 152f-40.” No Go to “Step 152f-34” on page 249.Step 152f-40 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152f-41 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152f-42.” No Call for support.Step 152f-42 Remove the system clock card, at location U1.18-P1-X5. Step 152f-43 Turn on the power. Step 152f-44 Did the system stop with the same error code as recorded in “Step 152f-1” on page 245? Yes Go to “Step 152f-50” on page 251. No Go to “Step 152f-45.”Step 152f-45 The system clock card is defective. Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152f-46 Reinstall the system clock card that you removed in “Step 152f-42.” 250 406/90 Service Guide Step 152f-47 Turn on the power. Step 152f-48 Did the system stop with the same error code as recorded in “Step 152f-1” on page 245? Yes The system clock card just reinserted is defective, replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152f-49.”Step 152f-49 Have all the modules listed in “Step 152f-42” on page 250 been reinstalled? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152f-45” on page 250.Step 152f-50 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152f-51 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152f-52.” No Call for support.Step 152f-52 Attention: Before replacing the system backplane, call for support. Replace the system backplane, at location U1.18-P1. Step 152f-53 Turn on the power. Step 152f-54 Did the system stop with the same error code as recorded in “Step 152f-1” on page 245? Yes Go to “Step 152f-55” on page 252. No The system backplane was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Chapter 3. Maintenance Analysis Procedures 251 Step 152f-55 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152f-56 Has the system backplane been replaced? Yes Call for support. No Go to “Step 152f-52” on page 251. 252 406/90 Service Guide MAP 152g: 5.0 V Current/Voltage Problem in Processor Subsystem Step 152g-1 For future reference, record the error code and the location code(s) that sent you to this MAP. Step 152g-2 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152g-3 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152g-4.” No Call for support.Step 152g-4 Pull out the following books about 25 mm (1 inch), if present: v Memory book 0, at location U1.18-P1-M2 v Memory book 1, at location U1.18-P1-M3 v Memory book 2, at location U1.18-P1-M7 v Memory book 3, at location U1.18-P1-M6Step 152g-5 Turn on the power. Step 152g-6 Did the system stop with the same error code as recorded in “Step 152g-1”? Yes Go to “Step 152g-12” on page 254. No Go to “Step 152g-7.”Step 152g-7 One of the books just removed is defective. Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152g-8 Reinsert one of the books that is pulled out and listed in “Step 152g-4.” Step 152g-9 Turn on the power. Chapter 3. Maintenance Analysis Procedures 253 Step 152g-10 Did the system stop with the same error code as recorded in “Step 152g-1” on page 253? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152g-11.”Step 152g-11 Have all the books listed in “Step 152g-4” on page 253 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152g-8” on page 253.Step 152g-12 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152g-13 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152g-14.” No Call for support.Step 152g-14 Pull out the following books about 25 mm (1 inch), if present: v Memory book 4, at location U1.18-P1-M4 v Memory book 5, at location U1.18-P1-M8 v Memory book 6, at location U1.18-P1-M5 v Memory book 7, at location U1.18-P1-M1Step 152g-15 Turn on the power. 254 406/90 Service Guide Step 152g-16 Did the system stop with the same error code as recorded in “Step 152g-1” on page 253? Yes Go to “Step 152g-22.” No Go to “Step 152g-17.”Step 152g-17 One of the books just removed is defective. Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152g-18 Reinsert one of the books that is pulled out and listed in “Step 152g-14” on page 254. Step 152g-19 Turn on the power. Step 152g-20 Did the system stop with the same error code as recorded in “Step 152g-1” on page 253? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152g-21.”Step 152g-21 Have all the books listed in “Step 152g-14” on page 254 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152g-18.”Step 152g-22 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Chapter 3. Maintenance Analysis Procedures 255 Step 152g-23 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152g-24.” No Call for support.Step 152g-24 Pull out the following books about 25 mm (1 inch), if present: v Fourth I/O book, at location U1.18-P1-H1 v Secondary I/O book, at location U1.18-P1-H3 v Third I/O book, at location U1.18-P1-H4 v Cap card at U1.18-P1-V3 v Cap card at U1.18-P1-V7Step 152g-25 Turn on the power. Step 152g-26 Did the system stop with the same error code as recorded in “Step 152g-1” on page 253? Yes Go to “Step 152g-32” on page 257. No Go to “Step 152g-27.”Step 152g-27 One of the books just removed is defective. Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152g-28 Reinsert one of the books that is pulled out and listed in “Step 152g-24.” Step 152g-29 Turn on the power. Step 152g-30 Did the system stop with the same error code as recorded in “Step 152g-1” on page 253? Yes The book just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152g-31” on page 257. 256 406/90 Service Guide Step 152g-31 Have all the books listed in “Step 152g-24” on page 256 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152g-28” on page 256.Step 152g-32 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152g-33 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152g-34.” No Call for support.Step 152g-34 Replace the book in the following list, if present, one at a time and in the order listed: v Primary I/O book, at location U1.18-P1-H2 v DCA 1-RH at U1.18-P1-V1 v DCA 2-RH at U1.18-P1-V2Step 152g-35 Turn on the power. Step 152g-36 Did the system stop with the same error code as recorded in “Step 152g-1” on page 253? Yes Go to “Step 152g-37.” No The book just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152g-37 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Chapter 3. Maintenance Analysis Procedures 257 Step 152g-38 Remove the new book that was just installed in “Step 152g-34” on page 257 and reinstall the original book. Step 152g-39 Have all the books listed in “Step 152g-34” on page 257 been replaced with new or original books? Yes Go to “Step 152g-40.” No Go to “Step 152g-34” on page 257.Step 152g-40 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 152g-41 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 152g-42.” No Call for support.Step 152g-42 Attention: Before replacing the system backplane, call for support. Replace the system backplane, at location U1.18-P1. Step 152g-43 Turn on the power. Step 152g-44 Did the system stop with the same error code as recorded in “Step 152g-1” on page 253? Yes Go to “Step 152g-45.” No The system backplane was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152g-45 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 152g-46 Has the system backplane been replaced? Yes Call for support. No Go to “Step 152g-42.” 258 406/90 Service Guide MAP 152h: 3.3V Current/Voltage Problem in I/O Subsystem Step 152h-1 Record the error code and the location code(s) that sent you to this MAP. Step 152h-2 Determine the I/O subsystem number from the error code (10yy xxxx) and the following list. The remaining steps of this map refer to this I/O subsystem: v If yy=14, I/O subsystem number is 1, location is U1.9 v If yy=15, I/O subsystem number is 2, location is U1.5 v If yy=16, I/O subsystem number is 3, location is U1.1 v If yy=17, I/O subsystem number is 4, location is U1.13 v If yy=21, I/O subsystem number is 5, location is U2.1 v If yy=22, I/O subsystem number is 6, location is U2.5 v If yy=23, I/O subsystem number is 4, location is U2.9 v If yy=24, I/O subsystem number is 7, location is U2.13 v If yy=25, I/O subsystem number is 8, location is U2.19Step 152h-3 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152h-4 Are all green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152h-5.” No Call for support.Step 152h-5 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P1-I1 v Adapter card at P1-I2 v Adapter card at P1-I3 v Adapter card at P1-I4 v Adapter card at P1-I5Step 152h-6 Turn on the power. Step 152h-7 Did the system stop with the same error code as recorded in “Step 152h-1”? Yes Go to “Step 152h-13” on page 260. No Go to “Step 152h-8” on page 260. Chapter 3. Maintenance Analysis Procedures 259 Step 152h-8 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152h-9 Reinsert one of the cards that is removed and listed in “Step 152h-5” on page 259. Step 152h-10 Turn on the power. Step 152h-11 Did the system stop with the same error code as recorded in “Step 152h-1” on page 259? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152h-12.”Step 152h-12 Have all the cards listed in “Step 152h-5” on page 259 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152h-9.”Step 152h-13 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152h-14 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152h-15” on page 261. No Call for support. 260 406/90 Service Guide Step 152h-15 Remove the following adapter cards, if present, from the I/O subsystem. record the adapter locations: v Adapter card at P1-I6 v Adapter card at P1-I7 v Adapter card at P1-I8 v Adapter card at P1-I9 v Adapter card at P1-I10Step 152h-16 Turn on the power. Step 152h-17 Did the system stop with the same error code as recorded in “Step 152h-1” on page 259? Yes Go to “Step 152h-23” on page 262. No Go to “Step 152h-18.”Step 152h-18 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152h-19 Reinsert one of the cards that is removed and listed in “Step 152h-15.” Step 152h-20 Turn on the power. Step 152h-21 Did the system stop with the same error code as recorded in “Step 152h-1” on page 259? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152h-22” on page 262. Chapter 3. Maintenance Analysis Procedures 261 Step 152h-22 Have all the cards listed in “Step 152h-15” on page 261 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152h-19” on page 261.Step 152h-23 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152h-24 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152h-25.” No Call for support.Step 152h-25 Remove the following adapter cards, if present, from the I/O subsystem. record the adapter locations: v Adapter card at P2-I1 v Adapter card at P2-I2 v Adapter card at P2-I3 v Adapter card at P2-I4 v Adapter card at P2-I5Step 152h-26 Turn on the power. Step 152h-27 Did the system stop with the same error code as recorded in “Step 152h-1” on page 259? Yes Go to “Step 152h-33” on page 263. No Go to “Step 152h-28.”Step 152h-28 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152h-29 Reinsert one of the cards that is removed and listed in “Step 152h-25.” 262 406/90 Service Guide Step 152h-30 Turn on the power. Step 152h-31 Did the system stop with the same error code as recorded in “Step 152h-1” on page 259? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152h-32.”Step 152h-32 Have all the cards listed in “Step 152h-25” on page 262 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152h-29” on page 262.Step 152h-33 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152h-34 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152h-35.” No Call for support.Step 152h-35 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P2-I6 v Adapter card at P2-I7 v Adapter card at P2-I8 v Adapter card at P2-I9 v Adapter card at P2-I10 Chapter 3. Maintenance Analysis Procedures 263 Step 152h-36 Turn on the power. Step 152h-37 Did the system stop with the same error code as recorded in “Step 152h-1” on page 259? Yes Go to “Step 152h-43.” No Go to “Step 152h-38.”Step 152h-38 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152h-39 Reinsert one of the cards that is removed and listed in “Step 152h-35” on page 263. Step 152h-40 Turn on the power. Step 152h-41 Did the system stop with the same error code as recorded in “Step 152h-1” on page 259? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152h-42.”Step 152h-42 Have all the cards listed in “Step 152h-35” on page 263 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152h-39.”Step 152h-43 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. 264 406/90 Service Guide Step 152h-44 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152h-45.” No Call for support.Step 152h-45 Replace the DCAs in the following list, one at a time and in the order listed: v DCA 1 at V1 v DCA 2 at V2Step 152h-46 Turn on the power. Step 152h-47 Did the system stop with the same error code as recorded in “Step 152h-1” on page 259? Yes Go to “Step 152h-48.” No The DCA just replaced is defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152h-48 Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152h-49 Remove the new DCA that was just installed in “Step 152h-45” and reinstall the original DCA. Step 152h-50 Have all the DCAs listed in “Step 152h-45” been replaced with new or original cards? Yes Go to “Step 152h-51” No Go to “Step 152h-45.”Step 152h-51 Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Chapter 3. Maintenance Analysis Procedures 265 Step 152h-52 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152h-53.” No Call for support.Step 152h-53 Replace the parts in the following list, one at a time and in the order listed: v I/O subsystem backplane at P1 v I/O subsystem backplane at P2Step 152h-54 Turn on the power. Step 152h-55 Did the system stop with the same error code as recorded in “Step 152h-1” on page 259? Yes Go to “Step 152h-56.” No The part just replaced is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152h-56 Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152h-57 Have all the cards listed in “Step 152h-53” been replaced with new or original cards? Yes Call for support. No Go to “Step 152h-53.” 266 406/90 Service Guide MAP 152i: 5.0 V Current/Voltage Problem in I/O Subsystem Step 152i-1 Record the error code and location code(s) that sent you to this MAP. Step 152i-2 Determine the I/O subsystem number from the error code (10yy xxxx) and the following list. The remaining steps of this map refer to this I/O subsystem: v If yy=14, I/O subsystem number is 1, location is U1.9 v If yy=15, I/O subsystem number is 2, location is U1.5 v If yy=16, I/O subsystem number is 3, location is U1.1 v If yy=17, I/O subsystem number is 4, location is U1.13 v If yy=21, I/O subsystem number is 5, location is U2.1 v If yy=22, I/O subsystem number is 6, location is U2.5 v If yy=23, I/O subsystem number is 4, location is U2.9 v If yy=24, I/O subsystem number is 7, location is U2.13 v If yy=25, I/O subsystem number is 8, location is U2.19Step 152i-3 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152i-4 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152i-5.” No Call for support.Step 152i-5 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P1-I1 v Adapter card at P1-I2 v Adapter card at P1-I3 v Adapter card at P1-I4 v Adapter card at P1-I5Step 152i-6 Turn on the power. Step 152i-7 Did the system stop with the same error code as recorded in “Step 152i-1”? Yes Go to “Step 152i-13” on page 268. No Go to “Step 152i-8” on page 268. Chapter 3. Maintenance Analysis Procedures 267 Step 152i-8 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152i-9 Reinsert one of the cards that is removed and listed in “Step 152i-5” on page 267. Step 152i-10 Turn on the power. Step 152i-11 Did the system stop with the same error code as recorded in “Step 152i-1” on page 267? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152i-12.”Step 152i-12 Have all the cards listed in “Step 152i-5” on page 267 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152i-9.”Step 152i-13 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152i-14 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152i-15” on page 269. No Call for support. 268 406/90 Service Guide Step 152i-15 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P1-I6 v Adapter card at P1-I7 v Adapter card at P1-I8 v Adapter card at P1-I9 v Adapter card at P1-I10Step 152i-16 Turn on the power. Step 152i-17 Did the system stop with the same error code as recorded in “Step 152i-1” on page 267? Yes Go to “Step 152i-23” on page 270. No Go to “Step 152i-18.”Step 152i-18 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152i-19 Reinsert one of the cards that is removed and listed in “Step 152i-15.” Step 152i-20 Turn on the power. Step 152i-21 Did the system stop with the same error code as recorded in “Step 152i-1” on page 267? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152i-22” on page 270. Chapter 3. Maintenance Analysis Procedures 269 Step 152i-22 Have all the cards listed in “Step 152i-15” on page 269 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152i-19” on page 269.Step 152i-23 Turn off the power. Examine all the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152i-24 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152i-25.” No Call for support.Step 152i-25 Remove the following adapter cards, if present, from the I/O subsystem. record the adapter locations: v Adapter card at P2-I1 v Adapter card at P2-I2 v Adapter card at P2-I3 v Adapter card at P2-I4 v Adapter card at P2-I5Step 152i-26 Turn on the power. Step 152i-27 Did the system stop with the same error code as recorded in “Step 152i-1” on page 267? Yes Go to “Step 152i-33” on page 271. No Go to “Step 152i-28” on page 271. 270 406/90 Service Guide Step 152i-28 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152i-29 Reinsert one of the cards that is removed and listed in “Step 152i-25” on page 270. Step 152i-30 Turn on the power. Step 152i-31 Did the system stop with the same error code as recorded in “Step 152i-1” on page 267? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152i-32.”Step 152i-32 Have all the cards listed in “Step 152i-25” on page 270 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152i-29.”Step 152i-33 Turn off the power. Examine all the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152i-34 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152i-35.” No Call for support.Step 152i-35 Remove the following adapter cards, if present, from the I/O subsystem. record the adapter locations: v Adapter card at P2-I6 v Adapter card at P2-I7 v Adapter card at P2-I8 v Adapter card at P2-I9 v Adapter card at P2-I10 Chapter 3. Maintenance Analysis Procedures 271 Step 152i-36 Turn on the power. Step 152i-37 Did the system stop with the same error code as recorded in “Step 152i-1” on page 267? Yes Go to “Step 152i-43.” No Go to “Step 152i-38.”Step 152i-38 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152i-39 Reinsert one of the cards that is removed and listed in “Step 152i-35” on page 271. Step 152i-40 Turn on the power. Step 152i-41 Did the system stop with the same error code as recorded in “Step 152i-1” on page 267? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152i-42.”Step 152i-42 Have all the cards listed in “Step 152i-35” on page 271 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152i-39.”Step 152i-43 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. 272 406/90 Service Guide Step 152i-44 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152i-45.” No Call for support.Step 152i-45 Remove the following DASD 4-packs, if present, from the I/O subsystem. Record the DASD locations: v DASD 4-pack at P3 v DASD 4-pack at P4 v DASD 4-pack at P5 v DASD 4-pack at P6Step 152i-46 Turn on the power. Step 152i-47 Did the system stop with the same error code as recorded in “Step 152i-1” on page 267? Yes Go to “Step 152i-53” on page 274. No Go to “Step 152i-48.”Step 152i-48 One of the DASDs just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152i-49 Reinsert one of the DASD 4-packs that is removed and listed in “Step 152i-45.” Step 152i-50 Turn on the power. Step 152i-51 Did the system stop with the same error code as recorded in “Step 152i-1” on page 267? Yes One of the DASD in the DASD 4-pack just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152i-52” on page 274. Chapter 3. Maintenance Analysis Procedures 273 Step 152i-52 Have all the cards listed in “Step 152i-45” on page 273 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152i-49” on page 273.Step 152i-53 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152i-54 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152i-55.” No Call for support.Step 152i-55 Replace the books in the following list one at a time and in the order listed: v DCA 1 at V1 v DCA 2 at V2Step 152i-56 Turn on the power. Step 152i-57 Did the system stop with the same error code as recorded in “Step 152i-1” on page 267? Yes Go to “Step 152i-58.” No The book just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152i-58 Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152i-59 Remove the new card that was just installed in “Step 152i-55” and reinstall the original card. 274 406/90 Service Guide Step 152i-60 Have all the cards listed in “Step 152i-55” on page 274 been replaced with new or original cards? Yes Go to “Step 152i-61.” No Go to “Step 152i-55” on page 274.Step 152i-61 Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152i-62 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152i-63.” No Call for support.Step 152i-63 Replace the parts in the following list one at a time and in the order listed: v I/O subsystem backplane at P1 v I/O subsystem backplane at P2Step 152i-64 Turn on the power. Step 152i-65 Did the system stop with the same error code as recorded in “Step 152i-1” on page 267? Yes Go to “Step 152i-66.” No The card just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152i-66 Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152i-67 Have all the cards listed in “Step 152i-63” been replaced? Yes Call for support. No Go to “Step 152i-63.” Chapter 3. Maintenance Analysis Procedures 275 MAP 152j: 2.5 V Current/Voltage Problem in I/O Subsystem Step 152j-1 Record the error code and location code(s) that sent you to this MAP. Step 152j-2 Determine the I/O subsystem number from the error code (10yy xxxx) and the following list. The remaining steps of this map refer to this I/O subsystem: v If yy=14, I/O subsystem number is 1, location is U1.9 v If yy=15, I/O subsystem number is 2, location is U1.5 v If yy=16, I/O subsystem number is 3, location is U1.1 v If yy=17, I/O subsystem number is 4, location is U1.13 v If yy=21, I/O subsystem number is 5, location is U2.1 v If yy=22, I/O subsystem number is 6, location is U2.5 v If yy=23, I/O subsystem number is 4, location is U2.9 v If yy=24, I/O subsystem number is 7, location is U2.13 v If yy=25, I/O subsystem number is 8, location is U2.19Step 152j-3 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152j-4 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152j-5.” No Call for support.Step 152j-5 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P1-I1 v Adapter card at P1-I2 v Adapter card at P1-I3 v Adapter card at P1-I4 v Adapter card at P1-I5Step 152j-6 Turn on the power. Step 152j-7 Did the system stop with the same error code as recorded in “Step 152j-1”? Yes Go to “Step 152j-13” on page 277. No Go to “Step 152j-8” on page 277. 276 406/90 Service Guide Step 152j-8 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152j-9 Reinsert one of the cards that is removed and listed in “Step 152j-5” on page 276. Step 152j-10 Turn on the power. Step 152j-11 Did the system stop with the same error code as recorded in “Step 152j-1” on page 276? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152j-12.”Step 152j-12 Have all the cards listed in “Step 152j-5” on page 276 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152j-9.”Step 152j-13 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152j-14 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152j-15” on page 278. No Call for support. Chapter 3. Maintenance Analysis Procedures 277 Step 152j-15 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P1-I6 v Adapter card at P1-I7 v Adapter card at P1-I8 v Adapter card at P1-I9 v Adapter card at P1-I10Step 152j-16 Turn on the power. Step 152j-17 Did the system stop with the same error code as recorded in “Step 152j-1” on page 276? Yes Go to “Step 152j-23” on page 279. No Go to “Step 152j-18.”Step 152j-18 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152j-19 Reinsert one of the cards that is removed and listed in “Step 152j-15.” Step 152j-20 Turn on the power. Step 152j-21 Did the system stop with the same error code as recorded in “Step 152j-1” on page 276? Yes The card just reinserted is defective. Replace it.This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152j-22” on page 279. 278 406/90 Service Guide Step 152j-22 Have all the cards listed in “Step 152j-15” on page 278 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152j-19” on page 278.Step 152j-23 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152j-24 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152j-25.” No Call for support.Step 152j-25 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P2-I1 v Adapter card at P2-I2 v Adapter card at P2-I3 v Adapter card at P2-I4 v Adapter card at P2-I5Step 152j-26 Turn on the power. Step 152j-27 Did the system stop with the same error code as recorded in “Step 152j-1” on page 276? Yes Go to “Step 152j-33” on page 280. No Go to “Step 152j-28.”Step 152j-28 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152j-29 Reinsert one of the cards that is removed and listed in “Step 152j-25.” Chapter 3. Maintenance Analysis Procedures 279 Step 152j-30 Turn on the power. Step 152j-31 Did the system stop with the same error code as recorded in “Step 152j-1” on page 276? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152j-32.”Step 152j-32 Have all the cards listed in “Step 152j-25” on page 279 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152j-29” on page 279.Step 152j-33 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152j-34 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152j-35.” No Call for support.Step 152j-35 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P2-I6 v Adapter card at P2-I7 v Adapter card at P2-I8 v Adapter card at P2-I9 v Adapter card at P2-I10Step 152j-36 Turn on the power. 280 406/90 Service Guide Step 152j-37 Did the system stop with the same error code as recorded in “Step 152j-1” on page 276? Yes Go to “Step 152j-43.” No Go to “Step 152j-38.”Step 152j-38 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152j-39 Reinsert one of the cards that is removed and listed in “Step 152j-35” on page 280. Step 152j-40 Turn on the power. Step 152j-41 Did the system stop with the same error code as recorded in “Step 152j-1” on page 276? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152j-42.”Step 152j-42 Have all the cards listed in “Step 152j-35” on page 280 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152j-39.”Step 152j-43 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Chapter 3. Maintenance Analysis Procedures 281 Step 152j-44 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152j-45.” No Call for support.Step 152j-45 Replace the cards in the following list one at a time and in the order listed: v DCA 1 at V1 v DCA 2 at V2Step 152j-46 Turn on the power. Step 152j-47 Did the system stop with the same error code as recorded in “Step 152j-1” on page 276? Yes Go to “Step 152j-48.” No The card just replaced is defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152j-48 Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152j-49 Remove the new card that was just installed in “Step 152j-45” and reinstall the original card. Step 152j-50 Have all the cards listed in “Step 152j-45” been replaced with new or original cards? Yes Go to “Step 152j-51” No Go to “Step 152j-45.”Step 152j-51 Examine the green ?power good out? LEDs of both I/O subsystem DCAs. 282 406/90 Service Guide Step 152j-52 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152j-53.” No Call for support.Step 152j-53 Replace the parts in the following list one at a time and in the order listed: v I/O subsystem backplane at P1 v I/O subsystem backplane at P2Step 152j-54 Turn on the power. Step 152j-55 Did the system stop with the same error code as recorded in “Step 152j-1” on page 276? Yes Go to “Step 152j-56.” No The part just replaced is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152j-56 Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152j-57 Have all the cards listed in “Step 152j-53” been replaced with new or original cards? Yes Call for support. No Go to “Step 152j-53.” Chapter 3. Maintenance Analysis Procedures 283 MAP 152k: 12.0 V Current/Voltage Problem in I/O Subsystem Step 152k-1 Record the error code and location code(s) that sent you to this MAP. Step 152k-2 Determine the I/O subsystem number from the error code (10yy xxxx) and the following list. The remaining steps of this map refer to this I/O subsystem: v If yy=14, I/O subsystem number is 1, location is U1.9 v If yy=15, I/O subsystem number is 2, location is U1.5 v If yy=16, I/O subsystem number is 3, location is U1.1 v If yy=17, I/O subsystem number is 4, location is U1.13 v If yy=21, I/O subsystem number is 5, location is U2.1 v If yy=22, I/O subsystem number is 6, location is U2.5 v If yy=23, I/O subsystem number is 4, location is U2.9 v If yy=24, I/O subsystem number is 7, location is U2.13 v If yy=25, I/O subsystem number is 8, location is U2.19Step 152k-3 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152k-4 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152k-5.” No Call for support.Step 152k-5 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P1-I1 v Adapter card at P1-I2 v Adapter card at P1-I3 v Adapter card at P1-I4 v Adapter card at P1-I5Step 152k-6 Turn on the power. Step 152k-7 Did the system stop with the same error code as recorded in “Step 152k-1”? Yes Go to “Step 152k-13” on page 285. No Go to “Step 152k-8” on page 285. 284 406/90 Service Guide Step 152k-8 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152k-9 Reinsert one of the cards that is removed and listed in “Step 152k-5” on page 284. Step 152k-10 Turn on the power. Step 152k-11 Did the system stop with the same error code as recorded in “Step 152k-1” on page 284? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152k-12.”Step 152k-12 Have all the cards listed in “Step 152k-5” on page 284 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152k-9.”Step 152k-13 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152k-14 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152k-15” on page 286. No Call for support. Chapter 3. Maintenance Analysis Procedures 285 Step 152k-15 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P1-I6 v Adapter card at P1-I7 v Adapter card at P1-I8 v Adapter card at P1-I9 v Adapter card at P1-I10Step 152k-16 Turn on the power. Step 152k-17 Did the system stop with the same error code as recorded in “Step 152k-1” on page 284? Yes Go to “Step 152k-23” on page 287. No Go to “Step 152k-18.”Step 152k-18 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152k-19 Reinsert one of the cards that is removed and listed in “Step 152k-15.” Step 152k-20 Turn on the power. Step 152k-21 Did the system stop with the same error code as recorded in “Step 152k-1” on page 284? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152k-22” on page 287. 286 406/90 Service Guide Step 152k-22 Have all the cards listed in “Step 152k-15” on page 286 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152k-19” on page 286.Step 152k-23 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152k-24 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152k-25.” No Call for support.Step 152k-25 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P2-I1 v Adapter card at P2-I2 v Adapter card at P2-I3 v Adapter card at P2-I4 v Adapter card at P2-I5Step 152k-26 Turn on the power. Step 152k-27 Did the system stop with the same error code as recorded in “Step 152k-1” on page 284? Yes Go to “Step 152k-33” on page 288. No Go to “Step 152k-28.”Step 152k-28 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152k-29 Reinsert one of the cards that is removed and listed in “Step 152k-25.” Step 152k-30 Turn on the power. Chapter 3. Maintenance Analysis Procedures 287 Step 152k-31 Did the system stop with the same error code as recorded in “Step 152k-1” on page 284? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152k-32.”Step 152k-32 Have all the cards listed in “Step 152k-25” on page 287 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152k-29” on page 287.Step 152k-33 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152k-34 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152k-35.” No Call for support.Step 152k-35 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P2-I6 v Adapter card at P2-I7 v Adapter card at P2-I8 v Adapter card at P2-I9 v Adapter card at P2-I10Step 152k-36 Turn on the power. Step 152k-37 Did the system stop with the same error code as recorded in “Step 152k-1” on page 284? Yes Go to “Step 152k-43” on page 289. No Go to “Step 152k-38” on page 289. 288 406/90 Service Guide Step 152k-38 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152k-39 Reinsert one of the cards that is removed and listed in “Step 152k-35” on page 288. Step 152k-40 Turn on the power. Step 152k-41 Did the system stop with the same error code as recorded in “Step 152k-1” on page 284? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152k-42.”Step 152k-42 Have all the cards listed in “Step 152k-35” on page 288 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152k-39.”Step 152k-43 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152k-44 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152k-45.” No Call for support.Step 152k-45 Remove the following DASD 4-packs, if present, from the I/O subsystem. Record the DASD locations: v DASD 4-pack at P3 v DASD 4-pack at P4 v DASD 4-pack at P5 v DASD 4-pack at P6 Chapter 3. Maintenance Analysis Procedures 289 Step 152k-46 Turn on the power. Step 152k-47 Did the system stop with the same error code as recorded in “Step 152k-1” on page 284? Yes Go to “Step 152k-53.” No Go to “Step 152k-48.”Step 152k-48 One of the DASDs just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152k-49 Reinsert one of the DASD 4-packs that is removed and listed in “Step 152k-45” on page 289. Step 152k-50 Turn on the power. Step 152k-51 Did the system stop with the same error code as recorded in “Step 152k-1” on page 284? Yes One of the DASD in the DASD 4-pack just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152k-52.”Step 152k-52 Have all the cards listed in “Step 152k-45” on page 289 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152k-49.”Step 152k-53 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. 290 406/90 Service Guide Step 152k-54 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152k-55.” No Call for support.Step 152k-55 Replace the books in the following list one at a time and in the order listed: v DCA 1 at V1 v DCA 2 at V2Step 152k-56 Turn on the power. Step 152k-57 Did the system stop with the same error code as recorded in “Step 152k-1” on page 284? Yes Go to “Step 152k-58.” No The part just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152k-58 Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152k-59 Remove the new card that was just installed in “Step 152k-55” and reinstall the original card. Step 152k-60 Have all the cards listed in “Step 152k-55” been replaced with new or original cards? Yes Go to “Step 152k-61.” No Go to “Step 152k-55.”Step 152k-61 Examine the the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152k-62 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152k-63” on page 292. No Call for support. Chapter 3. Maintenance Analysis Procedures 291 Step 152k-63 Replace the parts in the following list one at a time and in the order listed: v I/O subsystem backplane at P1 v I/O subsystem backplane at P2Step 152k-64 Turn on the power. Step 152k-65 Did the system stop with the same error code as recorded in “Step 152k-1” on page 284? Yes Go to “Step 152k-66.” No The card just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152k-66 Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152k-67 Have all the cards listed in “Step 152k-63” been replaced? Yes Call for support. No Go to “Step 152k-63.” 292 406/90 Service Guide MAP 152l: -12.0V Current/Voltage Problem in I/O Subsystem Step 152l-1 Record the error code and location code(s) that sent you to this MAP. Step 152l-2 Determine the I/O subsystem number from the error code (10yy xxxx) and the following list. The remaining steps of this map refer to this I/O subsystem: v If yy=14, I/O subsystem number is 1, location is U1.9 v If yy=15, I/O subsystem number is 2, location is U1.5 v If yy=16, I/O subsystem number is 3, location is U1.1 v If yy=17, I/O subsystem number is 4, location is U1.13 v If yy=21, I/O subsystem number is 5, location is U2.1 v If yy=22, I/O subsystem number is 6, location is U2.5 v If yy=23, I/O subsystem number is 4, location is U2.9 v If yy=24, I/O subsystem number is 7, location is U2.13 v If yy=25, I/O subsystem number is 8, location is U2.19Step 152l-3 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152l-4 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152l-5.” No Call for support.Step 152l-5 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P1-I1 v Adapter card at P1-I2 v Adapter card at P1-I3 v Adapter card at P1-I4 v Adapter card at P1-I5Step 152l-6 Turn on the power. Step 152l-7 Did the system stop with the same error code as recorded in “Step 152l-1”? Yes Go to “Step 152l-13” on page 294. No Go to “Step 152l-8” on page 294. Chapter 3. Maintenance Analysis Procedures 293 Step 152l-8 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152l-9 Reinsert one of the cards that is removed and listed in “Step 152l-5” on page 293. Step 152l-10 Turn on the power. Step 152l-11 Did the system stop with the same error code as recorded in “Step 152l-1” on page 293? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152l-12.”Step 152l-12 Have all the cards listed in “Step 152l-5” on page 293 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152l-9.”Step 152l-13 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152l-14 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152l-15.” No Call for support.Step 152l-15 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P1-I6 v Adapter card at P1-I7 v Adapter card at P1-I8 v Adapter card at P1-I9 v Adapter card at P1-I10 294 406/90 Service Guide Step 152l-16 Turn on the power. Step 152l-17 Did the system stop with the same error code as recorded in “Step 152l-1” on page 293? Yes Go to “Step 152l-23.” No Go to “Step 152l-18.”Step 152l-18 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152l-19 Reinsert one of the cards that is removed and listed in “Step 152l-15” on page 294. Step 152l-20 Turn on the power. Step 152l-21 Did the system stop with the same error code as recorded in “Step 152l-1” on page 293? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152l-22.”Step 152l-22 Have all the cards listed in “Step 152l-15” on page 294 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152l-19.”Step 152l-23 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Chapter 3. Maintenance Analysis Procedures 295 Step 152l-24 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152l-25.” No Call for support.Step 152l-25 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P2-I1 v Adapter card at P2-I2 v Adapter card at P2-I3 v Adapter card at P2-I4 v Adapter card at P2-I5Step 152l-26 Turn on the power. Step 152l-27 Did the system stop with the same error code as recorded in “Step 152l-1” on page 293? Yes Go to “Step 152l-33” on page 297. No Go to “Step 152l-28.”Step 152l-28 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152l-29 Reinsert one of the cards that is removed and listed in “Step 152l-25.” Step 152l-30 Turn on the power. Step 152l-31 Did the system stop with the same error code as recorded in “Step 152l-1” on page 293? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152l-32” on page 297. 296 406/90 Service Guide Step 152l-32 Have all the cards listed in “Step 152l-25” on page 296 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152l-29” on page 296.Step 152l-33 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152l-34 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152l-35.” No Call for support.Step 152l-35 Remove the following adapter cards, if present, from the I/O subsystem. Record the adapter locations: v Adapter card at P2-I6 v Adapter card at P2-I7 v Adapter card at P2-I8 v Adapter card at P2-I9 v Adapter card at P2-I10Step 152l-36 Turn on the power. Step 152l-37 Did the system stop with the same error code as recorded in “Step 152l-1” on page 293? Yes Go to “Step 152l-43” on page 298. No Go to “Step 152l-38.”Step 152l-38 One of the cards just removed is defective. Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152l-39 Reinsert one of the cards that is removed and listed in “Step 152l-35.” Step 152l-40 Turn on the power. Chapter 3. Maintenance Analysis Procedures 297 Step 152l-41 Did the system stop with the same error code as recorded in “Step 152l-1” on page 293? Yes The card just reinserted is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152l-42.”Step 152l-42 Have all the cards listed in “Step 152l-35” on page 297 been reinserted? Yes The symptom has changed. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152l-39” on page 297.Step 152l-43 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152l-44 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152l-45.” No Call for support.Step 152l-45 Replace the DCAs in the following list one at a time and in the order listed: v DCA 1 at V1 v DCA 2 at V2Step 152l-46 Turn on the power. Step 152l-47 Did the system stop with the same error code as recorded in “Step 152l-1” on page 293? Yes Go to “Step 152l-48” on page 299. No The DCA just replaced is defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. 298 406/90 Service Guide Step 152l-48 Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152l-49 Remove the new DCA that was just installed in “Step 152l-45” on page 298 and reinstall the original DCA. Step 152l-50 Have all the DCAs listed in “Step 152l-45” on page 298 been replaced with new or original cards? Yes Go to “Step 152l-51” No Go to “Step 152l-45” on page 298.Step 152l-51 Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152l-52 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152l-53.” No Call for support.Step 152l-53 Replace the parts in the following list one at a time and in the order listed: v I/O subsystem backplane at P1 v I/O subsystem backplane at P2Step 152l-54 Turn on the power. Step 152l-55 Did the system stop with the same error code as recorded in “Step 152l-1” on page 293? Yes Go to “Step 152l-56.” No The part just replaced is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152l-56 Turn off the power. Make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152l-57 Have all the cards listed in “Step 152l-53” been replaced with new or original cards? Yes Call for support. No Go to “Step 152l-53.” Chapter 3. Maintenance Analysis Procedures 299 MAP 152m: Cable Problem in Power Subsystem Step 152m-1 Record the error code and location code(s) that sent you to this MAP. You should be supplied with one or two pairs of location codes from the action for the error code in the service guide. These location codes are the ends of the cable or cables in question. The fourth nibble of the error code indicates the side of the power subsystem that detected the failure. For example, 101A indicates side A (front), and 101B indicates side B (rear). The following steps apply to the power subsystem side indicated by the error code recorded. For each location code pair: 1. Generate the first FRU location code (from U1.35) to be replaced by taking the location code given for the action of the error code and truncating the connector ID (/Qx). 2. Generate the second FRU location code to be replaced by taking the location code (not from U1.35) given for the action of the error code and truncating the connector ID (/Qx). 3. Generate the third FRU location code to be replaced (the cable) by taking the location code (from U1.35) given for the action of the error code and appending a # (number sign). For example, the service guide for error code 101A x125 lists the location codes U1.35-P1-X3/Q1 and U1.9-V1/Q2. From the error code, you are working on the A (front) side. From the location codes, the FRUs, in order of replacement, (if required) are: a. U1.35-P1-X3 - BPD1 b. U1.9-V1 - DCA1 c. U1.35-P1-X3/Q1# - cable4. If two pairs of location codes (in other words, two cables) were reported by the eror code that sent you to this map, repeat steps 1, 2, and 3 for the other pair of location codes. Assemble the FRU list for both cables as follows: a. Cable from first pair of location codes. b. Cable from second pair of location codes. Refer to “AIX and Physical Location Code Reference Tables” on page 68 and “Power Distribution Cabling” on page 30. Step 152m-2 Press the green start service button on the panel with the UEPO switch. Step 152m-3 From the action column in the ?Error Code to FRU? table entry for the error code recorded in “Step 152m-1,” find the location codes of both ends of the suspect power cable or cables. Step 152m-4 Examine both ends of the power cable or cables, and make sure each end is properly connected and fully seated. Step 152m-5 Is the power cable or cables properly connected and seated fully at both ends? Yes Go to “Step 152m-30” on page 303. No Go to “Step 152m-29” on page 303. 300 406/90 Service Guide Step 152m-6 Have you replaced the first FRU in the list of FRUs recorded in “Step 152m-1” on page 300 in a previous service action? Yes Go to “Step 152m-18” on page 302. No Go to “Step 152m-7.”Step 152m-7 Is the power-on LED of the first FRU generated from the location codes recorded in “Step 152m-1” on page 300 off? Yes Go to “Step 152m-8.” No Call for support.Step 152m-8 Replace the first FRU in the list of FRUs recorded in “Step 152m-1” on page 300 with power-on LED off. Press the white service complete button on the panel with the UEPO switch after replacement. Step 152m-9 Does the error code recorded in “Step 152m-1” on page 300 reappear? Yes Return the system to its original configuration. Go to “Step 152m-11.” No Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152m-10 Press the white service complete button on the panel with the UEPO switch. Step 152m-11 Schedule non-concurrent service with the customer to replace the second FRU in the list of FRUs recorded in “Step 152m-1” on page 300. This ends the procedure. Step 152m-12 Is the power-on LED of the first FRU generated from the location codes recorded in “Step 152m-1” on page 300 off? Yes Go to “Step 152m-13.” No Call for support.Step 152m-13 Reseat and properly connect the power cable ends, with location codes recorded in “Step 152m-1” on page 300. Step 152m-14 Set the power-on switches of all BPRs on the same side with the power cable that was not connected properly to the off (right) position. Chapter 3. Maintenance Analysis Procedures 301 Step 152m-15 Wait 30 seconds and then turn the power-on switches of the BPRs found in “Step 152m-14” on page 301 to the on (left) position. Step 152m-16 Press the white service complete button on the panel with the UEPO switch after replacement. Step 152m-17 Does the error code recorded in “Step 152m-1” on page 300 reappear? Yes Go to “Step 152m-2” on page 300. No Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152m-18 Note: The following steps will require the system power to be removed. The operating system must be shut down in all partitions before turning off the power.Turn off the power. Turn off the UEPO switch. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Also examine the green ?power good out? LEDs of all installed I/O subsystem DCAs. Step 152m-19 Are all amber logic-power LEDs of all installed processor subsystem DCAs and all green ?logic power out? LEDs of all installed I/O subsystem DCAs off? Yes Go to “Step 152m-20.” No Call for support.Step 152m-20 Replace the second FRU in the list of FRUs recorded in step “Step 152m-1” on page 300. Step 152m-21 Turn on the UEPO switch. Turn on the power. Step 152m-22 Does the error code recorded in “Step 152m-1” on page 300 reappear? Yes Go to “Step 152m-23.” No Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152m-23 Go to “Step 152m-24.” Step 152m-24 Turn off the power. Turn off the UEPO switch. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Also examine the green ?power good out? LEDs of all installed I/O subsystem DCAs. 302 406/90 Service Guide Step 152m-25 Are all amber logic-power LEDs of all installed processor subsystem DCAs and all green ?logic power out? LEDs of all installed I/O subsystem DCAs off? Yes Go to “Step 152m-26.” No Call for support.Step 152m-26 Replace the third FRU in the list of FRUs recorded in step 152m-1. Step 152m-27 Turn on the UEPO switch. Turn on the power. Step 152m-28 Does the error code recorded in “Step 152m-1” on page 300 reappear? Yes Call for support. No Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152m-29 Is the error code recorded in “Step 152m-1” on page 300 of the form 101A xxxx or 101B xxxx? Yes Go to “Step 152m-12” on page 301. No All system partitions must be shut down and the system must be powered off to correct the power cabling. Schedule nonconcurrent service with the customer to correct the power cabling. This ends this procedure.Step 152m-30 Is the error code recorded in “Step 152m-1” on page 300 of the form 101A xxxx or 101B xxxx? Yes Go to “Step 152m-6” on page 301. No Verify all power subsystem UPIC connections. Go to “Step 152m-31.”Step 152m-31 Are all power subsystem UPIC connections properly seated and correctly connected? Yes Call for support. No All system partitions must be shut down and the system must be powered off to correct the power cabling. Schedule nonconcurrent service with the customer to correct the power cabling. This ends this procedure. Chapter 3. Maintenance Analysis Procedures 303 MAP 152n: DASD Subsystem Power Problem Step 152n-1 Record the error code and location code(s) that sent you to this MAP. Step 152n-2 Press the green start service button on the panel with the UEPO switch. Determine the I/O subsystem number from the error code (10yy xxxx) and the following list. The remaining steps of this map refer to this I/O subsystem: v If yy=14, I/O subsystem number is 1, location is U1.9 v If yy=15, I/O subsystem number is 2, location is U1.5 v If yy=16, I/O subsystem number is 3, location is U1.1 v If yy=17, I/O subsystem number is 4, location is U1.13 v If yy=21, I/O subsystem number is 5, location is U2.1 v If yy=22, I/O subsystem number is 6, location is U2.5 v If yy=23, I/O subsystem number is 4, location is U2.9 v If yy=24, I/O subsystem number is 7, location is U2.13 v If yy=25, I/O subsystem number is 8, location is U2.19Step 152n-3 Examine the last 2 characters of the error code recorded in “Step 152n-1,” and use the following list to determine which DASD backplane to service. v If the last 2 characters of the error code is 96, then the following steps refer to DASD backplane 1 (P3) of I/O subsystem number found in “Step 152n-2.” v If the last 2 characters of the error code is A6, then the following steps refer to DASD backplane 2 (P4) of I/O subsystem number found in “Step 152n-2.” v If the last 2 characters of the error code is B6, then the following steps refer to DASD backplane 3 (P5) of I/O subsystem number found in “Step 152n-2.” v If the last 2 characters of the error code is C6, then the following steps refer to DASD backplane 4 (P6) of I/O subsystem number found in “Step 152n-2.”Step 152n-4 Turn off the power. Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152n-5 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to “Step 152n-6” on page 305. No Call for support. 304 406/90 Service Guide Step 152n-6 Are there any DASD in positions A8, A9, Aa, or Ab of the DASD backplane found in “Step 152n-2” on page 304 and “Step 152n-3” on page 304? Yes Go to “Step 152n-7.” No Go to “Step 152n-15” on page 306.Step 152n-7 Record the DASD positions in slots A8, A9, Aa, or Ab, then remove any DASD in slots A8, A9, Aa, or Ab. Note: You will be instructed in a later step to reinstall the DASD one at a time. Turn on the power. Step 152n-8 Did the error code recorded in “Step 152n-1” on page 304 reappear? Yes Go to “Step 152n-14.” No One of the DASD just removed is defective. Go to “Step 152n-9.”Step 152n-9 Turn off the power. Make sure all green ?power good out? LEDs of all installed I/O subsystem DCAs are off. Step 152n-10 Reinstall one of the DASD removed in “Step 152n-6.” Step 152n-11 Turn on the power. Step 152n-12 Did the error code recorded in “Step 152n-1” on page 304 reappear? Yes The DASD just reinstalled is defective. Replace it. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. No Go to “Step 152n-13.”Step 152n-13 Have all DASD that were removed in “Step 152n-6” been reinstalled? Yes The problem has changed. This ends the procedure. No Go to “Step 152n-9” to reinstall the next DASD.Step 152n-14 Turn off the power. Make sure all green ?power good out? LEDs of all installed I/O subsystem DCAs are off. Chapter 3. Maintenance Analysis Procedures 305 Step 152n-15 Replace the books in the following list, one at a time and in the listed order: v DCA 1 at V1 v DCA 2 at V2Step 152n-16 Turn on the power. Step 152n-17 Did the system stop with the same error code as recorded in “Step 152n-1” on page 304? Yes Go to “Step 152n-18.” No The book just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 152n-18 Turn off the power then make sure all the green ?power good out? LEDs of both I/O subsystem DCAs are off. Step 152n-19 Remove the new card that was just installed in “Step 152n-15” and reinstall the original card. Step 152n-20 Have all the cards listed in “Step 152n-15” been replaced with new or original cards?? Yes Go to “Step 152n-21.” No Go to “Step 152n-15.”Step 152n-21 Examine the green ?power good out? LEDs of both I/O subsystem DCAs. Step 152n-22 Are all the green ?power good out? LEDs of both I/O subsystem DCAs off? Yes Go to Step “Step 152n-23.” No Call for support.Step 152n-23 Replace the DASD backplane. Step 152n-24 Turn on the power. 306 406/90 Service Guide Step 152n-25 Did the system stop with the same error code as recorded in “Step 152m-1” on page 300? Yes Call for support. No The book just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Chapter 3. Maintenance Analysis Procedures 307 MAP 1540: Problem Isolation Procedures These procedures are used to locate problems in the processor subsystem, I/O subsystems, or rack. If a problem is detected, these procedures help you isolate the problem to a failing unit. Find the symptom in the following table; then follow the instructions given in the Action column. Attention: To identify a FRU, these MAPs refer to FRU Identify LEDs. For more information about the FRU Identify LEDs see “FRU Identify LEDs” on page 65. Problem Isolation Procedures Symptom/Reference Code/Checkpoint Action You were sent here by the Diagnostic Information for Multiple Bus Systems book. Go to “MAP 1542: I/O Problem Isolation” on page 318. 406x 00B7 Go to “MAP 154B: Insufficient Hardware Resources Problem Isolation” on page 365. Attention: Before you remove or replace any MCM or L3 module, stop, read and understand the following procedures: “MCM Module (Processor) and Pass Through Modules” on page 863 and “L3 Cache Modules” on page 882. 406x 0EB1, 406x 0EB2, 406x 0EB3, 406x 0EB4, 406x 0EB5, 406x 0EB6, 406x 0EB7, 406x 0EB8, 406x 0EB9, 406x 0EBA, 406x 0EBB, 406x 0EBC Go to “MAP 1549: Attention Problem Isolation” on page 353.Attention: Before you remove or replace any MCM or L3 module, stop, read and understand the following procedures: “MCM Module (Processor) and Pass Through Modules” on page 863 and “L3 Cache Modules” on page 882. 406x 0EB0 Go to “MAP 1541: JTAG Problem Isolation” on page 310. 406x 0C03, 4B2x 258A, 4B2x 25C7, 4B2x 25C8, 4B2x 25CF, 4B2x 25D3, 4B2x 25D5, 4B2x 25D7, 4B2x 25D8, 4B2x 25E5, 4B2x 25E94B2x 526B, 4B2x 526C, 4B2x 526D, 4B2x 526E Go to “MAP 1543: MCM Module Problem Isolation” on page 337.Attention: Before you remove or replace any MCM or L3 module, stop, read and understand the following procedures: “MCM Module (Processor) and Pass Through Modules” on page 863 and “L3 Cache Modules” on page 882. 450x 302A, 450x 302C, 450x305A, 450x 305C, 450x 305E, 450x 307A, 450x 307B, 450x 307C, 450x 30BD, 450x 30C3 Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340.Attention: Before you remove or replace any MCM or L3 module, stop, read and understand the following procedures: “MCM Module (Processor) and Pass Through Modules” on page 863 and “L3 Cache Modules” on page 882. 450x 302B, 450x 302D, 450x 305B 450x 305D, 450x 305F, 450x 308A, 450x 308B, 450x 308C, 450x 30C6, 450x 30CC Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342.Attention: Before you remove or replace any MCM or L3 module, stop, read and understand the following procedures: “MCM Module (Processor) and Pass Through Modules” on page 863 and “L3 Cache Modules” on page 882. 308 406/90 Service Guide Problem Isolation Procedures Symptom/Reference Code/Checkpoint Action 406x 0C09 Go to “MAP 1546: Memory Book Problem Isolation” on page 345. 406x 0C0A Go to “MAP 1547: I/O Book Problem Isolation” on page 348. 406x 0001, 406x 0002, 406x 000A, 406x 000B,406x 000C, 406x 000D, B1xx 4643, B1xx 4645, B1xx 4648 Go to “MAP 154A: I2C Bus Problem Isolation” on page 361.Attention: Before you remove or replace any MCM or L3 module, stop, read and understand the following procedures: “MCM Module (Processor) and Pass Through Modules” on page 863 and “L3 Cache Modules” on page 882. E101, E102, E10A, E10B, E111, E120, E121, E122, E130, E131, E132, E133, E134, E135, E138, E139, E13A, E149, E14C, E191, E19A, E19B, E19D, E1A0, E1A1, E1A2, E1A3, E1A4, E1A5, E1A6, E1A7, E1A8, E1A9, E1AA, E1AB, E1AC, E1AD, E1AE, E1AF, E1B1, E1C4, E1C5, E1C6, E1D0, E1D3, E1D4, E1DB, E1F0, E900, E901 Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350.Attention: Before you remove or replace any MCM or L3 module, stop, read and understand the following procedures: “MCM Module (Processor) and Pass Through Modules” on page 863 and “L3 Cache Modules” on page 882. Chapter 3. Maintenance Analysis Procedures 309 MAP 1541: JTAG Problem Isolation Step 1541-1 Record the error code and location code(s) that sent you to this MAP. Step 1541-2 Check the following list to find the location code(s) recorded in “Step 1541-1”: v L3 module at U1.18-P1-C5 v L3 module at U1.18-P1-C11 v L3 module at U1.18-P1-C14 v L3 module at U1.18-P1-C20 v L3 module at U1.18-P1-C9 v L3 module at U1.18-P1-C10 v L3 module at U1.18-P1-C6 v L3 module at U1.18-P1-C12 v L3 module at U1.18-P1-C13 v L3 module at U1.18-P1-C7 v L3 module at U1.18-P1-C15 v L3 module at U1.18-P1-C16 v L3 module at U1.18-P1-C17 v L3 module at U1.18-P1-C18 v L3 module at U1.18-P1-C19 v L3 module at U1.18-P1-C8 v MCM module 0 at U1.18-P1-C1 v MCM module 3 at U1.18-P1-C2 v MCM module 1 at U1.18-P1-C3 v MCM module 2 at U1.18-P1-C4 310 406/90 Service Guide Step 1541-3 Is (are) the location code(s) recorded in “Step 1541-1” on page 310 also listed in “Step 1541-2” on page 310? Yes Go to “Step 1541-19” on page 313. No Go to “Step 1541-4.”Step 1541-4 Check the following list to find the location code(s) recorded in “Step 1541-1” on page 310: v Fourth I/O book, at location U1.18-P1-H1 v Primary I/O Book at U1.18-P1-H2 v Secondary I/O Book at U1.18-P1-H3 v Third I/O Book at U1.18-P1-H4 v Memory Book 6 at U1.18-P1-M5 v Memory Book 7 at U1.18-P1-M1Step 1541-5 Is (are) the location code(s) recorded in “Step 1541-1” on page 310 also listed in “Step 1541-4”? Yes Go to “Step 1541-25” on page 314. No Go to “Step 1541-6.”Step 1541-6 Check the following list to find the location code(s) recorded in “Step 1541-1” on page 310: v Memory Book 0 at U1.18-P1-M2 v Memory Book 1 at U1.18-P1-M3 v Memory Book 2 at U1.18-P1-M7 v Memory Book 3 at U1.18-P1-M6 v Memory Book 4 at U1.18-P1-M4 v Memory Book 5 at U1.18-P1-M8Step 1541-7 Is (are) the location code(s) recorded in “Step 1541-1” on page 310 listed in “Step 1541-6”? Yes Go to “Step 1541-31” on page 315. No Go to “Step 1541-8” on page 312. Chapter 3. Maintenance Analysis Procedures 311 Step 1541-8 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1541-9 Are all the amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1541-10.” No Call for support.Step 1541-10 Replace the item recorded in “Step 1541-1” on page 310. Step 1541-11 Turn on the power. Step 1541-12 Did the system stop with the same error code as recorded in “Step 1541-1” on page 310? Yes Go to “Step 1541-13.” No This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1541-13 Turn off the power. Step 1541-14 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1541-15 Are all the amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1541-16.” No Call for support.Step 1541-16 Replace the primary I/O book at U1.18-P1-H2. Step 1541-17 Turn on the power. 312 406/90 Service Guide Step 1541-18 Did the system stop with the same error code as recorded in “Step 1541-1” on page 310? Yes Go to “Step 1541-37” on page 316. No The part just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1541-19 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1541-20 Are all the amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1541-21.” No Call for support.Step 1541-21 Attention: Before you remove or replace any MCM or L3 module, stop, read and understand the following procedures: “MCM Module (Processor) and Pass Through Modules” on page 863 and “L3 Cache Modules” on page 882. Attention: MCM and L3 modules have a limit of three plug cycles. Before removing any of the modules in the following list, call for support. Replace the following modules or books, if present, one at a time and in the order listed: v First location code item recorded, if any, “Step 1541-1” on page 310. v Second location code item recorded, if any, in “Step 1541-1” on page 310. v Third location code item recorded, if any, in “Step 1541-1” on page 310. v Fourth location code item recorded, if any, in “Step 1541-1” on page 310. v Primary I/O book at U1.18-P1-H2. v L3 module, at location U1.18-P1-C5 if not recorded in “Step 1541-1” on page 310. v L3 module, at location U1.18-P1-C11 if not recorded in “Step 1541-1” on page 310. v L3 module, at location U1.18-P1-C14 if not recorded in “Step 1541-1” on page 310. v L3 module, at location U1.18-P1-C20 if not recorded in “Step 1541-1” on page 310. v L3 module, at location U1.18-P1-C9 if not recorded in “Step 1541-1” on page 310. v L3 module, at location U1.18-P1-C10 if not recorded in “Step 1541-1” on page 310. v L3 module, at location U1.18-P1-C6 if not recorded in “Step 1541-1” on page 310. v L3 module, at location U1.18-P1-C12 if not recorded in “Step 1541-1” on page 310. v L3 module, at location U1.18-P1-C13 if not recorded in “Step 1541-1” on page 310. v L3 module, at location U1.18-P1-C7 if not recorded in “Step 1541-1” on page 310. v L3 module, at location U1.18-P1-C15 if not recorded in “Step 1541-1” on page 310. v L3 module, at location U1.18-P1-C16 if not recorded in “Step 1541-1” on page 310. v L3 module, at location U1.18-P1-C17 if not recorded in “Step 1541-1” on page 310. v L3 module, at location U1.18-P1-C18 if not recorded in “Step 1541-1” on page 310. v L3 module, at location U1.18-P1-C19 if not recorded in “Step 1541-1” on page 310. v L3 module, at location U1.18-P1-C8 if not recorded in “Step 1541-1” on page 310. v MCM module 0 at U1.18-P1-C1 if not recorded in “Step 1541-1” on page 310. v MCM module 3 at U1.18-P1-C2 if not recorded in “Step 1541-1” on page 310. v MCM module 1 at U1.18-P1-C3 if not recorded in “Step 1541-1” on page 310. v MCM module 2 at U1.18-P1-C4 if not recorded in “Step 1541-1” on page 310. Chapter 3. Maintenance Analysis Procedures 313 Step 1541-22 Turn on the power. Step 1541-23 Did the system stop with the same error code as recorded in “Step 1541-1” on page 310? Yes Go to “Step 1541-24.” No The part just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1541-24 Have all the MCMs and books listed in “Step 1541-21” on page 313 been replaced? Yes Go to “Step 1541-37” on page 316. No Go to “Step 1541-19” on page 313.Step 1541-25 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1541-26 Are all the amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1541-27.” No Call for support.Step 1541-27 Replace the following books, if present, one at a time: v First location code item recorded, if any in “Step 1541-1” on page 310. v Second location code item recorded, if any, in “Step 1541-1” on page 310. v Third location code item recorded, if any, in “Step 1541-1” on page 310. v Fourth location code item recorded, if any, in “Step 1541-1” on page 310. v Fourth I/O book, at location U1.18-P1-H1 if not recorded in “Step 1541-1” on page 310. v Primary I/O Book at U1.18-P1-H2 if not recorded in “Step 1541-1” on page 310. v Secondary I/O Book at U1.18-P1-H3 if not recorded in “Step 1541-1” on page 310. v Third I/O Book at U1.18-P1-H4 if not recorded in “Step 1541-1” on page 310. v Memory Book 6 at U1.18-P1-M5 if not recorded in “Step 1541-1” on page 310. v Memory Book 7 at U1.18-P1-M1 if not recorded in “Step 1541-1” on page 310.Step 1541-28 Turn on the power. 314 406/90 Service Guide Step 1541-29 Did the system stop with the same error code as recorded in “Step 1541-1” on page 310? Yes Go to “Step 1541-30.” No The book just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1541-30 Have all the books listed in “Step 1541-27” on page 314 been replaced? Yes Go to “Step 1541-37” on page 316. No Go to “Step 1541-25” on page 314.Step 1541-31 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1541-32 Are all the amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1541-33.” No Call for support.Step 1541-33 Replace the following books, if present, one at a time: v First location code item recorded in “Step 1541-1” on page 310. v Second location code item recorded, if any, in “Step 1541-1” on page 310. v Third location code item recorded, if any, in “Step 1541-1” on page 310. v Fourth location code item recorded, if any, in “Step 1541-1” on page 310. v Primary I/O book at U1.18-P1-H2. v Memory book 0 at U1.18-P1-M2 if not recorded in “Step 1541-1” on page 310. v Memory book 1 at U1.18-P1-M3 if not recorded in “Step 1541-1” on page 310. v Memory book 2 at U1.18-P1-M7 if not recorded in “Step 1541-1” on page 310. v Memory book 3 at U1.18-P1-M6 if not recorded in “Step 1541-1” on page 310. v Memory book 4 at U1.18-P1-M4 if not recorded in “Step 1541-1” on page 310. v Memory book 5 at U1.18-P1-M8 if not recorded in “Step 1541-1” on page 310Step 1541-34 Turn on the power. Chapter 3. Maintenance Analysis Procedures 315 Step 1541-35 Did the system stop with the same error code as recorded in “Step 1541-1” on page 310? Yes Go to “Step 1541-36.” No The book just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1541-36 Have all the books listed in “Step 1541-33” on page 315 been replaced? Yes Go to “Step 1541-37.” No Go to “Step 1541-31” on page 315.Step 1541-37 Turn off the power. Step 1541-38 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1541-39 Are all the amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1541-40.” No Call for support.Step 1541-40 Attention: Before replacing the system backplane, call for support. Replace the system backplane at U1.18-P1. Step 1541-41 Turn on the power. Step 1541-42 Did the system stop with the same error code as recorded in “Step 1541-1” on page 310? Yes Go to “Step 1541-43” on page 317. No The book just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. 316 406/90 Service Guide Step 1541-43 Turn off the power. Make sure the amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 1541-44 Call for support. This ends the procedure. Chapter 3. Maintenance Analysis Procedures 317 MAP 1542: I/O Problem Isolation This MAP is used to locate defective FRUs not found by normal diagnostics. For this procedure, diagnostics are run on a minimally configured system. If a failure is detected on the minimally configured system, the remaining FRUs are exchanged one at a time until the failing FRU is identified. If a failure is not detected, FRUs are added back until the failure occurs. The failure is then isolated to the failing FRU. This I/O problem-determination MAP isolates I/O card and I/O subsystem failures. When I/O problem isolation is complete, all cables and cards exhibiting a failure will have been replaced or reseated.Notes: 1. This MAP assumes that a CD-ROM drive is installed in the media subsystem and connected to a SCSI adapter in the first I/O subsystem, and that a Diagnostics CD-ROM is available. 2. If a general-access password or privileged-access password is installed, you are prompted to enter the password before the diagnostic CD-ROM can load. 3. The term POST indicators refers to the device mnemonics (words Memory, Keyboard, Network, SCSI, and Speaker) that appear on the system console during the power-on self-test (POST). 4. The service processor may have recorded one or more symptoms in its error log. It is a good idea to examine that error log before proceeding (see “System Information Menu” on page 781). 5. The service processor may have been set by the user to monitor system operations and to attempt recoveries. You can disable these actions while you diagnose and service the system. If you disable them, make notes of their current settings so that you can restore them easily. The following table lists the settings: Surveillance From the service processor Setup Menu, go to the Surveillance Setup Menu and disable surveillance. (Operating System Surveillance is disabled in partition mode.) Unattended Start From the service processor System Power Control Menu, disable unattended start mode. Reboot Policy From the System Power Control Menu, go to the Reboot/Restart Policy Setup Menu and set: 1. Number of reboot attempts to 0 (zero) 2. Use OS-Defined restart policy to No 3. Enable supplemental restart policy to No. Call Out From the Call-In/Call-Out Setup Menu, go to the Serial Port Selection Menu and disable call-out on both serial ports. The steps in this MAP will attempt to slow boot the system into service mode diagnostics. The following procedure describes booting a nonpartitioned system with an attached 3151 console. The procedure for booting a partitioned system with an attached HMC is slightly different and is described in “Performing Slow Boot” on page 413. “Performing Slow Boot” describes the procedure for booting online diagnostics (6 key between the Keyboard and Speaker indicators) and the following procedure requires booting standalone diagnostics (5 key between the Keyboard and Speaker indicators). Therefore, on a partitioned system, make sure the partition you selected to boot in the following procedure has the CD-ROM allocated to it, as described in Chapter 6, “Using the Online and Standalone Diagnostics,” on page 759. 318 406/90 Service Guide Step 1542-1 1. Ensure that the diagnostics and the operating system are shut down. 2. Turn off the power. 3. Select slow boot mode (select Disable Fast Boot) on the System Power Control menu from the service processor main menu. 4. Turn on the power. 5. Insert the diagnostic CD-ROM into the CD-ROM drive. Does the CD-ROM appear to operate correctly? NO Go to “Boot Problems” on page 409, and perform steps 1-8 for an internally attached boot device. YES Go to “Step 1542-2.”Step 1542-2 1. When the keyboard indicator is displayed (the word Keyboard), press the 5 key on the firmware console. 2. Enter the appropriate password when you are prompted to do so. Is the Please define system console screen displayed? NO Go to “Step 1542-3.” YES Go to “Step 1542-4” on page 320.Step 1542-3 The system is unable to boot standalone diagnostics. Check the service processor error log and the media subsystem operator panel for additional error codes resulting from the slow boot in “Step 1542-1.” Did the slow boot generate a different error code from the one that originally sent you to MAP 1542? NO It appears you have a processor subsystem problem. Call for support. This ends the procedure. YES Restore fast-boot mode (select Enable Fast Boot) on the System Power Control menu from the service processor main menu. Go to “Checkpoints and Error Codes Index” on page 416, and follow the actions for the new error code. Chapter 3. Maintenance Analysis Procedures 319 Step 1542-4 The system stopped with the Please define system console prompt appearing on the system console. Standalone diagnostics can be booted. Perform the following: 1. Follow the instructions on the screen to select the system console. 2. When the DIAGNOSTIC OPERATING INSTRUCTIONS screen is displayed, press Enter. 3. If the terminal type has not been defined, you must use the Initialize Terminal option of the FUNCTION SELECTION menu to initialize the AIX operating system environment before you can continue with the diagnostics. This is a separate and different operation from selecting the firmware console display. 4. Select Advanced Diagnostic Routines. 5. When the DIAGNOSTIC MODE SELECTION menu displays, select System Verification to run diagnostics on all resources. Did running diagnostics produce a different symptom? NO Go to step number 6, below. YES Go to “Entry MAP” on page 149. Use the new symptom.6. Record any devices missing from the list of all adapters and devices. Continue with this MAP. When you have fixed the problem, use this record to verify that all devices appear when you run System Verification.Are there any devices missing from the list of all adapters and devices? NO Reinstall all remaining adapters, if any, and reconnect all devices. Return the system to its original configuration. Be sure to select fast-boot mode (select Enable Fast Boot) on the System Power Control menu from the service processor main menu. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. YES The boot attempts that follow will attempt to isolate any remaining I/O subsystem problems with missing devices. Ignore any codes that may display on the operator panel unless stated otherwise. Go to “Step 1542-5” on page 321. 320 406/90 Service Guide Step 1542-5 There are missing devices associated with one or more I/O subsystems.Note: Before continuing, check the cabling from the processor subsystem to I/O subsystem(s) to ensure that the system is cabled correctly. Refer to “Subsystem Positioning and Cabling” on page 36 for valid configurations. Record the current cabling configuration and then continue with the following steps: 1. Turn off the power. 2. At the primary I/O book, disconnect the cable connection at RIO port A0 (U1.18-P1-H2/Q1). 3. Disconnect the cable connection at I/O subsystem 1 left I/O port connector 0 (U1.9-P1/Q2). The RIO cable that was connected to RIO port A0 should now be loose and can be removed. 4. At the primary I/O book, disconnect the cable connection at RIO port A1 (U1.18-P1-H2/Q2) and reconnect it to RIO port A0 (U1.18-P1-H2/Q1). 5. Disconnect the cable connection at I/O subsystem 1 right I/O port connector 1 (U1.9-P2/Q1). The RIO cable that was connected to RIO port A0 on the primary I/O book should now be loose and can be removed. 6. Verify that processor subsystem RIO port A0 on primary I/O book (U1.18-P1-H2/Q1) is connected to I/O subsystem 1 left I/O port connector 0 (U1.9-P1/Q2). 7. Verify that I/O subsystem 1 left I/O port connector 1 (U1.9-P1/Q1) is connected to I/O subsystem 1 right I/O port connector 0 (U1.9-P2/Q2). 8. Turn on the power to boot standalone diagnostics from CD. 9. If the Please define the System Console screen is displayed, follow directions to select the system console. 10. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems if necessary). 11. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Go to “Step 1542-6” on page 322. YES The RIO cable that was removed in step 5 above is defective. Replace the RIO cable. Connect the new RIO cable from I/O subsystem 1 right I/O port connector 1 (U1.9-P2/Q1) to processor subsystem RIO port A1 on primary I/O book (U1.18-P1-H2/Q2). Reconfigure the system to its original condition. Go to “MAP 0410: Repair Checkout” in the Diagnostic Information for Multiple Bus Systems. Chapter 3. Maintenance Analysis Procedures 321 Step 1542-6 The 406/90 server must have a minimum of one I/O subsystem and one media subsystem attached to the processor subsystem. The following steps reduce the system I/O to the required minimum of one I/O subsystem: 1. Turn off the system power. 2. Verify that processor subsystem RIO port A0 on primary I/O book (U1.18-P1-H2/Q1) is connected to I/O subsystem 1 left I/O port connector 0 (U1.9-P1/Q2). 3. Verify that I/O subsystem 1 right I/O port connector 1 (U1.9-P2/Q1) is connected to processor subsystem RIO port A1 on primary I/O book (U1.18-P1-H2/Q2). 4. Verify that I/O subsystem 1 left I/O port connector 1 (U1.9-P1/Q1) is connected to I/O subsystem 1 right I/O port connector 0 (U1.9-P2/Q2). 5. Disconnect cable at processor subsystem RIO port B0 on primary I/O book (U1.18-P1-H2/Q3), if cable attached. 6. Disconnect cable at processor subsystem RIO port B1 on primary I/O book (U1.18-P1-H2/Q4), if cable attached. 7. Unplug the secondary I/O book (U1.18-P1-H3), if present, and carefully pull it approximately 25 mm (1 inch) away from the backplane without disconnecting any cables 8. Unplug the third I/O book (U1.18-P1-H4), if present, and carefully pull it approximately 25 mm (1 inch) away from the backplane without disconnecting any cables. 9. Connect the previously removed RIO cable from I/O subsystem 1 right I/O port connector 1 (U1.9-P2/Q1) to processor subsystem RIO port A1 on primary I/O book (U1.18-P1-H2/Q2). 10. Verify that the power cable is connected between I/O subsystem 1 right media subsystem power connector (U1.9-P2-V1/Q3) and the media subsystem. 11. Disconnect power cables from remaining I/O subsystems. All I/O subsystems except the primary I/O subsystem should now be physically disconnected from the processor subsystem. Go to “Step 1542-7” on page 323. 322 406/90 Service Guide Step 1542-7 I/O subsystem 1 (U1.9) is the only I/O subsystem connected to the processor subsystem. 1. Turn on the power to boot standalone diagnostics from CD. 2. If the Please define the System Console screen is displayed, follow directions to select the system console. 3. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 4. Check that all attached devices and adapters are listed.If the Please define the System Console prompt did not display or all attached devices and adapters are not listed, the problem is in the media subsystem or I/O subsystem 1 (U1.9). Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Go to “Step 1542-8.” YES Go to “Step 1542-21” on page 329.Step 1542-8 Perform the following to deconfigure the media subsystem: 1. Turn off the power. 2. If you have not already done so, set the service processor settings with the instructions in step 5 on page 318. Then return here and continue. 3. Exit the service processor menus, and disconnect the power cable between the I/O subsystem 1 right media subsystem power connector (U1.9-P2-V1/Q3) and the media subsystem. 4. If a display adapter with keyboard and mouse connected to a Keyboard/Mouse Attachment (USB) card in I/O subsystem 1 (U1.9) is being used as the console, locate a standalone serial terminal (for example a 3151), and connect to the S1 serial port or use the HMC. 5. Remove the keyboard and mouse, if attached to the I/O subsystem 1 (U1.9). 6. Remove (unplug) the tape drive (if installed) from the media subsystem. 7. Disconnect the diskette drive cable from the diskette drive connector on the media subsystem. 8. Connect the power cable between the I/O subsystem 1 right media subsystem power connector (U1.9-P2-V1/Q3) and the media subsystem. The media subsystem operator panel momentarily displays an OK prompt. 9. Set the current drawer location to x.y=1.9. Use this value in future steps until changed in a later step.Go to “Step 1542-9” on page 324. Chapter 3. Maintenance Analysis Procedures 323 Step 1542-9 Are there any adapters in slots 1, 2, 3, 4 or 5 (location codes Ux.y-P1-I1 through I5) on the left side of the I/O subsystem? NO Go to “Step 1542-10.” YES Go to “Step 1542-13.”Step 1542-10 Are there any adapters in slots 6, 7, 8, 9 or 10 (location codes Ux.y-P1-I6 through I10) on the left side of the I/O subsystem? NO Go to “Step 1542-11.” YES Go to “Step 1542-14” on page 325.Step 1542-11 Are there any adapters in slots 1, 2, 3, 4 or 5 (location codes Ux.y-P2-I1 through I5) on the right side of the I/O subsystem? NO Go to “Step 1542-12.” YES Go to “Step 1542-15” on page 325.Step 1542-12 Are there any adapters in slots 6, 7, 8, 9 or 10 (location codes Ux.y-P2-I7 through I10) on the right side of the I/O subsystem? NO Go to “Step 1542-19” on page 328. YES Go to “Step 1542-16” on page 326.Step 1542-13 1. If it is not already off, turn off the power. 2. Remove all adapters from slots 1, 2, 3, 4 and 5 (location codes Ux.y-P1-I1 through I5) from the left side of the I/O subsystem that are not attached to the boot device. 3. Label and record the location of any cables attached to the adapters. 4. Record the slot number of the adapters. 5. Turn on the power to boot standalone diagnostics from CD. 6. If the TTY screen displays Enter 0 to select this console, press the 0 key on the TTY terminal’s keyboard. 7. If the Please define the System Console screen is displayed, follow directions to select the system console. 8. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 9. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Go to “Step 1542-10.” YES Go to “Step 1542-17” on page 326. 324 406/90 Service Guide Step 1542-14 1. If it is not already off, turn off the power. 2. Remove all adapters from slots 6, 7, 8, 9 and 10 (location codes Ux.y-P1-I6 through I10) from the left side of the I/O subsystem that are not attached to the boot device. 3. Label and record the location of any cables attached to the adapters. 4. Record the slot number of the adapters. 5. Turn on the power to boot standalone diagnostics from CD. 6. If the TTY screen displays Enter 0 to select this console, press the 0 key on the TTY terminal’s keyboard. 7. If the Please define the System Console screen is displayed, follow directions to select the system console. 8. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 9. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Go to “Step 1542-11” on page 324. YES Go to “Step 1542-17” on page 326.Step 1542-15 1. If it is not already off, turn off the power. 2. Remove all adapters from slots 1, 2, 3, 4 and 5 (location codes Ux.y-P2-I1 through I5) from the right side of the I/O subsystem that are not attached to the boot device. 3. Label and record the location of any cables attached to the adapters. 4. Record the slot number of the adapters. 5. Turn on the power to boot standalone diagnostics from CD. 6. If the TTY screen displays Enter 0 to select this console, press the 0 key on the TTY terminal’s keyboard. 7. If the Please define the System Console screen is displayed, follow directions to select the system console. 8. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 9. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Go to “Step 1542-12” on page 324. YES Go to “Step 1542-17” on page 326. Chapter 3. Maintenance Analysis Procedures 325 Step 1542-16 1. If it is not already off, turn off the power. 2. Remove all adapters from slots 6, 7, 8, 9 and 10 (location codes Ux.y-P2-I6 through I10) from the right side of the I/O subsystem that are not attached to the boot device. 3. Label and record the location of any cables attached to the adapters. 4. Record the slot number of the adapters. 5. Turn on the power to boot standalone diagnostics from CD. 6. If the TTY screen displays Enter 0 to select this console, press the 0 key on the TTY terminal’s keyboard. 7. If the Please define the System Console screen is displayed, follow directions to select the system console. 8. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 9. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Go to “Step 1542-19” on page 328. YES Go to “Step 1542-17.”Step 1542-17 If the Please define the System Console screen displays and all attached devices and adapters are listed, the problem is with one of the adapter cards or devices that was removed or disconnected from the I/O subsystem. 1. Turn off the power. 2. Reinstall one adapter or device that was removed. Use the original adapter cards in their original slots when reinstalling adapters. 3. Turn on the power to boot standalone diagnostics from CD. 4. If the Please define the System Console screen is displayed, follow directions to select the system console. 5. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 6. Check that all attached devices and adapters are listed.Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Go to “Step 1542-18” on page 327. YES Reinstall the next adapter or device and return to the beginning of this step. Continue repeating this process until an adapter or device causes the Please define the System Console screen to not display or all attached devices and adapters to not be listed. After installing all of the adapters and the Please define the System Console screen does display and all attached devices and adapters are listed, return the system to its original configuration. Go to “MAP 0410: Repair Checkout” in the Diagnostic Information for Multiple Bus Systems. 326 406/90 Service Guide Step 1542-18 Replace the adapter you just installed with a new adapter and try to boot standalone diagnostics from CD again. 1. If the Please define the System Console screen is displayed, follow directions to select the system console. 2. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 3. Check that all attached devices and adapters are listed.Did the Please define the System Console screen display and are all attached devices and adapters listed? NO The I/O subsystem backplane is defective. Replace the I/O subsystem backplane on the side of the problem adapter (Ux.y-P1 or Ux.y-P2). Go to “Step 1542-20” on page 329. YES The adapter was defective. Go to “Step 1542-20” on page 329. Chapter 3. Maintenance Analysis Procedures 327 Step 1542-19 1. Turn off the power. 2. Disconnect the I/O subsystem power cables. 3. Replace the following parts, one at a time, in the sequence listed: a. Boot device SCSI cable b. Boot device SCSI adapter c. Boot device SCSI backplane d. Boot device e. I/O subsystem backplane where boot device is located f. Other I/O subsystem backplane g. Both I/O subsystem DCAs4. Reconnect the I/O subsystem power cables. 5. Turn on the power. 6. Boot standalone diagnostics from CD. 7. If the Please define the System Console screen is displayed, follow directions to select the system console. 8. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 9. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Replace the next part in the list and return to the beginning of this step. Continue repeating this process until a part causes the Please define the System Console screen to be displayed and all attached devices and adapters to be listed. If you have replaced all the items listed above and the Please define the System Console screen does not display or all attached devices and adapters are not listed, check any external devices and cabling. If you do not find a problem, contact your next level of support for assistance. YES Go to “Step 1542-20” on page 329. 328 406/90 Service Guide Step 1542-20 1. Turn off the power. 2. The part just replaced fixed the problem. 3. If a display adapter with keyboard, and mouse were used, reinstall the display adapter card, keyboard and mouse. 4. Reconnect the diskette drive cable to the diskette drive connector on the media subsystem. 5. Reconnect the tape drive (if previously installed) to the internal SCSI bus cable. 6. Plug in all adapters that were previously removed but not reinstalled. 7. Reconnect the I/O subsystem power cables that were previously disconnected. Reconfigure the system to its original condition. Go to “MAP 0410: Repair Checkout” in the Diagnostic Information for Multiple Bus Systems. Step 1542-21 If the boot is successful, the problem is with a remaining I/O subsystem. Turn off the power. Are there any I/O subsystems left to connect to the processor subsystem? NO Reconfigure the system to its original condition. Go to “MAP 0410: Repair Checkout” in the Diagnostic Information for Multiple Bus Systems. YES Go to “Step 1542-22.”Step 1542-22 Reconnect the next I/O subsystem in the configuration that you recorded earlier in this MAP. Use the following table to determine the next I/O subsystem to reconnect. I/O Subsystem Current x.y Value New x.y Value Go to: 1 1.9 1.5 “Step 1542-23” on page 330 2 1.5 1.1 “Step 1542-24” on page 331 3 1.1 1.13 (no IBF), 2.9 (with IBF) “Step 1542-25” on page 332 4 1.13 2.1 “Step 1542-26” on page 333 4 2.9 2.1 “Step 1542-26” on page 333 5 2.1 2.5 “Step 1542-27” on page 334 6 2.5 2.13 “Step 1542-28” on page 335 7 2.13 2.19 “Step 1542-29” on page 336 8 2.19 2.19 “Step 1542-30” on page 336 Note: The I/O subsystems should be cabled as shown in “Subsystem Positioning and Cabling” on page 36. Chapter 3. Maintenance Analysis Procedures 329 Step 1542-23 1. Turn off the power. 2. At the primary I/O book, disconnect the cable connection at RIO port B0 (U1.18-P1-H2/Q3). 3. Disconnect the cable connection at I/O subsystem 2 left I/O port connector 0 (U1.5-P1/Q2). The RIO cable that was connected to RIO port B0 should now be loose and can be removed. 4. At the primary I/O book, disconnect the cable connection at RIO port B1 (U1.18-P1-H2/Q4) and reconnect it to RIO port B0 (U1.18-P1-H2/Q3). 5. Disconnect the cable connection at I/O subsystem 2 right I/O port connector 1 (U1.5-P2/Q1) and reconnect it to I/O subsystem 2 left I/O port connector 0 (U1.5-P1/Q2). 6. Verify that processor subsystem RIO port B0 on primary I/O book (U1.18-P1-H2/Q3) is connected to I/O subsystem 2 left I/O port connector 0 (U1.5-P1/Q2). 7. Verify that I/O subsystem 2 left I/O port connector 1 (U1.5-P1/Q1) is connected to I/O subsystem 2 right I/O port connector 0 (U1.5-P2/Q2). 8. Turn on the power to boot standalone diagnostics from CD. 9. If the Please define the System Console screen is displayed, follow directions to select the system console. 10. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 11. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Connect the previously removed RIO cable from I/O subsystem 2 right I/O port connector 1 (U1.5-P2/Q1) to processor subsystem RIO port B1 on primary I/O book (U1.18-P1-H2/Q4). Go to “Step 1542-9” on page 324. YES The RIO cable that was removed in step number 3 above is defective. Replace the RIO cable. Connect the new RIO cable from I/O subsystem 2 right I/O port connector 1 (U1.5-P2/Q1) to processor subsystem RIO port B1 on primary I/O book (U1.18-P1-H2/Q4). Reconfigure the system to its original condition. Go to “MAP 0410: Repair Checkout” in the Diagnostic Information for Multiple Bus Systems. 330 406/90 Service Guide Step 1542-24 1. Turn off the power. 2. Slide the secondary I/O book (U1.18-P1-H3) all the way into the processor backplane until it is firmly seated and latched. 3. At the secondary I/O book, disconnect the cable connection at RIO port A0 (U1.18-P1-H3/Q4). 4. At the secondary I/O book, disconnect the cable connection at RIO port A1 (U1.18-P1-H3/Q3). 5. At the secondary I/O book, disconnect the cable connection at RIO port B0 (U1.18-P1-H3/Q2). 6. At the secondary I/O book, disconnect the cable connection at RIO port B1 (U1.18-P1-H3/Q1). 7. At the secondary I/O book, disconnect the cable connection at RIO port D0 (U1.18-P1-H3/Q6). 8. At the secondary I/O book, disconnect the cable connection at RIO port D1 (U1.18-P1-H3/Q5). 9. At the secondary I/O book, disconnect the cable connection at RIO port C0 (U1.18-P1-H3/Q8). 10. Disconnect the cable connection at I/O subsystem 3 left I/O port connector 0 (U1.1-P1/Q2). The RIO cable that was connected to RIO port C0 should now be loose and can be removed. 11. At the secondary I/O book, disconnect the cable connection at RIO port C1 (U1.18-P1-H3/Q7) and reconnect it to RIO port C0 (U1.18-P1-H3/Q8). 12. Disconnect the cable connection at I/O subsystem 3 right I/O port connector 1 (U1.1-P2/Q1) and reconnect it to I/O subsystem 3 left I/O port connector 0 (U1.1-P1/Q2). 13. Verify that processor subsystem RIO port C0 on secondary I/O book (U1.18-P1-H3/Q8) is connected to I/O subsystem 3 left I/O port connector 0 (U1.1-P1/Q2). 14. Verify that I/O subsystem 3 left I/O port connector 1 (U1.1-P1/Q1) is connected to I/O subsystem 3 right I/O port connector 0 (U1.1-P2/Q2). 15. Turn on the power to boot standalone diagnostics from CD. 16. If the Please define the System Console screen is displayed, follow directions to select the system console. 17. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 18. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Connect the previously removed RIO cable from I/O subsystem 3 right I/O port connector 1 (U1.1-P2/Q1) to processor subsystem RIO port C1 on secondary I/O book (U1.18-P1-H3/Q7). Go to “Step 1542-9” on page 324. YES The RIO cable that was removed in step 10 above is defective. Replace the RIO cable. Connect the new RIO cable from I/O subsystem 3 right I/O port connector 1 (U1.1-P2/Q1) to processor subsystem RIO port C1 on secondary I/O book (U1.18-P1-H3/Q7). Reconfigure the system to its original condition. Go to “MAP 0410: Repair Checkout” in the Diagnostic Information for Multiple Bus Systems. Chapter 3. Maintenance Analysis Procedures 331 Step 1542-25 1. Turn off the power. 2. At the secondary I/O book, disconnect the cable connection at RIO port D0 (U1.18-P1-H3/Q6). 3. Disconnect the cable connection at I/O subsystem 4 left I/O port connector 0 (Ux.y-P1/Q2). The RIO cable that was connected to RIO port D0 should now be loose and can be removed. 4. At the secondary I/O book, disconnect the cable connection at RIO port D1 (U1.18-P1-H3/Q5) and reconnect it to RIO port D0 (U1.18-P1-H3/Q6). 5. Disconnect the cable connection at I/O subsystem 4 right I/O port connector 1 (Ux.y-P2/Q1) and reconnect it to I/O subsystem 4 left I/O port connector 0 (Ux.y-P1/Q2). 6. Verify that processor subsystem RIO port D0 on secondary I/O book (U1.18-P1-H3/Q6) is connected to I/O subsystem 4 left I/O port connector 0 (Ux.y-P1/Q2). 7. Verify that I/O subsystem 4 left I/O port connector 1 (Ux.y-P1/Q1) is connected to I/O subsystem 4 right I/O port connector 0 (Ux.y-P2/Q2). 8. Turn on the power to boot standalone diagnostics from CD. 9. If the Please define the System Console screen is displayed, follow directions to select the system console. 10. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 11. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Connect the previously removed RIO cable from I/O subsystem 4 right I/O port connector 1 (Ux.y-P2/Q1) to processor subsystem RIO port D1 on secondary I/O book (U1.18-P1-H3/Q5). Go to “Step 1542-9” on page 324. YES The RIO cable that was removed in step 3 above is defective. Replace the RIO cable. Connect the new RIO cable from I/O subsystem 4 right I/O port connector 1 (Ux.y-P2/Q1) to processor subsystem RIO port D1 on secondary I/O book (U1.18-P1-H3/Q5). Reconfigure the system to its original condition. Go to “MAP 0410: Repair Checkout” in the Diagnostic Information for Multiple Bus Systems. 332 406/90 Service Guide Step 1542-26 1. Turn off the power. 2. At the secondary I/O book disconnect the cable connection at RIO port A0 (U1.18-P1-H3/Q4). 3. Disconnect the cable connection at I/O subsystem 5 left I/O port connector 0 (U2.1-P1/Q2).The RIO cable that was connected to RIO port A0 should now be loose and can be removed. 4. At the secondary I/O book, disconnect the cable connection at RIO port A1 (U1.18-P1-H3/Q3) and reconnect it to RIO port A0 (U1.18-P1-H3/Q4). 5. Disconnect the cable connection at I/O subsystem 5 right I/O port connector 1 (U2.1-P2/Q1) and reconnect it to I/O subsystem 5 left I/O port connector 0 (U2.1-P1/Q2). 6. Verify that processor subsystem RIO port A0 on secondary I/O book (U1.18-P1-H3/Q4) is connected to I/O subsystem 5 left I/O port connector 0 (U2.1-P1/Q2). 7. Verify that I/O subsystem 5 left I/O port connector 1 (U2.1-P1/Q1) is connected to I/O subsystem 5 right I/O port connector 0 (U2.1-P2/Q2). 8. Turn on the power to boot standalone diagnostics from CD. 9. If the Please define the System Console screen is displayed, follow directions to select the system console. 10. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 11. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Connect the previously removed RIO cable from I/O subsystem 5 right I/O port connector 1 (U2.1-P2/Q1) to processor subsystem RIO port A1 on secondary I/O book (U1.18-P1-H3/Q3). Go to “Step 1542-9” on page 324. YES The RIO cable that was removed in step 3 is defective. Replace the RIO cable. Connect the new RIO cable from I/O subsystem 5 right I/O port connector 1 (U2.1-P2/Q1) to processor subsystem RIO port A1 on secondary I/O book (U1.18-P1-H3/Q3). Reconfigure the system to its original condition. Go to “MAP 0410: Repair Checkout” in the Diagnostic Information for Multiple Bus Systems. Chapter 3. Maintenance Analysis Procedures 333 Step 1542-27 1. Turn off the power. 2. At the secondary I/O book, disconnect the cable connection at RIO port B0 (U1.18-P1-H3/Q2). 3. Disconnect the cable connection at I/O subsystem 6 left I/O port connector 0 (U2.5-P1/Q2). The RIO cable that was connected to RIO port B0 should now be loose and can be removed. 4. At the secondary I/O book, disconnect the cable connection at RIO port B1 (U1.18-P1-H3/Q1) and reconnect it to RIO port B0 (U1.18-P1-H3/Q2). 5. Disconnect the cable connection at I/O subsystem 6 right I/O port connector 1 (U2.5-P2/Q1) and reconnect it to I/O subsystem 6 left I/O port connector 0 (U2.5-P1/Q2). 6. Verify that processor subsystem RIO port B0 on secondary I/O book (U1.18-P1-H3/Q2) is connected to I/O subsystem 6 left I/O port connector 0 (U2.5-P1/Q2). 7. Verify that I/O subsystem 6 left I/O port connector 1 (U2.5-P1/Q1) is connected to I/O subsystem 6 right I/O port connector 0 (U2.5-P2/Q2). 8. Turn on the power to boot standalone diagnostics from CD. 9. If the Please define the System Console screen is displayed, follow directions to select the system console. 10. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 11. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Connect the previously removed RIO cable from I/O subsystem 6 right I/O port connector 1 (U2.5-P2/Q1) to processor subsystem RIO port B1 on secondary I/O book (U1.18-P1-H3/Q1). Go to “Step 1542-9” on page 324. YES The RIO cable that was removed in step 3 is defective. Replace the RIO cable. Connect the new RIO cable from I/O subsystem 6 right I/O port connector 1 (U2.5-P2/Q1) to processor subsystem RIO port B1 on secondary I/O book (U1.18-P1-H3/Q1). Reconfigure the system to its original condition. Go to “MAP 0410: Repair Checkout” in the Diagnostic Information for Multiple Bus Systems. 334 406/90 Service Guide Step 1542-28 1. Turn off the power. 2. Slide the secondary I/O book (U1.18-P1-H4) all the way into the processor backplane until it is firmly seated and latched. 3. At the secondary I/O book, disconnect the cable connection at RIO port D0 (U1.18-P1-H4/Q6). 4. At the secondary I/O book, disconnect the cable connection at RIO port D1 (U1.18-P1-H4/Q5). 5. At the secondary I/O book, disconnect the cable connection at RIO port C0 (U1.18-P1-H4/Q8). 6. Disconnect the cable connection at I/O subsystem 7 left I/O port connector 0 (U2.13-P1/Q2). The RIO cable that was connected to RIO port C0 should now be loose and can be removed. 7. At the secondary I/O book, disconnect the cable connection at RIO port C1 (U1.18-P1-H4/Q7) and reconnect it to RIO port C0 (U1.18-P1-H4/Q8). 8. Disconnect the cable connection at I/O subsystem 7 right I/O port connector 1 (U2.13-P2/Q1) and reconnect it to I/O subsystem 7 left I/O port connector 0 (U2.13-P1/Q2). 9. Verify that processor subsystem RIO port C0 on secondary I/O book (U1.18-P1-H4/Q8) is connected to I/O subsystem 7 left I/O port connector 0 (U2.13-P1/Q2). 10. Verify that I/O subsystem 7 left I/O port connector 1 (U2.13-P1/Q1) is connected to I/O subsystem 7 right I/O port connector 0 (U2.13-P2/Q2). 11. Turn on the power to boot standalone diagnostics from CD. 12. If the Please define the System Console screen is displayed, follow directions to select the system console. 13. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 14. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Connect the previously removed RIO cable from I/O subsystem 7 right I/O port connector 1 (U2.13-P2/Q1) to processor subsystem RIO port C1 on secondary I/O book (U1.18-P1-H4/Q7). Go to “Step 1542-9” on page 324. YES The RIO cable that was removed in step 6 is defective. Replace the RIO cable. Connect the new RIO cable from I/O subsystem 7 right I/O port connector 1 (U2.13-P2/Q1) to processor subsystem RIO port C1 on secondary I/O book (U1.18-P1-H4/Q7). Reconfigure the system to its original condition. Go to “MAP 0410: Repair Checkout” in the Diagnostic Information for Multiple Bus Systems. Chapter 3. Maintenance Analysis Procedures 335 Step 1542-29 1. Turn off the power. 2. At the secondary I/O book, disconnect the cable connection at RIO port D0 (U1.18-P1-H4/Q6). 3. Disconnect the cable connection at I/O subsystem 8 left I/O port connector 0 (U2.19-P1/Q2). The RIO cable that was connected to RIO port D0 should now be loose and can be removed 4. At the secondary I/O book, disconnect the cable connection at RIO port D1 (U1.18-P1-H4/Q5) and reconnect it to RIO port D0 (U1.18-P1-H4/Q6). 5. Disconnect the cable connection at I/O subsystem 8 right I/O port connector 1 (U2.19-P2/Q1) and reconnect it to I/O subsystem 8 left I/O port connector 0 (U2.19-P1/Q2). 6. Verify that processor subsystem RIO port D0 on secondary I/O book (U1.18-P1-H4/Q6) is connected to I/O subsystem 8 left I/O port connector 0 (U2.19-P1/Q2). 7. Verify that I/O subsystem 8 left I/O port connector 1 (U2.19-P1/Q1) is connected to I/O subsystem 8 right I/O port connector 0 (U2.19-P2/Q2). 8. Turn on the power to boot standalone diagnostics from CD. 9. If the Please define the System Console screen is displayed, follow directions to select the system console. 10. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 11. Check that all attached devices and adapters are listed Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Connect the previously removed RIO cable from I/O subsystem 8 right I/O port connector 1 (U2.19-P2/Q1) to processor subsystem RIO port D1 on secondary I/O book (U1.18-P1-H4/Q5). Go to “Step 1542-9” on page 324. YES The RIO cable that was removed in step 3 is defective. Replace the RIO cable. Connect the new RIO cable from I/O subsystem 8 right I/O port connector 1 (U2.19-P2/Q1) to processor subsystem RIO port D1 on secondary I/O book (U1.18-P1-H4/Q5). Reconfigure the system to its original condition. Go to “MAP 0410: Repair Checkout” in the Diagnostic Information for Multiple Bus Systems.Step 1542-30 This ends the procedure. Reconfigure the system to its original condition. Go to “MAP 0410: Repair Checkout” in the Diagnostic Information for Multiple Bus Systems. 336 406/90 Service Guide MAP 1543: MCM Module Problem Isolation Step 1543-1 Record the error code and the location code(s) that sent you to this MAP. Step 1543-2 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1543-3 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1543-4.” No Call for support.Step 1543-4 Attention: Some of the parts in the following list are modules that have a limit of three plug cycles. Before replacing any of the modules in the following list, call for support. You may be asked to replace one or more of the following cards and modules: v Passthru module at U1.18-P1-C2 v Passthru module at U1.18-P1-C3 v MCM module 0 at U1.18-P1-C1 v MCM module 2 at U1.18-P1-C4 v MCM module 1 at U1.18-P1-C3 v MCM module 3 at U1.18-P1-C2 v System clock card, at location U1.18-P1-X5Replace the following books and modules, if present, one at a time: 1. Passthru module at U1.18-P1-C2 if not recorded in “Step 1543-1.” 2. Passthru module at U1.18-P1-C3 if not recorded in “Step 1543-1.” 3. System clock card, at location U1.18-P1-X5 if not recorded in “Step 1543-1.” 4. First location code item recorded, if any, in “Step 1543-1.” 5. Second location code item recorded, if any, in “Step 1543-1.” 6. Third location code item recorded, if any, in “Step 1543-1.” 7. Fourth location code item recorded, if any, in “Step 1543-1.” 8. MCM module 0 at U1.18-P1-C1 if not recorded in “Step 1543-1.” 9. MCM module 2 at U1.18-P1-C4 if not recorded in “Step 1543-1.” 10. MCM module 1 at U1.18-P1-C3 if not recorded in “Step 1543-1.” 11. MCM module 3 at U1.18-P1-C2 if not recorded in “Step 1543-1.”Step 1543-5 Turn on the power. Chapter 3. Maintenance Analysis Procedures 337 Step 1543-6 Did the system stop with the same error code as recorded in “Step 1543-1” on page 337? Yes Go to “Step 1543-7.” No The module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1543-7 Have all the modules listed in “Step 1543-4” on page 337 been replaced? Yes Go to “Step 1543-8.” No Go to “Step 1543-2” on page 337.Step 1543-8 Turn off the power. Step 1543-9 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1543-10 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1543-11.” No Call for support.Step 1543-11 Attention: Before replacing the system backplane, call for support. Replace the system backplane at U1.18-P1. Step 1543-12 Turn on the power. Step 1543-13 Did the system stop with the same error code as recorded in “Step 1541-1” on page 310? Yes Go to “Step 1543-14.” No The part just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1543-14 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 1543-15 Call for support. 338 406/90 Service Guide This ends the procedure. Chapter 3. Maintenance Analysis Procedures 339 MAP 1544: Inner L3 Module Problem Isolation Step 1544-1 Record the error code and the location code(s) that sent you to this MAP. Step 1544-2 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1544-3 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1544-4.” No Call for support.Step 1544-4 Attention: Some of the parts in the following list are modules that have a limit of three plug cycles. Before replacing any of the modules in the following list, call for support. You may be asked to replace one or more of the following books and modules: v Memory book 0 at U1.18-P1-M2 v Memory book 1 at U1.18-P1-M3 v Memory book 2 at U1.18-P1-M7 v Memory book 3 at U1.18-P1-M6 v L3 module at U1.18-P1-C6 v L3 module at U1.18-P1-C7 v L3 module at U1.18-P1-C10 v L3 module at U1.18-P1-C11 v L3 module at U1.18-P1-C14 v L3 module at U1.18-P1-C15 v L3 module at U1.18-P1-C18 v L3 module at U1.18-P1-C19Replace the following books and modules, if present, one at a time: 1. First location code item recorded, if any, in “Step 1544-1.” 2. Second location code item recorded, if any, in “Step 1544-1.” 3. Third location code item recorded, if any, in “Step 1544-1.”Step 1544-5 Turn on the power. Step 1544-6 Did the system stop with the same error code as recorded in “Step 1544-1”? Yes Go to “Step 1544-7” on page 341. No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. 340 406/90 Service Guide Step 1544-7 Have all the books or modules listed in “Step 1544-4” on page 340 been replaced? Yes Go to “Step 1544-8.” No Go to “Step 1544-2” on page 340.Step 1544-8 Turn off the power. Step 1544-9 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1544-10 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1544-11.” No Call for support.Step 1544-11 Attention: Before replacing the system backplane, call for support. Replace the system backplane at U1.18-P1. Step 1544-12 Turn on the power. Step 1544-13 Did the system stop with the same error code as recorded in “Step 1544-1” on page 340? Yes Go to “Step 1544-14.” No The part just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1544-14 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 1544-15 Call for support. This ends the procedure. Chapter 3. Maintenance Analysis Procedures 341 MAP 1545: Outer L3 Module Problem Isolation Step 1545-1 Record the error code and the location code(s) that sent you to this MAP. Step 1545-2 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1545-3 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1545-4.” No Call for support.Step 1545-4 Attention: Some of the parts in the following list are modules that have a limit of three plug cycles. Before replacing any of the modules in the following list, call for support. You may be asked to replace one or more of the following books and modules: v Memory book 4 at U1.18-P1-M4 v Memory book 5 at U1.18-P1-M8 v Memory book 6 at U1.18-P1-M5 v Memory book 7 at U1.18-P1-M1 v L3 module at U1.18-P1-C5 v L3 module at U1.18-P1-C8 v L3 module at U1.18-P1-C9 v L3 module at U1.18-P1-C12 v L3 module at U1.18-P1-C13 v L3 module at U1.18-P1-C16 v L3 module at U1.18-P1-C17 v L3 module at U1.18-P1-C20Replace the following books and modules, if present, one at a time: 1. First location code item recorded, if any, in “Step 1545-1.” 2. Second location code item recorded, if any, in “Step 1545-1.” 3. Third location code item recorded, if any, in “Step 1545-1.”Step 1545-5 Turn on the power. 342 406/90 Service Guide Step 1545-6 Did the system stop with the same error code as recorded in “Step 1545-1” on page 342? Yes Go to “Step 1545-7.” No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1545-7 Have all the books or modules listed in “Step 1545-4” on page 342 been replaced? Yes Go to “Step 1545-8.” No Go to “Step 1545-2” on page 342.Step 1545-8 Turn off the power. Step 1545-9 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1545-10 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1545-11.” No Call for support.Step 1545-11 Attention: Before replacing the system backplane, call for support. Replace the system backplane at U1.18-P1. Step 1545-12 Turn on the power. Step 1545-13 Did the system stop with the same error code as recorded in “Step 1545-1” on page 342? Yes Go to “Step 1545-14.” No The part just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1545-14 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 1545-15 Call for support. Chapter 3. Maintenance Analysis Procedures 343 This ends the procedure. 344 406/90 Service Guide MAP 1546: Memory Book Problem Isolation Step 1546-1 Record the error code and the location code(s) that sent you to this MAP. Step 1546-2 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1546-3 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1546-4.” No Call for support.Step 1546-4 Attention: Some of the parts in the following list are modules that have a limit of three plug cycles. Before replacing any of the modules in the following list, call for support. Replace the following books and modules, if present, one at a time: v First location code item recorded, if any, in “Step 1546-1.” v Second location code item recorded, if any, in “Step 1546-1.” v Third location code item recorded, if any, in “Step 1546-1.” v Memory book 0 at U1.18-P1-M2 if not recorded in “Step 1546-1.” v Memory book 1 at U1.18-P1-M3 if not recorded in “Step 1546-1.” v Memory book 2 at U1.18-P1-M7 if not recorded in “Step 1546-1.” v Memory book 3 at U1.18-P1-M6 if not recorded in “Step 1546-1.” v Memory book 4 at U1.18-P1-M4 if not recorded in “Step 1546-1.” v Memory book 5 at U1.18-P1-M8 if not recorded in “Step 1546-1.” v Memory book 6 at U1.18-P1-M5 if not recorded in “Step 1546-1.” v Memory book 7 at U1.18-P1-M1 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C5 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C6 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C7 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C8 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C9 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C10 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C11 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C12 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C13 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C14 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C15 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C16 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C17 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C18 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C19 if not recorded in “Step 1546-1.” v L3 module at U1.18-P1-C20 if not recorded in “Step 1546-1.”Step 1546-5 Turn on the power. Chapter 3. Maintenance Analysis Procedures 345 Step 1546-6 Did the system stop with the same error code as recorded in “Step 1546-1” on page 345? Yes Go to “Step 1546-7.” No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1546-7 Have all the books or modules listed in “Step 1546-4” on page 345 been replaced? Yes Go to “Step 1546-8.” No Go to “Step 1546-2” on page 345.Step 1546-8 Turn off the power. Step 1546-9 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1546-10 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1546-11.” No Call for support.Step 1546-11 Attention: Before replacing the system backplane, call for support. Replace the system backplane at U1.18-P1. Step 1546-12 Turn on the power. Step 1546-13 Did the system stop with the same error code as recorded in “Step 1546-1” on page 345? Yes Go to “Step 1546-14.” No The part just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1546-14 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 1546-15 Call for support. 346 406/90 Service Guide This ends the procedure. Chapter 3. Maintenance Analysis Procedures 347 MAP 1547: I/O Book Problem Isolation Step 1547-1 Record the error code and the location code(s) that sent you to this MAP. Step 1547-2 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1547-3 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1547-4.” No Call for support.Step 1547-4 Replace the following books, if present, one at a time: v Primary I/O Book at U1.18-P1-H2 v Secondary I/O Book at U1.18-P1-H3 v Third I/O Book at U1.18-P1–H4 v Fourth I/O Book at U1.18-P1-H1Step 1547-5 Turn on the power. Step 1547-6 Did the system stop with the same error code as recorded in “Step 1547-1”? Yes Go to “Step 1547-7.” No The book just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1547-7 Have all the books or modules listed in “Step 1547-4” been replaced? Yes Go to “Step 1547-8.” No Go to “Step 1547-2.”Step 1547-8 Turn off the power. Step 1547-9 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1547-10 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1547-11” on page 349. 348 406/90 Service Guide No Call for support.Step 1547-11 Attention: Before replacing the system backplane, call for support. Replace the system backplane at U1.18-P1. Step 1547-12 Turn on the power. Step 1547-13 Did the system stop with the same error code as recorded in “Step 1547-1” on page 348? Yes Go to “Step 1547-14.” No The part just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1547-14 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 1547-15 Call for support. This ends the procedure. Chapter 3. Maintenance Analysis Procedures 349 MAP 1548: Memory and Processor Problem Isolation Step 1548-1 Record the error code and the location code(s) that sent you to this MAP. Step 1548-2 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1548-3 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1548-4.” No Call for support.Step 1548-4 Attention: Some of the parts in the following list are modules that have a limit of three plug cycles. Before replacing any of the modules in the following list, call for support. Replace the following books and modules, if present, one at a time: v First location code item recorded, if any, in “Step 1548-1.” v Second location code item recorded, if any, in “Step 1548-1.” v Third location code item recorded, if any, in “Step 1548-1.” v Memory book 0 at U1.18-P1-M2 if not recorded in “Step 1548-1.” v Memory book 1 at U1.18-P1-M3 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C6 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C7 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C10 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C11 if not recorded in “Step 1548-1.” v Passthru module at U1.18-P1-C2 if not recorded in “Step 1548-1.” v Passthru module at U1.18-P1-C3 if not recorded in “Step 1548-1.” v MCM module 0 at U1.18-P1-C1 if not recorded in “Step 1548-1.” v Memory book 2 at U1.18-P1-M7 if not recorded in “Step 1548-1.” v Memory book 3 at U1.18-P1-M6 if not recorded in “Step 1548-1.” v Memory book 4 at U1.18-P1-M4 if not recorded in “Step 1548-1.” v Memory book 5 at U1.18-P1-M8 if not recorded in “Step 1548-1.” v Memory book 6 at U1.18-P1-M5 if not recorded in “Step 1548-1.” v Memory book 7 at U1.18-P1-M1 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C5 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C8 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C9 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C12 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C13 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C14 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C15 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C16 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C17 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C18 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C19 if not recorded in “Step 1548-1.” v L3 module at U1.18-P1-C20 if not recorded in “Step 1548-1.” v MCM module 2 at U1.18-P1-C4 if not recorded in “Step 1548-1.” v MCM module 1 at U1.18-P1-C3 if not recorded in “Step 1548-1.” 350 406/90 Service Guide v MCM module 3 at U1.18-P1-C2 if not recorded in “Step 1548-1” on page 350.Step 1548-5 Turn on the power. Step 1548-6 Did the system stop with the same error code as recorded in “Step 1548-1” on page 350? Yes Go to “Step 1548-7.” No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1548-7 Have all the books or modules listed in “Step 1548-4” on page 350 been replaced? Yes Go to “Step 1548-8.” No Go to “Step 1548-2” on page 350.Step 1548-8 Turn off the power. Step 1548-9 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1548-10 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? Yes Go to “Step 1548-11.” No Call for support.Step 1548-11 Attention: Before replacing the system backplane, call for support. Replace the system backplane at U1.18-P1. Chapter 3. Maintenance Analysis Procedures 351 Step 1548-12 Turn on the power. Step 1548-13 Did the system stop with the same error code as recorded in “Step 1548-1” on page 350? Yes Go to “Step 1548-14.” No The part just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.Step 1548-14 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 1548-15 Call for support. This ends the procedure. 352 406/90 Service Guide MAP 1549: Attention Problem Isolation Step 1549-1 Record the error code and the location code(s) that sent you to this MAP. Step 1549-2 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 1549-3 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? No Call for support. Yes Go to “Step 1549-4.”Step 1549-4 Examine the following table and locate the error code that sent you to this MAP to determine your next step. Error Code Action 406x 0EB1 Go to “Step 1549-5” on page 354. 406x 0EB2 Go to “Step 1549-9” on page 355. 406x 0EB3 Go to “Step 1549-13” on page 356. 406x 0EB4 Go to “Step 1549-33” on page 359. 406x 0EB5 Go to “Step 1549-29” on page 358. 406x 0EB6 Go to “Step 1549-33” on page 359. 406x 0EB7 Go to “Step 1549-33” on page 359. 406x 0EB8 Go to “Step 1549-33” on page 359. 406x 0EB9 Go to “Step 1549-33” on page 359. 406x 0EBA Go to “Step 1549-17” on page 356. 406x 0EBB Go to “Step 1549-21” on page 357. 406x 0EBC Go to “Step 1549-25” on page 358. Chapter 3. Maintenance Analysis Procedures 353 Step 1549-5 Attention: Some of the parts in the following list are modules that have a limit of three plug cycles. Before replacing any of the modules in the following list, call for support. Replace the following books and modules, if present, one at a time: v First location code item recorded, if any, in “Step 1549-1” on page 353. v Second location code item recorded, if any, in “Step 1549-1” on page 353. v Third location code item recorded, if any, in “Step 1549-1” on page 353. v Primary I/O Book at U1.18-P1-H2 if not recorded in “Step 1549-1” on page 353. v MCM module 0 at U1.18-P1-C1 if not recorded in “Step 1549-1” on page 353. v MCM module 2 at U1.18-P1-C4 if not recorded in “Step 1549-1” on page 353. v MCM module 1 at U1.18-P1-C3 if not recorded in “Step 1549-1” on page 353. v MCM module 3 at U1.18-P1-C2 if not recorded in “Step 1549-1” on page 353.Step 1549-6 Turn on the power. Step 1549-7 Did the system stop with the same error code as recorded in “Step 1549-1” on page 353? No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Yes Go to “Step 1549-8.”Step 1549-8 Have all the books or modules listed in “Step 1549-5” been replaced? No Go to “Step 1549-2” on page 353. Yes Go to “Step 1549-37” on page 359. 354 406/90 Service Guide Step 1549-9 Attention: Some of the parts in the following list are modules that have a limit of three plug cycles. Before replacing any of the modules in the following list, call for support. Replace the following books and modules, if present, one at a time: v First location code item recorded, if any, in “Step 1549-1” on page 353. v Second location code item recorded, if any, in “Step 1549-1” on page 353. v Third location code item recorded, if any, in “Step 1549-1” on page 353. v Primary I/O Book at U1.18-P1-H2 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C6 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C7 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C10 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C11 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C5 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C8 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C9 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C12 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C13 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C14 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C15 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C16 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C17 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C18 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C19 if not recorded in “Step 1549-1” on page 353. v L3 module at U1.18-P1-C20 if not recorded in “Step 1549-1” on page 353.Step 1549-10 Turn on the power. Step 1549-11 Did the system stop with the same error code as recorded in “Step 1549-1” on page 353? No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Yes Go to “Step 1549-12.”Step 1549-12 Have all the books listed in “Step 1549-9” been replaced? No Go to “Step 1549-2” on page 353. Yes Go to “Step 1549-13” on page 356. Chapter 3. Maintenance Analysis Procedures 355 Step 1549-13 Replace the following books and modules, if present, one at a time: v First location code item recorded, if any, in “Step 1549-1” on page 353. v Second location code item recorded, if any, in “Step 1549-1” on page 353. v Third location code item recorded, if any, in “Step 1549-1” on page 353. v Memory book 0 at U1.18-P1-M2 if not recorded in “Step 1549-1” on page 353. v Memory book 1 at U1.18-P1-M3 if not recorded in “Step 1549-1” on page 353. v Primary I/O Book at U1.18-P1-H2 if not recorded in “Step 1549-1” on page 353.Step 1549-14 Turn on the power. Step 1549-15 Did the system stop with the same error code as recorded in “Step 1549-1” on page 353? No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Yes Go to “Step 1549-16.”Step 1549-16 Have all the books or modules listed in “Step 1549-13” been replaced? No Go to “Step 1549-2” on page 353. Yes Go to “Step 1549-37” on page 359.Step 1549-17 Replace the following books and modules, if present, one at a time: v First location code item recorded, if any, in “Step 1549-1” on page 353. v Second location code item recorded, if any, in “Step 1549-1” on page 353. v Third location code item recorded, if any, in “Step 1549-1” on page 353. v Memory book 2 at U1.18-P1-M7 if not recorded in “Step 1549-1” on page 353. v Memory book 3 at U1.18-P1-M6 if not recorded in “Step 1549-1” on page 353. v Primary I/O Book at U1.18-P1-H2 if not recorded in “Step 1549-1” on page 353.Step 1549-18 Turn on the power. 356 406/90 Service Guide Step 1549-19 Did the system stop with the same error code as recorded in “Step 1549-1” on page 353? No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Yes Go to “Step 1549-20.”Step 1549-20 Have all the books or modules listed in “Step 1549-17” on page 356 been replaced? No Go to “Step 1549-2” on page 353. Yes Go to “Step 1549-37” on page 359.Step 1549-21 Replace the following books and modules, if present, one at a time: v First location code item recorded, if any, in “Step 1549-1” on page 353. v Second location code item recorded, if any, in “Step 1549-1” on page 353. v Third location code item recorded, if any, in “Step 1549-1” on page 353. v Memory book 4 at U1.18-P1-M4 if not recorded in “Step 1549-1” on page 353. v Memory book 5 at U1.18-P1-M8 if not recorded in “Step 1549-1” on page 353. v Primary I/O Book at U1.18-P1-H2 if not recorded in “Step 1549-1” on page 353.Step 1549-22 Turn on the power. Step 1549-23 Did the system stop with the same error code as recorded in “Step 1549-1” on page 353? No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Yes Go to “Step 1549-24.”Step 1549-24 Have all the books or modules listed in “Step 1549-21” been replaced? No Go to “Step 1549-2” on page 353. Yes Go to “Step 1549-37” on page 359. Chapter 3. Maintenance Analysis Procedures 357 Step 1549-25 Replace the following books and modules, if present, one at a time: v First location code item recorded, if any, in “Step 1549-1” on page 353. v Second location code item recorded, if any, in “Step 1549-1” on page 353. v Third location code item recorded, if any, in “Step 1549-1” on page 353. v Memory book 6 at U1.18-P1-M5 if not recorded in “Step 1549-1” on page 353. v Memory book 7 at U1.18-P1-M1 if not recorded in “Step 1549-1” on page 353. v Primary I/O Book at U1.18-P1-H2 if not recorded in “Step 1549-1” on page 353.Step 1549-26 Turn on the power. Step 1549-27 Did the system stop with the same error code as recorded in “Step 1549-1” on page 353? No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Yes Go to “Step 1549-28.”Step 1549-28 Have all the books or modules listed in “Step 1549-25” been replaced? No Go to “Step 1549-2” on page 353. Yes Go to “Step 1549-37” on page 359.Step 1549-29 Replace the following books and modules, if present, one at a time: v First location code item recorded, if any, in “Step 1549-1” on page 353. v Second location code item recorded, if any, in “Step 1549-1” on page 353. v Third location code item recorded, if any, in “Step 1549-1” on page 353. v Secondary I/O Book at U1.18-P1-H3 if not recorded in “Step 1549-1” on page 353. v Third I/O Book at U1.18-P1-H4 if not recorded in “Step 1549-1” on page 353. v Fourth I/O Book at U1.18-P1-H1 if not recorded in “Step 1549-1” on page 353. v Primary I/O Book at U1.18-P1-H2 if not recorded in “Step 1549-1” on page 353.Step 1549-30 Turn on the power. 358 406/90 Service Guide Step 1549-31 Did the system stop with the same error code as recorded in “Step 1549-1” on page 353? No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Yes Go to “Step 1549-32.”Step 1549-32 Have all the books or modules listed in “Step 1549-29” on page 358 been replaced? No Go to “Step 1549-2” on page 353. Yes Go to “Step 1549-37.”Step 1549-33 Replace the following books and modules, if present, one at a time: v Replace the Primary I/O Book at U1.18-P1-H2 if not recorded in “Step 1549-1” on page 353. v First location code item recorded, if any, in “Step 1549-1” on page 353. v Second location code item recorded, if any, in “Step 1549-1” on page 353. v Third location code item recorded, if any, in “Step 1549-1” on page 353.Step 1549-34 Turn on the power. Step 1549-35 Did the system stop with the same error code as recorded in “Step 1549-1” on page 353? No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Yes Go to “Step 1549-36.”Step 1549-36 Have all the books or modules listed in “Step 1549-33” been replaced? No Go to “Step 1549-2” on page 353. Yes Call for support. This ends the procedure.Step 1549-37 Turn off the power. Step 1549-38 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Chapter 3. Maintenance Analysis Procedures 359 Step 1549-39 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? No Call for support. Yes Go to “Step 1549-40.”Step 1549-40 Attention: Before replacing the system backplane, call for support. Replace the system backplane at U1.18-P1. Step 1549-41 Turn on the power. Step 1549-42 Did the system stop with the same error code as recorded in “Step 1549-1” on page 353? No The part just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Yes Go to “Step 1549-43.”Step 1549-43 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 1549-44 Call for support. This ends the procedure. 360 406/90 Service Guide MAP 154A: I2C Bus Problem Isolation Step 154A-1 Record the error code, location code(s), word 13, and word 16 that sent you to this MAP. Step 154A-2 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 154A-3 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? No Call for support. Yes Go to “Step 154A-4.”Step 154A-4 Examine the following table and locate the error code that sent you to this MAP to determine your next step. Error Code Action B1xx 4643 Go to “Step 154A-22” on page 364. B1xx 4645 Go to “Step 154A-9” on page 362. B1xx 4648, 406x 0001, 406x 0002, 406x 000A, 406x 000B, 406x 000C, 406x 000D Go to “Step 154A-5.” Step 154A-5 Replace the following books and modules, if present, one at a time: v First location code item recorded, if any, in “Step 154A-1.” v Second location code item recorded, if any, in “Step 154A-1.” v Primary I/O Book at U1.18-P1-H2 if not recorded in “Step 154A-1.” v Secondary I/O Book at U1.18-P1-H3 if not recorded in “Step 154A-1.” v Third I/O Book at U1.18-P1-H4 if not recorded in “Step 154A-1.” v Fourth I/O Book at U1.18-P1-H1 if not recorded in “Step 1549-1” on page 353. v System clock card, at U1.18-P1-X5 if not recorded in “Step 154A-1.”Step 154A-6 Turn on the power. Chapter 3. Maintenance Analysis Procedures 361 Step 154A-7 Did the system stop with the same error code as recorded in “Step 154A-1” on page 361? No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Yes Go to “Step 154A-8.”Step 154A-8 Have all the books listed in “Step 154A-5” on page 361 been replaced? No Go to “Step 154A-2” on page 361. Yes Go to “Step 154A-14.”Step 154A-9 Is the value recorded in “Step 154A-1” on page 361 for word 16 of this error code equal to 00000008 or 00000009? No Go to “Step 154A-5” on page 361. Yes Go to “Step 154A-10.”Step 154A-10 Replace the following books and modules, if present, one at a time: v First location code item recorded, if any, in “Step 154A-1” on page 361. v Second location code item recorded, if any, in “Step 154A-1” on page 361. v Primary I/O Book at U1.18-P1-H2 if not recorded, in “Step 154A-1” on page 361. v Capacitor book, at location U1.18-P1-V3 if not recorded in “Step 154A-1” on page 361. v Capacitor book, at location U1.18-P1-V7 if not recorded in “Step 154A-1” on page 361.Step 154A-11 Turn on the power. Step 154A-12 Did the system stop with the same error code as recorded in “Step 154A-1” on page 361? No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Yes Go to “Step 1549-12” on page 355.Step 154A-13 Have all the books or modules listed in “Step 154A-10” been replaced? No Go to “Step 154A-2” on page 361. Yes Go to “Step 154A-14.”Step 154A-14 Turn off the power. 362 406/90 Service Guide Step 154A-15 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 154A-16 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? No Call for support. Yes Go to “Step 154A-17.”Step 154A-17 Attention: Before replacing the system backplane, call for support. Replace the system backplane at U1.18-P1. Step 154A-18 Turn on the power. Step 154A-19 Did the system stop with the same error code as recorded in “Step 154A-1” on page 361? No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Yes Go to “Step 154A-20.”Step 154A-20 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. Step 154A-21 Call for support. This ends the procedure. Chapter 3. Maintenance Analysis Procedures 363 Step 154A-22 Is the value recorded in “Step 154A-1” on page 361 for word 13 of this error code equal to 0000A218 or 0000A21B? No Go to “Step 154A-5” on page 361. Yes Go to “Step 154A-23.”Step 154A-23 Replace the following cards and modules, if present, one at a time: v First location code item recorded, if any, in “Step 154A-1” on page 361. v Second location code item recorded, if any, in “Step 154A-1” on page 361. v Primary I/O Book at U1.18-P1-H2 if not recorded in “Step 154A-1” on page 361. v Component on which you are trying to turn on the identify LED at the location you selected.Step 154A-24 Turn on the power. Step 154A-25 Did the system stop with the same error code as recorded in “Step 154A-1” on page 361? No The card or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Yes Go to “Step 154A-26.”Step 154A-26 Have all the cards or modules listed in “Step 154A-23” been replaced? No Go to “Step 154A-2” on page 361. Yes Go to “Step 154A-14” on page 362. 364 406/90 Service Guide MAP 154B: Insufficient Hardware Resources Problem Isolation Step 154B-1 Record the error code, location code(s), and word 13 that sent you to this MAP. Step 154B-2 Turn off the power. Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 154B-3 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? No Call for support. Yes Go to “Step 154B-4.”Step 154B-4 Attention: Before replacing any parts, examine the processor subsystem drawer to ensure that: v At least one MCM (processor) module is installed. v At least one memory card is installed. v At least one L3 cache module is installed. v The primary I/O book is installed.Refer to the error code recorded in “Step 154B-1.” When using the listings below, note the error code condition before you replace parts that are called out. Replace the following FRUs, if present, one at a time: Attention: Some of the parts in the following list are modules that have a limit of three plug cycles. Before replacing any of the modules in the following list, call for support. v First location code item recorded, if any, in “Step 154B-1.” v Second location code item recorded, if any, in “Step 154B-1.” v Third location code item recorded, if any, in “Step 154B-1.”Note: Replace the following FRUs only if they are associated with the word 13 that you recorded earlier. If the value of word 13, recorded in “Step 154B-1,” is 4110 0000 or 4060 0000, replace the following: v Primary I/O book at U1.18-P1-H2 if not recorded in “Step 154B-1.” If the value of word 13, recorded in “Step 154B-1,” is 403F 0000 or 40FF 0000, replace the following: v Memory book 0 at U1.18-P1-M2 if not recorded in “Step 154B-1.” v Memory book 1 at U1.18-P1-M3 if not recorded in “Step 154B-1” v Memory book 2 at U1.18-P1-M7 if not recorded in “Step 154B-1” v Memory book 3 at U1.18-P1-M6 if not recorded in “Step 154B-1.” If the value of word 13, recorded in “Step 154B-1,” is 402F 0000, replace the following: v MCM module 0 at U1.18-P1-C1 if not recorded in “Step 154B-1.” v MCM module 2 at U1.18-P1-C4 if not recorded in“Step 154B-1.” v MCM module 1 at U1.18-P1-C3 if not recorded in “Step 154B-1.” v MCM module 3 at U1.18-P1-C2 if not recorded in “Step 154B-1.” Chapter 3. Maintenance Analysis Procedures 365 Step 154B-5 Turn on the power. Step 154B-6 Did the system stop with the same error code as recorded in “Step 154B-1” on page 365? No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Yes Go to “Step 154B-7.”Step 154B-7 Have all the books or modules listed in “Step 154B-4” on page 365 been replaced? No Go to “Step 154B-2” on page 365. Yes Go to “Step 154B-8.”Step 154B-8 Turn off the power. Step 154B-9 Examine the amber logic-power LEDs of all installed processor subsystem DCAs. Step 154B-10 Are all amber logic-power LEDs of all installed processor subsystem DCAs off? No Call for support. Yes Go to “Step 154A-11” on page 362.Step 154B-11 Attention: Before replacing the system backplane, call for support. Replace the system backplane at U1.18–P1. Step 154B-12 Turn on the power. Step 154B-13 Did the system stop with the same error code as recorded in “Step 154B-1” on page 365? No The book or module just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Yes Go to “Step 154B-14.”Step 154B-14 Turn off the power. Make sure all amber logic-power LEDs of all installed processor subsystem DCAs are off. 366 406/90 Service Guide Step 154B-15 Call for support. This ends the procedure. Chapter 3. Maintenance Analysis Procedures 367 368 406/90 Service Guide Chapter 4. Checkpoints Checkpoints display on the media subsystem operator panel and the virtual terminal while the system is powering on and going through the initial program load (IPL). This chapter explains the IPL flow of the system and provides a table that lists checkpoints that you might see in the media subsystem operator panel and in a virtual terminal window on the HMC. IPL Flow The IPL process starts when ac power is connected to the system. The IPL process has the following phases: v Phase 1: Service Processor Initialization Phase 1 starts when ac power is connected to the system and ends when OK is displayed in the media subsystem operator panel. 8xxx checkpoints are displayed during this phase. Several 9xxx codes may also be displayed. Service processor menus are available at the end of this phase by striking any key on the console keyboard. v Phase 2: Hardware Initialization by the Service Processor Phase 2 starts when system power-on is initiated by pressing the power on button on the media subsystem operator panel. 9xxx checkpoints are displayed during this time. 91FF, the last checkpoint in this phase, indicates the transition to phase 3 is taking place. v Phase 3: System Firmware initialization On a full system partition, at phase 3, a system processor takes over control and continues initializing partition resources. During this phase, checkpoints in the form Exxx are displayed. E105, the last checkpoint in this phase, indicates that control is being passed to the operating system boot program. On a partitioned system, there is a global systemwide initialization phase 3, during which a system processor continues the initialization process. Checkpoints in this phase are of the form Exxx. This global phase 3 ends with a "LPAR..." on the operator panel. As a logical partition begins a partition-initialization phase 3, one of the system processors assigned to that partition continues initialization of resource assigned to that partition. Checkpoints in this phase are also of the form Exxx. This partition phase 3 ends with an E105 displayed on the partition’s virtual operator panel on the HMC, indicating control has been passed to that logical partition’s operating system boot program. For both the global and partition phase 3, location codes may also be displayed on the physical operator panel and the partition’s virtual terminal, respectively. v Phase 4: Operating System Boot When the operating system starts to boot, checkpoints in the form 0xxx and 2xxx are displayed. This phase ends when the operating system login prompt displays on the operating system console. 369 The following table outlines the IPL phases from power-on to the operating system login prompt, matching the phases to corresponding operator-panel checkpoints for a typical partition boot. Also listed are the approximate boot times for each phase, along with their dependencies for each phase. Note: The following table lists the major checkpoints only. Phase # Checkpoints on Operator Panel or HMC Time in Phase (minutes) Major Contributors to Time in Phase Phase 1 8xxx => OK Approx. 1 Phase 2 9xxx => 91FF 3 to 6 Number of I/O drawers Phase 3 Exxx => E105 1 to 3 Number of bootable adapters Phase 4 0xxx or 2xxx => blank 2 to 30+ Number of SSA drivesNumber of SCSI drivesNumber of async sessionsNumber of processorsNumber of adaptersAmount of memory 370 406/90 Service Guide Checkpoints enable users and service personnel to know what the server is doing, with some detail, as it initializes. These checkpoints are not intended to be error indicators, but in some cases a server could hang at one of the checkpoints without displaying an 8-character error code. It is for these hang conditions, only, that any action should be taken with respect to checkpoints. The most appropriate action is included with each checkpoint. Before taking actions listed with a checkpoint, check for additional symptoms in the service processor error log. See the “System Information Menu” on page 781 for information on how to access the service processor error log. Note: Go to “MAP 1540: Problem Isolation Procedures” on page 308 for any of the following checkpoint hang conditions: v A four-digit code in the range of 8xxx through EFFF not listed in the checkpoint tables. v A four-digit code is in the checkpoint tables, but does not contain a repair action or FRU listing. v All of the FRUs listed in the repair action have been replaced and the problem has not been corrected.Checkpoints listed in the following tables are in the format 8xxx, 9xxx, Axxx, Bxxx, or Exxx, where x is any hexadecimal digit from 1-9 or A-F. If your system hangs with a checkpoint displayed that begins with anything other than 8, 9, A, B, or E, go to the Diagnostic Information for Multiple Bus Systems. Use the following table to determine where to find the checkpoint. Symptom Action You have a code that is of the form 8xxx, 9xxx, Axxx, or Bxxx. Go to “Service Processor Checkpoints” on page 372. You have a code of the form Exxx. Go to “Firmware Checkpoints” on page 379. You have a four-character code that is not of the form 8xxx, 9xxx, Axxx, Bxxx, or Exxx. See the Diagnostic Information for Multiple Bus Systems. Chapter 4. Checkpoints 371 Service Processor Checkpoints Service processor checkpoints are in the range 8xxx to Bxxx. The message OK indicates successful service processor testing and initialization. Firmware checkpoints are listed in “Firmware Checkpoints” on page 379. Note: A spinning (or twirling) slash displays in the upper-right corner of the operator panel display while some checkpoints are being displayed. If the slash stops spinning, a hang condition is indicated. Checkpoint Description Action/ Possible Failing FRU 8000 Test of the service processor DRAM See note 1 on page 377. 8004 Verify base code checksum See note 1 on page 377. 8008 Verify base code mirror checksum See note 1 on page 377. 8009 Set NVRAM configuration in TITAN See note 1 on page 377. 800F Start base code See note 1 on page 377. 8010 Start supervisor in base code See note 1 on page 377. 8012 Initialize hardware interrupts See note 1 on page 377. 8014 Allocate and initialize area in DRAM See note 1 on page 377. 8016 Initialize debugger and build VPD See note 1 on page 377. 8018 Initialize service processor interfaces See note 1 on page 377. 801A Initialize external FLASH load See note 1 on page 377. 801B Initialize and test ISA hub chip See note 1 on page 377. 801C Initialize and test SUPER I/O chip See note 1 on page 377. 801F Initialize diskette, verify VDASD and start operational loader See note 1 on page 377. 809x Related to recovery mode system firmware update from diskette See note 3 on page 377. 8092 Diskette initialization during recovery mode system firmware update See note 3 on page 377. 8098 Firmware image being read during recovery mode system firmware update See note 3 on page 377. 809C Firmware image being written during recovery mode system firmware update See note 3 on page 377. 809F Diskette error during recovery mode system firmware update See note 3 on page 377. 8100 Start operational loader See note 1 on page 377. 81xx Load code into DRAM from VDASD (where xx = number of loads) See note 1 on page 377. 81FE Load service processor operating system See note 1 on page 377. 81FF Start service processor operating system See note 1 on page 377. 9000 Primary service processor task initializing See note 1 on page 377. 9009 System processor IPL has started See note 1 on page 377. 9010 Primary service processor thread waiting for response See note 1 on page 377. 9020 Service processor state set to pre-standby See note 1 on page 377. 9021 Service processor state set to standby See note 1 on page 377. 372 406/90 Service Guide Checkpoint Description Action/ Possible Failing FRU 9022 Service processor state set to IPL See note 1 on page 377. 9023 Service processor state set to runtime See note 1 on page 377. 9024 Service processor state set to terminate See note 1 on page 377. 9025 Service processor state set to DPO See note 1 on page 377. 9030 Waiting for secondary thread acknowledgement See note 1 on page 377. 9033 I/O base initialization See note 1 on page 377. 9034 I2C initialization See note 1 on page 377. 9035 ISA initialization See note 1 on page 377. 9036 Mailbox initialization See note1 on page 377. 9037 SLLA initialization (VPD) See note 1 on page 377. 9038 NVRAM initialization See note 1 on page 377. 9039 Serial port initialization See note 1 on page 377. 903A Serial I/O initialization See note 1 on page 377. 903B SPCN initialization See note 1 on page 377. 9051 End of reset/reload operation: service processor dump has completed See note 1 on page 377. 9055 Start menus thread See note 1 on page 377. 9060 Checking/configuring modem on serial port 0 See note 4 on page 377. 9061 Checking/configuring modem on serial port 1 See note 4 on page 377. 906A Create mailbox processing thread See note 1 on page 377. 906B Create menu interface thread See note 1 on page 377. 906C Create general thread See note 1 on page 377. 906D Create SPCN thread See note 1 on page 377. 9070 Create thread to control LEDs on I/O subsystems See note 1 on page 377. 9071 Initialize thread to control LEDs on I/O subsystems See note 1 on page 377. 9080 Start VPD collection program See note 1 on page 377. 9081 Start SC VPD data collection See note 1 on page 377. 9082 I2C/SPCN VPD data collection See note 1 on page 377. 9083 End SC VPD data collection See note 1 on page 377. 9084 End I2C/SPCN VPD data collection See note 1 on page 377. 90FD Waiting for IBIST to complete See note 1 on page 377. 9101 Setup load source See note 1 on page 377. 9102 Open load source See note 1 on page 377. 9103 Verify open status of load source See note 1 on page 377. 9104 Waiting for SPCN VPD collection to be completed See note 1 on page 377. 9105 Load IPL LIDs See note 1 on page 377. Chapter 4. Checkpoints 373 Checkpoint Description Action/ Possible Failing FRU 9106 Initialize MOPs See note 1 on page 377. 9107 System flush See note 1 on page 377. 9108 Processor scan interface BATs and LBIST See note 1 on page 377. 9109 ABIST See note 1 on page 377. 910A Non-processor scan interface BATs and LBIST See note 1 on page 377. 910B Wire test See note 1 on page 377. 910C Initialize main storage (system memory) See note 1 on page 377. 910D Configure processors See note 1 on page 377. 910E Configure and initialize I/O See note 1 on page 377. 910F Test and initialize main storage (system memory) See note 1 on page 377. 9110 Free IPL LIDs See note 1 on page 377. 9111 Load run-time LIDS See note 1 on page 377. 9112 Load dump LIDs See note 1 on page 377. 9113 Enable attention See note 1 on page 377. 9114 Enable function 22 See note 1 on page 377. 9115 Load system IPL program See note 1 on page 377. 9116 Load system data areas (NACA/PACA) See note 1 on page 377. 9117 Start system IPL program See note 1 on page 377. 9119 Waiting for VPD collection to complete See note 1 on page 377. 911E Message passing waiting See note 1 on page 377. 911F Message passing starting See note 1 on page 377. 9184 Firmware image being loaded into service processor’s control storage during firmware update. 1. Remove, then reapply power to the system. If the service processor enters recovery mode (indicated by A1FD 0000 in the display), attempt to update the firmware again in recovery mode. If the service processor comes up to the OK prompt, try again to update the firmware using the service processor menus. 2. Call service support. 91C4 Flash update pending, either a firmware update or hypervisor dump. This checkpoint may be displayed during the firmware update process. The checkpoint may be displayed for a long period of time during the process. If process does not complete, the system may be hung. See note 1 on page 377. 91FF Control being handed to system processor from service processor See note 1 on page 377. 9200 Scan interface BATs See note 1 on page 377. 9300 Logic BIST See note 1 on page 377. 9301 Scanned ABIST See note 1 on page 377. 374 406/90 Service Guide Checkpoint Description Action/ Possible Failing FRU 9302 Dedicated ABIST See note 1 on page 377. 9303 Dedicated ABIST array fuse repair calculation See note 1 on page 377. 9380 Built-in-self-test (BIST) See note 2 on page 377. 9400 Service processor is requesting system flush See note 1 on page 377. 9410 Service processor is issuing request to start instruction execution See note 1 on page 377. 9411 Service processor is issuing request to stop instruction execution See note 1 on page 377. 9420 Service processor is issuing request to start system clocks See note 1 on page 377. 9421 Service processor is issuing request to stop system clocks See note 1 on page 377. 94B0 Wire test faulty driver status refcode See note 2 on page 377. 94B1 Wire test shorted net status refcode See note 2 on page 377. 94B2 Wire test elastic interface test See note 2 on page 377. 94BB Wire test starting See note 2 on page 377. 9501 IPL diagnostic initialization See note 2 on page 377. 9502 IPL diagnostic L3 cache march test See note 2 on page 377. 9503 IPL diagnostic L3 connections test See note 2 on page 377. 9504 IPL diagnostic L2 cache march test See note 2 on page 377. 9505 IPL diagnostic RIO wrap test See note 2 on page 377. 9506 IPL diagnostic cleanup See note 2 on page 377. 9507 IPL diagnostic test See note 2 on page 377. 9508 IPL diagnostic main storage march test See note 2 on page 377. 9509 IPL diagnostic main storage connections test See note 2 on page 377. 950A IPL diagnostic elastic interface slack test See note 2 on page 377. 950B IPL diagnostic fast initialization to zeros See note 2 on page 377. 96C2 IPL MOPs processor configuration See note 2 on page 377. 96C3 IPL MOPs main storage size See note 2 on page 377. 96C4 IPL MOPs main storage configuration See note 2 on page 377. 96C6 IPL MOPs I/O configuration See note 2 on page 377. 96C7 IPL MOPs: enable chip machine checks See note 2 on page 377. 96E1 Initialize run-time PRD objects and memory See note 2 on page 377. 96E2 Run-time initialization: enable attention handling See note 2 on page 377. 96EE Firmware update module corrupted See note 3 on page 377. Chapter 4. Checkpoints 375 Checkpoint Description Action/ Possible Failing FRU 99FD Service processor receiving firmware update module See note 3 on page 377.Attention: This checkpoint may stay in the operator panel for some period of time during a firmware update or hypervisor dump with no indication of activity. Do not assume that the service processor is hung unless this checkpoint has been in the operator panel for at least 15 minutes. with no indication of activity.Attention: Do not remove power from the system, or reset the service processor, during a firmware update or hypervisor dump. 99FF Service processor writing firmware update module See note 3 on page 377.Attention: This checkpoint may stay in the operator panel for some period of time during a firmware update or hypervisor dump with no indication of activity. Do not assume that the service processor is hung unless this checkpoint has been in the operator panel for at least 15 minutes. with no indication of activity.Attention: Do not remove power from the system, or reset the service processor, during a firmware update or hypervisor dump. A800 Start Service Processor dump process See note 1 on page 377. A801 Start dump to NVRAM See note 1 on page 377. A802 Start dump to debug port See note 1 on page 377. A803 NVRAM not usable See note 1 on page 377. A804 NVRAM dump done See note 1 on page 377. A805 Start dump to flash See note 1 on page 377. A806 Flash dump area done See note 1 on page 377. A807 Flash area not usable See note 1 on page 377. A808 Flash error log to debug port See note 1 on page 377. A809 Flash erase start See note 1 on page 377. A80A Flash erase end See note 1 on page 377. A80B Reserved; not used See note 1 on page 377. A80C Reserved; not used See note 1 on page 377. A80D Reserved; not used See note 1 on page 377. A80E Reserved; not used See note 1 on page 377. A80F Service processor dump done See note 1 on page 377. 376 406/90 Service Guide Checkpoint Description Action/ Possible Failing FRU A810-A8FF Scan log dump in progress Informational message. (See “Scan Log Dump Policy” on page 776.) The last two characters of the checkpoints will change as the scan log dump progresses. If the last two characters do not change after five minutes, the service processor is hung and must be reset. Follow the procedure outlined in Note 1 following this table. When the scan log dump is complete, depending on how the reboot policy is set, the system will either: v Go to the standby state (and the service processor menus will be available), indicated by OK or STBY in the operator panel, ORv Attempt to reboot. B0F5 Delayed power-off sequence has begun See note 1. B0FA Delayed power-off acknowledged See note 1. B0FF Power-off requested program executing See note 1. Notes: 1. If the system fails to progress after this checkpoint is put in the display, do the following: a. Activate the service processor pinhole reset switch on the system’s operator panel. b. If the hang repeats, check with service support to see if there is a firmware update that fixes the problem. c. If not, or the update does not fix the problem, replace the primary I/O book, location: U1.18-P1-H2. d. If this procedure does not fix the problem, call service support.2. If the system fails to progress after this checkpoint is put in the operator display, do the following: a. Activate the service processor pinhole reset switch on the system’s operator panel. b. Reboot the system in slow mode. (Slow/fast IPL is set using a service processor menu.) If the hang repeats, check with service support to see if there is a system firmware update that fixes this problem. c. If not, or the update does not fix the problem, call service support.3. Recovery mode flash update has failed. Do the following: a. Activate the service processor pinhole reset switch on the system’s operator panel and try the recovery mode flash update again. b. If the system hangs again, make a new set of firmware update diskettes, using new, blank diskettes. Retry the flash update with the new diskettes. c. If the same hang occurs with the new diskettes, replace the following FRUs one at a time in this order: 1) Diskette drive 2) Primary I/O book, location: U1.18-P1-H2.d. If this procedure does not fix the problem, call service support.4. The service processor was unable to configure the modem(s) attached to the serial port(s) on the primary I/O book (a problem with the modem attached to the HMC is not indicated). Do the following: a. Check the modem, modem cable, and serial port for loose or unconnected cables. Reset the service processor using the pinhole reset switch on the system’s operator panel. Chapter 4. Checkpoints 377 b. If the hang repeats, check with service support to see if there is a firmware update that fixes the problem. c. If there is not, or the update does not fix the problem, replace the following FRUs, one at a time, in the following order: 1) Modem cable 2) Modemd. If this procedure does not fix the problem, call service support. 378 406/90 Service Guide Firmware Checkpoints Firmware uses checkpoints (progress codes) in the range of Exxx to EFFF. These checkpoints occur during system startup and can be useful in diagnosing certain problems. Service processor checkpoints are listed in “Service Processor Checkpoints” on page 372. If you have a checkpoint with no location code associated with it, see “Determining Location Code” on page 415. If a checkpoint has a location code associated with a FRU replacement, see “AIX and Physical Location Code Reference Tables” on page 68 for physical location information. If you replace FRUs and the problem is still not corrected, go to “MAP 1542: I/O Problem Isolation” on page 318 unless otherwise indicated in the tables. Note: If you receive a four-digit code in the range of E1xx to EFFF that is not listed in the following table, go to “MAP 1542: I/O Problem Isolation” on page 318. Table 2. Firmware Checkpoints Checkpoint Description Action/ Possible Failing FRU E101 Create RTAS node Go to “MAP 1540: Problem Isolation Procedures” on page 308. E102 Load/Init RTAS Go to “MAP 1540: Problem Isolation Procedures” on page 308. E105 Transfer control to Operating System (normal boot). See “Boot Problems” on page 409. E10A Load RTAS device tree Go to “MAP 1540: Problem Isolation Procedures” on page 308. E10B Set RTAS device properties Go to “MAP 1540: Problem Isolation Procedures” on page 308. E111 GOOD CRC - jump to composite image Go to “MAP 1540: Problem Isolation Procedures” on page 308. E120 Initialize I/O and early memory block Go to “MAP 1540: Problem Isolation Procedures” on page 308. E121 Initialize service processor Go to “MAP 1540: Problem Isolation Procedures” on page 308. E122 RTAS sensor setup complete Go to “MAP 1540: Problem Isolation Procedures” on page 308. E130 Build device tree Go to “MAP 1540: Problem Isolation Procedures” on page 308. E131 Build device tree Go to “MAP 1540: Problem Isolation Procedures” on page 308. E132 Build device tree, just before setting up root node Go to “MAP 1540: Problem Isolation Procedures” on page 308. E133 Build device tree, just before setting up CPUs Go to “MAP 1540: Problem Isolation Procedures” on page 308. E134 Create memory node Go to “MAP 1540: Problem Isolation Procedures” on page 308. E135 Configuring memory nodes Go to “MAP 1540: Problem Isolation Procedures” on page 308. E138 Create options node Go to “MAP 1540: Problem Isolation Procedures” on page 308. Chapter 4. Checkpoints 379 Table 2. Firmware Checkpoints (continued) Checkpoint Description Action/ Possible Failing FRU E139 Create node aliases and system aliases Go to “MAP 1540: Problem Isolation Procedures” on page 308. E13A Create packages node Go to “MAP 1540: Problem Isolation Procedures” on page 308. E140 Operating system load See “Boot Problems” on page 409. E145 Early debug code activated; firmware is halted Press the g key on the serial interface. E149 Create boot mgr node Go to “MAP 1540: Problem Isolation Procedures” on page 308. E14C Create terminal-emulator node Go to “MAP 1540: Problem Isolation Procedures” on page 308. E14D Loading boot image See “Boot Problems” on page 409. E150 Create host (primary) PCI controller node If a location code is specified, replace the adapter at that location code. If no location code is specified, go to “MAP 1542: I/O Problem Isolation” on page 318. E151 Probing primary PCI bus If a location code is specified, replace the adapter at that location code. If replacing the adapter does not resolve the problem, replace the I/O backplane in which the adapter is plugged. If no location code is specified, go to “MAP 1542: I/O Problem Isolation” on page 318. E152 Probing for adapter FCODE, evaluate if present Follow the repair action listed for checkpoint E151. E153 End adapter FCODE, probe/evaluate Follow the repair action listed for checkpoint E151. E154 Create PCI bridge node Follow the repair action listed for checkpoint E151. E155 Probing PCI bridge secondary bus Follow the repair action listed for checkpoint E151. E156 Create PCI Ethernet node If a location code is specified, replace the adapter at that location code. If no location code is specified, go to “MAP 1542: I/O Problem Isolation” on page 318. E158 System firmware waiting for virtual terminal to be opened on HMC for SMS menu to be displayed Open a virtual terminal window on the HMC. E15B Transferring control to operating system (service mode boot) See “Boot Problems” on page 409. E15F Adapter VPD probe Follow the repair action listed for checkpoint E151. E170 Start of PCI Bus Probe Follow the repair action listed for checkpoint E151. E172 First pass of PCI device probe Follow the repair action listed for checkpoint E151. 380 406/90 Service Guide Table 2. Firmware Checkpoints (continued) Checkpoint Description Action/ Possible Failing FRU E174 Establishing host connection If the system is not connected to an active network, or if the target server is inaccessible (which can result from incorrect IP parameters being specified), the system still attempts to boot and because time-out durations are necessarily long to accommodate retries, the system may appear to hang. Restart the system and get to the SMS utilities menu. In the multi-boot menu, verify: v Is the intended boot device correctly specified in the boot list? v Are the IP parameters correct? v Look at the server configuration for this client. v Attempt to ping the target server using the ping utility in SMS. E175 BootP request Follow the repair actions for checkpoint E174. E176 TFTP file transfer Follow the repair actions for checkpoint E174. E177 Transfer failure due to TFTP error condition See “Boot Problems” on page 409. E17B Processor clock speed measurement Call service support. E183 Service Processor POST 1. Replace the service processor, location: U1.18-P1-H2. 2. Go to “MAP 1541: JTAG Problem Isolation” on page 310. E18D A system reset request has been sent to the service processor 1. Check for system firmware updates. 2. Call service support. E191 Create ISA reserved node Go to “MAP 1540: Problem Isolation Procedures” on page 308. E199 Evaluating the RIO interface speed set by the user in the service processor menu 1. Check for system firmware updates. Apply if available. 2. Call service support. E19A NVRAM auto-boot? variable not found - assume FALSE Go to “MAP 1540: Problem Isolation Procedures” on page 308. E19B NVRAM menu? variable not found - assume FALSE Go to “MAP 1540: Problem Isolation Procedures” on page 308. E19D Create NVRAM node Go to “MAP 1540: Problem Isolation Procedures” on page 308. E19E Real-time clock (RTC) initialization Refer to error code 28030xxx in “Firmware/POST Error Codes” on page 488. E1A0 User requested boot to SMS menus via keyboard entry Go to “MAP 1540: Problem Isolation Procedures” on page 308. Chapter 4. Checkpoints 381 Table 2. Firmware Checkpoints (continued) Checkpoint Description Action/ Possible Failing FRU E1A1 User requested boot to open firmware prompt via keyboard entry Go to “MAP 1540: Problem Isolation Procedures” on page 308. E1A2 User requested boot using default service mode boot list via keyboard entry Go to “MAP 1540: Problem Isolation Procedures” on page 308. E1A3 User requested boot using customized service mode boot list via keyboard entry Go to “MAP 1540: Problem Isolation Procedures” on page 308. E1A4 User requested boot to SMS menus via HMC or CSP command Go to “MAP 1540: Problem Isolation Procedures” on page 308. E1A5 User requested boot to open firmware prompt via HMC or CSP command Go to “MAP 1540: Problem Isolation Procedures” on page 308. E1A6 User requested boot using default service mode boot list via HMC or CSP command Go to “MAP 1540: Problem Isolation Procedures” on page 308. E1A7 User requested boot using customized service mode boot list via HMC or CSP command Go to “MAP 1540: Problem Isolation Procedures” on page 308. E1A8 System booting to open firmware prompt Go to “MAP 1540: Problem Isolation Procedures” on page 308. E1A9 System booting to SMS menus Go to “MAP 1540: Problem Isolation Procedures” on page 308. E1AA System NVRAM settings during boot Go to “MAP 1540: Problem Isolation Procedures” on page 308. E1AB System booting using default service mode boot list Go to “MAP 1540: Problem Isolation Procedures” on page 308. E1AC System booting using customized service mode boot list Go to “MAP 1540: Problem Isolation Procedures” on page 308. E1AD System booting to operating system Go to “MAP 1540: Problem Isolation Procedures” on page 308. E1AE System booted to multiboot menu via NVRAM settings 1. This checkpoint is posted on the operator panel when system firmware has entered the boot devices menu in SMS because the multiboot flag was turned on. The firmware is waiting for user input. If the firmware console is not open, the boot devices menu will not be seen, and the system will appear to be hung even though it is not. System firmware will only progress past this point when the required input is provided. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 308. E1AF System booted to utilities menu via NVRAM settings Go to “MAP 1540: Problem Isolation Procedures” on page 308. E1B1 Create serial node Go to “MAP 1540: Problem Isolation Procedures” on page 308. 382 406/90 Service Guide Table 2. Firmware Checkpoints (continued) Checkpoint Description Action/ Possible Failing FRU E1B2 XOFF received - waiting for XON 1. The system firmware is waiting for a virtual terminal to be opened on the HMC. Open a virtual terminal. 2. If a virtual terminal is open, the user might have entered a CTRL-S key sequence to stop the scrolling of data off of the screen. If this is the case, enter a CTRL-Q key sequence to resume scrolling. 3. Check the serial connection between the HMC and the managed system. 4. Reboot the HMC. 5. There may be a hardware problem with the HMC. Refer to the Hardware Management Console Installation and Operations Guide. 6. There may be a hardware problem with the service processor in the managed system. Check the service action event log in Service Focal Point for error codes that indicate a problem with the serial ports on the service processor. The service processor is in the primary I/O book at the U1.18-P1-H2 location. Take the appropriate actions based on the error codes that are found. E1B3 XON received This checkpoint will flash by too quickly to be seen on the operator panel. It may be recorded in the progress indicators log, which is accessed using the service processor menus. If a partition hangs on this checkpoint, complete the following: 1. Check for platform firmware updates. Apply if available. 2. Call service support. E1B6 Probe for (ISA) keyboard Replace the primary I/O book, Location: U1.18-P1-H2. E1BD Probe for (ISA) mouse Replace the primary I/O book, Location: U1.18-P1-H2. E1C4 Build ISA timer chip node Go to “MAP 1540: Problem Isolation Procedures” on page 308. E1C5 Create ISA interrupt controller (pic) node Go to “MAP 1540: Problem Isolation Procedures” on page 308. E1C6 Create DMA node Go to “MAP 1540: Problem Isolation Procedures” on page 308. E1D0 Create PCI SCSI node Go to “MAP 1540: Problem Isolation Procedures” on page 308. E1D3 Create SCSI block device node (SD) Go to “MAP 1540: Problem Isolation Procedures” on page 308. Chapter 4. Checkpoints 383 Table 2. Firmware Checkpoints (continued) Checkpoint Description Action/ Possible Failing FRU E1D4 Create SCSI byte device node (ST) Go to “MAP 1540: Problem Isolation Procedures” on page 308. E1DB Create floppy controller (FDC) node Go to “MAP 1540: Problem Isolation Procedures” on page 308. E1DC Dynamic console selection. 1. If a console is attached, but nothing is displayed on it, go to ?All display problems? in the Entry MAP. 2. If selection screens can be seen on the terminals and the appropriate key on the input device is pressed within 60 seconds but there is no response to the keystroke: a. If selecting the console with a keyboard attached to a USB card, replace the keyboard, then replace the USB card. b. If selecting the console with an ASCII terminal, suspect the terminal. Use the problem determination procedures for the terminal.Note: Terminal setting should be: v 9600 baud v no parity v 8 data bits v 1 stop bit E1F0 Start O.B.E. Go to “MAP 1540: Problem Isolation Procedures” on page 308. E1F1 Begin selftest sequence on boot devices. Go to “MAP 1542: I/O Problem Isolation” on page 318. E1F2 Power-on password prompt The prompt should be visible on the firmware console (usually a VTERM on the HMC). If a console is attached but nothing is displayed on it, go to the “Entry MAP” on page 149 with the symptom ?All display problems.? E1F3 Privileged-access password prompt The prompt should be visible on the firmware console (usually a VTERM on the HMC). If a console is attached but nothing is displayed on it, go to the “Entry MAP” on page 149 with the symptom ?All display problems.? E1F4 End self-test sequence on boot devices; begin SMS. Go to “MAP 1542: I/O Problem Isolation” on page 318. E1F5 Build boot device list Go to “MAP 1542: I/O Problem Isolation” on page 318. 384 406/90 Service Guide Table 2. Firmware Checkpoints (continued) Checkpoint Description Action/ Possible Failing FRU E1F6 Determine boot device sequence Go to “MAP 1542: I/O Problem Isolation” on page 318. E1F7 No boot image located Go to “Boot Problems” on page 409. E1F8 Partition (system firmware) is waiting for a virtual terminal to be opened on HMC. Open a virtual terminal for the partition on the HMC. If the partition hangs at this checkpoint after a virtual terminal is opened on the HMC: 1. Replace the service processor/RIO book, location: U1.18-P1-H2 2. Call service support. E1FA Building boot device list for SCSI adapters. (The device ID and the device LUN being scanned code are also displayed.) 1. If a location code is available, follow the actions listed for error code 21A0 00xx. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E1FB Scan SCSI bus for attached devices 1. If a location code is available, follow the Repair actions listed for error code 21A0 00XX. 2. If no location code is available, go to “MAP 1542: I/O Problem Isolation” on page 318. E1FC Building boot device list for SSA adapters. The location code of the SSA adapter being scanned is also displayed. Go to “MAP 1542: I/O Problem Isolation” on page 318. E1FD The operator panel alternates between the code E1FD and another Exxx code, where Exxx is the point at which the error occurred. 1. If a location code is available, follow the actions listed for error code 21A0 00xx. 2. If no location code is available, go to “MAP 1542: I/O Problem Isolation” on page 318. E1FE Building boot device list for fibre-channel adapters. (The WWPN of the SAN adapter being scanned is also displayed.) Go to “MAP 1542: I/O Problem Isolation” on page 318. E1FF Building boot device list for fibre-channel adapters. (The WWPN of the SAN adapter being scanned is also displayed.) Go to “MAP 1542: I/O Problem Isolation” on page 318. E20F System firmware has exited to open firmware prompt. 1. Replace the primary I/O book, Location: U1.18-P1-H2. 2. Call service support. E440 Validate NVRAM, initialize partitions as needed 1. Check for system firmware updates. 2. Replace the primary I/O book, Location: U1.18-P1-H2. 3. Call service support. E441 Generate /options node NVRAM configuration variable properties 1. Check for system firmware updates. 2. Replace the primary I/O book, Location: U1.18-P1-H2. 3. Call service support. E442 Validate NVRAM partitions 1. Check for system firmware updates. 2. Replace the primary I/O book, Location: U1.18-P1-H2. 3. Call service support. Chapter 4. Checkpoints 385 Table 2. Firmware Checkpoints (continued) Checkpoint Description Action/ Possible Failing FRU E443 Generate NVRAM configuration variable dictionary words Suspect a system firmware problem if problem persists. If a system firmware update is available, install it. E500 Begin I/O drawer configuration 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E501 Initialize I/O data structures 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E502 Set up I/O data structure pointers 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E503 Initialize location code data structures 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E504 Initialize memory map data structures 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E505 Enable the time base for all RIO hubs 1. Check for system firmware updates. 2. Replace the I/O book specified by the location code. 3. Go to “MAP 1542: I/O Problem Isolation” on page 318. E506 Reset time base of processors 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E507 I/O hub assignment 1. Check for system firmware updates. 2. Replace the I/O book specified by the location code. 3. Go to “MAP 1542: I/O Problem Isolation” on page 318. E508 Begin initializing I/O hubs 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E509 Begin configuring EADS chips in I/O subsystems 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E50A Check for missing I/O interface chips 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E50B Save I/O hub information 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. 386 406/90 Service Guide Table 2. Firmware Checkpoints (continued) Checkpoint Description Action/ Possible Failing FRU E50C Copy all of the memory ranges 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E50D Copy all of the I/O data 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E50E Copy all of the I/O hub information 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E50F Clean up the error registers 1. Check for system firmware updates. 2. Replace the I/O book specified by the location code. 3. Go to “MAP 1542: I/O Problem Isolation” on page 318. E51F End of I/O configuration 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E520 Set up the I/O architecture registers 1. Check for system firmware updates. 2. Replace the I/O book specified by the location code. 3. Go to “MAP 1542: I/O Problem Isolation” on page 318. E521 Set up interrupts for the current I/O hub 1. Check for system firmware updates. 2. Replace the I/O book specified by the location code. 3. Go to “MAP 1542: I/O Problem Isolation” on page 318. E522 Set up miscellaneous registers for the current I/O hub 1. Check for system firmware updates. 2. Replace the I/O book specified by the location code. 3. Go to “MAP 1542: I/O Problem Isolation” on page 318. E523 Initialize data structures and enable all I/O hub ports 1. Check for system firmware updates. 2. Replace the I/O book specified by the location code. 3. Go to “MAP 1542: I/O Problem Isolation” on page 318. E524 Determine I/O configuration of current I/O hub 1. Check for system firmware updates. 2. Replace the I/O book specified by the location code. 3. Go to “MAP 1542: I/O Problem Isolation” on page 318. Chapter 4. Checkpoints 387 Table 2. Firmware Checkpoints (continued) Checkpoint Description Action/ Possible Failing FRU E525 Set up the ETE of the current I/O hub 1. Check for system firmware updates. 2. Replace the I/O book specified by the location code. 3. Go to “MAP 1542: I/O Problem Isolation” on page 318. E526 Reassign the I/O subsystems to the nearest I/O hub 1. Replace the I/O cable specified by the location code. 2. Replace the I/O book that the failing I/O subsystem is attached to, U1.18-P1-Hx. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 4. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. 5. Go to “MAP 1542: I/O Problem Isolation” on page 318. E527 Enable the memory map and I/O map for the current I/O hub port 1. Check for system firmware updates. 2. Replace the I/O book specified by the location code. 3. Go to “MAP 1542: I/O Problem Isolation” on page 318. E528 Begin search for the next available I/O drawer from a current RIO port 1. Check the RIO cable specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, check the RIO cables going into and out of the I/O subsystem. 2. Replace the RIO cables specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, replace the RIO cables going into and out of the I/O subsystem. 3. Replace the I/O book specifed by the location code, or the I./O book that the RIO cables in step 1 are attached to. 4. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 5. Replace the I/O subsystem backplane specified by the location code, or the I/O subsystem backplane that the RIO cables in step 1 are attached to. 388 406/90 Service Guide Table 2. Firmware Checkpoints (continued) Checkpoint Description Action/ Possible Failing FRU E529 Enable the control port and driver of the current RIO hub 1. Check the RIO cable specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, check the RIO cables going into and out of the I/O subsystem. 2. Replace the RIO cables specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, replace the RIO cables going into and out of the I/O subsystem. 3. Replace the I/O book specifed by the location code, or the I./O book that the RIO cables in step 1 are attached to. 4. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 5. Replace the I/O subsystem backplane specified by the location code, or the I/O subsystem backplane that the RIO cables in step 1 are attached to. E52A Search for the next available I/O drawer from a current RIO port 1. Check the RIO cable specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, check the RIO cables going into and out of the I/O subsystem. 2. Replace the RIO cables specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, replace the RIO cables going into and out of the I/O subsystem. 3. Replace the I/O book specifed by the location code, or the I./O book that the RIO cables in step 1 are attached to. 4. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 5. Replace the I/O subsystem backplane specified by the location code, or the I/O subsystem backplane that the RIO cables in step 1 are attached to. Chapter 4. Checkpoints 389 Table 2. Firmware Checkpoints (continued) Checkpoint Description Action/ Possible Failing FRU E52B The RIO cable connected to the RIO port has been detected 1. Check the RIO cable specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, check the RIO cables going into and out of the I/O subsystem. 2. Replace the RIO cables specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, replace the RIO cables going into and out of the I/O subsystem. 3. Replace the I/O book specifed by the location code, or the I./O book that the RIO cables in step 1 are attached to. 4. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 5. Replace the I/O subsystem backplane specified by the location code, or the I/O subsystem backplane that the RIO cables in step 1 are attached to. E52F End of configuration of the RIO hub 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E530 Checking I/O hub ports on current I/O hub 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E531 Get I/O hub node ID 1. Replace the I/O cable specified by the location code. 2. Replace the I/O book that the failing I/O subsystem is attached to, U1.18-P1-Hx. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 4. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. 5. Go to “MAP 1542: I/O Problem Isolation” on page 318. E532 I/O cable detected; enable the I/O hub port 1. Replace the I/O cable specified by the location code. 2. Replace the I/O book that the failing I/O subsystem is attached to, U1.18-P1-Hx. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 4. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. 5. Go to “MAP 1542: I/O Problem Isolation” on page 318. 390 406/90 Service Guide Table 2. Firmware Checkpoints (continued) Checkpoint Description Action/ Possible Failing FRU E533 Set up the I/O hub routing table 1. Replace the I/O cable specified by the location code. 2. Replace the I/O book that the failing I/O subsystem is attached to, U1.18-P1-Hx. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 4. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. 5. Go to “MAP 1542: I/O Problem Isolation” on page 318. E534 Get the node ID of the node on the other end of the I/O cable 1. This error code may indicate that the I/O (RIO) cables are not properly connected to the I/O subsystems. Check the RIO cabling; refer to “I/O Book Connector Locations” on page 19 and “Subsystem Positioning and Cabling” on page 36. 2. Replace the RIO cable specified by the location code. 3. Replace the I/O book that the failing I/O subsystem is attached to, U1.18-P1-Hx. 4. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 5. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. 6. Go to “MAP 1542: I/O Problem Isolation” on page 318. E535 Valid node ID detected at other end of the I/O cable 1. Replace the I/O cable specified by the location code. 2. Replace the I/O book that the failing I/O subsystem is attached to, U1.18-P1-Hx. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 4. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. 5. Go to “MAP 1542: I/O Problem Isolation” on page 318. E536 I/O subsystem detected at other end of the I/O cable 1. Replace the I/O cable specified by the location code. 2. Replace the I/O book that the failing I/O subsystem is attached to, U1.18-P1-Hx. 3. Go to “MAP 1542: I/O Problem Isolation” on page 318. Chapter 4. Checkpoints 391 Table 2. Firmware Checkpoints (continued) Checkpoint Description Action/ Possible Failing FRU E537 Configuring the I/O subsystem 1. Replace the I/O cable specified by the location code. 2. Replace the I/O book that the failing I/O subsystem is attached to, U1.18-P1-Hx. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 4. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. 5. Go to “MAP 1542: I/O Problem Isolation” on page 318. E538 Detected the I/O hub at other end of the I/O cable 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E539 Enabling the I/O hub ports 1. Check for system firmware updates. 2. Replace the I/O book specified by the location code. 3. Go to “MAP 1542: I/O Problem Isolation” on page 318. 4. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. E53A Set up the RIO port speed of the exit port of the I/O subsystem connected to the RIO hub port 1. Replace the RIO cable specified by the location code. 2. Replace the I/O book that the failing drawer is attached to, U1.18-P1-Hx. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 4. Replace the I/O subsystem backplane that the RIO cable in step 1 is attached to. 5. Go to “MAP 1542: I/O Problem Isolation” on page 318. E53B End of looking for next node port 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E53C End of looking for next RIO hub port 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E53D Disable the anynode mode 1. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 2. Replace the I/O subsytem backplane that the RIO cable specified by the location code is attached to. 392 406/90 Service Guide Table 2. Firmware Checkpoints (continued) Checkpoint Description Action/ Possible Failing FRU E540 Begin initialization of I/O subsystem 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E541 Initialize the routing table 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. 4. Go to “MAP 1542: I/O Problem Isolation” on page 318. E544 Enable the I/O ports 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E545 Set up the device routing address for this new node ID 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E546 Set up the I/O interface chip new node ID 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E547 Set up the I/O interface chip routing table 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. Chapter 4. Checkpoints 393 Table 2. Firmware Checkpoints (continued) Checkpoint Description Action/ Possible Failing FRU E548 Disable any node mode 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E549 Adjust I/O hub table 1. Replace the I/O cable specified by the location code. 2. Replace the I/O book that the failing I/O subsystem is attached to, U1.18-P1-Hx. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 4. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E54A Configure the I/O drawer 1. Check the RIO cable specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, check the RIO cables going into and out of the I/O subsystem. 2. Replace the RIO cables specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, replace the RIO cables going into and out of the I/O subsystem. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. E54B Update the I/O drawer routing table of RIO ports 1. Check the RIO cable specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, check the RIO cables going into and out of the I/O subsystem. 2. Replace the RIO cables specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, replace the RIO cables going into and out of the I/O subsystem. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 394 406/90 Service Guide Table 2. Firmware Checkpoints (continued) Checkpoint Description Action/ Possible Failing FRU E550 Begin to initialize I/O interface chip 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E551 Set up the I/O speed 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E552 Set up I/O interface chip registers 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E553 Set up the I/O interface chip base address 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E554 Set up more I/O interface chip registers 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E555 Set up the I/O interface chip I/O link 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. Chapter 4. Checkpoints 395 Table 2. Firmware Checkpoints (continued) Checkpoint Description Action/ Possible Failing FRU E556 Set up and initialize I/O interface chips under current I/O interface chip 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E55F End of I/O drawer initialization 1. Replace the I/O cable specified by the location code. 2. Replace the I/O book that the failing I/O subsystem is attached to, U1.18-P1-Hx. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 4. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E560 Begin initializing the I/O subsystem 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E561 Set up I/O interface chip registers 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E562 Set up I/O interface chip registers 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E563 Set up I/O interface chip registers 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. 396 406/90 Service Guide Table 2. Firmware Checkpoints (continued) Checkpoint Description Action/ Possible Failing FRU E564 Set up I/O interface chip registers 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E565 Set up I/O interface chip registers 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E566 Enable device arbitration on the I/O interface chip 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E567 Set up I/O interface chip registers 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E568 Set up the external interrupt vector register 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E569 Set up I/O interface chip registers 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. Chapter 4. Checkpoints 397 Table 2. Firmware Checkpoints (continued) Checkpoint Description Action/ Possible Failing FRU E56A Set up I/O interface chip registers 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E56B Set up I/O interface chip registers 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E56C Set up I/O interface chip PCI bus 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E56D Identify devices on current bus 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E56E Begin initializing the service processor control chip 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E56F End initializing the service processor control chip 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 398 406/90 Service Guide Table 2. Firmware Checkpoints (continued) Checkpoint Description Action/ Possible Failing FRU E570 Initialize the PCI-ISA bridge chip 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E571 Initialize the integrated Ethernet (if present) 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E572 Initialize the integrated SCSI 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E57F End of I/O interface chip initialization 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E580 Begin initializing the EADS module 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E581 Read the PCI bus configuration header 1. This error code may indicate that the I/O (RIO) cables are not properly connected to the I/O subsystems. Check the RIO cabling; refer to “I/O Book Connector Locations” on page 19and “Subsystem Positioning and Cabling” on page 36. 2. Replace the RIO cable specified by the location code. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 4. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. Chapter 4. Checkpoints 399 Table 2. Firmware Checkpoints (continued) Checkpoint Description Action/ Possible Failing FRU E582 EADS has been detected 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E583 Running BIST on the current EADS 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E584 Checking function of current EADS 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E585 Valid function of current EADS detected 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E586 Set up EADS function 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E587 Set up EADS BIST 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. 400 406/90 Service Guide Table 2. Firmware Checkpoints (continued) Checkpoint Description Action/ Possible Failing FRU E588 Set up EADS function registers 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E589 Set up EADS function 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E58A Set up EADS function registers 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E58B Set up EADS function registers 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E58C Set up EADS function registers 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E58D Set up EADS function registers 1. Replace the I/O cable specified by the location code. 2. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. Chapter 4. Checkpoints 401 Table 2. Firmware Checkpoints (continued) Checkpoint Description Action/ Possible Failing FRU E58E Set up EADS function registers 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E58F Set up EADS function registers 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E590 Set up EADS function registers 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E591 Set up EADS function registers 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E592 Set up EADS function registers 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E593 Set up EADS function registers 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. 402 406/90 Service Guide Table 2. Firmware Checkpoints (continued) Checkpoint Description Action/ Possible Failing FRU E594 Begin probing slots 1. Replace the I/O cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem backplane that the I/O cable in step 1 is attached to. E595 Detected valid adapter in slot 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. E596 Read the EADS revision ID and class code 1. Replace the RIO cable specified by the location code. 2. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. 3. Replace the I/O subsystem planar to which the RIO cable in Step 1 is attached. E59E End of walking bus of current EADS 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. E59F End of EADS initialization 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. 3. If the location code associated with this checkpoint specifies a PCI adapter card, replace the adapter card. E5E0 Initialize system processor array 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E5E1 Reserved E5E2 Initialization of the hypervisor, phase 1 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E5E3 Not in quick-restart: load and set up open firmware code in memory 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. Chapter 4. Checkpoints 403 Table 2. Firmware Checkpoints (continued) Checkpoint Description Action/ Possible Failing FRU E5E4 Activate slave processors 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E5E5 Initialize the hypervisor, phase 2 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E5E6 Set L2 and processor status 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E5E7 Save I/O hub information for RTAS and open firmware 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E5E8 Set up registers 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E5E9 Reset the quick restart path 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E5EE Hang state in main control code 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. E5EF End of low-level firmware 1. Replace the I/O drawer subsystem backplane specified by the location code. 2. Power down, then power up the system using the UEPO switch. 3. Call service support. E5FE Get the system’s licensed memory quantity from the service processor 1. Disconnect and then reconnect AC power to the system using the UEPO switch. 2. Verify that the CUoD capacity card is installed and properly seated. 3. Call service support. E600 SSA PCI adapter open firmware has run successfully. Replace the adapter. E601 SSA PCI adapter BIST has started but failed to complete after 4 seconds. Replace the adapter. E602 SSA PCI open firmware has started. Replace the adapter. E603 SSA PCI adapter BIST has completed with an error. Replace the adapter. E604 SSA PCI adapter BIST and subsequent POSTs have completed successfully. Replace the adapter. E605 SSA PCI adapter BIST has completed successfully but subsequent POSTS have failed. Replace the adapter. 404 406/90 Service Guide Table 2. Firmware Checkpoints (continued) Checkpoint Description Action/ Possible Failing FRU E60E SSA PCI open firmware about to exit (no stack corruption). Replace the adapter. E60F SSA PCI open firmware has run unsuccessfully. Replace the adapter. E6FF SSA PCI open firmware about to exit (with stack corruption). Replace the adapter. E700 Create system firmware VPD System firmware may be corrupted. Reinstall system firmware. E701 Create memory VPD. 1. Check for system firmware updates. 2. Call for support. E702 Create processor card VPD. 1. Check for system firmware updates. 2. Call for support. E703 Create operator panel VPD. 1. Check for system firmware updates. 2. Call for support. E704 Create processor subsystem VPD. 1. Check for system firmware updates. 2. Call for support. E705 Create clock card VPD. 1. Check for system firmware updates. 2. Call for support. E706 Create RIO extender card VPD. 1. Check for system firmware updates. 2. Call for support. E707 Create system VPD. 1. Check for system firmware updates. 2. Call for support. E708 Create RIO to PCI adapter VPD. Go to “MAP 1542: I/O Problem Isolation” on page 318. E709 Create VPD via SPCN. 1. Check for system firmware updates. 2. Call service support. E70A Create service processor VPD. 1. Check for system firmware updates. 2. Call for support. E70B Create L3 module VPD 1. Check for system firmware updates. 2. Call service support. E70C Create I/O backplane VPD (I2C problem) Go to “MAP 1542: I/O Problem Isolation” on page 318. E70D Create capacitor book VPD 1. Check for system firmware updates. 2. Call service support. E70F Create CEC VPD 1. Check for system firmware updates. 2. Call service support. E7EE Delaying while waiting for power subsystem VPD collection to be completed; informational only 1. Check for system firmware updates. 2. Call service support. Chapter 4. Checkpoints 405 Table 2. Firmware Checkpoints (continued) Checkpoint Description Action/ Possible Failing FRU E7FF Successful end of VPD creation Informational message. If the system hangs here, go to “MAP 1541: JTAG Problem Isolation” on page 310. E800 RTAS initialization problem 1. Check for system firmware updates. 2. Call service support. E841 Initializing RTAS/open firmware device tree interface 1. Check for system firmware updates. 2. Call service support. E842 Initializing RTAS/service processor interface 1. Check for system firmware updates. 2. Replace the primary I/O bookLocation: U1.18-P1-H2. 3. Call service support. E843 Initializing RTAS/sensor interface 1. Check for system firmware updates. 2. Call service support. E844 Initializing RTAS/time-of-day clock interface 1. Check for system firmware updates. 2. Replace the primary I/O bookLocation: U1.18-P1-H2. 3. Call service support. E845 Initializing interface/sensor access 1. Check for system firmware updates. 2. Call service support. E846 Initializing interface/sensors/led_table access 1. Check for system firmware updates. 2. Call service support. E84F Initializing RTAS/operator panel interface 1. Check for system firmware updates. 2. Call service support. E850 Initializing RTAS/cache interface 1. Check for system firmware updates. 2. Call service support. E851 Initializing RTAS/hardware access 1. Check for system firmware updates. 2. Call service support. E852 Initializing RTAS/PCI bus interface 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. 3. Call service support. E853 Start cable ID request process 1. Check the SPCN and RIO cabling to the I/O subsystems 2. Disconnect then reconnect power to the system. 3. Go to “MAP 1542: I/O Problem Isolation” on page 318. 4. Call service support. 406 406/90 Service Guide Table 2. Firmware Checkpoints (continued) Checkpoint Description Action/ Possible Failing FRU E854 Start collecting cable IDs 1. Check the SPCN and RIO cabling to the I/O subsystems 2. Disconnect then reconnect power to the system. 3. Go to “MAP 1542: I/O Problem Isolation” on page 318. 4. Call service support. E855 Start setting RIO link speed 1. Check the SPCN and RIO cabling to the I/O subsystems 2. Disconnect then reconnect power to the system. 3. Go to “MAP 1542: I/O Problem Isolation” on page 318. 4. Call service support. E860 Initializing RTAS/power subsystem interface 1. Check for system firmware updates. 2. Call service support. E861 Initializing RTAS/PCI bus interface 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318 3. Call service support. E870 Initializing RTAS/logs interface 1. Check for system firmware updates. 2. Call service support. E871 Initializing RTAS/EEH handling interface 1. Check for system firmware updates. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. 3. Call service support. E872 Initializing RTAS/error inject interface 1. Check for system firmware updates. 2. Call service support. E873 Initializing RTAS/error handling interface 1. Check for system firmware updates. 2. Call service support. E880 Initializing RTAS/debug interface 1. Check for system firmware updates. 2. Call service support. E881 Initializing RTAS/utility interface 1. Check for system firmware updates. 2. Call service support. E882 Initializing RTAS/softpatch register interface 1. Check for system firmware updates. 2. Call service support. E890 Initializing RTAS/hot-plug interface 1. Check for system firmware updates. 2. Call service support. E8C1 Initializing interface/client memory access 1. Check for system firmware updates. 2. Call service support. Chapter 4. Checkpoints 407 Table 2. Firmware Checkpoints (continued) Checkpoint Description Action/ Possible Failing FRU E8D1 Initializing interface/special memory access 1. Check for system firmware updates. 2. Call service support. E900 The system firmware is in the recoverable non-maskable interrupt code Go to “MAP 1540: Problem Isolation Procedures” on page 308. E901 The system firmware is in the recoverable non-maskable interrupt code Go to “MAP 1540: Problem Isolation Procedures” on page 308. EAA1 Probe PCI-PCI bridge bus Follow the repair actions for checkpoint E151. 408 406/90 Service Guide Boot Problems Attention: If the system is running partitions from partition standby (LPAR), the following procedure addresses the problem in which one partition will not boot while other partitions boot successfully and run the operating system successfully. This procedure also addresses the problem in which a system does not boot to a full system partition. All of the system’s resources are available to the diagnostics; the same general procedure applies, with the exception of moving resources between partitions. Attention: It is the customer’s responsibility to move devices between partitions. If a device must be moved to another partition to run standalone diagnostics, contact the customer or system administrator. (If the CD-ROM must be moved to another partition, all SCSI devices connected to that SCSI adapter must be moved because moves are done at the slot level, not at the device level.) Both partitions must be rebooted if a device is moved between partitions. Depending on the boot device, a checkpoint may be displayed on the operator panel for an extended period of time while the boot image is retrieved from the device. This is particularly true for tape and network boot attempts. If the checkpoint/code is displayed for an extended time, there may be a problem loading the boot image from the device. If booting from CD-ROM or tape, watch for activity on the drive’s LED indicator. A blinking LED indicates that the loading of either the boot image, or additional information required by the operating system being booted, is still being loaded. For network boot attempts, refer to checkpoint E174. If the checkpoint is displayed for an extended time, there may be a problem with the integrity of the boot image. This procedure assumes that a CD-ROM drive is connected to the system, or that the diagnostics can be run from a NIM (Network Installation Management) server. 1. Look at the service action event-error log in the Service Focal Point on the HMC (see “Service Focal Point” on page 4 for instructions on accessing the service action event log). If there are open entries that affect devices in the boot path of the partition, perform the actions to correct those errors. If there are open entries that indicate problems with I/O cabling, perform the actions to correct those problems. Try to reboot the partition. If the partition hangs with a 4-character checkpoint in the display, the partition must be deactivated, then reactivated before attempting to reboot. If a 20EE xxxx error code is reported, the partition is already deactivated and in the error state. Reboot by activating the partition. If the system is running in full system partition, the power must be cycled before the system can be rebooted. If the reboot is still not successful, continue to step 2. 2. Boot to the SMS main menu as follows: v If you are rebooting a partition from partition standby (LPAR), go to the properties of the partition and select Boot to SMS, then activate the partition. v If you are rebooting the system in full system partition, start the service processor menus, select option 2, System Power Control Menu, then select option 7, Boot Mode Menu, then select option 1, Boot to SMS menu. Enter 98 to return to the system power control menu, then select option 4, Power-on System. Enter y to confirm.At the SMS main menu, select Multiboot. a. Check to see if the intended boot device is correctly specified in the boot list. If it is in the boot list: 1) Remove all removable media from devices in the boot list from which you do not want to boot. 2) If attempting to boot from a network, go to step 3 on page 410. If attempting to boot from a disk drive or CD-ROM, go to step 4 on page 410. Chapter 4. Checkpoints 409 b. If the intended boot device is not correctly listed in the boot sequence, add it to the boot sequence using the SMS menus. If the device can be added to the boot sequence, reboot the partition. If the intended boot device cannot be added to the boot list, ask the customer or system administrator to verify that the device you are trying to boot from is assigned to the correct partition. If it is, go to step 4.3. If attempting to boot from the network: a. Verify that the IP parameters are correct. b. Attempt to ?ping? the target server using the SMS ping utility. If the ping is not successful, have the network administrator verify the server configuration for this client. c. Check with the network administrator to ensure that the network is up. Also ask the network administrator to verify the settings on the server from which you are trying to boot. d. Check the network cabling to the adapter. e. Restart the partition and retry the boot operation.4. Try to boot and run standalone diagnostics against the devices in the partition, particularly against the intended boot device. Standalone diagnostics can be run from CD-ROM or a NIM server. To boot standalone diagnostics, follow the detailed procedures in “Loading the Standalone Diagnostics from CD-ROM Using an HMC” on page 764 or “Running Standalone Diagnostics from a Network Installation Management (NIM) Server” on page 766. Attention: Standalone diagnostics may be run on systems configured for either full system partition or partition standby. When attempting to boot diagnostics on a partition from partition standby, the device from which you are booting standalone diagnostics must be made available to the partition that is not able to boot, if it is not already in that partition. Contact the customer or system administrator if a device must be moved between partitions in order to boot standalone diagnostics. If devices are moved between partitions, both partitions must be rebooted. For more information, see “Standalone Diagnostic Operation” on page 764. If standalone diagnostics boot successfully: a. Go to the Task Selection menu, and select Display Configuration and Resource List. If the intended boot device is not listed, ask the customer or system administrator to verify that it is assigned to the correct partition. If it is, and does not appear in the NEW RESOURCE list, go to ?MAP 0290: Missing Resource Problem Resolution? in the Diagnostic Information for Multiple Bus Systems. b. Run diagnostics on the device you want to boot from. If an SRN, not an 8-digit error code, is reported, go to the Diagnostic Information for Multiple Bus Systems. c. If the diagnostics are successful, it may be necessary to perform an operating system-specific recovery process, or reinstall the operating system.If standalone diagnostics do not boot successfully from CD-ROM: a. Verify that the CD-ROM drive is assigned to the correct partition. b. Check the SCSI cabling to the CD-ROM drive. Check the power connector on the CD-ROM. c. It is possible that another SCSI device on the same bus is causing the problem. Do the following: 1) Disconnect the power and signal cables from all other SCSI devices that are attached to the SCSI adapter to which the CD-ROM is attached. 2) Try to boot standalone diagnostics again. 3) If you are still unable to load standalone diagnostics, go to step 5 on page 411. 4) If standalone diagnostics load, one of the disconnected devices is causing the problem. Reconnect the devices one at a time and retry the boot operation until the problem recurs, and replace the device that caused the problem. Go to ?MAP 410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems.If standalone diagnostics do not boot successfully from a NIM server: a. Verify that the network adapter is assigned to the correct partition. 410 406/90 Service Guide b. Check the network settings and flag settings, as detailed in “Running Standalone Diagnostics from a Network Installation Management (NIM) Server” on page 766. c. Follow the procedure outlined in step 3 on page 410.5. Go to “MAP 1542: I/O Problem Isolation” on page 318. Chapter 4. Checkpoints 411 412 406/90 Service Guide Chapter 5. Error Code to FRU Index The Error Code to FRU Index lists fault symptoms and possible causes. The most likely cause is listed first. Use this index to help you decide which FRUs to replace when servicing the system. Attention: If you replace FRUs or perform an action on an I/O subsystem and the problem is still not corrected, go to “MAP 1542: I/O Problem Isolation” on page 318 unless you were already directed to any MAP 154x by the error code. Otherwise, call service support if the action(s) for an error code do not resolve the problem. Attention: If the error code that brought you to this chapter originated in the AIX error log, remember that an error may require an additional 15 minutes to be propagated to the AIX error log. If you replace FRUs and the problem is corrected, go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Performing Slow Boot On this system, the error code displayed in the operator panel or in the service action event (SAE) log may not indicate the root cause of the problem. To fully analyze all of the available information, perform the following steps before doing a hardware repair or replacement: 1. Record the 8-character error code (and location code if present) in operator panel. 2. Do a slow-mode boot into the SMS main menu. (A fast-mode boot skips much of the built-in diagnostic testing.) A slow-mode boot may yield a new 8-character error code on the operator panel or new errors in the service processor error log. If a new error code is reported, record it for use in subsequent steps. Slow mode boot is selected through the service processor menus. The procedure is different for systems without an HMC attached versus systems with an HMC attached. v Non-partitioned systems: a. Go to the system power control menu from the service processor main menu. b. Select option 7, the boot mode menu. c. On the boot mode menu, select option 1. This automatically enables Boot to SMS menus. d. Enter 98 to go back to the system power control menu. e. Look at the state of the fast boot enable, option 6. If it is enabled, select option 6 to change it to disabled (disabling fast system boot automatically enables slow mode boot). If it is already disabled, it may have been disabled by the service processor because the previous boot failed. In either case, boot the system by continuing to substep 2f. f. Select option 4 to power on the system. Enter y when prompted. The system will power up and begin the boot process. g. When the system stops at the SMS main menu, use the white power button on the operator panel to power down the system. At this point, the service processor menus should be available to examine the service processor error logs. v Partitioned systems: a. From the HMC, select Partition Manager. b. Highlight the processor subsystem name. c. Right-click the mouse and select Open Terminal Window. d. From the Service Processor Menu on the VTERM select Option 2 System Power Control Menu. 413 e. Select option 6. Verify that the state changes to currently disabled. Disabling fast system boot automatically enables slow boot. f. Select Option 98 to exit the system power control menu. g. Use the HMC to power up the system to the SMS menus in a full system partition.3. When the system has stopped at the SMS menus, use the HMC to power off the system. At this point, the service processor menus should be available to examine the service processor error log.Confirming Initial Error Code The service processor may have recorded one or more symptoms in its error log. If the system has attempted to boot since an error terminated normal operation, there may be specific fault information in the service processor error log. Examine this error log before proceeding (see “System Information Menu” on page 781). In the service processor error log, the most recent error is at the top of the list and the oldest error is at the bottom. It is also important to look at the time stamp associated with each error. (The time stamps in the service processor error log reflect coordinated universal time (UTC), which is also referred to as Greenwich mean time.) On this system, the error code displayed in the operator panel may not be indicative of the root cause of the problem. This is also true of the most recent error in the service processor error log. Attempt to find the error code that originally sent you to the “Checkpoints and Error Codes Index” on page 416 table. 1. If you cannot find the error code in the service processor error log, start with the error code at the bottom of the list. Proceed to step 3. 2. If you can find the error code in the service processor error log, observe the time stamps and look for a group of failures that happened within minutes prior to this error code. In this group, start at the error code with the earliest time stamp. Proceed to step 3. 3. Examine the list of error codes you obtained from steps 1 or 2, and look for any of the form 4xxB xxxx. Can you find any error codes of the form 4xxB xxxx? No Proceed to step 6. Yes Proceed to step 4.4. Do any of the error codes of the form 4xxB xxxx have the same first two characters on the left as the first two characters on the left of the error code that sent you to the MAP? No Proceed to step 6. Yes Proceed to step 5.5. Adjust the order of the list of error codes you obtained from steps 1 or 2 by placing the error codes of the form 4xxB xxxx with the same first two characters as the error code that sent you to this MAP on the bottom of the list. Proceed to step 6. starting with the codes at the bottom of the adjusted list of error codes. 6. To get a list of possible FRUs, select the detail screen for each error code. Record the listed location codes and their order. Note: If the error code that results from this procedure is 406x 0EA0, 406x 0EA1, 406x 0EA2, or 406x 0EA4, do not go back to the “Checkpoints and Error Codes Index” on page 416 with this error code. Look for a hardware error in the service processor error log of the form 450x yyyy, 460x yyyy, or 4B2x yyyy that occurred just prior to the 406x 0EAy error. Use this hardware error code instead when returning to the “Checkpoints and Error Codes Index” on page 416.Return to the step in the “Checkpoints and Error Codes Index” on page 416 table immediately after the step that sent you to this ?Confirming Initial Error Code? procedure to perform the listed repair actions. 414 406/90 Service Guide If the repair actions for an error code are ineffective, continue to work upward through this group of error codes (which is forward in time from the earliest time stamp to the latest) until either the problem is fixed, you reach the error code that was first reported, or you are at the top of the list.Four-Character Checkpoints Four-character checkpoints in the form 8xxx, 9xxx, Axxx, Bxxx, and Exxx are listed in “Firmware Checkpoints” on page 379. If you have a four-character code that begins with any character other than 8, 9, A, B, or E, refer to the Diagnostic Information for Multiple Bus Systems. Replacing the Media Subsystem Operator Panel Licensed programs frequently rely on system information that is stored in the VPD module on the media subsystem’s operator panel. If the MAPs indicate that the operator panel should be replaced, the pluggable VPD module must be swapped. If the old module is defective, or cannot be recovered, contact service support. Replacing the Network Adapter If a network adapter is replaced, the network administrator must be notified so that the client IP addresses used by the server can be updated. In addition, the operating system configuration of the network controller may need to be changed in order to enable system startup. Also check to ensure that any client or server that accesses this system is updated. Determining Location Code If you have an error code in the form 2xxx xxxx or Exxx, and no location code associated with it, the location code may possibly be found in the following places, depending on the state of the system: v For systems with AIX installed, if the system will boot into AIX, the location code will be reported when error log analysis is run under AIX diagnostics. v If the system will boot into the system management services (SMS) menus, the SMS error log will contain the location code associated with the error code. v If the system halted when the error occurred, the error code and location can be found in the LCD progress indicator log, which can be accessed using the service processor menus. See “System Information Menu” on page 781, then select Read Progress Indicator From Last System Boot. Find the error code that directed you to this note in the list, and the location code is part of the entry in the log. Attention: To identify a FRU, these MAPs refer to FRU identify LEDs. For more Information about the FRU identify LEDs see “FRU Identify LEDs” on page 65. Chapter 5. Error Code to FRU Index 415 Checkpoints and Error Codes Index The following table is an index to the checkpoints or error codes that you may receive to help diagnose a failure. Locate the code you received, and follow the instructions to determine your next step. 8-Digit Error Codeor4-Digit Checkpoint What You Should Do 0000 xxxx Operator Panel Error Codes Go to “Operator Panel Error Codes” on page 419, and follow the instructions in the operator panel error code table. 1xxx xxxx System Power Control Network (SPCN) Error Codes 1. There may be additional error codes and information in the service processor error log. The system must be powered off to examine the contents of the service processor error log. a. Is the system still powered on? NO Go to step2 below. YES Ask the customer if the system can be powered off, then go to step 1b.b. Can the system be powered off? NO Go to step3 below. YES Power off the system, then Go to step2 below.2. Confirm this is the first error code that caused the failure, as described in “Confirming Initial Error Code” on page 414. 3. Did you come to this step from step 2? NO Go to “SPCN Error Codes” on page 420 with the error codes you already have and follow the instructions in the SPCN error code table. YES Go to “SPCN Error Codes” on page 420 with the error code(s) from step 2 and follow the instructions in the SPCN error code table. 2xxx xxxx Firmware Error Codes Go to “Firmware/POST Error Codes” on page 488, and follow the instructions in the firmware error code table. 416 406/90 Service Guide 8-Digit Error Codeor4-Digit Checkpoint What You Should Do 4xxx xxxx Service Processor Error Codes 1. Is the system powered up? NO Go to step 2 below. YES Go to step 4 below.2. Find the error code in the service processor error log and check the value of word 13. Go to step 3 below. 3. Is the value of word 13 C0xx xxxx? NO Go to step 7 below. YES Go to “MAP 1541: JTAG Problem Isolation” on page 310.4. Is the system running Linux? NO Go to step 6 below. YES Go to step 5 below.5. If the system is not functional, go to step 7 below. If the system is functional, do the following: Note: If you have already recorded the value of word 13 in a previous step, continue to answer the question about the word 13 in this susbstep below. Find the value of word 13 in the Linux system log entry. For detailed instructions on finding word 13 in a Linux system log entry, refer to the actions under error code B1xx 4699 in this service guide. Is the value of word 13 C0xx xxxx? NO Go to step 7 below. YES Go to “MAP 1541: JTAG Problem Isolation” on page 310.6. If the system is not functional, go to step 7. If the system is functional, do the following: Examine the AIX error log entry. Find the value of word 13. For detailed instructions on finding word 13 in a AIX error log entry, refer to the actions under error code B1xx 4699 in this service guide. Is the value of word 13 C0xx xxxx? NO Go to step 7 below. YES Go to “MAP 1541: JTAG Problem Isolation” on page 310. 4xxx xxxx (Cont.) Service Processor Error Codes 7. There might be additional error codes and information in the service processor error log. The system must be powered off to examine the contents of the service processor error log. a. Is the system still powered on? NO Go to step 8 below. YES Ask the customer if the system can be powered off, then go to step 7b.b. Can the system be powered off? NO Go to step 10 below. YES Power off the system, then go to step 8. 8. Perform slow boot procedure as described in “Performing Slow Boot” on page 413. 9. Confirm this is the first error code that caused the failure as described in “Confirming Initial Error Code” on page 414. 10. Did you come to this step from step 9? NO Go to “Service Processor Error Codes” on page 516 with the error codes you already have and follow the instructions in the service processor error codes table. YES Go to “Service Processor Error Codes” on page 516 with the error codes from step 9 above and follow the instructions in the service processor error codes table. Chapter 5. Error Code to FRU Index 417 8-Digit Error Codeor4-Digit Checkpoint What You Should Do 8xxx 9xxx Axxx Bxxx Service Processor Checkpoints Go to “Service Processor Checkpoints” on page 372 for more information on these checkpoints. A1FD 000x The system has entered recovery mode. Go to “Recovery Mode” on page 808. B006 xxxx B1xx xxxx Common Firmware Error Codes 1. There may be additional error codes and information in the service processor error log. The system must be powered off to examine the contents of the service processor error log. a. Is the system still powered on? NO Go to step 2 below. YES Ask the customer if the system can be powered off, then go to step 1b below.b. Can the system be powered off? NO Go to step 3 below. YES Power off the system, then go to step 2 below.2. Confirm this is the first error code that caused the failure as described in “Confirming Initial Error Code” on page 414. 3. Did you come to this step from step 2. NO Go to “Common Firmware Error Codes” on page 737 with the error codes you already have and follow the instructions in the Common Firmware error code table. YES Go to “Common Firmware Error Codes” on page 737 with the error code(s) from step 2 and follow the instructions in the Common Firmware error code table. BBxx xxxx BCxx xxxx pSeries High Performance Switch Error Codes. Go to 45H/W4 Switch Planning, Installation, and Service Guide, order number GA22-7954. Exxx Firmware Checkpoints Go to “Firmware Checkpoints” on page 379 for more information on these checkpoints. Mxxx xxxx Problem Determination-Generated Error Codes Go to “Problem Determination-Generated Error Codes” on page 757, and follow the instructions in the error code table. 418 406/90 Service Guide Operator Panel Error Codes Attention: If you replace FRUs or perform an action on an I/O subsystem and the problem is still not corrected, go to “MAP 1542: I/O Problem Isolation” on page 318 unless you were already directed to any MAP 154x by the error code. Otherwise call support if the action(s) for an error code do not resolve the problem. Attention: If the error code that brought you to this chapter originated in the AIX error log, remember that an error may require an additional 15 minutes to be propagated to the AIX error log. If you replace FRUs and the problem is corrected, go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Error Code Description Action/ Possible Failing FRU 0000 1111 At power on, failure of power-good signal from power supply 1. Primary I/O backplane Location: U1.18-P1-H2 2. DCAs providing power to the primary I/O book Locations: U1.18-P1-V1, U1.18-P1-V2 0000 1112 At power off, failure of power-good signal from power supply 1. Primary I/O backplane Location: U1.18-P1-H2 2. DCAs providing power to the primary I/O book Locations: U1.18-P1-V1, U1.18-P1-V2 0000 1113 While powered on, failure of power-good signal from power supply 1. Primary I/O backplane Location: U1.18-P1-H2 2. DCAs providing power to the primary I/O book Locations: U1.18-P1-V1, U1.18-P1-V2 0000 1114 Operator panel and service processor have been reset after main power disturbance Informational message 0000 2222 Operator panel not communicating with service processor 1. If system cannot be powered off, reset the service processor (via the pinhole reset or by disconnecting then reconnecting power). 2. Primary I/O book,Location: U1.18-P1-H2 0000 3333 SPCN communications error Primary I/O book,Location: U1.18-P1-H2 0000 3334 I2C repeater error Primary I/O book,Location: U1.18-P1-H2 0000 BBBB Time-of-day battery malfunction Time-of-day battery Location: U1.18-P1-H2-V1 Chapter 5. Error Code to FRU Index 419 SPCN Error Codes Attention: If you replace FRUs or perform an action on an I/O subsystem and the problem is still not corrected, go to “MAP 1542: I/O Problem Isolation” on page 318 unless you were already directed to any MAP 154x by the error code. Otherwise, call support if the action(s) for an error code do not resolve the problem. Attention: If the error code that brought you to this chapter originated in the operating system error log, be aware that an error may require an additional 15 minutes to be propagated to the operating system error log. If you replace FRUs and the problem is corrected, go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Error Code Description Action/ Possible Failing FRU 1010 xxxx An undefined error has occurred in the power subsystem. Call service support. 1011 00AC Last power-down was the result of loss of AC. Informational message 1011 00CA Thermal calibration in progress Has this code been displayed longer than 15 minutes? YES Reboot the system. The code will be cleared when the system successfully reboots. NO Wait until this code has been displayed for 15 minutes to allow thermal calibration to complete. The system will not IPL while thermal calibration is in progress. 1011 3100 I2C Controller Primary I/O bookLocation: U1.18-P1-H2 1011 3101 1. I2C bus 1 fault 2. Operator panel 3. Operator panel cable 1. Primary I/O bookLocation: U1.18-P1-H2 2. Operator panelLocation: U1.17-L1 3. Operator panel cableLocation: U1.17-L1/Q1# 1011 3102 I2C bus 2 fault Primary I/O bookLocation: U1.18-P1-H2 1011 3104 I2C bus 4 fault Primary I/O bookLocation: U1.18-P1-H2 1011 3106 I2C bus 6 fault Primary I/O bookLocation: U1.18-P1-H2 1011 3113 RIO Bus Adapter Primary I/O bookLocation: U1.18-P1-H2 420 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 1011 3114 PCI Backplane Was the primary I/O book at location U1.18-P1-H2 changed prior to the most recent reboot of the system? YES This error code may be due to down-level system power control network firmware on the new primary I/O book. The system will automatically update the system power control network firmware to the correct level. This error code should be cleared after the next reboot. NO Replace the primary I/O book, Location: U1.18-P1-H2 1011 3115 Operator Panel 1. Operator panel,location: U1.17-L1 2. Primary I/O book, Location: U1.18-P1-H2 1011 3118 SPCN Primary I/O bookLocation: U1.18-P1-H2 1011 34A2 SPCN Primary I/O bookLocation: U1.18-P1-H2 1011 8400 VPD Configuration Primary I/O bookLocation: U1.18-P1-H2 1011 8401 VPD access problem 1. System VPD module Location: U1.17-L1-N1 2. Primary I/O bookLocation: U1.18-P1-H2 1011 8402 VPD access problem Primary I/O bookLocation: U1.18-P1-H2 1011 8403 VPD level mismatch 1. System VPD module Location: U1.17-L1-N1 2. Primary I/O bookLocation: U1.18-P1-H2 1011 8404 MCM 2 VPD mismatch 1. MCM 2 and VPD card,Locations: U1.18-P1-C4 and U1.18-P1-H2.4 2. Primary I/O bookLocation: U1.18-P1-H2 1011 8405 MCM 1 VPD mismatch 1. MCM 1 and VPD card,Locations: U1.18-P1-C3 and U1.18-P1-H2.5 2. Primary I/O bookLocation: U1.18-P1-H2 1011 8406 MCM 3 VPD mismatch 1. MCM 3 and VPD card,Locations: U1.18-P1-C2 and U1.18-P1-H2.6 2. Primary I/O bookLocation: U1.18-P1-H2 Chapter 5. Error Code to FRU Index 421 Error Code Description Action/ Possible Failing FRU 1011 8408 MCM 0 VPD mismatch 1. MCM 0 and VPD card,locations: U1.18-P1-C1 and U1.18-P1-H2.3 2. Primary I/O book, location: U1.18-P1-H2 1011 8409 No valid processor VPD cards found on primary I/O book 1. Verify that the processor VPD cards are properly installed and in the correct slots on the primary I/O book for the MCMs that are installed. 2. Go to “MAP 1543: MCM Module Problem Isolation” on page 337. 1011 840A VPD +5 stuck high Primary I/O bookLocation: U1.18-P1-H2 1011 840B VPD 5 V power-on failure Primary I/O bookLocation: U1.18-P1-H2 1011 840D SPCN Configuration mismatch 1. Set I/O type to correct value. See “Change I/O Type” on page 785. 2. Primary I/O bookLocation: U1.18-P1-H2 1011 840E SPCN default configuration loaded 1. Set I/O type to correct value. See “Change I/O Type” on page 785. 2. Primary I/O bookLocation: U1.18-P1-H2 1011 840F SPCN Configuration mismatch 1. Set I/O type to correct value. See “Change I/O Type” on page 785. 2. Primary I/O bookLocation: U1.18-P1-H2 1011 8413 Invalid MCM 0 VPD card or MCM 1. Location: U1.18-P1-H2.3 and U1.18-P1-C1See note on page 516. 2. Primary I/O bookLocation: U1.18-P1-H2 1011 8414 Invalid MCM 2 VPD card or MCM 1. Location: U1.18-P1-H2.4 and U1.18-P1-C4See note on page 516. 2. Primary I/O bookLocation: U1.18-P1-H2 1011 8415 Invalid MCM 1 VPD card or MCM 1. Location: U1.18-P1-H2.5 and U1.18-P1-C3See note on page 516. 2. Primary I/O bookLocation: U1.18-P1-H2 1011 8416 Invalid MCM 3 VPD card or MCM 1. Location: U1.18-P1-H2.6 and U1.18-P1-C2See note on page 516. 2. Primary I/O bookLocation: U1.18-P1-H2 1011 8423 No MCM 0 VPD card or MCM 1. Location: U1.18-P1-H2.3 and U1.18-P1-C1See note on page 516. 2. Primary I/O bookLocation: U1.18-P1-H2 422 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 1011 8424 No MCM 2 VPD card or MCM 1. Location: U1.18-P1-H2.4 and U1.18-P1-C4See note on page 516. 2. Primary I/O bookLocation: U1.18-P1-H2 1011 8425 No MCM 1 VPD card or MCM 1. Location: U1.18-P1-H2.5 and U1.18-P1-C3See note on page 516. 2. Primary I/O bookLocation: U1.18-P1-H2 1011 8426 No MCM 3 VPD card or MCM 1. Location: U1.18-P1-H2.6 and U1.18-P1-C2See note on page 516. 2. Primary I/O bookLocation: U1.18-P1-H2 1011 8430 SPCN configuration mismatch 1. Set I/O type to correct value. See “Change I/O Type” on page 785. 2. Primary I/O bookLocation: U1.18-P1-H2 1011 8431 SPCN configuration mismatch 1. Set I/O type to correct value. See “Change I/O Type” on page 785. 2. Primary I/O bookLocation: U1.18-P1-H2 1011 8440 SPCN configuration mismatch 1. Set I/O type to correct value. See “Change I/O Type” on page 785. 2. Primary I/O bookLocation: U1.18-P1-H2 1011 8441 SPCN configuration mismatch 1. Set I/O type to correct value. See “Change I/O Type” on page 785. 2. Primary I/O bookLocation: U1.18-P1-H2 1011 8468 Capacitor card fault 1. If fewer than 3 DCAs are installed: a. If no capacitor card is installed in U1.18-P1-V3 or U1.18-P1-V7, install a card in U1.18-P1-V3. b. If a capacitor card is present in either U1.18-P1-V3 or U1.18-P1-V7, reseat it. If the problem is not fixed, replace it.2. If 3 or more DCAs are installed: a. If no capacitor card is installed in U1.18-P1-V3, one is required. b. If a capacitor card is installed in U1.18-P1-V3, reseat it. If the problem is not fixed, replace it.3. If the same error code reoccurs, replace the primary I/O book, location: U1.18-P1-H2. Chapter 5. Error Code to FRU Index 423 Error Code Description Action/ Possible Failing FRU 1011 8469 Capacitor card fault 1. If no capacitor card is installed in U1.18-P1-V7, install one. The capacitor card is required. 2. If a capacitor card is present in U1.18-P1-V7, reseat it. If the problem is not fixed, replace the capacitor card. 3. If the same error code reoccurs, replace the primary I/O book, location: U1.18-P1-H2. 1011 8700 BPA A ac loss Go to “MAP 1520: Power” on page 192. 1011 8701 BPA B ac loss Go to “MAP 1520: Power” on page 192. 1011 8710 BPC A communication failure 1. Go to “MAP 1525: There Is a 350 Volt Bulk Failure” on page 203. 2. BPC A cable to primary I/O bookLocation: U1.18-P1-H2/Q8# 3. BPC A controllerLocation: U1.35-P1-X4 4. Primary I/O bookLocation: U1.18-P1-H2 1011 8711 BPC B communication failure 1. Go to “MAP 1525: There Is a 350 Volt Bulk Failure” on page 203. 2. BPC B cable to primary I/O bookLocation: U1.18-P1-H2/Q9# 3. BPC B controllerLocation: U1.35-P2-X4 4. Primary I/O bookLocation: U1.18-P1-H2 1011 8722 BPC A and B are not at standby. 1. BPC A controllerLocation: U1.35-P1-X4 2. BPC B controllerLocation: U1.35-P2-X4 1011 8731 BPC failure 1. Ensure that the red switch on the front of the EPO panel is in the ON position. If there is no external EPO cable, also ensure that the internal toggle switch to the ROOM EPO BYPASS on the bottom of the EPO panel is set to the left before proceeding. 2. BPC A controllerLocation: U1.35-P1-X4and BPC-B controller,Location: U1.35-P2-X4 3. Primary I/O bookLocation: U1.18-P1-H2 1011 8750 BPC download failure on BPA A 1. BPC A controller,Location: U1.35-P1-X4 2. Primary I/O book, Location: U1.18-P1-H2 424 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 1011 8751 BPC download failure on BPA B 1. BPC-B controller,Location: U1.35-P2-X4 2. Primary I/O book, Location: U1.18-P1-H2 1011 875A Informational message. No power subsystem FRU code update is required. 1011 875B Informational message. The power subsystem FRU code update has begun. 1011 875C Informational message. The power subsystem FRU code update has ended. 1011 8800 BPC A cable missing 1. BPC A cable to primary I/O bookLocation: U1.18-P1-H2/Q8# 2. BPC A controllerLocation: U1.35-P1-X4 3. Primary I/O bookLocation: U1.18-P1-H2 1011 8801 BPC B cable missing 1. BPC B cable to primary I/O bookLocation: U1.18-P1-H2/Q9# 2. BPC B controllerLocation: U1.35-P2-X4 3. Primary I/O bookLocation: U1.18-P1-H2 1011 8A00 The SPCN firmware has detected a change in the drawer configuration Informational message. 1011 xxxx SPCN configuration error 1. Set I/O type to correct value. See “Change I/O Type” on page 785. 2. Primary I/O bookLocation: U1.18-P1-H2 1014 1B17 Problem with fan 1 in first I/O subsystem in primary rack 1. DCA-1Location: U1.9-V1 2. Fan 1Location: U1.9-F1 1014 1B27 Problem with fan 2 in first I/O subsystem in primary rack 1. DCA-2Location: U1.9-V2 2. Fan 2Location: U1.9-F2 1014 1C06 Loss of airflow in first I/O subsystemlocation: U1.9 Go to “MAP 1527: An Airflow Loss Has Been Detected” on page 209. 1014 6014 Overcurrent or overvoltage problem with 3.3V in first I/O subsystem Go to “MAP 1520: Power” on page 192. 1014 6024 Overcurrent or overvoltage problem with 5.0V in first I/O subsystem Go to “MAP 1520: Power” on page 192. 1014 6034 Overcurrent or overvoltage problem with 2.5V in first I/O subsystem Go to “MAP 1520: Power” on page 192. 1014 6044 Overcurrent or overvoltage problem with 12.0V in first I/O subsystem Go to “MAP 1520: Power” on page 192. Chapter 5. Error Code to FRU Index 425 Error Code Description Action/ Possible Failing FRU 1014 6054 Overcurrent or overvoltage problem with -12.0V in first I/O subsystem Go to “MAP 1520: Power” on page 192. 1014 6814 Overcurrent or overvoltage problem with 3.3V in first I/O subsystem Go to “MAP 1520: Power” on page 192. 1014 6824 Overcurrent or overvoltage problem with 5.0V in first I/O subsystem Go to “MAP 1520: Power” on page 192. 1014 6834 Overcurrent or overvoltage problem with 2.5V in first I/O subsystem Go to “MAP 1520: Power” on page 192. 1014 6844 Overcurrent or overvoltage problem with 12.0V in first I/O subsystem Go to “MAP 1520: Power” on page 192. 1014 6854 Overcurrent or overvoltage problem with -12.0V in first I/O subsystem Go to “MAP 1520: Power” on page 192. 1014 7017 Problem with fan 3 in first I/O subsystem in primary rack 1. DCA-1Location: U1.9-V1 2. Fan 3Location: U1.9-F3 1014 7027 Problem with fan 4 in first I/O subsystem in primary rack 1. DCA-2Location: U1.9-V2 2. Fan 4Location: U1.9-F4 1014 7117 Problem with fan 1 and fan 3 in first I/O subsystem in primary rack 1. DCA-1Location: U1.9-V1 2. Fans 1 and 3Location: U1.9-F1 and U1.9-F3 1014 7127 Problem with fan 2 and fan 4 in first I/O subsystem in primary rack 1. DCA-2Location: U1.9-V2 2. Fans 2 and 4Location: U1.9-F2 and U1.9-F4 1014 8A00 The SPCN firmware has detected a change in the drawer configuration Informational message. 1014 B014 Overcurrent or overvoltage problem with 3.3V in first I/O subsystem Go to “MAP 1520: Power” on page 192. 1014 B024 Overcurrent or overvoltage problem with 5.0V in first I/O subsystem Go to “MAP 1520: Power” on page 192. 1014 B034 Overcurrent or overvoltage problem with 2.5V in first I/O subsystem Go to “MAP 1520: Power” on page 192. 1014 B044 Overcurrent or overvoltage problem with 12.0V in first I/O subsystem Go to “MAP 1520: Power” on page 192. 1014 B054 Overcurrent or overvoltage problem with -12.0V in first I/O subsystem Go to “MAP 1520: Power” on page 192. 1014 B814 Overcurrent or overvoltage problem with 3.3V in first I/O subsystem Go to “MAP 1520: Power” on page 192. 1014 B824 Overcurrent or overvoltage problem with 5.0V in first I/O subsystem Go to “MAP 1520: Power” on page 192. 1014 B834 Overcurrent or overvoltage problem with 2.5V in first I/O subsystem Go to “MAP 1520: Power” on page 192. 426 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 1014 B844 Overcurrent or overvoltage problem with 12.0V in first I/O subsystem Go to “MAP 1520: Power” on page 192. 1014 B854 Overcurrent or overvoltage problem with -12.0V in first I/O subsystem Go to “MAP 1520: Power” on page 192. 1014 BF06 Unable to resolve the chassis VPD in an I/O subsystem The chassis VPD in the I/O subsystem drawer identified by the location code could not be resolved. This may be an indication that more than one FRU in the drawer was replaced during a service action. If this is the case, reinstall the original FRUs in the drawer, then replace the FRUs one at a time. If only one FRU was replaced and this error code occurred, call service support. 1014 CC36 Description: Loss of pgood on an I/O planar in the I/O drawer at location U1.9Action: This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U1.9-P1. 1014 CC46 Description: Loss of pgood on an I/O planar in the I/O drawer at location U1.9Actions: This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U1.9-P2. 1014 CD96 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. 1014 CDA6 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. Chapter 5. Error Code to FRU Index 427 Error Code Description Action/ Possible Failing FRU 1014 CDB6 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. 1014 CDC6 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. 1014 F4x6 Hardware configuration does not support N-mode power. Call Service Support 1014 F516 An N+1 boundary (DCA or fan) has dropped to N-mode status in the first I/O subsystem, location: U1.9. This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced. 1014 F526 An N+1 boundary (DCA or fan) has dropped to N-mode status in the first I/O subsystem, location: U1.9. This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced. 1014 xx11 First I/O subsystem in primary rack, DCA-1 U1.9-V1 1014 xx13 Seeprom fault on DCA-1 in first I/O subsystem in primary rack U1.9-V1 1014 xx15 First I/O subsystem in primary rack, DCA-1, cable fault Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.9-V1/Q1 and U1.35-P2-X3/Q1 2. U1.9-V1/Q2 and U1.35-P1-X3/Q1 1014 xx17 Problem with fan 1 and fan 3 in first I/O subsystem in primary rack 1. DCA-1Location U1.9–V1 2. Fan 1,Location U1.9–F1 3. Fan 3,Location U1.9–F3 1014 xx21 First I/O subsystem in primary rack, DCA-2 U1.9-V2 1014 xx23 Seeprom fault on DCA-2 in first I/O subsystem in primary rack U1.9-V2 1014 xx25 First I/O subsystem in primary rack, DCA-2, cable fault Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.9-V2/Q1 and U1.35-P2-X3/Q2 2. U1.9-V2/Q2 and U1.35-P1-X3/Q2 1014 xx27 Problem with fan 2 and fan 4 in first I/O subsystem in primary rack 1. DCA-2Location U1.9–V2 2. Fan 2,Location U1.9–F2 3. Fan 4,Location U1.9–F4 428 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 1014 xx31 First I/O subsystem in primary rack, I/O subsystem backplane 1 fault U1.9-P1 1014 xx33 Seeprom fault on I/O subsystem backplane 1 in first I/O subsystem in primary rack U1.9-P1 1014 xx41 First I/O subsystem in primary rack, I/O subsystem backplane 2 fault U1.9-P2 1014 xx43 Seeprom fault on I/O subsystem backplane 2 in first I/O subsystem in primary rack U1.9-P2 1014 xx51 First I/O subsystem in primary rack, DASD backplane 1 fault U1.9-P3 1014 xx53 Seeprom fault on DASD backplane 1 in first I/O subsystem in primary rack U1.9-P3 1014 xx61 First I/O subsystem in primary rack, DASD backplane 2 fault U1.9-P4 1014 xx63 Seeprom fault on DASD backplane 2 in first I/O subsystem in primary rack U1.9-P4 1014 xx71 First I/O subsystem in primary rack, DASD backplane 3 fault U1.9-P5 1014 xx73 Seeprom fault on DASD backplane 3 in first I/O subsystem in primary rack U1.9-P5 1014 xx81 First I/O subsystem in primary rack, DASD backplane 4 fault U1.9-P6 1014 xx83 Seeprom fault on DASD backplane 4 in first I/O subsystem in primary rack U1.9-P6 1015 1B17 Problem with fan 1 in second I/O subsystem in primary rack. 1. DCA-1Location: U1.5-V1 2. Fan 1Location: U1.5-F1 1015 1B27 Problem with fan 2 in second I/O subsystem in primary rack. 1. DCA-1Location: U1.5-V2 2. Fan 1Location: U1.5-F2 1015 1C06 Loss of airflow in second I/O subsystem, location: U1.5 Go to “MAP 1527: An Airflow Loss Has Been Detected” on page 209. 1015 6814 Overcurrent or overvoltage problem with 3.3V in second I/O subsystem Go to “MAP 1520: Power” on page 192. 1015 6824 Overcurrent or overvoltage problem with 5.0V in second I/O subsystem Go to “MAP 1520: Power” on page 192. 1015 6834 Overcurrent or overvoltage problem with 2.5V in second I/O subsystem Go to “MAP 1520: Power” on page 192. 1015 6844 Overcurrent or overvoltage problem with 12.0V in second I/O subsystem Go to “MAP 1520: Power” on page 192. 1015 6854 Overcurrent or overvoltage problem with -12.0V in second I/O subsystem Go to “MAP 1520: Power” on page 192. Chapter 5. Error Code to FRU Index 429 Error Code Description Action/ Possible Failing FRU 1015 7017 Problem with fan 3 in second I/O subsystem in primary rack 1. DCA-1Location: U1.5-V1 2. Fan 3Location: U1.5-F3 1015 7027 Problem with fan 4 in second I/O subsystem in primary rack 1. DCA-2Location: U1.5-V2 2. Fan 4Location: U1.5-F4 1015 7117 Problem with fan 1 and fan 3 in second I/O subsystem in primary rack 1. DCA-1Location: U1.5-V1 2. Fans 1 and 3Location: U1.5-F1 and U1.5-F3 1015 7127 Problem with fan 2 and fan 4 in second I/O subsystem in primary rack 1. DCA-2Location: U1.5-V2 2. Fans 2 and 4Location: U1.5-F2 and U1.5-F4 1015 8A00 The SPCN firmware has detected a change in the drawer configuration Informational message. 1015 B014 Overcurrent or overvoltage problem with 3.3V in second I/O subsystem Go to “MAP 1520: Power” on page 192. 1015 B024 Overcurrent or overvoltage problem with 5.0V in second I/O subsystem Go to “MAP 1520: Power” on page 192. 1015 B034 Overcurrent or overvoltage problem with 2.5V in second I/O subsystem Go to “MAP 1520: Power” on page 192. 1015 B044 Overcurrent or overvoltage problem with 12.0V in second I/O subsystem Go to “MAP 1520: Power” on page 192. 1015 B054 Overcurrent or overvoltage problem with -12.0V in second I/O subsystem Go to “MAP 1520: Power” on page 192. 1015 B814 Overcurrent or overvoltage problem with 3.3V in second I/O subsystem Go to “MAP 1520: Power” on page 192. 1015 B824 Overcurrent or overvoltage problem with 5.0V in second I/O subsystem Go to “MAP 1520: Power” on page 192. 1015 B834 Overcurrent or overvoltage problem with 2.5V in second I/O subsystem Go to “MAP 1520: Power” on page 192. 1015 B844 Overcurrent or overvoltage problem with 12.0V in second I/O subsystem Go to “MAP 1520: Power” on page 192. 1015 B854 Overcurrent or overvoltage problem with -12.0V in second I/O subsystem Go to “MAP 1520: Power” on page 192. 1015 BF06 Unable to resolve the chassis VPD in an I/O subsystem The chassis VPD in the I/O subsystem drawer identified by the location code could not be resolved. This may be an indication that more than one FRU in the drawer was replaced during a service action. If this is the case, reinstall the original FRUs in the drawer, then replace the FRUs one at a time. If only one FRU was replaced and this error code occurred, call service support. 430 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 1015 CC36 Description: Loss of pgood on an I/O planar in the I/O drawer at location U1.5 Action: This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U1.5-P1. 1015 CC46 Description: Loss of pgood on an I/O planar in the I/O drawer at location U1.5Action:This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U1.5-P2. 1015 CD96 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. 1015 CDA6 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. 1015 CDB6 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. 1015 CDC6 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. 1015 F4x6 Hardware configuration does not support N-mode power. Call Service Support Chapter 5. Error Code to FRU Index 431 Error Code Description Action/ Possible Failing FRU 1015 F516 An N+1 boundary (DCA or fan) has dropped to N-mode status in the second I/O subsystem, location: U1.5. This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced. 1015 F526 An N+1 boundary (DCA or fan) has dropped to N-mode status in the second I/O subsystem, location: U1.5. This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced. 1015 xx11 Second I/O subsystem in primary rack, DCA-1 U1.5-V1 1015 xx13 Seeprom fault on DCA-1 in second I/O subsystem in primary rack U1.5-V1 1015 xx15 Second I/O subsystem in primary rack, DCA-1, cable fault Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.5-V1/Q1 and U1.35-P2-X3/Q3 2. U1.5-V1/Q2 and U1.35-P1-X3/Q3 1015 xx17 Problem with fan 1 and fan 3 in second I/O subsystem in primary rack 1. DCA-1Location U1.5–V1 2. Fan 1,Location U1.5–F1 3. Fan 3,Location U1.5–F3 1015 xx21 Second I/O subsystem in primary rack, DCA-2 U1.5-V2 1015 xx23 Seeprom fault on DCA-2 in second I/O subsystem in primary rack U1.5-V2 1015 xx25 Second I/O subsystem in primary rack, DCA-2, cable fault Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.5-V2/Q1 and U1.35-P2-X3/Q4 2. U1.5-V2/Q2 and U1.35-P1-X3/Q4 1015 xx27 Problem with fan 2 and fan 4 in second I/O subsystem in primary rack 1. DCA-2Location U1.5–V2 2. Fan 2,Location U1.5–F2 3. Fan 4,Location U1.5–F4 1015 xx31 Second I/O subsystem in primary rack, I/O subsystem backplane 1 fault U1.5-P1 1015 xx33 Seeprom fault on I/O subsystem backplane 1 in second I/O subsystem in primary rack U1.5-P1 1015 xx41 Second I/O subsystem in primary rack, I/O subsystem backplane 2 fault U1.5-P2 432 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 1015 xx43 Seeprom fault on I/O subsystem backplane 2 in second I/O subsystem in primary rack U1.5-P2 1015 xx51 Second I/O subsystem in primary rack, DASD backplane 1 fault U1.5-P3 1015 xx53 Seeprom fault on DASD backplane 1 in second I/O subsystem in primary rack U1.5-P3 1015 xx61 Second I/O subsystem in primary rack, DASD backplane 2 fault U1.5-P4 1015 xx63 Seeprom fault on DASD backplane 2 in second I/O subsystem in primary rack U1.5-P4 1015 xx71 Second I/O subsystem in primary rack, DASD backplane 3 fault U1.5-P5 1015 xx73 Seeprom fault on DASD backplane 3 in second I/O subsystem in primary rack U1.5-P5 1015 xx81 Second I/O subsystem in primary rack, DASD backplane 4 fault U1.5-P6 1015 xx83 Seeprom fault on DASD backplane 4 in second I/O subsystem in primary rack U1.5-P6 1016 1B17 Problem with fan 1 in third I/O subsystem in primary rack 1. DCA-1Location: U1.1-V1 2. Fan 1Location: U1.1-F1 1016 1B27 Problem with fan 2 in third I/O subsystem in primary rack 1. DCA-2Location: U1.1-V2 2. Fan 2Location: U1.1-F2 1016 1C06 Loss of airflow in third I/O subsystem, location: U1.1 Go to “MAP 1527: An Airflow Loss Has Been Detected” on page 209. 1016 6014 Overcurrent or overvoltage problem with 3.3V in third I/O subsystem Go to “MAP 1520: Power” on page 192. 1016 6024 Overcurrent or overvoltage problem with 5.0V in third I/O subsystem Go to “MAP 1520: Power” on page 192. 1016 6034 Overcurrent or overvoltage problem with 2.5V in third I/O subsystem Go to “MAP 1520: Power” on page 192. 1016 6044 Overcurrent or overvoltage problem with 12.0V in third I/O subsystem Go to “MAP 1520: Power” on page 192. 1016 6054 Overcurrent or overvoltage problem with -12.0V in third I/O subsystem Go to “MAP 1520: Power” on page 192. 1016 6814 Overcurrent or overvoltage problem with 3.3V in third I/O subsystem Go to “MAP 1520: Power” on page 192. 1016 6824 Overcurrent or overvoltage problem with 5.0V in third I/O subsystem Go to “MAP 1520: Power” on page 192. 1016 6834 Overcurrent or overvoltage problem with 2.5V in third I/O subsystem Go to “MAP 1520: Power” on page 192. 1016 6844 Overcurrent or overvoltage problem with 12.0V in third I/O subsystem Go to “MAP 1520: Power” on page 192. Chapter 5. Error Code to FRU Index 433 Error Code Description Action/ Possible Failing FRU 1016 6854 Overcurrent or overvoltage problem with -12.0V in third I/O subsystem Go to “MAP 1520: Power” on page 192. 1016 7017 Problem with fan 3 in third I/O subsystem in primary rack 1. DCA-1Location: U1.1-V1 2. Fan 3Location: U1.1-F3 1016 7027 Problem with fan 4 in third I/O subsystem in primary rack 1. DCA-2Location: U1.1-V2 2. Fan 4Location: U1.1-F4 1016 7117 Problem with fan 1 and fan 3 in third I/O subsystem in primary rack 1. DCA-1Location: U1.1-V1 2. Fans 1 and 3Location: U1.1-F1 and U1.1-F3 1016 7127 Problem with fan 2 and fan 4 in third I/O subsystem in primary rack 1. DCA-2Location: U1.1-V2 2. Fans 2 and 4Location: U1.1-F2 and U1.1-F4 1016 8A00 The SPCN firmware has detected a change in the drawer configuration Informational message. 1016 B014 Overcurrent or overvoltage problem with 3.3V in third I/O subsystem Go to “MAP 1520: Power” on page 192. 1016 B024 Overcurrent or overvoltage problem with 5.0V in third I/O subsystem Go to “MAP 1520: Power” on page 192. 1016 B034 Overcurrent or overvoltage problem with 2.5V in third I/O subsystem Go to “MAP 1520: Power” on page 192. 1016 B044 Overcurrent or overvoltage problem with 12.0V in third I/O subsystem Go to “MAP 1520: Power” on page 192. 1016 B054 Overcurrent or overvoltage problem with -12.0V in third I/O subsystem Go to “MAP 1520: Power” on page 192. 1016 B814 Overcurrent or overvoltage problem with 3.3V in third I/O subsystem Go to “MAP 1520: Power” on page 192. 1016 B824 Overcurrent or overvoltage problem with 5.0V in third I/O subsystem Go to “MAP 1520: Power” on page 192. 1016 B834 Overcurrent or overvoltage problem with 2.5V in third I/O subsystem Go to “MAP 1520: Power” on page 192. 1016 B844 Overcurrent or overvoltage problem with 12.0V in third I/O subsystem Go to “MAP 1520: Power” on page 192. 1016 B854 Overcurrent or overvoltage problem with -12.0V in third I/O subsystem Go to “MAP 1520: Power” on page 192. 434 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 1016 BF06 Unable to resolve the chassis VPD in an I/O subsystem The chassis VPD in the I/O subsystem drawer identified by the location code could not be resolved. This may be an indication that more than one FRU in the drawer was replaced during a service action. If this is the case, reinstall the original FRUs in the drawer, then replace the FRUs one at a time. If only one FRU was replaced and this error code occurred, call service support. 1016 CC36 Description: Loss of pgood on an I/O planar in the I/O drawer at location U1.1Action:This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U1.1-P1. 1016 CC46 Description: Loss of pgood on an I/O planar in the I/O drawer at location U1.1Action:This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U1.1-P2. 1016 CD96 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. 1016 CDA6 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. 1016 CDB6 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. 1016 CDC6 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. Chapter 5. Error Code to FRU Index 435 Error Code Description Action/ Possible Failing FRU 1016 F4x6 Hardware configuration does not support N-mode power. Call Service Support 1016 F516 An N+1 boundary (DCA or fan) has dropped to N-mode status in the third I/O subsystem, location: U1.1. This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced. 1016 F526 An N+1 boundary (DCA or fan) has dropped to N-mode status in the third I/O subsystem, location: U1.1. This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced. 1016 xx11 third I/O subsystem in primary rack, DCA-1 U1.1-V1 1016 xx13 Seeprom fault on DCA-1 in third I/O subsystem in primary rack U1.1-V1 1016 xx15 Third I/O subsystem in primary rack, DCA-1, cable fault Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.1-V1/Q1 and U1.35-P2-X3/Q5 2. U1.1-V1/Q2 and U1.35-P1-X3/Q5 1016 xx17 Problem with fan 1 and fan 3 in third I/O subsystem in primary rack 1. DCA-1Location U1.1–V1 2. Fan 1,Location U1.1–F1 3. Fan 3,Location U1.1–F3 1016 xx21 Third I/O subsystem in primary rack, DCA-2 U1.1-V2 1016 xx23 Seeprom fault on DCA-2 in third I/O subsystem in primary rack U1.1-V2 1016 xx25 Third I/O subsystem in primary rack, DCA-2, cable fault Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.1-V2/Q1 and U1.35-P2-X3/Q6 2. U1.1-V2/Q2 and U1.35-P1-X3/Q6 1016 xx27 Problem with fan 2 and fan 4 in third I/O subsystem in primary rack 1. DCA-2Location U1.1–V2 2. Fan 2,Location U1.1F2 3. Fan 4,Location U1.1–F4 1016 xx31 Third I/O subsystem in primary rack, I/O subsystem backplane 1 fault U1.1-P1 1016 xx33 Seeprom fault on I/O subsystem backplane 1 in third I/O subsystem in primary rack U1.1-P1 436 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 1016 xx41 Third I/O subsystem in primary rack, I/O subsystem backplane 2 fault U1.1-P2 1016 xx43 Seeprom fault on I/O subsystem backplane 2 in third I/O subsystem in primary rack U1.1-P2 1016 xx51 Third I/O subsystem in primary rack, DASD backplane 1 fault U1.1-P3 1016 xx53 Seeprom fault on DASD backplane 1 in third I/O subsystem in primary rack U1.1-P3 1016 xx61 Third I/O subsystem in primary rack, DASD backplane 2 fault U1.1-P4 1016 xx63 Seeprom fault on DASD backplane 2 in third I/O subsystem in primary rack U1.1-P4 1016 xx71 Third I/O subsystem in primary rack, DASD backplane 3 fault U1.1-P5 1016 xx73 Seeprom fault on DASD backplane 3 in third I/O subsystem in primary rack U1.1-P5 1016 xx81 Third I/O subsystem in primary rack, DASD backplane 4 fault U1.1-P6 1016 xx83 Seeprom fault on DASD backplane 4 in third I/O subsystem in primary rack U1.1-P6 1017 1B17 Problem with fan 1 in fourth I/O subsystem (no IBF installed) in primary rack 1. DCA-1Location: U1.13-V1 2. Fan 1Location: U1.13-F1 1017 1B27 Problem with fan 2 in fourth I/O subsystem (no IBF installed) in primary rack 1. DCA-2Location: U1.13-V2 2. Fan 2Location: U1.13-F2 1017 1C06 Loss of airflow in fourth I/O subsystem, no IBFlocation: U1.13 Go to “MAP 1527: An Airflow Loss Has Been Detected” on page 209. 1017 6014 Overcurrent or overvoltage problem with 3.3V in fourth I/O subsystem, no IBF Go to “MAP 1520: Power” on page 192. 1017 6024 Overcurrent or overvoltage problem with 5.0V in fourth I/O subsystem, no IBF Go to “MAP 1520: Power” on page 192. 1017 6034 Overcurrent or overvoltage problem with 2.5V in fourth I/O subsystem, no IBF Go to “MAP 1520: Power” on page 192. 1017 6044 Overcurrent or overvoltage problem with 12.0V in fourth I/O subsystem, no IBF Go to “MAP 1520: Power” on page 192. 1017 6054 Overcurrent or overvoltage problem with -12.0V in fourth I/O subsystem, no IBF Go to “MAP 1520: Power” on page 192. 1017 6814 Overcurrent or overvoltage problem with 3.3V in fourth I/O subsystem, no IBF Go to “MAP 1520: Power” on page 192. 1017 6824 Overcurrent or overvoltage problem with 5.0V in fourth I/O subsystem, no IBF Go to “MAP 1520: Power” on page 192. 1017 6834 Overcurrent or overvoltage problem with 2.5V in fourth I/O subsystem, no IBF Go to “MAP 1520: Power” on page 192. Chapter 5. Error Code to FRU Index 437 Error Code Description Action/ Possible Failing FRU 1017 6844 Overcurrent or overvoltage problem with 12.0V in fourth I/O subsystem, no IBF Go to “MAP 1520: Power” on page 192. 1017 6854 Overcurrent or overvoltage problem with -12.0V in fourth I/O subsystem, no IBF Go to “MAP 1520: Power” on page 192. 1017 7017 Problem with fan 3 in fourth I/O subsystem (no IBF installed) in primary rack 1. DCA-1Location: U1.13-V1 2. Fan 3Location: U1.13-F3 1017 7027 Problem with fan 4 in fourth I/O subsystem (no IBF installed) in primary rack 1. DCA-2Location: U1.13-V2 2. Fan 4Location: U1.13-F4 1017 7117 Problem with fan 1 and fan 3 in fourth I/O subsystem (no IBF installed) in primary rack 1. DCA-1Location: U1.13-V1 2. Fans 1 and 3Location: U1.13-F1 and U1.13-F3 1017 7127 Problem with fan 2 and fan 4 in fourth I/O subsystem (no IBF installed) in primary rack 1. DCA-2Location: U1.13-V2 2. Fans 2 and 4Location: U1.13-F2 and U1.13-F4 1017 8A00 The SPCN firmware has detected a change in the drawer configuration Informational message. 1017 B014 Overcurrent or overvoltage problem with 3.3V in fourth I/O subsystem, no IBF Go to “MAP 1520: Power” on page 192. 1017 B024 Overcurrent or overvoltage problem with 5.0V in fourth I/O subsystem, no IBF Go to “MAP 1520: Power” on page 192. 1017 B034 Overcurrent or overvoltage problem with 2.5V in fourth I/O subsystem, no IBF Go to “MAP 1520: Power” on page 192. 1017 B044 Overcurrent or overvoltage problem with 12.0V in fourth I/O subsystem, no IBF Go to “MAP 1520: Power” on page 192. 1017 B054 Overcurrent or overvoltage problem with -12.0V in fourth I/O subsystem, no IBF Go to “MAP 1520: Power” on page 192. 1017 B814 Overcurrent or overvoltage problem with 3.3V in fourth I/O subsystem, no IBF Go to “MAP 1520: Power” on page 192. 1017 B824 Overcurrent or overvoltage problem with 5.0V in fourth I/O subsystem, no IBF Go to “MAP 1520: Power” on page 192. 1017 B834 Overcurrent or overvoltage problem with 2.5V in fourth I/O subsystem, no IBF Go to “MAP 1520: Power” on page 192. 1017 B844 Overcurrent or overvoltage problem with 12.0V in fourth I/O subsystem, no IBF Go to “MAP 1520: Power” on page 192. 1017 B854 Overcurrent or overvoltage problem with -12.0V in fourth I/O subsystem, no IBF Go to “MAP 1520: Power” on page 192. 438 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 1017 BF06 Unable to resolve the chassis VPD in an I/O subsystem The chassis VPD in the I/O subsystem drawer identified by the location code could not be resolved. This may be an indication that more than one FRU in the drawer was replaced during a service action. If this is the case, reinstall the original FRUs in the drawer, then replace the FRUs one at a time. If only one FRU was replaced and this error code occurred, call service support. 1017 CC36 Description: Loss of pgood on an I/O planar in the I/O drawer at location U1.13Action:This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U1.13-P1. 1017 CC46 Description: Loss of pgood on an I/O planar in the I/O drawer at location U1.13Action:This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U1.13-P2.Go to “MAP 1520: Power” on page 192. 1017 CD96 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. 1017 CDA6 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. 1017 CDB6 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. Chapter 5. Error Code to FRU Index 439 Error Code Description Action/ Possible Failing FRU 1017 CDC6 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. 1017 F4x6 Hardware configuration does not support N-mode power. Call Service Support 1017 F516 An N+1 boundary (DCA or fan) has dropped to N-mode status in the fourth I/O subsystem, location: U1.13. This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced. 1017 F526 An N+1 boundary (DCA or fan) has dropped to N-mode status in the fourth I/O subsystem, location: U1.13. This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced. 1017 xx11 Fourth I/O subsystem in primary rack (no IBF), DCA-1 U1.13-V1 1017 xx13 Seeprom fault on DCA-1 in fourth I/O subsystem (no IBF) in primary rack U1.13-V1 1017 xx15 Fourth I/O subsystem in primary rack (no IBF) If there is no DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.13-V1/Q1 and U1.35-P2-X3/Q7 2. U1.13-V1/Q2 and U1.35-P1-X3/Q7If there is a 6th DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.13-V1/Q1 and U1.35-P2-X2/Q9 2. U1.13-V1/Q2 and U1.35-P1-X2/Q9 1017 xx17 Problem with fan 1 and fan 3 in fourth I/O subsystem (no IBF installed) in primary rack 1. DCA-1Location U1.13-V1 2. Fan 1,Location U1.13-F1 3. Fan 3,Location U1.13-F3 1017 xx21 Fourth I/O subsystem in primary rack (no IBF), DCA-2 U1.13-V2 1017 xx23 Seeprom fault on DCA-2 in fourth I/O subsystem (no IBF) in primary rack U1.13-V2 440 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 1017 xx25 Fourth I/O subsystem in primary rack no IBFs If there is no DCA in U1.18-P1-V8, primary rack, no IBFs, go to “MAP 1520: Power” on page 192with these pairs of location codes: 1. U1.13-V2/Q1 and U1.35-P2-X3/Q8 2. U1.13-V2/Q2 and U1.35-P1-X3/Q8If there is a 6th DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.13-V2/Q1 and U1.35-P2-X2/Q10 2. U1.13-V2/Q2 and U1.35-P1-X2/Q10 1017 xx27 Problem with fan 2 and fan 4 in fourth I/O subsystem (no IBF installed) in primary rack 1. DCA-2Location U1.13-V2 2. Fan 2,Location U1.13-F2 3. Fan 4,Location U1.13-F4 1017 xx31 Fourth I/O subsystem in primary rack, I/O subsystem backplane 1 fault U1.13-P1 1017 xx33 Seeprom fault on I/O subsystem backplane 1 in fourth I/O subsystem (no IBF) in primary rack U1.13-P1 1017 xx41 Fourth I/O subsystem in primary rack, I/O subsystem backplane 2 fault U1.13-P2 1017 xx43 Seeprom fault on I/O subsystem backplane 2 in fourth I/O subsystem (no IBF) in primary rack U1.13-P2 1017 xx51 Fourth I/O subsystem in primary rack, DASD backplane 1 fault U1.13-P3 1017 xx53 Seeprom fault on DASD backplane 1 in fourth I/O subsystem (no IBF) in primary rack U1.13-P3 1017 xx61 Fourth I/O subsystem in primary rack, DASD backplane 2 fault U1.13-P4 1017 xx63 Seeprom fault on DASD backplane 2 in fourth I/O subsystem (no IBF) in primary rack U1.13-P4 1017 xx71 Fourth I/O subsystem in primary rack, DASD backplane 3 fault U1.13-P5 1017 xx73 Seeprom fault on DASD backplane 3 in fourth I/O subsystem (no IBF) in primary rack U1.13-P5 1017 xx81 Fourth I/O subsystem in primary rack, DASD backplane 4 fault U1.13-P6 1017 xx83 Seeprom fault on DASD backplane 4 in fourth I/O subsystem (no IBF) in primary rack U1.13-P6 101A 0106 BPA A ac loss in a frame that contains only High Performance Switches (no CECs). Go to “MAP 1520: Power” on page 192. 101A 0856 BPR 1A communications fault Go to “MAP 1523: There Is a Bulk Power Regulator (BPR) Communications Fault” on page 200. 101A 0866 BPR 2A communications fault Go to “MAP 1523: There Is a Bulk Power Regulator (BPR) Communications Fault” on page 200. Chapter 5. Error Code to FRU Index 441 Error Code Description Action/ Possible Failing FRU 101A 0876 BPR 3A communications fault Go to “MAP 1523: There Is a Bulk Power Regulator (BPR) Communications Fault” on page 200. 101A 0D06 350 volt bulk failure, BPA-A Go to “MAP 1525: There Is a 350 Volt Bulk Failure” on page 203. 101A 1C06 Cage airflow missing (temperature problem) Go to “MAP 1527: An Airflow Loss Has Been Detected” on page 209. 101A 3F06 Invalid I/O subsystem configurations 1. Verify that the I/O subsystems are properly cabled. Note: If the service processor is hung, this error code causes the service processor to hang. Refer to “Resetting the Service Processor” on page 819. 2. Call service support. 101A 7A56 IBF failure Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206. 101A 7A66 IBF failure Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206. 101A 7A76 IBF failure Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206. 101A 7A86 IBF 1A failure Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206. 101A 7A96 IBF 2A failure Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206. 101A 7AA6 IBF 3A failure Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206. 101A 7B56 IBF failure Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206. 101A 7B66 IBF failure Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206. 101A 7B76 IBF failure Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206. 101A 7B86 IBF 1A failure Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206. 101A 7B96 IBF 2A failure Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206. 101A 7BA6 IBF 3A failure Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206. 101A CE16 BPC-BPC communication failure 1. Check the cable between the two BPCs; reseat and look for bent pins. 2. Replace BPC-A, location: U1.35-P1-X4 3. Replace BPC-B, location: U1.35-P2-X4. 101A D216 BPA-BPA communication failure Go to “MAP 1520: Power” on page 192 for isolation. 442 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 101A D616 Problem with external UPS: utility failure 1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support. 101A D716 Problem with external UPS: battery low 1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support. 101A D756 Informational message about the IBF. Informational message. 101A D766 Informational message about the IBF. Informational message. 101A D776 Informational message about the IBF. Informational message. 101A D816 External UPS: bypass active (informational message) Informational message. If a problem is suspected: 1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support. 101A D916 External UPS: power restored (informational message) Informational message. If a problem is suspected: 1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support. 101A DA16 External UPS: installed (informational message) Informational message. If a problem is suspected: 1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support. 101A DB16 Problem with external UPS: UPS not available 1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support. 101A DC16 External UPS: battery not low (informational message) Informational message. If a problem is suspected: 1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support. 101A DC56 Informational message about the IBF. Informational message. Chapter 5. Error Code to FRU Index 443 Error Code Description Action/ Possible Failing FRU 101A DC66 Informational message about the IBF. Informational message. 101A DC76 Informational message about the IBF. Informational message. 101A DD16 External UPS: bypass ended (informational message) Informational message. If a problem is suspected: 1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support. 101A DF16 Problem with external UPS: UPS failure 1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support. 101A E106 Critical logic over temperature Go to “MAP 1528: There Is A Processor (Critical/Warning) Overtemperature Fault” on page 211. 101A E206 Ambient room temperature too high 1. The ambient room temperature has reached a warning level (30 C) or a critical level (36 C). Ask the customer to correct the room temperature problem. 2. Call service support. 101A F4x6 Hardware configuration does not support N-mode power Call service support. 101A F806 BPC A: loss of ac power or phase missing Go to “MAP 152a: Loss of ac Power or Phase Missing” on page 214. 101A F906 BPC A: loss of ac power or phase missing Go to “MAP 152a: Loss of ac Power or Phase Missing” on page 214. 101A F806 BPC A: loss of ac power or phase missing Go to “MAP 152a: Loss of ac Power or Phase Missing” on page 214. 101A F906 BPC A: loss of ac power or phase missing Go to “MAP 152a: Loss of ac Power or Phase Missing” on page 214. 101A FA06 Problem with EPO switch detected by BPC-A Go to “MAP 152a: Loss of ac Power or Phase Missing” on page 214. 101A FDB6 BPC A: open room EPO switch has been detected Go to “MAP 1524: An Open Room EPO Switch Has Been Detected From One Side” on page 201. 101A FE06 UEPO switch on BPC A is in the bypass position Go to “MAP 1522: UEPO Switch On The BPC Is In The Bypass Position” on page 199. 101A FE16 UEPO switch on BPC A is in the bypass position Go to “MAP 1522: UEPO Switch On The BPC Is In The Bypass Position” on page 199. 101A FE56 BPR 1A communications fault Go to “MAP 1523: There Is a Bulk Power Regulator (BPR) Communications Fault” on page 200. 101A FE66 BPR 2A communications fault Go to “MAP 1523: There Is a Bulk Power Regulator (BPR) Communications Fault” on page 200. 444 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 101A FE76 BPR 3A communications fault Go to “MAP 1523: There Is a Bulk Power Regulator (BPR) Communications Fault” on page 200. 101A FFx6 Cabling error on the BPC or a BPD that is not isolated There is a cabling problem on a BPC or BPD that cannot be isolated.The x is decoded as follows:If x=1, the problem is on the BPC, location: U1.35-P1-X4If x=2, the problem is on BPD-1, location: U1.35-P1-X3 If x=3, the problem is on BPD-2, location: U1.35-P1-X2If x=4, the problem is on BPD-3, location: U1.35-P1-X1 1. Check the UPIC cabling on the BPC or BPD specified by decoding the x as shown above. Correct any cabling problems that are found. 2. If no problems are found on the A side of the BPE, check the UPIC cabling on the BPC or BPD in the same position on the B side of the BPE for cabling errors. Correct and problems that are found. 3. If both sides of the BPE are cabled correctly, call service support. 101A x115 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X4/Q1# to U1.18-X1/Q1# 101A x125 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X3/Q1# to U1.9-V1/Q2# 101A x135 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X2/Q1# to U2.1-V1/Q1# 101A x215 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X4/Q2# to U1.18-X2/Q1# 101A x225 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X3/Q2# to U1.9-V2/Q2# 101A x235 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X2/Q2# to U2.1-V2/Q1# 101A x315 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X4/Q3# to U1.18-X3/Q1# 101A x325 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X3/Q3# to U1.5-V1/Q2# 101A x335 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X2/Q3# to U2.5-V1/Q1# Chapter 5. Error Code to FRU Index 445 Error Code Description Action/ Possible Failing FRU 101A x415 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X4/Q4# to U1.18-X4/Q1# 101A x425 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X3/Q4# to U1.5-V2/Q2# 101A x435 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X2/Q4# to U2.5-V2/Q1# 101A x515 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X4/Q5# to U1.18-P1-V1/Q1# 101A x525 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X3/Q5# to U1.1-V1/Q2# 101A x535 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X2/Q5# to U2.9-V1/Q1# 101A x615 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X4/Q6# to U1.18-P1-V2/Q1# 101A x625 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X3/Q6# to U1.1-V2/Q2# 101A x635 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X2/Q6# to U2.9-V2/Q1# 101A x715 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X4/Q7# to U1.18-P1-V4/Q1# 101A x725 Power cable fault If there is no DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these location codes: U1.35-P1-X3/Q7# to U1.13-V1/Q2#If there is a 6th DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X3/Q7# to U1.18-P1-V8/Q1# 101A x735 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X2/Q7# to U2.13-V1/Q1# 101A x825 Power cable fault If there is no DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X3/Q8# to U1.13-V2/Q2#If there is a 6th DCA in U1.18-P1-V8, this error code will not be reported. 101A x835 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X2/Q8# to U2.13-V2/Q1# 446 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 101A x925 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X3/Q9# to U1.18-P1-V5/Q1# 101A x935 Power cable fault If there is no DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X2/Q9# to U2.19-V1/Q1# If there is a 6th DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X2/Q9# to U1.13-V1/Q1# 101A x945 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes: U1.35-P2-X1/Q9# to U2.19-V1/Q1# 101A xA25 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X3/Q10# to U1.18-P1-V6/Q1# 101A xA35 Power cable fault If there is no DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X2/Q10# to U2.19-V2/Q1#If there is a 6th DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X2/Q10# to U1.13-V2/Q1# 101A xA45 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X1/Q10# to U2.19-V2/Q1# 101A xx0x BPA A-FRU not isolated U1.35 101A xx11 BPA-A / BPC U1.35-P1-X4 101A xx13 Seeprom (VPD) fault on BPC-A U1.35-P1-X4 101A xx17 Fan fault on BPC-A (BPA-A fan) 1. BPE-A side fan Location: U1.35-P1-F1 2. BPC-A Location: U1.35-P1-X4 101A xx21 BPA-A / BPD-1 U1.35-P1-X3 101A xx22 BPC-A to BPD-1 communications fault 1. BPD-1 Location: U1.35-P1-X3 2. BPC-A Location: U1.35-P1-X4 3. BPE backplane A Location: U1.35-P1 101A xx23 Seeprom (VPD) fault on BPD-1A U1.35-P1-X3 101A xx31 BPA-A / BPD-2 U1.35-P1-X2 Chapter 5. Error Code to FRU Index 447 Error Code Description Action/ Possible Failing FRU 101A xx32 BPC-A to BPD-2 communications fault 1. BPD-2 Location: U1.35-P1-X2 2. BPC-A Location: U1.35-P1-X4 3. BPE backplane A Location: U1.35-P1 101A xx33 Seeprom (VPD) fault on BPD-2A U1.35-P1-X2 101A xx41 BPA-A / BPD-3 U1.35-P1-X1 101A xx42 BPC-A to BPD-3 communications fault 1. BPD-3 Location: U1.35-P1-X1 2. BPC-A Location: U1.35-P1-X4 3. BPE backplane A Location: U1.35-P1 101A xx43 Seeprom (VPD) fault on BPD-3A U1.35-P1-X1 101A xx51 BPA-A / BPR-1 U1.35-P1-V1 101A xx52 BPC-A to BPR-1 communications fault 1. BPR-1 Location: U1.35-P1-V1 2. BPC-A Location: U1.35-P1-X4 3. BPE backplane A Location: U1.35-P1 101A xx53 Seeprom (VPD) fault on BPR-1A U1.35-P1-V1 101A xx61 BPA-A / BPR-2 U1.35-P1-V2 101A xx62 BPC-A to BPR-2 communications fault 1. BPR-2 Location: U1.35-P1-V2 2. BPC-A Location: U1.35-P1-X4 3. BPE backplane A Location: U1.35-P1 101A xx63 Seeprom (VPD) fault on BPR-2A U1.35-P1-V2 101A xx71 BPA-A / BPR-3 U1.35-P1-V3 101A xx72 BPC-A to BPR-3 communications fault 1. BPR-3 Location: U1.35-P1-V3 2. BPC-A Location: U1.35-P1-X4 3. BPE backplane A Location: U1.35-P1 101A xx73 Seeprom (VPD) fault on BPR-3A U1.35-P1-V3 101A xx81 BPA-A / IBF 1 U1.13-P1-V1 101A xx83 Seeprom (VPD) fault on IBF-1A U1.13-P1-V1 101A xx91 BPA-A / IBF 2 U1.15-P1-V2 101A xx93 Seeprom (VPD) fault on IBF-2A U1.15-P1-V2 101A xxA1 BPA-A / IBF 3 U2.17-P1-V5 101A xxA3 Seeprom (VPD) fault on IBF-3A U2.17-P1-V5 448 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 101A xxB1 BPA-A / EPO_SW 1. Replace the UEPO switch,Location: U1-V1 2. Check the cable from BPC-A to the UEPO switch, Location U1-V1/Q1# 101B 0106 BPA B ac loss in a frame that contains only High Performance Switches (no CECs). Go to “MAP 1520: Power” on page 192. 101B 0856 BPR 1B communications fault Go to “MAP 1523: There Is a Bulk Power Regulator (BPR) Communications Fault” on page 200. 101B 0866 BPR 2B communications fault Go to “MAP 1523: There Is a Bulk Power Regulator (BPR) Communications Fault” on page 200. 101B 0876 BPR 3B communications fault Go to “MAP 1523: There Is a Bulk Power Regulator (BPR) Communications Fault” on page 200. 101B 0D06 350 volt bulk failure, BPA-B Go to “MAP 1525: There Is a 350 Volt Bulk Failure” on page 203. 101B 1C06 Cage airflow missing (temperature problem) Go to “MAP 1527: An Airflow Loss Has Been Detected” on page 209. 101B 3F06 Invalid I/O subsystem configuration 1. Verify that the I/O subsystems are properly cabled. Note: If the service processor is hung, this error code causes the service processor to hang. Refer to “Resetting the Service Processor” on page 819. 2. Call service support. 101B 7A56 IBF failure Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206. 101B 7A66 IBF failure Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206. 101B 7A76 IBF failure Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206. 101B 7A86 IBF 1B failure Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206. 101B 7A96 IBF 2B failure Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206. 101B 7AA6 IBF 3B failure Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206. 101B 7B56 IBF failure Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206. 101B 7B66 IBF failure Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206. 101B 7B76 IBF failure Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206. 101B 7B86 IBF 1B failure Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206. 101B 7B96 IBF 2B failure Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206. Chapter 5. Error Code to FRU Index 449 Error Code Description Action/ Possible Failing FRU 101B 7BA6 IBF 3B failure Go to “MAP 1526: There Is An Integrated Battery Feature (IBF) Failure” on page 206. 101B CE16 BPC-BPC communication failure 1. Check the cable between the two BPCs; reseat and look for bent pins. 2. Replace BPC-A, location: U1.35-P1-X4 3. Replace BPC-B, location: U1.35-P2-X4. 101B D216 BPA-BPA communication failure Go to “MAP 1520: Power” on page 192 for isolation. 101B D616 Problem with external UPS: utility failure 1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support. 101B D716 Problem with external UPS: battery low 1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support. 101B D716 Informational message about the IBF. Informational message. 101B D766 Informational message about the IBF. Informational message. 101B D776 Informational message about the IBF. Informational message. 101B D816 External UPS: bypass active (informational message) Informational message. If a problem is suspected: 1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support. 101B D916 External UPS: power restored (informational message) Informational message. If a problem is suspected: 1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support. 101B DA16 External UPS: installed (informational message) Informational message. If a problem is suspected: 1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support. 101B DB16 Problem with external UPS: UPS not available 1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support. 450 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 101B DC16 External UPS: battery not low (informational message) Informational message. If a problem is suspected: 1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support. 101B DC56 Informational message about the IBF. Informational message. 101B DC66 Informational message about the IBF. Informational message. 101B DC76 Informational message about the IBF. Informational message. 101B DD16 External UPS: bypass ended (informational message) Informational message. If a problem is suspected: 1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support. 101B DF16 Problem with external UPS: UPS failure 1. Verify the cabling between the UPS and the BPCs. 2. Refer to the manufacturer’s documentation for the UPS. 3. Call service support. 101B E106 Critical logic over temperature Go to “MAP 1528: There Is A Processor (Critical/Warning) Overtemperature Fault” on page 211. 101B E206 Ambient room temperature too high 1. The ambient room temperature has reached a warning level or a critical level. Ask the customer to correct the room temperature problem. 2. Call service support. 101B F4x6 Hardware configuration does not support N-mode power Call service support. 101B F806 BPC B: loss of ac power or phase missing Go to “MAP 152a: Loss of ac Power or Phase Missing” on page 214. 101B F906 BPC B: loss of ac power or phase missing Go to “MAP 152a: Loss of ac Power or Phase Missing” on page 214. 101B FA06 BPC B: loss of ac power or phase missing Go to “MAP 152a: Loss of ac Power or Phase Missing” on page 214. 101B FDB6 BPC B: open room EPO switch has been detected Go to “MAP 1524: An Open Room EPO Switch Has Been Detected From One Side” on page 201. 101B FE06 UEPO switch on BPC B is in the bypass position Go to “MAP 1522: UEPO Switch On The BPC Is In The Bypass Position” on page 199. 101B FE16 UEPO switch on BPC B is in the bypass position Go to “MAP 1522: UEPO Switch On The BPC Is In The Bypass Position” on page 199. 101B FE56 BPR 1B communications fault Go to “MAP 1523: There Is a Bulk Power Regulator (BPR) Communications Fault” on page 200. Chapter 5. Error Code to FRU Index 451 Error Code Description Action/ Possible Failing FRU 101B FE66 BPR 2B communications fault Go to “MAP 1523: There Is a Bulk Power Regulator (BPR) Communications Fault” on page 200. 101B FE76 BPR 3B communications fault Go to “MAP 1523: There Is a Bulk Power Regulator (BPR) Communications Fault” on page 200. 101B FFx6 Cabling error on the BPC or a BPD that is not isolated There is a cabling problem on a BPC or BPD that cannot be isolated.The x is decoded as follows:If x=1, the problem is on the BPC, location: U1.35-P2-X4If x=2, the problem is on BPD-1, location: U1.35-P2-X3 If x=3, the problem is on BPD-2, location: U1.35-P2-X2If x=4, the problem is on BPD-3, location: U1.35-P2-X1 1. Check the UPIC cabling on the BPC or BPD specified by decoding the x as shown above. Correct any cabling problems that are found. 2. If no problems are found on the A side of the BPE, check the UPIC cabling on the BPC or BPD in the same position on the B side of the BPE for cabling errors. Correct and problems that are found. 3. If both sides of the BPE are cabled correctly, call service support. 101B x115 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X4/Q1# to U1.18-X1/Q2# 101B x125 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X3/Q1# to U1.9-V1/Q1# 101B x135 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X2/Q1# to U2.1-V1/Q2# 101B x215 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X4/Q2# to U1.18-X2/Q2# 101B x225 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X3/Q2# to U1.9-V2/Q1# 101B x235 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X2/Q2# to U2.1-V2/Q2# 101B x315 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X4/Q3# to U1.18-X3/Q2# 101B x325 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X3/Q3# to U1.5-V1/Q1# 452 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 101B x335 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X2/Q3# to U2.5-V1/Q2# 101B x415 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X4/Q4# to U1.18-X4/Q2# 101B x425 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X3/Q4# to U1.5-V2/Q1# 101B x435 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X2/Q4# to U2.5-V2/Q2# 101B x515 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X4/Q5# to U1.18-P1-V1/Q2# 101B x525 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X3/Q5# to U1.1-V1/Q1# 101B x535 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X2/Q5# to U2.9-V1/Q2# 101B x615 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X4/Q6# to U1.18-P1-V2/Q2# 101B x625 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X3/Q6# to U1.1-V2/Q1# 101B x635 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X2/Q6# to U2.9-V2/Q2# 101B x715 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X4/Q7# to U1.18-P1-V4/Q2# 101B x725 Power cable fault If there is no DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X3/Q7# to U1.13-V1/Q1#If there is a 6th DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X3/Q7# to U1.18-P1-V8/Q2# 101B x735 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X2/Q7# to U2.13-V1/Q2# 101B x825 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X3/Q8# to U1.13-V2/Q1# If there is a 6th DCA in U1.18-P1-V8, this error code will not be reported. Chapter 5. Error Code to FRU Index 453 Error Code Description Action/ Possible Failing FRU 101B x835 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X2/Q8# to U2.13-V2/Q2# 101B x925 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X3/Q9# to U1.18-P1-V5/Q2# 101B x935 Power cable fault If there is no DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X2/Q9# to U2.19-V1/Q2# If there is a 6th DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X2/Q9# to U1.13-V1/Q2# 101B x945 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes: U1.35-P1-X1/Q9# to U2.19-V1/Q2# 101B xA25 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X3/Q10# to U1.18-P1-V6/Q2# 101B xA35 Power cable fault If there is no DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these location codes: U1.35-P1-X2/Q10# to U2.19-V2/Q2# If there is a 6th DCA in U1.18-P1-V8, go to“MAP 1520: Power” on page 192 with these location codes:U1.35-P2-X2/Q10# to U1.13-V2/Q2# 101B xA45 Power cable fault Go to “MAP 1520: Power” on page 192 with these location codes:U1.35-P1-X1/Q10# to U2.19-V2/Q2# 101B xx0x BPA B-FRU not isolated U1.35 101B xx11 BPA-B / BPC U1.35-P2-X4 101B xx13 Seeprom (VPD) fault on BPC-B U1.35-P2-X4 101B xx17 Fan fault on BPC-B (BPA-B fan) 1. BPE-B side fan Location: U1.35-P2-F1 2. BPC-B Location: U1.35-P2-X4 101B xx21 BPA-B / BPD-1 U1.35-P2-X3 101B xx22 BPC-B to BPD-1 communications fault 1. BPD-1 Location: U1.35-P2-X3 2. BPC-B Location: U1.35-P2-X4 3. BPE backplane B Location: U1.35-P2 101B xx23 Seeprom (VPD) fault on BPD-1B U1.35-P2-X3 101B xx31 BPA-B / BPD-2 U1.35-P2-X2 454 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 101B xx32 BPC-B to BPD-2 communications fault 1. BPD-2 Location: U1.35-P2-X2 2. BPC-B Location: U1.35-P2-X4 3. BPE backplane B Location: U1.35-P2 101B xx33 Seeprom (VPD) fault on BPD-2B U1.35-P2-X2 101B xx41 BPA-B / BPD-3 U1.35-P2-X1 101B xx42 BPC-B to BPD-3 communications fault 1. BPD-3 Location: U1.35-P2-X1 2. BPC-B Location: U1.35-P2-X4 3. BPE backplane B Location: U1.35-P2 101B xx43 Seeprom (VPD) fault on BPD-3B U1.35-P2-X1 101B xx51 BPA-B / BPR-1 U1.35-P2-V1 101B xx52 BPC-B to BPR-1 communications fault 1. BPR-1 Location: U1.35-P2-V1 2. BPC-B Location: U1.35-P2-X4 3. BPE backplane B Location: U1.35-P2 101B xx53 Seeprom (VPD) fault on BPR-1B U1.35-P2-V1 101B xx61 BPA-B / BPR-2 U1.35-P2-V2 101B xx62 BPC-B to BPR-2 communications fault 1. BPR-2 Location: U1.35-P2-V2 2. BPC-B Location: U1.35-P2-X4 3. BPE backplane B Location: U1.35-P2 101B xx63 Seeprom (VPD) fault on BPR-2B U1.35-P2-V2 101B xx71 BPA-B / BPR-3 U1.35-P2-V3 101B xx72 BPC-B to BPR-3 communications fault 1. BPR-3 Location: U1.35-P2-V3 2. BPC-B Location: U1.35-P2-X4 3. BPE backplane B Location: U1.35-P2 101B xx73 Seeprom (VPD) fault on BPR-3B U1.35-P2-V3 101B xx81 BPA-B / IBF 1 U1.13-P2-V3 101B xx83 Seeprom (VPD) fault on IBF-1B U1.13-P2-V3 101B xx91 BPA-B / IBF 2 U1.15-P2-V4 101B xx93 Seeprom (VPD) fault on IBF-2B U1.15-P2-V4 101B xxA1 BPA-B / IBF 3 U2.17-P2-V6 101B xxA3 Seeprom (VPD) fault on IBF-3B U2.17-P2-V6 Chapter 5. Error Code to FRU Index 455 Error Code Description Action/ Possible Failing FRU 101B xxB1 BPA-B / EPO_SW 1. Replace the UEPO switch,Location: U1-V1 2. Check the cable from BPC-B to the UEPO switch, Location U1-V1/Q2# 101C 0FB6 MCM 0 running over temperature 1. Verify that the MCM at U1.18-P1-C1 is installed. 2. Verify that there are no airflow problems with the MCM at U1.18-P1-C1 or its heatsink. 3. Verify that the MCM at U1.18-P1-C1 is installed correctly onto the processor backplane. 4. Call service support. 101C 0FC6 MCM 1 running over temperature 1. Verify that the MCM at U1.18-P1-C3 is installed. 2. Verify that there are no airflow problems with the MCM at U1.18-P1-C3 or its heatsink. 3. Verify that the MCM at U1.18-P1-C3 is installed correctly onto the processor backplane. 4. Call service support. 101C 0FD6 MCM 2 running over temperature 1. Verify that the MCM at U1.18-P1-C4 is installed. 2. Verify that there are no airflow problems with the MCM at U1.18-P1-C4 or its heatsink. 3. Verify that the MCM at U1.18-P1-C4 is installed correctly onto the processor backplane. 4. Call service support. 101C 0FE6 MCM 3 running over temperature 1. Verify that the MCM at U1.18-P1-C2 is installed. 2. Verify that there are no airflow problems with the MCM at U1.18-P1-C2 or its heatsink. 3. Verify that the MCM at U1.18-P1-C2 is installed correctly onto the processor backplane. 4. Call service support. 101C 1C06 Loss of airflow in processor subsystem resulting in over-temperature Go to “MAP 1527: An Airflow Loss Has Been Detected” on page 209 with location U1.18. 456 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 101C 5Ax6 A DCA FRU whose firmware is too down-level to function with the power subsystem has been installed in the CEC. For the DCA to function, the power subsystem controller must download code to it. (The x indicates which DCA has the down-level firmware.) If the system is powered up: 1. Wait approximately 30 minutes, then press the green start-of-service button followed by the white end-of-service button. 2. Look at the service action event log in Service Focal Point. Verify that the 101C 5Ax6 error code did not reoccur; if it did not, the update was successful. If the 101C 5Ax6 error code reoccurs, call service support. If the system is powered down: a. Wait approximately 30 minutes. b. Look at the service action event log in Service Focal Point. Verify that the 101C 5Ax6 error code did not reoccur after 30 minutes. If it did not, the update was successful. If the 101C 5Ax6 error code reoccurs, call service support. 101C 6014 Overcurrent or overvoltage problem with 2.5V in processor subsystem Go to “MAP 1520: Power” on page 192. 101C 6114 Overcurrent or overvoltage problem with 1.8V in processor subsystem Go to “MAP 1520: Power” on page 192. 101C 6124 Overcurrent or overvoltage problem with 1.5V in processor subsystem Go to “MAP 1520: Power” on page 192. 101C 6134 Overcurrent or overvoltage problem with 1.5V in processor subsystem Go to “MAP 1520: Power” on page 192. 101C 6144 Overcurrent or overvoltage problem with 3.3V in processor subsystem (oscillator) Go to “MAP 1520: Power” on page 192. 101C 6514 Overcurrent or overvoltage problem with 5.0V in processor subsystem (autostart, standby) Go to “MAP 1520: Power” on page 192. 101C 6814 Overcurrent or overvoltage problem with 2.5V in processor subsystem Go to “MAP 1520: Power” on page 192. 101C 6914 Overcurrent or overvoltage problem with 1.8V in processor subsystem Go to “MAP 1520: Power” on page 192. 101C 6924 Overcurrent or overvoltage problem with 1.5V in processor subsystem Go to “MAP 1520: Power” on page 192. 101C 6934 Overcurrent or overvoltage problem with 1.5V in processor subsystem Go to “MAP 1520: Power” on page 192. 101C 6944 Overcurrent or overvoltage problem with 3.3V in processor subsystem (oscillator) Go to “MAP 1520: Power” on page 192. 101C 6D14 Overcurrent or overvoltage problem with 5.0V in processor subsystem (autostart, standby) Go to “MAP 1520: Power” on page 192. 101C 7E06 No MCMs detected OR the actual number of MCMs does not match the expected number of MCMs 1. Verify that the processor MCMs are plugged in a valid configuration. 2. Verify that the processor MCMs are seated properly. 3. Verify the processor VPD cards in the primary I/O book are of the proper number and are positioned and seated correctly. 4. Call service support. Chapter 5. Error Code to FRU Index 457 Error Code Description Action/ Possible Failing FRU 101C 7EB6 MCM 0 running over temperature 1. Verify that the MCM at U1.18-P1-C1 is installed. 2. Verify that there are no airflow problems with the MCM at U1.18-P1-C1 or its heatsink. 3. Verify that the MCM at U1.18-P1-C1 is installed correctly onto the processor backplane. 4. Call service support. 101C 7EC6 MCM 1 running over temperature 1. Verify that the MCM at U1.18-P1-C3 is installed. 2. Verify that there are no airflow problems with the MCM at U1.18-P1-C3 or its heatsink. 3. Verify that the MCM at U1.18-P1-C3 is installed correctly onto the processor backplane. 4. Call service support. 101C 7ED6 MCM 2 running over temperature 1. Verify that the MCM at U1.18-P1-C4 is installed. 2. Verify that there are no airflow problems with the MCM at U1.18-P1-C4 or its heatsink. 3. Verify that the MCM at U1.18-P1-C4 is installed correctly onto the processor backplane. 4. Call service support. 101C 7EE6 MCM 3 running over temperature 1. Verify that the MCM at U1.18-P1-C2 is installed. 2. Verify that there are no airflow problems with the MCM at U1.18-P1-C2 or its heatsink. 3. Verify that the MCM at U1.18-P1-C2 is installed correctly onto the processor backplane. 4. Call service support. 101C 7F06 No MCMs detected OR the actual number of MCMs does not match the expected number of MCMs Note: If the service processor is hung, this error code causes the service processor to hang. Refer to “Resetting the Service Processor” on page 819. 1. Verify that the processor MCMs are plugged in a valid configuration. 2. Verify that the processor MCMs are seated properly. 3. Verify the processor VPD cards in the primary I/O book are of the proper number and are positioned and seated correctly. 4. Call service support. 101C B014 Overcurrent or overvoltage problem with 2.5V in processor subsystem Go to “MAP 1520: Power” on page 192. 101C B114 Overcurrent or overvoltage problem with 1.8V in processor subsystem Go to “MAP 1520: Power” on page 192. 458 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 101C B124 Overcurrent or overvoltage problem with 1.5V in processor subsystem Go to “MAP 1520: Power” on page 192. 101C B134 Overcurrent or overvoltage problem with 1.5V in processor subsystem Go to “MAP 1520: Power” on page 192. 101C B144 Overcurrent or overvoltage problem with 3.3V in processor subsystem (oscillator) Go to “MAP 1520: Power” on page 192. 101C B544 Overcurrent or overvoltage problem with 5.0V in processor subsystem (autostart, standby) Go to “MAP 1520: Power” on page 192. 101C B814 Overcurrent or overvoltage problem with 2.5V in processor subsystem Go to “MAP 1520: Power” on page 192. 101C B914 Overcurrent or overvoltage problem with 1.8V in processor subsystem Go to “MAP 1520: Power” on page 192. 101C B924 Overcurrent or overvoltage problem with 1.5V in processor subsystem Go to “MAP 1520: Power” on page 192. 101C B934 Overcurrent or overvoltage problem with 1.5V in processor subsystem Go to “MAP 1520: Power” on page 192. 101C B944 Overcurrent or overvoltage problem with 3.3V in processor subsystem (oscillator) Go to “MAP 1520: Power” on page 192. 101C BD44 Overcurrent or overvoltage problem with 5.0V in processor subsystem (autostart, standby) Go to “MAP 1520: Power” on page 192. 101C E006 Processor over-temperature detected Go to “MAP 1528: There Is A Processor (Critical/Warning) Overtemperature Fault” on page 211. 101C E106 Processor over-temperature detected Go to “MAP 1528: There Is A Processor (Critical/Warning) Overtemperature Fault” on page 211. 101C E1B6 MCM 0 running over temperature 1. Verify that the MCM at U1.18-P1-C1 is installed. 2. Verify that there are no airflow problems with the MCM at U1.18-P1-C1 or its heatsink. 3. Verify that the MCM at U1.18-P1-C1 is installed correctly onto the processor backplane. 4. Call service support. 101C E1C6 MCM 1 running over temperature 1. Verify that the MCM at U1.18-P1-C3 is installed. 2. Verify that there are no airflow problems with the MCM at U1.18-P1-C3 or its heatsink. 3. Verify that the MCM at U1.18-P1-C3 is installed correctly onto the processor backplane. 4. Call service support. Chapter 5. Error Code to FRU Index 459 Error Code Description Action/ Possible Failing FRU 101C E1D6 MCM 2 running over temperature 1. Verify that the MCM at U1.18-P1-C4 is installed. 2. Verify that there are no airflow problems with the MCM at U1.18-P1-C4 or its heatsink. 3. Verify that the MCM at U1.18-P1-C4 is installed correctly onto the processor backplane. 4. Call service support. 101C E1E6 MCM 3 running over temperature 1. Verify that the MCM at U1.18-P1-C2 is installed. 2. Verify that there are no airflow problems with the MCM at U1.18-P1-C2 or its heatsink. 3. Verify that the MCM at U1.18-P1-C2 is installed correctly onto the processor backplane. 4. Call service support. 101C F206 Error not isolated; VPD on BPR may be damaged Call service support. 101C F306 The power subsystem does not recognize the processor subsystem type or processor subsystem configuration Call service support. 101C F406 Hardware configuration cannot support N-mode power Call service support. 101C F606 The power subsystem does not recognize the processor subsystem type or processor subsystem configuration Call service support. 101C F516 An N+1 boundary (DCA or fan) has dropped to N-mode status in the processor subsystem, location: U1.18 This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced. 101C F526 An N+1 boundary (DCA or fan) has dropped to N-mode status in the processor subsystem, location: U1.18 This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced. 460 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 101C F536 An N+1 boundary (DCA or fan) has dropped to N-mode status in the processor subsystem, location: U1.18 This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced. 101C F546 An N+1 boundary (DCA or fan) has dropped to N-mode status in the processor subsystem, location: U1.18 This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced. 101C F556 An N+1 boundary (DCA or fan) has dropped to N-mode status in the processor subsystem, location: U1.18 This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced. 101C F566 An N+1 boundary (DCA or fan) has dropped to N-mode status in the processor subsystem, location: U1.18 This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced. 101C F576 An N+1 boundary (DCA or fan) has dropped to N-mode status in the processor subsystem, location: U1.18 This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced. 101C F586 An N+1 boundary (DCA or fan) has dropped to N-mode status in the processor subsystem, location: U1.18 This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced. Chapter 5. Error Code to FRU Index 461 Error Code Description Action/ Possible Failing FRU 101C F596 An N+1 boundary (DCA or fan) has dropped to N-mode status in the processor subsystem, location: U1.18 This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced. 101C F5A6 An N+1 boundary (DCA or fan) has dropped to N-mode status in the processor subsystem, location: U1.18 This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced. 101C xx11 Processor subsystem / DCA-1 U1.18-P1-V1 101C xx13 Seeprom (VPD) fault on processor subsystem DCA-1 U1.18-P1-V1 101C xx15 Processor subsystem / DCA-1 cable fault Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.18-P1-V1/Q1 and U1.35-P1-X4/Q5 2. U1.18-P1-V1/Q2 and U1.35-P2-X4/Q5 101C xx21 Processor subsystem / DCA-2 U1.18-P1-V2 101C xx23 Seeprom (VPD) fault on processor subsystem DCA-2 U1.18-P1-V2 101C xx25 Processor subsystem / DCA-2 cable fault Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.18-P1-V2/Q1 and U1.35-P1-X4/Q6 2. U1.18-P1-V2/Q2 and U1.35-P2-X4/Q6 101C xx31 Processor subsystem / DCA-3 U1.18-P1-V4 101C xx33 Seeprom (VPD) fault on processor subsystem DCA -3 U1.18-P1-V4 101C xx35 Processor subsystem / DCA-3 cable fault Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.18-P1-V4/Q1 and U1.35-P1-X4/Q7 2. U1.18-P1-V4/Q2 and U1.35-P2-X4/Q7 101C xx41 Processor subsystem / DCA-4 U1.18-P1-V5 101C xx43 Seeprom (VPD) fault on processor subsystem DCA-4 U1.18-P1-V5 101C xx45 Processor subsystem / DCA-4 cable fault Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.18-P1-V5/Q1 and U1.35-P1-X3/Q9 2. U1.18-P1-V5/Q2 and U1.35-P2-X3/Q9 101C xx51 Processor subsystem / DCA-5 U1.18-P1-V6 101C xx53 Seeprom (VPD) fault on processor subsystem DCA-5 U1.18-P1-V6 462 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 101C xx55 Processor subsystem / DCA-5 cable fault Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.18-P1-V6/Q1 and U1.35-P1-X3/Q10 2. U1.18-P1-V6/Q2 and U1.35-P2-X3/Q10 101C xx61 Processor subsystem / DCA-6 (if installed) U1.18-P1-V8 101C xx63 Seeprom (VPD) fault on processor subsystem DCA-6 U1.18-P1-V8 101C xx65 Processor subsystem/DCA-6 cable fault Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.18-P1-V8/Q1 and U1.35-P1-X3/Q7 2. U1.18-P1-V8/Q2 and U1.35-P2-X3/Q7 101C xx71 Processor subsystem / MDA-1 U1.18-X1 101C xx73 Seeprom (VPD) fault on processor subsystem MDA -1 U1.18-X1 101C xx75 Processor subsystem / MDA-1 cable fault Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.18-X1/Q1 and U1.35-P1-X4/Q1 2. U1.18-X1/Q2 and U1.35-P2-X4/Q1 101C xx77 Fan fault on processor subsystem fan 1 1. Fan 1 controllerLocation: U1.18-X1 2. Fan 1 Location: U1.18-F1 101C xx81 Processor subsystem / MDA-2 U1.18-X2 101C xx83 Seeprom (VPD) fault on processor subsystem MDA-2 U1.18-X2 101C xx85 Processor subsystem / MDA-2 cable fault Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.18-X2/Q1 and U1.35-P1-X4/Q2 2. U1.18-X2/Q2 and U1.35-P2-X4/Q2 101C xx87 Fan fault on processor subsystem fan 2 1. Fan 2 controllerLocation: U1.18-X2 2. Fan 2 Location: U1.18-F2 101C xx91 Processor subsystem / MDA-3 U1.18-X3 101C xx93 Seeprom (VPD) fault on processor subsystem MDA-3 U1.18-X3 101C xx95 Processor subsystem / MDA-3 cable fault Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.18-X3/Q1 and U1.35-P1-X4/Q3 2. U1.18-X3/Q2 and U1.35-P2-X4/Q3 101C xx97 Fan fault on processor subsystem fan 3 1. Fan 3 controllerLocation: U1.18-X3 2. Fan 3 Location: U1.18-F3 101C xxA1 Processor subsystem / MDA-4 U1.18-X4 Chapter 5. Error Code to FRU Index 463 Error Code Description Action/ Possible Failing FRU 101C xxA3 Seeprom (VPD) fault on processor subsystem MDA-4 U1.18-X4 101C xxA5 Processor subsystem / MDA-4 cable fault Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U1.18-X4/Q1 and U1.35-P1-X4/Q4 2. U1.18-X4/Q2 and U1.35-P2-X4/Q4 101C xxA7 Fan fault on processor subsystem fan 4 1. Fan 4 controllerLocation: U1.18-X4 2. Fan 4 Location: U1.18-F4 101F xxxx Power fault on high performance switch Go to the 45H/W4 Switch Planning, Installation, and Service Guide. 1021 1B17 Problem with fan 1 in first I/O subsystem in second rack (I/O subsystem 5) 1. DCA-1Location: U2.1-V1 2. Fan 1Location: U2.1-F1 1021 1B27 Problem with fan 2 in first I/O subsystem in second rack (I/O subsystem 5) 1. DCA-2Location: U2.1-V2 2. Fan 2Location: U2.1-F2 1021 1C06 Loss of airflow in first I/O subsystem in second rack, location: U2.1 Go to “MAP 1527: An Airflow Loss Has Been Detected” on page 209. 1021 6014 Overcurrent or overvoltage problem with 3.3V in 5th I/O subsystem Go to “MAP 1520: Power” on page 192. 1021 6024 Overcurrent or overvoltage problem with 5.0V in 5th I/O subsystem Go to “MAP 1520: Power” on page 192. 1021 6034 Overcurrent or overvoltage problem with 2.5V in 5th I/O subsystem Go to “MAP 1520: Power” on page 192. 1021 6044 Overcurrent or overvoltage problem with 12.0V in 5th I/O subsystem Go to “MAP 1520: Power” on page 192. 1021 6054 Overcurrent or overvoltage problem with -12.0V in 5th I/O subsystem Go to “MAP 1520: Power” on page 192. 1021 6814 Overcurrent or overvoltage problem with 3.3V in 5th I/O subsystem Go to “MAP 1520: Power” on page 192. 1021 6824 Overcurrent or overvoltage problem with 5.0V in 5th I/O subsystem Go to “MAP 1520: Power” on page 192. 1021 6834 Overcurrent or overvoltage problem with 2.5V in 5th I/O subsystem Go to “MAP 1520: Power” on page 192. 1021 6844 Overcurrent or overvoltage problem with 12.0V in 5th I/O subsystem Go to “MAP 1520: Power” on page 192. 1021 6854 Overcurrent or overvoltage problem with -12.0V in 5th I/O subsystem Go to “MAP 1520: Power” on page 192. 1021 7017 Problem with fan 3 in first I/O subsystem in second rack (I/O subsystem 5) 1. DCA-1Location: U2.1-V1 2. Fan 3Location: U2.1-F3 464 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 1021 7027 Problem with fan 4 in first I/O subsystem in second rack (I/O subsystem 5) 1. DCA-2Location: U2.1-V2 2. Fan 4Location: U2.1-F4 1021 7117 Problem with fan 1 and fan 3 in first I/O subsystem in second rack (I/O subsystem 5) 1. DCA-1Location: U2.1-V1 2. Fans 1 and 3Location: U2.1-F1 and U2.1-F3 1021 7127 Problem with fan 2 and fan 4 in first I/O subsystem in second rack (I/O subsystem 5) 1. DCA-2Location: U2.1-V2 2. Fans 2 and 4Location: U2.1-F2 and U2.1-F4 1021 8A00 The SPCN firmware has detected a change in the drawer configuration Informational message. 1021 B014 Overcurrent or overvoltage problem with 3.3V in 5th I/O subsystem Go to “MAP 1520: Power” on page 192. 1021 B024 Overcurrent or overvoltage problem with 5.0V in 5th I/O subsystem Go to “MAP 1520: Power” on page 192. 1021 B034 Overcurrent or overvoltage problem with 2.5V in 5th I/O subsystem Go to “MAP 1520: Power” on page 192. 1021 B044 Overcurrent or overvoltage problem with 12.0V in 5th I/O subsystem Go to “MAP 1520: Power” on page 192. 1021 B054 Overcurrent or overvoltage problem with -12.0V in 5th I/O subsystem Go to “MAP 1520: Power” on page 192. 1021 B814 Overcurrent or overvoltage problem with 3.3V in 5th I/O subsystem Go to “MAP 1520: Power” on page 192. 1021 B824 Overcurrent or overvoltage problem with 5.0V in 5th I/O subsystem Go to “MAP 1520: Power” on page 192. 1021 B834 Overcurrent or overvoltage problem with 2.5V in 5th I/O subsystem Go to “MAP 1520: Power” on page 192. 1021 B844 Overcurrent or overvoltage problem with 12.0V in 5th I/O subsystem Go to “MAP 1520: Power” on page 192. 1021 B854 Overcurrent or overvoltage problem with -12.0V in 5th I/O subsystem Go to “MAP 1520: Power” on page 192. 1021 BF06 Unable to resolve the chassis VPD in an I/O subsystem The chassis VPD in the I/O subsystem drawer identified by the location code could not be resolved. This may be an indication that more than one FRU in the drawer was replaced during a service action. If this is the case, reinstall the original FRUs in the drawer, then replace the FRUs one at a time. If only one FRU was replaced and this error code occurred, call service support. Chapter 5. Error Code to FRU Index 465 Error Code Description Action/ Possible Failing FRU 1021 CC36 Description: Loss of pgood on an I/O planar in the I/O drawer at location U2.1Action:This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U2.1-P1. 1021 CC46 Description: Loss of pgood on an I/O planar in the I/O drawer at location U2.1Action:This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U2.1-P2.Go to “MAP 1520: Power” on page 192. 1021 CD96 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. 1021 CDA6 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. 1021 CDB6 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. 1021 CDC6 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. 1021 F4x6 Hardware configuration does not support N-mode power. Call Service Support 466 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 1021 F516 An N+1 boundary (DCA or fan) has dropped to N-mode status in the first I/O subsystem in the secondary rack, location: U2.1. This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced. 1021 F526 An N+1 boundary (DCA or fan) has dropped to N-mode status in the first I/O subsystem in the secondary rack, location: U2.1. This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced. 1021 xx11 First I/O subsystem in second rack, DCA-1 U2.1-V1 1021 xx13 Seeprom fault on DCA-1 in first I/O subsystem secondary rack U2.1-V1 1021 xx15 First I/O subsystem in second rack, DCA-1, cable fault Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U2.1-V1/Q1 and U1.35-P2-X2/Q1 2. U2.1-V1/Q2 and U1.35-P1-X2/Q1 1021 xx17 Problem with fan 1 and fan 3 in first I/O subsystem in second rack 1. DCA-1Location U2.1–V1 2. Fan 1,Location U2.1–F1 3. Fan 3,Location U2.1–F3 1021 xx21 First I/O subsystem in second rack, DCA-2 U2.1-V2 1021 xx23 Seeprom fault on DCA-2 in first I/O subsystem secondary rack U2.1-V2 1021 xx25 First I/O subsystem in second rack, DCA-2, cable fault Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U2.1-V2/Q1 and U1.35-P2-X2/Q2 2. U2.1-V2/Q2 and U1.35-P1-X2/Q2 1021 xx27 Problem with fan 2 and fan 4 in first I/O subsystem in second rack 1. DCA-2Location U2.1–V2 2. Fan 2,Location U2.1–F2 3. Fan 4,Location U2.1–F4 1021 xx31 First I/O subsystem in secondary rack I/O subsystem backplane 1 fault U2.1-P1 1021 xx33 Seeprom fault on I/O subsystem backplane 1 in first I/O subsystem in secondary rack U2.1-P1 1021 xx41 First I/O subsystem in secondary rack I/O subsystem backplane 2 fault U2.1-P2 Chapter 5. Error Code to FRU Index 467 Error Code Description Action/ Possible Failing FRU 1021 xx43 Seeprom fault on I/O subsystem backplane 2 in first I/O subsystem in secondary rack U2.1-P2 1021 xx51 First I/O subsystem in secondary rack DASD backplane 1 fault U2.1-P3 1021 xx53 Seeprom fault on DASD backplane 1 in first I/O subsystem in secondary rack U2.1-P3 1021 xx61 First I/O subsystem in secondary rack DASD backplane 2 fault U2.1-P4 1021 xx63 Seeprom fault on DASD backplane 2 in first I/O subsystem in secondary rack U2.1-P4 1021 xx71 First I/O subsystem in secondary rack DASD backplane 3 fault U2.1-P5 1021 xx73 Seeprom fault on DASD backplane 3 in first I/O subsystem in secondary rack U2.1-P5 1021 xx81 First I/O subsystem in secondary rack DASD backplane 4 fault U2.1-P6 1021 xx83 Seeprom fault on DASD backplane 4 in first I/O subsystem in secondary rack U2.1-P6 1022 1B17 Problem with fan 1 in second I/O subsystem in second rack (I/O subsystem 6) 1. DCA-1Location: U2.5-V1 2. Fan 1Location: U2.5-F1 1022 1B27 Problem with fan 2 in second I/O subsystem in second rack (I/O subsystem 6) 1. DCA-2Location: U2.5-V2 2. Fan 2Location: U2.5-F2 1022 1C06 Loss of airflow in second I/O subsystem in second rack (I/O subsystem 6), location: U2.5 Go to “MAP 1527: An Airflow Loss Has Been Detected” on page 209. 1022 6014 Overcurrent or overvoltage problem with 3.3V in second I/O subsystem in second rack (I/O subsystem 6) Go to “MAP 1520: Power” on page 192. 1022 6024 Overcurrent or overvoltage problem with 5.0V in second I/O subsystem in second rack (I/O subsystem 6) Go to “MAP 1520: Power” on page 192. 1022 6034 Overcurrent or overvoltage problem with 2.5V in second I/O subsystem in second rack (I/O subsystem 6) Go to “MAP 1520: Power” on page 192. 1022 6044 Overcurrent or overvoltage problem with 12.0V in second I/O subsystem in second rack (I/O subsystem 6) Go to “MAP 1520: Power” on page 192. 1022 6054 Overcurrent or overvoltage problem with -12.0V in second I/O subsystem in second rack (I/O subsystem 6) Go to “MAP 1520: Power” on page 192. 1022 6814 Overcurrent or overvoltage problem with 3.3V in second I/O subsystem in second rack (I/O subsystem 6) Go to “MAP 1520: Power” on page 192. 468 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 1022 6824 Overcurrent or overvoltage problem with 5.0V in second I/O subsystem in second rack (I/O subsystem 6) Go to “MAP 1520: Power” on page 192. 1022 6834 Overcurrent or overvoltage problem with 2.5V in second I/O subsystem in second rack (I/O subsystem 6) Go to “MAP 1520: Power” on page 192. 1022 6844 Overcurrent or overvoltage problem with 12.0V in second I/O subsystem in second rack (I/O subsystem 6) Go to “MAP 1520: Power” on page 192. 1022 6854 Overcurrent or overvoltage problem with -12.0V in second I/O subsystem in second rack (I/O subsystem 6) Go to “MAP 1520: Power” on page 192. 1022 7017 Problem with fan 3 in second I/O subsystem in second rack (I/O subsystem 6) 1. DCA-1Location: U2.5-V1 2. Fan 3Location: U2.5-F3 1022 7027 Problem with fan 4 in second I/O subsystem in second rack (I/O subsystem 6) 1. DCA-2Location: U2.5-V2 2. Fan 4Location: U2.5-F4 1022 7117 Problem with fan 1 and fan 3 in second I/O subsystem in second rack (I/O subsystem 6) 1. DCA-1Location: U2.5-V1 2. Fans 1 and 3Location: U2.5-F1 and U2.5-F3 1022 7127 Problem with fan 2 and fan 4 in second I/O subsystem in second rack (I/O subsystem 6) 1. DCA-2Location: U2.5-V2 2. Fans 2 and 4Location: U2.5-F2 and U2.5-F4 1022 8A00 The SPCN firmware has detected a change in the drawer configuration Informational message. 1022 B014 Overcurrent or overvoltage problem with 3.3V in second I/O subsystem in second rack (I/O subsystem 6) Go to “MAP 1520: Power” on page 192. 1022 B024 Overcurrent or overvoltage problem with 5.0V in second I/O subsystem in second rack (I/O subsystem 6) Go to “MAP 1520: Power” on page 192. 1022 B034 Overcurrent or overvoltage problem with 2.5V in 6th I/O subsystem Go to “MAP 1520: Power” on page 192. 1022 B044 Overcurrent or overvoltage problem with 12.0V in6th I/O subsystem Go to “MAP 1520: Power” on page 192. 1022 B054 Overcurrent or overvoltage problem with -12.0V in 6th I/O subsystem Go to “MAP 1520: Power” on page 192. 1022 B814 Overcurrent or overvoltage problem with 3.3V in 6th I/O subsystem Go to “MAP 1520: Power” on page 192. 1022 B824 Overcurrent or overvoltage problem with 5.0V in second I/O subsystem in second rack (I/O subsystem 6) Go to “MAP 1520: Power” on page 192. Chapter 5. Error Code to FRU Index 469 Error Code Description Action/ Possible Failing FRU 1022 B834 Overcurrent or overvoltage problem with 2.5V in second I/O subsystem in second rack (I/O subsystem 6) Go to “MAP 1520: Power” on page 192. 1022 B844 Overcurrent or overvoltage problem with 12.0V in second I/O subsystem in second rack (I/O subsystem 6) Go to “MAP 1520: Power” on page 192. 1022 B854 Overcurrent or overvoltage problem with -12.0V in second I/O subsystem in second rack (I/O subsystem 6) Go to “MAP 1520: Power” on page 192. 1022 BF06 Unable to resolve the chassis VPD in an I/O subsystem The chassis VPD in the I/O subsystem drawer identified by the location code could not be resolved. This may be an indication that more than one FRU in the drawer was replaced during a service action. If this is the case, reinstall the original FRUs in the drawer, then replace the FRUs one at a time. If only one FRU was replaced and this error code occurred, call service support. 1022 CC36 Description: Loss of pgood on an I/O planar in the I/O drawer at location U2.5Action:This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U2.5-P1. 1022 CC46 Description: Loss of pgood on an I/O planar in the I/O drawer at location U2.5Action:This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U2.5-P2. 470 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 1022 CD96 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. 1022 CDA6 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. 1022 CDB6 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. 1022 CDC6 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. 1022 F4x6 Hardware configuration does not support N-mode power. Call Service Support 1022 F516 An N+1 boundary (DCA or fan) has dropped to N-mode status in second I/O subsystem in second rack (I/O subsystem 6), location: U2.5. This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced. 1022 F526 An N+1 boundary (DCA or fan) has dropped to N-mode status in the second I/O subsystem in the secondary rack, location: U2.5. This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced. 1022 xx11 Second I/O subsystem in second rack (I/O subsystem 6), DCA-1 U2.5-V1 1022 xx13 Seeprom fault on DCA-1 in second I/O subsystem in second rack (I/O subsystem 6) U2.5-V1 1022 xx15 Second I/O subsystem in second rack (I/O subsystem 6), DCA-1, cable fault Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U2.5-V1/Q1 and U1.35-P2-X2/Q3 2. U2.5-V1/Q2 and U1.35-P1-X2/Q3 1022 xx17 Problem with fan 1 and fan 3 in second I/O subsystem in second rack (I/O subsystem 6) 1. DCA-1Location U2.5–V1 2. Fan 1,Location U2.5–F1 3. Fan 3,Location U2.5–F3 1022 xx21 Second I/O subsystem in second rack (I/O subsystem 6), DCA-2 U2.5-V2 1022 xx23 Seeprom fault on DCA-2 in second I/O subsystem in secondary rack (I/O subsystem 6) U2.5-V2 1022 xx25 Second I/O subsystem in second rack (I/O subsystem 6), DCA-2, cable fault Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U2.5-V2/Q1 and U1.35-P2-X2/Q4 2. U2.5-V2/Q2 and U1.35-P1-X2/Q4 Chapter 5. Error Code to FRU Index 471 Error Code Description Action/ Possible Failing FRU 1022 xx27 Problem with fan 2 and fan 4 in second I/O subsystem in second rack (I/O subsystem 6) 1. DCA-2Location U2.5–V2 2. Fan 2,Location U2.5–F2 3. Fan 4,Location U2.5–F4 1022 xx31 Second I/O subsystem in secondary rack (I/O subsystem 6) I/O subsystem backplane 1 fault U2.5-P1 1022 xx33 Seeprom fault on I/O subsystem backplane 1 in second I/O subsystem in secondary rack (I/O subsystem 6)Broken 12C Bus U2.5-P1 1022 xx41 Second I/O subsystem in secondary rack (I/O subsystem 6) I/O subsystem backplane 2 fault U2.5-P2 1022 xx43 Seeprom fault on I/O subsystem backplane 2 in second I/O subsystem in secondary rack (I/O subsystem 6) U2.5-P2 1022 xx51 Second I/O subsystem in secondary rack (I/O subsystem 6) DASD backplane 1 fault U2.5-P3 1022 xx53 Seeprom fault on DASD backplane 1 in second I/O subsystem in secondary rack (I/O subsystem 6) U2.5-P3 1022 xx61 Second I/O subsystem in secondary rack (I/O subsystem 6), DASD backplane 2 fault U2.5-P4 1022 xx63 Seeprom fault on DASD backplane 2 in second I/O subsystem in secondary rack (I/O subsystem 6) U2.5-P4 1022 xx71 Second I/O subsystem in secondary rack (I/O subsystem 6), DASD backplane 3 fault U2.5-P5 1022 xx73 Seeprom fault on DASD backplane 3 in second I/O subsystem in secondary rack (I/O subsystem 6) U2.5-P5 1022 xx81 Second I/O subsystem in secondary rack (I/O subsystem 6) DASD backplane 4 fault U2.5-P6 1022 xx83 Seeprom fault on DASD backplane 4 in second I/O subsystem in secondary rack (I/O subsystem 6) U2.5-P6 1023 1B17 Problem with fan 1 in fourth I/O subsystem (with IBF, moves to second rack) 1. DCA-1Location: U2.9-V1 2. Fan 1Location: U2.9-F1 1023 1B27 Problem with fan 2 in fourth I/O subsystem (with IBF, moves to second rack) 1. DCA-2Location: U2.9-V2 2. Fan 2Location: U2.9-F2 1023 1C06 Loss of airflow in fourth I/O subsystem, with IBFlocation: U2.9 Go to “MAP 1527: An Airflow Loss Has Been Detected” on page 209. 472 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 1023 6014 Overcurrent or overvoltage problem with 3.3V in fourth I/O subsystem (with IBF, moves to second rack) Go to “MAP 1520: Power” on page 192. 1023 6024 Overcurrent or overvoltage problem with 5.0V in fourth I/O subsystem (with IBF, moves to second rack) Go to “MAP 1520: Power” on page 192. 1023 6034 Overcurrent or overvoltage problem with 2.5V in fourth I/O subsystem (with IBF, moves to second rack) Go to “MAP 1520: Power” on page 192. 1023 6044 Overcurrent or overvoltage problem with 12.0V in fourth I/O subsystem (with IBF, moves to second rack) Go to “MAP 1520: Power” on page 192. 1023 6054 Overcurrent or overvoltage problem with -12.0V in fourth I/O subsystem (with IBF, moves to second rack) Go to “MAP 1520: Power” on page 192. 1023 6814 Overcurrent or overvoltage problem with 3.3V in fourth I/O subsystem (with IBF, moves to second rack) Go to “MAP 1520: Power” on page 192. 1023 6824 Overcurrent or overvoltage problem with 5.0V in fourth I/O subsystem (with IBF, moves to second rack) Go to “MAP 1520: Power” on page 192. 1023 6834 Overcurrent or overvoltage problem with 2.5V in fourth I/O subsystem (with IBF, moves to second rack) Go to “MAP 1520: Power” on page 192. 1023 6844 Overcurrent or overvoltage problem with 12.0V in fourth I/O subsystem (with IBF, moves to second rack) Go to “MAP 1520: Power” on page 192. 1023 6854 Overcurrent or overvoltage problem with -12.0V in fourth I/O subsystem (with IBF, moves to second rack) Go to “MAP 1520: Power” on page 192. 1023 7017 Problem with fan 3 in fourth I/O subsystem (with IBF, moves to second rack) 1. DCA-1Location: U2.9-V1 2. Fan 3Location: U2.9-F3 1023 7027 Problem with fan 4 in fourth I/O subsystem (with IBF, moves to second rack) 1. DCA-2Location: U2.9-V2 2. Fan 4Location: U2.9-F4 1023 7117 Problem with fan 1 and fan 3 in fourth I/O subsystem (with IBF, moves to second rack) 1. DCA-1Location: U2.9-V1 2. Fans 1 and 3Location: U2.9-F1 and U2.9-F3 1023 7127 Problem with fan 2 and fan 4 in fourth I/O subsystem (with IBF, moves to second rack) 1. DCA-2Location: U2.9-V2 2. Fans 2 and 4Location: U2.9-F2 and U2.9-F4 1023 8A00 The SPCN firmware has detected a change in the drawer configuration Informational message. Chapter 5. Error Code to FRU Index 473 Error Code Description Action/ Possible Failing FRU 1023 B014 Overcurrent or overvoltage problem with 3.3V in fourth I/O subsystem (with IBF, moves to second rack) Go to “MAP 1520: Power” on page 192. 1023 B024 Overcurrent or overvoltage problem with 5.0V in fourth I/O subsystem (with IBF, moves to second rack) Go to “MAP 1520: Power” on page 192. 1023 B034 Overcurrent or overvoltage problem with 2.5V in fourth I/O subsystem (with IBF, moves to second rack) Go to “MAP 1520: Power” on page 192. 1023 B044 Overcurrent or overvoltage problem with 12.0V in fourth I/O subsystem (with IBF, moves to second rack) Go to “MAP 1520: Power” on page 192. 1023 B054 Overcurrent or overvoltage problem with -12.0V in fourth I/O subsystem (with IBF, moves to second rack) Go to “MAP 1520: Power” on page 192. 1023 B814 Overcurrent or overvoltage problem with 3.3V in fourth I/O subsystem (with IBF, moves to second rack) Go to “MAP 1520: Power” on page 192. 1023 B824 Overcurrent or overvoltage problem with 5.0V in fourth I/O subsystem (with IBF, moves to second rack) Go to “MAP 1520: Power” on page 192. 1023 B834 Overcurrent or overvoltage problem with 2.5V in fourth I/O subsystem (with IBF, moves to second rack) Go to “MAP 1520: Power” on page 192. 1023 B844 Overcurrent or overvoltage problem with 12.0V in fourth I/O subsystem (with IBF, moves to second rack) Go to “MAP 1520: Power” on page 192. 1023 B854 Overcurrent or overvoltage problem with -12.0V in fourth I/O subsystem (with IBF, moves to second rack) Go to “MAP 1520: Power” on page 192. 1023 BF06 Unable to resolve the chassis VPD in an I/O subsystem The chassis VPD in the I/O subsystem drawer identified by the location code could not be resolved. This may be an indication that more than one FRU in the drawer was replaced during a service action. If this is the case, reinstall the original FRUs in the drawer, then replace the FRUs one at a time. If only one FRU was replaced and this error code occurred, call service support. 474 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 1023 CC36 Description: Loss of pgood on an I/O planar in the I/O drawer at location U2.9Action:This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U2.9-P1. 1023 CC46 Description: Loss of pgood on an I/O planar in the I/O drawer at location U2.9Action:This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U2.9-P2. 1023 CD96 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. 1023 CDA6 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. 1023 CDB6 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. 1023 CDC6 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. 1023 F4x6 Hardware configuration does not support N-mode power. Call Service Support Chapter 5. Error Code to FRU Index 475 Error Code Description Action/ Possible Failing FRU 1023 F516 An N+1 boundary (DCA or fan) has dropped to N-mode status in the fourth subsystem (with IBF, location: U2.9. This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced. 1023 F526 An N+1 boundary (DCA or fan) has dropped to N-mode status in the fourth I/O subsystem (with IBF), location: U2.9. This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced. 1023 xx11 Fourth I/O subsystem moves to second rack (with IBF), DCA-1 U2.9-V1 1023 xx13 Seeprom fault on DCA-1 in fourth I/O subsystem (with IBF) in secondary rack U2.9-V1 1023 xx15 Fourth I/O subsystem moves to second rack (with IBF), DCA-1, cable fault Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U2.9-V1/Q1 and U1.35-P2-X2/Q5 2. U2.9-V1/Q2 and U1.35-P1-X2/Q5 1023 xx17 Problem with fan 1 and fan 3 in fourth I/O subsystem (with IBF, moves to second rack) 1. DCA-1Location U2.9–V1 2. Fan 1,Location U2.9–F1 3. Fan 3,Location U2.9–F3 1023 xx21 Fourth I/O subsystem moves to second rack (with IBF), DCA-2 U2.9-V2 1023 xx23 Seeprom fault on DCA-2 in fourth I/O subsystem (with IBF) in secondary rack U2.9-V2 1023 xx25 Fourth I/O subsystem moves to second rack (with IBF), DCA-2, cable fault Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U2.9-V2/Q1 and U1.35-P2-X2/Q6 2. U2.9-V2/Q2 and U1.35-P1-X2/Q6 1023 xx27 Problem with fan 2 and fan 4 in fourth I/O subsystem (with IBF, moves to second rack) 1. DCA-2Location U2.9–V2 2. Fan 2,Location U2.9–F2 3. Fan 4,Location U2.9–F4 1023 xx31 Fourth I/O subsystem (with IBF) in secondary rack I/O subsystem backplane 1 fault U2.9-P1 476 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 1023 xx33 Seeprom fault on I/O subsystem backplane 1 in fourth I/O subsystem (with IBF) in secondary rack U2.9-P1 1023 xx41 Fourth I/O subsystem (with IBF) in secondary rack I/O subsystem backplane 2 fault U2.9-P2 1023 xx43 Seeprom fault on I/O subsystem backplane 2 in fourth I/O subsystem (with IBF) in secondary rack U2.9-P2 1023 xx51 Fourth I/O subsystem (with IBF) in secondary rack DASD backplane 1 fault U2.9-P3 1023 xx53 Seeprom fault on DASD backplane 1 in fourth I/O subsystem (with IBF) in secondary rack U2.9-P3 1023 xx61 Fourth I/O subsystem (with IBF) in secondary rack DASD backplane 2 fault U2.9-P4 1023 xx63 Seeprom fault on DASD backplane 2 in fourth I/O subsystem (with IBF) in secondary rack U2.9-P4 1023 xx71 Fourth I/O subsystem (with IBF) in secondary rack DASD backplane 3 fault U2.9-P5 1023 xx73 Seeprom fault on DASD backplane 3 in fourth I/O subsystem (with IBF) in secondary rack U2.9-P5 1023 xx81 Fourth I/O subsystem (with IBF) in secondary rack DASD backplane 4 fault U2.9-P6 1023 xx83 Seeprom fault on DASD backplane 4 in fourth I/O subsystem (with IBF) in secondary rack U2.9-P6 1024 1B17 Problem with fan 1 in third I/O subsystem in second rack (I/O subsystem 7) 1. DCA-1Location: U2.13-V1 2. Fan 1Location: U2.13-F1 1024 1B27 Problem with fan 2 in third I/O subsystem in second rack (I/O subsystem 7) 1. DCA-2Location: U2.13-V2 2. Fan 2Location: U2.13-F2 1024 1C06 Loss of airflow in third I/O subsystem in second rack (I/O subsystem 7), location: U2.13 Go to “MAP 1527: An Airflow Loss Has Been Detected” on page 209. 1024 6014 Overcurrent or overvoltage problem with 3.3V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13 Go to “MAP 1520: Power” on page 192. 1024 6024 Overcurrent or overvoltage problem with 5.0V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13 Go to “MAP 1520: Power” on page 192. 1024 6034 Overcurrent or overvoltage problem with 2.5V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13 Go to “MAP 1520: Power” on page 192. Chapter 5. Error Code to FRU Index 477 Error Code Description Action/ Possible Failing FRU 1024 6044 Overcurrent or overvoltage problem with 12.0V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13 Go to “MAP 1520: Power” on page 192. 1024 6054 Overcurrent or overvoltage problem with -12.0V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13 Go to “MAP 1520: Power” on page 192. 1024 6814 Overcurrent or overvoltage problem with 3.3V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13 Go to “MAP 1520: Power” on page 192. 1024 6824 Overcurrent or overvoltage problem with 5.0V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13 Go to “MAP 1520: Power” on page 192. 1024 6834 Overcurrent or overvoltage problem with 2.5V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13 Go to “MAP 1520: Power” on page 192. 1024 6844 Overcurrent or overvoltage problem with 12.0V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13 Go to “MAP 1520: Power” on page 192. 1024 6854 Overcurrent or overvoltage problem with -12.0V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13 Go to “MAP 1520: Power” on page 192. 1024 7017 Problem with fan 3 in third I/O subsystem in second rack (I/O subsystem 7) 1. DCA-1Location: U2.13-V1 2. Fan 3Location: U2.13-F3 1024 7027 Problem with fan 4 in third I/O subsystem in second rack (I/O subsystem 7) 1. DCA-2Location: U2.13-V2 2. Fan 4Location: U2.13-F4 1024 7117 Problem with fan 1 and fan 3 in third I/O subsystem in second rack (I/O subsystem 7) 1. DCA-1Location: U2.13-V1 2. Fans 1 and 3Location: U2.13-F1 and U2.13-F3 1024 7127 Problem with fan 2 and fan 4 in third I/O subsystem in second rack (I/O subsystem 7) 1. DCA-2Location: U2.13-V2 2. Fans 2 and 4Location: U2.13-F2 and U2.13-F4 1024 8A00 The SPCN firmware has detected a change in the drawer configuration Informational message. 1024 B014 Overcurrent or overvoltage problem with 3.3V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13 Go to “MAP 1520: Power” on page 192. 478 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 1024 B024 Overcurrent or overvoltage problem with 5.0V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13 Go to “MAP 1520: Power” on page 192. 1024 B034 Overcurrent or overvoltage problem with 2.5V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13 Go to “MAP 1520: Power” on page 192. 1024 B044 Overcurrent or overvoltage problem with 12.0V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13 Go to “MAP 1520: Power” on page 192. 1024 B054 Overcurrent or overvoltage problem with -12.0V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13 Go to “MAP 1520: Power” on page 192. 1024 B814 Overcurrent or overvoltage problem with 3.3V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13 Go to “MAP 1520: Power” on page 192. 1024 B824 Overcurrent or overvoltage problem with 5.0V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13 Go to “MAP 1520: Power” on page 192. 1024 B834 Overcurrent or overvoltage problem with 2.5V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13 Go to “MAP 1520: Power” on page 192. 1024 B844 Overcurrent or overvoltage problem with 12.0V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13 Go to “MAP 1520: Power” on page 192. 1024 B854 Overcurrent or overvoltage problem with -12.0V in third I/O subsystem in second rack (I/O subsystem 7). Location:U2.13 Go to “MAP 1520: Power” on page 192. 1024 BF06 Unable to resolve the chassis VPD in an I/O subsystem The chassis VPD in the I/O subsystem drawer identified by the location code could not be resolved. This may be an indication that more than one FRU in the drawer was replaced during a service action. If this is the case, reinstall the original FRUs in the drawer, then replace the FRUs one at a time. If only one FRU was replaced and this error code occurred, call service support. Chapter 5. Error Code to FRU Index 479 Error Code Description Action/ Possible Failing FRU 1024 CC36 Description: Loss of pgood on an I/O planar in the I/O drawer at location U2.13Action:This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U2.13-P1. 1024 CC46 Description: Loss of pgood on an I/O planar in the I/O drawer at location U2.13Action:This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U2.13-P2. 1024 CD96 A power short has been detected in the DASD subsystem in an I/O drawer. Location:U2.13 Go to “MAP 1520: Power” on page 192. 1024 CDA6 A power short has been detected in the DASD subsystem in an I/O drawer. Location:U2.13 Go to “MAP 1520: Power” on page 192. 1024 CDB6 A power short has been detected in the DASD subsystem in an I/O drawer. Location:U2.13 Go to “MAP 1520: Power” on page 192. 1024 CDC6 A power short has been detected in the DASD subsystem in an I/O drawer. Location:U2.13 Go to “MAP 1520: Power” on page 192. 1024 F4x6 Hardware configuration does not support N-mode power. Call Service Support 480 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 1024 F516 An N+1 boundary (DCA or fan) has dropped to N-mode status in third I/O subsystem in second rack (I/O subsystem 7), location: U2.13. This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced. 1024 F526 An N+1 boundary (DCA or fan) has dropped to N-mode status in third I/O subsystem in second rack (I/O subsystem 7), location: U2.13. This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced. 1024 xx11 Third I/O subsystem in second rack (I/O subsystem 7), DCA-1 U2.13-V1 1024 xx13 Seeprom fault on DCA-1 in third I/O subsystem in second rack (I/O subsystem 7) U2.13-V1 1024 xx15 Third I/O subsystem in second rack (I/O subsystem 7), DCA-1, cable fault Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U2.13-V1/Q1 and U1.35-P2-X2/Q7 2. U2.13-V1/Q2 and U1.35-P1-X2/Q7 1024 xx17 Problem with fan 1 and fan 3 in third I/O subsystem in second rack (I/O subsystem 7) 1. DCA-1Location U2.13–V1 2. Fan 1,Location U2.13–F1 3. Fan 3,Location U2.13–F3 1024 xx21 Third I/O subsystem in second rack (I/O subsystem 7), DCA-2 U2.13-V2 1024 xx23 Seeprom fault on DCA-2 in third I/O subsystem in second rack (I/O subsystem 7) U2.13-V2 1024 xx25 Third I/O subsystem in second rack (I/O subsystem 7), DCA-2, cable fault Go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U2.13-V2/Q1 and U1.35-P2-X2/Q8 2. U2.13-V2/Q2 and U1.35-P1-X2/Q8 1024 xx27 Problem with fan 2 and fan 4 in third I/O subsystem in second rack (I/O subsystem 7) 1. DCA-2Location U2.13–V2 2. Fan 2,Location U2.13–F2 3. Fan 4,Location U2.13–F4 1024 xx31 Third I/O subsystem in second rack (I/O subsystem 7) I/O subsystem backplane 1 fault U2.13-P1 Chapter 5. Error Code to FRU Index 481 Error Code Description Action/ Possible Failing FRU 1024 xx33 Seeprom fault on I/O subsystem backplane 1 in third I/O subsystem in second rack (I/O subsystem 7) U2.13-P1 1024 xx41 Third I/O subsystem in second rack (I/O subsystem 7) I/O subsystem backplane 2 fault U2.13-P2 1024 xx43 Seeprom fault on I/O subsystem backplane 2 in third I/O subsystem in second rack (I/O subsystem 7) U2.13-P2 1024 xx51 Third I/O subsystem in second rack (I/O subsystem 7) DASD backplane 1 fault U2.13-P3 1024 xx53 Seeprom fault on DASD backplane 1 in third I/O subsystem in second rack (I/O subsystem 7) U2.13-P3 1024 xx61 Third I/O subsystem in second rack (I/O subsystem 7), DASD backplane 2 fault U2.13-P4 1024 xx63 Seeprom fault on DASD backplane 2 in third I/O subsystem in second rack (I/O subsystem 7) U2.13-P4 1024 xx71 Third I/O subsystem in second rack (I/O subsystem 7), DASD backplane 3 fault U2.13-P5 1024 xx73 Seeprom fault on DASD backplane 3 in third I/O subsystem in second rack (I/O subsystem 7) U2.13-P5 1024 xx81 Third I/O subsystem in second rack (I/O subsystem 7) DASD backplane 4 fault U2.13-P6 1024 xx83 Seeprom fault on DASD backplane 4 in third I/O subsystem in second rack (I/O subsystem 7) U2.13-P6 1025 1B17 Problem with fan 1 in fourth I/O subsystem in second rack (I/O subsystem 8) 1. DCA-1Location: U2.19-V1 2. Fan 1Location: U2.19-F1 1025 1B27 Problem with fan 2 in fourth I/O subsystem in second rack (I/O subsystem 8) 1. DCA-2Location: U2.19-V2 2. Fan 2Location: U2.19-F2 1025 1C06 Loss of airflow in fourth I/O subsystem in second rack (I/O subsystem 8), location: U2.19 Go to “MAP 1527: An Airflow Loss Has Been Detected” on page 209. 1025 6014 Overcurrent or overvoltage problem with 3.3V in fourth I/O subsystem in second rack (I/O subsystem 8) Go to “MAP 1520: Power” on page 192. 1025 6024 Overcurrent or overvoltage problem with 5.0V in fourth I/O subsystem in second rack (I/O subsystem 8) Go to “MAP 1520: Power” on page 192. 1025 6034 Overcurrent or overvoltage problem with 2.5V in fourth I/O subsystem in second rack (I/O subsystem 8) Go to “MAP 1520: Power” on page 192. 1025 6044 Overcurrent or overvoltage problem with 12.0V in fourth I/O subsystem in second rack (I/O subsystem 8) Go to “MAP 1520: Power” on page 192. 482 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 1025 6054 Overcurrent or overvoltage problem with -12.0V in fourth I/O subsystem in second rack (I/O subsystem 8) Go to “MAP 1520: Power” on page 192. 1025 6814 Overcurrent or overvoltage problem with 3.3V in fourth I/O subsystem in second rack (I/O subsystem 8) Go to “MAP 1520: Power” on page 192. 1025 6824 Overcurrent or overvoltage problem with 5.0V in fourth I/O subsystem in second rack (I/O subsystem 8) Go to “MAP 1520: Power” on page 192. 1025 6834 Overcurrent or overvoltage problem with 2.5V in fourth I/O subsystem in second rack (I/O subsystem 8) Go to “MAP 1520: Power” on page 192. 1025 6844 Overcurrent or overvoltage problem with 12.0V in fourth I/O subsystem in second rack (I/O subsystem 8) Go to “MAP 1520: Power” on page 192. 1025 6854 Overcurrent or overvoltage problem with -12.0V in fourth I/O subsystem in second rack (I/O subsystem 8) Go to “MAP 1520: Power” on page 192. 1025 7017 Problem with fan 3 in fourth I/O subsystem in second I/O subsystem in second rack (I/O subsystem 8) 1. DCA-1Location: U2.19-V1 2. Fan 3Location: U2.19-F3 1025 7027 Problem with fan 4 in fourth I/O subsystem in second rack (I/O subsystem 8) 1. DCA-2Location: U2.19-V2 2. Fan 4Location: U2.19-F4 1025 7117 Problem with fan 1 and fan 3 in fourth I/O subsystem in second rack (I/O subsystem 8) 1. DCA-1Location: U2.19-V1 2. Fans 1 and 3Location: U2.19-F1 and U2.19-F3 1025 7127 Problem with fan 2 and fan 4 in fourth I/O subsystem in second rack (I/O subsystem 8) 1. DCA-2Location: U2.19-V2 2. Fans 2 and 4Location: U2.19-F2 and U2.19-F4 1025 8A00 The SPCN firmware has detected a change in the drawer configuration Informational message. 1025 B014 Overcurrent or overvoltage problem with 3.3V in fourth I/O subsystem in second rack (I/O subsystem 8) Go to “MAP 1520: Power” on page 192. 1025 B024 Overcurrent or overvoltage problem with 5.0V in fourth I/O subsystem in second rack (I/O subsystem 8) Go to “MAP 1520: Power” on page 192. 1025 B034 Overcurrent or overvoltage problem with 2.5V in fourth I/O subsystem in second rack (I/O subsystem 8) Go to “MAP 1520: Power” on page 192. 1025 B044 Overcurrent or overvoltage problem with 12.0V in fourth I/O subsystem in second rack (I/O subsystem 8) Go to “MAP 1520: Power” on page 192. Chapter 5. Error Code to FRU Index 483 Error Code Description Action/ Possible Failing FRU 1025 B054 Overcurrent or overvoltage problem with -12.0V in fourth I/O subsystem in second rack (I/O subsystem 8) Go to “MAP 1520: Power” on page 192. 1025 B814 Overcurrent or overvoltage problem with 3.3V in fourth I/O subsystem in second rack (I/O subsystem 8) Go to “MAP 1520: Power” on page 192. 1025 B824 Overcurrent or overvoltage problem with 5.0V in second I/O subsystem in fourth I/O subsystem in second rack (I/O subsystem 8) Go to “MAP 1520: Power” on page 192. 1025 B834 Overcurrent or overvoltage problem with 2.5V in second I/O subsystem in fourth I/O subsystem in second rack (I/O subsystem 8) Go to “MAP 1520: Power” on page 192. 1025 B844 Overcurrent or overvoltage problem with 12.0V in second I/O subsystem in fourth I/O subsystem in second rack (I/O subsystem 8) Go to “MAP 1520: Power” on page 192. 1025 B854 Overcurrent or overvoltage problem with -12.0V in fourth I/O subsystem in second rack (I/O subsystem 8) Go to “MAP 1520: Power” on page 192. 1025 BF06 Unable to resolve the chassis VPD in an I/O subsystem The chassis VPD in the I/O subsystem drawer identified by the location code could not be resolved. This may be an indication that more than one FRU in the drawer was replaced during a service action. If this is the case, reinstall the original FRUs in the drawer, then replace the FRUs one at a time. If only one FRU was replaced and this error code occurred, call service support. 1025 CC36 Description: Loss of pgood on an I/O planar in the I/O drawer at location U2.19Action:This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U2.19-P1. 484 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 1025 CC46 Description: Loss of pgood on an I/O planar in the I/O drawer at location U2.19Action:This error code can be caused by hot-plugging RIO cables. To see if this is a recoverable error, do the following: 1. Power down the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. 2. After the system is shut down, 350V dc must be removed from the DCAs in the I/O drawer that reported the error: v If a High Performance Switch (7045-SW4) is installed in the CEC frame, follow the instructions in “Powering Off and Powering On the System” on page 838 for deactivating a system’s I/O drawer after the system is shut down. v If a High Performance Switch (7045-SW4) is not present in the CEC frame, use the UEPO switch to remove 350V dc from all of the drawers in the frame.3. Then boot the system by following the appropriate procedure in “Powering Off and Powering On the System” on page 838. At any point after the service processor comes up to standby, indicated by OK in the operator panel in the media drawer, look at the service action event log in Service Focal Point on the HMC. If this error code does not reoccur, no action is required. If this error code reoccurs, replace the I/O planar, location: U2.19-P2. 1025 CD96 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. 1025 CDA6 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. 1025 CDB6 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. 1025 CDC6 A power short has been detected in the DASD subsystem in an I/O drawer. Go to “MAP 1520: Power” on page 192. 1025 F4x6 Hardware configuration does not support N-mode power. Call Service Support 1025 F516 An N+1 boundary (DCA or fan) has dropped to N-mode status in fourth I/O subsystem in second rack (I/O subsystem 8), location: U2.19. This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced. 1025 F526 An N+1 boundary (DCA or fan) has dropped to N-mode status in fourth I/O subsystem in second rack (I/O subsystem 8), location: U2.19. This is normally an informational message. It should be accompanied by other error codes that more precisely indicate where the failure is. Examine the service processor error log. If there are no other power-related error codes (in the time frame in which this code was generated) in the service processor error log, the FRU indicated with this error code, if any, should be replaced. 1025 xx11 Fourth I/O subsystem in second rack (I/O subsystem 8), DCA-1 U2.19-V1 1025 xx13 Seeprom fault on DCA-1 in fourth I/O subsystem in second rack (I/O subsystem 8) U2.19-V1 Chapter 5. Error Code to FRU Index 485 Error Code Description Action/ Possible Failing FRU 1025 xx15 Fourth I/O subsystem in second rack (I/O subsystem 8), DCA-1, cable fault If there is no DCA in U1.18-P1-V8, second rack (I/O go to “MAP 1520: Power” on page 192 subsystem 8) with these pairs of location codes: 1. U2.19-V1/Q1 and U1.35-P2-X2/Q9 2. U2.19-V1/Q2 and U1.35-P1-X2/Q9If there is a 6th DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U2.19-V1/Q1 and U1.35-P2-X1/Q9 2. U2.19-V1/Q2 and U1.35-P1-X1/Q9 1025 xx17 Problem with fan 1 and fan 3 in fourth I/O subsystem in second rack (I/O subsystem 8) 1. DCA-1Location U2.19–V1 2. Fan 1,Location U2.19–F1 3. Fan 3,Location U2.19–F3 1025 xx21 Fourth I/O subsystem in second rack (I/O subsystem 8), DCA-2 U2.19-V2 1025 xx23 Seeprom fault on DCA-2 in fourth I/O subsystem in second rack (I/O subsystem 8) U2.19-V2 1025 xx25 Fourth I/O subsystem in second rack (I/O subsystem 8), DCA-2, cable fault If there is no DCA in U1.18-P1-V8, second rack (I/O go to “MAP 1520: Power” on page 192 subsystem 8), DCA-2, with these pairs of location codes: 1. U2.19-V2/Q1 and U1.35-P2-X2/Q10 2. U2.19-V2/Q2 and U1.35-P1-X2/Q10If there is a 6th DCA in U1.18-P1-V8, go to “MAP 1520: Power” on page 192 with these pairs of location codes: 1. U2.19-V2/Q1 and U1.35-P2-X1/Q10 2. U2.19-V2/Q2 and U1.35-P1-X1/Q10 1025 xx27 Problem with fan 2 and fan 4 in fourth I/O subsystem in second rack (I/O subsystem 8) 1. DCA-2Location U2.19–V2 2. Fan 2,Location U2.19–F2 3. Fan 4,Location U2.19–F4 1025 xx31 Fourth I/O subsystem in second rack (I/O subsystem 8) I/O subsystem backplane 1 fault U2.19-P1 1025 xx33 Seeprom fault on I/O subsystem backplane 1 in fourth I/O subsystem in second rack (I/O subsystem 8) U2.19-P1 1025 xx41 Fourth I/O subsystem in second rack (I/O subsystem 8) I/O subsystem backplane 2 fault U2.19-P2 1025 xx43 Seeprom fault on I/O subsystem backplane 2 in fourth I/O subsystem in second rack (I/O subsystem 8) U2.19-P2 486 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 1025 xx51 Fourth I/O subsystem in second rack (I/O subsystem 8) DASD backplane 1 fault U2.19-P3 1025 xx53 Seeprom fault on DASD backplane 1 in fourth I/O subsystem in second rack (I/O subsystem 8) U2.19-P3 1025 xx61 Fourth I/O subsystem in second rack (I/O subsystem 8), DASD backplane 2 fault U2.19-P4 1025 xx63 Seeprom fault on DASD backplane 2 in fourth I/O subsystem in second rack (I/O subsystem 8) U2.19-P4 1025 xx71 Fourth I/O subsystem in second rack (I/O subsystem 8), DASD backplane 3 fault U2.19-P5 1025 xx73 Seeprom fault on DASD backplane 3 in fourth I/O subsystem in second rack (I/O subsystem 8) U2.19-P5 1025 xx81 Fourth I/O subsystem in second rack (I/O subsystem 8) DASD backplane 4 fault U2.19-P6 1025 xx83 Seeprom fault on DASD backplane 4 in fourth I/O subsystem in second rack (I/O subsystem 8) U2.19-P6 1RRU 1C06 Go to “MAP 1527: An Airflow Loss Has Been Detected” on page 209. SPCNtoA Go to “MAP 1529: There Is A Bulk Power Assembly (BPA) Communication Failure” on page 212. SPCNtoB Go to “MAP 1529: There Is A Bulk Power Assembly (BPA) Communication Failure” on page 212. Chapter 5. Error Code to FRU Index 487 Firmware/POST Error Codes Attention: If you replace FRUs or perform an action on an I/O subsystem and the problem is still not corrected, go to “MAP 1542: I/O Problem Isolation” on page 318 unless you were already directed to any MAP 154x by the error code. Otherwise call service support if the action(s) for an error code do not resolve the problem. Attention: If the error code that brought you to this chapter originated in the operating system error log, remember that an error may require an additional 15 minutes to be propagated to the operating system error log. If you replace FRUs and the problem is corrected, go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Error Code Description Action/ Possible Failing FRU 2011 00xy I/O subsystem power supply failure 1. Replace power supply “y” in I/O subsystem “x”. If y=_, then the power supply is: y = power supply 3 = Both power supplies (I/O subsystem) 2 = Reserved 1 = Reserved See “AIX and Physical Location Code Tables” in Chapter 1, for location codes. 2. Check ac power source to I/O subsystem “x”. See “AIX and Physical Location Code Tables” in Chapter 1, for location codes. 2022 0000 The I/O bus is running slowly 1. Check cabling. 2. Call second level of support. 488 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 203w 0xyz Remote I/O (RIO) configuration warning. This code indicates that a RIO configuration was detected that does not result in complete loops. The system continues to boot, however performance or recovery capability may be reduced. In some cases, the problem may result in missing I/O. Remote I/O subsystems are connected in loops. Use the letters w, y, and z in the error code to isolate the location of the open loop. Check RIO cables for loose connections, and verify power on the I/O subsystems by the presence of fan motion. The following numbers may be used to help isolate the failing component. v w = loop number, this is either 0 or 1 in a system with 2 RIO loops. v x = Always 0. v y is either 1 or 2 on a system with 2 loops. y indicates the nearest associated RIO port number on the processor subsystem according to the following table: y = Nearest RIO Port 1 = 0 2 = 1 3 = 2 4 = 3 v z = B, indicates a missing return line from the I/O subsystem to the processor subsystem. v z = C, indicates a missing link between two I/O subsystems. v z = E, indicates an I/O subsystem was found connected to RIO port 3 with no return to the processor subsystem, and no I/O subsystem was found connected to RIO port 2. In this case, the I/O connected to RIO port 3 is removed from the configuration since the cause of the error and the proper location of the I/O cannot be determined. Go to “MAP 1542: I/O Problem Isolation” on page 318. 20A8 0000 Insufficient information to boot. Verify the IP address in the SMS menu. 20A8 0001 Client IP address is already in use by other network device Verify the IP address in the SMS menu. 20A8 0002 Cannot get gateway IP address Refer to Table 2 on page 379 using code E174. 20A8 0003 Cannot get server hardware address Refer to Table 2 on page 379 using code E174. 20A8 0004 BootP failed Refer to Table 2 on page 379 using code E175. 20A8 0005 File transmission (TFTP) failed. Check network connection, try again. 20A8 0006 Boot image too large Contact your next level of support for assistance. Chapter 5. Error Code to FRU Index 489 Error Code Description Action/ Possible Failing FRU 20D0 0001 I/O subsystem has changed locations This is an informational message. The I/O subsystem at the location code associated with this messages was found at another location code during a prior boot of the system. If an I/O subsystem has not been moved, complete the following steps: 1. Boot the system and match the I/O subsystem location codes reported by the operating system to the subsystems attached to the CEC. Check the SPCN and RIO cabling to the I/O subsystems. If a cabling problem is found, correct it. 2. If no cabling problem is found, call your next level of support. 20D0 000F Self-test failed on device, no error and/or location code information available Refer to “Determining Location Code” on page 415 to see if a location code is available for the device the error was reported against. 1. Replace device indicated by the location code (if available). 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. 20D0 0010 Selftest failed on device, cannot locate package Contact your next level of support for assistance. 20D0 0011 Firmware RTAS attempt to allocate memory failed. Contact your next level of support for assistance. 20D0 0800 Firmware update failure: partition does not have service authority Using the HMC, grant service authority to that partition. Reboot the partition. Retry the flash update. 20D0 0801 System firmware update failure: more than one partition running Shut down all other partitions, then retry the firmware update. 20D0 0997 Operating system terminated with error string Call service support. A hypervisor dump may be available; see “Hypervisor Dump Retrieval Procedure” on page 757. 20D0 0A01 Error initializing RTAS NVRAM 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A02 Error initializing RTAS debugging token 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A03 Error initializing interface/service processor access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 490 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 20D0 0A04 Error initializing interface/ operator panel access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A05 Error initializing hardware access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A06 Error initializing interface/PCI access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A07 Error initializing interface/operating system access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A08 Error initializing interface/ client_interface access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A09 Error initializing interface/NUMA access 1. Check for system firmware updates. Apply update if available. 2. Check for system firmware updates. Apply update if available. 20D0 0A0A Error initializing interface/interrupt access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A0B Error initializing interface/OFDT access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A0C Error initializing interface/sensor access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A0D Error initializing interface/sensors/led_table access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A0E Error initializing interface/timebase access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. Chapter 5. Error Code to FRU Index 491 Error Code Description Action/ Possible Failing FRU 20D0 0A0F Error initializing interface/time-of-day access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A10 Error initializing interface/cache access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A1 Error initializing interface/power access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A1 Error initializing interface/hot-plug access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A1 Error initializing interface/log access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A1 Error initializing interface/EEH access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A1 Error initializing interface/error injection access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A1 Error initializing interface/handling access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A1 Error initializing interface/utility access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A1 Error initializing register softpatch access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0998 Operating system terminated with error string Call service support. A hypervisor dump may be available; see “Hypervisor Dump Retrieval Procedure” on page 757. 492 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 20D0 0999 Operating system terminated with error string Call service support. A hypervisor dump may be available; see “Hypervisor Dump Retrieval Procedure” on page 757. 20D0 9001 Subsystem information not found in VPD 1. If the location code starts with U1.17 or U1.18, the problem is in the system VPD module on the media subsystem operator panel. Replace the operator panel; do not swap the old VPD module onto the new operator panel. Call service support for instructions on programming the VPD information into the new VPD module. 2. If the location code indicates an I/O subsystem, call service support. 20D0 9002 Subsystem machine type and model information not found in VPD 1. If the location code starts with U1.17 or U1.18, the problem is in the system VPD module on the media subsystem operator panel. Replace the operator panel; do not swap the old VPD module onto the new operator panel. Call service support for instructions on programming the VPD information into the new VPD module. 2. If the location code indicates an I/O subsystem, call service support. 20D0 9003 Subsystem serial number information not found in VPD 1. If the location code starts with U1.17 or U1.18, the problem is in the system VPD module on the media subsystem operator panel. Replace the operator panel; do not swap the old VPD module onto the new operator panel. Call service support for instructions on programming the VPD information into the new VPD module. 2. If the location code indicates an I/O subsystem, call service support. 20D0 9004 Subsystem machine type and model field is empty in VPD 1. If the location code starts with U1.17 or U1.18, the problem is in the system VPD module on the media subsystem operator panel. Replace the operator panel; do not swap the old VPD module onto the new operator panel. Call service support for instructions on programming the VPD information into the new VPD module. 2. If the location code indicates an I/O subsystem, call service support. Chapter 5. Error Code to FRU Index 493 Error Code Description Action/ Possible Failing FRU 20D0 9005 Subsystem serial number field is empty in VPD 1. If the location code starts with U1.17 or U1.18, the problem is in the system VPD module on the media subsystem operator panel. Replace the operator panel; do not swap the old VPD module onto the new operator panel. Call service support for instructions on programming the VPD information into the new VPD module. 2. If the location code indicates an I/O subsystem, call service support. 20D0 9006 Duplicate subsystem serial number found in VPD 1. If the location code starts with U1.17 or U1.18, the problem is in the system VPD module on the media subsystem operator panel. Replace the operator panel; do not swap the old VPD module onto the new operator panel. Call service support for instructions on programming the VPD information into the new VPD module. 2. If the location code indicates an I/O subsystem, call service support. 20D0 9007 Invalid subsystem machine type and model information in VPD 1. If the location code starts with U1.17 or U1.18, the problem is in the system VPD module on the media subsystem operator panel. Replace the operator panel; do not swap the old VPD module onto the new operator panel. Call service support for instructions on programming the VPD information into the new VPD module. 2. If the location code indicates an I/O subsystem, call service support. 20D0 9008 Invalid subsystem serial number found in VPD 1. If the location code starts with U1.17 or U1.18, the problem is in the system VPD module on the media subsystem operator panel. Replace the operator panel; do not swap the old VPD module onto the new operator panel. Call service support for instructions on programming the VPD information into the new VPD module. 2. If the location code indicates an I/O subsystem, call service support. 20D1 0001 Failed to send state-change message to service processor 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D1 0002 Failed to disable heartbeat alarm 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 494 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 20D1 0003 Failed to send boot status in LPAR partition 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D1 0004 Failed to locate service processor device tree node 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D1 0005 Failed to reboot system in LPAR mode 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D1 0006 Failed to reboot a partition in LPAR mode 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D1 0007 Failed to set PCI read/write permissions array 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D1 0008 Failed to set PCI read/write permissions array for slot 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D1 0009 Missing or invalid subsystem serial number 1. Check for system firmware updates. Apply update if available. 2. If a location code is shown with this error code, replace the part at that location. 3. Contact your next level of support for assistance.Note: The location code identifies the failing subsystem. 20D1 000A Failed to send boot failed message to service processor 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D1 000B Failed to reset PCI read/write permissions array 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D1 000C Failed to send LMB-TABLE-INFO mailbox repair 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. Chapter 5. Error Code to FRU Index 495 Error Code Description Action/ Possible Failing FRU 20D1 000D Hypervisor function to get time-of-day failed 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D1 000F Hypervisor call to reset PHB or to check for drawer existence failed 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D1 0010 Stack underflow from fiber channel adapter 1. Check for microcode updates for the fiber channel adapter. Apply update if available. 2. Check for system firmware updates. Apply update if available. 3. Contact your next level of support for assistance. 20D1 0011 A recovery operation on an internal I/O control chip failed 1. If the system is partitioned, perform a hard reset on the partition that owns the drawer specified by the location code reported with the error. Reboot the partition. If the error reoccurs, replace the I/O planar specified by the location code. If the system is running in full system partition mode, or an HMC is not attached to the system, reboot the system. If the error reoccurs, replace the I/O planar specified by the location code. 2. Call service support. 20D1 0012 Attention: One of the I/O subsystems has been disconnected from the system This indicates that an I/O subsystem that was previously in the configuration was not found during the current boot. If an I/O subsystem has been removed, this code is an informational message. If this message occurs during a system boot when the system’s drawer configuration has not changed, complete the following steps: 1. If not already booted, boot the system and match the I/O subsystem location codes reported by the operating system to the subsystems attached to the CEC. Check the SPCN and RIO cabling to the I/O subsystems. If a cabling problem is found, correct it. 2. Check for system firmware updates. Apply update if available. 3. Contact your next level of support for assistance. 20E0 0001 Privileged-access password entry error. The password has been entered incorrectly. Try installing the password again. 496 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 20E0 0004 Battery drained or needs replacement 1. Replace the battery,Location: U1.18-P1-H2-V1 2. If problem persists, replace the primary I/O book, Location: U1.18-P1-H2 20E0 0009 Invalid password entered - system locked The password has been entered incorrectly 3 times. Turn off, then turn on the system unit, then enter the password correctly. 20EE 0003 IP parameter requires 3 dots ?.? Enter valid IP parameter. Example: 000.000.000.000 20EE 0004 Invalid IP parameter Enter valid IP parameter. Example: 000.000.000.000 20EE 0005 Invalid IP parameter (>255) Enter valid IP parameter. Example: 255.192.002.000 20EE 0007 Keyboard not found 1. Plug in keyboard. 2. Replace the USB card the keyboard is plugged into. 20EE 0008 No configurable adapters found in the system by the remote IPL menu in the SMS utilities This error occurs when the remote IPL menu in the SMS utilities cannot locate any LAN adapters that are supported by the remote IPL function. If a supported device is installed: 1. Replace the adapter. (See notes on page 413.) 2. Replace the I/O subsystem backplane in the subsystem(s) containing the device/adapter. 20EE 0009 Unable to communicate with the service processor Replace the primary I/O book, Location: U1.18-P1-H2 20EE 000A The image size on the boot or installation device exceeds the system firmware memory limitations. 1. Check for system firmware updates. Apply if available. 2. Call service support. 20EE 000B The system was not able to find an operating system on the devices in the boot list. Go to “Boot Problems” on page 409. Chapter 5. Error Code to FRU Index 497 Error Code Description Action/ Possible Failing FRU 20EE 000C Pointer to the operating system found in non-volatile storage. Values normally found in non-volatile storage that point to the location of an operating system were not found. This can happen for two reasons: v Either your operating system doesn’t support storing the values orv An event occurred that caused the system to lose non-volatile storage information (drainage or replacement of the battery).If you are running AIX, reconstruct this information by running the bootlist command and specifying the device that the operating system is installed on. See the AIX documentation for the syntax and usage of the bootlist command. To boot the operating system so that the above-mentioned values can be reconstructed, power the system down and power it back on again to cause the system to look for the operating system in the devices contained in the custom boot list or in the default boot list, depending on the condition of the system. If this is not successful, use the SMS menus to modify the boot sequence (also known as the boot list) to include devices that are known to contain a copy of the operating system. For example, select a hard disk known to have a copy of the operating system as the first and only device in the boot sequence and attempt to boot again. If the problem persists, replace the primary I/O book, Location: U1.18-P1-H2 20EE 000E Unable to execute the lpar-mgr open firmware method 1. Verify that the system is in LPAR mode, and reboot the system. 2. Reflash the system with the latest level of system firmware. 3. Call service support. 20EE 000F Unable to find the lpar-mgr package node 1. Verify that the system is in LPAR mode, and reboot the system. 2. Reflash the system with the latest level of system firmware. 3. Call service support. 20EE 0010 Unable to execute the root vpd method 1. Verify that the system is in LPAR mode, and reboot the system. 2. Reflash the system with the latest level of system firmware. 3. Call service support. 498 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 20EE 0011 Unable to execute the root-lpar-vpd vpd method 1. Verify that the system is in LPAR mode, and reboot the system. 2. Reflash the system with the latest level of system firmware. 3. Call service support. 20EE 0012 Unable to execute the mem-dimm-vpd method 1. Verify that the system is in LPAR mode, and reboot the system. 2. Reflash the system with the latest level of system firmware. 3. Call service support. 20EE 0013 Unable to allocate memory for root-lpar-vpd vpd 1. Verify that the system is in LPAR mode, and reboot the system. 2. Reflash the system with the latest level of system firmware. 3. Call service support. 20EE 0014 Unable to allocate memory for root-lpar-vpd location codes Verify that the system is in LPAR mode, and try again. Otherwise, the system firmware may be damaged. 20EE 0020 The boot-device environment variable exceeds the allowed character limit If this error occurs during the installation of AIX via a process called NIM push, the set_bootlist attribute may not have been set correctly on the NIM master. See the AIX 5.2 Installation Guide and Reference, order number SC23-4389, for more information. If this error occurs at any other time, complete the following steps: 1. Check for platform firmware updates. Apply if available. 2. Call service support. 20EE 0021 The boot-device environment variable contained more than five entries If this error occurs during the installation of AIX via a process called NIM push, the set_bootlist attribute may not have been set correctly on the NIM master. See the AIX 5.2 Installation Guide and Reference, order number SC23-4389, for more information. If this error occurs at any other time, complete the following steps: 1. Check for platform firmware updates. Apply if available. 2. Call service support. Chapter 5. Error Code to FRU Index 499 Error Code Description Action/ Possible Failing FRU 20EE 0022 The boot-device environment variable contained an entry that exceeded 255 characters in length If this error occurs during the installation of AIX via a process called NIM push, the set_bootlist attribute may not have been set correctly on the NIM master. See the AIX 5.2 Installation Guide and Reference, order number SC23-4389, for more information. If this error occurs at any other time, complete the following steps: 1. Check for platform firmware updates. Apply if available. 2. Call service support. 20EE 0100 Fcode resident on an I/O adapter is not supported on this system Informational message. The adapter’s Fcode driver is not supported on this system. IBM may produce a compatible driver in the future, but does not guarantee one. Additional information may be available from service support. This error code only affects boot-time operation, not the adapter’s functionality under the operating system. 20FB 0010 Failed to find keyword offset of SPCN configuration table in NVRAM. Call service support. 20FB 0020 Invalid SPCN configuration table flag before SPCN table update. Call service support. 20FC 0020 Failed to retrieve power, package, and/or cooling VPD via SPCN for the corresponding subsystem. Service processor mailbox failed to get VPD 1. Check RIO cable connection. 2. Check for system firmware updates. 3. Replace the part specified by the location code. 20FC 0021 Failed to retrieve power, package, and/or cooling VPD via SPCN for the subsystem. Service processor mailbox failed to get power, packaging, and cooling device VPD 1. Check RIO cable connection. 2. Check for system firmware updates. 3. Replace the part specified by the location code. 20FC 0030 Damaged power, packaging, and/or cooling device VPD format in SPCN configuration table 1. Check for system firmware updates. 2. Replace the part specified by the location code. 20FF 0001 Problem with VPD 1. Replace the part indicated by the location code. 2. If no location code is shown with the error code, call service support. 20FF0010 Invalid VPD (defect ROM structure for PCI) 1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support. 20FF0020 Invalid VPD (checksum mismatch) 1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support. 500 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 20FF0021 Invalid VPD (ID tag length is more than 256 bytes) 1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support. 20FF0022 Invalid VPD (84/90 tag length is more than 2KB - large resource tag) 1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support. 20FF0023 Invalid VPD (buffer length is more than 2KB 1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support. 20FF0024 Invalid VPD (84/90 tag length is more than 2KB - vendor-defined tag) 1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support. 20FF0026 Invalid VPD (tag is not recognized) 1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support. 20FF0030 Invalid VPD (VPD data bad) 1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support. 20FF0060 Invalid VPD (VPD length is zero) 1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support. 20FF0070 Invalid VPD (cannot find the beginning of the keyword) 1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support. 20FF0072 Invalid VPD (cannot find the beginning of the keyword) 1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support. 20FF0073 Invalid VPD (cannot find the beginning of the keyword) 1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support. 20FF0090 Invalid VPD (empty VPD data from the service processor mailbox command) 1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support. Chapter 5. Error Code to FRU Index 501 Error Code Description Action/ Possible Failing FRU 20FF00A0 Invalid VPD (getvpd mailbox command failed) 1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support. 20FF00B0 Invalid VPD (the FG keyword is not found for a fabric card) 1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support. 20FF00B1 Invalid VPD (cannot find RM keyword) 1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support. 20FF00C0 Invalid VPD (missing base address of the RIO-PCI bridge) 1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support. 20FF00D0 Invalid VPD (machine type field is invalid) 1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support. 20FF00F0 Invalid VPD (the rack number and the unit number are not found in the SCF partition in NVRAM) 1. Replace the part specified by the location code. 2. If no location code is shown with the error code, call service support. 2100 0xxx SCSI controller fault Before replacing any system components, refer to the notes in error code 21A0 0001. 1. Replace the SCSI device. 21A0 0001 SCSI DASD: test unit ready failed; hardware error Before replacing any system components: 1. Ensure that the controller and each device on the SCSI bus is assigned a unique SCSI ID. 2. Ensure that the SCSI bus is properly terminated. 3. Ensure that the SCSI signal and power cables are securely connected and not damaged.The location code information is required to identify the ID of SCSI device failures as well as to indicate the location of the controller to which the device is attached. Check the system error logs to determine the location code information associated with the error code. 1. Replace the SCSI device. 2. Replace the SCSI cable. 3. Replace the SCSI controller. 502 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 21A0 0002 SCSI DASD: test unit ready failed; sense data available Before replacing any system components, refer to the Action under error code 21A0 0001. 1. Replace the media (if a removable media device). 2. Replace the SCSI device. 21A0 0003 SCSI DASD: send diagnostic failed; sense data available Before replacing any system components, refer to the Action under error code 21A0 0001. Replace the SCSI device. 21A0 0004 SCSI DASD: send diagnostic failed - devofl command Before replacing any system components, refer to the Action under error code 21A0 0001. Replace the SCSI device. 21E0 0001 SCSI tape: test unit ready failed; hardware error Before replacing any system components, refer to the Action under error code 21A0 0001. 1. Replace the SCSI device. 2. Replace the SCSI cable. 3. Replace the SCSI controller. 21E0 0002 SCSI tape: test unit ready failed; sense data available Before replacing any system components, refer to the Action under error code 21A0 0001. 1. Replace the media (if a removable media device). 2. Replace the SCSI device. 21E0 0003 SCSI tape: send diagnostic failed; sense data available Before replacing any system components, refer to the Action under error code 21A0 0001. Replace the SCSI device. 21E0 0004 SCSI tape: send diagnostic failed - devofl command Before replacing any system components, refer to the Action under error code 21A0 0001. Replace the SCSI device. 21ED 0001 SCSI C: test unit ready failed; hardware error Before replacing any system components, refer to the Action under error code 21A0 0001. 1. Replace the SCSI device. 2. Replace the SCSI cable. 3. Replace the SCSI controller. 21ED 0002 SCSI changer: test unit ready failed; sense data available Before replacing any system components, refer to the Action under error code 21A0 0001. 1. Replace the SCSI device. 2. Replace the media. Chapter 5. Error Code to FRU Index 503 Error Code Description Action/ Possible Failing FRU 21ED 0003 SCSI changer: send diagnostic failed; sense data available Before replacing any system components, refer to the Action under error code 21A0 0001. Replace the SCSI device. 21ED 0004 SCSI changer: send diagnostic failed - devofl command Before replacing any system components, refer to the Action under error code 21A0 0001. Replace the SCSI device. 21EE 0001 Undetermined SCSI device: test unit ready failed; hardware error Before replacing any system components, refer to the Action under error code 21A0 0001. 1. Replace the SCSI device. 2. Replace the SCSI cable. 3. If the missing SCSI devices are connected to the same backplane, replace the SCSI backplane. 4. Replace the SCSI controller. 21EE 0002 Undetermined SCSI device: test unit ready failed; sense data available Before replacing any system components, refer to the Action under error code 21A0 0001. 1. Replace the SCSI device. 2. Replace the media. 21EE 0003 Undetermined SCSI device: send diagnostic failed; sense data available Before replacing any system components, refer to the Action under error code 21A0 0001. Replace the SCSI device. 21EE 0004 Undetermined SCSI device: send diagnostic failed - devofl command Before replacing any system components, refer to the Action under error code 21A0 0001. Replace the SCSI device. 21F0 0001 SCSI CD-ROM: test unit ready failed; hardware error Before replacing any system components, refer to the Action under error code 21A0 0001. 1. Replace the SCSI device. 2. Replace the SCSI cable. 3. Replace the SCSI controller. 21F0 0002 SCSI CD-ROM: test unit ready failed; sense data available Before replacing any system components, refer to the Action under error code 21A0 0001. 1. Replace the SCSI device. 2. Replace the media. 21F0 0003 SCSI CD-ROM: send diagnostic failed; sense data available Before replacing any system components, refer to the Action under error code 21A0 0001. Replace the SCSI device. 504 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 21F0 0004 SCSI CD-ROM: send diagnostic failed - devofl command Before replacing any system components, refer to the Action under error code 21A0 0001. Replace the SCSI device. 21F2 0001 SCSI read/write optical device: test unit ready failed; hardware error Before replacing any system components, refer to the Action under error code 21A0 0001. 1. Replace the SCSI device. 2. Replace the SCSI cable. 3. Replace the SCSI controller. 21F2 0002 SCSI read/write optical device: test unit ready failed; sense data available Before replacing any system components, refer to the Action under error code 21A0 0001. 1. Replace the SCSI device. 2. Replace the media. 21F2 0003 SCSI read/write optical drive: send diagnostic failed; sense data available Before replacing any system components, refer to the Action under error code 21A0 0001. Replace the SCSI drive. 21F2 0004 SCSI read/write optical drive: send diagnostic failed - devofl command Before replacing any system components, refer to the Action under error code 21A0 0001. Replace the SCSI drive. 2200 0001 PCI Ethernet BNC/RJ-45 or PCI Ethernet AUI/RJ-45 Adapter Internal Wrap Test failure Replace the Adapter. See “Determining Location Code” on page 415 for location code information related to this error. 2200 1001 10/100 Mbps Ethernet PCI Adapter Internal Wrap Test failure Replace the adapter. See “Determining Location Code” on page 415 for location code information related to this error. 2200 1002 10/100 Mbps Ethernet PCI Adapter failure Replace the adapter. See “Determining Location Code” on page 415 for location code information related to this error. 2201 0001 PCI Auto LANstreamer Token Ring Adapter Adapter failed to complete hardware initialization. Replace the adapter. See “Determining Location Code” on page 415 for location code information related to this error. 2201 1001 PCI Token Ring Adapter Adapter failed to complete hardware initialization. Replace the adapter. See “Determining Location Code” on page 415 for location code information related to this error. 20EE 0013 Unable to allocate memory for root-lpar-vpd vpd 1. Verify that the system is in LPAR mode, and reboot the system. 2. Reflash the system with the latest level of system firmware. 3. Call service support. Chapter 5. Error Code to FRU Index 505 Error Code Description Action/ Possible Failing FRU 25A0 0001 L2 Cache controller failure Replace the MCM module specified by the location code. 25A1 0001 L2 Cache SRAM failure Replace the MCM module specified by the location code. 25A8 0xxx NVRAM problems Errors reported against NVRAM can be caused by low battery voltage and (more rarely) power outages that occur during normal system usage. With the exception of the 25A8 0000 error, these errors are warnings that the NVRAM data content had to be reestablished and do not require any FRU replacement unless the error is persistent. When one of these errors occurs, any system customization (for example, boot device list) information has been lost, the system may need to be reconfigured. If the error is persistent, replace the primary I/O book, Location: U1.18-P1-H2 25A8 0000 Initialization failed, device test failed Refer to Action under error code 25A8 0xxx 25A8 0100 NVRAM data validation check failed. Turn off, turn on system unit and retry the operation before replacing any system component. Refer to Action under error code 25A8 0xxx 25A8 0201 Unable to expand target partition - Saving configuration variable. Refer to Action under error code 25A8 0xxx 25A8 0202 Unable to expand target partition - Writing error log entry. Refer to Action under error code 25A8 0xxx 25A8 0203 Unable to expand target partition - Writing VPD data. Refer to Action under error code 25A8 0xxx 25A8 0210 Setenv/$Setenv parameter error - name contains a null character. Refer to Action under error code 25A8 0xxx 25A8 0211 Setenv/$Setenv parameter error - value contains a null character. Refer to Action under error code 25A8 0xxx 25A8 0215 NVRAM RIO-USR-select variable not found Complete the following: 1. Check for platform firmware updates. Apply if available. 2. Call service support. 506 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 25A8 0998 NVRAMRC script evaluation error - command line execution error Action: Execution of a command line within the nvram configuration variable “nvramrc” (script) resulted in a “throw” being executed. This script can be modified by the system firmware SMS utilities, the operating system, PCI adapter rom code or utility, or an operator (via the open firmware script editing command nvedit). It may not be possible to resolve the problem without a detailed analysis of the nvram script, the current system configuration, and device tree contents. 1. The problem can be caused by a SCSI adapter, whose SCSI bus ID has been changed from the default setting, no longer appearing in the system. This can be caused either by removing a SCSI adapter, or by a problem with a SCSI adapter. a. Select option 5, Change SCSI Settings, on the SMS main menu. Then on the SCSI utilities menu, select option 2, Change SCSI Id. 1) Verify the list of SCSI controllers/adapters. If the list is not correct, suspect a problem with the adapter(s) installed but not listed. 2) Select one of the adapters listed and enter the SCSI id that is currently displayed for that adapter, then exit from this menu. This causes the information in NVRAM to be updated. 3) Restart the system.b. If the problem persists, boot the operating system and verify the SCSI bus IDs of any installed/available SCSI controllers (change as necessary), and restart the system.2. Contact your support representative for further assistance. 25A8 0999 NVRAMRC script evaluation error - stack unbalanced on completion This is a firmware debug environment error. There is no user action or FRU replacement for this error. If this error persists, call service support. 25A8 0A00 Unable to retrieve NVRAM LPAR table information Refer to the actions for error code 25A8 0xxx. 25B0 0001 Memory book failure Replace the memory book specified by the location code. 25B0 0004 Memory book is not supported Replace the unsupported memory book(s) specified by the location code. 25Cy y001 Memory book is not supported Replace unsupported memory book(s) specified by the location code. 25Cy y002 Memory book failure Replace the memory book(s) specified by the location code. 2601 0000 Boot failed because the RIO cables, or the power subsystem cables, are not routed correctly to the I/O subsystems 1. Check both ends of the RIO cable specified in the location code that appears with this error code. 2. Determine which I/O subsystem is at the end of the RIO cable specified by the location code shown with this error code. Check both ends of the power cables going to both DCAs in that I/O subsystem.For information about I/O cabling and connector locations, see “I/O Book Connector Locations” on page 19 and “Subsystem Positioning and Cabling” on page 36. Note: The port location code identifies the I/O book number. Chapter 5. Error Code to FRU Index 507 Error Code Description Action/ Possible Failing FRU 2601 0001 Boot failed because the I/O subsystem has not been programmed correctly. For information about I/O book cabling and connector locations, see “I/O Book Connector Locations” on page 19 and “Subsystem Positioning and Cabling” on page 36. 1. Ensure that power cables are correctly connected to drawers. 2. Firmware, including power code, may have to be flashed on both sides of the I/O subsystem. Note: The location code identifies the I/O subsystem that has the problem 3. Contact the next level of support. 2601 0002 RIO cable error or DCA power cable error detected on an I/O subsystem 1. Check both ends of the RIO cable specified in the location code that appears with this error code. 2. Determine which I/O subsystem is at the end of the RIO cable specified by the location code shown with this error code. Check the power cables going to both DCAs in that I/O subsystem.For information about I/O cabling and connector locations, see “I/O Book Connector Locations” on page 19 and “Subsystem Positioning and Cabling” on page 36. Note: The port location code identifies the I/O book number. 2602 0001 Invalid PCI adapter vendor ID 1. Move adapter to another slot (behind a different PCI bus). 2. Check for available firmware updates for adapter. Apply update if available. 3. Run AIX diagnostics on adapter. Replace if indicated. 4. Replace adapter. 5. Check for system firmware updates. Apply if update available. 6. Replace I/O subsystem backplane (See notes on Chapter 5, “Error Code to FRU Index,” on page 413.) 2602 0002 Invalid PCI adapter device ID 1. Move adapter to another slot (behind a different PCI bus). 2. Check for available firmware updates for adapter. Apply update if available. 3. Run AIX diagnostics on adapter. Replace if indicated. 4. Replace adapter. 5. Check for system firmware updates. Apply if update available. 6. Replace I/O subsystem backplane. (See notes on Chapter 5, “Error Code to FRU Index,” on page 413.) 508 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 2602 0007 PCI device I/O or memory space request could not be satisfied 1. If the location code identifies a slot: a. Reseat the adapter and reboot the system. b. Replace the adapter. c. Check for system firmware updates. Apply if available. d. Replace the I/O backplane in which the adapter is installed.2. If the location code identifies an I/O planar: a. Check for system firmware updates. Apply if available. b. Replace the I/O backplane.3. Call service support. 2602 0008 PCI device Fcode evaluation error 1. If the location code identifies a slot: a. Check for adapter firmware updates. Apply if available. b. Replace the adapter. c. Check for system firmware updates. Apply if available.2. If the location code identifies an I/O planar: a. Check for system firmware updates. Apply if available. b. Replace the I/O planar.3. Call service support. 2602 0010 PCI probe error, bridge in freeze state If the location code identifies a slot or PCI adapter: 1. Check for system firmware updates. Apply if update available. 2. Check the cabling to the adapter (in particular, the adapters that have serial ports). Serial ports may require null modems or special cabling configurations to avoid connecting driver outputs together. This may create a PCI slot power problem and force the adapter to be deconfigured. 3. Use the hot plug service aid to reseat the adapter specified by the location code. See “Removing and Replacing a Hot-Pluggable PCI Adapter” on page 921. If reseating the adapter fixes the problem, go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems manual. If the problem is not resolved, continue with 4. 4. Use the hot plug task to move the adapter to another slot (behind a different PCI bridge). The “I/O Subsystem PCI and PCI-X PHB and PCI and PCI-X Slot Locations” on page 27 identifies the PCI bridges and the associated slots. See “Removing and Replacing a Hot-Pluggable PCI Adapter” on page 921 when removing the adapter, and “Installing a New Hot-Pluggable PCI Adapter” on page 923 when installing the adapter in the new slot. If the adapter is successfully reconfigured in the new slot (behind a different PCI bridge), the slot in which the adapter was originally plugged is bad. Replace the I/O backplane assembly that contains the slot in which the adapter was originally installed, location: Ux.y-Pz. If the adapter does not reconfigure successfully in the new slot, replace the adapter. 5. Go to “MAP 1542: I/O Problem Isolation” on page 318. If the location code does not identify a slot or PCI adapter or no location code is specified, go to “MAP 1542: I/O Problem Isolation” on page 318. Chapter 5. Error Code to FRU Index 509 Error Code Description Action/ Possible Failing FRU 2602 0011 PCI probe error, bridge is not usable If the location code identifies a slot: 1. Check the cabling to the adapter (in particular, the adapters that have serial ports). Serial ports may require null modems or special cabling configurations to avoid connecting driver outputs together. This may create a PCI slot power problem and force the adapter to be deconfigured. 2. Move adapter to another slot (behind a different PCI bus). 3. Check for available firmware updates for adapter. Apply update if available. 4. Replace adapter. 5. Check for system firmware updates. Apply if update available. 6. Replace I/O subsystem backplane (See notes on Chapter 5, “Error Code to FRU Index,” on page 413.)If the location code identifies an I/O subsystem backplane: 1. Check for system firmware updates. Apply if update is available. 2. Replace I/O subsystem backplane. 2602 0012 PCI device run-time error, bridge in freeze state If the location code identifies a slot: 1. Check the cabling to the adapter (in particular, the adapters that have serial ports). Serial ports may require null modems or special cabling configurations to avoid connecting driver outputs together. This may create a PCI slot power problem and force the adapter to be deconfigured. 2. Move adapter to another slot (behind a different PCI bus). 3. Check for available firmware updates for adapter. Apply update if available. 4. Replace adapter. 5. Check for system firmware updates. Apply if update available. 6. Replace I/O subsystem backplane (See notes on Chapter 5, “Error Code to FRU Index,” on page 413).If the location code identifies an I/O subsystem backplane: 1. Check for system firmware updates. Apply if update is available. 2. Replace I/O subsystem backplane. 510 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 2602 0014 A PCI adapter is not properly seated, or an invalid type of PCI adapter is installed in the system. If the location code identifies a slot: 1. Check the PCI adapter specified by the location code for proper seating. 2. Verify that the PCI adapter specified by the location code is supported on this system. 3. Replace the adapter.If the location code identifies an I/O planar: 1. Check for system firmware updates. Apply the update if one is available. 2. Replace the I/O planar. 2602 0Fxy PCI bridge chip failure Replace the I/O planar specified by the location code. The I/IO planar may be in the CEC or in an external I/O drawer. 2680 0Axy MX-PCI Bridge Bist Failure The problem is associated with bridge ?y? in I/O subsystem ?x?. Replace the I/O board associated with x x = I/O subsystem 1=0 2=1 3=2 4=3 5=4 6=5 See “Determining Location Code” on page 415 or location code information related to this error. 2680 0Cxx Machine Check occurred 1. Replace the part identified by the location code associated with this error code. 2. If the location code associated with this error code identifies an adapter, and replacing the adapter does not correct the problem, replace the I/O subsystem backplane.See “Determining Location Code” on page 415 for location code information related to this error. 2680 0Dxx Machine check occurred; unable to isolate to a single device Go to “MAP 1542: I/O Problem Isolation” on page 318. 27A0 0001 Error obtaining external RIO cable IDs 1. Check the SPCN and RIO cabling going to the I/O subsystem specified by the location code. 2. Check for system firmware updates. 3. Replace the I/O subsystem planar specified by the location code. 2803 0xxx Real-time-clock (RTC) errors Chapter 5. Error Code to FRU Index 511 Error Code Description Action/ Possible Failing FRU 2803 0001 RTC not updating - RTC initialization required Errors reported against the real-time clock can be caused by low battery voltage and (more rarely) power outages that occur during normal system usage. These errors are warnings that the RTC data content must be re-established and do not require any FRU replacement unless the error is persistent. When one of these errors occurs, the time and date information has been lost. To set/restore the time and date, use the operating system facility. If the error is persistent, replace the primary I/O book, Location: U1.18-P1-H2 2803 0002 Bad time/date values 1. Set time and date. 2. Refer to Action under error code 2803 0001. 2803 0003 RTC not updating - not correctable Replace the primary I/O book, Location: U1.18-P1-H2 2803 0004 RTC operating mode parameters (for example, data mode) changed 1. Set time and date. 2. Refer to Action under error code 2803 0001. 2803 0005 RTC battery error Replace the primary I/O book, Location: U1.18-P1-H2 Note: Passwords, time, and date must be reset. 2803 0006 Processor frequency measurement error Replace the clock card, location: U1.18-P1-X5. 2803 0010 The real-time-clock cannot be protected in the LPAR environment. 1. Check for system firmware updates. Apply if available. 2. Contact your next level of support. 2803 F003 Real-time-clock not updating - not correctable (detected by hypervisor) Replace the primary I/O book, location: U1.18-P1-H2. 2900 0002 ISA keyboard/mouse controller failed self-test Replace the primary I/O book, Location: U1.18-P1-H2 29A0 0003 ISA keyboard not present/detected Replace the primary I/O book, Location: U1.18-P1-H2 29A0 0004 ISA keyboard stuck key detected Replace the primary I/O book, Location: U1.18-P1-H2 29B0 0004 ISA mouse not present/detected Replace the primary I/O book, Location: U1.18-P1-H2 2B20 0011 Processor is manually disabled. Use the service processor menus to re-enable the processor and reboot the system. 2B20 0012 This is an informational code that is generated when the cpu_deallocate command is used to take a processor off-line. There is no action required. To bring the processor back on-line, or back into the system or partition configuration, reboot the operating system. 512 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 2B20 0022 Processor is deconfigured by the system. The processor has previously failed BIST or POST. Replace the processor MCM indicated by the location code associated with the error code in the error log. 2B20 0031 Processor is failing POST Replace the processor MCM indicated by the location code associated with the error code in the error log. 2B20 0042 Unknown processor card Remove the unknown/unsupported processor MCM. 2B20 8880 Unrecoverable processor subsystem error If Linux is the only operating system on the failing system, call service support. If AIX is available, and the system is powered off, replace the FRU indicated in the service processor error log entry. If AIX is available and the system has been rebooted, run online AIX diagnostics and look for an AIX error log entry in which the description says cache or cache controller error. Replace the FRUs listed in this entry. 2B20 8881 Software error Call service support. 2B20 8882 Software error Call service support. 2B20 8883 Software error Call service support. 2B20 8884 Unrecoverable processor subsystem error If Linux is the only operating system on the failing system, call service support. If AIX is available, and the system is powered off, replace the FRU indicated in the service processor error log entry. If AIX is available and the system has been rebooted, run online AIX diagnostics and look for an AIX error log entry in which the description says cache or cache controller error. Replace the FRUs listed in this entry. 2B20 8885 Unrecoverable internal hardware error If Linux is the only operating system on the failing system, call service support. If Aix is available, run online AIX diagnostics.Look for an AIX error log entry in which the description says I/O bus time out, access, or other error, or I/O bridge/device internal error. Replace the FRUs listed in this entry. 2B20 8886 Recoverable internal error while resetting a partition Informational message; no action required. If a partition is reset, either from the operating system or the HMC, and a machine check occurs once the partition has started but not completed the reset operation, the normal reset operation is aborted and the partition is terminated without an operating system dump. When the partition is brought back up, examine the operating system error log to determine the cause of the machine check. Chapter 5. Error Code to FRU Index 513 Error Code Description Action/ Possible Failing FRU 2B20 8888 Operating system termination request received Informational message. 2B2x 00EE An exception error has been reported on a global processor The type of interrupt that caused the exception is specified by the x as follows: 0: Unknown interrupt 1: System reset interrupt (SRI) 2. Machine check interrupt (MCI) 3. Data storage interrupt (DSI) 4. Instruction storage interrupt (ISI) 5. External interrupt (EXI) 6. Alignment interrupt (ALI) 7. Program interrupt (SRI) 8. Floating unavailable interrupt (FUI)1. Check for system firmware updates. Apply if available. 2. Call service support. 2BA0 0000 Service processor POST failure Replace the primary I/O book, Location: U1.18-P1-H2 2BA0 0012 Service processor reports self test failure Replace the primary I/O book, Location: U1.18-P1-H2 2BA0 0013 Service processor reports bad NVRAM CRC Refer to the actions for 25A8 0xxx. 2BA0 0017 Service processor reports bad or low battery. Refer to the actions for 25A8 0xxx. 2BA0 0021 Open firmware mailbox call to activate attention LED failed 1. Check for system firmware updates. Apply if available. 2. Call service support. 2BA0 0041 Service processor VPD is damaged. Replace the primary I/O book, Location: U1.18-P1-H2 2BA0 0071 VPD data is damaged for MCM 0 Replace the MCM and VPD card 2BA0 0073 VPD data is damaged for MCM 2 Replace the MCM and VPD card 2BA0 0080 Service processor hardware reset request failure 1. Check for system firmware updates. Apply if available. 2. Call service support. 2Cmm mhhh A processor’s timebase enable signal is stuck enabled 1. Replace the primary I/O book, location: U1.18-P1-H2. 2. hhh indicates the failing processor. Replace the MCM that contains the failing processor according to the following table:v hhh = 0 - 7: U1.18-P1-C1 v hhh = 8-15: U1.18-P1-C3 v hhh = 16-23: U1.18-P1-C4 v hhh = 24-31: U1.18-P1-C2 514 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 2Dmm mhhh A processor’s timebase enable signal is stuck disabled 1. Replace the primary I/O book, location: U1.18-P1-H2. 2. hhh indicates the failing processor. Replace the MCM that contains the failing processor according to the following table:v hhh = 0 - 7: U1.18-P1-C1 v hhh = 8-15: U1.18-P1-C3 v hhh = 16-23: U1.18-P1-C4 v hhh = 24-31: U1.18-P1-C2 Chapter 5. Error Code to FRU Index 515 Service Processor Error Codes Attention: Follow the procedure defined in “Checkpoints and Error Codes Index” on page 416. If you replace FRUs or perform an action on an I/O subsystem and the problem is still not corrected, go to “MAP 1542: I/O Problem Isolation” on page 318 unless you were already directed to any MAP 154x by the error code. Otherwise call support if the action(s) for an error code do not resolve the problem. Attention: If the error code that brought you to this chapter originated in the AIX error log, remember that an error may require an additional 15 minutes to be propagated to the AIX error log. If you replace FRUs and the problem is corrected, go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Note: The MCM and its associated VPD card are a single FRU, and must be replaced as a set. Error Code Description Action/ Possible Failing FRU 4064 0001 Processor subsystem tests detected clock card assembly failure Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4064 0002 Processor subsystem tests detected backplane assembly failure Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4064 0003 Processor subsystem tests detected fault in MCM module 0 MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3 4064 0004 Processor subsystem tests detected fault in MCM module 2 MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4 4064 0005 Processor subsystem tests detected fault in MCM module 1 MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5 4064 0006 Processor subsystem tests detected fault in MCM module 3 MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6 4064 0007 Processor subsystem tests detected fault in L3 VPD card 1. L3 VPD cardLocation: U1.18-P1-H2.1 (Order the L3 cache module FRU to obtain a new L3 VPD card. Return the L3 cache module that was included in the FRU kit, and the failing L3 VPD card as ?new defective?.) 2. Primary I/O bookLocation: U1.18-P1-H2 4064 000A Processor subsystem tests detected fault in primary I/O book 1 (slot 0) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4064 000B Processor subsystem tests detected fault in secondary I/O book (slot 1) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4064 000C Processor subsystem tests detected fault in I/O book 3 (slot 2) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4064 000D Processor subsystem tests detected fault in I/O book 4 (slot 3) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4064 000E CUoD capacity card 1. Replace the CUoD capacity cardLocation: U1.18-P1-H2.2 2. Ask the customer to reenter the CUoD activation keys. 4064 0010 Processor subsystem tests detected fault in Memory book (slot 0) Memory book (slot 0) Location: U1.18-P1-M2 516 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4064 0011 Processor subsystem tests detected fault in Memory book (slot 1) Memory book (slot 1) Location: U1.18-P1-M3 4064 0012 Processor subsystem tests detected fault in Memory book (slot 2) Memory book (slot 2) Location: U1.18-P1-M7 4064 0013 Processor subsystem tests detected fault in Memory book (slot 3) Memory book (slot 3)Location: U1.18-P1-M6 4064 0014 Processor subsystem tests detected fault in Memory book (slot 4) Memory book (slot 4)Location: U1.18-P1-M4 4064 0015 Processor subsystem tests detected fault in Memory book (slot 5) Memory book (slot 5)Location: U1.18-P1-M8 4064 0016 Processor subsystem tests detected fault in Memory book (slot 6) Memory book (slot 6)Location: U1.18-P1-M5 4064 0017 Processor subsystem tests detected fault in Memory book (slot 7) Memory book (slot 7)Location: U1.18-P1-M1 4064 00B0 Processor subsystem tests detected possible problem with bus controller or CSP code 1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. 4064 00B1 Processor subsystem tests detected possible problem with bus controller 1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. 4064 00B3 Processor subsystem tests detected MCM 0/VPD problem 1. Check for system firmware updates. 2. MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3 4064 00B4 Processor subsystem tests detected MCM 2/VPD problem 1. Check for system firmware updates. 2. MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4 4064 00B5 Processor subsystem tests detected MCM 1/VPD problem 1. Check for system firmware updates. 2. MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5 4064 00B6 Processor subsystem tests detected MCM 3/VPD problem 1. Check for system firmware updates. 2. MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6 4064 00B7 Processor subsystem tests detected Insufficient hardware resources to continue IPL after processor subsystem testing and initialization completed Go to “MAP 1540: Problem Isolation Procedures” on page 308. Chapter 5. Error Code to FRU Index 517 Error Code Description Action/ Possible Failing FRU 4064 0A03 Reboot message (run-time array gard failure) for processor 0 Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 4064 0A04 Reboot message (run-time array gard failure) for processor 2 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 518 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4064 0A05 Reboot message (run-time array gard failure) for processor 1 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 4064 0A06 Reboot message (run-time array gard failure) for processor 3 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. Chapter 5. Error Code to FRU Index 519 Error Code Description Action/ Possible Failing FRU 4064 0B00 Processor subsystem tests detected fault in I/O book (slot 0 RIO port 0) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 4064 0B01 Processor subsystem tests detected fault in I/O book (slot 0 RIO port 1) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 4064 0B02 Processor subsystem tests detected fault in I/O book (slot 0 RIO port 2) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 4064 0B03 Processor subsystem tests detected fault in I/O book (slot 0 RIO port 3) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 4064 0B10 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 4064 0B11 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 4064 0B12 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 520 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4064 0B13 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 4064 0B20 Processor subsystem tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4064 0B21 Processor subsystem tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4064 0B22 Processor subsystem tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4064 0B23 Processor subsystem tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4064 0B24 Processor subsystem tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. Chapter 5. Error Code to FRU Index 521 Error Code Description Action/ Possible Failing FRU 4064 0B25 Processor subsystem tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4064 0B26 Processor subsystem tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4064 0B27 Processor subsystem tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4064 0B30 Processor subsystem tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 4064 0B31 Processor subsystem tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 4064 0B32 Processor subsystem tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 4064 0B33 Processor subsystem tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 522 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4064 0B34 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 4064 0B35 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 4064 0B36 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 4064 0B37 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 4064 0BF0 Processor subsystem tests detected fault in I/O hub chip 0 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q1 b. U1.18-P1-H2/Q22. Replace the primary I/O book, Location: U1.18-P1-H2 4064 0BF1 Processor subsystem tests detected fault in I/O hub chip 1 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q3 b. U1.18-P1-H2/Q42. Replace the primary I/O book, Location: U1.18-P1-H2 Chapter 5. Error Code to FRU Index 523 Error Code Description Action/ Possible Failing FRU 4064 0BF2 Processor subsystem tests detected fault in I/O hub chip 2 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q3 b. U1.18-P1-H1/Q42. Replace the RIO book, Location: U1.18-P1-H1 4064 0BF3 Processor subsystem tests detected fault in I/O hub chip 3 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q1 b. U1.18-P1-H1/Q22. Replace the RIO book, Location: U1.18-P1-H1 4064 0BF4 Processor subsystem tests detected fault in I/O hub chip 4 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q7 b. U1.18-P1-H3/Q82. Replace the RIO book, Location: U1.18-P1-H3 4064 0BF5 Processor subsystem tests detected fault in I/O hub chip 5 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q5 b. U1.18-P1-H3/Q62. Replace the RIO book, Location: U1.18-P1-H3 4064 0BF6 Processor subsystem tests detected fault in I/O hub chip 6 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q5 b. U1.18-P1-H4/Q62. Replace the RIO book, Location: U1.18-P1-H4 4064 0BF7 Processor subsystem tests detected fault in I/O hub chip 7 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q7 b. U1.18-P1-H4/Q82. Replace the RIO book, Location: U1.18-P1-H4 4064 0BF8 Processor subsystem tests detected fault in I/O hub chip 8 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q7 b. U1.18-P1-H1/Q82. Replace the RIO book, Location: U1.18-P1-H1 524 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4064 0BF9 Processor subsystem tests detected fault in I/O hub chip 9 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q5 b. U1.18-P1-H1/Q62. Replace the RIO book, Location: U1.18-P1-H1 4064 0BFA Processor subsystem tests detected fault in I/O hub chip 10 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q3 b. U1.18-P1-H3/Q42. Replace the RIO book, Location: U1.18-P1-H3 4064 0BFB Processor subsystem tests detected fault in I/O hub chip 11 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q1 b. U1.18-P1-H3/Q22. Replace the RIO book, Location: U1.18-P1-H3 4064 0BFC Processor subsystem tests detected fault in I/O hub chip 12 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q1 b. U1.18-P1-H4/Q22. Replace the RIO book, Location: U1.18-P1-H4. 4064 0BFD Processor subsystem tests detected fault in I/O hub chip 13 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q3 b. U1.18-P1-H4/Q42. Replace the RIO book, Location: U1.18-P1-H4 4064 0C03 Processor subsystem tests detected fault in all MCMs Go to “MAP 1543: MCM Module Problem Isolation” on page 337. 4064 0C09 Processor subsystem tests detected fault in all memory books Go to “MAP 1546: Memory Book Problem Isolation” on page 345. 4064 0C0A Processor subsystem tests detected fault in all I/O books Go to “MAP 1547: I/O Book Problem Isolation” on page 348. 4064 0C10 Processor subsystem tests detected mismatch between MCM 0 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C1 and U1.18-P1-H2.3 4064 0C11 Processor subsystem tests detected mismatch between MCM 2 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C4 and U1.18-P1-H2.4 4064 0C12 Processor subsystem tests detected mismatch between MCM 1 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C3 and U1.18-P1-H2.5 4064 0C13 Processor subsystem tests detected mismatch between MCM 3 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C2 and U1.18-P1-H2.6 4064 0CA5 Processor subsystem tests detected invalid processor subsystem configuration Verify that MCMs, memory books, and L3 modules are correctly configured. Chapter 5. Error Code to FRU Index 525 Error Code Description Action/ Possible Failing FRU 4064 0CA6 Processor subsystem tests detected no usable main storage Reboot the system in slow mode to isolate the failure. 4064 0CA9 Processor subsystem tests detected configuration error with processor memory Reboot the system in slow mode to isolate the failure. 4064 0D00 Processor subsystem tests detected fault in L3 cache module 0 L3 cache module 0 and L3 VPD cardLocation: U1.18-P1-C6 and U1.18-P1-H2.1 4064 0D01 Processor subsystem tests detected fault in L3 cache module 1 L3 cache module 1 and L3 VPD card Location: U1.18-P1-C7 and U1.18-P1-H2.1 4064 0D02 Processor subsystem tests detected fault in L3 cache module 2 L3 cache module 2 and L3 VPD card Location: U1.18-P1-C11 and U1.18-P1-H2.1 4064 0D03 Processor subsystem tests detected fault in L3 cache module 3 L3 cache module 3 and L3 VPD card Location: U1.18-P1-C10 and U1.18-P1-H2.1 4064 0D04 Processor subsystem tests detected fault in L3 cache module 4 L3 cache module 4 and L3 VPD card Location: U1.18-P1-C8 and U1.18-P1-H2.1 4064 0D05 Processor subsystem tests detected fault in L3 cache module 5 L3 cache module 5 and L3 VPD card Location: U1.18-P1-C20 and U1.18-P1-H2.1 4064 0D06 Processor subsystem tests detected fault in L3 cache module 6 L3 cache module 6 and L3 VPD card Location: U1.18-P1-C16 and U1.18-P1-H2.1 4064 0D07 Processor subsystem tests detected fault in L3 cache module 7 L3 cache module 7 and L3 VPD card Location: U1.18-P1-C12 and U1.18-P1-H2.1 4064 0D08 Processor subsystem tests detected fault in L3 cache module 8 L3 cache module 8 and L3 VPD card Location: U1.18-P1-C19 and U1.18-P1-H2.1 4064 0D09 Processor subsystem tests detected fault in L3 cache module 9 L3 cache module 9 and L3 VPD card Location: U1.18-P1-C18 and U1.18-P1-H2.1 4064 0D0A Processor subsystem tests detected fault in L3 cache module 10 L3 cache module 10 and L3 VPD card Location: U1.18-P1-C14 and U1.18-P1-H2.1 4064 0D0B Processor subsystem tests detected fault in L3 cache module 11 L3 cache module 11 and L3 VPD card Location: U1.18-P1-C15 and U1.18-P1-H2.1 4064 0D0C Processor subsystem tests detected fault in L3 cache module 12 L3 cache module 12 and L3 VPD card Location: U1.18-P1-C17 and U1.18-P1-H2.1 4064 0D0D Processor subsystem tests detected fault in L3 cache module 13 L3 cache module 13 and L3 VPD card Location: U1.18-P1-C5 and U1.18-P1-H2.1 4064 0D0E Processor subsystem tests detected fault in L3 cache module 14 L3 cache module 14 and L3 VPD card Location: U1.18-P1-C9 and U1.18-P1-H2.1 4064 0D0F Processor subsystem tests detected fault in L3 cache module 15 L3 cache module 15 and L3 VPD card Location: U1.18-P1-C13 and U1.18-P1-H2.1 526 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4064 0EA0 Processor subsystem tests detected fault in MOPS service processor code 1. Access the service processor error log. Find the error log entry for the 406x 0EA0 error. Look for a hardware error of the form 450x yyyy, 460x yyyy, or 4B2x yyyy with a timestamp within a few minutes of the 406x 0EA0 error. Follow the actions for this hardware error. 2. If there is no hardware error in the service processor error log with a timestamp that is close to the 406x 0EA0 error, or following the actions for the hardware error does not resolve the problem, check for platform firmware updates. If an update is available, apply it. 3. If a platform firmware update is not available, or the update does not correct the problem, call service support. 4064 0EA1 Processor subsystem tests detected fault in processor subsystem code on main processor 1. Check for operating system updates. 2. Call service support. 4064 0EA2 Processor subsystem tests detected fault in PRD code on service processor 1. Check for system firmware updates. 2. Call service support. 4064 0EA4 The service processor firmware detected a hardware problem but was unable to isolate the source. Call service support. 4064 0EA5 PRD detected a problem in the FNM code 1. Check for system firmware updates. 2. Call service support. 4064 0EAA PRD: FNM is handling this error 1. Check for system firmware updates. 2. Call service support. 4064 0EAF Processor subsystem tests detected unknown service processor code and system type 1. Check for system firmware updates. 2. Call service support. 4064 0EB0 Processor subsystem tests detected JTAG problem Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4064 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4064 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4064 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4064 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4064 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4064 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4064 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. Chapter 5. Error Code to FRU Index 527 Error Code Description Action/ Possible Failing FRU 4064 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4064 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4064 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4064 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4064 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4064 0EBF Memory error 1. Reboot the system in slow mode to the SMS menus. See “Performing Slow Boot” on page 413. 2. Examine the service processor error log. v If a CEC subsystem error (a 4xxx yyyy error) occurred during the slow boot that calls out the memory, processors, L3 cache modules, or the primary I/O book, replace the part indicated by the location code. v If a CEC subsystem error (a 4xxx yyyy error) did not occur during the slow boot, look for a 4xxx yyyy error immediately preceding the 406x 0EBF error. Replace the part indicated by the location code associated with that error. 4064 0EE1 Processor subsystem tests detected fault in service processor code (invalid MRU to mapping) 1. Check for system firmware updates. 2. Call service support. 4064 0EE2 Processor subsystem tests detected fault in service processor firmware fault 1. Check for system firmware updates. 2. Call service support. 4064 0EE5 Processor subsystem tests detected firmware error; no callout available 1. Check for system firmware updates. 2. Call service support. 4064 25E6 SMA 4-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 4064 273F SMA 2-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 4065 0001 Display alter tests detected clock card assembly failure Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4065 0002 Display alter tests detected backplane assembly failure Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4065 0003 Display alter tests detected fault in MCM module 0 MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3 4065 0004 Display alter tests detected fault in MCM module 2 MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4 4065 0005 Display alter tests detected fault in MCM module 1 MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5 528 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4065 0006 Display alter tests detected fault in MCM module 3 MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6 4065 0007 Display alter tests detected fault in L3 VPD card 1. L3 VPD cardLocation: U1.18-P1-H2.1 (Order the L3 cache module FRU to obtain a new L3 VPD card. Return the L3 cache module that was included in the FRU kit, and the failing L3 VPD card as ?new defective.?) 2. Primary I/O bookLocation: U1.18-P1-H2 4065 000A Display alter tests detected fault in primary I/O book 1 (slot 0) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4065 000B Display alter tests detected fault in secondary I/O book (slot 1) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4065 000C Display alter tests detected fault in I/O book 3 (slot 2) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4065 000D Display alter tests detected fault in I/O book 4 (slot 3) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4065 000E CUoD capacity card 1. Replace the CUoD capacity cardLocation: U1.18-P1-H2.2 2. Ask the customer to reenter the CUoD activation keys. 4065 0010 Display alter tests detected fault in Memory book (slot 0) Memory book (slot 0) Location: U1.18-P1-M2 4065 0011 Display alter tests detected fault in Memory book (slot 1) Memory book (slot 1) Location: U1.18-P1-M3 4065 0012 Display alter tests detected fault in Memory book (slot 2) Memory book (slot 2) Location: U1.18-P1-M7 4065 0013 Display alter tests detected fault in Memory book (slot 3) Memory book (slot 3)Location: U1.18-P1-M6 4065 0014 Display alter tests detected fault in Memory book (slot 4) Memory book (slot 4)Location: U1.18-P1-M4 4065 0015 Display alter tests detected fault in Memory book (slot 5) Memory book (slot 5)Location: U1.18-P1-M8 4065 0016 Display alter tests detected fault in Memory book (slot 6) Memory book (slot 6)Location: U1.18-P1-M5 4065 0017 Display alter tests detected fault in Memory book (slot 7) Memory book (slot 7)Location: U1.18-P1-M1 4065 00B0 Display alter tests detected possible problem with bus controller or CSP code 1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. 4065 00B1 Display alter tests detected possible problem with bus controller 1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. Chapter 5. Error Code to FRU Index 529 Error Code Description Action/ Possible Failing FRU 4065 00B3 Display alter tests detected MCM 0/VPD problem 1. Check for system firmware updates. 2. MCM module 0 and VPD cardLocation: 2U1.18-P1-C1 and U1.18-P1-H2.3 4065 00B4 Display alter tests detected MCM 2/VPD problem 1. Check for system firmware updates. 2. MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4 4065 00B5 Display alter tests detected MCM 1/VPD problem 1. Check for system firmware updates. 2. MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5 4065 00B6 Display alter tests detected MCM 3/VPD problem 1. Check for system firmware updates. 2. MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6 4065 00B7 Display alter tests detected hardware problem in the processor subsystem Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4065 0A03 Reboot message (run-time array gard failure) for processor 0 Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 530 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4065 0A04 Reboot message (run-time array gard failure) for processor 2 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 4065 0A05 Reboot message (run-time array gard failure) for processor 1 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. Chapter 5. Error Code to FRU Index 531 Error Code Description Action/ Possible Failing FRU 4065 0A06 Reboot message (run-time array gard failure) for processor 3 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 4065 0B00 Display alter tests detected fault in I/O book (slot 0 RIO port 0) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 4065 0B01 Display alter tests detected fault in I/O book (slot 0 RIO port 1) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 4065 0B02 Display alter tests detected fault in I/O book (slot 0 RIO port 2) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 4065 0B03 Display alter tests detected fault in I/O book (slot 0 RIO port 3) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 532 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4065 0B10 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 4065 0B11 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 4065 0B12 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 4065 0B13 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 4065 0B20 Display alter tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4065 0B21 Display alter tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4065 0B22 Display alter tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. Chapter 5. Error Code to FRU Index 533 Error Code Description Action/ Possible Failing FRU 4065 0B23 Display alter tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4065 0B24 Display alter tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4065 0B25 Display alter tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4065 0B26 Display alter tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4065 0B27 Display alter tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4065 0B30 Display alter tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 4065 0B31 Display alter tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 534 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4065 0B32 Display alter tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 4065 0B33 Display alter tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 4065 0B34 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 4065 0B35 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 4065 0B36 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 4065 0B37 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. Chapter 5. Error Code to FRU Index 535 Error Code Description Action/ Possible Failing FRU 4065 0BF0 Display alter tests detected fault in I/O hub chip 0 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q1 b. U1.18-P1-H2/Q22. Replace the primary I/O book, Location: U1.18-P1-H2 4065 0BF1 Display alter tests detected fault in I/O hub chip 1 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q3 b. U1.18-P1-H2/Q42. Replace the primary I/O book, Location: U1.18-P1-H2 4065 0BF2 Display alter tests detected fault in I/O hub chip 2 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q3 b. U1.18-P1-H1/Q42. Replace the RIO book, Location: U1.18-P1-H1 4065 0BF3 Display alter tests detected fault in I/O hub chip 3 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q1 b. U1.18-P1-H1/Q22. Replace the RIO book, Location: U1.18-P1-H1 4065 0BF4 Display alter tests detected fault in I/O hub chip 4 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q7 b. U1.18-P1-H3/Q82. Replace the RIO book, Location: U1.18-P1-H3 4065 0BF5 Display alter tests detected fault in I/O hub chip 5 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q5 b. U1.18-P1-H3/Q62. Replace the RIO book, Location: U1.18-P1-H3 4065 0BF6 Display alter tests detected fault in I/O hub chip 6 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q5 b. U1.18-P1-H4/Q62. Replace the RIO book, Location: U1.18-P1-H4 536 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4065 0BF7 Display alter tests detected fault in I/O hub chip 7 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q7 b. U1.18-P1-H4/Q82. Replace the RIO book, Location: U1.18-P1-H4 4065 0BF8 Display alter tests detected fault in I/O hub chip 8 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q7 b. U1.18-P1-H1/Q82. Replace the RIO book, Location: U1.18-P1-H1 4065 0BF9 Display alter tests detected fault in I/O hub chip 9 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q5 b. U1.18-P1-H1/Q62. Replace the RIO book, Location: U1.18-P1-H1 4065 0BFA Display alter tests detected fault in I/O hub chip 10 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q3 b. U1.18-P1-H3/Q42. Replace the RIO book, Location: U1.18-P1-H3 4065 0BFB Display alter tests detected fault in I/O hub chip 11 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q1 b. U1.18-P1-H3/Q22. Replace the RIO book, Location: U1.18-P1-H3 4065 0BFC Display alter tests detected fault in I/O hub chip 12 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q1 b. U1.18-P1-H4/Q22. Replace the RIO book, Location: U1.18-P1-H4. 4065 0BFD Display alter tests detected fault in I/O hub chip 13 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q3 b. U1.18-P1-H4/Q42. Replace the RIO book, Location: U1.18-P1-H4 4065 0C03 Display alter tests detected fault in all MCMs Go to “MAP 1543: MCM Module Problem Isolation” on page 337. 4065 0C09 Display alter tests detected fault in all memory books Go to “MAP 1546: Memory Book Problem Isolation” on page 345. Chapter 5. Error Code to FRU Index 537 Error Code Description Action/ Possible Failing FRU 4065 0C0A Display alter tests detected fault in all I/O books Go to “MAP 1547: I/O Book Problem Isolation” on page 348. 4065 0C10 Display alter tests detected mismatch between MCM 0 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C1 and U1.18-P1-H2.3 4065 0C11 Display alter tests detected mismatch between MCM 2 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C4 and U1.18-P1-H2.4 4065 0C12 Display alter tests detected mismatch between MCM 1 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C3 and U1.18-P1-H2.5 4065 0C13 Display alter tests detected mismatch between MCM 3 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C2 and U1.18-P1-H2.6 4065 0CA5 Display alter tests detected invalid processor subsystem configuration Verify that MCMs, memory books, and L3 modules are correctly configured. 4065 0CA6 Display alter tests detected no usable main storage Reboot the system in slow mode to isolate the failure. 4065 0CA9 Display alter tests detected configuration error with processor memory Reboot the system in slow mode to isolate the failure. 4065 0D00 Display alter tests detected fault in L3 cache module 0 L3 cache module 0 and L3 VPD cardLocation: U1.18-P1-C6 and U1.18-P1-H2.1 4065 0D01 Display alter tests detected fault in L3 cache module 1 L3 cache module 1 and L3 VPD card Location: U1.18-P1-C7 and U1.18-P1-H2.1 4065 0D02 Display alter tests detected fault in L3 cache module 2 L3 cache module 2 and L3 VPD card Location: U1.18-P1-C11 and U1.18-P1-H2.1 4065 0D03 Display alter tests detected fault in L3 cache module 3 L3 cache module 3 and L3 VPD card Location: U1.18-P1-C10 and U1.18-P1-H2.1 4065 0D04 Display alter tests detected fault in L3 cache module 4 L3 cache module 4 and L3 VPD card Location: U1.18-P1-C8 and U1.18-P1-H2.1 4065 0D05 Display alter tests detected fault in L3 cache module 5 L3 cache module 5 and L3 VPD card Location: U1.18-P1-C20 and U1.18-P1-H2.1 4065 0D06 Display alter tests detected fault in L3 cache module 6 L3 cache module 6 and L3 VPD card Location: U1.18-P1-C16 and U1.18-P1-H2.1 4065 0D07 Display alter tests detected fault in L3 cache module 7 L3 cache module 7 and L3 VPD card Location: U1.18-P1-C12 and U1.18-P1-H2.1 4065 0D08 Display alter tests detected fault in L3 cache module 8 L3 cache module 8 and L3 VPD card Location: U1.18-P1-C19 and U1.18-P1-H2.1 4065 0D09 Display alter tests detected fault in L3 cache module 9 L3 cache module 9 and L3 VPD card Location: U1.18-P1-C18 and U1.18-P1-H2.1 4065 0D0A Display alter tests detected fault in L3 cache module 10 L3 cache module 10 and L3 VPD card Location: U1.18-P1-C14 and U1.18-P1-H2.1 4065 0D0B Display alter tests detected fault in L3 cache module 11 L3 cache module 11 and L3 VPD card Location: U1.18-P1-C15 and U1.18-P1-H2.1 4065 0D0C Display alter tests detected fault in L3 cache module 12 L3 cache module 12 and L3 VPD card Location: U1.18-P1-C17 and U1.18-P1-H2.1 4065 0D0D Display alter tests detected fault in L3 cache module 13 L3 cache module 13 and L3 VPD card Location: U1.18-P1-C5 and U1.18-P1-H2.1 4065 0D0E Display alter tests detected fault in L3 cache module 14 L3 cache module 14 and L3 VPD card Location: U1.18-P1-C9 and U1.18-P1-H2.1 538 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4065 0D0F Display alter tests detected fault in L3 cache module 15 L3 cache module 15 and L3 VPD card Location: U1.18-P1-C13 and U1.18-P1-H2.1 4065 0EA0 Display alter tests detected fault in MOPS service processor code 1. Access the service processor error log. Find the error log entry for the 406x 0EA0 error. Look for a hardware error of the form 450x yyyy, 460x yyyy, or 4B2x yyyy with a timestamp within a few minutes of the 406x 0EA0 error. Follow the actions for this hardware error. 2. If there is no hardware error in the service processor error log with a timestamp that is close to the 406x 0EA0 error, or following the actions for the hardware error does not resolve the problem, check for platform firmware updates. If an update is available, apply it. 3. If a platform firmware update is not available, or the update does not correct the problem, call service support. 4065 0EA1 Display alter tests detected fault in processor subsystem code on main processor 1. Check for operating system updates. 2. Call service support. 4065 0EA2 Display alter tests detected fault in PRD code on service processor 1. Check for system firmware updates. 2. Call service support. 4065 0EA4 The service processor firmware detected a hardware problem but was unable to isolate the source. Call service support. 4065 0EA5 PRD detected a problem in the FNM code 1. Check for system firmware updates. 2. Call service support. 4065 0EAA PRD: FNM is handling this error 1. Check for system firmware updates. 2. Call service support. 4065 0EAF Display alter tests detected unknown service processor code and system type 1. Check for system firmware updates. 2. Call service support. 4065 0EB0 Display alter tests detected JTAG problem Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4065 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4065 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4065 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4065 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4065 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4065 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. Chapter 5. Error Code to FRU Index 539 Error Code Description Action/ Possible Failing FRU 4065 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4065 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4065 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4065 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4065 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4065 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4065 0EBF Memory error 1. Reboot the system in slow mode to the SMS menus. See “Performing Slow Boot” on page 413. 2. Examine the service processor error log. v If a CEC subsystem error (a 4xxx yyyy error) occurred during the slow boot that calls out the memory, processors, L3 cache modules, or the primary I/O book, replace the part indicated by the location code. v If a CEC subsystem error (a 4xxx yyyy error) did not occur during the slow boot, look for a 4xxx yyyy error immediately preceding the 406x 0EBF error. Replace the part indicated by the location code associated with that error. 4065 0EE1 Display alter tests detected fault in service processor code (invalid MRU to mapping) 1. Check for system firmware updates. 2. Call service support. 4065 0EE2 Display alter tests detected fault in service processor firmware fault 1. Check for system firmware updates. 2. Call service support. 4065 0EE5 Display alter tests detected firmware error; no callout available 1. Check for system firmware updates. 2. Call service support. 4065 25E6 SMA 4-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 4065 273F SMA 2-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 4066 0001 Scan interface basic assurance tests detected clock card assembly failure Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4066 0002 Scan interface basic assurance tests detected backplane assembly failure Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4066 0003 Scan interface basic assurance tests detected fault in MCM module 0 MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3 4066 0004 Scan interface basic assurance tests detected fault in MCM module 2 MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4 4066 0005 Scan interface basic assurance tests detected fault in MCM module 1 MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5 540 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4066 0006 Scan interface basic assurance tests detected fault in MCM module 3 MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6 4066 0007 Scan interface basic assurance tests detected fault in L3 VPD card 1. L3 VPD cardLocation: U1.18-P1-H2.1 (Order the L3 cache module FRU to obtain a new L3 VPD card. Return the L3 cache module that was included in the FRU kit, and the failing L3 VPD card as ?new defective?.) 2. Primary I/O bookLocation: U1.18-P1-H2 4066 000A Scan interface basic assurance tests detected fault in primary I/O book 1 (slot 0) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4066 000B Scan interface basic assurance tests detected fault in secondary I/O book (slot 1) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4066 000C Scan interface basic assurance tests detected fault in I/O book 3 (slot 2) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4066 000D Scan interface basic assurance tests detected fault in I/O book 4 (slot 3) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4066 000E CUoD capacity card 1. Replace the CUoD capacity cardLocation: U1.18-P1-H2.2 2. Ask the customer to reenter the CUoD activation keys. 4066 0010 Scan interface basic assurance tests detected fault in Memory book (slot 0) Memory book (slot 0) Location: U1.18-P1-M2 4066 0011 Scan interface basic assurance tests detected fault in Memory book (slot 1) Memory book (slot 1) Location: U1.18-P1-M3 4066 0012 Scan interface basic assurance tests detected fault in Memory book (slot 2) Memory book (slot 2) Location: U1.18-P1-M7 4066 0013 Scan interface basic assurance tests detected fault in Memory book (slot 3) Memory book (slot 3)Location: U1.18-P1-M6 4066 0014 Scan interface basic assurance tests detected fault in Memory book (slot 4) Memory book (slot 4)Location: U1.18-P1-M4 4066 0015 Scan interface basic assurance tests detected fault in Memory book (slot 5) Memory book (slot 5)Location: U1.18-P1-M8 4066 0016 Scan interface basic assurance tests detected fault in Memory book (slot 6) Memory book (slot 6)Location: U1.18-P1-M5 4066 0017 Scan interface basic assurance tests detected fault in Memory book (slot 7) Memory book (slot 7)Location: U1.18-P1-M1 4066 00B0 Possible problem with bus controller or CSP code 1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. 4066 00B1 Possible problem with bus controller 1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. Chapter 5. Error Code to FRU Index 541 Error Code Description Action/ Possible Failing FRU 4066 00B3 MCM 0/VPD problem 1. Check for system firmware updates. 2. MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3 4066 00B4 MCM 2/VPD problem 1. Check for system firmware updates. 2. MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4 4066 00B5 MCM 1/VPD problem 1. Check for system firmware updates. 2. MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5 4066 00B6 MCM 3/VPD problem 1. Check for system firmware updates. 2. MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6 4066 00B7 Hardware problem in the processor subsystem Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4066 0A03 Reboot message (run-time array gard failure) for processor 0 Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 542 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4066 0A04 Reboot message (run-time array gard failure) for processor 2 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 4066 0A05 Reboot message (run-time array gard failure) for processor 1 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. Chapter 5. Error Code to FRU Index 543 Error Code Description Action/ Possible Failing FRU 4066 0A06 Reboot message (run-time array gard failure) for processor 3 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 4066 0B00 Scan interface basic assurance tests detected fault in I/O book (slot 0 RIO port 0) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 4066 0B01 Scan interface basic assurance tests detected fault in I/O book (slot 0 RIO port 1) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 4066 0B02 Scan interface basic assurance tests detected fault in I/O book (slot 0 RIO port 2) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 4066 0B03 Scan interface basic assurance tests detected fault in I/O book (slot 0 RIO port 3) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 544 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4066 0B10 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 4066 0B11 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 4066 0B12 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 4066 0B13 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 4066 0B20 Scan interface basic assurance tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4066 0B21 Scan interface basic assurance tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4066 0B22 Scan interface basic assurance tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. Chapter 5. Error Code to FRU Index 545 Error Code Description Action/ Possible Failing FRU 4066 0B23 Scan interface basic assurance tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4066 0B24 Scan interface basic assurance tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4066 0B25 Scan interface basic assurance tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4066 0B26 Scan interface basic assurance tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4066 0B27 Scan interface basic assurance tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4066 0B30 Scan interface basic assurance tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 4066 0B31 Scan interface basic assurance tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 546 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4066 0B32 Scan interface basic assurance tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 4066 0B33 Scan interface basic assurance tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 4066 0B34 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 4066 0B35 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 4066 0B36 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 4066 0B37 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. Chapter 5. Error Code to FRU Index 547 Error Code Description Action/ Possible Failing FRU 4066 0BF0 Scan interface basic assurance tests detected fault in I/O hub chip 0 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q1 b. U1.18-P1-H2/Q22. Replace the primary I/O book, Location: U1.18-P1-H2 4066 0BF1 Scan interface basic assurance tests detected fault in I/O hub chip 1 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q3 b. U1.18-P1-H2/Q42. Replace the primary I/O book, Location: U1.18-P1-H2 4066 0BF2 Scan interface basic assurance tests detected fault in I/O hub chip 2 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q3 b. U1.18-P1-H1/Q42. Replace the RIO book, Location: U1.18-P1-H1 4066 0BF3 Scan interface basic assurance tests detected fault in I/O hub chip 3 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q1 b. U1.18-P1-H1/Q22. Replace the RIO book, Location: U1.18-P1-H1 4066 0BF4 Scan interface basic assurance tests detected fault in I/O hub chip 4 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q7 b. U1.18-P1-H3/Q82. Replace the RIO book, Location: U1.18-P1-H3 4066 0BF5 Scan interface basic assurance tests detected fault in I/O hub chip 5 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q5 b. U1.18-P1-H3/Q62. Replace the RIO book, Location: U1.18-P1-H3 4066 0BF6 Scan interface basic assurance tests detected fault in I/O hub chip 6 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q5 b. U1.18-P1-H4/Q62. Replace the RIO book, Location: U1.18-P1-H4 548 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4066 0BF7 Scan interface basic assurance tests detected fault in I/O hub chip 7 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q7 b. U1.18-P1-H4/Q82. Replace the RIO book, Location: U1.18-P1-H4 4066 0BF8 Scan interface basic assurance tests detected fault in I/O hub chip 8 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q7 b. U1.18-P1-H1/Q82. Replace the RIO book, Location: U1.18-P1-H1 4066 0BF9 Scan interface basic assurance tests detected fault in I/O hub chip 9 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q5 b. U1.18-P1-H1/Q62. Replace the RIO book, Location: U1.18-P1-H1 4066 0BFA Scan interface basic assurance tests detected fault in I/O hub chip 10 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q3 b. U1.18-P1-H3/Q42. Replace the RIO book, Location: U1.18-P1-H3 4066 0BFB Scan interface basic assurance tests detected fault in I/O hub chip 11 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q1 b. U1.18-P1-H3/Q22. Replace the RIO book, Location: U1.18-P1-H3 4066 0BFC Scan interface basic assurance tests detected fault in I/O hub chip 12 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q1 b. U1.18-P1-H4/Q22. Replace the RIO book, Location: U1.18-P1-H4. 4066 0BFD Scan interface basic assurance tests detected fault in I/O hub chip 13 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q3 b. U1.18-P1-H4/Q42. Replace the RIO book, Location: U1.18-P1-H4 4066 0C03 Scan interface basic assurance tests detected fault in all MCMs Go to “MAP 1543: MCM Module Problem Isolation” on page 337. 4066 0C09 Scan interface basic assurance tests detected fault in all memory books Go to “MAP 1546: Memory Book Problem Isolation” on page 345. Chapter 5. Error Code to FRU Index 549 Error Code Description Action/ Possible Failing FRU 4066 0C0A Scan interface basic assurance tests detected fault in all I/O books Go to “MAP 1547: I/O Book Problem Isolation” on page 348. 4066 0C10 Mismatch between MCM 0 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C1 and U1.18-P1-H2.3 4066 0C11 Mismatch between MCM 2 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C4 and U1.18-P1-H2.4 4066 0C12 Mismatch between MCM 1 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C3 and U1.18-P1-H2.5 4066 0C13 Mismatch between MCM 3 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C2 and U1.18-P1-H2.6 4066 0CA5 Scan interface basic assurance tests detected invalid processor subsystem configuration Verify that MCMs, memory books, and L3 modules are correctly configured. 4066 0CA6 Scan interface basic assurance tests detected no usable main storage Reboot the system in slow mode to isolate the failure. 4066 0CA9 Scan interface basic assurance tests detected configuration error with processor memory Reboot the system in slow mode to isolate the failure. 4066 0D00 Scan interface basic assurance tests detected fault in L3 cache module 0 L3 cache module 0 and L3 VPD cardLocation: U1.18-P1-C6 and U1.18-P1-H2.1 4066 0D01 Scan interface basic assurance tests detected fault in L3 cache module 1 L3 cache module 1 and L3 VPD card Location: U1.18-P1-C7 and U1.18-P1-H2.1 4066 0D02 Scan interface basic assurance tests detected fault in L3 cache module 2 L3 cache module 2 and L3 VPD card Location: U1.18-P1-C11 and U1.18-P1-H2.1 4066 0D03 Scan interface basic assurance tests detected fault in L3 cache module 3 L3 cache module 3 and L3 VPD card Location: U1.18-P1-C10 and U1.18-P1-H2.1 4066 0D04 Scan interface basic assurance tests detected fault in L3 cache module 4 L3 cache module 4 and L3 VPD card Location: U1.18-P1-C8 and U1.18-P1-H2.1 4066 0D05 Scan interface basic assurance tests detected fault in L3 cache module 5 L3 cache module 5 and L3 VPD card Location: U1.18-P1-C20 and U1.18-P1-H2.1 4066 0D06 Scan interface basic assurance tests detected fault in L3 cache module 6 L3 cache module 6 and L3 VPD card Location: U1.18-P1-C16 and U1.18-P1-H2.1 4066 0D07 Scan interface basic assurance tests detected fault in L3 cache module 7 L3 cache module 7 and L3 VPD card Location: U1.18-P1-C12 and U1.18-P1-H2.1 4066 0D08 Scan interface basic assurance tests detected fault in L3 cache module 8 L3 cache module 8 and L3 VPD card Location: U1.18-P1-C19 and U1.18-P1-H2.1 4066 0D09 Scan interface basic assurance tests detected fault in L3 cache module 9 L3 cache module 9 and L3 VPD card Location: U1.18-P1-C18 and U1.18-P1-H2.1 4066 0D0A Scan interface basic assurance tests detected fault in L3 cache module 10 L3 cache module 10 and L3 VPD card Location: U1.18-P1-C14 and U1.18-P1-H2.1 4066 0D0B Scan interface basic assurance tests detected fault in L3 cache module 11 L3 cache module 11 and L3 VPD card Location: U1.18-P1-C15 and U1.18-P1-H2.1 4066 0D0C Scan interface basic assurance tests detected fault in L3 cache module 12 L3 cache module 12 and L3 VPD card Location: U1.18-P1-C17 and U1.18-P1-H2.1 4066 0D0D Scan interface basic assurance tests detected fault in L3 cache module 13 L3 cache module 13 and L3 VPD card Location: U1.18-P1-C5 and U1.18-P1-H2.1 4066 0D0E Scan interface basic assurance tests detected fault in L3 cache module 14 L3 cache module 14 and L3 VPD card Location: U1.18-P1-C9 and U1.18-P1-H2.1 550 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4066 0D0F Scan interface basic assurance tests detected fault in L3 cache module 15 L3 cache module 15 and L3 VPD card Location: U1.18-P1-C13 and U1.18-P1-H2.1 4066 0EA0 Scan interface basic assurance tests detected fault in MOPS service processor code 1. Access the service processor error log. Find the error log entry for the 406x 0EA0 error. Look for a hardware error of the form 450x yyyy, 460x yyyy, or 4B2x yyyy with a timestamp within a few minutes of the 406x 0EA0 error. Follow the actions for this hardware error. 2. If there is no hardware error in the service processor error log with a timestamp that is close to the 406x 0EA0 error, or following the actions for the hardware error does not resolve the problem, check for platform firmware updates. If an update is available, apply it. 3. If a platform firmware update is not available, or the update does not correct the problem, call service support. 4066 0EA1 Scan interface basic assurance tests detected fault in processor subsystem code on main processor 1. Check for operating system updates. 2. Call service support. 4066 0EA2 Scan interface basic assurance tests detected fault in PRD code on service processor 1. Check for system firmware updates. 2. Call service support. 4066 0EA4 The service processor firmware detected a hardware problem but was unable to isolate the source. Call service support. 4066 0EA5 PRD detected a problem in the FNM code 1. Check for system firmware updates. 2. Call service support. 4066 0EAA PRD: FNM is handling this error 1. Check for system firmware updates. 2. Call service support. 4066 0EAF Scan interface basic assurance tests detected unknown service processor code and system type 1. Check for system firmware updates. 2. Call service support. 4066 0EB0 Scan interface basic assurance tests detected JTAG problem Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4066 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4066 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4066 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4066 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4066 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4066 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. Chapter 5. Error Code to FRU Index 551 Error Code Description Action/ Possible Failing FRU 4066 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4066 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4066 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4066 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4066 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4066 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4066 0EBF Memory error 1. Reboot the system in slow mode to the SMS menus. See “Performing Slow Boot” on page 413. 2. Examine the service processor error log. v If a CEC subsystem error (a 4xxx yyyy error) occurred during the slow boot that calls out the memory, processors, L3 cache modules, or the primary I/O book, replace the part indicated by the location code. v If a CEC subsystem error (a 4xxx yyyy error) did not occur during the slow boot, look for a 4xxx yyyy error immediately preceding the 406x 0EBF error. Replace the part indicated by the location code associated with that error. 4066 0EE1 Scan interface basic assurance tests detected fault in service processor code (invalid MRU to mapping) 1. Check for system firmware updates. 2. Call service support. 4066 0EE2 Scan interface basic assurance tests detected fault in service processor firmware fault 1. Check for system firmware updates. 2. Call service support. 4066 0EE5 Scan interface basic assurance tests detected firmware error; no callout available 1. Check for system firmware updates. 2. Call service support. 4066 25E6 SMA 4-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 4066 273F SMA 2-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 4067 0001 Bus tests detected clock card assembly failure Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4067 0002 Bus tests detected backplane assembly failure Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4067 0003 Bus tests detected fault in MCM module 0 MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3 4067 0004 Bus tests detected fault in MCM module 2 MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4 552 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4067 0005 Bus tests detected fault in MCM module 1 MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5 4067 0006 Bus tests detected fault in MCM module 3 MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6 4067 0007 Bus tests detected fault in L3 VPD card 1. L3 VPD cardLocation: U1.18-P1-H2.1 (Order the L3 cache module FRU to obtain a new L3 VPD card. Return the L3 cache module that was included in the FRU kit, and the failing L3 VPD card as ?new defective?.) 2. Primary I/O bookLocation: U1.18-P1-H2 4067 000A Bus tests detected fault in primary I/O book 1 (slot 0) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4067 000B Bus tests detected fault in secondary I/O book (slot 1) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4067 000C Bus tests detected fault in I/O book 3 (slot 2) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4067 000D Bus tests detected fault in I/O book 4 (slot 3) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4067 000E CUoD capacity card 1. Replace the CUoD capacity cardLocation: U1.18-P1-H2.2 2. Ask the customer to reenter the CUoD activation keys. 4067 0010 Bus tests detected fault in Memory book (slot 0) Memory book (slot 0) Location: U1.18-P1-M2 4067 0011 Bus tests detected fault in Memory book (slot 1) Memory book (slot 1) Location: U1.18-P1-M3 4067 0012 Bus tests detected fault in Memory book (slot 2) Memory book (slot 2) Location: U1.18-P1-M7 4067 0013 Bus tests detected fault in Memory book (slot 3) Memory book (slot 3)Location: U1.18-P1-M6 4067 0014 Bus tests detected fault in Memory book (slot 4) Memory book (slot 4)Location: U1.18-P1-M4 4067 0015 Bus tests detected fault in Memory book (slot 5) Memory book (slot 5)Location: U1.18-P1-M8 4067 0016 Bus tests detected fault in Memory book (slot 6) Memory book (slot 6)Location: U1.18-P1-M5 4067 0017 Bus tests detected fault in Memory book (slot 7) Memory book (slot 7)Location: U1.18-P1-M1 4067 00B0 Bus tests detected possible problem with bus controller or CSP code 1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. Chapter 5. Error Code to FRU Index 553 Error Code Description Action/ Possible Failing FRU 4067 00B1 Bus tests detected possible problem with bus controller 1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. 4067 00B3 Bus tests detected MCM 0/VPD problem 1. Check for system firmware updates. 2. MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3 4067 00B4 Bus tests detected MCM 2/VPD problem 1. Check for system firmware updates. 2. MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4 4067 00B5 Bus tests detected MCM 1/VPD problem 1. Check for system firmware updates. 2. MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5 4067 00B6 Bus tests detected MCM 3/VPD problem 1. Check for system firmware updates. 2. MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6 4067 00B7 Bus tests detected hardware problem in the processor subsystem Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4067 0A03 Reboot message (run-time array gard failure) for processor 0 Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 554 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4067 0A04 Reboot message (run-time array gard failure) for processor 2 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 4067 0A05 Reboot message (run-time array gard failure) for processor 1 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. Chapter 5. Error Code to FRU Index 555 Error Code Description Action/ Possible Failing FRU 4067 0A06 Reboot message (run-time array gard failure) for processor 3 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 4067 0B00 Bus tests detected fault in I/O book (slot 0 RIO port 0) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 4067 0B01 Bus tests detected fault in I/O book (slot 0 RIO port 1) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 4067 0B02 Bus tests detected fault in I/O book (slot 0 RIO port 2) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 4067 0B03 Bus tests detected fault in I/O book (slot 0 RIO port 3) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 556 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4067 0B10 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 4067 0B11 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 4067 0B12 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 4067 0B13 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 4067 0B20 Bus tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4067 0B21 Bus tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4067 0B22 Bus tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. Chapter 5. Error Code to FRU Index 557 Error Code Description Action/ Possible Failing FRU 4067 0B23 Bus tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4067 0B24 Bus tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4067 0B25 Bus tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4067 0B26 Bus tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4067 0B27 Bus tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4067 0B30 Bus tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 4067 0B31 Bus tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 558 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4067 0B32 Bus tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 4067 0B33 Bus tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 4067 0B34 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 4067 0B35 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 4067 0B36 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 4067 0B37 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. Chapter 5. Error Code to FRU Index 559 Error Code Description Action/ Possible Failing FRU 4067 0BF0 Bus tests detected fault in I/O hub chip 0 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q1 b. U1.18-P1-H2/Q22. Replace the primary I/O book, Location: U1.18-P1-H2 4067 0BF1 Bus tests detected fault in I/O hub chip 1 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q3 b. U1.18-P1-H2/Q42. Replace the primary I/O book, Location: U1.18-P1-H2 4067 0BF2 Bus tests detected fault in I/O hub chip 2 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q3 b. U1.18-P1-H1/Q42. Replace the RIO book, Location: U1.18-P1-H1 4067 0BF3 Bus tests detected fault in I/O hub chip 3 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q1 b. U1.18-P1-H1/Q22. Replace the RIO book, Location: U1.18-P1-H1 4067 0BF4 Bus tests detected fault in I/O hub chip 4 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q7 b. U1.18-P1-H3/Q82. Replace the RIO book, Location: U1.18-P1-H3 4067 0BF5 Bus tests detected fault in I/O hub chip 5 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q5 b. U1.18-P1-H3/Q62. Replace the RIO book, Location: U1.18-P1-H3 4067 0BF6 Bus tests detected fault in I/O hub chip 6 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q5 b. U1.18-P1-H4/Q62. Replace the RIO book, Location: U1.18-P1-H4 560 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4067 0BF7 Bus tests detected fault in I/O hub chip 7 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q7 b. U1.18-P1-H4/Q82. Replace the RIO book, Location: U1.18-P1-H4 4067 0BF8 Bus tests detected fault in I/O hub chip 8 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q7 b. U1.18-P1-H1/Q82. Replace the RIO book, Location: U1.18-P1-H1 4067 0BF9 Bus tests detected fault in I/O hub chip 9 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q5 b. U1.18-P1-H1/Q62. Replace the RIO book, Location: U1.18-P1-H1 4067 0BFA Bus tests detected fault in I/O hub chip 10 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q3 b. U1.18-P1-H3/Q42. Replace the RIO book, Location: U1.18-P1-H3 4067 0BFB Bus tests detected fault in I/O hub chip 11 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q1 b. U1.18-P1-H3/Q22. Replace the RIO book, Location: U1.18-P1-H3 4067 0BFC Bus tests detected fault in I/O hub chip 12 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q1 b. U1.18-P1-H4/Q22. Replace the RIO book, Location: U1.18-P1-H4. 4067 0BFD Bus tests detected fault in I/O hub chip 13 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q3 b. U1.18-P1-H4/Q42. Replace the RIO book, Location: U1.18-P1-H4 4067 0C03 Bus tests detected fault in all MCMs Go to “MAP 1543: MCM Module Problem Isolation” on page 337. 4067 0C09 Bus tests detected fault in all memory books Go to “MAP 1546: Memory Book Problem Isolation” on page 345. Chapter 5. Error Code to FRU Index 561 Error Code Description Action/ Possible Failing FRU 4067 0C0A Bus tests detected fault in all I/O books Go to “MAP 1547: I/O Book Problem Isolation” on page 348. 4067 0C10 Bus tests detected mismatch between MCM 0 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C1 and U1.18-P1-H2.3 4067 0C11 Bus tests detected mismatch between MCM 2 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C4 and U1.18-P1-H2.4 4067 0C12 Bus tests detected mismatch between MCM 1 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C3 and U1.18-P1-H2.5 4067 0C13 Bus tests detected mismatch between MCM 3 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C2 and U1.18-P1-H2.6 4067 0CA5 Bus tests detected invalid processor subsystem configuration Verify that MCMs, memory books, and L3 modules are correctly configured. 4067 0CA6 Bus tests detected no usable main storage Reboot the system in slow mode to isolate the failure. 4067 0CA9 Bus tests detected configuration error with processor memory Reboot the system in slow mode to isolate the failure. 4067 0D00 Bus tests detected fault in L3 cache module 0 L3 cache module 0 and L3 VPD cardLocation: U1.18-P1-C6 and U1.18-P1-H2.1 4067 0D01 Bus tests detected fault in L3 cache module 1 L3 cache module 1 and L3 VPD card Location: U1.18-P1-C7 and U1.18-P1-H2.1 4067 0D02 Bus tests detected fault in L3 cache module 2 L3 cache module 2 and L3 VPD card Location: U1.18-P1-C11 and U1.18-P1-H2.1 4067 0D03 Bus tests detected fault in L3 cache module 3 L3 cache module 3 and L3 VPD card Location: U1.18-P1-C10 and U1.18-P1-H2.1 4067 0D04 Bus tests detected fault in L3 cache module 4 L3 cache module 4 and L3 VPD card Location: U1.18-P1-C8 and U1.18-P1-H2.1 4067 0D05 Bus tests detected fault in L3 cache module 5 L3 cache module 5 and L3 VPD card Location: U1.18-P1-C20 and U1.18-P1-H2.1 4067 0D06 Bus tests detected fault in L3 cache module 6 L3 cache module 6 and L3 VPD card Location: U1.18-P1-C16 and U1.18-P1-H2.1 4067 0D07 Bus tests detected fault in L3 cache module 7 L3 cache module 7 and L3 VPD card Location: U1.18-P1-C12 and U1.18-P1-H2.1 4067 0D08 Bus tests detected fault in L3 cache module 8 L3 cache module 8 and L3 VPD card Location: U1.18-P1-C19 and U1.18-P1-H2.1 4067 0D09 Bus tests detected fault in L3 cache module 9 L3 cache module 9 and L3 VPD card Location: U1.18-P1-C18 and U1.18-P1-H2.1 4067 0D0A Bus tests detected fault in L3 cache module 10 L3 cache module 10 and L3 VPD card Location: U1.18-P1-C14 and U1.18-P1-H2.1 4067 0D0B Bus tests detected fault in L3 cache module 11 L3 cache module 11 and L3 VPD card Location: U1.18-P1-C15 and U1.18-P1-H2.1 4067 0D0C Bus tests detected fault in L3 cache module 12 L3 cache module 12 and L3 VPD card Location: U1.18-P1-C17 and U1.18-P1-H2.1 4067 0D0D Bus tests detected fault in L3 cache module 13 L3 cache module 13 and L3 VPD card Location: U1.18-P1-C5 and U1.18-P1-H2.1 4067 0D0E Bus tests detected fault in L3 cache module 14 L3 cache module 14 and L3 VPD card Location: U1.18-P1-C9 and U1.18-P1-H2.1 562 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4067 0D0F Bus tests detected fault in L3 cache module 15 L3 cache module 15 and L3 VPD card Location: U1.18-P1-C13 and U1.18-P1-H2.1 4067 0EA0 Bus tests detected fault in MOPS service processor code 1. Access the service processor error log. Find the error log entry for the 406x 0EA0 error. Look for a hardware error of the form 450x yyyy, 460x yyyy, or 4B2x yyyy with a timestamp within a few minutes of the 406x 0EA0 error. Follow the actions for this hardware error. 2. If there is no hardware error in the service processor error log with a timestamp that is close to the 406x 0EA0 error, or following the actions for the hardware error does not resolve the problem, check for platform firmware updates. If an update is available, apply it. 3. If a platform firmware update is not available, or the update does not correct the problem, call service support. 4067 0EA1 Bus tests detected fault in processor subsystem code on main processor 1. Check for operating system update. 2. Call service support. 4067 0EA2 Bus tests detected fault in PRD code on service processor 1. Check for system firmware updates. 2. Call service support. 4067 0EA4 The service processor firmware detected a hardware problem but was unable to isolate the source. Call service support. 4067 0EA5 PRD detected a problem in the FNM code 1. Check for system firmware updates. 2. Call service support. 4067 0EAA PRD: FNM is handling this error 1. Check for system firmware updates. 2. Call service support. 4067 0EAF Bus tests detected unknown service processor code and system type 1. Check for system firmware updates. 2. Call service support. 4067 0EB0 Bus tests detected JTAG problem Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4067 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4067 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4067 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4067 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4067 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4067 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. Chapter 5. Error Code to FRU Index 563 Error Code Description Action/ Possible Failing FRU 4067 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4067 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4067 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4067 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4067 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4067 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4067 0EBF Memory error 1. Reboot the system in slow mode to the SMS menus. See “Performing Slow Boot” on page 413. 2. Examine the service processor error log. v If a CEC subsystem error (a 4xxx yyyy error) occurred during the slow boot that calls out the memory, processors, L3 cache modules, or the primary I/O book, replace the part indicated by the location code. v If a CEC subsystem error (a 4xxx yyyy error) did not occur during the slow boot, look for a 4xxx yyyy error immediately preceding the 406x 0EBF error. Replace the part indicated by the location code associated with that error. 4067 0EE1 Bus tests detected fault in service processor code (invalid MRU to mapping) 1. Check for system firmware updates. 2. Call service support. 4067 0EE2 Bus tests detected fault in service processor firmware fault 1. Check for system firmware updates. 2. Call service support. 4067 0EE5 Bus tests detected firmware error; no callout available 1. Check for system firmware updates. 2. Call service support. 4067 25E6 SMA 4-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 4067 273F SMA 2-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 4068 0001 Other bus tests detected clock card assembly failure Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4068 0002 Other bus tests detected backplane assembly failure Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4068 0003 Other bus tests detected fault in MCM module 0 MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3 4068 0004 Other bus tests detected fault in MCM module 2 MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4 4068 0005 Other bus tests detected fault in MCM module 1 MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5 564 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4068 0006 Other bus tests detected fault in MCM module 3 MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6 4068 0007 Other bus tests detected fault in L3 VPD card 1. L3 VPD cardLocation: U1.18-P1-H2.1 (Order the L3 cache module FRU to obtain a new L3 VPD card. Return the L3 cache module that was included in the FRU kit, and the failing L3 VPD card as ?new defective?.) 2. Primary I/O bookLocation: U1.18-P1-H2 4068 000A Other bus tests detected fault in primary I/O book 1 (slot 0) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4068 000B Other bus tests detected fault in secondary I/O book (slot 1) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4068 000C Other bus tests detected fault in I/O book 3 (slot 2) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4068 000D Other bus tests detected fault in I/O book 4 (slot 3) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4068 000E CUoD capacity card 1. Replace the CUoD capacity cardLocation: U1.18-P1-H2.2 2. Ask the customer to reenter the CUoD activation keys. 4068 0010 Other bus tests detected fault in Memory book (slot 0) Memory book (slot 0) Location: U1.18-P1-M2 4068 0011 Other bus tests detected fault in Memory book (slot 1) Memory book (slot 1) Location: U1.18-P1-M3 4068 0012 Other bus tests detected fault in Memory book (slot 2) Memory book (slot 2) Location: U1.18-P1-M7 4068 0013 Other bus tests detected fault in Memory book (slot 3) Memory book (slot 3)Location: U1.18-P1-M6 4068 0014 Other bus tests detected fault in Memory book (slot 4) Memory book (slot 4)Location: U1.18-P1-M4 4068 0015 Other bus tests detected fault in Memory book (slot 5) Memory book (slot 5)Location: U1.18-P1-M8 4068 0016 Other bus tests detected fault in Memory book (slot 6) Memory book (slot 6)Location: U1.18-P1-M5 4068 0017 Other bus tests detected fault in Memory book (slot 7) Memory book (slot 7)Location: U1.18-P1-M1 4068 00B0 Other bus tests detected possible problem with bus controller or CSP code 1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. 4068 00B1 Other bus tests detected possible problem with bus controller 1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. Chapter 5. Error Code to FRU Index 565 Error Code Description Action/ Possible Failing FRU 4068 00B3 Other bus tests detected MCM 0/VPD problem 1. Check for system firmware updates. 2. MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3 4068 00B4 Other bus tests detected MCM 2/VPD problem 1. Check for system firmware updates. 2. MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4 4068 00B5 Other bus tests detected MCM 1/VPD problem 1. Check for system firmware updates. 2. MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5 4068 00B6 Other bus tests detected MCM 3/VPD problem 1. Check for system firmware updates. 2. MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6 4068 00B7 Other bus tests detected hardware problem in the processor subsystem Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4068 0A03 Reboot message (run-time array gard failure) for processor 0 Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 566 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4068 0A04 Reboot message (run-time array gard failure) for processor 2 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 4068 0A05 Reboot message (run-time array gard failure) for Processor 1 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. Chapter 5. Error Code to FRU Index 567 Error Code Description Action/ Possible Failing FRU 4068 0A06 Reboot message (run-time array gard failure) for Processor 3 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 4068 0B00 Other bus tests detected fault in I/O book (slot 0 RIO port 0) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 4068 0B01 Other bus tests detected fault in I/O book (slot 0 RIO port 1) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 4068 0B02 Other bus tests detected fault in I/O book (slot 0 RIO port 2) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 4068 0B03 Other bus tests detected fault in I/O book (slot 0 RIO port 3) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 568 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4068 0B10 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 4068 0B11 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 4068 0B12 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 4068 0B13 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 4068 0B20 Other bus tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4068 0B21 Other bus tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4068 0B22 Other bus tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. Chapter 5. Error Code to FRU Index 569 Error Code Description Action/ Possible Failing FRU 4068 0B23 Other bus tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4068 0B24 Other bus tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4068 0B25 Other bus tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4068 0B26 Other bus tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4068 0B27 Other bus tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4068 0B30 Other bus tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 4068 0B31 Other bus tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 570 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4068 0B32 Other bus tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 4068 0B33 Other bus tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 4068 0B34 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 4068 0B35 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 4068 0B36 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 4068 0B37 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. Chapter 5. Error Code to FRU Index 571 Error Code Description Action/ Possible Failing FRU 4068 0BF0 Other bus tests detected fault in I/O hub chip 0 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q1 b. U1.18-P1-H2/Q22. Replace the primary I/O book, Location: U1.18-P1-H2 4068 0BF1 Other bus tests detected fault in I/O hub chip 1 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q3 b. U1.18-P1-H2/Q42. Replace the primary I/O book, Location: U1.18-P1-H2 4068 0BF2 Other bus tests detected fault in I/O hub chip 2 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q3 b. U1.18-P1-H1/Q42. Replace the RIO book, Location: U1.18-P1-H1 4068 0BF3 Other bus tests detected fault in I/O hub chip 3 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q1 b. U1.18-P1-H1/Q22. Replace the RIO book, Location: U1.18-P1-H1 4068 0BF4 Other bus tests detected fault in I/O hub chip 4 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q7 b. U1.18-P1-H3/Q82. Replace the RIO book, Location: U1.18-P1-H3 4068 0BF5 Other bus tests detected fault in I/O hub chip 5 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q5 b. U1.18-P1-H3/Q62. Replace the RIO book, Location: U1.18-P1-H3 4068 0BF6 Other bus tests detected fault in I/O hub chip 6 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q5 b. U1.18-P1-H4/Q62. Replace the RIO book, Location: U1.18-P1-H4 572 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4068 0BF7 Other bus tests detected fault in I/O hub chip 7 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q7 b. U1.18-P1-H4/Q82. Replace the RIO book, Location: U1.18-P1-H4 4068 0BF8 Other bus tests detected fault in I/O hub chip 8 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q7 b. U1.18-P1-H1/Q82. Replace the RIO book, Location: U1.18-P1-H1 4068 0BF9 Other bus tests detected fault in I/O hub chip 9 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q5 b. U1.18-P1-H1/Q62. Replace the RIO book, Location: U1.18-P1-H1 4068 0BFA Other bus tests detected fault in I/O hub chip 10 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q3 b. U1.18-P1-H3/Q42. Replace the RIO book, Location: U1.18-P1-H3 4068 0BFB Other bus tests detected fault in I/O hub chip 11 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q1 b. U1.18-P1-H3/Q22. Replace the RIO book, Location: U1.18-P1-H3 4068 0BFC Other bus tests detected fault in I/O hub chip 12 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q1 b. U1.18-P1-H4/Q22. Replace the RIO book, Location: U1.18-P1-H4. 4068 0BFD Other bus tests detected fault in I/O hub chip 13 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q3 b. U1.18-P1-H4/Q42. Replace the RIO book, Location: U1.18-P1-H4 4068 0C03 Other bus tests detected fault in all MCMs Go to “MAP 1543: MCM Module Problem Isolation” on page 337. 4068 0C09 Other bus tests detected fault in all memory books Go to “MAP 1546: Memory Book Problem Isolation” on page 345. Chapter 5. Error Code to FRU Index 573 Error Code Description Action/ Possible Failing FRU 4068 0C0A Other bus tests detected fault in all I/O books Go to “MAP 1547: I/O Book Problem Isolation” on page 348. 4068 0C10 Other bus tests detected mismatch between MCM 0 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C1 and U1.18-P1-H2.3 4068 0C11 Other bus tests detected mismatch between MCM 2 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C4 and U1.18-P1-H2.4 4068 0C12 Other bus tests detected mismatch between MCM 1 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C3 and U1.18-P1-H2.5 4068 0C13 Other bus tests detected mismatch between MCM 3 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C2 and U1.18-P1-H2.6 4068 0CA5 Other bus tests detected invalid processor subsystem configuration Verify that MCMs, memory books, and L3 modules are correctly configured. 4068 0CA6 Other bus tests detected no usable main storage Reboot the system in slow mode to isolate the failure. 4068 0CA9 Other bus tests detected configuration error with processor memory Reboot the system in slow mode to isolate the failure. 4068 0D00 Other bus tests detected fault in L3 cache module 0 L3 cache module 0 and L3 VPD cardLocation: U1.18-P1-C6 and U1.18-P1-H2.1 4068 0D01 Other bus tests detected fault in L3 cache module 1 L3 cache module 1 and L3 VPD card Location: U1.18-P1-C7 and U1.18-P1-H2.1 4068 0D02 Other bus tests detected fault in L3 cache module 2 L3 cache module 2 and L3 VPD card Location: U1.18-P1-C11 and U1.18-P1-H2.1 4068 0D03 Other bus tests detected fault in L3 cache module 3 L3 cache module 3 and L3 VPD card Location: U1.18-P1-C10 and U1.18-P1-H2.1 4068 0D04 Other bus tests detected fault in L3 cache module 4 L3 cache module 4 and L3 VPD card Location: U1.18-P1-C8 and U1.18-P1-H2.1 4068 0D05 Other bus tests detected fault in L3 cache module 5 L3 cache module 5 and L3 VPD card Location: U1.18-P1-C20 and U1.18-P1-H2.1 4068 0D06 Other bus tests detected fault in L3 cache module 6 L3 cache module 6 and L3 VPD card Location: U1.18-P1-C16 and U1.18-P1-H2.1 4068 0D07 Other bus tests detected fault in L3 cache module 7 L3 cache module 7 and L3 VPD card Location: U1.18-P1-C12 and U1.18-P1-H2.1 4068 0D08 Other bus tests detected fault in L3 cache module 8 L3 cache module 8 and L3 VPD card Location: U1.18-P1-C19 and U1.18-P1-H2.1 4068 0D09 Other bus tests detected fault in L3 cache module 9 L3 cache module 9 and L3 VPD card Location: U1.18-P1-C18 and U1.18-P1-H2.1 4068 0D0A Other bus tests detected fault in L3 cache module 10 L3 cache module 10 and L3 VPD card Location: U1.18-P1-C14 and U1.18-P1-H2.1 4068 0D0B Other bus tests detected fault in L3 cache module 11 L3 cache module 11 and L3 VPD card Location: U1.18-P1-C15 and U1.18-P1-H2.1 4068 0D0C Other bus tests detected fault in L3 cache module 12 L3 cache module 12 and L3 VPD card Location: U1.18-P1-C17 and U1.18-P1-H2.1 4068 0D0D Other bus tests detected fault in L3 cache module 13 L3 cache module 13 and L3 VPD card Location: U1.18-P1-C5 and U1.18-P1-H2.1 4068 0D0E Other bus tests detected fault in L3 cache module 14 L3 cache module 14 and L3 VPD card Location: U1.18-P1-C9 and U1.18-P1-H2.1 574 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4068 0D0F Other bus tests detected fault in L3 cache module 15 L3 cache module 15 and L3 VPD card Location: U1.18-P1-C13 and U1.18-P1-H2.1 4068 0EA0 Other bus tests detected fault in MOPS service processor code 1. Access the service processor error log. Find the error log entry for the 406x 0EA0 error. Look for a hardware error of the form 450x yyyy, 460x yyyy, or 4B2x yyyy with a timestamp within a few minutes of the 406x 0EA0 error. Follow the actions for this hardware error. 2. If there is no hardware error in the service processor error log with a timestamp that is close to the 406x 0EA0 error, or following the actions for the hardware error does not resolve the problem, check for platform firmware updates. If an update is available, apply it. 3. If a platform firmware update is not available, or the update does not correct the problem, call service support. 4068 0EA1 Other bus tests detected fault in processor subsystem code on main processor 1. Check for operating system updates. 2. Call service support. 4068 0EA2 Other bus tests detected fault in PRD code on service processor 1. Check for system firmware updates. 2. Call service support. 4068 0EA4 The service processor firmware detected a hardware problem but was unable to isolate the source. Call service support. 4068 0EA5 PRD detected a problem in the FNM code 1. Check for system firmware updates. 2. Call service support. 4068 0EAA PRD: FNM is handling this error 1. Check for system firmware updates. 2. Call service support. 4068 0EAF Other bus tests detected unknown service processor code and system type 1. Check for system firmware updates. 2. Call service support. 4068 0EB0 Other bus tests detected JTAG problem Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4068 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4068 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4068 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4068 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4068 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4068 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. Chapter 5. Error Code to FRU Index 575 Error Code Description Action/ Possible Failing FRU 4068 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4068 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4068 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4068 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4068 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4068 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4068 0EBF Memory error 1. Reboot the system in slow mode to the SMS menus. See “Performing Slow Boot” on page 413. 2. Examine the service processor error log. v If a CEC subsystem error (a 4xxx yyyy error) occurred during the slow boot that calls out the memory, processors, L3 cache modules, or the primary I/O book, replace the part indicated by the location code. v If a CEC subsystem error (a 4xxx yyyy error) did not occur during the slow boot, look for a 4xxx yyyy error immediately preceding the 406x 0EBF error. Replace the part indicated by the location code associated with that error. 4068 0EE1 Other bus tests detected fault in service processor code (invalid MRU to mapping) 1. Check for system firmware updates. 2. Call service support. 4068 0EE2 Other bus tests detected fault in service processor firmware fault 1. Check for system firmware updates. 2. Call service support. 4068 0EE5 Other bus tests detected firmware error; no callout available 1. Check for system firmware updates. 2. Call service support. 4068 25E6 SMA 4-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 4068 273F SMA 2-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 4069 0001 Memory tests detected clock card assembly failure Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4069 0002 Memory tests detected backplane assembly failure Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4069 0003 Memory tests detected fault in MCM module 0 MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3 4069 0004 Memory tests detected fault in MCM module 2 MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4 4069 0005 Memory tests detected fault in MCM module 1 MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5 576 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4069 0006 Memory tests detected fault in MCM module 3 MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6 4069 0007 Memory tests detected fault in L3 VPD card 1. L3 VPD cardLocation: U1.18-P1-H2.1 (Order the L3 cache module FRU to obtain a new L3 VPD card. Return the L3 cache module that was included in the FRU kit, and the failing L3 VPD card as ?new defective?.) 2. Primary I/O bookLocation: U1.18-P1-H2 4069 000A Memory tests detected fault in primary I/O book 1 (slot 0) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4069 000B Memory tests detected fault in secondary I/O book (slot 1) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4069 000C Memory tests detected fault in I/O book 3 (slot 2) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4069 000D Memory tests detected fault in I/O book 4 (slot 3) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4069 000E CUoD capacity card 1. Replace the CUoD capacity cardLocation: U1.18-P1-H2.2 2. Ask the customer to reenter the CUoD activation keys. 4069 0010 Memory tests detected fault in Memory book (slot 0) Memory book (slot 0) Location: U1.18-P1-M2 4069 0011 Memory tests detected fault in Memory book (slot 1) Memory book (slot 1) Location: U1.18-P1-M3 4069 0012 Memory tests detected fault in Memory book (slot 2) Memory book (slot 2) Location: U1.18-P1-M7 4069 0013 Memory tests detected fault in Memory book (slot 3) Memory book (slot 3)Location: U1.18-P1-M6 4069 0014 Memory tests detected fault in Memory book (slot 4) Memory book (slot 4)Location: U1.18-P1-M4 4069 0015 Memory tests detected fault in Memory book (slot 5) Memory book (slot 5)Location: U1.18-P1-M8 4069 0016 Memory tests detected fault in Memory book (slot 6) Memory book (slot 6)Location: U1.18-P1-M5 4069 0017 Memory tests detected fault in Memory book (slot 7) Memory book (slot 7)Location: U1.18-P1-M1 4069 00B0 Memory tests detected possible problem with bus controller or CSP code 1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. 4069 00B1 Memory tests detected possible problem with bus controller 1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. Chapter 5. Error Code to FRU Index 577 Error Code Description Action/ Possible Failing FRU 4069 00B3 Memory tests detected MCM 0/VPD problem 1. Check for system firmware updates. 2. MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3 4069 00B4 Memory tests detected MCM 2/VPD problem 1. Check for system firmware updates. 2. MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4 4069 00B5 Memory tests detected MCM 1/VPD problem 1. Check for system firmware updates. 2. MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5 4069 00B6 Memory tests detected MCM 3/VPD problem 1. Check for system firmware updates. 2. MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6 4069 00B7 Memory tests detected hardware problem in the processor subsystem Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4069 0A03 Reboot message (run-time array gard failure) for processor 0 Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 578 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4069 0A04 Reboot message (run-time array gard failure) for processor 2 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 4069 0A05 Reboot message (run-time array gard failure) for processor 1 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. Chapter 5. Error Code to FRU Index 579 Error Code Description Action/ Possible Failing FRU 4069 0A06 Reboot message (run-time array gard failure) for processor 3 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 4069 0B00 Memory tests detected fault in I/O book (slot 0 RIO port 0) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 4069 0B01 Memory tests detected fault in I/O book (slot 0 RIO port 1) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 4069 0B02 Memory tests detected fault in I/O book (slot 0 RIO port 2) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 4069 0B03 Memory tests detected fault in I/O book (slot 0 RIO port 3) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 580 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4069 0B10 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 4069 0B11 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 4069 0B12 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 4069 0B13 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 4069 0B20 Memory tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4069 0B21 Memory tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4069 0B22 Memory tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. Chapter 5. Error Code to FRU Index 581 Error Code Description Action/ Possible Failing FRU 4069 0B23 Memory tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4069 0B24 Memory tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4069 0B25 Memory tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4069 0B26 Memory tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4069 0B27 Memory tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 4069 0B30 Memory tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 4069 0B31 Memory tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 582 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4069 0B32 Memory tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 4069 0B33 Memory tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 4069 0B34 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 4069 0B35 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 4069 0B36 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 4069 0B37 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. Chapter 5. Error Code to FRU Index 583 Error Code Description Action/ Possible Failing FRU 4069 0BF0 Memory tests detected fault in I/O hub chip 0 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q1 b. U1.18-P1-H2/Q22. Replace the primary I/O book, Location: U1.18-P1-H2 4069 0BF1 Memory tests detected fault in I/O hub chip 1 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q3 b. U1.18-P1-H2/Q42. Replace the primary I/O book, Location: U1.18-P1-H2 4069 0BF2 Memory tests detected fault in I/O hub chip 2 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q3 b. U1.18-P1-H1/Q42. Replace the RIO book, Location: U1.18-P1-H1 4069 0BF3 Memory tests detected fault in I/O hub chip 3 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q1 b. U1.18-P1-H1/Q22. Replace the RIO book, Location: U1.18-P1-H1 4069 0BF4 Memory tests detected fault in I/O hub chip 4 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q7 b. U1.18-P1-H3/Q82. Replace the RIO book, Location: U1.18-P1-H3 4069 0BF5 Memory tests detected fault in I/O hub chip 5 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q5 b. U1.18-P1-H3/Q62. Replace the RIO book, Location: U1.18-P1-H3 4069 0BF6 Memory tests detected fault in I/O hub chip 6 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q5 b. U1.18-P1-H4/Q62. Replace the RIO book, Location: U1.18-P1-H4 584 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4069 0BF7 Memory tests detected fault in I/O hub chip 7 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q7 b. U1.18-P1-H4/Q82. Replace the RIO book, Location: U1.18-P1-H4 4069 0BF8 Memory tests detected fault in I/O hub chip 8 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q7 b. U1.18-P1-H1/Q82. Replace the RIO book, Location: U1.18-P1-H1 4069 0BF9 Memory tests detected fault in I/O hub chip 9 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q5 b. U1.18-P1-H1/Q62. Replace the RIO book, Location: U1.18-P1-H1 4069 0BFA Memory tests detected fault in I/O hub chip 10 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q3 b. U1.18-P1-H3/Q42. Replace the RIO book, Location: U1.18-P1-H3 4069 0BFB Memory tests detected fault in I/O hub chip 11 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q1 b. U1.18-P1-H3/Q22. Replace the RIO book, Location: U1.18-P1-H3 4069 0BFC Memory tests detected fault in I/O hub chip 12 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q1 b. U1.18-P1-H4/Q22. Replace the RIO book, Location: U1.18-P1-H4. 4069 0BFD Memory tests detected fault in I/O hub chip 13 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q3 b. U1.18-P1-H4/Q42. Replace the RIO book, Location: U1.18-P1-H4 4069 0C03 Memory tests detected fault in all MCMs Go to “MAP 1543: MCM Module Problem Isolation” on page 337. 4069 0C09 Memory tests detected fault in all memory books Go to “MAP 1546: Memory Book Problem Isolation” on page 345. Chapter 5. Error Code to FRU Index 585 Error Code Description Action/ Possible Failing FRU 4069 0C0A Memory tests detected fault in all I/O books Go to “MAP 1547: I/O Book Problem Isolation” on page 348. 4069 0C10 Memory tests detected mismatch between MCM 0 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C1 and U1.18-P1-H2.3 4069 0C11 Memory tests detected mismatch between MCM 2 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C4 and U1.18-P1-H2.4 4069 0C12 Memory tests detected mismatch between MCM 1 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C3 and U1.18-P1-H2.5 4069 0C13 Memory tests detected mismatch between MCM 3 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C2 and U1.18-P1-H2.6 4069 0CA5 Memory tests detected invalid processor subsystem configuration Verify that MCMs, memory books, and L3 modules are correctly configured. 4069 0CA6 Memory tests detected no usable main storage Reboot the system in slow mode to isolate the failure. 4069 0CA9 Memory tests detected configuration error with processor memory Reboot the system in slow mode to isolate the failure. 4069 0D00 Memory tests detected fault in L3 cache module 0 L3 cache module 0 and L3 VPD cardLocation: U1.18-P1-C6 and U1.18-P1-H2.1 4069 0D01 Memory tests detected fault in L3 cache module 1 L3 cache module 1 and L3 VPD card Location: U1.18-P1-C7 and U1.18-P1-H2.1 4069 0D02 Memory tests detected fault in L3 cache module 2 L3 cache module 2 and L3 VPD card Location: U1.18-P1-C11 and U1.18-P1-H2.1 4069 0D03 Memory tests detected fault in L3 cache module 3 L3 cache module 3 and L3 VPD card Location: U1.18-P1-C10 and U1.18-P1-H2.1 4069 0D04 Memory tests detected fault in L3 cache module 4 L3 cache module 4 and L3 VPD card Location: U1.18-P1-C8 and U1.18-P1-H2.1 4069 0D05 Memory tests detected fault in L3 cache module 5 L3 cache module 5 and L3 VPD card Location: U1.18-P1-C20 and U1.18-P1-H2.1 4069 0D06 Memory tests detected fault in L3 cache module 6 L3 cache module 6 and L3 VPD card Location: U1.18-P1-C16 and U1.18-P1-H2.1 4069 0D07 Memory tests detected fault in L3 cache module 7 L3 cache module 7 and L3 VPD card Location: U1.18-P1-C12 and U1.18-P1-H2.1 4069 0D08 Memory tests detected fault in L3 cache module 8 L3 cache module 8 and L3 VPD card Location: U1.18-P1-C19 and U1.18-P1-H2.1 4069 0D09 Memory tests detected fault in L3 cache module 9 L3 cache module 9 and L3 VPD card Location: U1.18-P1-C18 and U1.18-P1-H2.1 4069 0D0A Memory tests detected fault in L3 cache module 10 L3 cache module 10 and L3 VPD card Location: U1.18-P1-C14 and U1.18-P1-H2.1 4069 0D0B Memory tests detected fault in L3 cache module 11 L3 cache module 11 and L3 VPD card Location: U1.18-P1-C15 and U1.18-P1-H2.1 4069 0D0C Memory tests detected fault in L3 cache module 12 L3 cache module 12 and L3 VPD card Location: U1.18-P1-C17 and U1.18-P1-H2.1 4069 0D0D Memory tests detected fault in L3 cache module 13 L3 cache module 13 and L3 VPD card Location: U1.18-P1-C5 and U1.18-P1-H2.1 4069 0D0E Memory tests detected fault in L3 cache module 14 L3 cache module 14 and L3 VPD card Location: U1.18-P1-C9 and U1.18-P1-H2.1 586 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4069 0D0F Memory tests detected fault in L3 cache module 15 L3 cache module 15 and L3 VPD card Location: U1.18-P1-C13 and U1.18-P1-H2.1 4069 0EA0 Memory tests detected fault in MOPS service processor code 1. Access the service processor error log. Find the error log entry for the 406x 0EA0 error. Look for a hardware error of the form 450x yyyy, 460x yyyy, or 4B2x yyyy with a timestamp within a few minutes of the 406x 0EA0 error. Follow the actions for this hardware error. 2. If there is no hardware error in the service processor error log with a timestamp that is close to the 406x 0EA0 error, or following the actions for the hardware error does not resolve the problem, check for platform firmware updates. If an update is available, apply it. 3. If a platform firmware update is not available, or the update does not correct the problem, call service support. 4069 0EA1 Memory tests detected fault in processor subsystem code on main processor 1. Check for operating system updates. 2. Call service support. 4069 0EA2 Memory tests detected fault in PRD code on service processor 1. Check for system firmware updates. 2. Call service support. Chapter 5. Error Code to FRU Index 587 Error Code Description Action/ Possible Failing FRU 4069 0EA4 Description: The service processor firmware detected a hardware problem but was unable to isolate the source.Action: Examine word 13 and word 16 of the extended error log entry. (Instructions for finding word 13 and word 16 in the service processor error log entry can be found in the action column for error code B1xx 4699. To find word 16 in the AIX error log entry, go forward 20 bytes after ?40?. If word 13 is 9250xxxx and word 16 is A30A000, do the following: 1. Access the service processor menus. Select option 2, ?System Power Control Menu?, then select option 6, ?Enable/Disable Fast System Boot?. Note: If necessary, change the setting of option 6 to ?Currently Enabled?. 2. Enter ?98? to go back to the ?System Power Control Menu?. 3. Select option 3, ?Reboot/Restart Policy Setup Menu?. 4. Select option 1 in the ?Reboot/Restart Policy Setup Menu? to set the number of reboot attempts to 0. 5. Enter ?98? to exit the ?Reboot/Restart Policy Setup Menu?. 6. Power up the system using either the ?System Power Control Menu? or the white power button. Note: The system should fail to boot. 7. Look for entries in the service processor error log that were generated during the latest boot. In particular, look for an error code that has a format of 4xxx yyyy whose word 13 value is 7xxxxxxx. Replace the FRU indicated by this error code. 8. Go back to the ?System Power Control Menu? 9. Select option 6. This disables fast boot and automatically enables slow boot. 10. Power up the system using either the ?System Power Control Menu? or the white power button. 11. Look again for a 4069 0EA4 error. If one reoccurs, repeat steps 1 through 10 until the problem has been resolved or all FRUs have been replaced. 12. Call service support if all of the potential FRUs have been replaced and the problem has not been resolved. 4069 0EA5 PRD detected a problem in the FNM code 1. Check for system firmware updates. 2. Call service support. 4069 0EAA PRD: FNM is handling this error 1. Check for system firmware updates. 2. Call service support. 4069 0EAF Memory tests detected unknown service processor code and system type 1. Check for system firmware updates. 2. Call service support. 4069 0EB0 Memory tests detected JTAG problem Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4069 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4069 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4069 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4069 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4069 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 588 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4069 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4069 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4069 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4069 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4069 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4069 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4069 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 4069 0EBF Memory error 1. Reboot the system in slow mode to the SMS menus. See “Performing Slow Boot” on page 413. 2. Examine the service processor error log. v If a CEC subsystem error (a 4xxx yyyy error) occurred during the slow boot that calls out the memory, processors, L3 cache modules, or the primary I/O book, replace the part indicated by the location code. v If a CEC subsystem error (a 4xxx yyyy error) did not occur during the slow boot, look for a 4xxx yyyy error immediately preceding the 406x 0EBF error. Replace the part indicated by the location code associated with that error. 4069 0EE1 Memory tests detected fault in service processor code (invalid MRU to mapping) 1. Check for system firmware updates. 2. Call service support. 4069 0EE2 Memory tests detected fault in service processor firmware fault 1. Check for system firmware updates. 2. Call service support. 4069 0EE5 Memory tests detected firmware error; no callout available 1. Check for system firmware updates. 2. Call service support. 4069 25E6 SMA 4-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 4069 273F SMA 2-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 406A 0001 Self tests detected clock card assembly failure Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406A 0002 Self tests detected backplane assembly failure Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406A 0003 Self tests detected fault in MCM module 0 MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3 406A 0004 Self tests detected fault in MCM module 2 MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4 Chapter 5. Error Code to FRU Index 589 Error Code Description Action/ Possible Failing FRU 406A 0005 Self tests detected fault in MCM module 1 MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5 406A 0006 Self tests detected fault in MCM module 3 MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6 406A 0007 Self tests detected fault in L3 VPD card 1. L3 VPD cardLocation: U1.18-P1-H2.1 (Order the L3 cache module FRU to obtain a new L3 VPD card. Return the L3 cache module that was included in the FRU kit, and the failing L3 VPD card as ?new defective?.) 2. Primary I/O bookLocation: U1.18-P1-H2 406A 000A Self tests detected fault in primary I/O book 1 (slot 0) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406A 000B Self tests detected fault in secondary I/O book (slot 1) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406A 000C Self tests detected fault in I/O book 3 (slot 2) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406A 000D Self tests detected fault in I/O book 4 (slot 3) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406A 000E CUoD capacity card 1. Replace the CUoD capacity cardLocation: U1.18-P1-H2.2 2. Ask the customer to reenter the CUoD activation keys. 406A 0010 Self tests detected fault in Memory book (slot 0) Memory book (slot 0) Location: U1.18-P1-M2 406A 0011 Self tests detected fault in Memory book (slot 1) Memory book (slot 1) Location: U1.18-P1-M3 406A 0012 Self tests detected fault in Memory book (slot 2) Memory book (slot 2) Location: U1.18-P1-M7 406A 0013 Self tests detected fault in Memory book (slot 3) Memory book (slot 3)Location: U1.18-P1-M6 406A 0014 Self tests detected fault in Memory book (slot 4) Memory book (slot 4)Location: U1.18-P1-M4 406A 0015 Self tests detected fault in Memory book (slot 5) Memory book (slot 5)Location: U1.18-P1-M8 406A 0016 Self tests detected fault in Memory book (slot 6) Memory book (slot 6)Location: U1.18-P1-M5 406A 0017 Self tests detected fault in Memory book (slot 7) Memory book (slot 7)Location: U1.18-P1-M1 406A 00B0 Self tests detected possible problem with bus controller or CSP code 1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. 590 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406A 00B1 Self tests detected possible problem with bus controller 1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. 406A 00B3 Self tests detected MCM 0/VPD problem 1. Check for system firmware updates. 2. MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3 406A 00B4 Self tests detected MCM 2/VPD problem 1. Check for system firmware updates. 2. MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4 406A 00B5 Self tests detected MCM 1/VPD problem 1. Check for system firmware updates. 2. MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5 406A 00B6 Self tests detected MCM 3/VPD problem 1. Check for system firmware updates. 2. MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6 406A 00B7 Self tests detected hardware problem in the processor subsystem Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406A 0A03 Reboot message (run-time array gard failure) for processor 0 Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. Chapter 5. Error Code to FRU Index 591 Error Code Description Action/ Possible Failing FRU 406A 0A04 Reboot message (run-time array gard failure) for processor 2 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 406A 0A05 Reboot message (run-time array gard failure) for processor 1 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 592 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406A 0A06 Reboot message (run-time array gard failure) for processor 3 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 406A 0B00 Self-tests detected fault in I/O book (slot 0 RIO port 0) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 406A 0B01 Self-tests detected fault in I/O book (slot 0 RIO port 1) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 406A 0B02 Self-tests detected fault in I/O book (slot 0 RIO port 2) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 406A 0B03 Self-tests detected fault in I/O book (slot 0 RIO port 3) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. Chapter 5. Error Code to FRU Index 593 Error Code Description Action/ Possible Failing FRU 406A 0B10 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 406A 0B11 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 406A 0B12 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 406A 0B13 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 406A 0B20 Self-tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406A 0B21 Self-tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406A 0B22 Self-tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 594 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406A 0B23 Self-tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406A 0B24 Self-tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406A 0B25 Self-tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406A 0B26 Self-tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406A 0B27 Self-tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406A 0B30 Self-tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 406A 0B31 Self-tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. Chapter 5. Error Code to FRU Index 595 Error Code Description Action/ Possible Failing FRU 406A 0B32 Self-tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 406A 0B33 Self-tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 406A 0B34 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 406A 0B35 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 406A 0B36 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 406A 0B37 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 596 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406A 0BF0 Self-tests detected fault in I/O hub chip 0 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q1 b. U1.18-P1-H2/Q22. Replace the primary I/O book, Location: U1.18-P1-H2 406A 0BF1 Self-tests detected fault in I/O hub chip 1 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q3 b. U1.18-P1-H2/Q42. Replace the primary I/O book, Location: U1.18-P1-H2 406A 0BF2 Self-tests detected fault in I/O hub chip 2 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q3 b. U1.18-P1-H1/Q42. Replace the RIO book, Location: U1.18-P1-H1 406A 0BF3 Self-tests detected fault in I/O hub chip 3 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q1 b. U1.18-P1-H1/Q22. Replace the RIO book, Location: U1.18-P1-H1 406A 0BF4 Self-tests detected fault in I/O hub chip 4 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q7 b. U1.18-P1-H3/Q82. Replace the RIO book, Location: U1.18-P1-H3 406A 0BF5 Self-tests detected fault in I/O hub chip 5 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q5 b. U1.18-P1-H3/Q62. Replace the RIO book, Location: U1.18-P1-H3 406A 0BF6 Self-tests detected fault in I/O hub chip 6 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q5 b. U1.18-P1-H4/Q62. Replace the RIO book, Location: U1.18-P1-H4 Chapter 5. Error Code to FRU Index 597 Error Code Description Action/ Possible Failing FRU 406A 0BF7 Self-tests detected fault in I/O hub chip 7 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q7 b. U1.18-P1-H4/Q82. Replace the RIO book, Location: U1.18-P1-H4 406A 0BF8 Self-tests detected fault in I/O hub chip 8 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q7 b. U1.18-P1-H1/Q82. Replace the RIO book, Location: U1.18-P1-H1 406A 0BF9 Self-tests detected fault in I/O hub chip 9 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q5 b. U1.18-P1-H1/Q62. Replace the RIO book, Location: U1.18-P1-H1 406A 0BFA Self-tests detected fault in I/O hub chip 10 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q3 b. U1.18-P1-H3/Q42. Replace the RIO book, Location: U1.18-P1-H3 406A 0BFB Self-tests detected fault in I/O hub chip 11 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q1 b. U1.18-P1-H3/Q22. Replace the RIO book, Location: U1.18-P1-H3 406A 0BFC Self-tests detected fault in I/O hub chip 12 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q1 b. U1.18-P1-H4/Q22. Replace the RIO book, Location: U1.18-P1-H4. 406A 0BFD Self-tests detected fault in I/O hub chip 13 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q3 b. U1.18-P1-H4/Q42. Replace the RIO book, Location: U1.18-P1-H4 406A 0C03 Self-tests detected fault in all MCMs Go to “MAP 1543: MCM Module Problem Isolation” on page 337. 406A 0C09 Self-tests detected fault in all memory books Go to “MAP 1546: Memory Book Problem Isolation” on page 345. 598 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406A 0C0A Self-tests detected fault in all I/O books Go to “MAP 1547: I/O Book Problem Isolation” on page 348. 406A 0C10 Self-tests detected mismatch between MCM 0 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C1 and U1.18-P1-H2.3 406A 0C11 Self-tests detected mismatch between MCM 2 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C4 and U1.18-P1-H2.4 406A 0C12 Self-tests detected mismatch between MCM 1 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C3 and U1.18-P1-H2.5 406A 0C13 Self-tests detected mismatch between MCM 3 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C2 and U1.18-P1-H2.6 406A 0CA5 Self-tests detected invalid processor subsystem configuration Verify that MCMs, memory books, and L3 modules are correctly configured. 406A 0CA6 Self-tests detected no usable main storage Reboot the system in slow mode to isolate the failure. 406A 0CA9 Self-tests detected configuration error with processor memory Reboot the system in slow mode to isolate the failure. 406A 0D00 Self-tests detected fault in L3 cache module 0 L3 cache module 0 and L3 VPD cardLocation: U1.18-P1-C6 and U1.18-P1-H2.1 406A 0D01 Self-tests detected fault in L3 cache module 1 L3 cache module 1 and L3 VPD card Location: U1.18-P1-C7 and U1.18-P1-H2.1 406A 0D02 Self-tests detected fault in L3 cache module 2 L3 cache module 2 and L3 VPD card Location: U1.18-P1-C11 and U1.18-P1-H2.1 406A 0D03 Self-tests detected fault in L3 cache module 3 L3 cache module 3 and L3 VPD card Location: U1.18-P1-C10 and U1.18-P1-H2.1 406A 0D04 Self-tests detected fault in L3 cache module 4 L3 cache module 4 and L3 VPD card Location: U1.18-P1-C8 and U1.18-P1-H2.1 406A 0D05 Self-tests detected fault in L3 cache module 5 L3 cache module 5 and L3 VPD card Location: U1.18-P1-C20 and U1.18-P1-H2.1 406A 0D06 Self-tests detected fault in L3 cache module 6 L3 cache module 6 and L3 VPD card Location: U1.18-P1-C16 and U1.18-P1-H2.1 406A 0D07 Self-tests detected fault in L3 cache module 7 L3 cache module 7 and L3 VPD card Location: U1.18-P1-C12 and U1.18-P1-H2.1 406A 0D08 Self-tests detected fault in L3 cache module 8 L3 cache module 8 and L3 VPD card Location: U1.18-P1-C19 and U1.18-P1-H2.1 406A 0D09 Self-tests detected fault in L3 cache module 9 L3 cache module 9 and L3 VPD card Location: U1.18-P1-C18 and U1.18-P1-H2.1 406A 0D0A Self-tests detected fault in L3 cache module 10 L3 cache module 10 and L3 VPD card Location: U1.18-P1-C14 and U1.18-P1-H2.1 406A 0D0B Self-tests detected fault in L3 cache module 11 L3 cache module 11 and L3 VPD card Location: U1.18-P1-C15 and U1.18-P1-H2.1 406A 0D0C Self-tests detected fault in L3 cache module 12 L3 cache module 12 and L3 VPD card Location: U1.18-P1-C17 and U1.18-P1-H2.1 406A 0D0D Self-tests detected fault in L3 cache module 13 L3 cache module 13 and L3 VPD card Location: U1.18-P1-C5 and U1.18-P1-H2.1 406A 0D0E Self-tests detected fault in L3 cache module 14 L3 cache module 14 and L3 VPD card Location: U1.18-P1-C9 and U1.18-P1-H2.1 Chapter 5. Error Code to FRU Index 599 Error Code Description Action/ Possible Failing FRU 406A 0D0F Self-tests detected fault in L3 cache module 15 L3 cache module 15 and L3 VPD card Location: U1.18-P1-C13 and U1.18-P1-H2.1 406A 0EA0 Self-tests detected fault in MOPS service processor code 1. Access the service processor error log. Find the error log entry for the 406x 0EA0 error. Look for a hardware error of the form 450x yyyy, 460x yyyy, or 4B2x yyyy with a timestamp within a few minutes of the 406x 0EA0 error. Follow the actions for this hardware error. 2. If there is no hardware error in the service processor error log with a timestamp that is close to the 406x 0EA0 error, or following the actions for the hardware error does not resolve the problem, check for platform firmware updates. If an update is available, apply it. 3. If a platform firmware update is not available, or the update does not correct the problem, call service support. 406A 0EA1 Self-tests detected fault in processor subsystem code on main processor 1. Check for operating system updates. 2. Call service support. 406A 0EA2 Self-tests detected fault in PRD code on service processor 1. Check for system firmware updates. 2. Call service support. 406A 0EA4 The service processor firmware detected a hardware problem but was unable to isolate the source. Call service support. 406A 0EA5 PRD detected a problem in the FNM code 1. Check for system firmware updates. 2. Call service support. 406A 0EAA PRD: FNM is handling this error 1. Check for system firmware updates. 2. Call service support. 406A 0EAF Self-tests detected unknown service processor code and system type 1. Check for system firmware updates. 2. Call service support. 406A 0EB0 Self-tests detected JTAG problem Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406A 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406A 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406A 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406A 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406A 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406A 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 600 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406A 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406A 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406A 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406A 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406A 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406A 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406A 0EBF Memory error 1. Reboot the system in slow mode to the SMS menus. See “Performing Slow Boot” on page 413. 2. Examine the service processor error log. v If a CEC subsystem error (a 4xxx yyyy error) occurred during the slow boot that calls out the memory, processors, L3 cache modules, or the primary I/O book, replace the part indicated by the location code. v If a CEC subsystem error (a 4xxx yyyy error) did not occur during the slow boot, look for a 4xxx yyyy error immediately preceding the 406x 0EBF error. Replace the part indicated by the location code associated with that error. 406A 0EE1 Self-tests detected fault in service processor code (invalid MRU to mapping) 1. Check for system firmware updates. 2. Call service support. 406A 0EE2 Self-tests detected fault in service processor firmware fault 1. Check for system firmware updates. 2. Call service support. 406A 0EE5 Self-tests detected firmware error; no callout available 1. Check for system firmware updates. 2. Call service support. 406A 25E6 SMA 4-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 406A 273F SMA 2-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 406B 0001 Wire tests detected clock card assembly failure Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406B 0002 Wire tests detected backplane assembly failure Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406B 0003 Wire tests detected fault in MCM module 0 MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3 406B 0004 Wire tests detected fault in MCM module 2 MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4 406B 0005 Wire tests detected fault in MCM module 1 MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5 Chapter 5. Error Code to FRU Index 601 Error Code Description Action/ Possible Failing FRU 406B 0006 Wire tests detected fault in MCM module 3 MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6 406B 0007 Wire tests detected fault in L3 VPD card 1. L3 VPD cardLocation: U1.18-P1-H2.1 (Order the L3 cache module FRU to obtain a new L3 VPD card. Return the L3 cache module that was included in the FRU kit, and the failing L3 VPD card as ?new defective?.) 2. Primary I/O bookLocation: U1.18-P1-H2 406B 000A Wire tests detected fault in primary I/O book 1 (slot 0) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406B 000B Wire tests detected fault in secondary I/O book (slot 1) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406B 000C Wire tests detected fault in I/O book 3 (slot 2) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406B 000D Wire tests detected fault in I/O book 4 (slot 3) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406B 000E CUoD capacity card 1. Replace the CUoD capacity cardLocation: U1.18-P1-H2.2 2. Ask the customer to reenter the CUoD activation keys. 406B 0010 Wire tests detected fault in Memory book (slot 0) Memory book (slot 0) Location: U1.18-P1-M2 406B 0011 Wire tests detected fault in Memory book (slot 1) Memory book (slot 1) Location: U1.18-P1-M3 406B 0012 Wire tests detected fault in Memory book (slot 2) Memory book (slot 2) Location: U1.18-P1-M7 406B 0013 Wire tests detected fault in Memory book (slot 3) Memory book (slot 3)Location: U1.18-P1-M6 406B 0014 Wire tests detected fault in Memory book (slot 4) Memory book (slot 4)Location: U1.18-P1-M4 406B 0015 Wire tests detected fault in Memory book (slot 5) Memory book (slot 5)Location: U1.18-P1-M8 406B 0016 Wire tests detected fault in Memory book (slot 6) Memory book (slot 6)Location: U1.18-P1-M5 406B 0017 Wire tests detected fault in Memory book (slot 7) Memory book (slot 7)Location: U1.18-P1-M1 406B 00B0 Wire tests detected possible problem with bus controller or CSP code 1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. 406B 00B1 Wire tests detected possible problem with bus controller 1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. 602 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406B 00B3 Wire tests detected MCM 0/VPD problem 1. Check for system firmware updates. 2. MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3 406B 00B4 Wire tests detected MCM 2/VPD problem 1. Check for system firmware updates. 2. MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4 406B 00B5 Wire tests detected MCM 1/VPD problem 1. Check for system firmware updates. 2. MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5 406B 00B6 Wire tests detected MCM 3/VPD problem 1. Check for system firmware updates. 2. MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6 406B 00B7 Wire tests detected hardware problem in the processor subsystem Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406B 0A03 Reboot message (run-time array gard failure) for processor 0 Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. Chapter 5. Error Code to FRU Index 603 Error Code Description Action/ Possible Failing FRU 406B 0A04 Reboot message (run-time array gard failure) for processor 2 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 406B 0A05 Reboot message (run-time array gard failure) for processor 1 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 604 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406B 0A06 Reboot message (run-time array gard failure) for processor 3 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 406B 0B00 Wire tests detected fault in I/O book (slot 0 RIO port 0) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 406B 0B01 Wire tests detected fault in I/O book (slot 0 RIO port 1) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 406B 0B02 Wire tests detected fault in I/O book (slot 0 RIO port 2) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 406B 0B03 Wire tests detected fault in I/O book (slot 0 RIO port 3) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. Chapter 5. Error Code to FRU Index 605 Error Code Description Action/ Possible Failing FRU 406B 0B10 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 406B 0B11 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 406B 0B12 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 406B 0B13 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 406B 0B20 Wire tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406B 0B21 Wire tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406B 0B22 Wire tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 606 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406B 0B23 Wire tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406B 0B24 Wire tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406B 0B25 Wire tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406B 0B26 Wire tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406B 0B27 Wire tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406B 0B30 Wire tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 406B 0B31 Wire tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. Chapter 5. Error Code to FRU Index 607 Error Code Description Action/ Possible Failing FRU 406B 0B32 Wire tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 406B 0B33 Wire tests detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 406B 0B34 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 406B 0B35 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 406B 0B36 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 406B 0B37 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 608 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406B 0BF0 Wire tests detected fault in I/O hub chip 0 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q1 b. U1.18-P1-H2/Q22. Replace the primary I/O book, Location: U1.18-P1-H2 406B 0BF1 Wire tests detected fault in I/O hub chip 1 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q3 b. U1.18-P1-H2/Q42. Replace the primary I/O book, Location: U1.18-P1-H2 406B 0BF2 Wire tests detected fault in I/O hub chip 2 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q3 b. U1.18-P1-H1/Q42. Replace the RIO book, Location: U1.18-P1-H1 406B 0BF3 Wire tests detected fault in I/O hub chip 3 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q1 b. U1.18-P1-H1/Q22. Replace the RIO book, Location: U1.18-P1-H1 406B 0BF4 Wire tests detected fault in I/O hub chip 4 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q7 b. U1.18-P1-H3/Q82. Replace the RIO book, Location: U1.18-P1-H3 406B 0BF5 Wire tests detected fault in I/O hub chip 5 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q5 b. U1.18-P1-H3/Q62. Replace the RIO book, Location: U1.18-P1-H3 406B 0BF6 Wire tests detected fault in I/O hub chip 6 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q5 b. U1.18-P1-H4/Q62. Replace the RIO book, Location: U1.18-P1-H4 Chapter 5. Error Code to FRU Index 609 Error Code Description Action/ Possible Failing FRU 406B 0BF7 Wire tests detected fault in I/O hub chip 7 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q7 b. U1.18-P1-H4/Q82. Replace the RIO book, Location: U1.18-P1-H4 406B 0BF8 Wire tests detected fault in I/O hub chip 8 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q7 b. U1.18-P1-H1/Q82. Replace the RIO book, Location: U1.18-P1-H1 406B 0BF9 Wire tests detected fault in I/O hub chip 9 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q5 b. U1.18-P1-H1/Q62. Replace the RIO book, Location: U1.18-P1-H1 406B 0BFA Wire tests detected fault in I/O hub chip 10 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q3 b. U1.18-P1-H3/Q42. Replace the RIO book, Location: U1.18-P1-H3 406B 0BFB Wire tests detected fault in I/O hub chip 11 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q1 b. U1.18-P1-H3/Q22. Replace the RIO book, Location: U1.18-P1-H3 406B 0BFC Wire tests detected fault in I/O hub chip 12 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q1 b. U1.18-P1-H4/Q22. Replace the RIO book, Location: U1.18-P1-H4. 406B 0BFD Wire tests detected fault in I/O hub chip 13 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q3 b. U1.18-P1-H4/Q42. Replace the RIO book, Location: U1.18-P1-H4 406B 0C03 Wire tests detected fault in all MCMs Go to “MAP 1543: MCM Module Problem Isolation” on page 337. 406B 0C09 Wire tests detected fault in all memory books Go to “MAP 1546: Memory Book Problem Isolation” on page 345. 610 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406B 0C0A Wire tests detected fault in all I/O books Go to “MAP 1547: I/O Book Problem Isolation” on page 348. 406B 0C10 Wire tests detected mismatch between MCM 0 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C1 and U1.18-P1-H2.3 406B 0C11 Wire tests detected mismatch between MCM 2 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C4 and U1.18-P1-H2.4 406B 0C12 Wire tests detected mismatch between MCM 1 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C3 and U1.18-P1-H2.5 406B 0C13 Wire tests detected mismatch between MCM 3 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C2 and U1.18-P1-H2.6 406B 0CA5 Wire tests detected invalid processor subsystem configuration Verify that MCMs, memory books, and L3 modules are correctly configured. 406B 0CA6 Wire tests detected no usable main storage Reboot the system in slow mode to isolate the failure. 406B 0CA9 Wire tests detected configuration error with processor memory Reboot the system in slow mode to isolate the failure. 406B 0D00 Wire tests detected fault in L3 cache module 0 L3 cache module 0 and L3 VPD cardLocation: U1.18-P1-C6 and U1.18-P1-H2.1 406B 0D01 Wire tests detected fault in L3 cache module 1 L3 cache module 1 and L3 VPD card Location: U1.18-P1-C7 and U1.18-P1-H2.1 406B 0D02 Wire tests detected fault in L3 cache module 2 L3 cache module 2 and L3 VPD card Location: U1.18-P1-C11 and U1.18-P1-H2.1 406B 0D03 Wire tests detected fault in L3 cache module 3 L3 cache module 3 and L3 VPD card Location: U1.18-P1-C10 and U1.18-P1-H2.1 406B 0D04 Wire tests detected fault in L3 cache module 4 L3 cache module 4 and L3 VPD card Location: U1.18-P1-C8 and U1.18-P1-H2.1 406B 0D05 Wire tests detected fault in L3 cache module 5 L3 cache module 5 and L3 VPD card Location: U1.18-P1-C20 and U1.18-P1-H2.1 406B 0D06 Wire tests detected fault in L3 cache module 6 L3 cache module 6 and L3 VPD card Location: U1.18-P1-C16 and U1.18-P1-H2.1 406B 0D07 Wire tests detected fault in L3 cache module 7 L3 cache module 7 and L3 VPD card Location: U1.18-P1-C12 and U1.18-P1-H2.1 406B 0D08 Wire tests detected fault in L3 cache module 8 L3 cache module 8 and L3 VPD card Location: U1.18-P1-C19 and U1.18-P1-H2.1 406B 0D09 Wire tests detected fault in L3 cache module 9 L3 cache module 9 and L3 VPD card Location: U1.18-P1-C18 and U1.18-P1-H2.1 406B 0D0A Wire tests detected fault in L3 cache module 10 L3 cache module 10 and L3 VPD card Location: U1.18-P1-C14 and U1.18-P1-H2.1 406B 0D0B Wire tests detected fault in L3 cache module 11 L3 cache module 11 and L3 VPD card Location: U1.18-P1-C15 and U1.18-P1-H2.1 406B 0D0C Wire tests detected fault in L3 cache module 12 L3 cache module 12 and L3 VPD card Location: U1.18-P1-C17 and U1.18-P1-H2.1 406B 0D0D Wire tests detected fault in L3 cache module 13 L3 cache module 13 and L3 VPD card Location: U1.18-P1-C5 and U1.18-P1-H2.1 406B 0D0E Wire tests detected fault in L3 cache module 14 L3 cache module 14 and L3 VPD card Location: U1.18-P1-C9 and U1.18-P1-H2.1 Chapter 5. Error Code to FRU Index 611 Error Code Description Action/ Possible Failing FRU 406B 0D0F Wire tests detected fault in L3 cache module 15 L3 cache module 15 and L3 VPD card Location: U1.18-P1-C13 and U1.18-P1-H2.1 406B 0EA0 Wire tests detected fault in MOPS service processor code 1. Access the service processor error log. Find the error log entry for the 406x 0EA0 error. Look for a hardware error of the form 450x yyyy, 460x yyyy, or 4B2x yyyy with a timestamp within a few minutes of the 406x 0EA0 error. Follow the actions for this hardware error. 2. If there is no hardware error in the service processor error log with a timestamp that is close to the 406x 0EA0 error, or following the actions for the hardware error does not resolve the problem, check for platform firmware updates. If an update is available, apply it. 3. If a platform firmware update is not available, or the update does not correct the problem, call service support. 406B 0EA1 Wire tests detected fault in processor subsystem code on main processor 1. Check for operating system updates. 2. Call service support. 406B 0EA2 Wire tests detected fault in PRD code on service processor 1. Check for system firmware updates. 2. Call service support. 406B 0EA4 The service processor firmware detected a hardware problem but was unable to isolate the source. Call service support. 406B 0EA5 PRD detected a problem in the FNM code 1. Check for system firmware updates. 2. Call service support. 406B 0EAA PRD: FNM is handling this error 1. Check for system firmware updates. 2. Call service support. 406B 0EAF Wire tests detected unknown service processor code and system type 1. Check for system firmware updates. 2. Call service support. 406B 0EB0 Wire tests detected JTAG problem Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406B 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406B 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406B 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406B 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406B 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406B 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 612 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406B 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406B 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406B 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406B 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406B 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406B 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406B 0EBF Memory error 1. Reboot the system in slow mode to the SMS menus. See “Performing Slow Boot” on page 413. 2. Examine the service processor error log. v If a CEC subsystem error (a 4xxx yyyy error) occurred during the slow boot that calls out the memory, processors, L3 cache modules, or the primary I/O book, replace the part indicated by the location code. v If a CEC subsystem error (a 4xxx yyyy error) did not occur during the slow boot, look for a 4xxx yyyy error immediately preceding the 406x 0EBF error. Replace the part indicated by the location code associated with that error. 406B 0EE1 Wire tests detected fault in service processor code (invalid MRU to mapping) 1. Check for system firmware updates. 2. Call service support. 406B 0EE2 Wire tests detected fault in service processor firmware fault 1. Check for system firmware updates. 2. Call service support. 406B 0EE5 Wire tests detected firmware error; no callout available 1. Check for system firmware updates. 2. Call service support. 406B 25E6 SMA 4-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 406B 273F SMA 2-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 406C 0001 IPL detected clock card assembly failure Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406C 0002 IPL detected backplane assembly failure Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406C 0003 IPL detected fault in MCM module 0 MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3 406C 0004 IPL detected fault in MCM module 2 MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4 406C 0005 IPL detected fault in MCM module 1 MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5 Chapter 5. Error Code to FRU Index 613 Error Code Description Action/ Possible Failing FRU 406C 0006 IPL detected fault in MCM module 3 MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6 406C 0007 IPL detected fault in L3 VPD card 1. L3 VPD cardLocation: U1.18-P1-H2.1 (Order the L3 cache module FRU to obtain a new L3 VPD card. Return the L3 cache module that was included in the FRU kit, and the failing L3 VPD card as ?new defective?.) 2. Primary I/O bookLocation: U1.18-P1-H2 406C 000A IPL detected fault in primary I/O book 1 (slot 0) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406C 000B IPL detected fault in secondary I/O book (slot 1) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406C 000C IPL detected fault in I/O book 3 (slot 2) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406C 000D IPL detected fault in I/O book 4 (slot 3) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406C 000E CUoD capacity card 1. Replace the CUoD capacity cardLocation: U1.18-P1-H2.2 2. Ask the customer to reenter the CUoD activation keys. 406C 0010 IPL detected fault in Memory book (slot 0) Memory book (slot 0) Location: U1.18-P1-M2 406C 0011 IPL detected fault in Memory book (slot 1) Memory book (slot 1) Location: U1.18-P1-M3 406C 0012 IPL detected fault in Memory book (slot 2) Memory book (slot 2) Location: U1.18-P1-M7 406C 0013 IPL detected fault in Memory book (slot 3) Memory book (slot 3)Location: U1.18-P1-M6 406C 0014 IPL detected fault in Memory book (slot 4) Memory book (slot 4)Location: U1.18-P1-M4 406C 0015 IPL detected fault in Memory book (slot 5) Memory book (slot 5)Location: U1.18-P1-M8 406C 0016 IPL detected fault in Memory book (slot 6) Memory book (slot 6)Location: U1.18-P1-M5 406C 0017 IPL detected fault in Memory book (slot 7) Memory book (slot 7)Location: U1.18-P1-M1 406C 00B0 IPL detected possible problem with bus controller or CSP code 1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. 406C 00B1 IPL detected possible problem with bus controller 1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. 614 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406C 00B3 IPL detected MCM 0/VPD problem 1. Check for system firmware updates. 2. MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3 406C 00B4 IPL detected MCM 2/VPD problem 1. Check for system firmware updates. 2. MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4 406C 00B5 IPL detected MCM 1/VPD problem 1. Check for system firmware updates. 2. MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5 406C 00B6 IPL detected MCM 3/VPD problem 1. Check for system firmware updates. 2. MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6 406C 00B7 IPL detected hardware problem in the processor subsystem Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406C 0A03 Reboot message (run-time array gard failure) for processor 0 Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. Chapter 5. Error Code to FRU Index 615 Error Code Description Action/ Possible Failing FRU 406C 0A04 Reboot message (run-time array gard failure) for processor 2 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 406C 0A05 Reboot message (run-time array gard failure) for processor 1 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 616 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406C 0A06 Reboot message (run-time array gard failure) for processor 3 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 406C 0B00 IPL detected fault in I/O book (slot 0 RIO port 0) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 406C 0B01 IPL detected fault in I/O book (slot 0 RIO port 1) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 406C 0B02 IPL detected fault in I/O book (slot 0 RIO port 2) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 406C 0B03 IPL detected fault in I/O book (slot 0 RIO port 3) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. Chapter 5. Error Code to FRU Index 617 Error Code Description Action/ Possible Failing FRU 406C 0B10 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 406C 0B11 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 406C 0B12 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 406C 0B13 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 406C 0B20 IPL detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406C 0B21 IPL detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406C 0B22 IPL detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 618 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406C 0B23 IPL detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406C 0B24 IPL detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406C 0B25 IPL detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406C 0B26 IPL detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406C 0B27 IPL detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406C 0B30 IPL detected fault in I/O book I/O book (slot 3 RIO port 0) Location: U1.18-P1-H4/Q1 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. Chapter 5. Error Code to FRU Index 619 Error Code Description Action/ Possible Failing FRU 406C 0B31 IPL detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 406C 0B32 IPL detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 406C 0B33 IPL detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 406C 0B34 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 406C 0B35 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 406C 0B36 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 620 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406C 0B37 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 406C 0BF0 IPL detected fault in I/O hub chip 0 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q1 b. U1.18-P1-H2/Q22. Replace the primary I/O book, Location: U1.18-P1-H2 406C 0BF1 IPL detected fault in I/O hub chip 1 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q3 b. U1.18-P1-H2/Q42. Replace the primary I/O book, Location: U1.18-P1-H2 406C 0BF2 IPL detected fault in I/O hub chip 2 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q3 b. U1.18-P1-H1/Q42. Replace the RIO book, Location: U1.18-P1-H1 406C 0BF3 IPL detected fault in I/O hub chip 3 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q1 b. U1.18-P1-H1/Q22. Replace the RIO book, Location: U1.18-P1-H1 406C 0BF4 IPL detected fault in I/O hub chip 4 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q7 b. U1.18-P1-H3/Q82. Replace the RIO book, Location: U1.18-P1-H3 406C 0BF5 IPL detected fault in I/O hub chip 5 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q5 b. U1.18-P1-H3/Q62. Replace the RIO book, Location: U1.18-P1-H3 Chapter 5. Error Code to FRU Index 621 Error Code Description Action/ Possible Failing FRU 406C 0BF6 IPL detected fault in I/O hub chip 6 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q5 b. U1.18-P1-H4/Q62. Replace the RIO book, Location: U1.18-P1-H4 406C 0BF7 IPL detected fault in I/O hub chip 7 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q7 b. U1.18-P1-H4/Q82. Replace the RIO book, Location: U1.18-P1-H4 406C 0BF8 IPL detected fault in I/O hub chip 8 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q7 b. U1.18-P1-H1/Q82. Replace the RIO book, Location: U1.18-P1-H1 406C 0BF9 IPL detected fault in I/O hub chip 9 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q5 b. U1.18-P1-H1/Q62. Replace the RIO book, Location: U1.18-P1-H1 406C 0BFA IPL detected fault in I/O hub chip 10 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q3 b. U1.18-P1-H3/Q42. Replace the RIO book, Location: U1.18-P1-H3 406C 0BFB IPL detected fault in I/O hub chip 11 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q1 b. U1.18-P1-H3/Q22. Replace the RIO book, Location: U1.18-P1-H3 406C 0BFC IPL detected fault in I/O hub chip 12 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q1 b. U1.18-P1-H4/Q22. Replace the RIO book, Location: U1.18-P1-H4. 622 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406C 0BFD IPL detected fault in I/O hub chip 13 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q3 b. U1.18-P1-H4/Q42. Replace the RIO book, Location: U1.18-P1-H4 406C 0C03 IPL detected fault in all MCMs Go to “MAP 1543: MCM Module Problem Isolation” on page 337. 406C 0C09 IPL detected fault in all memory books Go to “MAP 1546: Memory Book Problem Isolation” on page 345. 406C 0C0A IPL detected fault in all I/O books Go to “MAP 1547: I/O Book Problem Isolation” on page 348. 406C 0C10 IPL detected mismatch between MCM 0 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C1 and U1.18-P1-H2.3 406C 0C11 IPL detected mismatch between MCM 2 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C4 and U1.18-P1-H2.4 406C 0C12 IPL detected mismatch between MCM 1 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C3 and U1.18-P1-H2.5 406C 0C13 IPL detected mismatch between MCM 3 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C2 and U1.18-P1-H2.6 406C 0CA5 IPL detected invalid processor subsystem configuration Verify that MCMs, memory books, and L3 modules are correctly configured. 406C 0CA6 IPL detected no usable main storage Reboot the system in slow mode to isolate the failure. 406C 0CA9 IPL detected configuration error with processor memory Reboot the system in slow mode to isolate the failure. 406C 0D00 IPL detected fault in L3 cache module 0 L3 cache module 0 and L3 VPD cardLocation: U1.18-P1-C6 and U1.18-P1-H2.1 406C 0D01 IPL detected fault in L3 cache module 1 L3 cache module 1 and L3 VPD card Location: U1.18-P1-C7 and U1.18-P1-H2.1 406C 0D02 IPL detected fault in L3 cache module 2 L3 cache module 2 and L3 VPD card Location: U1.18-P1-C11 and U1.18-P1-H2.1 406C 0D03 IPL detected fault in L3 cache module 3 L3 cache module 3 and L3 VPD card Location: U1.18-P1-C10 and U1.18-P1-H2.1 406C 0D04 IPL detected fault in L3 cache module 4 L3 cache module 4 and L3 VPD card Location: U1.18-P1-C8 and U1.18-P1-H2.1 406C 0D05 IPL detected fault in L3 cache module 5 L3 cache module 5 and L3 VPD card Location: U1.18-P1-C20 and U1.18-P1-H2.1 406C 0D06 IPL detected fault in L3 cache module 6 L3 cache module 6 and L3 VPD card Location: U1.18-P1-C16 and U1.18-P1-H2.1 406C 0D07 IPL detected fault in L3 cache module 7 L3 cache module 7 and L3 VPD card Location: U1.18-P1-C12 and U1.18-P1-H2.1 406C 0D08 IPL detected fault in L3 cache module 8 L3 cache module 8 and L3 VPD card Location: U1.18-P1-C19 and U1.18-P1-H2.1 406C 0D09 IPL detected fault in L3 cache module 9 L3 cache module 9 and L3 VPD card Location: U1.18-P1-C18 and U1.18-P1-H2.1 Chapter 5. Error Code to FRU Index 623 Error Code Description Action/ Possible Failing FRU 406C 0D0A IPL detected fault in L3 cache module 10 L3 cache module 10 and L3 VPD card Location: U1.18-P1-C14 and U1.18-P1-H2.1 406C 0D0B IPL detected fault in L3 cache module 11 L3 cache module 11 and L3 VPD card Location: U1.18-P1-C15 and U1.18-P1-H2.1 406C 0D0C IPL detected fault in L3 cache module 12 L3 cache module 12 and L3 VPD card Location: U1.18-P1-C17 and U1.18-P1-H2.1 406C 0D0D IPL detected fault in L3 cache module 13 L3 cache module 13 and L3 VPD card Location: U1.18-P1-C5 and U1.18-P1-H2.1 406C 0D0E IPL detected fault in L3 cache module 14 L3 cache module 14 and L3 VPD card Location: U1.18-P1-C9 and U1.18-P1-H2.1 406C 0D0F IPL detected fault in L3 cache module 15 L3 cache module 15 and L3 VPD card Location: U1.18-P1-C13 and U1.18-P1-H2.1 406C 0EA0 IPL detected fault in MOPS service processor code 1. Access the service processor error log. Find the error log entry for the 406x 0EA0 error. Look for a hardware error of the form 450x yyyy, 460x yyyy, or 4B2x yyyy with a timestamp within a few minutes of the 406x 0EA0 error. Follow the actions for this hardware error. 2. If there is no hardware error in the service processor error log with a timestamp that is close to the 406x 0EA0 error, or following the actions for the hardware error does not resolve the problem, check for platform firmware updates. If an update is available, apply it. 3. If a platform firmware update is not available, or the update does not correct the problem, call service support. 406C 0EA1 IPL detected fault in processor subsystem code on main processor 1. Check for operating system updates. 2. Call service support. 406C 0EA2 IPL detected fault in PRD code on service processor 1. Check for system firmware updates. 2. Call service support. 406C 0EA4 The service processor firmware detected a hardware problem but was unable to isolate the source. Call service support. 406C 0EA5 PRD detected a problem in the FNM code 1. Check for system firmware updates. 2. Call service support. 406C 0EAA PRD: FNM is handling this error 1. Check for system firmware updates. 2. Call service support. 406C 0EAF IPL detected unknown service processor code and system type 1. Check for system firmware updates. 2. Call service support. 406C 0EB0 IPL tests detected JTAG problem Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406C 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 624 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406C 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406C 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406C 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406C 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406C 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406C 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406C 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406C 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406C 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406C 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406C 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406C 0EBF Memory error 1. Reboot the system in slow mode to the SMS menus. See “Performing Slow Boot” on page 413. 2. Examine the service processor error log. v If a CEC subsystem error (a 4xxx yyyy error) occurred during the slow boot that calls out the memory, processors, L3 cache modules, or the primary I/O book, replace the part indicated by the location code. v If a CEC subsystem error (a 4xxx yyyy error) did not occur during the slow boot, look for a 4xxx yyyy error immediately preceding the 406x 0EBF error. Replace the part indicated by the location code associated with that error. 406C 0EE1 IPL detected fault in service processor code (invalid MRU to mapping) 1. Check for system firmware updates. 2. Call service support. 406C 0EE2 IPL detected fault in service processor firmware fault 1. Check for system firmware updates. 2. Call service support. 406C 0EE5 IPL detected firmware error; no callout available 1. Check for system firmware updates. 2. Call service support. 406C 25E6 SMA 4-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 406C 273F SMA 2-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. Chapter 5. Error Code to FRU Index 625 Error Code Description Action/ Possible Failing FRU 406D 0001 Attention handler detected clock card assembly failure Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406D 0002 Attention handler detected backplane assembly failure Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406D 0003 Attention handler detected fault in MCM module 0 MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3 406D 0004 Attention handler detected fault in MCM module 2 MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4 406D 0005 Attention handler detected fault in MCM module 1 MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5 406D 0006 Attention handler detected fault in MCM module 3 MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6 406D 0007 Attention handler detected fault in L3 VPD card 1. L3 VPD cardLocation: U1.18-P1-H2.1 (Order the L3 cache module FRU to obtain a new L3 VPD card. Return the L3 cache module that was included in the FRU kit, and the failing L3 VPD card as ?new defective?.) 2. Primary I/O bookLocation: U1.18-P1-H2 406D 000A Attention handler detected fault in primary I/O book 1 (slot 0) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406D 000B Attention handler detected fault in secondary I/O book (slot 1) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406D 000C Attention handler detected fault in I/O book 3 (slot 2) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406D 000D Attention handler detected fault in I/O book 4 (slot 3) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406D 000E CUoD capacity card 1. Replace the CUoD capacity cardLocation: U1.18-P1-H2.2 2. Ask the customer to reenter the CUoD activation keys. 406D 0010 Attention handler detected fault in Memory book (slot 0) Memory book (slot 0) Location: U1.18-P1-M2 406D 0011 Attention handler detected fault in Memory book (slot 1) Memory book (slot 1) Location: U1.18-P1-M3 406D 0012 Attention handler detected fault in Memory book (slot 2) Memory book (slot 2) Location: U1.18-P1-M7 406D 0013 Attention handler detected fault in Memory book (slot 3) Memory book (slot 3)Location: U1.18-P1-M6 406D 0014 Attention handler detected fault in Memory book (slot 4) Memory book (slot 4)Location: U1.18-P1-M4 406D 0015 Attention handler detected fault in Memory book (slot 5) Memory book (slot 5)Location: U1.18-P1-M8 406D 0016 Attention handler detected fault in Memory book (slot 6) Memory book (slot 6)Location: U1.18-P1-M5 626 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406D 0017 Attention handler detected fault in Memory book (slot 7) Memory book (slot 7)Location: U1.18-P1-M1 406D 00B0 Attention handler detected possible problem with bus controller or CSP code 1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. 406D 00B1 Attention handler detected possible problem with bus controller 1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. 406D 00B3 Attention handler detected MCM 0/VPD problem 1. Check for system firmware updates. 2. MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3 406D 00B4 Attention handler detected MCM 2/VPD problem 1. Check for system firmware updates. 2. MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4 406D 00B5 Attention handler detected MCM 1/VPD problem 1. Check for system firmware updates. 2. MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5 406D 00B6 Attention handler detected MCM 3/VPD problem 1. Check for system firmware updates. 2. MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6 406D 00B7 Attention handler detected hardware problem in the processor subsystem Go to “MAP 1540: Problem Isolation Procedures” on page 308. Chapter 5. Error Code to FRU Index 627 Error Code Description Action/ Possible Failing FRU 406D 0A03 Reboot message (run-time array gard failure) for processor 0 Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 406D 0A04 Reboot message (run-time array gard failure) for processor 2 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 628 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406D 0A05 Reboot message (run-time array gard failure) for processor 1 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 406D 0A06 Reboot message (run-time array gard failure) for processor 3 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. Chapter 5. Error Code to FRU Index 629 Error Code Description Action/ Possible Failing FRU 406D 0B00 Attention handler detected fault in I/O book (slot 0 RIO port 0) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 406D 0B01 Attention handler detected fault in I/O book (slot 0 RIO port 1) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 406D 0B02 Attention handler detected fault in I/O book (slot 0 RIO port 2) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 406D 0B03 Attention handler detected fault in I/O book (slot 0 RIO port 3) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 406D 0B10 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 406D 0B11 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 406D 0B12 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 630 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406D 0B13 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 406D 0B20 Attention handler detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406D 0B21 Attention handler detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406D 0B22 Attention handler detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406D 0B23 Attention handler detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406D 0B24 Attention handler detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. Chapter 5. Error Code to FRU Index 631 Error Code Description Action/ Possible Failing FRU 406D 0B25 Attention handler detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406D 0B26 Attention handler detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406D 0B27 Attention handler detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406D 0B30 Attention handler detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 406D 0B31 Attention handler detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 406D 0B32 Attention handler detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 406D 0B33 Attention handler detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 632 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406D 0B34 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 406D 0B35 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 406D 0B36 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 406D 0B37 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 406D 0BF0 Attention handler detected fault in I/O hub chip 0 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q1 b. U1.18-P1-H2/Q22. Replace the primary I/O book, Location: U1.18-P1-H2 406D 0BF1 Attention handler detected fault in I/O hub chip 1 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q3 b. U1.18-P1-H2/Q42. Replace the primary I/O book, Location: U1.18-P1-H2 Chapter 5. Error Code to FRU Index 633 Error Code Description Action/ Possible Failing FRU 406D 0BF2 Attention handler detected fault in I/O hub chip 2 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q3 b. U1.18-P1-H1/Q42. Replace the RIO book, Location: U1.18-P1-H1 406D 0BF3 Attention handler detected fault in I/O hub chip 3 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q1 b. U1.18-P1-H1/Q22. Replace the RIO book, Location: U1.18-P1-H1 406D 0BF4 Attention handler detected fault in I/O hub chip 4 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q7 b. U1.18-P1-H3/Q82. Replace the RIO book, Location: U1.18-P1-H3 406D 0BF5 Attention handler detected fault in I/O hub chip 5 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q5 b. U1.18-P1-H3/Q62. Replace the RIO book, Location: U1.18-P1-H3 406D 0BF6 Attention handler detected fault in I/O hub chip 6 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q5 b. U1.18-P1-H4/Q62. Replace the RIO book, Location: U1.18-P1-H4 406D 0BF7 Attention handler detected fault in I/O hub chip 7 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q7 b. U1.18-P1-H4/Q82. Replace the RIO book, Location: U1.18-P1-H4 406D 0BF8 Attention handler detected fault in I/O hub chip 8 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q7 b. U1.18-P1-H1/Q82. Replace the RIO book, Location: U1.18-P1-H1 634 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406D 0BF9 Attention handler detected fault in I/O hub chip 9 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q5 b. U1.18-P1-H1/Q62. Replace the RIO book, Location: U1.18-P1-H1 406D 0BFA Attention handler detected fault in I/O hub chip 10 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q3 b. U1.18-P1-H3/Q42. Replace the RIO book, Location: U1.18-P1-H3 406D 0BFB Attention handler detected fault in I/O hub chip 11 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q1 b. U1.18-P1-H3/Q22. Replace the RIO book, Location: U1.18-P1-H3 406D 0BFC Attention handler detected fault in I/O hub chip 12 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q1 b. U1.18-P1-H4/Q22. Replace the RIO book, Location: U1.18-P1-H4. 406D 0BFD Attention handler detected fault in I/O hub chip 13 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q3 b. U1.18-P1-H4/Q42. Replace the RIO book, Location: U1.18-P1-H4 406D 0C03 Attention handler detected fault in all MCMs Go to “MAP 1543: MCM Module Problem Isolation” on page 337. 406D 0C09 Attention handler detected fault in all memory books Go to “MAP 1546: Memory Book Problem Isolation” on page 345. 406D 0C0A Attention handler detected fault in all I/O books Go to “MAP 1547: I/O Book Problem Isolation” on page 348. 406D 0C10 Attention handler detected mismatch between MCM 0 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C1 and U1.18-P1-H2.3 406D 0C11 Attention handler detected mismatch between MCM 2 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C4 and U1.18-P1-H2.4 406D 0C12 Attention handler detected mismatch between MCM 1 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C3 and U1.18-P1-H2.5 406D 0C13 Attention handler detected mismatch between MCM 3 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C2 and U1.18-P1-H2.6 406D 0CA5 Attention handler detected invalid processor subsystem configuration Verify that MCMs, memory books, and L3 modules are correctly configured. Chapter 5. Error Code to FRU Index 635 Error Code Description Action/ Possible Failing FRU 406D 0CA6 Attention handler detected no usable main storage Reboot the system in slow mode to isolate the failure. 406D 0CA9 Attention handler detected configuration error with processor memory Reboot the system in slow mode to isolate the failure. 406D 0D00 Attention handler detected fault in L3 cache module 0 L3 cache module 0 and L3 VPD cardLocation: U1.18-P1-C6 and U1.18-P1-H2.1 406D 0D01 Attention handler detected fault in L3 cache module 1 L3 cache module 1 and L3 VPD card Location: U1.18-P1-C7 and U1.18-P1-H2.1 406D 0D02 Attention handler detected fault in L3 cache module 2 L3 cache module 2 and L3 VPD card Location: U1.18-P1-C11 and U1.18-P1-H2.1 406D 0D03 Attention handler detected fault in L3 cache module 3 L3 cache module 3 and L3 VPD card Location: U1.18-P1-C10 and U1.18-P1-H2.1 406D 0D04 Attention handler detected fault in L3 cache module 4 L3 cache module 4 and L3 VPD card Location: U1.18-P1-C8 and U1.18-P1-H2.1 406D 0D05 Attention handler detected fault in L3 cache module 5 L3 cache module 5 and L3 VPD card Location: U1.18-P1-C20 and U1.18-P1-H2.1 406D 0D06 Attention handler detected fault in L3 cache module 6 L3 cache module 6 and L3 VPD card Location: U1.18-P1-C16 and U1.18-P1-H2.1 406D 0D07 Attention handler detected fault in L3 cache module 7 L3 cache module 7 and L3 VPD card Location: U1.18-P1-C12 and U1.18-P1-H2.1 406D 0D08 Attention handler detected fault in L3 cache module 8 L3 cache module 8 and L3 VPD card Location: U1.18-P1-C19 and U1.18-P1-H2.1 406D 0D09 Attention handler detected fault in L3 cache module 9 L3 cache module 9 and L3 VPD card Location: U1.18-P1-C18 and U1.18-P1-H2.1 406D 0D0A Attention handler detected fault in L3 cache module 10 L3 cache module 10 and L3 VPD card Location: U1.18-P1-C14 and U1.18-P1-H2.1 406D 0D0B Attention handler detected fault in L3 cache module 11 L3 cache module 11 and L3 VPD card Location: U1.18-P1-C15 and U1.18-P1-H2.1 406D 0D0C Attention handler detected fault in L3 cache module 12 L3 cache module 12 and L3 VPD card Location: U1.18-P1-C17 and U1.18-P1-H2.1 406D 0D0D Attention handler detected fault in L3 cache module 13 L3 cache module 13 and L3 VPD card Location: U1.18-P1-C5 and U1.18-P1-H2.1 406D 0D0E Attention handler detected fault in L3 cache module 14 L3 cache module 14 and L3 VPD card Location: U1.18-P1-C9 and U1.18-P1-H2.1 406D 0D0F Attention handler detected fault in L3 cache module 15 L3 cache module 15 and L3 VPD card Location: U1.18-P1-C13 and U1.18-P1-H2.1 636 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406D 0EA0 Attention handler detected fault in MOPS service processor code 1. Access the service processor error log. Find the error log entry for the 406x 0EA0 error. Look for a hardware error of the form 450x yyyy, 460x yyyy, or 4B2x yyyy with a timestamp within a few minutes of the 406x 0EA0 error. Follow the actions for this hardware error. 2. If there is no hardware error in the service processor error log with a timestamp that is close to the 406x 0EA0 error, or following the actions for the hardware error does not resolve the problem, check for platform firmware updates. If an update is available, apply it. 3. If a platform firmware update is not available, or the update does not correct the problem, call service support. 406D 0EA1 Attention handler detected fault in processor subsystem code on main processor 1. Check for operating system updates. 2. Call service support. 406D 0EA2 Attention handler detected fault in PRD code on service processor 1. Check for system firmware updates. 2. Call service support. 406D 0EA4 The service processor firmware detected a hardware problem but was unable to isolate the source. Call service support. 406D 0EA5 PRD detected a problem in the FNM code 1. Check for system firmware updates. 2. Call service support. 406D 0EAA PRD: FNM is handling this error 1. Check for system firmware updates. 2. Call service support. 406D 0EAF Attention handler detected unknown service processor code and system type 1. Check for system firmware updates. 2. Call service support. 406D 0EB0 Attention handler detected JTAG problem Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406D 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406D 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406D 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406D 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406D 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406D 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406D 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. Chapter 5. Error Code to FRU Index 637 Error Code Description Action/ Possible Failing FRU 406D 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406D 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406D 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406D 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406D 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406D 0EBF Memory error 1. Reboot the system in slow mode to the SMS menus. See “Performing Slow Boot” on page 413. 2. Examine the service processor error log. v If a CEC subsystem error (a 4xxx yyyy error) occurred during the slow boot that calls out the memory, processors, L3 cache modules, or the primary I/O book, replace the part indicated by the location code. v If a CEC subsystem error (a 4xxx yyyy error) did not occur during the slow boot, look for a 4xxx yyyy error immediately preceding the 406x 0EBF error. Replace the part indicated by the location code associated with that error. 406D 0EE1 Attention handler detected fault in service processor code (invalid MRU to mapping) 1. Check for system firmware updates. 2. Call service support. 406D 0EE2 Attention handler detected fault in service processor firmware fault 1. Check for system firmware updates. 2. Call service support. 406D 0EE5 Attention handler detected firmware error; no callout available 1. Check for system firmware updates. 2. Call service support. 406D 25E6 SMA 4-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 406D 273F SMA 2-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 406E 0001 Scan control routineIPL detecteds detected clock card assembly failure Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406E 0002 Scan control routines detected backplane assembly failure Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406E 0003 Scan control routines detected fault in MCM module 0 MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3 406E 0004 Scan control routines detected fault in MCM module 2 MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4 406E 0005 Scan control routines detected fault in MCM module 1 MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5 406E 0006 Scan control routines detected fault in MCM module 3 MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6 638 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406E 0007 Scan control routines detected fault in L3 VPD card 1. L3 VPD cardLocation: U1.18-P1-H2.1 (Order the L3 cache module FRU to obtain a new L3 VPD card. Return the L3 cache module that was included in the FRU kit, and the failing L3 VPD card as ?new defective?.) 2. Primary I/O bookLocation: U1.18-P1-H2 406E 000A Scan control routines detected fault in primary I/O book 1 (slot 0) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406E 000B Scan control routines detected fault in secondary I/O book (slot 1) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406E 000C Scan control routines detected fault in I/O book 3 (slot 2) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406E 000D Scan control routines detected fault in I/O book 4 (slot 3) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406E 000E CUoD capacity card 1. Replace the CUoD capacity cardLocation: U1.18-P1-H2.2 2. Ask the customer to reenter the CUoD activation keys. 406E 0010 Scan control routines detected fault in Memory book (slot 0) Memory book (slot 0) Location: U1.18-P1-M2 406E 0011 Scan control routines detected fault in Memory book (slot 1) Memory book (slot 1) Location: U1.18-P1-M3 406E 0012 Scan control routines detected fault in Memory book (slot 2) Memory book (slot 2) Location: U1.18-P1-M7 406E 0013 Scan control routines detected fault in Memory book (slot 3) Memory book (slot 3)Location: U1.18-P1-M6 406E 0014 Scan control routines detected fault in Memory book (slot 4) Memory book (slot 4)Location: U1.18-P1-M4 406E 0015 Scan control routines detected fault in Memory book (slot 5) Memory book (slot 5)Location: U1.18-P1-M8 406E 0016 Scan control routines detected fault in Memory book (slot 6) Memory book (slot 6)Location: U1.18-P1-M5 406E 0017 Scan control routines detected fault in Memory book (slot 7) Memory book (slot 7)Location: U1.18-P1-M1 406E 00B0 Scan control routines detected possible problem with bus controller or CSP code 1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. 406E 00B1 Scan control routines detected possible problem with bus controller 1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. Chapter 5. Error Code to FRU Index 639 Error Code Description Action/ Possible Failing FRU 406E 00B3 Scan control routines detected MCM 0/VPD problem 1. Check for system firmware updates. 2. MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3 406E 00B4 Scan control routines detected MCM 2/VPD problem 1. Check for system firmware updates. 2. MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4 406E 00B5 Scan control routines detected MCM 1/VPD problem 1. Check for system firmware updates. 2. MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5 406E 00B6 Scan control routines detected MCM 3/VPD problem 1. Check for system firmware updates. 2. MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6 406E 00B7 Scan control routines detected hardware problem in the processor subsystem Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406E 0A03 Reboot message (run-time array gard failure) for processor 0 Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 640 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406E 0A04 Reboot message (run-time array gard failure) for processor 2 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 406E 0A05 Reboot message (run-time array gard failure) for processor 1 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. Chapter 5. Error Code to FRU Index 641 Error Code Description Action/ Possible Failing FRU 406E 0A06 Reboot message (run-time array gard failure) for processor 3 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 406E 0B00 Scan control routines detected fault in I/O book (slot 0 RIO port 0) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 406E 0B01 Scan control routines detected fault in I/O book (slot 0 RIO port 1) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 406E 0B02 Scan control routines detected fault in I/O book (slot 0 RIO port 2) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 406E 0B03 Scan control routines detected fault in I/O book (slot 0 RIO port 3) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 642 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406E 0B10 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 406E 0B11 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 406E 0B12 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 406E 0B13 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 406E 0B20 Scan control routines detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406E 0B21 Scan control routines detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406E 0B22 Scan control routines detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. Chapter 5. Error Code to FRU Index 643 Error Code Description Action/ Possible Failing FRU 406E 0B23 Scan control routines detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406E 0B24 Scan control routines detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406E 0B25 Scan control routines detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406E 0B26 Scan control routines detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406E 0B27 Scan control routines detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406E 0B30 Scan control routines detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 406E 0B31 Scan control routines detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 644 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406E 0B32 Scan control routines detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 406E 0B33 Scan control routines detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 406E 0B34 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 406E 0B35 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 406E 0B36 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 406E 0B37 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. Chapter 5. Error Code to FRU Index 645 Error Code Description Action/ Possible Failing FRU 406E 0BF0 Scan control routines detected fault in I/O hub chip 0 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q1 b. U1.18-P1-H2/Q22. Replace the primary I/O book, Location: U1.18-P1-H2 406E 0BF1 Scan control routines detected fault in I/O hub chip 1 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q3 b. U1.18-P1-H2/Q42. Replace the primary I/O book, Location: U1.18-P1-H2 406E 0BF2 Scan control routines detected fault in I/O hub chip 2 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q3 b. U1.18-P1-H1/Q42. Replace the RIO book, Location: U1.18-P1-H1 406E 0BF3 Scan control routines detected fault in I/O hub chip 3 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q1 b. U1.18-P1-H1/Q22. Replace the RIO book, Location: U1.18-P1-H1 406E 0BF4 Scan control routines detected fault in I/O hub chip 4 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q7 b. U1.18-P1-H3/Q82. Replace the RIO book, Location: U1.18-P1-H3 406E 0BF5 Scan control routines detected fault in I/O hub chip 5 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q5 b. U1.18-P1-H3/Q62. Replace the RIO book, Location: U1.18-P1-H3 406E 0BF6 Scan control routines detected fault in I/O hub chip 6 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q5 b. U1.18-P1-H4/Q62. Replace the RIO book, Location: U1.18-P1-H4 646 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406E 0BF7 Scan control routines detected fault in I/O hub chip 7 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q7 b. U1.18-P1-H4/Q82. Replace the RIO book, Location: U1.18-P1-H4 406E 0BF8 Scan control routines detected fault in I/O hub chip 8 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q7 b. U1.18-P1-H1/Q82. Replace the RIO book, Location: U1.18-P1-H1 406E 0BF9 Scan control routines detected fault in I/O hub chip 9 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q5 b. U1.18-P1-H1/Q62. Replace the RIO book, Location: U1.18-P1-H1 406E 0BFA Scan control routines detected fault in I/O hub chip 10 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q3 b. U1.18-P1-H3/Q42. Replace the RIO book, Location: U1.18-P1-H3 406E 0BFB Scan control routines detected fault in I/O hub chip 11 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q1 b. U1.18-P1-H3/Q22. Replace the RIO book, Location: U1.18-P1-H3 406E 0BFC Scan control routines detected fault in I/O hub chip 12 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q1 b. U1.18-P1-H4/Q22. Replace the RIO book, Location: U1.18-P1-H4. 406E 0BFD Scan control routines detected fault in I/O hub chip 13 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q3 b. U1.18-P1-H4/Q42. Replace the RIO book, Location: U1.18-P1-H4 406E 0C03 Scan control routines detected fault in all MCMs Go to “MAP 1543: MCM Module Problem Isolation” on page 337. 406E 0C09 Scan control routines detected fault in all memory books Go to “MAP 1546: Memory Book Problem Isolation” on page 345. Chapter 5. Error Code to FRU Index 647 Error Code Description Action/ Possible Failing FRU 406E 0C0A Scan control routines detected fault in all I/O books Go to “MAP 1547: I/O Book Problem Isolation” on page 348. 406E 0C10 Scan control routines detected mismatch between MCM 0 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C1 and U1.18-P1-H2.3 406E 0C11 Scan control routines detected mismatch between MCM 2 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C4 and U1.18-P1-H2.4 406E 0C12 Scan control routines detected mismatch between MCM 1 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C3 and U1.18-P1-H2.5 406E 0C13 Scan control routines detected mismatch between MCM 3 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C2 and U1.18-P1-H2.6 406E 0CA5 Scan control routines detected invalid processor subsystem configuration Verify that MCMs, memory books, and L3 modules are correctly configured. 406E 0CA6 Scan control routines detected no usable main storage Reboot the system in slow mode to isolate the failure. 406E 0CA9 Scan control routines detected configuration error with processor memory Reboot the system in slow mode to isolate the failure. 406E 0D00 Scan control routines detected fault in L3 cache module 0 L3 cache module 0 and L3 VPD cardLocation: U1.18-P1-C6 and U1.18-P1-H2.1 406E 0D01 Scan control routines detected fault in L3 cache module 1 L3 cache module 1 and L3 VPD card Location: U1.18-P1-C7 and U1.18-P1-H2.1 406E 0D02 Scan control routines detected fault in L3 cache module 2 L3 cache module 2 and L3 VPD card Location: U1.18-P1-C11 and U1.18-P1-H2.1 406E 0D03 Scan control routines detected fault in L3 cache module 3 L3 cache module 3 and L3 VPD card Location: U1.18-P1-C10 and U1.18-P1-H2.1 406E 0D04 Scan control routines detected fault in L3 cache module 4 L3 cache module 4 and L3 VPD card Location: U1.18-P1-C8 and U1.18-P1-H2.1 406E 0D05 Scan control routines detected fault in L3 cache module 5 L3 cache module 5 and L3 VPD card Location: U1.18-P1-C20 and U1.18-P1-H2.1 406E 0D06 Scan control routines detected fault in L3 cache module 6 L3 cache module 6 and L3 VPD card Location: U1.18-P1-C16 and U1.18-P1-H2.1 406E 0D07 Scan control routines detected fault in L3 cache module 7 L3 cache module 7 and L3 VPD card Location: U1.18-P1-C12 and U1.18-P1-H2.1 406E 0D08 Scan control routines detected fault in L3 cache module 8 L3 cache module 8 and L3 VPD card Location: U1.18-P1-C19 and U1.18-P1-H2.1 406E 0D09 Scan control routines detected fault in L3 cache module 9 L3 cache module 9 and L3 VPD card Location: U1.18-P1-C18 and U1.18-P1-H2.1 406E 0D0A Scan control routines detected fault in L3 cache module 10 L3 cache module 10 and L3 VPD card Location: U1.18-P1-C14 and U1.18-P1-H2.1 406E 0D0B Scan control routines detected fault in L3 cache module 11 L3 cache module 11 and L3 VPD card Location: U1.18-P1-C15 and U1.18-P1-H2.1 406E 0D0C Scan control routines detected fault in L3 cache module 12 L3 cache module 12 and L3 VPD card Location: U1.18-P1-C17 and U1.18-P1-H2.1 406E 0D0D Scan control routines detected fault in L3 cache module 13 L3 cache module 13 and L3 VPD card Location: U1.18-P1-C5 and U1.18-P1-H2.1 406E 0D0E Scan control routines detected fault in L3 cache module 14 L3 cache module 14 and L3 VPD card Location: U1.18-P1-C9 and U1.18-P1-H2.1 648 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406E 0D0F Scan control routines detected fault in L3 cache module 15 L3 cache module 15 and L3 VPD card Location: U1.18-P1-C13 and U1.18-P1-H2.1 406E 0EA0 Scan control routines detected fault in MOPS service processor code 1. Access the service processor error log. Find the error log entry for the 406x 0EA0 error. Look for a hardware error of the form 450x yyyy, 460x yyyy, or 4B2x yyyy with a timestamp within a few minutes of the 406x 0EA0 error. Follow the actions for this hardware error. 2. If there is no hardware error in the service processor error log with a timestamp that is close to the 406x 0EA0 error, or following the actions for the hardware error does not resolve the problem, check for platform firmware updates. If an update is available, apply it. 3. If a platform firmware update is not available, or the update does not correct the problem, call service support. 406E 0EA1 Scan control routines detected fault in processor subsystem code on main processor 1. Check for operating system updates. 2. Call service support. 406E 0EA2 Scan control routines detected fault in PRD code on service processor 1. Check for system firmware updates. 2. Call service support. 406E 0EA4 The service processor firmware detected a hardware problem but was unable to isolate the source. Call service support. 406E 0EA5 PRD detected a problem in the FNM code 1. Check for system firmware updates. 2. Call service support. 406E 0EAA PRD: FNM is handling this error 1. Check for system firmware updates. 2. Call service support. 406E 0EAF Scan control routines detected unknown service processor code and system type 1. Check for system firmware updates. 2. Call service support. 406E 0EB0 Scan control routines detected JTAG problem Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406E 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406E 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406E 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406E 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406E 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406E 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. Chapter 5. Error Code to FRU Index 649 Error Code Description Action/ Possible Failing FRU 406E 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406E 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406E 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406E 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406E 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406E 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406E 0EBF Memory error 1. Reboot the system in slow mode to the SMS menus. See “Performing Slow Boot” on page 413. 2. Examine the service processor error log. v If a CEC subsystem error (a 4xxx yyyy error) occurred during the slow boot that calls out the memory, processors, L3 cache modules, or the primary I/O book, replace the part indicated by the location code. v If a CEC subsystem error (a 4xxx yyyy error) did not occur during the slow boot, look for a 4xxx yyyy error immediately preceding the 406x 0EBF error. Replace the part indicated by the location code associated with that error. 406E 0EE1 Scan control routines detected fault in service processor code (invalid MRU to mapping) 1. Check for system firmware updates. 2. Call service support. 406E 0EE2 Scan control routines detected fault in service processor firmware fault 1. Check for system firmware updates. 2. Call service support. 406E 0EE5 Scan control routines detected firmware error; no callout available 1. Check for system firmware updates. 2. Call service support. 406E 25E6 SMA 4-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 406E 273F SMA 2-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 406F 0001 Error code generation/logging detected clock card assembly failure Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406F 0002 Error code generation/logging detected backplane assembly failure Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406F 0003 Error code generation/logging detected fault in MCM module 0 MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3 406F 0004 Error code generation/logging detected fault in MCM module 2 MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4 406F 0005 Error code generation/logging detected fault in MCM module 1 MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5 650 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406F 0006 Error code generation/logging detected fault in MCM module 3 MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6 406F 0007 Error code generation/logging detected fault in L3 VPD card 1. L3 VPD cardLocation: U1.18-P1-H2.1 (Order the L3 cache module FRU to obtain a new L3 VPD card. Return the L3 cache module that was included in the FRU kit, and the failing L3 VPD card as ?new defective?.) 2. Primary I/O bookLocation: U1.18-P1-H2 406F 000A Error code generation/logging detected fault in I/O book 1 (slot 0) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406F 000B Error code generation/logging detected fault in primary I/O book (slot 1) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406F 000C Error code generation/logging detected fault in I/O book 3 (slot 2) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406F 000D Error code generation/logging detected fault in I/O book 4 (slot 3) Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406F 000E CUoD capacity card 1. Replace the CUoD capacity cardLocation: U1.18-P1-H2.2 2. Ask the customer to reenter the CUoD activation keys. 406F 0010 Error code generation/logging detected fault in Memory book (slot 0) Memory book (slot 0) Location: U1.18-P1-M2 406F 0011 Error code generation/logging detected fault in Memory book (slot 1) Memory book (slot 1) Location: U1.18-P1-M3 406F 0012 Error code generation/logging detected fault in Memory book (slot 2) Memory book (slot 2) Location: U1.18-P1-M7 406F 0013 Error code generation/logging detected fault in Memory book (slot 3) Memory book (slot 3)Location: U1.18-P1-M6 406F 0014 Error code generation/logging detected fault in Memory book (slot 4) Memory book (slot 4)Location: U1.18-P1-M4 406F 0015 Error code generation/logging detected fault in Memory book (slot 5) Memory book (slot 5)Location: U1.18-P1-M8 406F 0016 Error code generation/logging detected fault in Memory book (slot 6) Memory book (slot 6)Location: U1.18-P1-M5 406F 0017 Error code generation/logging detected fault in Memory book (slot 7) Memory book (slot 7)Location: U1.18-P1-M1 406F 00B0 Error code generation/logging detected possible problem with bus controller or CSP code 1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. 406F 00B1 Error code generation/logging detected possible problem with bus controller 1. Check for system firmware updates. 2. Primary I/O book, Location: U1.18-P1-H2 3. Call service support. Chapter 5. Error Code to FRU Index 651 Error Code Description Action/ Possible Failing FRU 406F 00B3 Error code generation/logging detected MCM 0/VPD problem 1. Check for system firmware updates. 2. MCM module 0 and VPD cardLocation: U1.18-P1-C1 and U1.18-P1-H2.3 406F 00B4 Error code generation/logging detected MCM 2/VPD problem 1. Check for system firmware updates. 2. MCM module 2 and VPD cardLocation: U1.18-P1-C4 and U1.18-P1-H2.4 406F 00B5 Error code generation/logging detected MCM 1/VPD problem 1. Check for system firmware updates. 2. MCM module 1 and VPD cardLocation: U1.18-P1-C3 and U1.18-P1-H2.5 406F 00B6 Error code generation/logging detected MCM 3/VPD problem 1. Check for system firmware updates. 2. MCM module 3 and VPD cardLocation: U1.18-P1-C2 and U1.18-P1-H2.6 406F 00B7 Error code generation/logging detected hardware problem in the processor subsystem Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406F 0A03 Reboot message (run-time array gard failure) for processor 0 Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 652 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406F 0A04 Reboot message (run-time array gard failure) for processor 2 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 406F 0A05 Reboot message (run-time array gard failure) for processor 1 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. Chapter 5. Error Code to FRU Index 653 Error Code Description Action/ Possible Failing FRU 406F 0A06 Reboot message (run-time array gard failure) for processor 3 Reboot the system in slow mode. Run diagnostics, then go to MAP 0235 in the .Array bit steering, which is a function in the firmware, may be able to correct this problem without replacing hardware. If AIX is available on the system, go to MAP 0235 in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If Linux is the only operating system on the system, schedule deferred maintenance with the customer. When it is possible to reboot the system, do the following: 1. Using the service processor’s system power control menu, disable fast mode boot. (Disabling fast mode automatically enables slow mode.) 2. Boot the AIX diagnostic CD-ROM. 3. Run diagnostics in problem determination mode. If no errors are recorded against any of the processors (devices with a resource name of procx), array bit steering was successful and the problem has been corrected.If errors are reported against the hardware, follow the actions for the error code or SRN. 406F 0B00 Error code generation/logging detected fault in I/O book (slot 0 RIO port 0) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 406F 0B01 Error code generation/logging detected fault in I/O book (slot 0 RIO port 1) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 406F 0B02 Error code generation/logging detected fault in I/O book (slot 0 RIO port 2) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 406F 0B03 Error code generation/logging detected fault in I/O book (slot 0 RIO port 3) 1. Check for system firmware updates. 2. Check the I/O cabling between the primary I/O book and the I/O subsystems. 3. Primary I/O book Location: U1.18-P1-H2 4. Call service support. 654 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406F 0B10 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 406F 0B11 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 406F 0B12 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 406F 0B13 Processor subsystem tests detected fault in SNI adapter 1. Check for system firmware updates. 2. Check the cabling between the SNI adapter book and the High Performance Switch. 3. SNI adapter bookLocation: U1.18-P1-H1 4. Call service support. 406F 0B20 Error code generation/logging detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406F 0B21 Error code generation/logging detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406F 0B22 Error code generation/logging detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. Chapter 5. Error Code to FRU Index 655 Error Code Description Action/ Possible Failing FRU 406F 0B23 Error code generation/logging detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406F 0B24 Error code generation/logging detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406F 0B25 Error code generation/logging detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406F 0B26 Error code generation/logging detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406F 0B27 Error code generation/logging detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the secondary I/O book and the I/O subsystems. 3. Secondary I/O book Location: U1.18-P1-H3 4. Call service support. 406F 0B30 Error code generation/logging detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 406F 0B31 Error code generation/logging detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 656 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406F 0B32 Error code generation/logging detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 406F 0B33 Error code generation/logging detected fault in I/O book 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems. 3. Third I/O book Location: U1.18-P1-H4 4. Call service support. 406F 0B34 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 406F 0B35 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 406F 0B36 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. 406F 0B37 Error code generation/logging detected fault in I/O book or SNI adapter 1. Check for system firmware updates. 2. Check the I/O cabling between the third I/O book and the I/O subsystems, or the SNI adapter and the High Performance Switch, depending on which adapter is installed in location U1.18-P1-H4. 3. Third I/O book or SNI adapterLocation: U1.18-P1-H4 4. Call service support. Chapter 5. Error Code to FRU Index 657 Error Code Description Action/ Possible Failing FRU 406F 0BF0 Error code generation/logging detected fault in I/O hub chip 0 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q1 b. U1.18-P1-H2/Q22. Replace the primary I/O book, Location: U1.18-P1-H2 406F 0BF1 Error code generation/logging detected fault in I/O hub chip 1 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H2/Q3 b. U1.18-P1-H2/Q42. Replace the primary I/O book, Location: U1.18-P1-H2 406F 0BF2 Error code generation/logging detected fault in I/O hub chip 2 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q3 b. U1.18-P1-H1/Q42. Replace the RIO book, Location: U1.18-P1-H1 406F 0BF3 Error code generation/logging detected fault in I/O hub chip 3 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q1 b. U1.18-P1-H1/Q22. Replace the RIO book, Location: U1.18-P1-H1 406F 0BF4 Error code generation/logging detected fault in I/O hub chip 4 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q7 b. U1.18-P1-H3/Q82. Replace the RIO book, Location: U1.18-P1-H3 406F 0BF5 Error code generation/logging detected fault in I/O hub chip 5 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q5 b. U1.18-P1-H3/Q62. Replace the RIO book, Location: U1.18-P1-H3 406F 0BF6 Error code generation/logging detected fault in I/O hub chip 6 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q5 b. U1.18-P1-H4/Q62. Replace the RIO book, Location: U1.18-P1-H4 658 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406F 0BF7 Error code generation/logging detected fault in I/O hub chip 7 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q7 b. U1.18-P1-H4/Q82. Replace the RIO book, Location: U1.18-P1-H4 406F 0BF8 Error code generation/logging detected fault in I/O hub chip 8 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q7 b. U1.18-P1-H1/Q82. Replace the RIO book, Location: U1.18-P1-H1 406F 0BF9 Error code generation/logging detected fault in I/O hub chip 9 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H1/Q5 b. U1.18-P1-H1/Q62. Replace the RIO book, Location: U1.18-P1-H1 406F 0BFA Error code generation/logging detected fault in I/O hub chip 10 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q3 b. U1.18-P1-H3/Q42. Replace the RIO book, Location: U1.18-P1-H3 406F 0BFB Error code generation/logging detected fault in I/O hub chip 11 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H3/Q1 b. U1.18-P1-H3/Q22. Replace the RIO book, Location: U1.18-P1-H3 406F 0BFC Error code generation/logging detected fault in I/O hub chip 12 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q1 b. U1.18-P1-H4/Q22. Replace the RIO book, Location: U1.18-P1-H4. 406F 0BFD Error code generation/logging detected fault in I/O hub chip 13 (port x) 1. Check the cabling on the following RIO ports: a. U1.18-P1-H4/Q3 b. U1.18-P1-H4/Q42. Replace the RIO book, Location: U1.18-P1-H4 406F 0C03 Error code generation/logging detected fault in all MCMs Go to “MAP 1543: MCM Module Problem Isolation” on page 337. 406F 0C09 Error code generation/logging detected fault in all memory books Go to “MAP 1546: Memory Book Problem Isolation” on page 345. Chapter 5. Error Code to FRU Index 659 Error Code Description Action/ Possible Failing FRU 406F 0C0A Error code generation/logging detected fault in all I/O books Go to “MAP 1547: I/O Book Problem Isolation” on page 348. 406F 0C10 Error code generation/logging detected mismatch between MCM 0 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C1 and U1.18-P1-H2.3 406F 0C11 Error code generation/logging detected mismatch between MCM 2 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C4 and U1.18-P1-H2.4 406F 0C12 Error code generation/logging detected mismatch between MCM 1 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C3 and U1.18-P1-H2.5 406F 0C13 Error code generation/logging detected mismatch between MCM 3 and VPD card Plugging error with FRU kitLocation: U1.18-P1-C2 and U1.18-P1-H2.6 406F 0CA5 Error code generation/logging detected invalid processor subsystem configuration Verify that MCMs, memory books, and L3 modules are correctly configured. 406F 0CA6 Error code generation/logging detected no usable main storage Reboot the system in slow mode to isolate the failure. 406F 0CA9 Error code generation/logging detected configuration error with processor memory Reboot the system in slow mode to isolate the failure. 406F 0D00 Error code generation/logging detected fault in L3 cache module 0 L3 cache module 0 and L3 VPD cardLocation: U1.18-P1-C6 and U1.18-P1-H2.1 406F 0D01 Error code generation/logging detected fault in L3 cache module 1 L3 cache module 1 and L3 VPD card Location: U1.18-P1-C7 and U1.18-P1-H2.1 406F 0D02 Error code generation/logging detected fault in L3 cache module 2 L3 cache module 2 and L3 VPD card Location: U1.18-P1-C11 and U1.18-P1-H2.1 406F 0D03 Error code generation/logging detected fault in L3 cache module 3 L3 cache module 3 and L3 VPD card Location: U1.18-P1-C10 and U1.18-P1-H2.1 406F 0D04 Error code generation/logging detected fault in L3 cache module 4 L3 cache module 4 and L3 VPD card Location: U1.18-P1-C8 and U1.18-P1-H2.1 406F 0D05 Error code generation/logging detected fault in L3 cache module 5 L3 cache module 5 and L3 VPD card Location: U1.18-P1-C20 and U1.18-P1-H2.1 406F 0D06 Error code generation/logging detected fault in L3 cache module 6 L3 cache module 6 and L3 VPD card Location: U1.18-P1-C16 and U1.18-P1-H2.1 406F 0D07 Error code generation/logging detected fault in L3 cache module 7 L3 cache module 7 and L3 VPD card Location: U1.18-P1-C12 and U1.18-P1-H2.1 406F 0D08 Error code generation/logging detected fault in L3 cache module 8 L3 cache module 8 and L3 VPD card Location: U1.18-P1-C19 and U1.18-P1-H2.1 406F 0D09 Error code generation/logging detected fault in L3 cache module 9 L3 cache module 9 and L3 VPD card Location: U1.18-P1-C18 and U1.18-P1-H2.1 406F 0D0A Error code generation/logging detected fault in L3 cache module 10 L3 cache module 10 and L3 VPD card Location: U1.18-P1-C14 and U1.18-P1-H2.1 406F 0D0B Error code generation/logging detected fault in L3 cache module 11 L3 cache module 11 and L3 VPD card Location: U1.18-P1-C15 and U1.18-P1-H2.1 406F 0D0C Error code generation/logging detected fault in L3 cache module 12 L3 cache module 12 and L3 VPD card Location: U1.18-P1-C17 and U1.18-P1-H2.1 406F 0D0D Error code generation/logging detected fault in L3 cache module 13 L3 cache module 13 and L3 VPD card Location: U1.18-P1-C5 and U1.18-P1-H2.1 406F 0D0E Error code generation/logging detected fault in L3 cache module 14 L3 cache module 14 and L3 VPD card Location: U1.18-P1-C9 and U1.18-P1-H2.1 660 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 406F 0D0F Error code generation/logging detected fault in L3 cache module 15 L3 cache module 15 and L3 VPD card Location: U1.18-P1-C13 and U1.18-P1-H2.1 406F 0EA0 Error code generation/logging detected fault in MOPS service processor code 1. Access the service processor error log. Find the error log entry for the 406x 0EA0 error. Look for a hardware error of the form 450x yyyy, 460x yyyy, or 4B2x yyyy with a timestamp within a few minutes of the 406x 0EA0 error. Follow the actions for this hardware error. 2. If there is no hardware error in the service processor error log with a timestamp that is close to the 406x 0EA0 error, or following the actions for the hardware error does not resolve the problem, check for platform firmware updates. If an update is available, apply it. 3. If a platform firmware update is not available, or the update does not correct the problem, call service support. 406F 0EA1 Error code generation/logging detected fault in processor subsystem code on main processor 1. Check for operating system updates. 2. Call service support. 406F 0EA2 Error code generation/logging detected fault in PRD code on service processor 1. Check for system firmware updates. 2. Call service support. 406F 0EA4 The service processor firmware detected a hardware problem but was unable to isolate the source. Call service support. 406F 0EA5 PRD detected a problem in the FNM code 1. Check for system firmware updates. 2. Call service support. 406F 0EAA PRD: FNM is handling this error 1. Check for system firmware updates. 2. Call service support. 406F 0EAF Error code generation/logging detected unknown service processor code and system type 1. Check for system firmware updates. 2. Call service support. 406F 0EB0 Error code generation/logging detected JTAG problem Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406F 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406F 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406F 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406F 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406F 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406F 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. Chapter 5. Error Code to FRU Index 661 Error Code Description Action/ Possible Failing FRU 406F 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406F 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406F 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406F 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406F 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406F 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. 406F 0EBF Memory error 1. Reboot the system in slow mode to the SMS menus. See “Performing Slow Boot” on page 413. 2. Examine the service processor error log. v If a CEC subsystem error (a 4xxx yyyy error) occurred during the slow boot that calls out the memory, processors, L3 cache modules, or the primary I/O book, replace the part indicated by the location code. v If a CEC subsystem error (a 4xxx yyyy error) did not occur during the slow boot, look for a 4xxx yyyy error immediately preceding the 406x 0EBF error. Replace the part indicated by the location code associated with that error. 406F 0EE1 Error code generation/logging detected fault in service processor code (invalid MRU to mapping) 1. Check for system firmware updates. 2. Call service support. 406F 0EE2 Error code generation/logging detected fault in service processor firmware fault 1. Check for system firmware updates. 2. Call service support. 406F 0EE5 Error code generation/logging detected firmware error; no callout available 1. Check for system firmware updates. 2. Call service support. 406F 25E6 SMA 4-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 406F 273F SMA 2-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 4503 25D9 This error code indicates that the memory repeat gard function has deconfigured an L3 cache module. Replace the L3 cache module indicated by the location code reported with the error code, and the L3 VPD card, location: U1.18-P1-H2.1 4503 25E7 This error code indicates that the memory repeat gard function has deconfigured an L3 cache module. Replace the L3 cache module indicated by the location code reported with the error code, and the L3 VPD card, location: U1.18-P1-H2.1 662 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4503 25E8 This error code indicates that the memory repeat gard function has deconfigured an L3 cache module. Replace the L3 cache module indicated by the location code reported with the error code, and the L3 VPD card, location: U1.18-P1-H2.1 4503 25F1 This error code indicates that the memory repeat gard function has deconfigured an L3 cache module. Replace the L3 cache module indicated by the location code reported with the error code, and the L3 VPD card, location: U1.18-P1-H2.1 4503 25F9 This error code indicates that the memory repeat gard function has deconfigured an L3 cache module. Replace the L3 cache module indicated by the location code reported with the error code, and the L3 VPD card, location: U1.18-P1-H2.1 4503 269A This error code indicates that the memory repeat gard function has deconfigured an L3 cache module. Replace the L3 cache module indicated by the location code reported with the error code, and the L3 VPD card, location: U1.18-P1-H2.1 4503 302A This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book. Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. Chapter 5. Error Code to FRU Index 663 Error Code Description Action/ Possible Failing FRU 4503 302B This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book. Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. 4503 302C This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book. Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. 664 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4503 302D This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book. Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. 4503 305A This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book. Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. Chapter 5. Error Code to FRU Index 665 Error Code Description Action/ Possible Failing FRU 4503 305B This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book. Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. 4503 305C This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book. Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. 666 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4503 305D This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book. Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. 4503 305E This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book. Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. Chapter 5. Error Code to FRU Index 667 Error Code Description Action/ Possible Failing FRU 4503 305F This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book. Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. 4503 307A This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book. Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. 668 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4503 307B This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book. Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. 4503 307C This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book. Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. Chapter 5. Error Code to FRU Index 669 Error Code Description Action/ Possible Failing FRU 4503 307D This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book. Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. 4503 307E This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book. Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. 670 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4503 308A This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book. Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. 4503 308B This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book. Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. Chapter 5. Error Code to FRU Index 671 Error Code Description Action/ Possible Failing FRU 4503 308C This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book. Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. 4503 308D This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book. Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. 672 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4503 308E This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book. Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. 4503 30BD This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book. Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. Chapter 5. Error Code to FRU Index 673 Error Code Description Action/ Possible Failing FRU 4503 30C0 This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book. Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. 4503 30C1 This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book. Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. 674 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4503 30C3 This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book. Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. 4503 30C6 This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book. Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. Chapter 5. Error Code to FRU Index 675 Error Code Description Action/ Possible Failing FRU 4503 30C9 This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book. Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. 4503 30CA This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book. Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. 676 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4503 30CC This error code indicates that the memory repeat gard function has deconfigured or partially deconfigured a memory book. Do the following: 1. Go to the memory configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the books. If any of the books have a status of ?manually deconfigured?, reconfigure the books and boot the system in a full system partition to the SMS menus. Shut down the system and look again at the status of the books. If all of the books are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the books have a status of ?deconfigured by the system?, either before or after the boot to the SMS menus, go to step 2. 2. Replace the memory book specified by the location code. 4504 248B Memory subsystem and processor subsystem tests detected fault in operator panel Operator panel Location: U1.17-L1 4504 25C0 Memory subsystem and processor subsystem tests detected fault in system backplane System backplane Location: U1.18-P1 4504 25C1 Memory subsystem and processor subsystem tests detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 4504 25D9 Memory subsystem and processor subsystem tests detected fault in L3 VPD card used with 1.1 GHz MCM and 1.3 GHz MCM Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4504 25E7 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4504 25E8 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4504 25F1 Memory subsystem and processor subsystem tests detected fault in L3 VPD card used with 1.3 GHz MCM Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4504 25F9 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4504 269A Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4504 28B0 Memory subsystem and processor subsystem tests detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 4504 28B1 Memory subsystem and processor subsystem tests detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 4504 302A Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (inner) Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340 to isolate the failing component. Chapter 5. Error Code to FRU Index 677 Error Code Description Action/ Possible Failing FRU 4504 302B Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (outer) Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342 to isolate the failing component. 4504 302C Memory subsystem and processor subsystem tests detected fault in 32 GB memory book (inner) Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340 to isolate the failing component. 4504 302D Memory subsystem and processor subsystem tests detected fault in 32 GB memory book (outer) Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342 to isolate the failing component. 4504 305A Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (inner) Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4504 305B Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (outer) Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4504 305C Memory subsystem and processor subsystem tests detected fault in 4 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4504 305D Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4504 305E Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4504 305F Memory subsystem and processor subsystem tests detected fault in 16 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4504 307A Memory subsystem and processor subsystem tests detected fault in 4 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4504 307B Memory subsystem and processor subsystem tests detected fault in 8 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4504 307C Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4504 307D Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4504 307E Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4504 308A Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4504 308B Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4504 308C Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4504 308D Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4504 308E Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 678 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4504 30BD Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB Capacity Upgrade on Demand (CUoD) inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4504 30C0 Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB Capacity Upgrade on Demand (CUoD) inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4504 30C1 Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB Capacity Upgrade on Demand (CUoD) inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4504 30C3 Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4504 30C6 Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4504 30C9 Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4504 30CA Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4504 30CC Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4505 248B Memory subsystem and display alter tests detected fault in operator panel Operator panel Location: U1.17-L1 4505 25C0 Memory subsystem and display alter tests detected fault in system backplane System backplane Location: U1.18-P1 4505 25C1 Memory subsystem and display alter tests detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 4505 25D9 Memory subsystem and display alter tests detected fault in L3 VPD card used with 1.1 GHz MCM and 1.3 GHz MCM Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4505 25E7 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4505 25E8 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4505 25F1 Memory subsystem and display alter tests detected fault in L3 VPD card used with 1.3 GHz MCM Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4505 25F9 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4505 269A Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4505 28B0 Memory subsystem and display alter tests detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code Chapter 5. Error Code to FRU Index 679 Error Code Description Action/ Possible Failing FRU 4505 28B1 Memory subsystem and display alter tests detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 4505 302A Memory subsystem and display alter tests detected fault in 8 GB memory book (inner) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4505 302B Memory subsystem and display alter tests detected fault in 8 GB memory book (outer) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4505 302C Memory subsystem and display alter tests detected fault in 32 GB memory book (inner) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4505 302D Memory subsystem and display alter tests detected fault in 32 GB memory book (outer) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4505 305A Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (inner) Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4505 305B Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (outer) Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4505 305C Memory subsystem and display alter tests detected fault in 4 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4505 305D Memory subsystem and display alter tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4505 305E Memory subsystem and display alter tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4505 305F Memory subsystem and display alter tests detected fault in 16 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4505 307A Memory subsystem and processor subsystem tests detected fault in 4 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4505 307B Memory subsystem and processor subsystem tests detected fault in 8 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4505 307C Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4505 307D Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4505 307E Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4505 308A Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4505 308B Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4505 308C Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 680 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4505 308D Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4505 308E Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4505 30BD Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4505 30C3 Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4505 30C6 Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4505 30CC Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4506 248B Memory subsystem and scan interface basic assurance tests detected fault in operator panel Operator panel Location: U1.17-L1 4506 25C0 Memory subsystem and scan interface basic assurance tests detected fault in system backplane System backplane Location: U1.18-P1 4506 25C1 Memory subsystem and scan interface basic assurance tests detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 4506 25D9 Memory subsystem and scan interface basic assurance tests detected fault in L3 VPD card used with 1.1 GHz MCM and 1.3 GHz MCM Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4506 25E7 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4506 25E8 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4506 25F1 Memory subsystem and scan interface basic assurance tests detected fault in L3 VPD card used with 1.3 GHz MCM Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4506 25F9 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4506 269A Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4506 28B0 Memory subsystem and scan interface basic assurance tests detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 4506 28B1 Memory subsystem and scan interface basic assurance tests detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 4506 302A Memory subsystem and scan interface basic assurance tests detected fault in 8 GB memory book (inner) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4506 302B Memory subsystem and scan interface basic assurance tests detected fault in 8 GB memory book (outer) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. Chapter 5. Error Code to FRU Index 681 Error Code Description Action/ Possible Failing FRU 4506 302C Memory subsystem and scan interface basic assurance tests detected fault in 32 GB memory book (inner) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4506 302D Memory subsystem and scan interface basic assurance tests detected fault in 32 GB memory book (outer) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4506 305A Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (inner) Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4506 305B Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (outer) Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4506 305C Memory subsystem and scan interface basic assurance tests detected fault in 4 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4506 305D Memory subsystem and scan interface basic assurance tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4506 305E Memory subsystem and scan interface basic assurance tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4506 305F Memory subsystem and scan interface basic assurance tests detected fault in 16 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4506 307A Memory subsystem and processor subsystem tests detected fault in 4 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4506 307B Memory subsystem and processor subsystem tests detected fault in 8 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4506 307C Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4506 307D Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4506 307E Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4506 308A Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4506 308B Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4506 308C Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4506 308D Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 45064 308E Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4506 30BD Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 682 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4506 30C3 Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4506 30C6 Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4506 30CC Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4507 248B Memory subsystem and bus tests detected fault in operator panel Operator panel Location: U1.17-L1 4507 25C0 Memory subsystem and bus tests detected fault in system backplane System backplane Location: U1.18-P1 4507 25C1 Memory subsystem and bus tests detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 4507 25D9 Memory subsystem and bus tests detected fault in L3 VPD card used with 1.1 GHz MCM and 1.3 GHz MCM Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4507 25E7 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4507 25E8 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4507 25F1 Memory subsystem and bus tests detected fault in L3 VPD card used with 1.3 GHz MCM Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4507 25F9 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4507 269A Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4507 28B0 Memory subsystem and bus tests detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 4507 28B1 Memory subsystem and bus tests detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 4507 302A Memory subsystem and bus tests detected fault in 8 GB memory book (inner) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4507 302B Memory subsystem and bus tests detected fault in 8 GB memory book (outer) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4507 302C Memory subsystem and bus tests detected fault in 32 GB memory book (inner) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4507 302D Memory subsystem and bus tests detected fault in 32 GB memory book (outer) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4507 305A Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (inner) Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. Chapter 5. Error Code to FRU Index 683 Error Code Description Action/ Possible Failing FRU 4507 305B Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (outer) Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4507 305C Memory subsystem and bus tests detected fault in 4 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4507 305D Memory subsystem and bus tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4507 305E Memory subsystem and bus tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4507 305F Memory subsystem and bus tests detected fault in 16 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4507 307A Memory subsystem and processor subsystem tests detected fault in 4 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4507 307B Memory subsystem and processor subsystem tests detected fault in 8 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4507 307C Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4507 307D Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4507 307E Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4507 308A Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4507 308B Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4507 308C Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4507 308D Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4507 308E Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4507 30BD Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4507 30C3 Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4507 30C6 Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4507 30CC Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4508 248B Memory subsystem and other bus tests detected fault in operator panel Operator panel Location: U1.17-L1 684 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4508 25C0 Memory subsystem and other bus tests detected fault in system backplane System backplane Location: U1.18-P1 4508 25C1 Memory subsystem and other bus tests detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 4508 25D9 Memory subsystem and other bus tests detected fault in L3 VPD card used with 1.1 GHz MCM and 1.3 GHz MCM Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4508 25E7 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4508 25E8 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4508 25F1 Memory subsystem and other bus tests detected fault in L3 VPD card used with 1.3 GHz MCM Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4508 25F9 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4508 269A Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4508 28B0 Memory subsystem and other bus tests detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 4508 28B1 Memory subsystem and other bus tests detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 4508 302A Memory subsystem and other bus tests detected fault in 8 GB memory book (inner) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4508 302B Memory subsystem and other bus tests detected fault in 8 GB memory book (outer) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4508 302C Memory subsystem and other bus tests detected fault in 32 GB memory book (inner) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4508 302D Memory subsystem and other bus tests detected fault in 32 GB memory book (outer) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4508 305A Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (inner) Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4508 305B Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (outer) Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4508 305C Memory subsystem and other bus tests detected fault in 4 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4508 305D Memory subsystem and other bus tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4508 305E Memory subsystem and other bus tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4508 305F Memory subsystem and other bus tests detected fault in 16 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. Chapter 5. Error Code to FRU Index 685 Error Code Description Action/ Possible Failing FRU 4508 307A Memory subsystem and processor subsystem tests detected fault in 4 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4508 307B Memory subsystem and processor subsystem tests detected fault in 8 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4508 307C Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4508 307D Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 45084 307E Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4508 308A Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4508 308B Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4508 308C Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4508 308D Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4508 308E Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4508 30BD Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4508 30C3 Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4508 30C6 Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4508 30CC Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4509 248B Memory subsystem and memory tests detected fault in operator panel Operator panel Location: U1.17-L1 4509 25C0 Memory subsystem and memory tests detected fault in system backplane System backplane Location: U1.18-P1 4509 25C1 Memory subsystem and memory tests detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 4509 25D9 Memory subsystem and memory tests detected fault in L3 VPD card used with 1.1 GHz MCM and 1.3 GHz MCM Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4509 25E7 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4509 25E8 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 686 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4509 25F1 Memory subsystem and memory tests detected fault in L3 VPD card used with 1.3 GHz MCM Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4509 25F9 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4509 269A Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 4509 28B0 Memory subsystem and memory tests detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 4509 28B1 Memory subsystem and memory tests detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 4509 302A Memory subsystem and memory tests detected fault in 8 GB memory book (inner) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4509 302B Memory subsystem and memory tests detected fault in 8 GB memory book (outer) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4509 302C Memory subsystem and memory tests detected fault in 32 GB memory book (inner) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4509 302D Memory subsystem and memory tests detected fault in 32 GB memory book (outer) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4509 305A Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (inner) Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4509 305B Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (outer) Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4509 305C Memory subsystem and memory tests detected fault in 4 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4509 305D Memory subsystem and memory tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4509 305E Memory subsystem and memory tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4509 305F Memory subsystem and memory tests detected fault in 16 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4509 307A Memory subsystem and processor subsystem tests detected fault in 4 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4509 307B Memory subsystem and processor subsystem tests detected fault in 8 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4509 307C Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4509 307D Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4509 307E Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. Chapter 5. Error Code to FRU Index 687 Error Code Description Action/ Possible Failing FRU 4509 308A Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4509 308B Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4509 308C Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4509 308D Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4509 308E Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4509 30BD Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4509 30C3 Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 4509 30C6 Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4509 30CC Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450A 248B Memory subsystem and self-tests detected fault in operator panel Operator panel Location: U1.17-L1 450A 25C0 Memory subsystem and self-tests detected fault in system backplane System backplane Location: U1.18-P1 450A 25C1 Memory subsystem and self-tests detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 450A 25D9 Memory subsystem and self-tests detected fault in L3 VPD card used with 1.1 GHz MCM and 1.3 GHz MCM Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450A 25E7 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450A 25E8 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450A 25F1 Memory subsystem and self-tests detected fault in L3 VPD card used with 1.3 GHz MCM Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450A 25F9 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450A 269A Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450A 28B0 Memory subsystem and self-tests detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 450A 28B1 Memory subsystem and self-tests detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 450A 302A Memory subsystem and self-tests detected fault in 8 GB memory book (inner) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 688 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 450A 302B Memory subsystem and self-tests detected fault in 8 GB memory book (outer) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 450A 302C Memory subsystem and self-tests detected fault in 32 GB memory book (inner) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 450A 302D Memory subsystem and self-tests detected fault in 32 GB memory book (outer) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 450A 305A Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (inner) Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450A 305B Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (outer) Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450A 305C Memory subsystem and self-tests detected fault in 4 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450A 305D Memory subsystem and self-tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450A 305E Memory subsystem and self-tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450A 305F Memory subsystem and self-tests detected fault in 16 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450A 307A Memory subsystem and processor subsystem tests detected fault in 4 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450A 307B Memory subsystem and processor subsystem tests detected fault in 8 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450A 307C Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450A 307D Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450A 307E Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450A 308A Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450A 308B Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450A 308C Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450A 308D Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450A 308E Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450A 30BD Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. Chapter 5. Error Code to FRU Index 689 Error Code Description Action/ Possible Failing FRU 450A 30C3 Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450A 30C6 Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450A 30CC Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450B 248B Memory subsystem and wire tests detected fault in operator panel Operator panel Location: U1.17-L1 450B 25C0 Memory subsystem and wire tests detected fault in system backplane System backplane Location: U1.18-P1 450B 25C1 Memory subsystem and wire tests detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 450B 25D9 Memory subsystem and wire tests detected fault in L3 VPD card used with 1.1 GHz MCM and 1.3 GHz MCM Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450B 25E7 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450B 25E8 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450B 25F1 Memory subsystem and wire tests detected fault in L3 VPD card used with 1.3 GHz MCM Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450B 25F9 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450B 269A Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450B 28B0 Memory subsystem and wire tests detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 450B 28B1 Memory subsystem and wire tests detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 450B 302A Memory subsystem and wire tests detected fault in 8 GB memory book (inner) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 450B 302B Memory subsystem and wire tests detected fault in 8 GB memory book (outer) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 450B 302C Memory subsystem and wire tests detected fault in 32 GB memory book (inner) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 450B 302D Memory subsystem and wire tests detected fault in 32 GB memory book (outer) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 450B 305A Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (inner) Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 690 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 450B 305B Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (outer) Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450B 305C Memory subsystem and wire tests detected fault in 4 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450B 305D Memory subsystem and wire tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450B 305E Memory subsystem and wire tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450B 305F Memory subsystem and wire tests detected fault in 16 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450B 307A Memory subsystem and processor subsystem tests detected fault in 4 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450B 307B Memory subsystem and processor subsystem tests detected fault in 8 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450B 307C Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450B 307D Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450B 307E Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450B 308A Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450B 308B Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450B 308C Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450B 308D Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450B 308E Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450B 30BD Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450B 30C3 Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450B 30C6 Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450B 30CC Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450C 248B Memory subsystem and IPL detected fault in operator panel Operator panel Location: U1.17-L1 Chapter 5. Error Code to FRU Index 691 Error Code Description Action/ Possible Failing FRU 450C 25C0 Memory subsystem and IPL detected fault in system backplane System backplane Location: U1.18-P1 450C 25C1 Memory subsystem and IPL detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 450C 25D9 Memory subsystem and IPL detected fault in L3 VPD card used with 1.1 GHz MCM and 1.3 GHz MCM Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450C 25E7 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450C 25E8 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450C 25F1 Memory subsystem and IPL detected fault in L3 VPD card used with 1.3 GHz MCM Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450C 25F9 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450C 269A Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450C 28B0 Memory subsystem and IPL detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 450C 28B1 Memory subsystem and IPL detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 450C 302A Memory subsystem and IPL detected fault in 8 GB memory book (inner) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 450C 302B Memory subsystem and IPL detected fault in 8 GB memory book (outer) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 450C 302C Memory subsystem and IPL detected fault in 32 GB memory book (inner) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 450C 302D Memory subsystem and IPL detected fault in 32 GB memory book (outer) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 450C 305A Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (inner) Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450C 305B Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (outer) Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450C 305C Memory subsystem and IPL detected fault in 4 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450C 305D Memory subsystem and IPL detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450C 305E Memory subsystem and IPL detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450C 305F Memory subsystem and IPL detected fault in 16 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450C 307A Memory subsystem and processor subsystem tests detected fault in 4 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 692 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 450C 307B Memory subsystem and processor subsystem tests detected fault in 8 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450C 307C Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450C 307D Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450C 307E Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450C 308A Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450C 308B Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450C 308C Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450C 308D Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450C 308E Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450C 30BD Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450C 30C3 Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450C 30C6 Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450C 30CC Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450D 248B Memory subsystem and attention handler detected fault in operator panel Operator panel Location: U1.17-L1 450D 25C0 Memory subsystem and attention handler detected fault in system backplane System backplane Location: U1.18-P1 450D 25C1 Memory subsystem and attention handler detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 450D 25D9 Memory subsystem and attention handler detected fault in L3 VPD card used with 1.1 GHz MCM and 1.3 GHz MCM Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450D 25E7 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450D 25E8 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450D 25F1 Memory subsystem and attention handler detected fault in L3 VPD card used with 1.3 GHz MCM Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. Chapter 5. Error Code to FRU Index 693 Error Code Description Action/ Possible Failing FRU 450D 25F9 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450D 269A Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450D 28B0 Memory subsystem and attention handler detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 450D 28B1 Memory subsystem and attention handler detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 450D 302A Memory subsystem and attention handler detected fault in 8 GB memory book (inner) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 450D 302B Memory subsystem and attention handler detected fault in 8 GB memory book (outer) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 450D 302C Memory subsystem and attention handler detected fault in 32 GB memory book (inner) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 450D 302D Memory subsystem and attention handler detected fault in 32 GB memory book (outer) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 450D 305A Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (inner) Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450D 305B Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (outer) Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450D 305C Memory subsystem and attention handler detected fault in 4 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450D 305D Memory subsystem and attention handler detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450D 305E Memory subsystem and attention handler detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450D 305F Memory subsystem and attention handler detected fault in 16 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450D 307A Memory subsystem and processor subsystem tests detected fault in 4 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450D 307B Memory subsystem and processor subsystem tests detected fault in 8 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450D 307C Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450D 307D Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450D 307E Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 694 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 450D 308A Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450D 308B Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450D 308C Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450D 308D Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450D 308E Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450D 30BD Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450D 30C3 Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450D 30C6 Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450D 30CC Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450E 248B Memory subsystem and scan control routines detected fault in operator panel Operator panel Location: U1.17-L1 450E 25C0 Memory subsystem and scan control routines detected fault in system backplane System backplane Location: U1.18-P1 450E 25C1 Memory subsystem and scan control routines detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 450E 25D9 Memory subsystem and scan control routines detected fault in L3 VPD card used with 1.1 GHz MCM and 1.3 GHz MCM Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450E 25E7 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450E 25E8 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450E 25F1 Memory subsystem and scan control routines detected fault in L3 VPD card used with 1.3 GHz MCM Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450E 25F9 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450E 269A Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450E 28B0 Memory subsystem and scan control routines detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 450E 28B1 Memory subsystem and scan control routines detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code Chapter 5. Error Code to FRU Index 695 Error Code Description Action/ Possible Failing FRU 450E 302A Memory subsystem and scan control routines detected fault in 8 GB memory book (inner) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 450E 302B Memory subsystem and scan control routines detected fault in 8 GB memory book (outer) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 450E 302C Memory subsystem and scan control routines detected fault in 32 GB memory book (inner) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 450E 302D Memory subsystem and scan control routines detected fault in 32 GB memory book (outer) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 450E 305A Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (inner) Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450E 305B Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (outer) Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450E 305C Memory subsystem and scan control routines detected fault in 4 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450E 305D Memory subsystem and scan control routines detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450E 305E Memory subsystem and scan control routines detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450E 305F Memory subsystem and scan control routines detected fault in 16 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450E 307A Memory subsystem and processor subsystem tests detected fault in 4 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450E 307B Memory subsystem and processor subsystem tests detected fault in 8 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450E 307C Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450E 307D Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450E 307E Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450E 308A Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450E 308B Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450E 308C Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450E 308D Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450E 308E Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 696 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 450E 30BD Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450E 30C3 Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450E 30C6 Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450E 30CC Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450F 248B Memory subsystem and error code generation/logging detected fault in operator panel Operator panel Location: U1.17-L1 450F 25C0 Memory subsystem and error code generation/logging detected fault in system backplane System backplane Location: U1.18-P1 450F 25C1 Memory subsystem and error code generation/logging detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 450F 25D9 Memory subsystem and error code generation/logging detected fault in L3 VPD card used with 1.1 GHz MCM and 1.3 GHz MCM Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450F 25E7 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450F 25E8 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450F 25F1 Memory subsystem and error code generation/logging detected fault in L3 VPD card used with 1.3 GHz MCM Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450F 25F9 Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450F 269A Fault in L3 VPD card used with 1.5 GHz and 1.7 GHz MCMs Go to “MAP 1548: Memory and Processor Problem Isolation” on page 350. 450F 28B0 Memory subsystem and error code generation/logging detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 450F 28B1 Memory subsystem and error code generation/logging detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 450F 302A Memory subsystem and error code generation/logging detected fault in 8 GB memory book (inner) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 450F 302B Memory subsystem and error code generation/logging detected fault in 8 GB memory book (outer) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. Chapter 5. Error Code to FRU Index 697 Error Code Description Action/ Possible Failing FRU 450F 302C Memory subsystem and error code generation/logging detected fault in 32 GB memory book (inner) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 450F 302D Memory subsystem and error code generation/logging detected fault in 32 GB memory book (outer) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 450F 305A Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (inner) Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450F 305B Memory subsystem and processor subsystem tests detected fault in 8 GB memory book (outer) Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450F 305C Memory subsystem and error code generation/logging detected fault in 4 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450F 305D Memory subsystem and error code generation/logging detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450F 305E Memory subsystem and error code generation/logging detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450F 305F Memory subsystem and error code generation/logging detected fault in 16 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450F 307A Memory subsystem and processor subsystem tests detected fault in 4 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450F 307B Memory subsystem and processor subsystem tests detected fault in 8 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450F 307C Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450F 307D Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450F 307E Memory subsystem and processor subsystem tests detected fault in 16 GB inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450F 308A Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450F 308B Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450F 308C Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450F 308D Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450F 308E Memory subsystem and processor subsystem tests detected fault in 4 GB outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450F 30BD Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 698 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 450F 30C3 Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD inner memory book Go to “MAP 1544: Inner L3 Module Problem Isolation” on page 340. 450F 30C6 Memory subsystem and processor subsystem tests detected fault in 8 GB or 16 GB CUoD outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 450F 30CC Memory subsystem and processor subsystem tests detected fault in 32 GB or 64 GB CUoD outer memory book Go to “MAP 1545: Outer L3 Module Problem Isolation” on page 342. 4604 248B I/O subsystem and processor subsystem tests detected fault in operator panel Operator panel Location: U1.17-L1 4604 25C0 I/O subsystem and processor subsystem tests detected fault in system backplane System backplane Location: U1.18-P1 4604 25C1 I/O subsystem and processor subsystem tests detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 4604 25E6 SMA 4-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 4604 2738 I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 4604 273C I/O subsystem and processor subsystem tests detected fault in primary I/O book - Slot 0 Primary I/O book - Slot 0 Location: U1.18-P1-H2 4604 273D I/O subsystem and processor subsystem tests detected fault in primary I/O book - Slot 0 Primary I/O book - Slot 0 Location: U1.18-P1-H2 4604 273F SMA 2-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 4604 2776 I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 4604 2806 I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 4604 28B0 I/O subsystem and processor subsystem tests detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 4604 28B1 I/O subsystem and processor subsystem tests detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code Chapter 5. Error Code to FRU Index 699 Error Code Description Action/ Possible Failing FRU 4604 525E CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 4604 525F CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 4604 527A CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 4604 527C CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 4605 248B I/O subsystem and display alter tests detected fault in operator panel Operator panel Location: U1.17-L1 4605 25C0 I/O subsystem and display alter tests detected fault in system backplane System backplane Location: U1.18-P1 4605 25C1 I/O subsystem and display alter tests detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 4605 25E6 SMA 4-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 4605 2738 I/O subsystem and display alter tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 4605 273C I/O subsystem and display alter tests detected fault in primary I/O book - Slot 0 Primary I/O book - Slot 0 Location: U1.18-P1-H2 4605 273D I/O subsystem and processor subsystem tests detected fault in primary I/O book - Slot 0 Primary I/O book - Slot 0 Location: U1.18-P1-H2 4605 273F SMA 2-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 700 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4605 2776 I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 4605 2806 I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 4605 28B0 I/O subsystem and display alter tests detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 4605 28B1 I/O subsystem and display alter tests detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 4605 525E CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 4605 525F CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 4605 527A CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 4605 527C CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 4606 248B I/O subsystem and scan interface basic assurance tests detected fault in operator panel Operator panel Location: U1.17-L1 4606 25C0 I/O subsystem and scan interface basic assurance tests detected fault in system backplane System backplane Location: U1.18-P1 Chapter 5. Error Code to FRU Index 701 Error Code Description Action/ Possible Failing FRU 4606 25C1 I/O subsystem and scan interface basic assurance tests detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 4606 25E6 SMA 4-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 4606 2738 I/O subsystem and scan interface basic assurance tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 4606 273C I/O subsystem and scan interface basic assurance tests detected fault in primary I/O book - Slot 0 Primary I/O book - Slot 0 Location: U1.18-P1-H2 4606 273D I/O subsystem and processor subsystem tests detected fault in primary I/O book - Slot 0 Primary I/O book - Slot 0 Location: U1.18-P1-H2 4606 273F SMA 2-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 4606 2776 I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 4606 2806 I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 4606 28B0 I/O subsystem and scan interface basic assurance tests detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 4606 28B1 I/O subsystem and scan interface basic assurance tests detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 4606 525E CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 4606 525F CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 702 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4606 527A CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 4606 527C CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 4607 248B I/O subsystem and bus tests detected fault in operator panel Operator panel Location: U1.17-L1 4607 25C0 I/O subsystem and bus tests detected fault in system backplane System backplane Location: U1.18-P1 4607 25C1 I/O subsystem and bus tests detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 4607 25E6 SMA 4-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 4607 2738 I/O subsystem and bus tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 4607 273C I/O subsystem and bus tests detected fault in primary I/O book - Slot 0 Primary I/O book - Slot 0 Location: U1.18-P1-H2 4607 273D I/O subsystem and processor subsystem tests detected fault in primary I/O book - Slot 0 Primary I/O book - Slot 0 Location: U1.18-P1-H2 4607 273F SMA 2-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 4607 2776 I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 4607 2806 I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 4607 28B0 I/O subsystem and bus tests detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code Chapter 5. Error Code to FRU Index 703 Error Code Description Action/ Possible Failing FRU 4607 28B1 I/O subsystem and bus tests detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 4607 525E CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 4607 525F CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 4607 527A CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 4607 527C CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 4608 248B I/O subsystem and other bus tests detected fault in operator panel Operator panel Location: U1.17-L1 4608 25C0 I/O subsystem and other bus tests detected fault in system backplane System backplane Location: U1.18-P1 4608 25C1 I/O subsystem and other bus tests detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 4608 25E6 SMA 4-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 4608 2738 I/O subsystem and other bus tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 4608 273C I/O subsystem and other bus tests detected fault in primary I/O book - Slot 0 Primary I/O book - Slot 0 Location: U1.18-P1-H2 4608 273D I/O subsystem and processor subsystem tests detected fault in primary I/O book - Slot 0 Primary I/O book - Slot 0 Location: U1.18-P1-H2 4608 273F SMA 2-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 704 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4608 2776 I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 4608 2806 I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 4608 28B0 I/O subsystem and other bus tests detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 4608 28B1 I/O subsystem and other bus tests detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 4608 525E CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 4608 525F CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 4608 527A CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 4608 527C CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 4609 248B I/O subsystem and memory tests detected fault in operator panel Operator panel Location: U1.17-L1 4609 25C0 I/O subsystem and memory tests detected fault in system backplane System backplane Location: U1.18-P1 4609 25C1 I/O subsystem and memory tests detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 Chapter 5. Error Code to FRU Index 705 Error Code Description Action/ Possible Failing FRU 4609 25E6 SMA 4-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 4609 2738 I/O subsystem and memory tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 4609 273C I/O subsystem and memory tests detected fault in primary I/O book - Slot 0 Primary I/O book - Slot 0 Location: U1.18-P1-H2 4609 273D I/O subsystem and processor subsystem tests detected fault in primary I/O book - Slot 0 Primary I/O book - Slot 0 Location: U1.18-P1-H2 4609 273F SMA 2-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 4609 2776 I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 4609 2806 I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 4609 28B0 I/O subsystem and memory tests detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 4609 28B1 I/O subsystem and memory tests detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 4609 525E CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 4609 525F CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 706 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4609 527A CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 4609 527C CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 460A 248B I/O subsystem and self-tests detected fault in operator panel Operator panel Location: U1.17-L1 460A 25C0 I/O subsystem and self-tests detected fault in system backplane System backplane Location: U1.18-P1 460A 25C1 I/O subsystem and self-tests detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 460A 25E6 SMA 4-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 460A 2738 I/O subsystem and self-tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 460A 273C I/O subsystem and self-tests detected fault in primary I/O book - Slot 0 Primary I/O book - Slot 0 Location: U1.18-P1-H2 460A 273D I/O subsystem and processor subsystem tests detected fault in primary I/O book - Slot 0 Primary I/O book - Slot 0 Location: U1.18-P1-H2 460A 273F SMA 2-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 460A 2776 I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 460A 2806 I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 460A 28B0 I/O subsystem and self-tests detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code Chapter 5. Error Code to FRU Index 707 Error Code Description Action/ Possible Failing FRU 460A 28B1 I/O subsystem and self-tests detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 460A 525E CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 460A 525F CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 460A 527A CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 460A 527C CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 460B 248B I/O subsystem and wire tests detected fault in operator panel Operator panel Location: U1.17-L1 460B 25C0 I/O subsystem and wire tests detected fault in system backplane System backplane Location: U1.18-P1 460B 25C1 I/O subsystem and wire tests detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 460B 25E6 SMA 4-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 460B 2738 I/O subsystem and wire tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 460B 273C I/O subsystem and wire tests detected fault in primary I/O book - Slot 0 Primary I/O book - Slot 0 Location: U1.18-P1-H2 460B 273D I/O subsystem and processor subsystem tests detected fault in primary I/O book - Slot 0 Primary I/O book - Slot 0 Location: U1.18-P1-H2 460B 273F SMA 2-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 708 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 460B 2776 I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 460B 2806 I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 460B 28B0 I/O subsystem and wire tests detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 460B 28B1 I/O subsystem and wire tests detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 460B 525E CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 460B 525F CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 460B 527A CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 460B 527C CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 460C 248B I/O subsystem and IPL detected fault in operator panel Operator panel Location: U1.17-L1 460C 25C0 I/O subsystem and IPL detected fault in system backplane System backplane Location: U1.18-P1 460C 25C1 I/O subsystem and IPL detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 Chapter 5. Error Code to FRU Index 709 Error Code Description Action/ Possible Failing FRU 460C 25E6 SMA 4-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 460C 2738 I/O subsystem and IPL detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 460C 273C I/O subsystem and IPL detected fault in primary I/O book - Slot 0 Primary I/O book - Slot 0 Location: U1.18-P1-H2 460C 273D I/O subsystem and processor subsystem tests detected fault in primary I/O book - Slot 0 Primary I/O book - Slot 0 Location: U1.18-P1-H2 460C 273F SMA 2-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 460C 2776 I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 460C 2806 I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 460C 28B0 I/O subsystem and IPL detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 460C 28B1 I/O subsystem and IPL detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 460C 525E CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 460C 525F CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 710 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 460C 527A CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 460C 527C CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 460D 248B I/O subsystem and attention handler detected fault in operator panel Operator panel Location: U1.17-L1 460D 25C0 I/O subsystem and attention handler detected fault in system backplane System backplane Location: U1.18-P1 460D 25C1 I/O subsystem and attention handler detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 460D 25E6 SMA 4-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 460D 2738 I/O subsystem and attention handler detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 460D 273C I/O subsystem and attention handler detected fault in primary I/O book - Slot 0 Primary I/O book - Slot 0 Location: U1.18-P1-H2 460D 273D I/O subsystem and processor subsystem tests detected fault in primary I/O book - Slot 0 Primary I/O book - Slot 0 Location: U1.18-P1-H2 460D 273F SMA 2-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 460D 2776 I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 460D 2806 I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 460D 28B0 I/O subsystem and attention handler detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code Chapter 5. Error Code to FRU Index 711 Error Code Description Action/ Possible Failing FRU 460D 28B1 I/O subsystem and attention handler detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 460D 525E CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 460D 525F CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 460D 527A CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 460D 527C CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 460E 248B I/O subsystem and scan control routines detected fault in operator panel Operator panel Location: U1.17-L1 460E 25C0 I/O subsystem and scan control routines detected fault in system backplane System backplane Location: U1.18-P1 460E 25C1 I/O subsystem and scan control routines detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 460E 25E6 SMA 4-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 460E 2738 I/O subsystem and scan control routines detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 460E 273C I/O subsystem and scan control routines detected fault in primary I/O book - Slot 0 Primary I/O book - Slot 0 Location: U1.18-P1-H2 460E 273D I/O subsystem and processor subsystem tests detected fault in primary I/O book - Slot 0 Primary I/O book - Slot 0 Location: U1.18-P1-H2 460E 273F SMA 2-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 712 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 460E 2776 I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 460E 2806 I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 460E 28B0 I/O subsystem and scan control routines detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 460E 28B1 I/O subsystem and scan control routines detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 460E 525E CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 460E 525F CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 460E 527A CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 460E 527C CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 460F 248B I/O subsystem and error code generation/logging detected fault in operator panel Operator panel Location: U1.17-L1 Chapter 5. Error Code to FRU Index 713 Error Code Description Action/ Possible Failing FRU 460F 25C0 I/O subsystem and error code generation/logging detected fault in system backplane System backplane Location: U1.18-P1 460F 25C1 I/O subsystem and error code generation/logging detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 460F 25E6 SMA 4-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 460F 2738 I/O subsystem and error code generation/logging detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 460F 273C I/O subsystem and error code generation/logging detected fault in primary I/O book - Slot 0 Primary I/O book - Slot 0 Location: U1.18-P1-H2 460F 273D I/O subsystem and processor subsystem tests detected fault in primary I/O book - Slot 0 Primary I/O book - Slot 0 Location: U1.18-P1-H2 460F 273F SMA 2-port switch adapter fault Replace the I/O book (switch adapter) specified by the location code. 460F 2776 I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 460F 2806 I/O subsystem and processor subsystem tests detected fault in I/O book 1, 3, or 4 1. If word 13 is CFF1 xxxx, a configuration problem exists. Ensure that the number of MCMs is consistent with the number of I/O books. 2. Replace I/O book 1, 3, or 4 at location specified in error code. 460F 28B0 I/O subsystem and error code generation/logging detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 460F 28B1 I/O subsystem and error code generation/logging detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 460F 525E CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 714 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 460F 525F CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 460F 527A CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 460F 527C CUoD capacity card fault 1. Verify that the system firmware is RH021114 or later. CUoD is not supported by the earlier firmware levels. 2. Replace the CUoD capacity card, location: U1.18-P1-H2.2, then ask the customer to reenter the CUoD activation keys. 4B23 258A The processor repeat gard function has deconfigured a processor. Do the following: 1. Go to the processor configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of ?manually deconfigured,? reconfigure the processor(s) and boot the system to the SMS menus. Power down the system; look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of ?deconfigured by the system,? either before or after the boot to the SMS menus, replace the processor MCM (on which the deconfigured processor resides) specified by the location code. 2. Chapter 5. Error Code to FRU Index 715 Error Code Description Action/ Possible Failing FRU 4B23 25C7 The processor repeat gard function has deconfigured a processor. Do the following: 1. Go to the processor configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of ?manually deconfigured,? reconfigure the processor(s) and boot the system to the SMS menus. Power down the system; look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of ?deconfigured by the system,? either before or after the boot to the SMS menus, replace the processor MCM (on which the deconfigured processor resides) specified by the location code. 4B23 25C8 The processor repeat gard function has deconfigured a processor. Do the following: 1. Go to the processor configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of ?manually deconfigured,? reconfigure the processor(s) and boot the system to the SMS menus. Power down the system; look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of ?deconfigured by the system,? either before or after the boot to the SMS menus, replace the processor MCM (on which the deconfigured processor resides) specified by the location code. 716 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4B23 25CF The processor repeat gard function has deconfigured a processor. Do the following: 1. Go to the processor configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of ?manually deconfigured,? reconfigure the processor(s) and boot the system to the SMS menus. Power down the system; look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of ?deconfigured by the system,? either before or after the boot to the SMS menus, replace the processor MCM (on which the deconfigured processor resides) specified by the location code. 4B23 25D3 The processor repeat gard function has deconfigured a processor. Do the following: 1. Go to the processor configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of ?manually deconfigured,? reconfigure the processor(s) and boot the system to the SMS menus. Power down the system; look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of ?deconfigured by the system,? either before or after the boot to the SMS menus, replace the processor MCM (on which the deconfigured processor resides) specified by the location code. Chapter 5. Error Code to FRU Index 717 Error Code Description Action/ Possible Failing FRU 4B23 25D5 The processor repeat gard function has deconfigured a processor. Do the following: 1. Go to the processor configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of ?manually deconfigured,? reconfigure the processor(s) and boot the system to the SMS menus. Power down the system; look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of ?deconfigured by the system,? either before or after the boot to the SMS menus, replace the processor MCM (on which the deconfigured processor resides) specified by the location code. 4B23 25D7 The processor repeat gard function has deconfigured a processor. Do the following: 1. Go to the processor configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of ?manually deconfigured,? reconfigure the processor(s) and boot the system to the SMS menus. Power down the system; look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of ?deconfigured by the system,? either before or after the boot to the SMS menus, replace the processor MCM (on which the deconfigured processor resides) specified by the location code. 718 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4B23 25D8 The processor repeat gard function has deconfigured a processor. Do the following: 1. Go to the processor configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of ?manually deconfigured,? reconfigure the processor(s) and boot the system to the SMS menus. Power down the system; look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of ?deconfigured by the system,? either before or after the boot to the SMS menus, replace the processor MCM (on which the deconfigured processor resides) specified by the location code. 4B23 25E5 The processor repeat gard function has deconfigured a processor. Do the following: 1. Go to the processor configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of ?manually deconfigured,? reconfigure the processor(s) and boot the system to the SMS menus. Power down the system; look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of ?deconfigured by the system,? either before or after the boot to the SMS menus, replace the processor MCM (on which the deconfigured processor resides) specified by the location code. Chapter 5. Error Code to FRU Index 719 Error Code Description Action/ Possible Failing FRU 4B23 25E9 The processor repeat gard function has deconfigured a processor. Do the following: 1. Go to the processor configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of ?manually deconfigured,? reconfigure the processor(s) and boot the system to the SMS menus. Power down the system; look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of ?deconfigured by the system,? either before or after the boot to the SMS menus, replace the processor MCM (on which the deconfigured processor resides) specified by the location code. 4B23 526B The processor repeat gard function has deconfigured a processor. Do the following: 1. Go to the processor configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of ?manually deconfigured,? reconfigure the processor(s) and boot the system to the SMS menus. Power down the system; look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of ?deconfigured by the system,? either before or after the boot to the SMS menus, replace the processor MCM (on which the deconfigured processor resides) specified by the location code. 720 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4B23 526C The processor repeat gard function has deconfigured a processor. Do the following: 1. Go to the processor configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of ?manually deconfigured,? reconfigure the processor(s) and boot the system to the SMS menus. Power down the system; look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of ?deconfigured by the system,? either before or after the boot to the SMS menus, replace the processor MCM (on which the deconfigured processor resides) specified by the location code. 4B23 526D The processor repeat gard function has deconfigured a processor. Do the following: 1. Go to the processor configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of ?manually deconfigured,? reconfigure the processor(s) and boot the system to the SMS menus. Power down the system; look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of ?deconfigured by the system,? either before or after the boot to the SMS menus, replace the processor MCM (on which the deconfigured processor resides) specified by the location code. Chapter 5. Error Code to FRU Index 721 Error Code Description Action/ Possible Failing FRU 4B23 526E The processor repeat gard function has deconfigured a processor. Do the following: 1. Go to the processor configuration/deconfiguration menu, which is under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of ?manually deconfigured,? reconfigure the processor(s) and boot the system to the SMS menus. Power down the system; look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of ?deconfigured by the system,? either before or after the boot to the SMS menus, replace the processor MCM (on which the deconfigured processor resides) specified by the location code. 4B24 248B Processor subsystem detected fault in operator panel Operator panel Location: U1.17-L1 4B24 258A Processor subsystem detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B24 25C0 Processor subsystem detected fault in system backplane System backplane Location: U1.18-P1 4B24 25C1 Processor subsystem detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 4B24 25C7 Processor subsystem detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B24 25C8 Processor subsystem detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B24 25CF Processor subsystem detected fault in 8-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B24 25D3 Processor subsystem detected fault in 8-way processor MCM with VPD card Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B24 25D5 Processor subsystem detected fault in 8-way processor MCM with VPD card (8-, 16-, or 24-way) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B24 25D7 Fault in 4-way processor MCM detected Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B24 25D8 Fault in 4-way processor MCM detected Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 722 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4B24 25E5 Processor subsystem detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B24 25E9 Processor subsystem detected fault in 8-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B24 28B0 Processor subsystem detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 4B24 28B1 Processor subsystem detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 4B24 526B Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B24 526C Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B24 526D Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B24 526E Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B25 248B Processor subsystem and display alter tests detected fault in operator panel Operator panel Location: U1.17-L1 4B25 258A Processor subsystem detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B25 25C0 Processor subsystem and display alter tests detected fault in system backplane System backplane Location: U1.18-P1 4B25 25C1 Processor subsystem and display alter tests detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 4B25 25C7 Processor subsystem and display alter tests detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B25 25C8 Processor subsystem and display alter tests detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B25 25CF Processor subsystem detected fault in 8-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B25 25D3 Processor subsystem and display alter tests detected fault in 8-way processor MCM with VPD card Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B25 25D5 Processor subsystem and display alter tests detected fault in 8-way processor MCM with VPD card (8-, 16-, or 24-way) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B25 25D7 Fault in 4-way processor MCM detected Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. Chapter 5. Error Code to FRU Index 723 Error Code Description Action/ Possible Failing FRU 4B25 25D8 Fault in 4-way processor MCM detected Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B25 25E5 Processor subsystem detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B25 25E9 Processor subsystem detected fault in 8-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B25 28B0 Processor subsystem and display alter tests detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 4B25 28B1 Processor subsystem and display alter tests detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 4B25 526B Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B25 526C Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B25 526D Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B25 526D Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B26 248B Processor subsystem and scan interface basic assurance tests detected fault in operator panel Operator panel Location: U1.17-L1 4B26 258A Processor subsystem detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B26 25C0 Processor subsystem and scan interface basic assurance tests detected fault in system backplane System backplane Location: U1.18-P1 4B26 25C1 Processor subsystem and scan interface basic assurance tests detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 4B26 25C7 Processor subsystem and scan interface basic assurance tests detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B26 25C8 Processor subsystem and scan interface basic assurance tests detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B26 25CF Processor subsystem detected fault in 8-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B26 25D3 Processor subsystem and scan interface basic assurance tests detected fault in 8-way processor MCM with VPD card Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 724 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4B26 25D5 Processor subsystem and scan interface basic assurance tests detected fault in 8-way processor MCM with VPD card (8-, 16-, or 24-way) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B26 25D7 Fault in 4-way processor MCM detected Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B26 25D8 Fault in 4-way processor MCM detected Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B26 25E5 Processor subsystem detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B26 25E9 Processor subsystem detected fault in 84-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B26 28B0 Processor subsystem and scan interface basic assurance tests detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 4B26 28B1 Processor subsystem and scan interface basic assurance tests detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 4B26 526B Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B26 526C Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B26 526D Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B26 526E Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B27 248B Processor subsystem and bus tests detected fault in operator panel Operator panel Location: U1.17-L1 4B27 258A Processor subsystem detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B27 25C0 Processor subsystem and bus tests detected fault in system backplane System backplane Location: U1.18-P1 4B27 25C1 Processor subsystem and bus tests detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 4B27 25C7 Processor subsystem and bus tests detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B27 25C8 Processor subsystem and bus tests detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. Chapter 5. Error Code to FRU Index 725 Error Code Description Action/ Possible Failing FRU 4B27 25CF Processor subsystem detected fault in 8-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B27 25D3 Processor subsystem and bus tests detected fault in 8-way processor MCM with VPD card Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B27 25D5 Processor subsystem and bus tests detected fault in 8-way processor MCM with VPD card (8-, 16-, or 24-way) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B27 25D7 Fault in 4-way processor MCM detected Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B27 25D8 Fault in 4-way processor MCM detected Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B27 25E5 Processor subsystem detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B27 25E9 Processor subsystem detected fault in 8-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B27 28B0 Processor subsystem and bus tests detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 4B27 28B1 Processor subsystem and bus tests detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 4B27 526B Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B27 526C Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B27 526D Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B27 526E Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B28 248B Processor subsystem and other bus tests detected fault in operator panel Operator panel Location: U1.17-L1 4B28 258A Processor subsystem detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B28 25C0 Processor subsystem and other bus tests detected fault in system backplane System backplane Location: U1.18-P1 4B28 25C1 Processor subsystem and other bus tests detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 4B28 25C7 Processor subsystem and other bus tests detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 726 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4B28 25C8 Processor subsystem and other bus tests detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B28 25CF Processor subsystem detected fault in 8-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B28 25D3 Processor subsystem and other bus tests detected fault in 8-way processor MCM with VPD card Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B28 25D5 Processor subsystem and other bus tests detected fault in 8-way processor MCM with VPD card (8-, 16-, or 24-way) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B28 25D7 Fault in 4-way processor MCM detected Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B28 25D8 Fault in 4-way processor MCM detected Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B28 25E5 Processor subsystem detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B28 25E9 Processor subsystem detected fault in 8-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B28 28B0 Processor subsystem and other bus tests detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 4B28 28B1 Processor subsystem and other bus tests detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 4B28 526B Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B28 526C Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B28 526D Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B28 526E Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B29 248B Processor subsystem and memory test detected fault in operator panel Operator panel Location: U1.17-L1 4B29 258A Processor subsystem detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B29 25C0 Processor subsystem and memory test detected fault in system backplane System backplane Location: U1.18-P1 4B29 25C1 Processor subsystem and memory test detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 Chapter 5. Error Code to FRU Index 727 Error Code Description Action/ Possible Failing FRU 4B29 25C7 Processor subsystem and memory test detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B29 25C8 Processor subsystem and memory test detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B29 25CF Processor subsystem detected fault in 8-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B29 25D3 Processor subsystem and memory test detected fault in 8-way processor MCM with VPD card Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B29 25D5 Processor subsystem and memory test detected fault in 8-way processor MCM with VPD card (8-, 16-, or 24-way) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B29 25D7 Fault in 4-way processor MCM detected Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B29 25D8 Fault in 4-way processor MCM detected Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B29 25E5 Processor subsystem detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B29 25E9 Processor subsystem detected fault in 8-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B29 28B0 Processor subsystem and memory test detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 4B29 28B1 Processor subsystem and memory test detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 4B29 526B Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B29 526C Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B29 526D Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B29 526E Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2A 248B Processor subsystem and self test detected fault in operator panel Operator panel Location: U1.17-L1 4B2A 258A Processor subsystem detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 728 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4B2A 25C0 Processor subsystem and self test detected fault in system backplane System backplane Location: U1.18-P1 4B2A 25C1 Processor subsystem and self test detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 4B2A 25C7 Processor subsystem and self test detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2A 25C8 Processor subsystem and self test detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2A 25CF Processor subsystem detected fault in 8-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2A 25D3 Processor subsystem and self test detected fault in 8-way processor MCM with VPD card Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2A 25D5 Processor subsystem and self test detected fault in 8-way processor MCM with VPD card (8-, 16-, or 24-way) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2A 25D7 Fault in 4-way processor MCM detected Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2A 25D8 Fault in 4-way processor MCM detected Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2A 25E5 Processor subsystem detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2A 25E9 Processor subsystem detected fault in 8-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2A 28B0 Processor subsystem and self test detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 4B2A 28B1 Processor subsystem and self test detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 4B2A 526B Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2A 526C Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2A 526D Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2A 526E Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2B 248B Processor subsystem and wire test detected fault in operator panel Operator panel Location: U1.17-L1 Chapter 5. Error Code to FRU Index 729 Error Code Description Action/ Possible Failing FRU 4B2B 258A Processor subsystem detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2B 25C0 Processor subsystem and wire test detected fault in system backplane System backplane Location: U1.18-P1 4B2B 25C1 Processor subsystem and wire test detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 4B2B 25C7 Processor subsystem and wire test detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2B 25C8 Processor subsystem and wire test detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2B 25CF Processor subsystem detected fault in 8-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2B 25D3 Processor subsystem and wire test detected fault in 8-way processor MCM with VPD card Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2B 25D5 Processor subsystem and wire test detected fault in 8-way processor MCM with VPD card (8-, 16-, or 24-way) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2B 25D7 Fault in 4-way processor MCM detected Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2B 25D8 Fault in 4-way processor MCM detected Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2B 25E5 Processor subsystem detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2B 25E9 Processor subsystem detected fault in 8-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2B 28B0 Processor subsystem and wire test detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 4B2B 28B1 Processor subsystem and wire test detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 4B2B 526B Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2B 526C Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2B 526D Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2B 526E Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 730 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4B2C 248B Processor subsystem and IPL detected fault in operator panel Operator panel Location: U1.17-L1 4B2C 258A Processor subsystem detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2C 25C0 Processor subsystem and IPL detected fault in system backplane System backplane Location: U1.18-P1 4B2C 25C1 Processor subsystem and IPL detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 4B2C 25C7 Processor subsystem and IPL detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2C 25C8 Processor subsystem and IPL detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2C 25CF Processor subsystem detected fault in 8-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2C 25D3 Processor subsystem and IPL detected fault in 8-way processor MCM with VPD card Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2C 25D5 Processor subsystem and IPL detected fault in 8-way processor MCM with VPD card (8-, 16-, or 24-way) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2C 25D7 Fault in 4-way processor MCM detected Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2C 25D8 Fault in 4-way processor MCM detected Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2C 25E5 Processor subsystem detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2C 25E9 Processor subsystem detected fault in 8-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2C 28B0 Processor subsystem and IPL detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 4B2C 28B1 Processor subsystem and IPL detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 4B2C 526B Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2C 526C Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2C 526D Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. Chapter 5. Error Code to FRU Index 731 Error Code Description Action/ Possible Failing FRU 4B2C 526E Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2D 248B Processor subsystem and attention handler detected fault in operator panel Operator panel Location: U1.17-L1 4B2D 258A Processor subsystem detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2D 25C0 Processor subsystem and attention handler detected fault in system backplane System backplane Location: U1.18-P1 4B2D 25C1 Processor subsystem and attention handler detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 4B2D 25C7 Processor subsystem and attention handler detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2D 25C8 Processor subsystem and attention handler detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2D 25CF Processor subsystem detected fault in 8-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2D 25D3 Processor subsystem and attention handler detected fault in 8-way processor MCM with VPD card Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2D 25D5 Processor subsystem and attention handler detected fault in 8-way processor MCM with VPD card (8-, 16-, or 24-way) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2D 25D7 Fault in 4-way processor MCM detected Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2D 25D8 Fault in 4-way processor MCM detected Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2D 25E5 Processor subsystem detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2D 25E9 Processor subsystem detected fault in 8-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2D 28B0 Processor subsystem and attention handler detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 4B2D 28B1 Processor subsystem and attention handler detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 4B2D 526B Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2D 526C Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 732 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4B2D 526D Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2D 526E Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2E 248B Processor subsystem and scan control routines detected fault in operator panel Operator panel Location: U1.17-L1 4B2E 258A Processor subsystem detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2E 25C0 Processor subsystem and scan control routines detected fault in system backplane System backplane Location: U1.18-P1 4B2E 25C1 Processor subsystem and scan control routines detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 4B2E 25C7 Processor subsystem and scan control routines detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2E 25C8 Processor subsystem and scan control routines detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2E 25CF Processor subsystem detected fault in 8-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2E 25D3 Processor subsystem and scan control routines detected fault in 8-way processor MCM with VPD card Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2E 25D5 Processor subsystem and scan control routines detected fault in 8-way processor MCM with VPD card (8-, 16-, or 24-way) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2E 25D7 Fault in 4-way processor MCM detected Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2E 25D8 Fault in 4-way processor MCM detected Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2E 25E5 Processor subsystem detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2E 25E9 Processor subsystem detected fault in 8-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2E 28B0 Processor subsystem and scan control routines detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 4B2E 28B1 Processor subsystem and scan control routines detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 4B2E 526B Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. Chapter 5. Error Code to FRU Index 733 Error Code Description Action/ Possible Failing FRU 4B2E 526C Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2E 526D Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2E 526E Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2F 248B Processor subsystem and error code generation/logging detected fault in operator panel Operator panel Location: U1.17-L1 4B2F 258A Processor subsystem detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2F 25C0 Processor subsystem and error code generation/logging detected fault in system backplane System backplane Location: U1.18-P1 4B2F 25C1 Processor subsystem and error code generation/logging detected fault in 1.0 GHz clock card Clock card Location: U1.18-P1-X5 4B2F 25C7 Processor subsystem and error code generation/logging detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2F 25C8 Processor subsystem and error code generation/logging detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2F 25CF Processor subsystem detected fault in 8-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2F 25D3 Processor subsystem and error code generation/logging detected fault in 8-way processor MCM with VPD card Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2F 25D5 Processor subsystem and error code generation/logging detected fault in 8-way processor MCM with VPD card (8-, 16-, or 24-way) Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2F 25D7 Fault in 4-way processor MCM detected Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2F 25D8 Fault in 4-way processor MCM detected Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2F 25E5 Processor subsystem detected fault in 4-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2F 25E9 Processor subsystem detected fault in 8-way processor MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 734 406/90 Service Guide Error Code Description Action/ Possible Failing FRU 4B2F 28B0 Processor subsystem and error code generation/logging detected fault in I/O subsystem PCI backplane I/O subsystem PCI backplane at location specified in error code 4B2F 28B1 Processor subsystem and error code generation/logging detected fault in I/O subsystem DASD backplane I/O subsystem DASD backplane at location specified in error code 4B2F 526B Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2F 526C Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2F 526D Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. 4B2F 526E Fault detected in 4-way or 8-way CUoD MCM Go to “MAP 1540: Problem Isolation Procedures” on page 308 to isolate the failing component. Chapter 5. Error Code to FRU Index 735 System Firmware Update Messages Error Code Description Action/ Possible Failing FRU A1FD 0000 Firmware update recovery mode entered Informational message A1FD 0001 Service processor flash recovery Insert Diskette #1 A1FD 0002 Service processor flash recovery Insert Diskette #2 A1FD 0003 Service processor flash recovery Insert Diskette #3 A1FD 0004 Service processor flash recovery Insert Diskette #4 A1FD 000n Service processor flash recovery Insert Diskette #n Capacity Upgrade on Demand (CUoD) Messages Error Code Description Action/ Possible Failing FRU A100 C2AC Waiting for user to accept CUoD license agreement User must click to accept license agreement. If the user declines, the IPL will be halted. Scan Dump Messages Error Code Description Action/ Possible Failing FRU A100 3000 Successful completion of scan dump Informational message; may or may not be seen during a scan log dump. A1FF 3000 Scan dump has been disabled Informational message; may or may not be seen during a scan log dump. 736 406/90 Service Guide Common Firmware Error Codes Attention: Follow the procedure defined in “Checkpoints and Error Codes Index” on page 416. If you replace FRUs or perform an action on an I/O subsystem and the problem is still not corrected, go to “MAP 1542: I/O Problem Isolation” on page 318 unless you were already directed to any MAP 154x by the error code. Otherwise call support if the action(s) for an error code do not resolve the problem. Attention: If the error code that brought you to this chapter originated in the AIX error log, remember that an error may require an additional 15 minutes to be propagated to the AIX error log. If you replace FRUs and the problem is corrected, go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Error Code Description Action/ Possible Failing FRU B006 1403 Missing service processor resource 1. Check for system firmware updates. 2. Primary I/O bookLocation: U1.18-P1-H2 B006 1404 Service processor firmware fault 1. Check for system firmware updates. 2. Primary I/O book Location: U1.18-P1-H2 B006 1405 Service processor firmware fault 1. Check for system firmware updates. 2. Primary I/O bookLocation: U1.18-P1-H2 B006 1406 Service processor program error 1. Check for system firmware updates. 2. Check for operating system updates 3. Primary I/O bookLocation: U1.18-P1-H2 B006 1407 Service processor resource allocation problem 1. Check for system firmware updates. 2. Check for operating system updates 3. Primary I/O bookLocation: U1.18-P1-H2 B006 1408 Service processor program error 1. Check for system firmware updates. 2. Check for operating system updates 3. Primary I/O bookLocation: U1.18-P1-H2 B006 1409 Service processor program error 1. Check for system firmware updates. 2. Primary I/O bookLocation: U1.18-P1-H2 B10F 1000 Service processor failure Primary I/O bookLocation: U1.18-P1-H2 B10F 1002 Service processor failure Primary I/O bookLocation: U1.18-P1-H2 B10F 1005 Service processor failure Primary I/O bookLocation: U1.18-P1-H2 B10F 1007 Service processor failure Primary I/O bookLocation: U1.18-P1-H2 B10F 1008 Service processor failure Primary I/O bookLocation: U1.18-P1-H2 B10F 100E Service processor boot flash damaged Primary I/O bookLocation: U1.18-P1-H2 B10F 100F Service processor base flash damaged Primary I/O bookLocation: U1.18-P1-H2 Chapter 5. Error Code to FRU Index 737 Error Code Description Action/ Possible Failing FRU B10F 1300 Service processor failure Primary I/O bookLocation: U1.18-P1-H2 B10F 1370 Service processor flash update failure Primary I/O bookLocation: U1.18-P1-H2 B10F 1380 Service processor program failure Primary I/O bookLocation: U1.18-P1-H2 B10F 1381 Service processor flash error Primary I/O bookLocation: U1.18-P1-H2 B10F 1384 Service processor failure Primary I/O bookLocation: U1.18-P1-H2 B10F 1387 Service processor failure Primary I/O bookLocation: U1.18-P1-H2 B10F 1400 Service processor failure Primary I/O bookLocation: U1.18-P1-H2 B10F 1401 Service processor failure Primary I/O bookLocation: U1.18-P1-H2 B10F 1670 Service processor flash update failure Primary I/O bookLocation: U1.18-P1-H2 B10F 1672 Service processor firmware update invalid Call second level of support B10F 1675 Service processor memory failure Primary I/O bookLocation: U1.18-P1-H2 B10F 1681 Service processor firmware update invalid Call second level of support B10F 1682 Service processor firmware update invalid Call second level of support B10F 1683 Service processor firmware update invalid Call second level of support B1xx 0013 An attempt was made to flash (update) the firmware from AIX using the wrong image for this system. No flashing took place; the firmware level was not changed. Retry the firmware update from AIX using the correct flash image. B1xx 0577 Capacity upgrade on demand (CUoD) correctable surveillance error Informational message; no action required. B1xx 4600 Service processor failure Action: This code may be informational, or it may indicate a failure in the service processor. Before changing any parts, examine word 13 in the service processor error log entry, or bytes 68 and 69 in the AIX error log entry. For detailed instructions on finding word 13 in the service processor error log entry, or bytes 68 and 69 in the AIX error log entry, refer to error code B1xx 4699. If the error code (word 11 value) is B19E4600, and the word 13 value is 230Dxxxx, this is an informational message. No action is required by the customer or service representative. For all other values of word 11 and word 13, replace the primary I/O book, U1.18-P1-H2. 1. Check for system firmware updates. 2. Primary I/O bookLocation: U1.18-P1-H2 B1xx 4601 System processor failure Reboot the system in slow mode to isolate the failure. 738 406/90 Service Guide Error Code Description Action/ Possible Failing FRU B1xx 4602 Hardware failure 1. Was this error code logged shortly after the system crashed? NO Go to 3. YES Go to 22. Examine the service processor error log. Look for a 4xxx yyyy error that calls out FRU(s) in the CEC whose timestamp just precedes the B1xx 4602 error. If a 4xxx yyyy error occurred, replace the FRU indicated by the location code. If there is no 4xxx yyyy error whose timestamp just precedes the B1xx 4602 error, continue to 3. 3. a. Reboot the system in slow mode to the SMS menus. Refer to “Performing Slow Boot” on page 413. b. Examine the service processor error logs. Look for a 4xxx yyyy error that occurred during the slow boot that calls out FRU(s) in the CEC. v If a 4xxx yyyy error occurred during the slow boot, replace the FRU(s) indicated by the location code(s). v If a 4xxx yyyy error did not occur during the slow boot, replace the primary I/O book, location: U1.18-P1-H2. B1xx 4603 Service processor firmware damaged 1. Reflash system firmware. 2. Primary I/O bookLocation: U1.18-P1-H2 B1xx 4604 Firmware failure Action: This code may be informational, or it may indicate a failure in the firmware. Before taking any action, examine word 13 in the service processor error log entry, or bytes 68 and 69 in the AIX error log entry. For detailed instructions on finding word 13 in the service processor error log entry, or bytes 68 and 69 in the AIX error log entry, refer to error code B1xx 4699. If the error code (word 11 value) is B1A14604, and the word 13 value is 2304xxxx, this is an informational message. No action is required by the customer or service representative. For all other values of word 11 and word 13: 1. Check for system firmware updates. Apply if available. 2. Call service support. B1xx 4606 Firmware failure 1. Check for system firmware updates. 2. Operator panelLocation: U1.17-L1 Chapter 5. Error Code to FRU Index 739 Error Code Description Action/ Possible Failing FRU B1xx 4608 Service processor early termination 1. Reset the system by removing, then reapplying power to the processor subsystem. 2. Reflash system firmware. 3. Primary I/O book Location: U1.18-P1-H2 B1xx 460A Time-of-day clock has been reset 1. Set time-of-day clock 2. Replace Primary I/O book Location: U1.18-P1-H2 B1xx 460B NVRAM/Time-of-day battery Replace battery Location: U1.18-P1-H2-V1 B1xx 4611 Service processor failure 1. Reboot the system in slow mode to isolate the problem in the processor subsystem. 2. Primary I/O bookLocation: U1.18-P1-H2 B1xx 4612 Service processor firmware failure 1. Check for system firmware updates. 2. Primary I/O bookLocation: U1.18-P1-H2 B1xx 4613 Service processor failure Primary I/O bookLocation: U1.18-P1-H2 B1xx 4614 Service processor failure related to processor subsystem Reboot the system in slow mode to isolate the problem in the processor subsystem. B1xx 4620 Modem configuration failure 1. Check modem configuration files. 2. Check for system firmware updates. 3. Modem and modem cable. 4. Primary I/O bookLocation: U1.18-P1-H2 B1xx 4621 Modem disconnection failure 1. Check modem configuration files. 2. Check for system firmware updates. 3. Modem and modem cable. 4. Primary I/O bookLocation: U1.18-P1-H2 B1xx 4622 Service processor failure Action: This code may be informational, or it may indicate a failure in the service processor. Before changing any parts, examine word 13 in the service processor error log entry, or bytes 68 and 69 in the AIX error log entry. For detailed instructions on finding word 13 in the service processor error log entry, or bytes 68 and 69 in the AIX error log entry, refer to error code B1xx 4699. If the error code (word 11 value) is B19F4622, and the word 13 value is 230Axxxx, this is an informational message. No action is required by the customer or service representative. For all other values of word 11 and word 13: 1. Operator panel, location: U1.17-L1 2. Check for system firmware updates. Apply if available. 3. Primary I/O book, location: U1.18-P1-H2. 740 406/90 Service Guide Error Code Description Action/ Possible Failing FRU B1xx 4633 System power control network (SPCN) failure Action: This error code may be informational, or it may indicate a failure in the SPCN firmware update. Before changing any parts, examine word 13 in the service processor error log entry, or bytes 68 and 69 in the AIX error log entry. For detailed instructions on finding word 13 in the service processor error log entry, or bytes 68 and 69 in the AIX error log entry, see error code “Error Code B1xx 4699” on page 747. If the error code (word 11 value) is B1394633, and the word 13 value is 2309xxxx, this is an informational message. No action is required by the customer or service representative. All of the following values of word 13 (or bytes 68 and 69) indicate a failure of an SPCN firmware update: A704: The SPCN code update has been interrupted; the SPCN firmware update terminates. This is logged in the AIX error log as a firmware event. Action: Stop all partitions. Use the HMC to power down the system. Power on the system and restart all partitions; the SPCN code update will restart and run in the background. A70E: SPCN (system power control network) flash update was successful. Actions: This is an informational message; no action is required by the customer or service representative. It indicates that the SPCN code flash update was successful. A710: SPCN command ?RLI? failed; this indicates a loss of communication with the SPCN controller. This is logged in the AIX error log as a firmware event. Action: Stop all partitions. Use the HMC to power down the system. Remove the 350V of power from the system by putting the UEPO switch into the OFF position. Place the UEPO switch into the ON position. Wait five minutes. Use the HMC to power on the system . Restart at least one partition. The SPCN code update will run in the background. If this error code (word 13 value) reoccurs, contact service support. A711: SPCN command ?DDB? failed; this indicates a loss of communication with the SPCN controller. This is logged in the AIX error log as a firmware event. Actions: Stop all partitions. Use the HMC to power down the system. Remove the 350V of power from the system by placing the UEPO switch into the OFF position. Place the UEPO switch into the ON position. Wait five minutes. Use the HMC to power on the system. Restart at least one partition. The SPCN code update will run in the background. If this error code (word 13 value) reoccurs, contact service support. A712: SPCN command ?QFS? failed. This is logged in the AIX error log as a firmware event. Actions: Stop all partitions. Use the HMC to power down the system. Remove the 350V of power from the system by placing the UEPO switch into the OFF position. Place the UEPO switch into the ON position. Wait five minutes. Use the HMC to power on the system. Restart at least one partition. The SPCN code update will run in the background. If this error code (word 13 value) reoccurs, contact service support. A713: No power subsystem code LID was found in the firmware image; the FRU firmware update terminates. This is logged in the AIX error log as a sysplanar0 error. Actions: Contact service support. A715: A bad power subsystem code LID was found in the firmware image; the FRU firmware update terminates. This is logged in the AIX error log as a sysplanar0 error. Actions: Contact service support.Error code B1xx 4633 continues on to the next row. Chapter 5. Error Code to FRU Index 741 Error Code Description Action/ Possible Failing FRU B1xx 4633Cont’d Continued from previous row. A716: The power subsystem FRU code update was interrupted due to the system being shut down or turned off because of a critical power problem; the FRU firmware update terminates. This is logged into the AIX error log as a firmware event. Actions: Resolve the problem that caused the power shutdown. Power on and reboot at least one partition; the power subsystem code update will run in the background.A701 A702 A703 A705 A706 A707 For these values of word 13, do the following: 1. Power off the system. 2. Collect the service processor error log data for the B1xx 4633 error. From the main service processor menu, enter 3 to access the System Information menu. Select option 3, Service Processor Error Logs. 3. Collect the SPCN trace data. From the main service processor menu, enter 3 to access the System Information menu. Select option 9, the Power Control Network Utilities Menu. Then select option 4, Collect and Display the SPCN Trace Data. 4. Power on the system.Error code B1xx 4633 continues on to the next row. B1xx 4633Cont’d Continued from previous row. 1. Wait 10 minutes after the system has reached the AIX prompt. Examine the AIX error log for new B1xx 4633 errors with the word 13 values (in bytes 68 and 69 in the error log entry). If there are no B1xx 4633 errors with matching time stamps, the SPCN down load was successful. If the download failed again (there is another B1xx 4633 error in the AIX error log with the correct time stamp), do the following: a. Power down the system. b. Collect the service processor error log data for the B1xx 4633 error (see 2). c. Collect the SPCN trace data (see 3). d. Contact service support. For all other values of word 13, or bytes 68 and 69, do the following: 1) Check for system firmware updates. Apply if available. 2) Call service support. B1xx 4634 SPCN Loop Fault 1. Check SPCN cabling. 2. Go to “MAP 1542: I/O Problem Isolation” on page 318. B1xx 4635 AC input was removed, then restored. Informational message. Look at the service processor error log for additional error codes relating to loss of ac power to the system. B1xx 4636 System is running on battery back-up power Informational message. B1xx 4643 Problem when turning on the identify LED of a FRU 1. Use the location code that was provided with the original error code to isolate and replace the failing device. 2. If a problem persists, go to “MAP 154A: I2C Bus Problem Isolation” on page 361. 742 406/90 Service Guide Error Code Description Action/ Possible Failing FRU B1xx 4644 I2C connection failure 1. Check for system firmware updates. 2. Primary I/O bookLocation: U1.18-P1-H2 B1xx 4645 I2C (non-processor) VPD system fault Go to “MAP 1540: Problem Isolation Procedures” on page 308. B1xx 4646 Vital product data (VPD) acquisition failure 1. Check for system firmware updates. 2. Primary I/O bookLocation: U1.18-P1-H2 B1xx 4647 Machine type and model fields missing or invalid in system VPD The machine type and model fields must be rewritten. Call service support. B1xx 4648 Processor subsystem backplane VPD or power source problem Go to “MAP 1540: Problem Isolation Procedures” on page 308. B1xx 4650 System processor failure (all processors reset). Reboot the system in slow mode to isolate the problem. B1xx 4651 CPU GARD VPD on MCM 0 U1.18-P1-C1 and U1.18-P1-H2.3 B1xx 4652 CPU GARD VPD on MCM 1 U1.18-P1-C3 and U1.18-P1-H2.5 B1xx 4653 CPU GARD VPD on MCM 2 U1.18-P1-C4 and U1.18-P1-H2.4 B1xx 4654 CPU GARD VPD on MCM 3 U1.18-P1-C2 and U1.18-P1-H2.6 B1xx 4660 Memory subsystem failure (cannot isolate) Reboot the system in slow mode to isolate the failure, then check the service processor error log for failing FRUs associated with this error code. B1xx 4661 Memory book VPD failure Reboot the system in slow mode to isolate the failure, then check the service processor error log for failing FRUs associated with this error code. B1xx 4662 Memory book VPD failure Reboot the system in slow mode to isolate the failure, then check the service processor error log for failing FRUs associated with this error code. B1xx 4681 JTAG scan system (cabling) 1. Check JTAG cabling. 2. Call service support B1xx 4682 Problem with VPD on operator panel 1. Replace the operator panel,Location: U1.17-L1Do not swap the old VPD module onto the new op panel. Call service support for instructions on programming the new VPD module. 2. Call service support. Chapter 5. Error Code to FRU Index 743 Error Code Description Action/ Possible Failing FRU B1xx 4683 Capacity Upgrade on Demand (CUoD) card fault Action: This error code may be informational, or it may indicate a failure of the CUoD capacity card. Before changing any parts, examine word 13 in the service processor error log entry, or bytes 68 and 69 in the AIX error log entry. For detailed instructions on finding word 13 in the service processor error log entry, or bytes 68 and 69 in the AIX error log entry, see error code “Error Code B1xx 4699” on page 747. All of the following values of word 13 (or bytes 68 and 69) indicate that a surveillance error occurred in the capacity upgrade on demand (CUoD) firmware or hardware that was corrected. No action is required by the customer or service representative.C157 C1A0 C1A1 C1A2 C1A3 C201 For all other values of word 13 (or bytes 68 and 69): 1. Replace the CUoD capacity card, location: U1.18-P1-H2.2. See also “Capacity Upgrade on Demand (CUoD) VPD Module” on page 902. If the value of the first two bytes of word 13 in the detailed service processor error log entry, or the value of bytes 68 and 69 in the AIX error log entry, is E196, this indicates the CUoD capacity card installed in the system was previously used in another system. CUoD capacity cards cannot be moved between systems; replace the CUoD capacity card with the one for this system. For detailed instructions on finding word 13 in the service processor error log entry, and bytes 68 and 69 in the AIX error log entry, refer to the action column for error code B1xx 4699. B1xx 4690 Operating system surveillance time out (AIX to service processor interface failure) Action: 1. This error code indicates that the operating system terminated early (which usually implies an operating system crash). This error code may appear in the service processor error log by itself. However, in the AIX error log, there should be another error which points to the cause of the operating system crash. Use the other error as the starting point for your service action. 2. The other possibility is that the operating system was not found during a prior boot attempt. To determine if this occurred, do the following: Look at the AIX error log entry containing B1xx4690. This will be a ?SCAN_ERROR_CHRP? error with an identifier of BFEC0425. In the detail data, find the string ?B1xx4690? (If present, it will be at byte 60 of the detail data.) Then go forward 8 bytes after the ?B1? to byte 68 and look at bytes 68 and 69. If the values of bytes 68 and 69 are A2B0, this indicates that the firmware was unable to find a bootable device in the boot list that is set in the SMS menus. If the system is up, the boot list problem has been corrected and the B1xx 4690 can be treated as an informational message with no actions required. 3. Call service support. 744 406/90 Service Guide Error Code Description Action/ Possible Failing FRU B1xx 4691 Description: System firmware to service processor interface failure. (System firmware surveillance time out) Action: This code may be informational, or it may indicate a system firmware to service processor interface failure. Before changing any parts, examine word 13 in the service processor error log entry, or bytes 68 and 69 in the AIX error log entry. For detailed instructions on finding word 13 in the service processor error log entry, or bytes 68 and 69 in the AIX error log entry, refer to error code B1xx 4699. If the error code (word 11 value) is B19F4691and the word 13 value is 230Axxxx, this is an informational message. No action is required by the customer or service representative. 1. Check for system firmware updates. 2. Go to the service processor main menu and select System Information Menu. Then select Read Progress Indicators From Last System Boot. Begin your repair action with the error code or checkpoint immediately preceding B1xx 4691. If a location code displays with the error code or checkpoint, replace the part at that location. If changing that part does not fix the problem, or no location code is specified, and you have an 8-character error code, go to the “Checkpoints and Error Codes Index” on page 416. If changing that part does not fix the problem, or no location code is specified, and you have a 4-character checkpoint, go to “Firmware Checkpoints” on page 379. 3. If the problem is not resolved, call the second level support. B1xx 4692 Operator panel task interface failure 1. Check for system firmware updates. 2. Call service support B1xx 4693 Service processor firmware failure 1. Check for system firmware updates. 2. Call service support B1xx 4694 System processor firmware failure 1. Check for system firmware updates. 2. Primary I/O bookLocation: U1.18-P1-H2 B1xx 4695 System power control network (SPCN) to service processor interface failure 1. Check for system firmware updates. 2. Primary I/O bookLocation: U1.18-P1-H2 B1xx 4696 System firmware interface failure 1. Check for system firmware updates. 2. Call service support. Chapter 5. Error Code to FRU Index 745 Error Code Description Action/ Possible Failing FRU B1xx 4698 Service processor firmware error This is usually an indication that the interposer plug count has not been set after an MES upgrade or FRU replacement of an MCM or L3 cache module. It can also indicate that the plug count values were not revalidated after the primary I/O book was replaced. In both cases, it is only an informational message. If the managed server is down, go to the service processor error log and find the error log entry containing B1xx 4698. Examine the first two bytes of word 13 of the detailed entry information. If the managed server is running, look at the AIX error log entry containing B1xx 4698. This will be a ?SCAN_ERROR_CHRP? error with an identifier of BFE4C025. In the detail data, find the string ?B1xx 4698? (if present, it will be at byte 60 of the detail data). Go forward 8 bytes after the ?B1? to byte 68 and examine bytes 68 and 69. If the value is E10B or E10C, do the following: v If the primary I/O book was just replaced, go to the service processor menus and select ?System Information Menu,? then select ?MCM/L3 Interposer Plug Count Menu.? At the 0> prompt, enter 50 to commit the values and write them to the VPD. Fully exit the menu before powering on the system. v If an MCM or L3 cache module was just added or replaced, go to the service processor menu and select ?System Information Menu,? then select ?MCM/L3 Interposer Plug Count Menu.? Find the index number that corresponds to the module that was just added or replaced and enter the new plug count that was included in the FRU or MES kit. If the plug count is not included with the FRU, enter 7 as the new plug count (to account for the number of times the module was plugged during the manufacturing process). Fully exit the menu before powering on the system.For all other values of bytes 68 and 69, or the first two bytes of word 13, do the following: 1. Check for system firmware updates. 2. Reset the service processor by activating the pinhole reset switch on the operator panel. 3. Call service support. B1xx 4699 Service processor firmware error or informational message. See “Error Code B1xx 4699” on page 747. B1xx 469A System firmware interface 1. Check for system firmware updates. 2. Call service support. B1xx 469B Firmware to I/O interface 1. Check for system firmware updates. 2. Primary I/O bookLocation: U1.18-P1-H2 B1xx 469C Firmware to I/O interface 1. Check for system firmware updates. 2. Primary I/O bookLocation: U1.18-P1-H2 B1xx 469D Service processor firmware 1. Check for system firmware updates. 2. Primary I/O bookLocation: U1.18-P1-H2 B1xx 469E Service processor firmware failure 1. Check for system firmware updates. 2. Reboot the system in slow mode to isolate a problem in the processor subsystem. 3. Primary I/O bookLocation: U1.18-P1-H2 746 406/90 Service Guide Error Code Description Action/ Possible Failing FRU B1xx 469F Service processor failure 1. Check for system firmware updates. 2. Reboot the system in slow mode to isolate a problem in the processor subsystem. 3. Primary I/O bookLocation: U1.18-P1-H2 B1xx 8FF0 Informational message See B1xx 8FF0 in “Error Code B1xx 8FF0” on page 752. B1xx A2D0 Write operation to the LPAR data area in NVRAM failed 1. Using the HMC, perform a soft reset on the partition that reported this error. 2. Using the HMC, perform a hard reset on the partition that reported this error. 3. If the attempts to reset the partition are not successful, shut down all partitions, then remove and reapply AC power to the system. 4. Reboot the system. B1FD 0013 A firmware update has failed because the update image is invalid. The update was stopped before any changes were made to the currently installed firmware Reset the service processor by activating the pinhole reset switch on the operator panel. Flash a valid firmware update image for the system type and model. B1FD 0015 Too many firmware update diskettes were inserted (or the update image was faulty, or not the right image for the system) 1. Essure that the firmware update image is correct for the system type and model. 2. Replace the update diskettes with a new set of firmware update diskettes. B1FD 0016 Firmware update diskette read error 1. The firmware diskette may be faulty. Try another diskette. 2. Diskette drive Location: U1.17- D1. 3. Primary I/O book Location: U1.18-P1-H2. B1FD 001A Service processor recovery mode update fault Reset the service processor by activating the pinhole reset switch on the operator panel. Attempt to flash a valid firmware update image for the system type and model again. B1FD 001E Service processor recovery mode update fault Wrong firmware update diskette. Insert the correct update diskette. B1FD 001F Firmware update diskette bad Replace the update diskette. Error Code B1xx 4699 The following error code was moved out of the Common Firmware Error Code table because of the size of the description column. Error Code Description: Action/ Possible Failing FRU: B1xx 4699 Service processor firmware error or informational message. For complete information regarding this error code, read through this entire section. This is usually an indication of a problem in the communication path between the HMC and the service processor. It may only be an informational message. Chapter 5. Error Code to FRU Index 747 If the managed server is down, go to the service processor error log and find the error log entry containing B1xx 4699. Examine the first two bytes of word 13 of the detailed entry information. If the managed server is running, examine the AIX error log entry containing B1xx 4699. This is a ?SCAN_ERROR_CHRP? error with an identifier of BFE4C025. In the detail data, find the string ?B1xx 4699?. (If present, it will be at byte 60 of the detail data.) Go forward 8 bytes after the ?B1? to byte 68 and examine bytes 68 and 69. If the system is running Linux, examine the Linux system log. The line(s) in the extended data that begin with <4>RTAS: Log Debug: 04? contain the error code in the next 8 hex characters. (This error code is also known as word 11). Each 4 bytes after the error code in the Linux extended data is another word. The 4 bytes after the error code are word 12 and the next four bytes are word 13. An example of the Linux extended data, and finding words 11, 12, and 13, is shown in “MAP 1321: Quick Entry MAP for Systems with Service Focal Point” on page 170, “Step 1321-28” on page 178, and “Step 1321-29” on page 178. Perform the following actions based on the values of bytes 68 and 69 listed below from the AIX error log entry, or on the first two bytes of word 13 from the service processor error log entry: 2200: Informational message. Actions: This is an informational message. No action is required by the customer or service representative.2209: Informational message. Actions: This is an informational message. No action is required by the customer or service representative.2306: No MCM is detected in the first position (U1.18-P1-C1); an MCM is required in the first position for the system to boot. Actions: 1. If an MCM is not plugged into the first position (U1.18-P1-C1), plug an MCM into position one. 2. If an MCM is plugged into the first position (U1.18-P1-C1), follow the removal and replacement procedures and reseat the MCM. If reseating the MCM does not fix the problem, replace the MCM.2310: Informational message. Actions: This is an informational message. No action is required by the customer or service representative.2311: Informational message. Actions: This is an informational message. No action is required by the customer or service representative.2315: Informational message. Actions: This is an informational message. No action is required by the customer or service representative.240A: Informational message. Actions: This is an informational message. No action is required by the customer or service representative.240B: Informational message. Actions: This is an informational message. No action is required by the customer or service representative. 748 406/90 Service Guide 24FF: Informational message. Actions: This is an informational message. No action is required by the customer or service representative.2504: Informational message. Actions: This is an informational message. No action is required by the customer or service representative.2512: Informational message. Actions: This is an informational message. No action is required by the customer or service representative.2513: Informational message. Actions: This is an informational message. No action is required by the customer or service representative.2524: Informational message. Actions: This is an informational message. No action is required by the customer or service representative.2526: Informational message. Actions: This is an informational message. No action is required by the customer or service representative.35EF: The scan dump code was not able to successfully complete the request to close the connection to the load source.Actions: This is an informational message. No action is required by the customer or service representative.9900: Memory allocation failure during firmware update. Actions: 1. Reset the system using the pin-hole reset button on the operator panel. Retry the operation that caused this error code. 2. Check for a later level of firmware than the one that you were trying to install. Apply if available. 3. Call service support.9901: State sequence error found during CRC query during firmware update. Actions: 1. Reset the system using the pin-hole reset button on the operator panel. Retry the operation that caused this error code. 2. Check for a later level of firmware than the one that you were trying to install. Apply if available. 3. Call service support.9902: Mailbox LID data failure during firmware update. Actions: 1. Reset the system using the pin-hole reset button on the operator panel. Retry the operation that caused this error code. 2. Check for a later level of firmware than the one that you were trying to install. Apply if available. 3. Call service support.9903: State sequence error found after a firmware update LID was received during firmware update. Chapter 5. Error Code to FRU Index 749 Actions: 1. Reset the system using the pin-hole reset button on the operator panel. Retry the operation that caused this error code. 2. Check for a later level of firmware than the one that you were trying to install. Apply if available. 3. Call service support.9904: Error while flashing an update LID during firmware update. Actions: 1. Reset the system using the pin-hole reset button on the operator panel. Retry the operation that caused this error code. 2. Check for a later level of firmware than the one that you were trying to install. Apply if available. 3. Call service support.9905: CRC mismatch after a firmware update LID was received during firmware update. Actions: 1. Reset the system using the pin-hole reset button on the operator panel. Retry the operation that caused this error code. 2. Check for a later level of firmware than the one that you were trying to install. Apply if available. 3. Call service support.9906: Software problem during firmware update from operating system. Actions: 1. Check for a system firmware update that is later than the one that caused this error. Apply if available. 2. Call service support.9920: Error occurred during model upgrade from a 406/70 (7040 model 671) to a 406/90 (7040 model 681). Actions: 1. Verify that the system being upgraded is a 406/70 (7040 model 671). This error will occur if the system has already been upgraded. 2. The special model upgrade diskette might be damaged. Obtain another model upgrade diskette. 3. The diskette drive in the media subsystem might be damaged. Replace it. 4. Check for system firmware updates. Apply if available. 5. Call service support.A205: Machine type and model fields are not valid in the VPD module. Obtain a new operator panel. Do not swap the old VPD module onto the new operator panel. Call service support for instructions on how to write the machine type and model into the new VPD module. A20B: Error requesting trace buffer for service processor. Actions: 1. Reset the service processor, if possible. 2. Check for system firmware updates. Apply the updates if they are available.A218: Unknown return code detected. Actions: Check for system firmware updates. Apply the updates if they are available. A21A: Error allocating an internal service processor. 750 406/90 Service Guide Actions: 1. Reset the service processor, if possible. 2. Check for system firmware updates. Apply the updates if they are available.A800: This indicates an HMC/service processor initialization failure. Actions: 1. Check for system firmware updates. 2. Replace the primary I/O book, location: U1.18-P1-H2.A801: This indicates an HMC wrap failure. Actions: Replace the primary I/O book, location: U1.18-P1-H2. A806: This indicates a loss of the surveillance heartbeat between the HMC and the service processor. Actions: 1. Make sure that the HMC is booted and operational. 2. Check the serial cables that go from the HMC to the primary I/O book, location: U1.18-P1-H2.If there are no other error codes or indications of a problem, the A806 (loss of surveillance heartbeat) was a temporary condition and has been resolved; the B1xx4699 code is then an informational message only. If the problem persists: 1. Run diagnostics on the serial port on the HMC. 2. Replace the primary I/O book, location U1.18-P1-H2. Values of A009 and A719 of bytes 68 and 69 in the AIX error log entry or the first two bytes of word 13 in the service processor error log entry are also informational entries: A009: The system received a power-off request at runtime from the HMC, or the white button on the operator panel was used to power down the system at runtime. A719: Primary power failed; the system switched to battery backup power. The following values of bytes 68 and 69, or the first half of word 13, indicate an internal error occurred in the capacity upgrade on demand (CUoD) firmware or hardware that was corrected. No action is required by the customer or service representative. C101, C103, C104, C109, C10B, C10C, C10D, C10E, C10F, C110, C112, C113, C114, C115, C116, C117, C118, C11A, C11F, C120, C121, C122, C123, C124, C125, C126, C127, C128, C129, C12A, C12B, C12C, C12D, C12E, C12F, C130, C131, C132, C133, C134, C135, C136, C138, C13B, C13C, C140, C141, C142, C143, C144, C145, C146, C147, C149, C14A, C14B, C14C, C14D, C14E, C14F, C150, C151, C152, C153, C154, C155, C156, C158, C159 C11B: This value indicates that the CUoD capacity card (location code U1.18-P1-H2.2) needs to be initialized or activated. If the customer is entitled to processor activations, go to step 9 in the “Capacity Upgrade on Demand (CUoD) VPD Module” on page 902. If the customer is not entitled to customer activations, do the following: 1. Power down the system. Wait until OK is in the operator panel in the media drawer indicating that the service processor is in standby. 2. From the service processor main menu, enter code 88484 to access the hidden menu. 3. Press enter again at ?Authorized Use Only? screen. 4. When the CUoD menu appears, the status of ?Test and Run? should be Available. 5. Select option 3 to enter an activation code. 6. Enter the null activation code of 34 zeros: 0000000000000000000000000000000000 7. A success return code should be returned and the Test and Run Status will now be Complete. 8. Now exit out back to the service processor main menu. The system is ready to be powered up.For all other values of bytes 68 and 69, or the first two bytes of word 13, do the following: 1. Check for system firmware updates. Apply if available. Chapter 5. Error Code to FRU Index 751 2. Reset the service processor by activating the pinhole reset switch on the operator panel. 3. Call service support.Error Code B1xx 8FF0 The following error code was moved out of the Common Firmware Error Code table because of the size of the description column. Error Code Description: Action/ Possible Failing FRU: B1xx 8FF0 Informational message No action required. (The data in these informational log entries may be used by support personnel to gather additional debug information.) For complete information regarding this error code, read through this entire section. If the managed server is down, go to the service processor error log and find the error log entry containing B1xx 8FF0. Examine the first two bytes of word 13 of the detailed entry information. If the managed server is running, do the following: v Examine the AIX error log entry containing B1xx 8FF0. This is a FIRMWARE_EVENT error with an identifier of A6D1BD62. v In the detail data, find the string B1xx 8FF0. (If present, it will be at byte 60 of the detail data.) v Move forward 8 bytes to bytes 68 and 69; this is the value of word 13 in the service processor error log entry.If the system is running Linux, examine the Linux system log. The line(s) in the extended data that begin with <4>RTAS: Log Debug: 04 contain the error code in the next 8 hex characters. (This error code is also known as word 11). Each 4 bytes after the error code in the Linux extended data are another word. The 4 bytes after the error code are word 12 and the next four bytes are word 13. An example of the Linux extended data, and finding words 11, 12, and 13, is shown in MAP 1321, step 1321-28, and step 1321-29, in this service guide. CoD Transactions If the value of the first two bytes of word 13 in the detailed service processor error log entry, or the value of bytes 68 and 69 in the AIX error log entry is C1A4, this indicates a capacity on demand (CoD) transaction has taken place. For more detail about CUoD and CoD transactions, see the Hardware Management Console Operations Guide, order number SA23-1251. If a CoD transaction has taken place, and more detail about the transaction is required, do the following: v Record the 36 bytes in the AIX error log entry starting with B1xx 8FF0, grouping 4 bytes at a time so that the resulting words match the service processor error log’s detail data. For example, B1xx 8FF0 is the value of word 11, the next 4 bytes are word 12, the next 4 are word 13, etc, until word 19 has a value.There are five parameters of interest in the CoD log entry: 1. Word 16 value (first 2 bytes): the type of transaction 2. Word 17 value (first 2 bytes): the change in number of processors due to this transaction 3. Word 17 value (second 2 bytes): the resulting number of processor due to this transaction 4. Word 18 value (first 2 bytes): the on/off CoD processor days remaining 5. Word 18 value (second 2 bytes): the hours remaining in the current on/off CoD resource periodTo find these values in the service processor error log entry, or in the AIX error log entry, do the following: 752 406/90 Service Guide Type of Transaction and Resource Type Examine the first two bytes of word 16 recorded above. The value of the first byte of word 16 denotes the type of CoD transaction; the value of the second byte of word 16 is always 01. The Change in Number of Processors Due to This transaction, and the Resulting Number of Processor Due to This transaction Examine the first two bytes of word 17 recorded above. The value in the first two bytes of word 17 is the change in the number of CoD processors due to this transaction. The value in the third and fourth bytes of word 17 is the number of CoD processors resulting from this transaction. The On/Off CoD Processor Days Remaining, and the Hours Remaining in the Current On/Off CoD Resource Period Examine the first two bytes of word 18 recorded above. The value in the first two bytes of word 18 is the change in the number of CoD processors days remaining. The value in the third and fourth bytes of word 18 is the number of hours remaining in the current on/off CoD resource period. Note: The hours remaining counter starts at 24 hours, 0 minutes. As soon as it decrements to 23 hours, 59 minutes, 59 seconds, it will display on the HMC as 23 hours. The same is true as the counter gets close to zero: as the counter rolls over from 1 to zero hours, zero is displayed on the HMC for the next hour.Permanent activation code entered An informational log entry is made each time a permanent activation code is entered. An informational log entry is made each time a permanent activation code is entered. v Word 16 value (first 2 bytes): 0100 v Word 17 value (first 2 bytes): The number of processors activated with this transaction. v Word 17 value (second 2 bytes): The total number of activated processors of this type. This includes those previously activated plus those activated with this transaction. v Word 18 value (first 2 bytes): The current value of the processor day counter. v Word 18 value (second 2 bytes): The current value of the hours remaining in the current period counter.Trial CoD activation An informational log entry is made each time a trial CoD period is started. v Word 16 value (first 2 bytes): 0200 v Word 17 value (first 2 bytes): The number of processors activated with this transaction. v Word 17 value (second 2 bytes): This value is not used; it will be 0. v Word 18 value (first 2 bytes): The current value of the processor day counter. v Word 18 value (second 2 bytes): The current value of the hours remaining in the current period counter.Trial CoD cancellation An informational log entry is made each time a trial CoD period is cancelled. v Word 16 value (first 2 bytes): 0300 v Word 17 value (first 2 bytes): The number of processors originally requested. v Word 17 value (second 2 bytes): This value is not used; it will be 0. v Word 18 value (first 2 bytes): The current value of the processor day counter. v Word 18 value (second 2 bytes): The current value of the hours remaining in the current period counter.On/off activation code entered An informational log entry is made each time an on/off CoD activation code is entered. v Word 16 value (first 2 bytes): 0400 v Word 17 value (first 2 bytes): The number of processors days purchased. Chapter 5. Error Code to FRU Index 753 v Word 17 value (second 2 bytes): The total number of activated processors of this type. This includes those previously activated plus those activated with this transaction. v Word 18 value (first 2 bytes): The current value of the processor day counter. It includes the processor days just added with this transaction. v Word 18 value (second 2 bytes): The current value of the hours remaining in the current period counter.Manage on/off CoD request An informational log entry is made each time the change that is requested in the number of on/off CoD resources to be activated is accepted. No processor day charges are associated with this entry. v Word 16 value (first 2 bytes): 0500 v Word 17 value (first 2 bytes): The number entered on the processor on/off CoD management screen on the HMC (the number itself, not the difference from the previous value). v Word 17 value (second 2 bytes): This value is not used; it will be 0. v Word 18 value (first 2 bytes): The current value of the processor day counter. v Word 18 value (second 2 bytes): The current value of the hours remaining in the current period counter.On/off CoD number change, per user request or during IPL An informational log entry is made when: 1. On/off CoD resources are started during an IPL (logically partitioned mode or full system partition mode) 2. On/off CoD resources are started or stopped due to a customer request (logically partitioned mode)v Word 16 value (first 2 bytes): 0600 v Word 17 value (first 2 bytes): The number entered on the processor on/off CoD management screen on the HMC (the number itself, not the difference from the previous value). v Word 17 value (second 2 bytes): The resulting number of processors. This is the number of on/off processors that the firmware was able to start. It may be less than the number requested. v Word 18 value (first 2 bytes): The current value of the processor day counter after the charge for starting the 24 hour period. v Word 18 value (second 2 bytes): The current value of the hours remaining in the current period counter. At boot time, and on an increase in the number of processor days, the counter is reset to 24 hours. If the number of processor days is decreased, the counter is not changed and will continue to show the current value.On/off CoD number change, system An informational log entry is made when the number of on/off CoD resources is automatically reduced by the firmware due to a failing processor being taken off-line and not replaced. v Word 16 value (first 2 bytes): 0700 v Word 17 value (first 2 bytes): The number entered on the processor on/off CoD management screen on the HMC. This is the number of on/off CoD resources that was previously requested. v Word 17 value (second 2 bytes): The number of on/off CoD resource currently active. It may be smaller than the number requested. v Word 18 value (first 2 bytes): The current value of the processor day counter after the charge for starting the 24 hour period. (This transaction never decrements the number of processor days ) v Word 18 value (second 2 bytes): The current value of the hours remaining in the current period counter. (This transaction never resets the hours remaining. 754 406/90 Service Guide On/off processor day charge An informational log entry is made when the system decrements the processor day counter at the start of each 24 hour period (including the first one). The ?account? is being charged for the next day’s use of the processor. v Word 16 value (first 2 bytes): 0800 v Word 17 value (first 2 bytes): The number entered on the processor on/off CoD management screen on the HMC. This is the number of on/off CoD resources that is being requested. v Word 17 value (second 2 bytes): The number of on/off CoD resource currently active. It may be smaller than the number requested due to failing resources. v Word 18 value (first 2 bytes): The current value of the processor day counter after the charge for starting the 24 hour period. v Word 18 value (second 2 bytes): The current value of the hours remaining in the current period counter. (This will always be 24 hours.)Out of compliance due to the expiration of a trial CoD period An informational log entry is made when the system goes out of compliance due to either: 1. The expiration of a trial CoD period. 2. The cancellation of a trial CoD period.v Word 16 value (first 2 bytes): 0900 v Word 17 value (first 2 bytes): The number of trial on/off CoD resources out of compliance. v Word 17 value (second 2 bytes): The new number of trial on/off CoD resources. This value is always 0. v Word 18 value (first 2 bytes): The current value of the processor day counter. v Word 18 value (second 2 bytes): The current value of the hours remaining in the current period counter.Out of compliance due to the expiration of an on/off CoD period An informational log entry is made when the system goes out of compliance due to the expiration of an on/off CoD period (when the processor day counter is fully decremented to zero). v Word 16 value (first 2 bytes): 0A00 v Word 17 value (first 2 bytes): The number of on/off CoD resources out of compliance. v Word 17 value (second 2 bytes): The new number of on/off CoD resources. This value is always 0. v Word 18 value (first 2 bytes): The current value of the processor day counter. This value is always 0. v Word 18 value (second 2 bytes): The current value of the hours remaining in the current period counter. This value is always 0.Trial CoD termination An informational log entry is made each time a trial CoD period times out. v Word 16 value (first 2 bytes): 0B00 v Word 17 value (first 2 bytes): The number of processors originally requested. v Word 17 value (second 2 bytes): This value is not used; it will be 0. v Word 18 value (first 2 bytes): The current value of the processor day counter. v Word 18 value (second 2 bytes): The current value of the hours remaining in the current period counter.SPCN Informational Codes The following values of word 13 indicate an SPCN informational message: C006, C007, C00C, C030, C0E0, C0E2, C0F0, C0F1. No action is required by the customer or service representative. Chapter 5. Error Code to FRU Index 755 VPD Informational Codes The following values of word 13 indicate a VPD informational message: E104, E10D, E112, E113, E114. No action is required by the customer or service representative. HMC Surveillance Error Codes The HMC surveillance error codes generated only by the HMC are listed in the following table. Error Code Description Action/ Possible Failing FRU B301 0001 HMC/partition surveillance fault Contact service support. B301 0002 HMC/partition surveillance fault Verify the network between the HMC and the partition by pinging in both directions. If problems are found, correct them. Reboot the HMC. B301 0012 HMC/partition surveillance fault Contact service support. B301 0013 HMC/partition surveillance fault Contact service support. B301 0019 HMC/partition surveillance fault Contact service support. B301 0020 HMC/partition surveillance fault Contact service support. B301 0023 HMC/partition surveillance fault Contact service support. B301 0025 HMC/partition surveillance fault Contact service support. B301 0026 HMC/partition surveillance fault Contact service support. B301 0027 HMC/partition surveillance fault 1. If short host names are being used for the HMC and the partitions, ensure that the short host names are set up correctly according to the instructions in ?Using Network Adapters to Communicate with Partitions? in the Hardware Management Console Operations Guide. Correct any problems that are found, then verify that you can ping in both directions between the HMC and partitions. 2. If short host names are not being used, or the previous step does not correct the problem, contact service support. B301 0029 HMC/partition surveillance fault Contact service support. B301 002B HMC/partition surveillance fault Contact service support. B301 002C HMC/partition surveillance fault Contact service support. B301 002D HMC/partition surveillance fault Contact service support. B302 0001 HMC/partition surveillance fault 1. If short host names are being used for the HMC and the partitions, ensure that the short host names are set up correctly according to the instructions in ?Using Network Adapters to Communicate with Partitions? in the Hardware Management Console Operations Guide. Correct any problems that are found, then verify that you can ping in both directions between the HMC and partitions. 2. If short host names are not being used, or the previous step does not correct the problem, contact service support. 756 406/90 Service Guide Problem Determination-Generated Error Codes Error Code Description Action/ Possible Failing FRU M0BT 0000 The system hung during speaker POST. This error code is generated by the customer performing “Problem Determination” as described in the 406/90. Go to “MAP 1542: I/O Problem Isolation” on page 318. M0BT 0001 The system hung during “Starting Software...”. This error code is generated by the customer performing “Problem Determination” as described in the 406/90. Go to “MAP 1542: I/O Problem Isolation” on page 318. M0KB D000 The system hung during keyboard POST. This error code is generated by the customer performing “Problem Determination” as described in the 406/90. 1. Primary I/O bookLocation: U1.18-P1-H2 2. Keyboard M0KB D001 The system did not respond to a keyboard entry. This error code is generated by the customer performing “Problem Determination” as described in the 406/90. Type 101 keyboard. M0KB D002 The system did not respond to a keyboard entry. This error code is generated by the customer performing “Problem Determination” as described in the 406/90. Type 102 keyboard. M0KB D003 The system did not respond to a keyboard entry. This error code is generated by the customer performing “Problem Determination” as described in the 406/90. Kanji-type keyboard. M0ME M002 The system hung during memory POST. This error code is generated by the customer performing “Problem Determination” as described in the 406/90. Go to “MAP 1542: I/O Problem Isolation” on page 318. M0NE T000 The system hung during network POST. This error code is generated by the customer performing “Problem Determination” as described in the 406/90. Go to “MAP 1542: I/O Problem Isolation” on page 318. M0PS 0000 Power failure. This error code is generated by the customer performing “Problem Determination” as described in the 406/90. Go to “MAP 1520: Power” on page 192. M0SC SI00 Unable to load diagnostics. This error code is generated by the customer performing “Problem Determination” as described in the 406/90. Go to “MAP 1542: I/O Problem Isolation” on page 318. M0SC SI01 Unable to load diagnostics. This error code is generated by the customer performing “Problem Determination” as described in the 406/90. Go to “MAP 1542: I/O Problem Isolation” on page 318. Hypervisor Dump Retrieval Procedure This procedure defines the requirements and methods for retrieving a hypervisor dump file following a system crash. This procedure is applicable only if the managed system had been running as a partitioned system before it crashed. Chapter 5. Error Code to FRU Index 757 This procedure requires the managed system to be rebooted as a partitioned system. Retrieval must then take place from an AIX partition. Hypervisor Dump Indications If the system operator panel displays error code 20D00997, 20D00998, or 20D00999, a system crash within the run-time firmware has occurred. If error code 20D00997, 20D00998, or 20D00999 is in the service processor error log, a system crash within the run-time firmware has occurred. The following is an example of a detailed service processor error log entry with error code 20D00999: 05/29/2001 21:06:12 Operating System Terminated with Error String Error code: 20D00999 Detail: A2C1 SRC -------------------------------------------------------------- word11: B1194690 word12: 04A0005D word13: A2C1C0FA word14: 00000000 word15: 00007701 word16: 00000008 word17: 00000000 word18: 00000000 word19: 00000000 20D00999 After a system crash has occurred, verify if a hypervisor dump has been collected, as described in the following section. Retrieving the Hypervisor Dump file After it has been determined that the managed system has crashed and a hypervisor dump exists, or if the user wants to test for an existing dump file, the managed system must be rebooted as a partitioned system. A partition or partitions running AIX must then be activated. After rebooting, any partition running AIX may be used to retrieve the dump file. Do the following: 1. Install the devices.chrp_lpar.base.ras fileset. The devices.chrp_lpar.base.ras AIX fileset must be installed and committed. To determine if the package has been previously installed, enter the following command on the AIX command line: lslpp -l | grep devices.chrp_lpar.base.ras If the output on the AIX console is similar to the following, the devices.chrp_lpar.base.ras fileset is already installed and no further action for step 1 is necessary: devices.chrp_lpar.base.ras 5.X.X.X COMMITTED CHRP LPAR RAS Support If no output is returned, the devices.chrp_lpar.base LPP must be installed from the AIX installation media by the customer or system administrator. After the fileset is installed, the dump file is ready to be retrieved. 2. Enter fetchdbg -k at the command prompt. This produces the following output: Retrieving Dump File ............Dump read successful This action results in a file being written into the /tmp directory named dumpMMDDYY where MM is the current month, DD is the current day, and YY is the current year. Forward the /tmp/dumpMMDDYY file to service support. If the following text displays on the AIX console, note the error and report it to service support: Retrieving Dump File Error performing dump read 758 406/90 Service Guide Chapter 6. Using the Online and Standalone Diagnostics The AIX diagnostics consist of online diagnostics and standalone diagnostics. Attention: The AIX operating system must be installed in a partition in order to run online diagnostics on that partition. If the AIX operating system is not installed, use the standalone diagnostic procedures. Online diagnostics, when they are installed, reside with AIX in the file system. They can be booted: v In single user mode (referred to as service mode) v To run in maintenance mode (referred to as maintenance mode) v To run concurrently with other applications (referred to as concurrent mode)Standalone diagnostics must be booted before they can be run. If booted, the diagnostics have no access to the AIX error log or the AIX configuration data. Does the system have AIX diagnostics preinstalled? Yes If there is an HMC attached to the system, go to “Operating Considerations for Online and Standalone Diagnostics.” If an HMC is not attached to the system, go to “Running the Online Diagnostics with No HMC Attached” on page 762. No If there is an HMC attached to the system, go to “Loading the Standalone Diagnostics from CD-ROM Using an HMC” on page 764. If an HMC is not attached to the system, go to “Running the Standalone Diagnostics with CD-ROM with No HMC Attached” on page 765. If standalone diagnostics will be run from a NIM server, go to “Running Standalone Diagnostics from a Network Installation Management (NIM) Server” on page 766. If an HMC is not attached to the system, follow the directions for a full system partition.Operating Considerations for Online and Standalone Diagnostics Note: When possible, run online diagnostics in service mode. Online diagnostics perform additional functions compared to standalone diagnostics. Running online diagnostics in service mode ensures that the error state of the system that has been captured in NVRAM is available for your use in analyzing the problem. The AIX error log and certain SMIT functions are only available when diagnostics are run from the disk drive. Consider the following items before using the diagnostics: v Standalone diagnostics can run on systems configured for either a full (or single) system partition or a multiple partitioned system. When running in a partitioned system, the device from which you are booting standalone diagnostics must be made available to the partition dedicated to running standalone diagnostics. This may require moving the device from the partition that currently contains the boot device (for example, the CD-ROM or network adapter connected to the NIM server that has a standalone diagnostic image) to the partition used to run standalone diagnostics. For partitions running AIX levels earlier than AIX 5.2, if you move devices, reboot both source and destination partitions. For more information, see “Standalone Diagnostic Operation” on page 764. v When diagnostics are installed, the device support for some devices might not get installed. If this is the case, that device does not display in the diagnostic test list when running disk-based diagnostics. v When running diagnostics in a partitioned system, diagnostics will work only with the resources that were assigned to that partition. You must run diagnostics in the partition containing the resource that you want to test. 759 Identifying the Terminal Type to the Diagnostics When you run diagnostics, you must identify which type of terminal you are using. If the terminal type is not known when the FUNCTION SELECTION menu is displayed, the diagnostics do not allow you to continue until a terminal is selected from the DEFINE TERMINAL option menu. Choose the lft selection when running diagnostics from an HMC. Undefined Terminal Types If you specify an undefined terminal type from the DEFINE TERMINAL option menu, the menu prompts the user to enter a valid terminal type. The menu redisplays until either a valid type is entered or you exit the DEFINE TERMINAL option. For a system attached to an HMC use VT320 emulation. If the AIX operating system is being used, from the command line, type: export term=vt320 Resetting the Terminal If you enter a terminal type that is valid (according to the DEFINE TERMINAL option menu) but is not the correct type for the HMC virtual terminal window being used, you may be unable to read the screen, use the function keys, or use the Enter key. Bypass these difficulties by pressing Ctrl-C to reset the terminal. The screen display that results from this reset depends on the mode in which the system is being run: v Online Normal or Maintenance Mode - The command prompt displays. v Standalone Mode or Online Service Mode - The terminal type is reset to dumb, the Diagnostic Operating Instruction panel displays, and you are required to go through the DEFINE TERMINAL process again.Online Diagnostics Modes of Operation Notes: 1. When running online diagnostics on a partition in a partitioned system, diagnostics can be run only on resources that are allocated to that partition. 2. Online diagnostics can only be run in a partition with AIX installed. The online diagnostics can be run in the following modes: v Service Mode v Concurrent Mode v Maintenance ModeService Mode Service mode provides the most complete checkout of the system resources. This mode also requires that no other programs be running on the partition or on a full system partition. Except for the SCSI adapter and the disk drives used for paging, resources on all partitions or systems on a full system partition can be tested. However, note that the memory and processor are only tested during POST, and the results of the POST tests are reported by diagnostics. Error-log analysis is done in service mode when you select the Problem Determination option on the DIAGNOSTIC MODE SELECTION menu. Consider the following when you run the online diagnostics from a server or a disk: v The diagnostics cannot be loaded and run from a disk until the AIX operating system has been installed and configured. v The diagnostics cannot be loaded on a system (client) from a server if that system is not set up to boot from a server over a network. When the system is set up to boot from a server, the diagnostics are run in the same manner as they are from disk. 760 406/90 Service Guide v On a full system partition, if the diagnostics are loaded from disk or a server, you must shut down the AIX operating system before turning off the system unit to prevent possible damage to disk data. Do this in either of the following ways: – If the diagnostics were loaded in standalone mode, press the F3 key until DIAGNOSTIC OPERATING INSTRUCTIONS displays. Then press the F3 key once again to shut down the AIX operating system. – If the diagnostics were loaded in maintenance or concurrent mode, type the shutdown -F command.v Under some conditions, the system might stop, with instructions displayed on attached displays and terminals. Follow the instructions to select a console display.Running Online Diagnostics in Service Mode with an HMC Attached to the System To run the online diagnostics in service mode from the boot hard disk, do the following: 1. From the HMC, select Partition Manager. 2. Right-click on the mouse and select Open Terminal Window. 3. From the Service Processor Menu on the VTERM, select Option 2 System Power Control. 4. Select option 6. Verify that the state changes to currently disabled. Disabling fast system boot automatically enables slow boot. 5. Select Option 98 to exit the system power control menu. 6. Use the HMC to power on the managed system in a full system partition by selecting the managed system in the Contents area. 7. Right-click or select the desired system in the Contents area. Next, on the menu, choose Selected. 8. Select Power On. 9. Select the Power on Diagnostic Stored Boot list option (refer to “Full System Management Tasks” in the Hardware Management Console , order number SA23-1251, for more information on full system partitions). 10. Make sure that there is no media in the devices in the media drive. 11. Enter any passwords, if requested. Chapter 6. Using the Online and Standalone Diagnostics 761 Running the Online Diagnostics with No HMC Attached Online diagnostics can be loaded by booting from a hard file in the custom boot list by pressing the numeric 6 key on the keyboard. To boot from the devices listed in the service mode boot list, perform the following procedure: 1. Verify with the system administrator and users that all programs may be stopped, then do so. 2. Turn off the system. Note: If AIX is running, type the shutdown command. 3. Wait until the Power-on LED stops flashing and remains off, then press the power button to turn the system on. 4. When the keyboard indicator appears during startup, press the numeric 6 key on the keyboard. 5. Enter any requested passwords.After any requested passwords have been entered, the system attempts to boot from the first device of each type found on the list. If no bootable image is found on the first device of each type on the list, the system does not search through the other devices of that type for a bootable image; instead, it polls the first device of the next type. If all types of devices in the boot list have been polled without finding a bootable image, the system restarts. This action gives the user the opportunity to start the System Management Services (by pressing the number 1 key on a directly attached keyboard or the number 1 on an ASCII terminal) before the system attempts to boot again. Concurrent Mode Running the online diagnostics in concurrent mode is started by entering diag at an AIX prompt. Use concurrent mode to run online diagnostics on some of the system resources while the system is running normal activity. Because the system is running in normal operation, the following resources cannot be tested in concurrent mode: v SCSI adapters connected to paging devices v Disk drive used for paging v Some display adapters and graphics related devices v Memory (tested during POST) v Processor (tested during POST)The following levels of testing exist in concurrent mode: v The share-test level tests a resource while the resource is being shared by programs running in the normal operation. This testing is mostly limited to normal commands that test for the presence of a device or adapter. v The sub-test level tests a portion of a resource while the remaining part of the resource is being used in normal operation. For example, this test could test one port of a multiport device while the other ports are being used in normal operation. v The full-test level requires the device not be assigned to or used by any other operation. This level of testing on a disk drive might require the use of the varyoff command. The diagnostics display menus to allow you to vary off the needed resource. 762 406/90 Service Guide Error-log analysis is done in concurrent mode when you select the Problem Determination option on the DIAGNOSTIC MODE SELECTION menu. To run the online diagnostics in concurrent mode, you must be logged in to the AIX operating system and have proper authority to issue the commands (if help is needed, see the system operator). The diag command loads the diagnostic controller and displays the online diagnostic menus. Running the Online Diagnostics in Concurrent Mode To run online diagnostics in concurrent mode, do the following: 1. Log in to the AIX operating system as root user, or use CE Login. 2. Enter the diag command. 3. When the DIAGNOSTIC OPERATING INSTRUCTIONS display, follow the instructions to check out the desired resources. 4. When testing is complete, use the F3 key to return to the DIAGNOSTIC OPERATING INSTRUCTIONS. Press the F3 key again to return to the AIX operating system prompt. Be sure to vary on any resource that you had varied to off. 5. Press Ctrl-D to log off from root user or CE Login.Maintenance Mode Maintenance mode runs the online diagnostics using the customer’s version of the AIX operating system. This mode requires that all activity on the partition running the AIX operating system be stopped so that the online diagnostics have most of the resources available to check. All of the system resources, except the SCSI adapters, memory, processor, and the disk drive used for paging, can be checked. Error log analysis is done in maintenance mode when you select the Problem Determination option on the DIAGNOSTIC MODE SELECTION menu. Use the shutdown -m command to stop all activity on the AIX operating system and put the AIX operating system into maintenance mode. The diag command is then used to invoke the diagnostic controller so you can run the diagnostics. After the diagnostic controller is loaded, follow the normal diagnostic instructions. Running the Online Diagnostics in Maintenance Mode To run the online diagnostics in maintenance mode, do the following: 1. Stop all programs running on the partition except the AIX operating system (if help is needed, see the system operator). 2. Log in to the AIX operating system as root user, or use CE Login. 3. Type the shutdown -m command. 4. When a message indicates the system is in maintenance mode, enter the diag command. Note: It might be necessary to set TERM type again. 5. When DIAGNOSTIC OPERATING INSTRUCTIONS screen displays, follow the displayed instructions to check out the desired resources. 6. When testing is complete, use the F3 key to return to DIAGNOSTIC OPERATING INSTRUCTIONS. Press the F3 key again to return to the AIX operating system prompt. 7. Press Ctrl-D to log off from root user or CE Login. Chapter 6. Using the Online and Standalone Diagnostics 763 Standalone Diagnostic Operation Use standalone diagnostics to test the system when the online diagnostics are not installed and as a method of testing the disk drives that cannot be tested by the online diagnostics. Note: No error log analysis is done by the standalone diagnostics. The CD-ROM drive and the SCSI controller that controls it cannot be tested by the standalone diagnostics. Standalone diagnostics: v Are resident on CD-ROM or a Network Installation Management (NIM) server v Provide a method to test the system when the online diagnostics are not installed or cannot be loaded from the disk drive v Allow testing of the disk drives and other resources that cannot be tested by the online diagnostics v Do not have access to the AIX configuration data v Do not have access to the AIX error log v Do not allow for running of error log analysisConsiderations for Running Standalone Diagnostics on a Partitioned System To run standalone diagnostics on a full system partition, you must reboot the entire system. However, for a partition in a partitioned system, you can boot standalone diagnostics either in a given partition or on the entire system (which is the same procedure as a full system partition). For a partitioned system, before running standalone diagnostics on a given partition, the user must move the device from the existing location where standalone diagnostics is booted (the CD-ROM drive or the network adapter connected to the NIM server, in the case of NIM boot of standalone diagnostics), to the partition that will run standalone diagnostics. Devices on a partitioned system are moved on an I/O-slot basis. If the CD-ROM drive is moved from one partition to another, all SCSI devices on the same SCSI adapter card which the CD-ROM drive is attached must be moved to the same partition. It is recommended that you attach few SCSI devices to the same SCSI controller card to prevent moving them, along with the CD-ROM drive, between partitions. For partitions running AIX levels earlier than AIX 5.2, a reboot is required on the partition containing this device and on the moved device, but the system itself is not powered off in order to boot standalone diagnostics in a logical partitioned system partition. Considerations for Running Standalone Diagnostics from CD-ROM Consider the following when you run standalone diagnostics: v The diagnostic CD must remain in the CD-ROM drive for the entire time that diagnostics are running. v The diagnostic CD cannot be ejected from the CD-ROM drive after the diagnostics have loaded. The CD can only be ejected after the system has been turned off and then turned on (standalone mode) or after the diagnostics program has terminated (online concurrent mode). v The CD-ROM drive from which diagnostics were loaded cannot be tested. v The SCSI adapter (or circuitry) controlling the CD-ROM drive from which diagnostics were loaded cannot be tested.Loading the Standalone Diagnostics from CD-ROM Using an HMC To run standalone diagnostics in service mode from CD-ROM, do the following: 1. Verify with the system administrator and users that all programs may be stopped, including the AIX operating system (get help if needed). Stop all programs. 2. Remove all tapes, diskettes, and CD-ROMs. 3. From the following list, select the type of partitioned system you are running the standalone diagnostics in, then follow the power or reboot instructions. 764 406/90 Service Guide v If you are running standalone diagnostics in a full machine partition, power off the system unit. Wait until the Power-on LED stops flashing and remains off, then press the power button to turn on the system. v If you are running standalone diagnostics in an LPAR partition, reboot the partition that will be running the standalone diagnostics.Note: If AIX is running, type the shutdown command. If Linux is running, type the shutdown now -h command. 4. Immediately insert the Standalone Diagnostic CD into the CD-ROM drive, located on the media subsystem. Do Not use the HMC’s CD-ROM drive to do this operation. 5. When the keyboard POST indicator appears, press the F5 key on the directly attached keyboard, or the number 5 key on the TTY keyboard. 6. Enter any requested passwords. 7. Follow any instructions to select the console. 8. After the diagnostic controller loads, DIAGNOSTIC OPERATING INSTRUCTIONS appear on the console display. 9. Follow the displayed instructions to test the desired resources. 10. When testing is complete, use the F3 key to return to the DIAGNOSTIC OPERATING INSTRUCTIONS.Running the Standalone Diagnostics with CD-ROM with No HMC Attached To load standalone diagnostics from the default boot list, perform the following procedure: To run standalone diagnostics in service mode from CD-ROM, do the following: 1. Verify with the system administrator and users that all programs may be stopped, including the AIX operating system (get help if needed). Stop all programs. 2. Remove all tapes, diskettes, and CD-ROMs. 3. From the following list, select the type of partitioned system you are running the standalone diagnostics in, then follow the power or reboot instructions. v If you are running standalone diagnostics in a full machine partition, power off the system unit. Wait until the Power-on LED stops flashing and remains off, then press the power button to turn on the system. v If you are running standalone diagnostics in an LPAR partition, reboot the partition that will be running the standalone diagnostics.Note: If AIX is running, type the shutdown command. If Linux is running, type the shutdown now -h command. 4. Immediately insert the Standalone Diagnostic CD into the CD-ROM drive, located on the media subsystem. 5. When the keyboard POST indicator appears, press the F5 key on the directly attached keyboard, or the number 5 key on the TTY keyboard. 6. Enter any requested passwords. 7. Follow any instructions to select the console. 8. After the diagnostic controller loads, DIAGNOSTIC OPERATING INSTRUCTIONS appear on the console display. 9. Follow the displayed instructions to test the desired resources. 10. When testing is complete, use the F3 key to return to the DIAGNOSTIC OPERATING INSTRUCTIONS. Chapter 6. Using the Online and Standalone Diagnostics 765 Running Standalone Diagnostics from a Network Installation Management (NIM) Server A client system connected to a network with a Network Installation Management (NIM) server can boot standalone diagnostics from the NIM server if the client-specific settings on both the NIM server and client are correct. Notes: 1. All operations to configure the NIM server require root user authority. 2. If you replace the network adapter in the client, the network adapter hardware address settings for the client must be updated on the NIM server. 3. The Cstate for each standalone diagnostics client on the NIM server should be kept in the diagnostic boot has been enabled state. 4. On the client system, the NIM server network adapter should be put in the bootlist after the boot disk drive. This allows the system to boot in standalone diagnostics from the NIM server if there is a problem booting from the disk drive. Refer to the Multiboot section under “SMS” in the client system’s service guide for information on setting the bootlist.NIM Server Configuration Refer to the “Advanced NIM Configuration Tasks” chapter of the AIX Installation Guide and Reference, order number SC23-4389 for information on doing the following: v Registering a client on the NIM server v Enabling a client to run diagnostics from the NIM serverTo verify that the client system is registered on the NIM server and the diagnostic boot is enabled, run the command lsnim -a Cstate -Z ClientName from the command line on the NIM server. Refer to the following table for system responses. Note: The ClientName is the name of the system on which you want to run standalone diagnostics. System Response Client Status #name:Cstate:ClientName:diagnostic boot has beenenabled: The client system is registered on the NIM server and enabled to run diagnostics from the NIM server. #name:Cstate:ClientName:ready for a NIM operation: or #name:Cstate:ClientName:BOS installation has beenenabled: The client is registered on the NIM server but not enabled to run diagnostics from the NIM server. Note: If the client system is registered on the NIM server but Cstate has not been set, no data will be returned. 0042-053 lsnim: there is no NIM objectnamed "ClientName" The client is not registered on the NIM server. 766 406/90 Service Guide Booting Standalone Diagnostics from the NIM Server on a System with an HMC Attached To run standalone diagnostics on a client from the NIM server, do the following: 1. Remove any removable media (tape or CD-ROM). 2. Stop all programs including the AIX operating system (get help if needed). 3. If you are running standalone diagnostics in a full system partition, verify with the system administrator and system users that the system unit can shut down. Stop all programs, including the AIX operating system. Refer to the AIX operating system documentation for shutdown command information. Verify with the system administrator and system users using that partition that all applications on that partition must be stopped, and that the partition will be rebooted. Stop all programs on that partition, including the operating system. 4. If you are in a full system partition, power on the system unit to run standalone diagnostics. In a partitioned system, reboot the partition to run standalone diagnostics. 5. When the keyboard indicator is displayed (the word keyboard on a HMC virtual terminal window or the keyboard icon on a graphical display), press the number 1 key on the keyboard to display the SMS menu. 6. Enter any requested passwords. 7. Select Set Up Remote IPL (Initial Program Load). 8. Enter the client address, server address, gateway address (if applicable), and subnet mask. 9. If the NIM server is set up to allow pinging from the client system, use the ping utility in the RIPL utility to verify that the client system can ping the NIM server. Under the ping utility, choose the network adapter that provides the attachment to the NIM server to do the ping operation. If ping returns with an OK prompt, the client is prepared to boot from the NIM server. If ping returns with a FAILED prompt, the client cannot proceed with the NIM boot. Note: If the ping fails, refer to “Boot Problems” on page 409 and follow the steps for network boot problems.To do a one-time boot of the network adapter attached to the NIM server network, do the following: 1. Exit to the SMS Main screen. 2. Select Select Boot Options. 3. Select Install or Boot a Device. 4. On the Select Device Type screen. select Network. 5. Set the network parameters for the adapter from which you want to boot. 6. Exit completely from SMS. The system starts loading packets while doing a bootp from the network.Follow the instructions on the screen to select the system console. v If Diagnostics Operating Instructions Version x.x.x displays, standalone diagnostics have loaded successfully. v If the AIX login prompt displays, standalone diagnostics did not load. Check the following items: – The network parameters on the client may be incorrect. – Cstate on the NIM server may be incorrect. – Network problems might be preventing you from connecting to the NIM server. Verify the settings and the status of the network. If you continue to have problems, refer to “Boot Problems” on page 409 and follow the steps for network boot problems. Chapter 6. Using the Online and Standalone Diagnostics 767 768 406/90 Service Guide Chapter 7. Using the Service Processor Notes: 1. The information in this chapter regarding the configuring of serial ports, and modems attached to those serial ports, applies only to the serial ports (S1 and S2) on the primary I/O book (location U1.18-P1-H2). None of this information is applicable to the serial ports, or modems attached to those serial ports, on the Hardware Management Console (HMC). 2. On some of the system management services (or service processor) screens, you will see the term LPAR, which is equivalent to the term partitioned system. The service processor runs on its own power boundary and continually monitors hardware attributes and the environmental conditions within the system. The service processor is controlled by firmware and does not require the operating system to be operational to perform its tasks. The service processor menus allow you to configure service processor options, as well as enable and disable functions. Service processor menus are available using an HMC virtual terminal window when OK is displayed on the operator panel or when the service processor has detected a server problem (such as a surveillance failure). Service Processor Menus The service processor menus are divided into the following groups: v General user menu - the user must know the general-access password. v Privileged user menus - the user must know the privileged-access password.If the server is powered off, the service processor menus can be accessed locally or remotely on the following: v Serial port 1 (S1) v Serial port 2 (S2) v The HMCAccessing the Service Processor Menus Locally Service processor menus can be accessed by opening a virtual terminal window on the HMC, or by attaching an ASCII terminal to serial port 1 (S1) or serial port 2 (S2). After OK displays in the operator panel, press any key on the keyboard to signal the service. Note: The service processor menus cannot be accessed simultaneously on a virtual terminal window on the HMC and on an ASCII terminal. Accessing the menus on the HMC locks out access to the ASCII terminals and the other way around. When you gain access, the service processor prompts you for a password (if one is set), and when verified, displays the service processor menus. The service processor menu prompt, represented by 0> or 1>, indicates the serial port to which the terminal is connected. v An ASCII terminal can have the following prompts: – 0> indicates serial port 1 (S1) – 1> indicates serial port 2 (S2)v The HMC always indicates 0> 769 Accessing the Service Processor Menus Remotely If your system has a modem connected to serial port 1 or serial port 2 and is configured for call-in (see “Modem Configuration Menu” on page 796), access the service processor menus remotely as follows: 1. With the system powered off, call in from a remote terminal. 2. The service processor detects ring-indicate and prompts you for a password (if one is set). When verified, the service processor menus display remotely.Saving and Restoring Service Processor Settings All the settings that you make (except language) from the service processor menus can be backed up either for recovering from a fault that may corrupt these settings, or for replicating these settings to other servers that include a service processor. The service aid, Save or Restore Hardware Management Policies, can be used to save your settings after initial setup or whenever the settings must be changed for system operation purposes. It is strongly recommended that you use this service aid for backing up service processor settings to protect the usefulness of the service processor and the availability of the server. Refer to “Save or Restore Hardware Management Policies,” in the “Introduction to Tasks and Service Aids” section of the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. If this task cannot be run, or the service processor settings were not previously backed up, the settings should be recorded manually. To record the settings manually, do the following: 1. At the service processor main menu, select option 3, ?System Information Menu. 2. At the system information menu, select option 6, ?Read Service Processor Configuration?. 3. Manually record the settings.Menu Inactivity The service processor exits menu mode after ten minutes of inactivity and displays a message indicating that it has done so. Pressing any key on the virtual terminal window causes the main menu to display. 770 406/90 Service Guide General User Menu The menu options presented to the general user are a subset of the options available to the privileged user. The user must know the general-access password, if one is set, to access this menu. GENERAL USER MENU 1. Power-on System 2. Power-off System 3. Read VPD Image from Last System Boot 4. Read Progress Indicators from Last System Boot 5. Read Service Processor Error Logs 6. Read System POST Errors 99. Exit from Menus 0> v Power-on System Allows the user to start the system using the current virtual terminal window as the active console. v Power-off System This option is not available on this system. v Read VPD Image from Last System Boot Displays manufacturer vital product data, such as serial numbers, part numbers, and so on, that were stored from the system boot prior to the one in progress now, for the entire system. v Read Progress Indicators from Last System Boot Displays a number of the boot progress indicators, which may include service processor checkpoints, IPL checkpoints, or AIX configuration codes, from the previous system boot. This information can be useful in diagnosing system faults.Note: If you are running one or more logical partitions, enter the partition ID (0-15) to display progress indicators for that partition since the last system boot. If your system is running in full system partition mode, this option automatically displays details from partition 0. The progress indicator codes are listed from top (latest) to bottom (oldest). This information is not stored in nonvolatile storage. If the system is powered off using the power-on button on the operator panel, this information is retained. If the ac power is disconnected from the system, this information will be lost. For an example, refer to “LCD Progress Indicator Log” on page 818. v Read Service Processor Error Logs Displays the service processor error logs. For an example, refer to “Service Processor Error Logs” on page 817. v Read System POST Errors Displays additional error log information (this option is only for service personnel). v Exit from Menus Selecting this option will exit the service processor menus. You can reenter the menus by pressing any key on the console. Chapter 7. Using the Service Processor 771 Privileged User Menus The following menus are available to privileged users only. The user must know the privileged-access password, if one is set, to access these menus. Main Menu A listing at the top of the main menu contains the following: v Your system’s current firmware version v The firmware copyright notice v The system name given to your server during setupYou need the firmware version for reference when you either update or repair the functions of your service processor. The system name, an optional field, is the name that your server reports in problem messages. This name helps your support team (for example, your system administrator, network administrator, or service representative) to more quickly identify the location, configuration, and history of your server. Set the system name, from the main menu, using option 6. Note: The information under the Service Processor Firmware heading in the following Main Menu illustration is example information only. Service Processor Firmware VERSION: RH011007 Copyright 2001 IBM Corporation SYSTEM NAME MAIN MENU 1. Service Processor Setup Menu 2. System Power Control Menu 3. System Information Menu 4. Language Selection Menu 5. Call-In/Call-Out Setup Menu 6. Set System Name 99. Exit from Menus 0> v Service Processor Setup Menu See “Service Processor Setup Menu” on page 773 for more information. v System Power Control Menu See “System Power Control Menu” on page 778 for more information. v System Information Menu See “System Information Menu” on page 781 for more information. v Language Selection Menu See “Language Selection Menu” on page 794 for more information. v Call-In/Call-Out Setup Menu See “Call-In/Call-Out Setup Menu” on page 795 for more information. v Set System Name Allows setting of the system name. 772 406/90 Service Guide Service Processor Setup Menu The following Service Processor Setup Menu is accessed from the Main Menu: SERVICE PROCESSOR SETUP MENU 1. Change Privileged Access Password 2. Change General Access Password 3. Enable/Disable Console Mirroring: Currently Enabled 4. Start Talk Mode 5. OS Surveillance Setup Menu NOT supported in LPAR mode. 6. Reset Service Processor 7. Reprogram Flash EPROM Menu 8. Serial Port Snoop Setup Menu NOT supported in LPAR mode. 9. Scan Log Dump Policy: Currently Never 98. Return to Previous Menu 99. Exit from Menus 0> Note: Unless otherwise stated in menu responses, settings become effective when a menu is exited using option 98 or 99. Chapter 7. Using the Service Processor 773 Passwords Passwords can be any combination of up to eight alphanumeric characters. You can enter longer passwords, but the entries are truncated to include only the first eight characters. The privileged-access password can be set from service processor menus or from System Management Services (SMS) utilities (see Chapter 8, “Using System Management Services,” on page 823). The general-access password can be set only from service processor menus. For security purposes, the service processor counts the number of attempts to enter passwords. The results of not recognizing a password within this error threshold are different, depending on whether the attempts are being made locally (at the server) or remotely (through a modem). The error threshold is three attempts. If the error threshold is reached by someone entering passwords at the server, the service processor commands the server to resume the initial program load (IPL). This action is taken based on the assumption that the server is in an adequately secure location with only authorized users having access. Such users must still successfully enter a login password to access the operating system. If the error threshold is reached by someone entering passwords remotely, the service processor commands the server to power off to prevent potential security attacks on the server by unauthorized remote users. The following table lists what you can access with the privileged-access password and the general-access password. Privileged Access Password General Access Password Resulting Menu None None Service processor MAIN MENU displays. Set None Users with the password see the service processor MAIN MENU. Users without password cannot log in. Set Set Users see menus associated with the entered password. If you forget the password, you must remove the battery for at least 30 seconds to disable the password. v Change Privileged-Access Password Set or change the privileged-access password. It provides the user with the capability to access all service processor functions. This password is usually used by the system administrator or root user. v Change General-Access Password Set or change the general-access password. It provides limited access to service processor menus, and is usually available to all users who are allowed to power on the server, especially remotely.Note: The general-access password can only be set or changed after the privileged access password is set. v Enable/Disable Console Mirroring Note: Console mirroring is disabled in partitioned systems. Console mirroring is supported on serial port 1 (S1) and serial port 2 (S2). When console mirroring is enabled, the service processor sends information to all serial ports. The serial port from which console mirroring is enabled is referred to as the active port. The mirror port is determined when keyboard input is detected from one of the other ports. From this point on, the service processor sends information only to the active port and the mirror port. This capability can be enabled by local or remote users, providing local users with the capability to monitor remote sessions. Console mirroring can be enabled for the current session only. For more information, see “Console Mirroring” on page 816. 774 406/90 Service Guide v Start Talk Mode In a console-mirroring session, it is useful for those who are monitoring the session to be able to communicate with each other. Selecting Start Talk Mode activates the keyboards and displays for such communications while console mirroring is established. This is a full duplex link, so message interference is possible. Alternating messages between users works best. v OS Surveillance Setup Menu Note: This option is disabled in partitioned systems. This menu can be used to set up operating system (OS) surveillance. OS Surveillance Setup Menu 1. Surveillance: Currently Enabled 2. Surveillance Time Interval: 2 minutes 3. Surveillance Delay: 2 minutes 98. Return to Previous Menu 0> – Surveillance Can be set to Enabled or Disabled. – Surveillance Time Interval Can be set to any number from 2 through 255. – Surveillance Delay Can be set to any number from 0 through 255.Refer to “Service Processor System Monitoring - Surveillance” on page 813 for more information about surveillance. v Reset Service Processor If this option is selected, entering Y causes the service processor to reboot. v Reprogram Flash EPROM Menu This option updates the system EPROMs. After entering Y to indicate that you want to continue, you are prompted to enter the update diskettes. Follow the instructions on the screen. When the update is complete, the service processor reboots. All system EPROMs that can be reprogrammed are updated at the same time and are as follows: – System power control network programming – Service processor programming – System firmware programming – Run-Time Abstraction Services Chapter 7. Using the Service Processor 775 v Serial Port Snoop Setup Menu Note: This option is disabled in partitioned systems. This menu can be used to set up serial port snooping, in which the user can configure serial port 1 as a ?catch-all? reset device. From the service processor main menu, select option 1, service processor setup menu, then select option 8 (Serial Port Snoop Setup Menu). SERIAL PORT SNOOP SETUP MENU 1. System reset string: Currently Unassigned 2. Snoop Serial Port: Currently Unassigned 98. Return to Previous Menu 1> Use the Snoop Serial Port option to select the serial port to snoop. Note: Only serial port 1 is supported. Use the system reset string option to enter the system reset string, which resets the machine when it is detected on the main console on Serial Port 1. After serial port snooping is correctly configured, at any point after the system is booted to AIX, whenever the reset string is typed on the main console, the system uses the service processor reboot policy to restart. Because pressing Enter after the reset string is not required, make sure that the string is not common or trivial. A mixed-case string is recommended. v Scan Log Dump Policy A scan dump is the collection of chip data that the service processor gathers after a system malfunction, such as a checkstop or hang. The scan dump data may contain chip scan rings, chip trace arrays, and SCOM contents. The scan dump data are stored in the system control store. The size of the scan dump area is approximately 4 MB. During the scan log dump, A8xx (in the range A810 to A8FF) displays in the operator panel. The xx characters will change as the scan log dump progresses. If the xx characters do not change after several minutes, the service processor is hung and must be reset. When the scan log dump is complete, depending on how the reboot policy is set, the system will either: – Go to the standby state (and the service processor menus will be available), indicated by OK or STBY in the operator panel OR– Attempt to reboot. 776 406/90 Service Guide Scan Log Dump Setup Menu 1. Scan Log Dump Policy: Currently As Needed 2. Scan Log Dump Content: Currently As Requested 3. Immediate Dump 98. Return to Previous Menu 0>1 Select from the following options: (As Needed=2, Always=3) Enter New Option: 0>1 The scan log dump policy can be set to the following: 2=As Needed The processor run-time diagnostics record the dump data base on the error type. This is the default value. 3=Always Selecting this option allows the service processor to record a scan log dump for all error types.The scan log dump policy can also be set from the Tasks menu in the AIX service aids. Option 2 displays the following screen: Scan Log Dump Setup Menu 1. Scan Log Dump Policy: Currently As Needed 2. Scan Log Dump Content: Currently As Requested 3. Immediate Dump 98. Return to Previous Menu 0>2 Select from the following options: (As Requested=1, Optimum=2, Complete=3, Minimum=4)) Enter New Option: 0>0 The scan log dump content can be set to the following: 1 = As needed The processor run-time diagnostics will select the the contents of the dump file based on the type of error that occurs. This is the default value. 2 = Optimum The dump will include the smallest amount of information possible (a minimum number of hardware scan log rings). 3 = Complete The dump will include as much information as possible to allow the complete analysis of hardware and software errors. 4 = Minimum The dump will include the smallest amount of information possible (a minimum number of hardware scan log rings). Chapter 7. Using the Service Processor 777 The complete dump will take the longest time to finish; it may take as long as 1.5 hours on a fully configured system. Option 3, Immediate Dump, can only be used when the system is in the standby state with power on. It is used to dump the system data after a checkstop or machine check occurs when the system firmware is running, or when the operating system is booting or running. The scan log dump policy can also be set from the Tasks menu in the AIX service aids. If a valid dump file already exists, the dump control code will stop because the contents of the prior dump must be protected.System Power Control Menu This menu is used to set power control options. Other menus that control boot options are available from this menu. SYSTEM POWER CONTROL MENU 1. Enable/Disable Unattended Start Mode: Currently Enabled 2. Ring Indicate Power-On Menu 3. Reboot/Restart Policy Setup Menu 4. Power-On System 5. Power-Off System 6. Enable/Disable Fast System Boot Currently Enabled 7. Boot Mode Menu 98. Return to Previous Menu 99. Exit from Menus 0> v Enable/Disable Unattended Start Mode Use this option to instruct the service processor to restore the power state of the server after a temporary power failure. This option is intended to be used on servers that require automatic power-on after a power failure. For more information, see “System Power-On Methods” on page 802. v Ring Indicate Power-On Menu RING INDICATE POWER-ON MENU 1. Ring indicate power-on : Currently Enabled 2. Number of rings: Currently 3 30. Refresh Modem Settings 98. Return to Previous Menu Ring indicate power-on is enabled by default on both serial port 1 (S1) and serial port 2 (S2). When ring indicate power-on is enabled, call-in is disabled. If ring indicate power-on is enabled and call-in is already enabled, you will be asked to confirm your choice. Refer to the message displayed on your screen. 778 406/90 Service Guide If the ring indicate power-on setting is changed, you must select option 30, Refresh Modem Settings to update the modem settings. If Refresh Modem Settings is selected, and the modem(s) have not been configured, you will be asked to configure the modems first. See “Call-In/Call-Out Setup Menu” on page 795 for information on configuring modems. Option 2 is used to set the number of rings. v Reboot/Restart Policy Setup Menu The following menu controls the Reboot/Restart Policy: Reboot/Restart Policy Setup Menu 1. Number of reboot attempts: Currently 1 2. Use OS-Defined restart policy? Currently No 3. Enable supplemental restart policy? Currently Yes 4. Call-Out before restart: Currently Disabled 98. Return to Previous Menu 0> Reboot is the process of bringing up the system hardware; for example, from a system reset or power on. Restart is activating the operating system after the system hardware is reinitialized. Restart must follow a successful reboot. – Number of reboot attempts - If the server fails to successfully complete the boot process, it attempts to reboot the number of times specified. Entry values equal to or greater than 0 are valid. Only successive failed reboot/restart attempts are counted. – Use OS-Defined restart policy - In a full system partition, this allows the service processor to react in the same way that the operating system does to major system faults by reading the setting of the operating system parameter Automatically Restart/Reboot After a System Crash. This parameter might already be defined, depending on the operating system or its version (or level). If the operating system automatic restart setting is defined, it can be set to respond to a major fault by restarting or by not restarting. See your operating system documentation for details on setting up operating system automatic restarts. The default value is No. On a partitioned system, this setting is ignored. – Enable supplemental restart policy - The default setting is Yes. When set to Yes in a full system partition, the service processor restarts the system when the system loses control as detected by service processor surveillance, and either: - The Use OS-Defined restart policy is set to No. OR - The Use OS-Defined restart policy is set to Yes, and the operating system has no automatic restart policy. If set to Yes in a partitioned system, the service processor restarts the system when the system loses control and it is detected by service processor surveillance.– Call-Out before restart (Enabled/Disabled) - If a restart is necessary due to a system fault, and you are running a full system partition, you can enable the service processor to call out and report the event. This option can be valuable if the number of these events becomes excessive, which might signal a bigger problem. This setting is ignored on a partitioned system. Chapter 7. Using the Service Processor 779 v Power-On System Allows immediate power-on of the system. v Power-Off System This option is not available on this system. v Enable/Disable Fast System Boot Allows the user to select the IPL type, mode, and speed of the system boot. Attention: Selecting the fast IPL results in several diagnostic tests being skipped and a shorter memory test being run. v Boot Mode Menu Note: This option is disabled in partitioned systems. The Boot Mode Menu allows you to select a boot mode. Boot Mode Menu 1. Boot to SMS Menu: Currently Disabled 2. Service Mode Boot from Saved List: Currently Disabled 3. Service Mode Boot from Default List: Currently Disabled 4. Boot to Open Firmware Prompt: Currently Disabled 98. Return to Previous Menu 0> To select a boot mode, select a number and press Enter. The item corresponding to the selected number toggles between Disabled to Enabled. If a boot mode is Enabled, the boot mode selected is performed, and the Disabled/Enabled selection is reset to Disabled. The following describes each boot mode: – Boot to SMS Menu When this selection is enabled, the system boots to the System Management Services (SMS) Menu. – Service Mode Boot from Saved List This selection causes the system to perform a service mode boot using the service mode boot list saved in NVRAM. If the system boots AIX from the disk drive and AIX diagnostics are loaded on the disk drive, AIX boots to the diagnostics menu. Using this option to boot the system is the preferred way to run online diagnostics. – Service Mode Boot from Default List This selection is similar to Service Mode Boot from Saved List, except the system boots using the default boot list that is stored in the system firmware. This is normally used to try to boot customer diagnostics from the CD-ROM drive. Using this option to boot the system is the preferred way to run standalone diagnostics. – Boot to Open Firmware This option should only be used by service personnel to obtain additional debug information. When this selection is enabled, the system boots to the open firmware prompt. 780 406/90 Service Guide System Information Menu This menu provides access to system configuration information, error logs, system resources, and processor configuration. SYSTEM INFORMATION MENU 1. Read VPD Image from Last System Boot 2. Read Progress Indicators from Last System Boot 3. Read Service Processor Error Logs 4. Read System POST Errors 5. Read NVRAM 6. Read Service Processor Configuration 7. Processor Configuration/Deconfiguration Menu 8. Memory Configuration/Deconfiguration Menu 9. Power Control Network Utilities Menu 10. LED Control Menu 11. MCM/L3 Interposer Plug Count Menu 12. Performance Mode Setup Menu 13. L3 Mode Menu Not Applicable 14. Remote I/O (RIO) Link Speed Setup Menu 98. Return to Previous Menu 99. Exit from Menus 0> v Read VPD Image from Last System Boot Displays manufacturer’s vital product data (VPD), such as serial numbers, part numbers, and so on, that was stored from the system boot prior to the one in progress now. VPD from all devices in the system is displayed. v Read Progress Indicators from Last System Boot Displays a number of the boot progress indicators, which may include service processor checkpoints, IPL checkpoints, or AIX configuration codes, from the previous system boot. This information can be useful in diagnosing system faults.Note: If you are running from a partitioned system, enter the partition ID (0-15) to display progress indicators for that partition since the last system boot. In a full system partition, this option automatically displays details from partition 0. The progress indicator codes are listed from top (latest) to bottom (oldest). This information is not stored in nonvolatile storage. If the system is powered off using the power-on button on the operator panel, this information is retained. If the ac power is disconnected from the system, this information will be lost. For an example, refer to “LCD Progress Indicator Log” on page 818. v Read Service Processor Error Logs Displays error conditions detected by the service processor. Refer to “Service Processor Error Logs” on page 817 for an example of this error log. v Read System POST Errors Chapter 7. Using the Service Processor 781 This option should only be used by service personnel to obtain additional debug information. v Read NVRAM Displays Non Volatile Random Access Memory (NVRAM) content. v Read Service Processor Configuration Displays current service processor configuration. v Processor Configuration/Deconfiguration Menu Enable/Disable CPU Repeat Gard: CPU repeat gard automatically deconfigures a CPU during a system boot if a processor has failed BIST (built-in self test), caused a machine check or check stop, or has reached a threshold of recoverable errors. The processor will remain deconfigured until repeat gard is disabled or the processor is replaced. CPU repeat gard is enabled by default. For more information, see “Configuring and Deconfiguring Processors or Memory” on page 812. Enable/Disable Dynamic Processor Sparing: This option is part of the Capacity Upgrade on Demand (CUoD) function. Dynamic processor sparing is the capability of the system to deconfigure a failing (or potentially failing) processor and then configure a replacement processor from the unlicensed CUoD processor pool. – If the system is running logical partitions, the dynamic processor sparing operation takes place with the system running. – If the system is booted in a full system partition, the system must be rebooted for the dynamic processor sparing operation to take place.Note: The memory affinity of the failing processor is not taken into account when the replacement processor is assigned; the replacement processor is the next one that is available. The Processor Configuration/Deconfiguration menu allows the user to change the system processor configuration. If it is necessary to take one of the processors offline, use this menu to deconfigure the processor, and then reconfigure the processor at a later time. An example of this menu follows: PROCESSOR CONFIGURATION/DECONFIGURATION MENU 77. Enable/Disable CPU Repeat Gard: Currently Enabled 78. Enable/Disable Dynamic Processor Sparing (if available): Currently enabled 1. 0 3.0 (00) Configured by system 2. 1 3.1 (00) Deconfigured by system 3. 2 3.2 (00) Configured by system 4. 3 3.3 (00) Configured by system 5. 4 3.4 (00) Configured by system 6. 5 3.5 (00) Deconfigured by system 7. 6 3.6 (00) Configured by system 8. 7 3.7 (00) Configured by system 98. Return to Previous Menu 0> Note: This table is built from vital product data collected during the last boot sequence. The first time the system is powered on, or after the system’s nonvolatile RAM (NVRAM) has been erased, this table may be empty. The table is rebuilt during the next boot into AIX. The fields of the previous table represent the following: Column 1 (1.) Menu selection index. 782 406/90 Service Guide Column 2 (0) Logical processor device number assigned by AIX. You can display these logical device numbers by issuing the following command on the AIX command line: lsdev -C | grep proc Column 3 (3.0) Processor address list used by the service processor. Column 4 (00) Error status of the processors.The error status of each processor is indicated by AB, where B indicates the number of errors and A indicates the type of error according to the following: 1. Bring-up failure 2. Run-time non-recoverable failure 3. Run-time recoverable failure 4. Group integrity failure 5. Non-repeat-gardable error. The resource may be reconfigured on the next boot.A status of 00 indicates that the CPU has not had any errors logged against it by the service processor. To enable or disable CPU repeat gard, use menu option 77. The default is enabled. If CPU repeat gard is disabled, processors that are in the ?deconfigured by system? state will be reconfigured. These reconfigured processors are then tested during the boot process, and if they pass, they remain online. If they fail the boot testing, they are deconfigured even though CPU repeat gard is disabled. The failure history of each CPU is retained. If a processor with a history of failures is brought back online by disabling repeat gard, it remains online if it passes testing during the boot process. However, if CPU repeat gard is enabled, the processor is taken offline again because of its history of failures. Notes: 1. The processor numbering scheme used by the service processor is different from the numbering scheme used by AIX. To ensure the correct processor is selected, consult the AIX documentation before configuring or deconfiguring a processor. 2. The number of processors available to AIX can be determined by issuing the following command on the AIX command line: bindprocessor -q.v Memory Configuration/Deconfiguration Menu Enable/Disable Memory Repeat Gard: Memory repeat gard partially or fully deconfigures a memory book automatically during a system boot. If a memory book has failed BIST POST, caused a machine check or checkstop, or has reached a threshold of recoverable errors, the memory will remain deconfigured until repeat gard is disabled or the memory is replaced. Memory repeat gard is enabled by default. For more information, see “Configuring and Deconfiguring Processors or Memory” on page 812. Runtime Recoverable Error Repeat Gard: The runtime recoverable error repeat gard flag controls the deallocation of the memory if a recoverable error occurs during runtime. If a recoverable memory error occurs, and the runtime recoverable error repeat gard is disabled, the system will continue running with no change in the memory configuration. If a recoverable memory error occurs, and runtime recoverable error repeat gard is enabled, the memory half-book or book in which the error occurred will be taken offline. Runtime Recoverable Error Repeat Gard is disabled by default. These configuration/deconfiguration menus allow the user to change the system memory configuration. If it is necessary to take one of the memory books partially or completely offline, this menu allows you to deconfigure a book, and then reconfigure the book at a later time. This menu also allows you to see if the repeat gard function has partially or completely deconfigured a memory book. When this option is selected, a menu displays. The following is an example of this menu: Chapter 7. Using the Service Processor 783 MEMORY CONFIGURATION/DECONFIGURATION MENU 77. Enable/Disable Memory Repeat Gard: Currently Enabled 78. Runtime Recoverable Error Repeat Gard: Currently Enabled 1. Memory card 98. Return to Previous Menu After you select the memory card option by entering 1, a menu displays, allowing the selection of a memory book. The following is an example of this menu. MEMORY CONFIGURATION/DECONFIGURATION MENU 1: 16.16(00, -) Configured by system 2: 17.17(00, -) Configured by system 3: 18.18(00, -) Configured by system 4: 19.19(00, 1) Partially deconfigured by system 98. Return to Previous Menu Note: This table is built from vital product data collected during the last boot sequence. The first time the system is powered on, or after the system’s nonvolatile RAM (NVRAM) has been erased, this table may be empty. The table is rebuilt during the next boot into AIX. The fields in the previous table represent the following: Column 1 1. Menu selection index/book number Column 2 xx.xx : Book address used by service processor Column 3 (00, -) Error/deconfiguration statusThe error status of the each memory book is indicated by (AB,-1) where B indicates the number of errors and A indicates the type of error according to the following: 1. Bring-up failure 2. Run-time non-recoverable failure 3. Run-time recoverable failure 4. Group integrity failure 5. Non-repeat-gardable error. The resource may be reconfigured on the next boot. An error status of (00, -) (for example, 11.16(00, -)) indicates that the memory book has not had any errors logged against it by the service processor, and it is fully configured. The field after the error status will be a dash (-), 0, or 1. The dash indicates that the memory book is fully configured. A 0 or 1 indicates that memory repeat gard has deconfigured half of the memory book. If this occurs, the status of the book in the menu is shown as Partially deconfigured by system. To change the memory configuration, select the number of the memory book. The memory book state will change from configured to deconfigured or from deconfigured to configured. This menu only allows the deconfiguration of an entire book; it does not allow the manual deconfiguration of half a book. If half a book has been configured by the system (Partially deconfigured), it can be manually reconfigured using this menu. In the previous example menu, each line shows two books and indicates whether they are configured, deconfigured, or partially deconfigured. To enable or disable Memory Repeat Gard, use menu option 77 of the Memory Configuration/Deconfiguration Menu. The default is enabled. 784 406/90 Service Guide To enable or disable runtime recoverable error repeat gard, use option 78 of the Memory Configuration/Deconfiguration Menu. The default is enabled. The failure history of each book is retained. If a book with a history of failures is brought back online by disabling Repeat Gard, it remains online if it passes testing during the boot process. However, if Repeat Gard is enabled, the book is taken offline again because of its history of failures. The four inner memory books, if present, are listed first, followed by the four outer books, if present. The memory books are shown in the following order of physical location code: 1 U1.18-P1-M2 5 U1.18-P1-M4 2 U1.18-P1-M3 6 U1.18-P1-M8 3 U1.18-P1-M7 7 U1.18-P1-M5 4 U1.18-P1-M6 8 U1.18-P1-M1 v Power Control Network Utilities Menu POWER CONTROL NETWORK UTILITIES MENU 1. Lamp Test for all Operator Panels 2. Display I/O Type 3. Change I/O Type 4. Collect & display SPCN trace data 5. Start SPCN Flash Update 6. Display Power Subsystem FRU Code Levels 7. Power Subsystem Code Update via the Power Control Network Currently Enabled 8. Start Power Subsystem Code Update 98. Return to Previous Menu 0> – Lamp Test for All Operator Panels Selecting this option tests the media drawer operator panel’s indicators by causing them to blink on and off for approximately 30 seconds. – Display I/O Type This option, when selected, will display the SPCN address of the CEC drawer (D0), and the I/O type of the service processor (which is part of the primary I/O book assembly). – Change I/O Type Use this option to change the I/O type of the primary I/O book after a service action or configuration change if the I/O type is incorrect. If this option is chosen, you will be asked to make the following entries: - For the I/O drawer address, type 1. - For the I/O type, type: - If a High Performance Switch is installed in or attached to the system, type 9D. - If a High Performance Switch is not installed in or attached to the system, type 98.If either value is not valid, a failure message displays on the console. Press Enter to return to the Power Control Network Utilities Menu. – Collect & display SPCN trace data This option is used by service personnel to dump the SPCN trace data from the processor subsystem (CEC drawer) to gather additional debug information. Chapter 7. Using the Service Processor 785 To dump the SPCN trace data for the processor subsystem (CEC drawer), enter 0 when prompted for the SPCN drawer number and enter d0 when prompted for the unit address, as shown in the example screen: Enter the SPCN drawer number: 0 Enter the unit address: d0 Note: It may take up to 5 minutes for the trace data to be dumped to the screen. The SPCN trace data will be displayed on the screen. An example of this output follows: Please wait.... ***** Power Trace Data Start ***** 00000000ffffffffffff0006158800a000061574a00000060200353700060210 031300060242000000060241040b0006110a0040000611090200000611020804 0000000000061102080b0006041000000006d0e3850000061103000000061109 02000006040200000006040200010006041000010006d0e38501000604100002 000000000006d0e385020006041000030006d0e3850300061580100000061540 03180006101116000006101116010006158010010007158156a0000711000010 00000000000910120000000910120001 ***** Power Trace data End ***** (Press Return to Continue) v Start SPCN Flash Update On this system, selecting this option initiates updating (or ?flashing?) of the SPCN (system power control network) firmware. v Display Power Subsystem FRU Code Levels Selecting this option displays information about each power subsystem FRU. The physical location code, and the firmware level (which is shown as the ?Code level?) that is currently installed on the FRU, as well as other information, are displayed. A sample of the output of this option, which shows several of the types of power subystem FRUs in this system, follows. 786 406/90 Service Guide Please wait.... Fru type = 00f6 BPC-RH Cage id = ff BPA Port = A Fru id = 80 BPC Location Code = U1.35-P1-X4 Code level = 0000255e Status = 01 Successfully collected Fru type = 0048 DCA-RHP Cage id = 00 CEC Cage Fru id = 90 DCA 1 Location Code = U1.18-P1-V1 Code level = 00000027 Status = 01 Successfully collected Fru type = 0050 MDA-X Cage id = 00 CEC Cage Fru id = 98 MDA 1 Location Code = U1.18-X1 Code level = 00000007 Status = 01 Successfully collected Fru type = 0047 DCA-BC Cage id = 01 I/O Drawer 1 Fru id = 90 DCA 1 Location Code = U1.9-V1 Code level = 0000fb23 Status = 01 Successfully collected Fru type = 0054 MDA-BC Cage id = 01 I/O Drawer 1 Fru id = 98 MDA 1 Location Code = U1.9-V1 Code level = 00000005 Status = 01 Successfully collected v Power Subsystem Code Update via the Power Control Network If this option is enabled (the default), the power subsystem’s code update will be controlled by the system power control network code. If this option is disabled, the power subsystem’s code update will be controlled by the HMC. Enabling the update by the SPCN locks out the update from the HMC and vise-versa. This option should only be disabled if a High Performance Switch was installed at the factory, or is added to the frame or connected to the system as an MES. If the High Performance Switch is removed from the system, or it is detached from the system, this option should be reenabled. v Start Power Subsystem Code Update Selecting this option forces the service processor to initiate a firmware update for all of the power subsystem codes; this includes the SPCN code. As the various types of firmware are updated, the service processor will report the progress of the update on the screen (as long as the system remains in standby) until the update is complete. An example of this output follows. Chapter 7. Using the Service Processor 787 0>8 Checking for SPCN code level... Updating SPCN Microcode PLEASE WAIT... System Power Control Network (SPCN) FLASH Update Successful: Initializing for Power Subsys code updates... Please wait...., spcn_rc = 21 Checking Power Subsystem code levels... Updating BPC (a0e00e01)... Please wait ..... Completed... (a0e00e01) Please wait...., spcn_rc = 21 Updating BPC (a0e00e01)... Please wait ..... Completed... (a0e00e01) Please wait...., spcn_rc = 21 Updating BPR (a0e00e02)... Please wait ..... Completed... (a0e00e02) Updating BPR (a0e00e02)... Please wait ..... Completed... (a0e00e02) Updating BPR (a0e00e02)... Please wait ..... Completed... (a0e00e02) Updating DCA-RHP (a0e00e09)... Please wait ..... Completed... (a0e00e09) Please wait...., spcn_rc = 21 Updating DCA-RHP (a0e00e09)... Please wait ..... Completed... (a0e00e09) Please wait...., spcn_rc = 21 Updating DCA-RHP (a0e00e09)... Please wait ..... Completed... (a0e00e09) Please wait...., spcn_rc = 21 Updating DCA-RHP (a0e00e09)... Please wait ..... Completed... (a0e00e09) Please wait...., spcn_rc = 21 Updating DCA-RHP (a0e00e09)... Please wait ..... Completed... (a0e00e09) Please wait...., spcn_rc = 21 Checking Power Subsystem code levels... Power Subsystem code update completed (Press Return to Continue) v LED Control Menu This menu displays the state of the processor subsystem disturbance or system attention LED and the fault/identify LED on the I/O subsystem(s). Use this menu to toggle the attention/fault LEDs between identify (blinking) and off. Option 1 is available only when the system is in the error state (the CEC is powered on and the service processor menus are available). Option 1 is not available when the system is in standby. 788 406/90 Service Guide An example of this menu follows: LED Control Menu 1. Set/Reset Identify LED state 2. Clear System Attention Indicator 98. Return to Previous Menu Enter LED index 0 > The processor subsystem disturbance or system attention LED is located on the operator panel in the media drawer. The I/O drawer fault/identify LED is located on the front of each I/O subsystem. If option 1 is selected, a list of location codes of the I/O subsystems and the CEC drawer displays. The screen will be similar to the following: 1. U1.9-P1 2. U1.9-P2 3. U1.5-P1 4. U1.5-P2 5. U1.1-P1 6. U1.1-P2 7. U2.1-P1 8. U2.1-P2 9. U2.5-P1 10. U2.5-P2 Enter number corresponding to the location code, or press Return to continue, or ’x’ to return to the menu. 0>4 If one of the devices is selected using the index number, the present state of its LED will be displayed, and you are given the option to toggle it, as shown in the following screen. The final state of the LED will then be displayed, whether or not it was changed. U1.5-P2 is currently in the OFF state Select from the following (1=IDENTIFY ON, 2=IDENTIFY OFF) 0>2 Please wait... U1.5-P2 is currently in the OFF state (Press Return to continue) Option 2, Clear System Attention Indicator, will clear the attention indicator on the operator panel in the media drawer. v MCM/L3 Interposer Plug Count Menu Attention: Do not power on the system when in this menu. Fully exit this menu before powering on the system. This menu tracks the number of times that the MCMs and L3 cache modules have been plugged into the system backplane. Chapter 7. Using the Service Processor 789 If an MCM or L3 cache module is reseated or replaced, the plug count for the module must be incremented by 1. If the plug count exceeds the limit of 10 (reaches 11 or greater), a 450x yyyy or 4B2x yyyy error with a detail value of CFF0 that identifies an MCM or L3 cache module will be posted in the service processor error log. The FRU should be replaced during a deferred service call. If an MCM or L3 cache module is reseated or replaced or installed during an MES upgrade, the plug count must be set using the MCM/L3 Interposer Plug Count menu. If the plug count information is not included with the new or replacement module, enter the default value of 7. If the plug count is not entered, a B1xx 4698 error code, with a detail value of E10B or E10C, will be posted in the service processor error log. If the primary I/O book is replaced, the plug counts are retained. However the MCM/L3 Plug Count menu must be accessed and option 50, Commit the values and write to the VPD, must be executed so that the plug counts are revalidated. If the counts are not revalidated, a B1xx 4698 error code, with a detail value of E10B or E10C, will be posted in the service processor error log.Notes: 1. The plug count will be 0 for those positions in which L3 cache modules and MCMs have never been installed. 2. If MCM and L3 modules are removed from the system and not replaced, those plug counts are retained until they are manually changed. A screen similar to the following will be displayed. In this example, all four MCMs and all 16 L3 modules are shown. MCM/L3 Interposer Plug Count Menu 1. L3_13:7 2. L3_0:7 3. MCM_0:7 4. L3_1:7 5. L3_4:7 6. L3_14:7 7. L3_3:7 8. L3_2:7 9. L3_7:7 10. MCM_3:1 11. MCM_2:1 12. L3_15:7 13. L3_10:7 14. MCM_1:7 15. L3_11:7 16. L3_6:7 17. L3_12:7 18. L3_9:7 19. L3_8:7 20. L3_5:7 50. Commit the values and write to the VPD 98. Return to Previous Menu The format of the preceding menu entries is the menu index number followed by the plug count after the colon. The following table matches the index number with the physical location codes. Menu Index Number Physical Location Code 1. L3_13 U1.18-P1-C5 2. L3_0 U1.18-P1-C6 3. MCM_0 U1.18-P1-C1 4. L3_1 U1.18-P1-C7 5. L3_4 U1.18-P1-C8 6. L3_14 U1.18-P1-C9 7. L3_3 U1.18-P1-C10 8. L3_2 U1.18-P1-C11 9. L3_7 U1.18-P1-C12 10. MCM_3 U1.18-P1-C2 11. MCM_2 U1.18-P1-C3 790 406/90 Service Guide 12. L3_15 U1.18-P1-C13 13. L3_10 U1.18-P1-C14 14. MCM_1 U1.18-P1-C4 15. L3_11 U1.18-P1-C15 16. L3_6 U1.18-P1-C16 17. L3_12 U1.18-P1-C17 18. L3_9 U1.18-P1-C18 19. L3_8 U1.18-P1-C19 20. L3_5 U1.18-P1-C20 To change the plug count for a particular module, enter a menu index number. For example, to change the plug count of the L3 module that is physically in the upper-right corner (U1.18-P1-C8), type 5, then enter the new plug count. When all of the new plug counts have been entered, select 50, Commit the values and write to the VPD. This action stores the new values in NVRAM. Determining the number of MCMs in a system: To determine the number of MCMs installed in the system, do one of the following: – If the managed system is powered down, go to “System Information Menu” on page 781, then to the section titled “Processor Configuration/Deconfiguration Menu” and refer to the example menu. The VPD port is shown as x.y, where x is in the range 3-6 and y is in the range 0-7. The listing of the VPD ports on the screen is used to determine the number of MCMs in the managed system. 3.y=MCM0 (U1.18-P1-C1) 4.y=MCM1 (U1.18-P1-C4) 5.y=MCM2 (U1.18-P1-C3) 6.y=MCM3 (U1.18-P1-C2) – If AIX is booted on the managed system, issue the following command on the AIX command line, to display a line of information for each MCM installed in the system: lscfg -vp | grep MCM The following table shows the plugging order of each MCM and its associated L3 cache modules. MCM Physical Location CODE Interposer Plug Count Menu Index Number First MCM:L3 Modules U1.18-P1-C1U1.18-P1-C6U1.18-P1-C7U1.18-P1-C10U1.18-P1-C11 3. MCM_02. L3_0 4. L3_17. L3_38. L3_2 Second MCM:L3 Modules U1.18-P1-C4U1.18-P1-C14U1.18-P1-C15U1.18-P1-C18U1.18-P1-C19 14. MCM_113. L3_1015. L3_1118. L3_919. L3_8 Third MCM:L3 Modules U1.18-P1-C3U1.18-P1-C8U1.18-P1-C12U1.18-P1-C16U1.18-P1-C20 11. MCM_25. L3_49. L3_7 16. L3_620. L3_5 Chapter 7. Using the Service Processor 791 MCM Physical Location CODE Interposer Plug Count Menu Index Number Fourth MCM:L3 Modules U1.18-P1-C2U1.18-P1-C5U1.18-P1-C9U1.18-P1-C13U1.18-P1-C17 10. MCM_31. L3_136. L3_1412. L3_1517. L3_12 v Performance Mode Setup Menu If certain types of processor cards are installed in the system, this menu is not applicable. For other types of processor cards, this menu will be active after the first boot as noted below.Note: The first time the system is booted after NVRAM is cleared, Not Applicable displays under Performance Mode Setup Menu on the screen. This may also happen if the service processor is replaced, or the processor MCMs are upgraded. If option 12 is selected when Not Applicable is on the screen, the system responds with Not Applicable and redisplay the system information menu. The setup menu can be displayed after the performance mode is set, which happens the first time the system is booted. The default performance mode is set by the firmware during IPL. The default mode provides the optimum performance for the hardware configuration of the system. The performance mode is systemwide; it cannot be set on a per-partition basis. You can override the default setting by using the performance mode setup menu. The performance mode setup menu will be similar to the following: Default Performance Mode: Large Commercial System optimization 1. Current Performance Mode: Large Commercial System optimization 98. Return to Previous Menu 0>1 Selecting option 1 displays the following performance modes: Select from the following options: 1. Large Commercial System optimization 2. Standard Operation 3. Turbo Database Mode 0> Notes: 1. Some MCM configuration changes, such as going from an 8-way to a 24-way system, will cause the default performance mode to change. This new setting will not be reflected in the menu until after the system is rebooted with the new configuration. 2. If the NVRAM has been cleared, the default performance mode and the current performance mode will indicate uninitialized. To override the default setting, a brief description of each performance mode follows: – Large Commercial System Optimization is the setting for systems that do not fall into the other two categories, standard operation and turbo database mode. This setting provides the best performance for most applications. – Standard Operation optimizes the system for large memory bandwidth applications where minimal sharing of data occurs and the likelihood of significant hardware data-prefetching exists. 792 406/90 Service Guide – Turbo Database Mode optimizes system operation for environments where there is a large amount of data sharing among processes running concurrently on the system.v L3 Mode Menu This function is not applicable to this system. v Remote I/O (RIO) Link Speed Set Up Menu This option is used to change the speed of the RIO interface to external drawers. On this system, setting this speed will have no effect on the system. The hardware capability is detected and set by the system firmware during the boot process. If the hardware capability is unitialized (after NVRAM is cleared), it will be set during the first boot and remain set during subsequent boots. The maximum RIO speed setting will remain uninitialized until set by the user. The user can set the speed lower than or equal to the hardware capability, but not higher. If the hardware capability is 1000 Mbps, the user can set the link speed to 500 Mbps, and the RIO link will run at 500 Mbps. However, if the hardware capability is 500 Mbps and the user selects 1000 Mbps, the user’s selection will be ignored by the system. If this option is chosen, the menu similar to the following is displayed: Remote I/O (RIO) Link Speed Set Up Menu Hardware Capability (internal and external): 1000 Mbps 1. Maximum RIO speed setting (internal and external): Currently Uninitialized 98. Return to previous menu. 0>1 If option 1 is selected, the RIO interface speed can be set as follows: Enter new value for this option: (1 = 1000 MBPS, 2 = 500 Mbps) 0> This setting will remain in effect until it is changed or NVRAM is cleared. Chapter 7. Using the Service Processor 793 Language Selection Menu The service processor menus and messages are available in various languages. This menu allows selecting languages in which the service processor and system firmware menus and messages are displayed. LANGUAGE SELECTION MENU 1. English 2. Francais 3. Deutsch 4. Italiano 5. Espanol 98. Return to Previous Menu 99. Exit from Menus 0> Note: Your virtual terminal window must support the ISO-8859 character set to correctly display languages other than English. 794 406/90 Service Guide Call-In/Call-Out Setup Menu Note: The information in this section regarding the configuring of serial ports, and modems attached to those serial ports, applies only to the serial ports (S1 and S2) on the primary I/O book (location U1.18-P1-H2). These serial ports are normally used for call-in and call-out. Call-out is disabled in partitioned systems. The call-out function is normally handled by the Service Focal Point application running on the HMC. None of these menus are applicable to the serial ports, or modems attached to those serial ports, on the hardware management console (HMC). CALL-IN/CALL-OUT SETUP MENU 1. Modem Configuration Menu 2. Serial Port Selection Menu 3. Serial Port Speed Setup Menu 4. Telephone Number Setup Menu 5. Call-Out Policy Setup Menu 6. Customer Account Setup Menu 7. Call-Out Test NOT supported in LPAR mode. 98. Return to Previous Menu 99. Exit from Menus 0> v Modem Configuration Menu, see “Modem Configuration Menu” on page 796. v Serial Port Selection Menu, see “Serial Port Selection Menu” on page 797. v Serial Port Speed Setup Menu, see “Serial Port Speed Setup Menu” on page 798. v Telephone Number Setup Menu, see “Telephone Number Setup Menu” on page 798. v Call-Out Policy Setup Menu, see “Call-Out Policy Setup Menu” on page 800. v Customer Account Setup Menu, see “Customer Account Setup Menu” on page 801. v Call-Out Test tests the configuration after the modem is installed and configured correctly.Note: If the system is running in partition mode, the call-out option is disabled. Chapter 7. Using the Service Processor 795 Modem Configuration Menu Note: This option applies only to a modem attached to serial port 1 (S1) or serial port 2 (S2) on the primary I/O book. The first two lines of the Modem Configuration Menu contain status of the current selections. Selections are made in the sections labeled Modem Ports and Modem Configuration File Name. Select the serial port that you want to activate and then select the modem configuration file for the modem on the port. If you want to set up all of the serial ports with modems, make your selections one port at a time. Modem Configuration Menu Port 1 Modem Configuration File Name: Port 2 Modem Configuration File Name: To make changes, First select the port and then the configuration file name Modem Ports: 1. Serial port 1 2. Serial port 2 Modem Configuration File Name: 5. none9. modem_z_sp 6. modem_f_sp 10. modem_m1_sp 7. modem_f0_sp 11. modem_m0_sp 8. modem_f1_sp 12. modem_m1_sp 30. Save configuration to NVRAM and Configure modem 98. Return to Previous Menu 0> For information on choosing a modem configuration file, see “Sample Modem Configuration Files” on page 1061 and “Transfer of a Modem Session” on page 1066. 796 406/90 Service Guide Serial Port Selection Menu This menu allows you to enable or disable the call-in and call-out functions of each serial port in any combination. Note: If the system is running in partition mode, call-out is disabled. Serial Port Selection Menu 1. Serial Port 1 Call-Out: Currently Disabled NOT supported in LPAR mode. 2. Serial Port 2 Call-Out: Currently Disabled NOT supported in LPAR mode. 3. Serial Port 1 Call-In: Currently Disabled 4. Serial Port 2 Call-In: Currently Disabled 98. Return to Previous Menu 0> Call-in and ring indicate power-on cannot be enabled at the same time. If ring-indicate power-on is already enabled and you try to enable call-in, a message prompts you for confirmation. Refer to the message displayed on the screen. Chapter 7. Using the Service Processor 797 Serial Port Speed Setup Menu This menu allows you to set serial port speed to enhance terminal performance or to accommodate modem capabilities. Serial Port Speed Setup Menu 1. Serial Port 1 Speed: Currently 9600 2. Serial Port 2 Speed: Currently 9600 98. Return to Previous Menu 0> A serial port speed of 9600 baud or higher is recommended. The following are valid serial port speeds: 50 600 4800 75 1200 7200 110 1800 9600 134 2000 19200 150 2400 38000 300 3600 57600 115200 Telephone Number Setup Menu Note: These menus are available, however, the function of calling out to report a system failure is normally handled by the Service Focal Point application running on the HMC. Use this menu to set or change the telephone numbers for reporting a system failure. Telephone Number Setup Menu 1. Service Center Telephone Number: Currently Unassigned 2. Customer Administration Center Telephone Number: Currently Unassigned 3. Digital Pager Telephone Number: Currently Unassigned 4. Customer Voice Telephone Number: Currently Unassigned 5. Customer System Telephone Number: Currently Unassigned 98. Return to Previous Menu 0> 798 406/90 Service Guide v Service Center Telephone Number is the number of the service center computer. The service center usually includes a computer that takes calls from servers with call-out capability. This computer is referred to as the ?catcher.? The catcher expects messages in a specific format to which the service processor conforms. For more information about the format and catcher computers, refer to the readme file in the AIX /usr/samples/syscatch directory. Contact your service provider for the correct service center telephone number to enter. Until you have that number, leave this field unassigned. v Customer Administration Center Telephone Number is the number of the System Administration Center computer (catcher) that receives problem calls from servers. Contact your system administrator for the correct telephone number to enter here. Until you have that number, leave this field unassigned. v Digital Pager Telephone Number is the number for a numeric pager carried by someone who responds to problem calls from your server. Contact your administration center representative for the correct telephone number to enter. For test purposes, use a test number, which you can change later (see the note on page 815).Note: If the system is running in partition mode, call-out is disabled. However, if the system is booted in full system partition mode, at least one of the preceding three telephone numbers must be assigned in order for the call-out test to complete successfully. v Customer Voice Telephone Number is the telephone number of a phone near the server or answered by someone responsible for the server. This is the telephone number left on the pager for callback. For test purposes, use a test number, which you can change later. v Customer System Telephone Number is the telephone number to which your server’s modem is connected. The service or administrative center representatives need this number to make direct contact with your server for problem investigation. This is also referred to as the call-in phone number. Chapter 7. Using the Service Processor 799 Call-Out Policy Setup Menu Note: Call-out is disabled in partitioned systems. Call out settings can be set using the following menu: CALL-OUT POLICY SETUP MENU 1. Call-Out policy (First/All): Currently First 2. Remote timeout, (in seconds): Currently 120 3. Remote latency, (in seconds): Currently 2 4. Number of retries: Currently 2 98. Return to Previous Menu 0> v Call-Out policy can be set to first or all. If call-out policy is set to first, the service processor stops at the first successful call-out to one of the following numbers in the order listed: 1. Service Center 2. Customer Administrative Center 3. PagerIf call-out policy is set to all, the service processor attempts a call-out to all of the following numbers in the order listed: 1. Service Center 2. Customer Administrative Center 3. Pagerv Remote timeout and remote latency are functions of your service provider’s catcher computer. Either use the defaults or contact your service provider for recommended settings. v Number of retries is the number of times you want the server to retry calls that failed to complete. 800 406/90 Service Guide Customer Account Setup Menu This menu allows users to enter information that is specific to their account. Customer Account Setup Menu 1. Customer Account Number: Currently Unassigned 2. Customer RETAIN Login userid: Currently Unassigned 3. Customer RETAIN login password: Currently Unassigned 98. Return to Previous Menu 0> v Customer Account Number is assigned by your service provider for record-keeping and billing. If you have an account number, enter it. Otherwise, leave this field unassigned. v Customer RETAIN Login User ID and Customer RETAIN Login Password apply to a service function to which your service provider might have access. Leave these fields unassigned if your service provider does not use RETAIN.Call-out Test Call-out test is disabled in partitioned systems. Service Processor Parameters in Service Mode (Full System Partition) When the system is in service mode, the following service processor functions are suspended: v Unattended Start Mode v Reboot/Restart Policy v Call-Out v SurveillanceWhen service mode is exited, the service processor functions are re-activated. Chapter 7. Using the Service Processor 801 System Power-On Methods This section discusses the following system power-on methods: v Hardware Management Console (HMC) Using the HMC is the preferred method of powering on a managed system. For details on using the HMC to power up a system, see the Hardware Management Console Operations Guide. v Power-on Switch v Service Processor Menus Privileged users can power on the system by selecting the System Control Power Menu option from the main menu and then selecting the Power-on System option from the System Power Control Menu. General users should select Power-on System on the General User Menu. v Remote Power-on via Ring-Indicate Signal The server automatically powers on when it detects a ?ring indicate? signal from a modem attached to serial port 1 (S1) or serial port 2 (S2). A remote user can call the server to activate ring detection by the modem. Listen for a few more rings than the threshold number for starting the system. The system powers on without answering the call. v Unattended start mode - refer to Enable/Disable Unattended Start Mode on page 778. The service processor can be enabled to recover from the loss of ac power (see Enable/Disable Unattended Power-On Mode in the SYSTEM POWER CONTROL MENU). When ac power is restored, the system returns to the power state at the time ac loss occurred. For example, if the system was powered on when ac loss occurred, it reboots/restarts when power is restored. If the system was powered off when ac loss occurred, it remains off when power is restored. v Timed power-on - refer to the shutdown -t command on servers using AIX. Working in conjunction with AIX, the Service Processor in your server can operate a timer, much like the wake-up timer on your clock radio. You can set the timer so that your server powers on at a certain time after shutting down. The timer is battery-operated, so power interruptions that occur while the server is off do not affect its accuracy. Refer to the AIX shutdown -t command for details on setting the timer. Note: If an ac power loss (exceeding the hold-up time of any optional IBFs) is in progress when the timed power-on attempt occurs, the server cannot power on when ac power is restored. v Follow-up to a Failed Boot Attempt The service processor initiates a power-on sequence if a failed boot attempt is detected (due to a hardware or software failure). v Fast or Slow Boot (IPL) Using the service processor menus, you can select the IPL type, mode, and speed of your system. Attention: Selecting fast IPL results in several diagnostic tests being skipped and a shorter memory test being run. 802 406/90 Service Guide Service Processor Reboot/Restart Recovery Reboot describes bringing the system hardware back up; for example, from a system reset or power-on. The boot process ends when control passes to the operating system process. Restart describes activating the operating system after the system hardware is reinitialized. Restart must follow a successful reboot. Boot (IPL) Speed When the server enters reboot recovery, slow IPL is automatically started, which gives the POST an opportunity to locate and report any problems that might otherwise be unreported. Failure During Boot Process During the boot process, either initially after system power-on or upon reboot after a system failure, the service processor monitors the boot progress. If progress stops, the service processor can reinitiate the boot process (reboot) if enabled to do so. The service processor can re-attempt this process according to the number of retries selected in the Reboot/Restart Policy Setup Menu. Failure During Normal System Operation When the boot process completes and control transfers to the operating system (OS), the service processor can monitor operating system activity (see the Set Surveillance Parameters option in the SERVICE PROCESSOR SETUP MENU). If OS activity stops due to a hardware- or software-induced failure, the service processor can initiate a reboot/restart process based on the settings in the Service Processor Reboot/Restart Policy Setup Menu and the OS automatic restart settings (see the operating system documentation). If you are using the AIX operating system, the menu item under SMIT for setting the restart policy is Automatically Reboot After Crash. The default is false. When the setting is true, and if the service processor parameter ?Use OS-Defined Restart Policy? is yes (the default), the service processor takes over for AIX to reboot/restart after a hardware or surveillance failure. Service Processor Reboot/Restart Policy Controls The operating system’s automatic restart policy (see operating system documentation) indicates the operating system response to a system crash. The service processor can be instructed to refer to that policy by the Use OS-Defined Restart Policy setup menu. If the operating system has no automatic restart policy, or if it is disabled, then the service processor-restart policy can be controlled from the service processor menus. Use the Enable Supplemental Restart Policy selection. Use OS-Defined restart policy - The default setting is no. If set to yes on a full system partition, this causes the service processor to refer to the OS Automatic Restart Policy setting and take action (the same action the operating system would take if it could have responded to the problem causing the restart). When this setting is no, or if the operating system did not set a policy, the service processor refers to enable supplemental restart policy for its action. This setting is ignored on a partitioned system. Chapter 7. Using the Service Processor 803 Enable supplemental restart policy - The default setting is yes. When set to yes on a full system partition, the service processor restarts the server when the operating system loses control and either: The Use OS-Defined restart policy is set to No. OR The Use OS-Defined restart policy is set to Yes and the operating system has no automatic restart policy. If set to Yes on a partitioned system, the service processor restarts the system when the system loses control and it is detected by service processor surveillance. Refer to “Service Processor Reboot/Restart Recovery” on page 803. The following table describes the relationship among the operating system and service processor restart controls in a full system partition. OS Automatic reboot/restart after crash setting Service processor to use OS-Defined restart policy? Service Processor Enable supplemental restart policy? System response None No1 No None No1 Yes1 Restarts None Yes No None Yes Yes1 Restarts False2 No1 No False2 No1 Yes1 Restarts False2 Yes No False2 Yes Yes1 True No1 No True No1 Yes1 Restarts True Yes No Restarts True Yes Yes1 Restarts 1 Service processor default 2 AIX default In a partitioned system, the service processor’s supplemental restart policy is the only setting that is used, as shown in the following table: Service Processor enable supplemental restart policy System Response No Yes (default) Restarts 804 406/90 Service Guide System Firmware Updates Attention: If the firmware on the system is RH021025 or later, the firmware can be updated to the latest level by the customer. If it is RH021024 or earlier, the firmware must be updated by a service representative. To determine the current level of firmware on the system, see “Determining the Level of Firmware on the System” on page 806. This section provides information and instructions for updating the system firmware. You may need to perform these steps if you are installing an option or if your support representative has requested that you update your firmware. If you cannot download from the Web, do the following: v If the service processor programming has been damaged, the service processor will automatically enter recovery mode when power is applied to the system. To check the level of firmware that is currently on the system, see “Determining the Level of Firmware on the System” on page 806. General Information on System Firmware Updates All the system firmware types that can be reprogrammed are updated at the same time. They are: v System power control network programming v Service processor programming v IPL programming v Run-time abstraction servicesRetain and store the latest firmware diskettes each time the firmware gets updated in the event that the firmware becomes damaged and must be reloaded. Chapter 7. Using the Service Processor 805 Determining the Level of Firmware on the System The firmware level can be denoted in either of the following forms: v RHyymmdd, where RH = the 406/90’s firmware designation, yy = year, mm = month, and dd = day of the release. v vHyymmdd, where v = version number, H= the 406/90’s firmware designation, yy = year, mm = month and dd = day of the release.If your system is running AIX, the platform firmware level can be determined by either of the following methods: v On the AIX command line, by typing: lscfg -vp|grep -p Platform A line that begins with ROM level (alterable).. displays the firmware level that is currently on the system. v Looking at the top of the service processor main menu.If the system is running Linux, the platform firmware level can be determined by any one of the following methods: v If the system was installed prior to May of 2003, and has not had a firmware update in May 2003 or later, type the following on the Linux command line: lscfg -vp | grep RH v If the system was installed in May of 2003 or later, or the system has firmware on it that was released in May 2003 or later, type the following on the Linux command line: lscfg -vp | grep 3H A line similar to Alterable ROM Level RH021114 or Alterable ROM Level 3H030509 displays the firmware level that is currently on the system. v Looking at the top of the service processor menu main menu. 806 406/90 Service Guide System Firmware Update in AIX Using a Locally Available Image To update the system firmware using a locally available image, perform the following steps: 1. Log in as root user. 2. If the /tmp/fwupdate directory does not exist, create it by issuing the following command: mkdir /tmp/fwupdate 3. The firmware update file must be downloaded or copied into the /tmp/fwupdate directory on the system. This can be done by using the ftp command to get the image from an ftp server or NFS-mounting the directory on the host system. The firmware updated file can also be transferred to the target system by backing up the image onto diskettes from another server and restoring it into the /tmp/fwupdate directory. After the firmware update file has been downloaded or copied into the /tmp/fwupdate directory, verify its existence by entering either of the following commands, depending on the name of the update image: ls /tmp/fwupdate/RH*.img or ls /tmp/fwupdate/3H*.img The update file will have either the RHyymmdd.img format or the 3Hyymmdd.img format. In both cases, the Hin the second position indicates that this is an update image for your system; yy is the year, mm is the month, and dd is the day of the update file. 4. After the update file has been written to the /tmp/fwupdate directory, enter the following commands: cd /usr/lpp/diagnostics/bin then either of these commands, depending on the name of the update file: ./update_flash -f /tmp/fwupdate/RHyymmdd.img or ./update_flash -f /tmp/fwupdate/3Hyymmdd.img Attention: Do not overlook the periods (.) in the commands shown above. AIX commands are case-sensitive. Type them exactly as shown.You will be asked for confirmation to proceed with the firmware update and the required reboot. If you confirm, the system will apply the new firmware, reboot, and return to the AIX prompt. This may take up to ten minutes depending on the configuration of the system. Attention: On some systems, the message Wait for rebooting before stopping may display on the system console. Do not turn off the system until the system has fully rebooted to the AIX login prompt. If a shutdown is necessary at that time, log in as root user and issue the shutdown command. While the update is in progress, you will see Rebooting... on the display for several minutes. The firmware update is complete. Chapter 7. Using the Service Processor 807 Recovery Mode Contact your marketing representative for information about obtaining and installing system firmware update packages. The eight-digit codes are as follows: Code Action A1FD 0000 System firmware has been damaged and must be reflashed. A1FD 0001 Insert update diskette 1. A1FD 0002 Insert update diskette 2. A1FD 0003 Insert update diskette 3. A1FD 000n Insert update diskette n. Notes: 1. If the wrong diskette is inserted at any time, or if the diskette is left in the drive after it has been read, B1FD 001F is displayed, indicating that the wrong diskette is in the drive. 2. If B1FD 001A is displayed at any time during the process, the service processor must be reset by activating the pinhole reset switch on the operator panel. After the update process is complete, the service processor resets itself and goes through a complete reboot. 808 406/90 Service Guide Frame (Power Subsystem) Firmware Update with a 45H/W4 Switch Installed When a 45H/W4 Switch is installed in or attached to the system, the frame firmware, which includes the power subsystem, is updated using a task on the HMC. Contact your marketing representative for code updates (also called corrective service). This task downloads a corrective service for the frame onto the HMC.Note: Because the HMC is a closed system, you cannot install additional applications on your HMC. All of the tasks you need to maintain the managed system, the underlying operating system, and the HMC application code are available by using the HMC’s management applications. To download corrective service on the HMC, do the following: 1. From the HMC interface, in the Navigation area (left side of the panel), open the Software Maintenance folder. 2. In the Contents area (right side of the panel), double-click the Frame icon. The Frame application opens in the Contents area. 3. In the Contents area, click Receive Corrective Service task. 4. Select one of the following by clicking on the circle to the left. v Upload corrective service from diskette. If you select this option, make sure the correct diskette is in the HMC diskette drive. v Download the corrective service file from a remote system. If you select this option, complete the Remote Site (host name), Patch file (file name or remote system), User ID (for remote system), and password (for remote system).5. Click OK to copy the corrective service to this HMC. Chapter 7. Using the Service Processor 809 Installing Corrective Service on the Frame This task allows you to update the level of firmware on the frame after you have downloaded a corrective service. This procedure should be run after any of the following components are replaced: v Bulk Power Regulator (BPR) v Bulk Power Controller (BPC) v Distributed Converter Assembly (DCA)Notes: 1. To install a corrective service on a frame, you must be a system administrator or a service representative. For information about system administrator or service representative roles, refer to the Hardware Management Console Operations Guide. 2. Do not power off any of the components in the frame at any time during this installation procedure. Interruptions can leave the power subsystem, or one of the other components in the frame, in an unrecoverable state.To install a corrective service, do the following: 1. If you have not installed the corrective fix from ftp or diskette onto this HMC, perform steps to receive frame corrective service on the HMC, as described in “Frame (Power Subsystem) Firmware Update with a 45H/W4 Switch Installed” on page 809. Then go to Step 4. 2. From the HMC interface, in the Navigation area (left side of the panel), open the Software Maintenance folder. 3. In the Contents area (right side of the panel), double-click the Frame icon. The Frame application opens in the Contents area. 4. In the Contents area, click Install Corrective Service. The Install Corrective Service window opens. 5. Select the Corrective Service Version, and select the frame where the service will be applied.Note: Unless you are directed otherwise, select the highest version number for the most recent fix. 6. Click Install. The installation may take up to an hour, depending on the number of parts in the frame that require an update. When installation is complete, a window opens to indicate installation status.Note: If you have replaced a single frame part and are using this procedure to update it, this process normally takes only two to five minutes. 7. If the installation fails, click Reason for Failure. a. If you are able to correct the problems(s), click Cancel and go to Step 6. b. If you are given a service request code (SRC), perform the steps to service the SRC, then return to this procedure. c. Otherwise, contact the next level of support.8. After the corrective service has been applied successfully, click Cancel to return. 810 406/90 Service Guide Reconfiguration Procedure for SNI Adapters After the system has booted to the AIX login prompt, log in and verify that the SNI adapters are configured properly. On the AIX command line, issue the following command: lscfg | grep sni The number of sni adapters identified in the output of the above command (represented by their logical device names) should match the number in the system (2-8). If the number of adapters does not equal the number of physical SNI ports in the system, issue the following commands at the AIX command line, where snix represents each SNI port. Run the following two commands for every sni port in the list generated by running the lscfg command above. ifconfig -d snix down detach rmdev -d -l snix After all sni ports have been deconfigured, run the following command: cfgmgr -v Then reboot the system and verify the presence of two SNI ports. Chapter 7. Using the Service Processor 811 Configuring and Deconfiguring Processors or Memory All failures that crash the system with a machine check or check stop, even if intermittent, are reported as a diagnostic callout for service repair. To prevent the recurrence of intermittent problems and improve the availability of the system until a scheduled maintenance window, processors and memory books with a failure history are marked ?bad? to prevent their being configured on subsequent boots. A processor or memory book is marked ?bad? under the following circumstances: v A processor or memory book fails built-in self-test (BIST) or power-on self-test (POST) testing during boot (as determined by the service processor). v A processor or memory book causes a machine check or check stop during runtime, and the failure can be isolated specifically to that processor or memory book (as determined by the processor runtime diagnostics in the service processor). v A processor or memory book reaches a threshold of recovered failures that results in a predictive callout (as determined by the processor run-time diagnostics in the service processor).During boot time, the service processor does not configure processors or memory books that are marked “bad.” If a processor or memory book is deconfigured, the processor or memory book remains offline for subsequent reboots until it is replaced or repeat gard is disabled. The repeat gard function also provides the user with the option of manually deconfiguring a processor or memory book, or re-enabling a previously deconfigured processor or memory book. For information on configuring or deconfiguring a processor, see the Processor Configuration/Deconfiguration Menu on page 782. For information on configuring or deconfiguring a memory book, see the Memory Configuration/Deconfiguration Menu on page 783. Both of these menus are submenus under the System Information Menu. You can enable or disable CPU Repeat Gard or Memory Repeat Gard using the Processor Configuration/Deconfiguration Menu. Run-Time CPU Deconfiguration (CPU Gard) L1 instruction cache recoverable errors, L1 data cache correctable errors, and L2 cache correctable errors are monitored by the processor runtime diagnostics (PRD) firmware running in the service processor. When a predefined error threshold is met, an error log with warning severity and threshold exceeded status is returned to AIX. At the same time, PRD marks the CPU for deconfiguration at the next boot. AIX will attempt to migrate all resources associated with that processor to another processor and then stop the defective processor. 812 406/90 Service Guide Service Processor System Monitoring - Surveillance Surveillance is a function in which the service processor monitors the system, and the system monitors the service processor. This monitoring is accomplished by periodic samplings called heartbeats. Surveillance is available during two phases: v System firmware bringup (automatic) v Operating system runtime (optional)Note: Operating system surveillance is disabled on partitioned systems. System Firmware Surveillance System firmware surveillance is automatically enabled during system power-on. It cannot be disabled by the user, and the surveillance interval and surveillance delay cannot be changed by the user. If the service processor detects no heartbeats during system IPL (for a set period of time), it cycles the system power to attempt a reboot. The maximum number of retries is set from the service processor menus. If the fail condition persists, the service processor leaves the machine powered on, logs an error, and displays menus to the user. If Call-out is enabled, the service processor calls to report the failure and displays the operating-system surveillance failure code on the operator panel. Chapter 7. Using the Service Processor 813 Operating System Surveillance Note: Operating system surveillance is disabled in partitioned systems. Operating system surveillance provides the service processor with a means to detect hang conditions, as well as hardware or software failures, while the operating system is running. It also provides the operating system with a means to detect a service processor failure caused by the lack of a return heartbeat. Operating system surveillance is not enabled by default, allowing you to run operating systems that do not support this service processor option. You can also use service processor menus and AIX service aids to enable or disable operating system surveillance. For operating system surveillance to work correctly, you must set these parameters: v Surveillance enable/disable v Surveillance interval The maximum time the service processor should wait for a heartbeat from the operating system before timeout. v Surveillance delay The length of time to wait from the time the operating system is started to when the first heartbeat is expected.Surveillance does not take effect until the next time the operating system is started after the parameters have been set. If desired, you can initiate surveillance mode immediately from service aids. In addition to the three options above, a fourth option allows you to select immediate surveillance, and rebooting of the system is not necessarily required. If operating system surveillance is enabled (and system firmware has passed control to the operating system), and the service processor does not detect any heartbeats from the operating system, the service processor assumes the system is hung and takes action according to the reboot/restart policy settings. See “Service Processor Reboot/Restart Recovery” on page 803. If surveillance is selected from the service processor menus that are only available at system boot, then surveillance is enabled by default as soon as the system boots. From service aids, the selection is optional. 814 406/90 Service Guide Call-Out (Call-Home) Note: Call-out is disabled on partitioned systems. The call-out function is handled by the Service Focal Point application on the Hardware Management Console (HMC). The service processor can call out (call-home) when it detects one of the following conditions: v System firmware surveillance failure v Operating system surveillance failure (if supported by operating system) v Restarts v Critical hardware failure v Abnormal operating system terminationTo enable the call-out feature, do the following: 1. Connect a modem to any serial port. 2. Set up the following using the service processor menus or diagnostic service aids: v Enable call-out for the serial port where the modem is connected. v Enter the modem configuration file name. v Set up site-specific parameters (phone numbers for call-out, call-out policy, number of call-out retries, and so on).3. To call out before restart, set Call-out before restart to ENABLED from the Reboot/Restart Policy Setup menu.Note: Some modems, are not designed for the paging function. Although they can be used for paging, they will return an error message when they do not get the expected response from another modem. Therefore, even though the paging was successful, the error message will cause the service processor to retry, continuing to place pager calls for the number of retries specified in the call-out policy setup menu. These retries result in redundant pages. Chapter 7. Using the Service Processor 815 Console Mirroring Console mirroring allows a user on one serial port to monitor the service processor activities on another serial port. This can be done on the locally attached HMC virtual terminal window or windows remotely attached through modems. Console mirroring is supported on any combination of locally or remotely attached monitors. The serial port from which console mirroring is enabled is called the primary port. The mirror port is determined when keyboard input is detected from one of the other two serial ports. From this point forward, the service processor sends information to the active port and the mirror port only. Console mirroring ends when the service processor releases control of the serial ports to the system firmware. Console mirroring is supported on serial port 1 (S1), serial port 2 (S2), and serial port 3 (S3). Remote attachment through a modem is supported on serial port 1 (S1) and serial port 2 (S2). System Configuration The following describes the configuration for console mirroring: v Service processor v Modem connected to one serial port and enabled for incoming calls v Local HMC virtual terminal window connected to the other serial port. This local terminal can be connected directly to your server or connected through another modem.Console mirroring can be started by either of the following methods: v Remote session first, then local session added: 1. Remote session is already in progress. 2. Remote user uses service processor menus to enable console mirroring, allowing both consoles to be active.v Local session first, then remote session added: 1. Local session is already in progress. 2. The service processor receives a call from the remote user. 3. The local user selects the option to enable console mirroring. The service processor immediately begins mirroring service processor menus. 816 406/90 Service Guide Service Processor Error Logs The service processor error logs, an example of which follows, contain error conditions detected by the service processor. Error Log 1. 11/30/99 19:41:56 Service Processor Firmware Failure B1004999 Enter error number for more details. Press Return to continue, or ’x’ to return to menu. Press "C" to clear error log, any other key to continue. > Note: The time stamp in this error log is coordinated universal time (UTC), which is also referred to as Greenwich mean time (GMT). Operating system error logs have additional information available and can time stamp with local time. Entering an error number provides nine words of system reference code (SRC) data; an example screen is shown below. Detail: 6005 SRC - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - word11:B1004999 word12:0110005D word13:00000000 word14:00000000 word15:00001111 word16:00000000 word17:B1004AAA word18:0114005D word19:A4F1E909 B1004999 Press Return to continue, or ’x’ to return to menu. If Return is pressed, the contents of NVRAM will be dumped 320 bytes at a time, starting at address 0000. Chapter 7. Using the Service Processor 817 LCD Progress Indicator Log The following is an example of the LCD progress indicator log. It shows the types of entries that may appear in the log, and is for example purposes only. The progress indicator codes are listed from top (latest) to bottom (oldest). LCD Progress Indicator Log B0FF 0539..17 0538..17 0539..17 0538..17 0539..17 0581 0538..17 0539..12 0538..12 0539.. 0821..01-K1-00 0539.. 0728..01-R1-00-00 0539.. 0664..40-60-00-1,0 0539.. 0777..U1.5-P1-I14/E1 0539.. 0742..U1.5-P1-I12/E1 0539.. 0776..U1.5-P1-I10/T1 E139 E1FB E139 E183 Press Return to continue, or ’x’ to return to menu. > EAA1..U1.9-P1-I14 E172..U1.9-P1 E172..U1.9-P1-I14 E172..U1.9-P1 94BB 9109 9380 9108 9107 9106 9105 9118 9104 9103 9102 90FD 818 406/90 Service Guide Resetting the Service Processor Some error conditions might cause the service processor to hang. The service processor must be reset to recover from a hung condition. If the system is powered on, resetting the service processor will cause the system to shut down. When the service processor is reset, it carries out its power-on sequence, including self-tests. Successful completion of the reset sequence is indicated by OK in the operator panel in the media drawer. Before resetting the service processor, if the managed system is powered on and running, shut down all partitions, or the full system partition. This action causes the system to shut down and the system power is turned off. To reset the service processor when the managed system is powered off, perform either of the following: v If the service processor is responding to input from the HMC (or ASCII terminal), select Service Processor Setup Menu from the main menu, then select Reset Service Processor. This can only be done by a privileged user. v Put the UEPO switch in the off position, then back to the on position.Note: Do not use the pinhole reset switch to reset the service processor. Chapter 7. Using the Service Processor 819 Service Processor Operational Phases This section provides a high-level flow of the phases of the service processor. SPPowerAppliedPre-StandbyPhaseStandbyPhaseSPMenusAvailableBring-UpPhaseSMSMenusAvailableRun-timePhaseOperatingSystemLoginPromptAvailable Pre-Standby Phase This phase is entered when the server is connected to a power source. The server may or may not be fully powered on. This phase is exited when the power-on self-tests (POSTs) and configurations tasks are completed. The pre-standby phase components are: v Service Processor Initialization - service processor performs any necessary hardware and software initialization. v Service Processor POST - service processor conducts power-on self-tests on its various work and code areas. v Service Processor Unattended Start Mode Checks - To assist fault recovery. If unattended start mode is set, the service processor automatically reboots the server. The service processor does not wait for user input or power-on command, but moves through the phase and into the bring-up phase. Access the SMS menus or the service processor menus to reset the unattended start mode. 820 406/90 Service Guide Standby Phase The standby phase can be reached in either of two ways: v With the server off and power connected (the normal path), recognized by OK in the LCD display. OR v With the server on after an operating system fault, recognized by an 8-digit code in the LCD display.In the standby phase, the service processor takes care of some automatic duties and is available for menus operation. The service processor remains in the standby phase until a power-on request is detected. The standby phase components are as follows: v Modem Configuration The service processor configures the modem (if installed) so that incoming calls can be received, or outgoing calls can be placed. v Dial In Monitor incoming phone line to answer calls, prompt for a password, verify the password, and remotely display the standby menu. The remote session can be mirrored on the local HMC virtual terminal window if the server is so equipped and if the user enables this function. v Menus The service processor menus are password-protected. Before you can access them, you need either the general user-password or privileged-user password. Service processor menus are available on ASCII terminals attached to the serial ports on the primary I/O book, and on terminal emulators. On HMC-managed systems, service processor menus are also available on the HMC graphical user interface.Bring-Up Phase On a system that is powered-on to full system partition, this phase is entered upon power-on, and exited upon loading of the operating system. On a system that is powered-on to partition standby, this phase is entered upon power-on, and exited when the partition manager and hypervisor have been loaded and become fully operational in system memory. The end of this phase is indicated when LPAR displays on the operator panel. At this point, the server can start multiple logical partitions which would be activated through the HMC graphical user interface. The bring-up phase components are as follows: v Retry Request Check The service processor checks to see if the previous boot attempt failed. If the specified number of failures are detected, the service processor displays an error code and places an outgoing call to notify an external party if the user has enabled this option. v Dial Out The service processor can dial a preprogrammed telephone number in the event of an IPL failure. The service processor issues an error report with the last reported IPL status indicated and any other available error information. v Update Operator Panel The service processor displays operator panel data on the HMC virtual terminal window if a remote connection is active. v Environmental Monitoring The service processor provides expanded error recording and reporting. v System Firmware Surveillance (Heartbeat Monitoring) Chapter 7. Using the Service Processor 821 The service processor monitors and times the interval between system firmware heartbeats. v Responding to System Processor Commands The service processor responds to any command issued by the system processor.Run-Time Phase This phase includes the tasks that the service processor performs during steady-state execution of the operating system. v Environmental Monitoring The service processor monitors voltages, temperatures, and fan speeds (on some servers). v Responding to System Processor Commands The service processor responds to any command issued by the system processor. v Run-Time Surveillance (not supported on partitioned systems) If the device driver is installed and surveillance enabled, the service processor monitors the system heartbeat. If the heartbeat times out, the service processor places an outgoing call. This is different from the bring-up phase scenario, where the specified number of reboot attempts are made before placing an outgoing call. v HMC surveillance On an HMC-managed system, the service processor monitors the communication link between the managed system and the HMC. If the service processor detects that this communication link has been broken, it will post an error to the operating system running on the managed system. 822 406/90 Service Guide Chapter 8. Using System Management Services Use the system management services menus to view information about your system or partition, and to perform tasks such as setting a password, changing the boot list, and setting the network parameters. Notes: 1. On some of the system management services (or service processor) screens, you will see the term LPAR. LPAR is equivalent to the term logically partitioned system or partitioned system. 2. In a partitioned system, only those devices that are assigned to the partition that is being booted display in the SMS menus. In a full system partition, all devices in the system display in the SMS menus. To start the system management services, do the following: 1. For a partitioned system, use the Hardware Management Console (HMC) to restart the partition. For a full system partition, restart the system. 2. For a partitioned system, watch the virtual terminal window on the HMC. For a full system partition, watch the firmware console. 3. Look for the POST indicators memory, keyboard, network, scsi, speaker, which appear across the bottom of the screen. Press the numeric 1 key after the word keyboard appears, and before the word speaker appears. For more information on the POST indicator words, refer to “POST Indicators” on page 12.After the system management services starts, the following screen displays: Main Menu 1 Select Language 2 Change Password Options NOT available in LPAR mode 3 View Error Log 4 Setup Remote IPL (Initial Program Load) 5 Change SCSI Settings 6 Select Console NOT available in LPAR mode 7 Select Boot Options -------------------------------------------------------------------------------------------------- Navigation keys: X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ Note: The system management services can also be started using the Service Processor Boot Mode Menu. See page 780. 823 On all menus except the Main Menu, there are several navigation keys: M Return to the main menu. ESC Return to the previous menu. X Exit the system management services and start the operating system. If X is entered, you are asked to confirm your choice to exit the SMS menus and start the operating system. When there is more than one page of information to display, there are two additional navigation keys: N Display the next page of the list. P Display the previous page of the list.Note: The lowercase navigation key has the same effect as the uppercase key that is shown on the screen. For example, m or M returns you to the main menu. On each menu screen, you are given the option of choosing a menu item and pressing Enter (if applicable), or selecting a navigation key. Select Language Note: Your TTY must support the ISO-8859 character set to properly display languages other than English. This option allows you to change the language used by the text-based System Management Services menus. SELECT LANGUAGE 1. English 2. Francais 3. Deutsch 4. Italiano 5. Espanol -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ 824 406/90 Service Guide Change Password Options The Change Password Options menu enables you to select from password utilities. Note: This option is not available in partitioned systems. Password Utilities 1 Set Privileged-Access Password 2 Remove Privileged-Access Password -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ The privileged-access password protects against the unauthorized starting of the system programs. Note: If the privileged-access password has been enabled, you are asked for the privileged-access password at startup every time you boot your system. If you previously had set a privileged-access password and want to remove it, select Remove Privileged-Access Password. View Error Log Use this option to view or clear your system’s error log. A menu similar to the following displays when you select this option. Error Log Date Time Error Code Location Entry 1. 01/04/96 12:13:22 25A80011 00-00 Entry 2. no error logged 1. Clear error log -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ Note: The time stamp in his error log is coordinated universal time (UTC), which is also referred to as Greenwich mean time (GMT). The operating system error logs have more information available and can time stamp with your local time. Chapter 8. Using System Management Services 825 Setup Remote IPL (Initial Program Load) This option allows you to enable and set up the remote startup capability of your system or partition. A list of NIC (network interface card) adapters in the system displays first. An example of this screen follows: Attention: In a partitioned system, only those network adapters that have been assigned to the partition being booted display in the IP Parameters menu. In a full system partition, all network adapters in the system are listed in the adapter parameters menu.NIC Adapters Device Slot Hardware Address 1. 10/100 Mbps Ethernet PCI Adapt U1.9-P1-I4/E1 00096baeac10 2. IBM 10/100/1000 Base-TX PCI U1.5-P2-I3/T1 0009cbce0fde -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ When an adapter is selected, the Network Parameters menu displays: Network Parameters 10/100 Mbps Ethernet PCI Adapter II: U1.9-P1-I4/E1 1. IP Parameters 2. Adapter Parameters 3. Ping Test -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ Selecting the IP (Internet Protocol) parameters option displays the following menu: IP Parameters 10/100 Mbps Ethernet PCI Adapter II: U1.9-P1-I4/E1 1. Client IP Address [000.000.000.000] 2. Server IP Address [000.000.000.000] 3. Gateway IP Address [000.000.000.000] 4. Subnet Mask [255.255.255.000] -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ To change IP parameters, type the number of the parameters for which you want to change the value. Entering IP parameters on this screen will automatically update the parameters on the ping test screen. 826 406/90 Service Guide Attention: If the client system and the server are on the same subnet, set the gateway IP address to [0.0.0.0]. Selecting Adapter Configuration allows the setting of the network speed, enabling or disabling spanning tree, and setting the protocol as shown in the following example menu: Adapter Configuration 10/100 Mbps Ethernet PCI Adapter II: U1.9-P1-I4/E1 1. Speed, Duplex 2. Spanning Tree Enabled 3. Protocol -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ Selecting the Speed, Duplex option allows you to set the interface speed at which the card will run and half versus full duplex. The current setting is indicated by ?<===?. Adapter Configuration 10/100 Mbps Ethernet PCI Adapter II: U1.9-P1-I4/E1 1. auto, auto 2. 10,half 3. 10,full 4. 100,half 5. 100,full -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ Chapter 8. Using System Management Services 827 Selecting the Spanning Tree Enabled menu allows you to enable or disable the spanning tree flag. If this flag is enabled (because the network the system is being attached to supports spanning trees), the firmware will impose a waiting period of 60 seconds before the adapter is allowed to communicate with the network. If this flag is disabled, the network adapter will be able to access the network immediately after the system is connected. The Protocol option allows you to set the appropriate protocol for your network as shown below. Protocol 10/100 Mbps Ethernet PCI Adapter II: U1.9-P1-I4/E1 1. Standard <=== 2. IEEE802.3 -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ Select the ping test option from the network parameters menu to test an adapter’s network connection to a remote system. After the ping test option is selected, the same series of screens will take you through setting up the IP parameters and the adapter configuration before attempting the ping test. Notes: 1. Once the ping test is initiated, it may take 60 seconds or longer to return a result. 2. If the ping test passes or fails, the firmware will stop and wait for a key to be pressed before continuing.Change SCSI Settings This option allows you to view and change the addresses of the SCSI controllers attached to your system. SCSI Utilities 1. Hardware Spin Up Delay 2. Change SCSI Id -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ Select Console Note: This option is not available on partitioned systems. The Select Console utility allows the user to select which console to use to display the SMS menus. This selection is only for the SMS menus and does not affect the display used by the operating system. Follow the instructions that display on the screen. The firmware automatically returns to the SMS main menu. 828 406/90 Service Guide Select Boot Options Use this menu to view and set various options regarding the installation devices and boot devices. 1. Select Install or Boot a Device 2. Select Boot Devices 3. Multiboot Startup -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ Select Install or Boot a Device Allows you to select a device from which to boot or install the operating system. This option is for the current boot only. Select Boot Devices Allows you to set the boot list. Multiboot Startup Toggles the multiboot startup flag, which controls whether the multiboot menu is invoked automatically on startup. If Select Install or Boot a Device is selected, the following menu displays: Select Device Type 1. Diskette 2. Tape 3. CD/DVD 4. IDE 5. Hard Drive 6. Network 7. None 8. List All Devices -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ If a device is selected that is not in the system, a menu with the following message displays: .---------------------------------------------------------. | THE SELECTED DEVICES WERE NOT DETECTED IN THE SYSTEM ! | | Press any key to continue. | `---------------------------------------------------------’ Chapter 8. Using System Management Services 829 If Hard Drive is selected, the following menu displays: Select Hard Drive Type 1. SCSI 2. SSA 3. SAN 4. None 5. List All Devices -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ If List All Devices is selected, a menu similar to the following displays, depending on the devices that are installed in the system: Select Device Device Current Device Number Position Name 1. 1 SCSI 18200 MB Harddisk (loc=U0.1-P2/Z1-A8,0) 2. - SCSI 18200 MB Harddisk (loc=U0.1-P2/Z1-A9,0) 3. - SCSI 18200 MB Harddisk (loc=U0.1-P2/Z1-Aa,0) 4. None 5. List all devices -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ The appropriate device can then be selected for this boot or installation. When a device is selected for installing the operating system, or to boot from, the Select Task menu allows you to get more information about the device, or to boot from that device in normal mode or service mode. The following is an example of this menu. Select Task SCSI 18200 MB Harddisk (loc=U0.1-P2/Z1-A8,0) 1. Information 2. Normal Mode Boot 3. Service Mode Boot -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ If either Normal Mode Boot or Service Mode Boot is selected, the next screen will ask, Are you sure?. If you answer yes, the device will be booted in the appropriate mode. If you answer no, the firmware will return to the Select Task menu. 830 406/90 Service Guide Select Boot Devices Attention: In a partitioned system, only those devices from which an operating system can be booted that are assigned to the partition that is being booted display on the select boot devices menu. In a full system partition, devices from which an operating system can be booted display on the Select Boot Devices menu.Note: To keep the search time for bootable devices down, these menus follow a hierarchy of: device type -> bus type -> adapter -> devices attached to the adapter To view all of the potentially bootable devices at one time rather than traversing down the hierarchy with the submenus, go to the ?Select Device Type? menu or the ?Select Media Type? menu and select the last option, ?List All Devices?. The ?List All Devices? function may take a long time on a large system with many I/O adapters and devices, such as large disk arrays. Select this option to view and change the customized boot list, which is the sequence of devices read at startup. Configure Boot Device Order 1. Select 1st Boot Device 2. Select 2nd Boot Device 3. Select 3rd Boot Device 4. Select 4th Boot Device 5. Select 5th Boot Device 6. Display Current Setting 7. Restore Default Setting -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ When any of the options 1-5 is selected, the Select Device Type screen will be displayed, which is similar to the following: Select Device Type 1. Diskette 2. Tape 3. CD/DVD 4. IDE 5. Hard Drive 6. Network 7. None 8. List All Devices -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ When a device type is selected, such as item 5, a Select Media Type menu is displayed. The following is an example of that menu. Chapter 8. Using System Management Services 831 Select Media Type 1. SCSI 2. SSA 3. SAN 4. IDE 5. ISA 7. None 8. List All Devices -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ When the media type is selected, all adapters of that type are displayed on the Select Media Adapter menu. The following is an example of that menu for a SCSI media type. Select Media Adapter 1. U1.9-P1/Z1 2. U1.9-P1-l1/Z1 3. U1.9-P1-l1/Z2 4. None 5. List All Devices -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ Each adapter must then be selected individually to see the devices tat are attached to it. An example of this menu for the first adapter in the previous example is as follows: Select Device Device Current Device Number Position Name 1. 1 SCSI 18 GB Harddisk (loc=U1.9-P1/Z1-A8,0) 2. - SCSI 18 GB Harddisk (loc=U0.9-P1/Z1-A9,0) 3. None -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ If there are no devices of the type chosen earlier (on the Select Device Type menu) attached to the adapter that is specified, a message similar to the following displays: -------------------------------------------------------------------------------------------------- | THE SELECTED DEVICES WERE NOT DETECTED IN THE SYSTEM | | Press any key to continue. | -------------------------------------------------------------------------------------------------- When a device type is selected, the Select Task menu allows you to see detailed information about the device or set the device’s location in the boot list as shown below in the example menu. The following is an example of the menu for a hard disk. 832 406/90 Service Guide Select Task SCSI 18200 MB Harddisk (loc=U0.1-P2/Z1-A8,0) 1. Information 2. Set Boot Sequence: Configure as 1st Boot Device -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ Selecting Information displays a menu similar to the following for a hard disk. Device Information /pci@3fffdf0a000/pci@2,4/scsi@1/sd@8,0 : (Integrated) (Bootable) DEVICE : SCSI 18200 MB Harddisk ( loc=U0.1-P2/Z1-A8,0) NAME : sd DEVICE-TYPE : block -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ The Set Boot Sequence option allows you to set the location of the device in the boot list. Display Current Settings This option displays the current setting of the customized boot list. An example of this menu, with one device in the boot list, follows. Current Boot Sequence 1. SCSI 18200 MB Harddisk (loc=U0.1-P1/Z1-A8,0) 2. None 3. None 4. None 5. None -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ Chapter 8. Using System Management Services 833 Restore Default Settings This option restores the boot list to the default boot list. The default boot list will vary depending on the devices that are installed in the system. The default boot list is as follows: 1. Primary diskette drive (if installed) 2. CD-ROM drive (if installed) 3. Tape drive (if installed) 4. Hard disk drive (if installed) 5. Network adapterMultiboot Startup Multiboot Startup toggles the multiboot startup flag, which controls whether the multiboot menu is invoked automatically on startup. Exiting System Management Services After you have finished using the system management services, type x (for exit) to boot your system or partition. 834 406/90 Service Guide Chapter 9. Removal and Replacement Procedures Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .837 Handling Static-Sensitive Devices . . . . . . . . . . . . . . . . . . . . . . . . .837 Powering Off and Powering On the System . . . . . . . . . . . . . . . . . . . . . .838 Powering Off the System With an HMC Attached and AIX Installed . . . . . . . . . . . . .838 Powering Off the System With an HMC Attached and Linux Installed . . . . . . . . . . . .840 Powering the System On . . . . . . . . . . . . . . . . . . . . . . . . . . . .841 Powering On the System Using the Service Processor . . . . . . . . . . . . . . . . .841 Hot-Pluggable FRUs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .842 Power Subsystem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .844 Bulk Power Assembly (BPA) . . . . . . . . . . . . . . . . . . . . . . . . . . .844 Bulk Power Regulator (BPR) . . . . . . . . . . . . . . . . . . . . . . . . . . .844 Bulk Power Controller (BPC) . . . . . . . . . . . . . . . . . . . . . . . . . . .845 Bulk Power Distributor (BPD) . . . . . . . . . . . . . . . . . . . . . . . . . . .847 Bulk Power Fan (BPF) . . . . . . . . . . . . . . . . . . . . . . . . . . . . .848 Bulk Power Enclosure (BPE) . . . . . . . . . . . . . . . . . . . . . . . . . . .849 Unit Emergency Power Off (UEPO) Card Assembly . . . . . . . . . . . . . . . . . .850 Integrated Battery Feature (IBF) . . . . . . . . . . . . . . . . . . . . . . . . .851 Processor Subsystem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .853 Installing the MCM/L3/Passthrough Plug Map . . . . . . . . . . . . . . . . . . . . .853 I/O Books and I/O Blanks . . . . . . . . . . . . . . . . . . . . . . . . . . . .854 Battery . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .859 Processor Subsystem DCA (Distributed Converter Assembly) Books and DCA Blanks . . . . . .861 Capacitor Books and Blanks . . . . . . . . . . . . . . . . . . . . . . . . . . .862 MCM Module (Processor) and Pass Through Modules . . . . . . . . . . . . . . . . .863 L3 Cache Modules . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .882 Capacity Upgrade on Demand (CUoD) VPD Module . . . . . . . . . . . . . . . . . .902 VPD Card . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .904 System Clock Card . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .905 Aluminum Backplane Stiffener . . . . . . . . . . . . . . . . . . . . . . . . . .906 Processor Subsystem Chassis . . . . . . . . . . . . . . . . . . . . . . . . . .908 Memory Books and Memory Blanks . . . . . . . . . . . . . . . . . . . . . . . .910 Fan Controller . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .911 Processor Fan Assemblies . . . . . . . . . . . . . . . . . . . . . . . . . . .912 Fan Chassis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .912 Processor Subsystem Air Filters . . . . . . . . . . . . . . . . . . . . . . . . .912 I/O Subsystem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .913 I/O Subsystem PCI Slot LED Definitions . . . . . . . . . . . . . . . . . . . . . .913 PCI Adapters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .914 Replacing a Double-Wide Blind-Swap Adapter . . . . . . . . . . . . . . . . . . . .985 I/O Subsystem RIO Riser Card . . . . . . . . . . . . . . . . . . . . . . . . . .986 I/O Subsystem Distributed Converter Assembly (DCA) . . . . . . . . . . . . . . . . .987 I/O Subsystem I/O Backplane Assembly . . . . . . . . . . . . . . . . . . . . . . .989 I/O Subsystem DASD Hard Disk Drive Assembly . . . . . . . . . . . . . . . . . . .991 I/O Subsystem DASD 4-Pack . . . . . . . . . . . . . . . . . . . . . . . . . .993 I/O Subsystem Fan . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .994 I/O Subsystem Chassis . . . . . . . . . . . . . . . . . . . . . . . . . . . . .995 Media Subsystem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .998 CD-ROM, DVD-RAM, or Tape Drive . . . . . . . . . . . . . . . . . . . . . . . .998 Diskette Drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .999 Operator Panel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1000 Media Subsystem Chassis . . . . . . . . . . . . . . . . . . . . . . . . . . .1001 Media Subsystem Internal Cables . . . . . . . . . . . . . . . . . . . . . . . .1002 835 p690 Cable Installation and Management . . . . . . . . . . . . . . . . . . . . . .1003 836 406/90 Service Guide Introduction Before performing any of the removal or replacement procedures in this chapter, read the following notices. DANGERAn electrical outlet that is not correctly wired could place hazardous voltage on metal parts of the system or the devices that attach to the system. It is the responsibility of the customer to ensure that the outlet is correctly wired and grounded to prevent an electrical shock. Use one hand, when possible, to connect or disconnect signal cables to prevent a possible shock from touching two surfaces with different electrical potentials. During an electrical storm, do not connect cables for display stations, printers, telephones, or station protectors for communications lines.D06 CAUTION: This product is equipped with a three-wire power cable and plug for the user’s safety. Use this power cable with a properly grounded electrical outlet to avoid electrical shock.C01 CAUTION: This unit has more than one power supply cord. To reduce the risk of electrical shock, disconnect two power supply cords before servicing.C21 Handling Static-Sensitive Devices Attention: Disk drives, tape drives, and CD-ROM drives are sensitive to static electricity discharge. These devices are shipped in antistatic containers to prevent damage caused by electrostatic discharge. Take the following precautions: v If you have an antistatic wrist strap available, use it while handling the device. v Do not remove the device from the antistatic container until you are ready to install the device in the system unit. v With the device still in its antistatic container, touch it to a metal frame of the system. v Grasp cards and boards by the edges. Hold drives by the frame. Avoid touching the solder joints or pins. v If you need to lay the device down while it is out of the antistatic container, lay it on the antistatic container. Before picking it up again, touch the antistatic container and the metal frame of the system unit at the same time. v To prevent permanent damage, handle the devices carefully. Chapter 9. Removal and Replacement Procedures 837 Powering Off and Powering On the System This section provides procedures for powering off and powering on the system. Powering Off the System With an HMC Attached and AIX Installed The HMC user interface provides a power-off function to turn off the power to the system. If the system is logically partitioned, AIX in each partition may be stopped individually to make repairs. If the system power must be turned off, all of the logical partitions must be stopped. AIX provides hot-swap procedures for adapters and devices that support hot-swap removal and installation. You do not have to shutdown a partition that controls an adapter or device that supports hot-swap. Individual power components and fans may be serviced with power on for power systems equipped with redundant power and cooling. Check removal and installation procedures before servicing the system. As the system goes off, the power LED on the operator panel starts blinking at a fast rate. B0FF appears in the primary operator panel display. When the power-off sequence is complete, the system goes into standby mode, as evidenced by the following: v OK displays in the operator panel display. v The operator-panel LED will start blinking at a slow rate. v The processor subsystem power LED will start blinking at a slow rate.To completely remove power from a subsystem in the frame, use one of the following procedures based on the presence or absence of the High Performance Switch (7045-SW4): v If a High Performance Switch (7045-SW4) is not installed in the processor subsystem frame, input power to a subsystem must be removed by turning off the UEPO switch and turning off the circuit breakers on all optional IBF subsystems (if provided). v If a High Performance Switch (7045-SW4) is installed in the processor subsystem frame, input power to a subsystem may be selectively removed from the processor subsystem, an I/O subsystem, or switch subsystem using the HMC as described below: The HMC user interface provides a power-off function to turn off the power to an individual subsystem. For information about shutting down and powering off an individual subsystem, see ?Basic System Management Tasks?, located in the Hardware Management Console Installation and Operations Guide. Attention: If a High Performance Switch is present in the frame, using the UEPO switch to remove power from a subsystem being serviced will remove power from ALL of the subsystems that are receiving power from the frame. This may result in unintended outages and extended down time. If you are servicing an individual I/O subsystem, refer to the information about deactivating a system’s I/O drawers, found in ?Basic System Management Tasks?, located in the Hardware Management Console Installation and Operations Guide to remove 350 V power from both of the I/O subsystem DCA UPIC cable connections. Before deactivating, ensure that you shut down the operating system on all partitions that are using the resources of the I/O subsystem being serviced. After deactivating the I/O subsystem, both the Power-In LED and the Power-Out LED on the front of the I/O subsystem DCAs should be off. If not, call service support. If you are servicing the processor subsystem, refer to the information about deactivating a managed system’s service processor found in ?Basic System Management Tasks?, located in the Hardware Management Console Installation and Operations Guide to remove 350 V power from the processor subsystem DCA UPIC cable connections. Before deactivating, ensure that you shut down the operating system on all partitions that are running on the processor subsystem being serviced. After deactivating the processor subsystem, both the Power-In LED and the Power-Out LED on the front of the processor subsystem DCA should be off. If not, do not continue. Call service support. v If a High Performance Switch (7045-SW4) is not installed in the processor subsystem frame, input power must be removed by turning off the UEPO switch, turning off the circuit breakers on all optional IBF subsystems (if provided), and then disconnecting both power cords from the customer’s receptacle. 838 406/90 Service Guide If the system is operating in a full system partition under AIX, typing the shutdown command causes the system to shut down and power off. The -F flag shuts down the system without warning system users. Check with the system administrator before using this command. If you cannot use this method, you can power off the system by pressing the power button on the operator panel or the appropriate command at the HMC. Attention: Using the power button on the operator panel or commands at the HMC to power off the system can cause unpredictable results in the data files. The next IPL will also take longer to complete if all applications in all partitions are not stopped before hand. Chapter 9. Removal and Replacement Procedures 839 Powering Off the System With an HMC Attached and Linux Installed The HMC user interface provides a power-off function to turn off the power to the system. If the system is logically partitioned, Linux may be stopped individually in each partition to make repairs. If the system power must be turned off, all of the logical partitions must be stopped. Linux does not provide hot-swap procedures for adapter and device removal and installation. You must shutdown all partitions and the system power must be turned off to service an adapter or device. Individual power components and fans may be serviced with power on for power systems equipped with redundant power and cooling. Check removal and installation procedures before servicing this system. As the system goes off, the power LED on the operator panel starts blinking at a fast rate. B0FF appears in the primary operator panel display. When the power-off sequence is complete, the system goes into standby mode, as evidenced by the following: v OK displays in the operator panel display. v The operator-panel LED will start blinking at a slow rate. v The processor subsystem power LED will start blinking at a slow rate.To completely remove power from a subsystem in the frame, use one of the following procedures based on the presence or absence of the High Performance Switch (7045-SW4): v If a High Performance Switch (7045-SW4) is not installed in the processor subsystem frame, input power to a subsystem must be removed by turning off the UEPO switch and turning off the circuit breakers on all optional IBF subsystems (if provided). v If a High Performance Switch (7045-SW4) is installed in the processor subsystem frame, input power to a subsystem may be selectively removed from the processor subsystem, an I/O subsystem, or switch subsystem using the HMC as described below: The HMC user interface provides a power-off function to turn off the power to an individual subsystem. For information about shutting down and powering off an individual subsystem, see ?Basic System Management Tasks?, located in the Hardware Management Console Installation and Operations Guide. Attention: If a High Performance Switch is present in the frame, using the UEPO switch to remove power from a subsystem being serviced will remove power from ALL of the subsystems that are receiving power from the frame. This may result in unintended outages and extended down time. If you are servicing an individual I/O subsystem, refer to the information about deactivating a system’s I/O drawers, found in ?Basic System Management Tasks, located in the Hardware Management Console Installation and Operations Guide to remove 350 V power from both of the I/O subsystem DCA UPIC cable connections. Before deactivating, ensure that you shut down the operating system on all partitions that are using the resources of the I/O subsystem being serviced. After deactivating the I/O subsystem, both the Power-In LED and the Power-Out LED on the front of the I/O subsystem DCAs should be off. If not, call service support. If you are servicing the processor subsystem, refer to the information about deactivating a managed system’s service processor, found in ?Basic System Management Tasks?, located in the Hardware Management Console Installation and Operations Guide to remove 350 V power from the processor subsystem DCA UPIC cable connections. Before deactivating, ensure that you shut down the operating system on all partitions that are running on the processor subsystem being serviced. After deactivating the processor subsystem, both the Power-In LED and the Power-Out LED on the front of the processor subsystem DCA should be off. If not, do not continue. Call service support. v If a High Performance Switch (7045-SW4) is not installed in the processor subsystem frame, input power must be removed by turning off the UEPO switch, turning off the circuit breakers on all optional IBF subsystems (if provided), and then disconnecting both power cords from the customer’s receptacle. If the system is operating in a full system partition under Linux, typing the shutdown now -h command causes the system to shut down and power off. The -F flag shuts down the system without warning 840 406/90 Service Guide system users. Check with the system administrator before using this command. If you cannot use this method, you can power off the system by pressing the power button on the operator panel or the appropriate command at the HMC. Attention: Using the power button on the operator panel or commands at the HMC to power off the system can cause unpredictable results in the data files. The next IPL will also take longer to complete if all applications in all partitions are not stopped beforehand. Powering the System On The system can be powered on after all of the following steps have been completed: v All I/O drawer cables are connected v UPIC cables are connected v All PCI cables to supported subsystems are connected v The hardware management console (HMC) is connected v Power is connected to the system and the UEPO switch is set to OnAfter the required cables are installed, and the power cables are connected, the HMC graphical user interface provides a power-on function to turn on the power to the system. The power button on the media subsystem operator-panel can be pushed to initialize the system, but the preferred method is to use the HMC. Progress indicators, also referred to as checkpoints, are visible on the media subsystem operator panel display as the system power is turned on. The power LED on the media subsystem stops blinking and stays on, indicating the system power is on. The processor subsystem and I/O subsystems are powered on through the system power control network (SPCN). When power is applied, the power LEDs on the media subsystem go from blinking to on continuously, and the power LEDs on the processor subsystem and I/O subsystem(s) come on and stay on. This indicates that power levels are satisfactory in the subsystems. Powering On the System Using the Service Processor The system can be powered on using the System Power Control menu, which is a Service Processor menu that is available to the privileged user. Chapter 9. Removal and Replacement Procedures 841 Hot-Pluggable FRUs Notes: 1. The Linux operating system does not support some hot-pluggable procedures. 2. Linux does not support hot-plugging any hot-pluggable PCI adapters or devices. A system with Linux installed on one or more partitions must be shutdown and powered off before replacing any PCI adapter or device assigned to a Linux partition. Follow the non-hot-pluggable adapter or device procedures when replacing a PCI adapter or device in any partition with Linux installed. 3. You do not have to power off the system to remove a hot-pluggable FRU from the processor subsystem or from I/O subsystems. FRUs can be considered as not concurrently replaceable, concurrently replaceable with hot plug, or concurrently replaceable. The following table indicates which FRUs are hot-pluggable or non-hot-pluggable. Each FRU shown is identified as available for concurrent maintenance if applicable. Processor Subsystem FRUs Hot-Pluggable Concurrent Maintenance Notes Blowers Yes Processor subsystem and power blowers can be replaced with power on, follow the appropriate removal and replacement procedures. DCA Yes DCAs can be replaced with power on, follow the appropriate removal and replacement procedures. Bulk Power Enclosure Yes One Bulk Power Enclosure can be replaced with the system running, follow the appropriate removal and replacement procedures. Note: If you use the Start Service button and the ENBL (Good) LEDs on the enclosure components do not go off, do not proceed. Contact the customer and schedule a time for replacement of the enclosure with power off. Bulk Power Controller (BPC) Yes BPCs can be replaced with power on, follow the appropriate removal and replacement procedures. Bulk Power Regulator (BPR) Yes BPRs can be replaced with power on, follow the appropriate removal and replacement procedures. Bulk Power Distributor (BPD) Yes BPDs can be replaced with power on, follow the appropriate removal and replacement procedures. UEPO Switch panel Yes UEPO Switch panel can be replaced with power on, follow the appropriate removal and replacement procedures. BPF Yes BPF can be replaced with power on, follow the appropriate removal and replacement procedures. IBF Yes IBF can be replaced with power on, follow the appropriate removal and replacement procedures. Capacitor book No Processor subsystem chassis No MCM No Memory cards No L3 cache No Clock card No I/O books No 842 406/90 Service Guide I/O Subsystem FRUs Hot-Pluggable Concurrent Maintenance Notes - I/O backplane and riser card No - DASD backplane No - Disk drives Yes (AIX only) Might require additional consideration or preparation by the customer for replacement. - DCA (power supplies) Yes Only one DCA can be removed at a time. - I/O fan assemblies Yes Can be replaced only if the DASD backplane is not installed. - PCI adapters Yes (AIX only) Might require additional consideration or preparation by the customer for replacement. Operator Panel No Diskette drive Yes (AIX only) Must be removed from the configuration and the partition must be booted afterwards to ensure correct operation. Might require additional consideration or preparation by the customer for replacement. CD ROM Yes (AIX only) Must be removed from the configuration and the partition must be booted afterwards to ensure correct operation. Might require additional consideration or preparation by the customer for replacement. Optional media SCSI devices Yes (AIX only) Must be removed from the configuration and the partition must be booted afterwards to ensure correct operation. Might require additional consideration or preparation by the customer for replacement. Chapter 9. Removal and Replacement Procedures 843 Power Subsystem This section contains the removal and replacement procedures for the power subsystem. Notes: 1. All pluggable BPA FRUs are replaced concurrently with system operation when the removal and replacement instructions are followed. Exceptions to this occur when multiple failures exist in the BPA. 2. Removal and replacement of the entire BPA enclosure (BPE) is also concurrent with system operation, unless there are multiple BPE failures. Attention: Be sure to check for FRU redundancy to determine if the repair can be performed concurrently. If you cannot determine redundancy, delay the repair until the system can be shut down. See the “Hot-Pluggable FRUs” on page 842 for more specific details. Bulk Power Assembly (BPA) The bulk power assembly is on the top of the rack. This assembly provides power to the whole system. There are two sides to the BPA so that power is never completely removed from the system unless the system power is shut down, and the power cables are removed from the customer’s power source. Bulk Power Regulator (BPR) The BPRs are located in the bottom positions of the bulk power enclosure (BPE).Notes: 1. Before installing the BPR in the BPE (Bulk Power Enclosure) during a service or repair action, the Lock/Unlock switch located on the BPR, MUST be set to the Unlock position. Failure to do so may damage the BPR guide rails in the BPE resulting in extended outages and unnecessary, additional parts replacement. 2. The field stocks two versions of BPR. The older version is identified by FRU part number 44P1995. Order the older version of the BPR if you are not replacing all of the BPRs installed in the system and the firmware on the system is below firmware release 3H030916. Install the upgrade BPR, identified by FRU part number 44P2645, only if the firmware on the system is 3H030916 or later. Do not mix the two versions without the firmware level being 3H030916 or later. 3. Removal Attention: The BPR unit weighs approximately 12.47 kg (27.5 lbs). 1. Locate the BPR to be replaced. 2. Note the state of the BPR ENBL (Good) LED. 3. Press the green Start Service button on the UEPO switch. 4. Proceed only if the ENBL (Good) LED goes off or the repair has been scheduled. If the repair has been scheduled, power off the system as described in “Powering Off and Powering On the System” on page 838, then return here. 5. Locate the BPR to be replaced and manually slide its Lock/Unlock switch to unlock (power switch to the right). At this point, the ENBL (Good) LEDs turn off. 6. If the integrated battery feature (IBF) is installed, manually turn off the IBF circuit breaker attached to the BPR being repaired. Unplug the battery cable from the front face of the BPR. 7. Loosen the left and right fasteners using the torque tool (part number 6422789). Push up on the BPR release levers, and slide the unit out of the slot.Replacement 1. The field stocks two versions of BPR. The older version is identified by FRU part number 44P1995. Order the older version of the BPR if you are not replacing all of the BPRs installed in the system and the firmware on the system is below firmware release 3H030916. Install the upgrade BPR, identified by 844 406/90 Service Guide FRU part number 44P2645, only if the firmware on the system is 3H030916 or later. Do not mix the two versions without the firmware level being 3H030916 or later. 2. Make sure that the BPR release levers are in the UP position. Slide the unit into slot until one inch from BPE. Using both hands on the front of the BPR, firmly and quickly seat the BPR against the backplane. 3. Tighten the left fastener four turns with torque tool followed by four turns to the right fastener. Torque the left fastener to completion, followed by the right fastener. Return to the left fastener and ensure that it is properly torqued. 4. Move the BPR release levers to the DOWN position. 5. If the battery feature is installed, reconnect the battery cable to the connector on the front face of the BPR. 6. Manually turn on the IBF circuit breaker. 7. Move the BPR Lock/Unlock switch to lock (power switch to the left). 8. Press the white Service Complete button on the UEPO switch to indicate completion of this procedure, if this repair was completed during a system outage, bring the system back up. 9. If a high performance switch is installed in or attached to the system, perform the steps described in “Frame (Power Subsystem) Firmware Update with a 45H/W4 Switch Installed” on page 809 to insure that the new BPR contains the correct code.Bulk Power Controller (BPC) The BPC is located immediately above the top BPR. Attention: This service procedure may produce other power error code(s) (101x xxxx) between the time the Start Service button is pressed and the Service Complete button is pressed, during concurrent repair. If normal system operation continues, as described by this procedure, ignore these error codes. Removal Attention: The BPC unit weighs approximately 3.18 kg (7 lbs). 1. Locate the BPC to be replaced. 2. Note the state of the BPC LEDs. 3. Press the green Start Service button on the UEPO switch. 4. Proceed only if the ENBL (Good) LED goes off or the repair has been scheduled. If the repair has been scheduled, power off the system as described in “Powering Off and Powering On the System” on page 838, then return here. 5. Check the ENBL (Good) LED for all the BPRs (LED on right side of FRU) on same BPA side. For each BPR that has this LED in the ON state, manually slide the power switches to the right. Attention: Do not switch off units where this LED is already off. 6. Verify that all cables are labeled for plug location and then unplug the cables from the BPC. 7. Loosen the left and right fasteners using the torque tool (part number 6422789). Slide the unit out of the slot. Chapter 9. Removal and Replacement Procedures 845 Replacement Attention: Before proceeding, make sure that all the BPR ENBL (Good) LEDs are OFF. For each BPR that has this LED in the ON state, manually slide the power switches to the right. Do not switch off units where this LED is already off. Attention: Make sure that the UEPO bypass switch on the BPC is in the BYPASS position. 1. Slowly slide the BPC into the slot and carefully seat the BPC against the backplane. 2. Tighten the left and right fasteners with the torque tool. 3. Plug all cables to the BPC, ensuring that the plug location and the cable label agree. 4. Manually move any BPR power switches that are to the right (OFF) position, to the left (ON) position. 5. Move the UEPO bypass switch to the NORMAL position. 6. Press the white Service Complete button on the UEPO switch to indicate completion of this procedure. The BPA automatically configures as it powers on. 7. If a high performance switch is installed in or attached to the system, perform the steps described in “Frame (Power Subsystem) Firmware Update with a 45H/W4 Switch Installed” on page 809 to insure that the new BPC contains the correct code. 846 406/90 Service Guide Bulk Power Distributor (BPD) There can be two BPDs, located immediately above the BPC. If only one BPD is required, an airflow baffle must installed in the top BPD position. Attention: This service procedure may produce other power error code(s) (101x xxxx) between the time the Start Service button is pressed, and the Service Complete button is pressed, during concurrent repair. If normal system operation continues, as described by this procedure, ignore these error codes. Removal Attention: The BPD unit weighs approximately 2.27 kg (5 lbs). 1. Locate the BPD to be replaced. 2. Note the state of the BPD ENBL (Good) LED. 3. Press the green Start Service button on the UEPO switch. 4. Proceed only if the ENBL (Good) LED goes off or the repair has been scheduled. If the repair has been scheduled, power off the system as described in “Powering Off and Powering On the System” on page 838, then return here. 5. Check the ENBL (Good) LED for all the BPRs (LED on right side of FRU) on same BPA side. For each BPR that has this LED in the ON state, manually slide the power switches to the right. Attention: Do not switch off units where this LED is already off. 6. Verify that all cables are labeled for plug location and then unplug the cables from the BPD. 7. Loosen the left and right fasteners using the torque tool (part number 6422789). Slide the unit out of the slot.Replacement Attention: Before proceeding, make sure that all the BPR ENBL (Good) LEDs are OFF. For each BPR that has this LED in the ON state, manually slide the power switches to the right. Do not switch off units where this LED is already off. Attention: Make sure that the UEPO bypass switch on the BPC is in the BYPASS position. 1. Slowly slide the BPD into the slot and carefully seat the BPD against the backplane. 2. Tighten the left and right fasteners with the torque tool. 3. Plug all cables to the BPD, ensuring that the plug location and the cable label agree. 4. Manually move any BPR power switches that are to the right (OFF) position, to the left (ON) position. 5. Move the UEPO bypass switch to the NORMAL position. 6. Press the white Service Complete button on the UEPO switch to indicate completion of this procedure. The BPA automatically configures as it powers on. 7. If a high performance switch is installed in or attached to the system, perform the steps described in “Frame (Power Subsystem) Firmware Update with a 45H/W4 Switch Installed” on page 809 to insure that the new BPD contains the correct code. Chapter 9. Removal and Replacement Procedures 847 Bulk Power Fan (BPF) The BPF is located immediately to the right of the BPC and positioned vertically in the BPA. Removal Attention: The BPF unit weighs approximately 3.63 kg (8 lbs). 1. Locate the BPF to be replaced. 2. Note the state of the BPC ENBL (Good) LED on the same BPA side as the BPF to be replaced. 3. Press the green Start Service button on the UEPO switch. 4. Proceed only if the ENBL (Good) LED goes off or the repair has been scheduled. If the repair has been scheduled, power off the system as described in “Powering Off and Powering On the System” on page 838, then return here. 5. Verify that the BPF cable is labeled for plug location and then unplug the cable from the BPC. 6. Remove the fan cover plate and set aside for the replacement procedure. 7. Loosen the upper and lower fasteners using the torque tool (part number 6422789)l. Move the BPR release, levers if necessary, and slide the unit out of the slot.Replacement 1. Ensure that the BPF fasteners are to the left side. 2. Slowly slide the BPF into the slot and carefully seat the BPF against the front of the BPE, moving the BPR release levers if necessary. 3. Tighten the upper and lower fasteners with the torque tool. 4. Return the BPR release levers to the down position, if necessary. 5. Plug the BPF cable to the BPC, ensuring that the plug location and the cable label agree. 6. Press the white Service Complete button on the UEPO switch to indicate completion of this procedure. 7. If a high performance switch is installed in or attached to the system, perform the steps described in “Frame (Power Subsystem) Firmware Update with a 45H/W4 Switch Installed” on page 809 to insure that the new BPF contains the correct code. 848 406/90 Service Guide Bulk Power Enclosure (BPE) Attention: This service procedure may produce other power error code(s) (101x xxxx) between the time the Start Service button is pressed and the Service Complete button is pressed, during concurrent repair. If the system behaves normally, as described by this procedure, ignore these error codes. Removal Attention: The BPE unit weighs approximately 18.14 kg (40 lbs). 1. Locate the BPE to be replaced. 2. Note the state of all ENBL (Good) LEDs in the BPE. 3. Press the green Start Service button on the UEPO switch. 4. Proceed only if the ENBL (Good) LED goes off or the repair has been scheduled. If the repair has been scheduled, power off the system as described in “Powering Off and Powering On the System” on page 838, then return here. 5. Manually slide all Lock/Unlock switches (power switches to the right, unlock) on all BPRs attached to the BPA side being repaired. At this point, the ENBL (Good) LEDs turn off. 6. Manually turn off all IBF circuit breakers attached to the BPA side being repaired. 7. Verify that all cables are labeled for plug location and then unplug the cables from the BPA in the following order: a. BPC b. BPD c. BPR d. IBF (if installed) e. ac line power cord 8. Remove all pluggable FRUs from the BPE (BPRs, BPCs, BPDs, and BPF). 9. Remove the four mounting screws.Note: Do not remove the two screws at the bottom left, as these fasten the cage rail to the frame. 10. Slide the unit out of the frame.Replacement Attention: Make sure that the all BPR Lock/Unlock switches (power switches) are in the unlock (right, OFF) position on this BPA before proceeding. Attention: Make sure that the UEPO bypass switch on the BPC front panel is in the NORMAL position. 1. Slowly slide the BPE into the frame using the right side guide pins until the side flanges contact the frame. 2. Fasten the four mounting screws. 3. Replace all pluggable FRUs to the BPE (BPR, BPC, BPDs, BPF, and baffles).Note: Be sure to check the board connector for each FRU for any visible signs of damage (such as broken sockets.) before plugging into the backplane. Replace any FRU that is damaged. 4. Plug all cables to the BPA FRUs, ensuring that the plug location and the cable label agree. 5. Plug in the ac line power cords. 6. Manually turn on all IBF circuit breakers (if installed) attached to the BPA side being repaired. 7. Manually move all Lock/Unlock switches (power switches) to the lock position (the left, ON position) on all BPRs attached to the BPA. 8. Press the white Service Complete button on the UEPO switch to indicate completion of this procedure. The BPA automatically configures as it powers on. Chapter 9. Removal and Replacement Procedures 849 Unit Emergency Power Off (UEPO) Card Assembly 41235667891011 1 Simplified UEPO Front Panel 7 J02 Connector (White) 2 Simplified UEPO Side Panel 8 Power Switch 3 Simplified UEPO Rear Panel 9 System Fault LED 4 J00 Connector (Back) 10 Start Service Button (Green) 5 J01 Connector (Back) 11 Service Complete Button (White) 6 Room EPO Bypass Interlock Removal Note: If you are replacing this assembly with a new assembly, the manufacturing label with the machine type, model number, and the serial number for the subsystem must be ordered to apply to the new part. For information on replacement labels, see Chapter 10, “Parts Information,” on page 1009. 1. Locate the system with the faulty UEPO switch. 2. Press the green Start Service button on the UEPO switch. 3. Switch the UEPO bypass switches on both BPCs to the ?Bypass? position (to the right). 4. Ensure that the red switch on the EPO panel is in the ON position. 5. Verify that all cables are labeled for plug location and then unplug the cables from the UEPO card assembly. 6. Loosen, but do not remove, the two 8-mm M5 mounting screws. Slide the assembly forward and away from the frame.Replacement Note: If you are replacing this assembly with a new assembly, the manufacturing label with the machine type, model number, and the serial number for the subsystem must be ordered to apply to the new part. For information on replacement labels, see Chapter 10, “Parts Information,” on page 1009. 1. Place the assembly over the two 8-mm M5 screws, slide it downward, and tighten the screws. 2. Plug all cables to the UEPO card assembly, ensuring that the plug location and the cable label agree. 3. Install the external cable, if required, to connector J02. a. If the external EPO connection is installed, the internal toggle switch is mechanically forced into the ROOM EPO ACTV position (to the right). 850 406/90 Service Guide b. If there is no external EPO cable, manually set the internal toggle switch to the ROOM EPO BYPASS position. To set this switch, pull the knob down, to the ROOM EPO BYPASS position, and release the knob (to the left).4. Ensure that the red UEPO switch is on (up position). 5. Switch the UEPO BYPASS switches on both BPCs back to NORMAL mode. 6. Press the white Service Complete button on the UEPO switch to indicate completion of this procedure.Integrated Battery Feature (IBF) Removal Attention: This process requires: v The use of a lift tool (part number 09P2481) and lift-plate assembly tool (part number 11P4369). v The lift tool wheels must be chocked to prevent it from moving during the operation. The frame must not be on its casters, and the leveling pads must be engaged to prevent the frame from moving during the operation.The IBFs (if installed) are mounted under the media subsystem. 1. Locate the IBF to be replaced. 2. Press the green Start Service button on the UEPO switch. 3. Turn off the IBF circuit breaker. 4. Unplug the battery cable from the front of the IBF, and remove the 7-mm M5 screw (closest to the connector) that is holding the cable to the frame. This action allows the cable to move so that the IBF can be removed. 5. Remove the two mounting bolts from the guide rails and retain for reuse during the replacement procedure. 6. Read all safety instructions before performing this operation. Unpack the lift tool and follow assembly instructions included with the tool. Inspect the tool for damage (do not perform the removal procedure with a damaged lift tool). Replace the blue lift plate with the orange lift plate (part number 11P4369). Orient the two round holes in the lift plate to the side of the lift tool that will be closest to the frame. Install the wheel chocks and engage the brake. 7. Raise the plate to a height such that the lift plate is just below the base of the IBF you are removing. Release the pins on the lift plate and fully extend the center portion of the plate toward the IBF and lock it in place. Release the pins on the extender arms and position them on both left and right side. The arms must fit around the frame extenders and lock in place (do not adjust the lift tool after these arms are locked in place). 8. One person should gently pull and guide the IBF onto the lift plate. The IBF must be removed far enough to clear the front face of the frame. Retract the extender arms and center portion of lift plate. Note: Ensure that the IBF is in the center of the lift plate. 9. Lower the IBF to the bottom position of the lift tool. 10. Remove the wheel chocks and release the brake. Flip the crank handle 180 degrees. Carefully move the lift tool with the IBF to the staging area. The brake and wheel chocks must be in place when transferring the IBF to or from the packaging material. Chapter 9. Removal and Replacement Procedures 851 Replacement Note: Install rails in the frame if not already installed. 1. Position the lift tool such that the new IBF can be moved onto the lift plate. 2. Install the wheel chocks and engage the brake. 3. Move the new IBF onto the tool with the rear side toward the two holes of the lift plate. The side with the handles is considered the front side.Note: Ensure the IBF is in the center of the lift plate.Remove the wheel chocks and disengage the brake. 4. Position the lift tool as close as possible to the rack. Install the wheel chocks and engage the brake. 5. Raise the plate to a height such that the lift plate is just above the rails of the slot you are installing it in. Release pins on the lift plate and fully extend the center portion of the plate toward the frame and lock it in place. Release the pins on the extender arms and position them on both left and right side. Arms must fit around the frame extenders and lock in place (do not adjust the lift tool after these arms are locked in place). 6. One person should gently push the IBF into the frame. Adjust the height of the lift plate, as needed. 7. Lower the lift plate and remove it from the lift tool, repackage the lift tool. 8. Fasten the IBF to the mounting rails, using two mounting bolts. 9. Ensure the circuit breaker is in the off (right) position if it is not already off. 10. Plug the battery cable into the IBF. Fasten the cable-retaining clamp (closest to the cable’s connector) to the frame, using a 7-mm M5 screw 11. Reinstall all components removed during the removal process. 12. Set the circuit breaker to the on (left) position. 13. Press the white Service Complete button on the UEPO switch to indicate completion of this procedure. 852 406/90 Service Guide Processor Subsystem This section contains the removal and replacement procedures for the processor subsystem. Handle all processor subsystem FRUs with extreme care. Installing the MCM/L3/Passthrough Plug Map If the MCM/L3/Passthrough Plug Map is not installed in the system, install it before continuing with any repair action for MCMs or L3 cache module. With the front cover open, locate the right frame extender (located on the opposite side of the UEPO switch). The 50 mm (approximately 2 inch) wide label should be installed vertically, along this extender, approximately in line with EIA frame positions 22 - 28. The trifold label has two hook-and-loop fastener strips attached to the back. Remove the adhesive backing from the hook-and-loop fastener and attach it to the frame extender. A small piece of hook-and-loop fastener is used to help keep it folded while attached to the frame. The label appears similar to the following: P/N11P4914 Using this Label Whenever an MCM/Passthrough module is added or replaced/reseated during a repair action, a sticker with the module’s serial number is provided for use with the plug map label according to its plugging location. If serial number label is not available, then record the serial number of the MCM/Passthrough module is added, reseated, or replaced along with the number of times plugged. This module is then allowed three plugs with each plug recorded on label. If the module should need to be removed and reinstalled a fourth time, it must be replaced. The process is similar for the L3 module, except a separate serial number label is not provided, instead the serial number is located on a sticker attached to the L3 heatsink. Record the serial number of the L3 modules added, replaced, and reseated along with the number of times plugged. If the module should need to be removed and reinstalled a fourth time, it must be replaced. Chapter 9. Removal and Replacement Procedures 853 I/O Books and I/O Blanks Attention: Before starting the installation of the primary I/O book FRU, the customer must back up the HMC partition profiles, save upgrade data, and back up critical console data. The following procedures are for removing, inspecting, and replacing primary and secondary I/O books. Before proceeding to the removal and replacement procedures, read and perform the procedures in “Before I/O Book Replacement.” Before I/O Book Replacement Read and then perform the following procedures before replaceing any primary or secondary I/O book. Save the Service Processor Settings: If the primary I/O book is being replaced, locate the backup copy of the service processor settings. If a backup copy is not available, use the “Save or Restore Hardware Management Policies,” in the “Introduction to Tasks and Service Aids” chapter of the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, to back up the service processor settings saved in NVRAM. If you cannot run the appropriate task, or the service processor settings were not previously backed up, record the service processor settings manually. See “Saving and Restoring Service Processor Settings” on page 770 for information about saving the service processor settings manually. The service processor settings must then be reentered manually after the primary I/O book is replaced. Record the System Name and Bootlists: From the ?Server and Partition Management? screen on the HMC, do the following: 1. Record the managed system’s name. This is the name that appears in the first line under the line that displays the machine type, model, and serial number of the managed system. 2. If the customer has not already backed up the profile data on the HMC, do so now: a. Right-click on the managed system’s name. b. Select the ?Profile Data? task with the ?Backup? option and perform that action. c. Record the filename that was entered as the backup file’s name.3. Record the LPAR mode bootlists. If the managed system is running in full system partition mode, skip to step 4. If the system is running in LPAR mode, continue. Ensure that all partitions have been shut down, then do the following for each partition: a. Right-click on the profile name and select ?Properties?. b. Select the ?Other? tab and change the boot mode to SMS. c. Click on ?OK?. d. Right-click on the profile name and select ?Activate?. e. On the ?Activate Partition? screen, click on the ?Open terminal? checkbox. f. Click on ?OK? to boot the partition to the SMS menus. g. At the SMS main menu, select ?Select Boot Options?, then select ?Select Boot Devices?, then select ?Display Current Settings?. h. Record the boot list found in ?Display Current Settings?. i. Click on the partition name and choose ?Operating system reset?; on the operating system reset screen select ?Hard Reset?. j. Click on ?OK? to shut down the partition. k. If the managed system is not powered down at this point, power it down now. 854 406/90 Service Guide 4. Record the full system partition mode bootlist if one has been set. a. Right-click on the full system partition name, then select ?PowerOnSMSProfile?, click on the ?Open terminal window? box, then click on ?OK? to power the system up to the SMS menus. b. At the SMS main menu, choose ?Select Boot Options?, then choose option 2, ?Select Boot Devices?. Choose option 6, ?Display Current Settings?. Record this boot list. c. Click on the partition name and choose ?Operating system reset?; on the operating system reset screen select ?Hard Reset?, then click on ?OK? to shut down the system. d. Continue to step 5.5. Continue to “Checking the platform firmware level.”Checking the platform firmware level: When the managed system is powered off with standby power still on, open a virtual terminal window on the HMC and access the service processor menus. On the main menu, look for the platform firmware level (which is in the format RHyymmdd or vHyymmdd, where v is the version number) above the main menu on the screen. Record the firmware level.Note: If the firmware is RH020312 or earlier, it is GA1 firmware. If it is between RH020413 and RH021024 (inclusive), or 2Hyymmdd, it is GA2 firmware. If it is RH021025 or later, or 3Hyymmdd or later, it is GA3 firmware. Removal 1. Turn off power as described in “Powering Off and Powering On the System” on page 838. 2. At the back of the system, remove the safety bars from the I/O books and blanks. 3. To ease replacement, record the connector locations of all cables attached to the I/O book. Disconnect all of the cables to the I/O book. 4. If the primary I/O book is being replaced, remove all VPD cards from the old primary I/O book and install them into the new primary I/O book. See “VPD Card” on page 904 for instructions on removing and replacing the VPD cards. 5. Grasp the blue release latches located at the top and bottom of the memory book between your thumb and index finger, and press the tabs together. Actuate both blue handles simultaneously until the book fully unseats from the connector. Pivot the tabs and pull the book straight back. Chapter 9. Removal and Replacement Procedures 855 Inspection You must examine the connectors for any connector damage on the removed book using the 3X Magnification Glass (part number 11P4747) included in the system toolkit. If you identify a defect on the book, do not install the replacement book without contacting your next level support for instructions. A damaged connector on a defective book may indicate a damaged system backplane. If an I/O book connector is damaged, then inspect the system backplane for damage. If the removed I/O book is not damaged, carefully remove the new I/O book from the packaging. Be sure to protect the connector and the blue handles when you remove the I/O book from the cardboard box or ESD bag. Do not handle the I/O book by its connector or either of its blue handles. Remove the protective cover from the connector and install it on the defective I/O book.Notes: 1. Do not stack books on top of each other. ESD bags do not provide sufficient stacking protection. 2. Do not rest an I/O book on its connector or on its blue handles at any time.After the I/O book has been removed from the ESD bag, examine the Book VHDM connector using the 3X Magnification Glass (part number 11P4747) included in the system toolkit. If any connector shows signs of damage, do not use it. Replacement To replace an I/O book, do the following: 1. Inspect the new I/O book for damage. For more information, see “Inspection.” 2. Unlatch the two blue handles on the replacement book. 3. Taking care not to hit the connector against the CEC cage, use both hands to carefully place the book on the CEC guides. 4. Slide the book in until it reaches the alignment pins. 5. Using even pressure, actuate both blue handles simultaneously until the book is fully seated. 6. Restore the service processor settings previously saved. 7. Repeat this process for all replacement I/O books. 8. If any cables were removed during the removal procedure, plug all cables into their correct locations. 9. Turn the UEPO switch to the ON position. The service processor goes into standby. Standby is indicated by the OK prompt in the operator panel display.Checking the Platform Firmware Level on the FRU Just Installed: The primary I/O book FRU might have a later level of firmware than the HMC recognizes. This causes a ?version mismatch? error message on the HMC. If this happens, the HMC can no longer communicate with the managed system. To determine if this is a potential problem, check the HMC’s software level, as follows: 1. Click Help at the top of the HMC console. 2. Click About the Hardware Management Console. 3. Install the new primary I/O book FRU and follow the procedure in one of the following cases: v Case 1: Version Mismatch Error Displays: If the ?version mismatch? error appears on the HMC, the firmware level on the new primary I/O book must be determined.Note: You need a laptop computer and (cable part number 62H4857) capable of connecting to serial port 2 on the primary I/O book. a. Attach the laptop computer and cable to serial port 2 on the primary I/O book. b. When OK is displayed in the operator panel on the media drawer, press a key on the virtual terminal on the laptop computer. The firmware version is displayed above the main menu, in the format RHyymmdd or vHyymmdd. 856 406/90 Service Guide If the HMC is at release 2 version 1.2 (or earlier), and the system firmware is at GA3, which is any level including RH021025 and later, and 3Hyymmdd and later, there are two options for resolving the version mismatch: – The customer can upgrade to HMC release 3 version 1.0 (or later), also referred to as HMC recovery version 1.3 (or later). OR – Flash the GA2 level of platform firmware on the primary I/O book after it is installed. GA2 in this case might be RH021024; however, if a level of firmware with the name in the format of 2Hyymmdd or later is available, use that level instead of RH021024.v Case 2: Version Mismatch Error Does Not Display: If the ?version mismatch? error does not appear on the HMC, the firmware level of the new primary I/O book must still be determined.Note: You need a laptop computer (and cable part number 62H4857) capable of connecting to serial port 2 on the primary I/O book. a. Open a virtual terminal window on the HMC. b. When OK is displayed in the operator panel on the media drawer, press a key on the virtual terminal on the laptop computer. The firmware version (from the new I/O book) is displayed above the main menu, in the format RHyymmdd or vHyymmdd. – If the HMC is at release 2, no further action is required, (ensure that the system firmware is updated to the latest GA2 firmware level because it is a minimum mandatory level). – If the HMC is at release 3 version 1.0 (or later), also referred to as HMC recovery version 1.3 (or later), no further action is required, (ensure that the system firmware is updated to the latest GA2 firmware level because it is a minimum mandatory level). Chapter 9. Removal and Replacement Procedures 857 Set the I/O Type: Verify, and change if necessary, the I/O type as described in “Change I/O Type” on page 785. Set the System Name: If you are replacing a primary I/O book on a system with an HMC attached, you must set the system name through the Main Menu (for a privileged user) of the service processor menus. Reboot the HMC after setting the system name to allow the HMC to recognize the attached managed system by its new name. If the HMC does not recognize the attached managed system, the default assigned name is unnamed. Restore the Service Processor Settings: If the service processor settings must be reentered manually, do so now. If the service processor settings were saved using the AIX diagnostic service aid, restore them using the service aid after the AIX partition with service authority is booted. Restore the Partition Data on the HMC: Using the HMC, bring the managed system up to the LPAR Ready state. After the system is selectable by its new name on the HMC console, the recover managed system task can be executed at the HMC console to restore partition information. The partition information is copied from the HMC console to the new service processor (part of the primary I/O book) NVRAM and may take 15 to 30 minutes to complete. Wait for the operation to complete as indicated on the HMC. Reenter the Boot Lists on the HMC: Using the HMC, reenter the partitions’ boot lists or the full system partition boot list using the lists that were recorded before the primary I/O book was replaced. Set the Date and Time: Set the system date and time using the operating system. 858 406/90 Service Guide Battery CAUTION: A lithium battery can cause fire, explosion, or a severe burn. Do not recharge, disassemble, heat above 100 degrees C (212 degrees F), solder directly to the cell, incinerate, or expose cell contents to water. Keep away from children. Replace only with the part number specified for your system. Use of another battery may represent a risk of fire or explosion. The battery connector is polarized; do not attempt to reverse the polarity. Dispose of the battery according to local regulations.C9 The battery is located in the primary I/O book. Before removing the primary I/O book to access the battery, follow the procedures titled “Save the Service Processor Settings” on page 854 and “Record the System Name and Bootlists” on page 854. Removal 1. Remove the primary I/O book (see “I/O Books and I/O Blanks” on page 854). 2. Use a T10 torx driver to remove the screws for the short cover near the connectors. 3. Locate the battery near the spare VPD card connector as shown in the following illustration. L3CUoD 4. Use one finger to gently press the top edge of the battery in and down from the battery socket. Then use your thumb and index finger to pull the battery out of the socket, as shown in the following illustration. Chapter 9. Removal and Replacement Procedures 859 Replacement Replace in reverse order. Insure that the battery polarity is correct: place the battery in the socket with the positive side facing out. After the primary I/O book has been reinstalled, turn the UEPO switch to the ON position. The service processor will come up to standby, indicated by OK in the operator panel in the media drawer. Perform these procedures in the order shown here: 1. “Set the I/O Type” on page 858 2. “Set the System Name” on page 858 3. “Restore the Service Processor Settings” on page 858 4. “Reenter the Boot Lists on the HMC” on page 858 5. “Set the Date and Time” on page 858. 860 406/90 Service Guide Processor Subsystem DCA (Distributed Converter Assembly) Books and DCA Blanks Removal The following safety notice applies to power-related field replaceable units. DANGERDo not attempt to open the covers of the power supply. Power supplies are not serviceable and are to be replaced as a unit.D021. Locate the DCA to be replaced. 2. Note the state of the DCA LED. 3. Press the green Start Service button on the UEPO switch. 4. Proceed only if all the DCA’s LEDs go off or the repair has been scheduled. If the repair has been scheduled, power off the system as described in “Powering Off and Powering On the System” on page 838, then return here. 5. Loosen the upper and lower fasteners, using the torque tool (part number 6422789), alternating between the fasteners until the unit can slide out of the slot. 6. Grasp both sides of the DCA and pull it out of the drawer.Replacement 1. Carefully slide the DCA into the slot as far as possible (the back edge of the top of the DCA is about 0.25 inch from the cage bracket).Note: Do not force the DCA against the backplane, as this may damage the connector. 2. Hold the upper part of the DCA against the backplane, and use the torque tool to engage the upper fastener about two turns into the cage bracket. 3. Repeat the previous step for the lower fastener. 4. Alternate tightening the upper and lower fasteners until both are fully torqued in. 5. Check torque on both fasteners. 6. Plug all cables into their correct location in the DCA. 7. Press the white Service Complete button on the UEPO switch to indicate completion of this procedure. Chapter 9. Removal and Replacement Procedures 861 Capacitor Books and Blanks Removal The following safety notice applies to power-related field replaceable units. DANGERDo not attempt to open the covers of the power supply. Power supplies are not serviceable and are to be replaced as a unit.D021. Turn off power as described in “Powering Off and Powering On the System” on page 838. 2. If you have not already done so, open the rear door of the rack unit. 3. Locate the capacitor book or blank to be removed. 4. Loosen the upper and lower fasteners, using the torque tool (part number 6422789), alternating between the fasteners until the unit can slide out of the slot. 5. Grasp both sides of the capacitor book or blank, and pull it out of the drawer.Replacement 1. Carefully slide the capacitor book into the slot as far as possible (the back edge of the top of the capacitor book is about .25 inch from the cage bracket).Note: Do not force the capacitor book against the backplane, as this may damage the connector. 2. Hold the upper part of the capacitor book against the backplane, and use the torque tool to engage the upper fastener about two turns into the cage bracket. 3. Repeat the previous step for the lower fastener. 4. Alternate tightening the upper and lower fasteners until both are fully torqued in. 5. Check torque on both fasteners. 6. If any cables were removed during the removal procedure, plug all cables into their correct locations. 7. Turn on the power as described in “Powering the System On” on page 841. 862 406/90 Service Guide MCM Module (Processor) and Pass Through Modules Attention: All components must be replaced to the location from which they were removed. Before replacing MCMs, check with your support center to ensure you have been directed to replace the MCM(s) by a valid error code in Chapter 5, “Error Code to FRU Index,” on page 413. Handling Static-Sensitive Modules Attention: MCM and L3 modules are sensitive to static electricity discharge. These devices are shipped in antistatic containers to prevent damage caused by electrostatic discharge. Take the following precautions: v Use an antistatic wrist strap while handling the device. v Do not remove the device from the antistatic container until you are ready to install the device in the system unit. v With the device still in its antistatic container, touch it to a metal frame of the system. v Grasp modules by the heatsink. Avoid touching the interposer buttons. v If you need to lay the device down while it is out of the antistatic container: – Lay the MCMs with the posts down on the antistatic mat. – Lay the L3s with the LGA connector down in the antistatic tray.Before picking it up again, touch the antistatic container and the metal frame of the system unit at the same time. v Handle the devices carefully in order to prevent permanent damage. Attention: Before you remove or replace any MCM or L3 cache module, stop, read and understand this entire procedure (MCM Module (Processor) and Pass Through Modules) and the procedure for “L3 Cache Modules” on page 882. Also, check to ensure that the MCM/L3 Label Map is installed. Refer to “Installing the MCM/L3/Passthrough Plug Map” on page 853. Attention: The MCM modules in this system can be attached to the backplane a maximum of three times. Check the MCM/L3 Label Map to confirm the number of times the target module has been unplugged. If it has been unplugged three times, order a new module. If an MCM module is removed for a fourth time, it must be replaced with a new MCM module. New MCMs are shipped with a new MCM/L3 plug count adhesive replacement label. When you install a new L3 cache module, place the new label in the correct position on the MCM/L3 Label Map. If a new MCM/L3 Label Map is needed, see “Tools” on page 1033 for replacement label part number.Note: The MCM module and its associated VPD card are a single FRU, and must be replaced as a set. See “VPD Card” on page 904 for VPD card removal and replacement procedure. Attention: If the base configuration of the system has only one MCM, The pass through modules are replaced with a filler cover that is attached only by the cam locks. Chapter 9. Removal and Replacement Procedures 863 Removal Attention: If more than one module is to be replaced or reseated, begin with the upperpmost module, replace it and then go to module(s) in the next lower position, until the lowermost module has been replaced or reseated. Before any MCM, pass through module, or cover is removed, perform a general cleaning operation, as described in step 3 of the following procedure. 1. Turn off the power as described in “Powering Off and Powering On the System” on page 838. 2. Remove the fan chassis. See “Fan Chassis” on page 912. 3. Perform a general cleaning operation, using a sash brush, (not the one included with the FRU kit) and vacuum cleaner (see “Tools” on page 1033 for the tool part numbers), to remove any accumulated dust and debris that has settled in the processor subsystem. Hold the vacuum hose near the brush to remove any debris that the brush generates. 4. Note the location of the module being removed on the MCM/L3 Label Map on the system to confirm the number of times the target module has been unplugged. If it has been unplugged three times, order a new module. Attention: Each MCM must be returned to the location from which it was removed.Note: If this is the first removal/reseat for the module, record the serial number of the module and the date of the action. 5. Determine the type of MCM modules used on your system. The following illustration shows the difference between the aluminum heatsink and copper heatsink MCMs. 1 MCM (with aluminum heatsink) 2 MCM Alignment Hole (on copper heatsink MCMs only) 3 MCM (with copper heatsink) 6. If the MCM modules that you are removing have an aluminum heatsink and a plastic stiffener attached to the backplane, go to substep 10 on page 865. 7. If the MCM modules that you are removing have a copper heatsink and an aluminum stiffener attached to the backplane, do the following:Note: Remove each MCM processor from the processor drawer, one at a time, until all MCM processors that you planned to remove have been removed. 864 406/90 Service Guide 8. Beginning with the upper most module to be removed, loosen the two captive heatsink bushing screws until they are free floating. See the following illustration for heatsink bushing screw locations. 9. Insert the two steel alignment rods through the MCM alignment holes and into the aluminum stiffener. See the following illustration for MCM clearance hole locations. 1 Aluminum Backplane Stiffener 4 MCM Alignment Hole (quantity 2) (on MCM with copper heatsink only) 2 Steel Alignment Rods 5 Captive Heatsink Bushing Screw 3 MCM (with copper heatsink) 6 Locking Cam 10. Ensure that the locking cams at the front of the MCM module are engaged with the module. To adjust the locking cams, use a 5-mm socket wrench for systems containing the aluminum backplane stiffener and an 8-mm socket wrench for systems containing the plastic backplane stiffener. 11. Remove DCA books and blanks 2 through 6, as well as both capacitor books. See “Processor Subsystem DCA (Distributed Converter Assembly) Books and DCA Blanks” on page 861 and “Capacitor Books and Blanks” on page 862. Chapter 9. Removal and Replacement Procedures 865 12. Ensure that the actuation bolt behind the MCM module is in the down position. AAFront1243566418Bab777RightSideBack 1 L3 Filler 5 System Clock Card 2 L3 Module Assembly 6 MCM Filler or Pass Through Card 3 MCM Locking Cam 7 System Board 4 MCM Heatsink / Module Assembly 8 Actuation Bolt 13. Use the torque wrench (part number 11P4486, supplied with the system tool kit) to loosen the actuation bolt in the center of the MCM module. If installing a new MCM, remove the actuation bolt. 14. If installing a new MCM, insert the new actuation bolt supplied with the new MCM module. 15. Move the actuation bolt to the up position and hand-tighten to hold the assembly in the up position. 16. Hold the MCM module that you are removing with one hand. Use an 8-mm socket wrench (plastic backplane stiffener) or 5-mm socket wrench (aluminum backplane stiffener) to unlatch the locking cams. 17. If you are removing a MCM with the aluminum heatsink, go to substep 19 on page 867. 18. If you are removing a MCM with the copper heatsink, using the steel alignment rods, previously installed, slide the MCM processors out and away from the system backplane. Notes: a. The steel alignment rods are used to relieve some of the weight of the MCM processor when removing them from the processor drawer. b. The field restocking plan, is to replace the 1.7 GHz copper heatsink MCM (factory installed) with a 1.7 GHz aluminum heatsink MCM. 866 406/90 Service Guide 19. With both hands, grasp the module across the narrow span of the heatsink cap, and pull the MCM module from the system board and place it, posts-down, on the ESD mat. 1 Processor Backplane 2 MCM (aluminum heatsink version) 3 MCM Pass Through 20. Remove the two steel alignment rods and move them to the next MCM processor to be removed. Continue doing the previous steps until all of the MCM processors you plan to remove have been removed. 21. Using the new brush supplied with the new module, follow the procedures described in this step to clean the LGA site on the system backplane. The LGA site is the position from which you removed the MCM module.Note: Before you remove the brush from its protective bag, loosen the bristles by pressing them on the ESD mat until they are 90 degrees from the handle. Chapter 9. Removal and Replacement Procedures 867 Attention: Each time an MCM module is removed from system backplane, the LGA site where the module connects must be cleaned. With the brush supplied with the replacement kit, use only downward strokes, starting in the upper-left location. Using firm pressure, brush left to right, top to bottom, using overlapping strokes as shown in the following illustration. 1122553344................ The brush must be discarded after cleaning only one LGA site. 22. Each time an MCM module (or pass through) is installed, the associated interposer must be inspected. Inspect the MCM (or pass through) interposer, using a magnifying glass (part number 11P4747). If either of the following conditions occur, the interposer is not acceptable and the module must be returned: Shorts If the button, a loop of the button wire, or an individual button wire lead extends more than three times the width of the wire outside of a connector hole chamfer (top side) or counter bore pin indent (bottom side). Missing buttons 23. Record on the system log sheet on the system the date that this MCM has been removed. 868 406/90 Service Guide Returning MCM Modules: The following procedure provides handling instructions for MCM modules to be packaged and returned to the plant of manufacture. 1. If you are replacing an MCM, use the packaging from the new module to return the old module. If the packaging is not available, order the following parts for MCM or pass through protective packaging: v 1 Black Plastic Case (Top and Bottom) 02P3750 v 1 MCM Heat Sink Cap - 7335949 or 7335963 (not required for pass through) v 1 Corrugated Carton - 7335743 (carton and foam assembly including: 1 Bottom Foam Cushion, 1 Top Foam Cushion, 1 Desiccant (Optional for return packaging) v 1 ESD Bag - 7352456 v 1 Foam Sheet - 73329792. Inspect the base of packaging part number 02P3750 and verify to see if an ?AA? is labeled. 3. If the base has an ?AA,? place the MCM or pass through module in the package base. If the base does not have an ?AA,? place one sheet of thin pink foam (part number 7332979) over the base and place the MCM module in the package base. 4. Place top adapter foam piece (part number 7335949) on the heat sink of the MCM module (not required for pass through module). 5. Complete the inner package by placing the top cover over the module and onto the base. Ensure the cover is secured to the base. 6. Reuse ESD bag part number 7352456 and place the packaged module inside the bag along with the desiccant. Seal with tape. The desiccant is optional for returned parts. 7. Place this MCM or pass through assembly in the foam cushion packaging carton (part number 7335743). 8. Seal the carton and ship. Chapter 9. Removal and Replacement Procedures 869 Replacement or New Installation Note: This procedure is to be performed only by trained service personnel. 1. Go to “Methods for Testing MCMs and Pass Through Modules for a Short Circuit” on page 872 and install the MCM/L3 module short-circuit test tool. Use the test tool during the MCM replacement procedure to ensure that the MCMs are installed without a short circuit. If you install an MCM and encounter a short circuit, the MCM you installed must be replaced. 2. If installing a new MCM, insert the new actuation bolt supplied with the new MCM module and move the actuation bolt and spring assembly to the up position and hand-tighten to hold the assembly in the up position. Perform a general cleaning operation, using a sash brush, (not the one included with the FRU kit) and vacuum cleaner (see “Tools” on page 1033 for the part numbers), to remove any accumulated dust and debris that has settled in the processor subsystem. Hold the vacuum hose near the brush to remove any debris the brush generates. 3. Loosen the captive bolt in the center of the protective cover and remove the cover from the new MCM module, as shown in the following illustration. 4. If you are installing MCMs with the aluminum heatsink, go to substep 5. If you are installing MCMs with the copper heatsink, do the following:Note: The field restocking plan, is to replace the 1.7 GHz copper heatsink MCM (factory installed) with a 1.7 GHz aluminum heatsink MCM. a. Ensure that the stiffener located on the backplane is aluminum and not plastic. b. Insert the two steel alignment rods into the aluminum backplane stiffener. c. Ensure that the locking cams located on the aluminum backplane stiffener are in the open position d. Grasp the module across the narrow span of the heatsink cap. Align the corner chamfer (flat corner) on the MCM module or the pass through to match the outline on the backplane. e. Align the two steel alignment rods with the clearance holes located on each side of the MCM. f. Insert the MCM clearance holes over the two steel alignment rods and slide the MCM module into the socket, and maintain pressure while closing both locking cams. g. Use an 5-mm locking socket wrench to secure the locking cams over the MCM. h. Go to substep 6. 5. Grasp the module across the narrow span of the heatsink cap. Align the corner chamfer (flat corner) on the MCM module or the pass through to match the outline on the backplane. Attention: Verify that the guide pins protrude above the connectors.Carefully push the MCM module into the socket. If your system has the plastic backplane stiffener, use an 8-mm locking socket wrench to secure the locking cams over the MCM. Be sure to maintain pressure on the MCM when securing both locking cams. 6. Loosen the actuation bolt at the rear of the MCM module. 7. Push the actuation bolt in and slide the spring plate assembly to the down position. 870 406/90 Service Guide 8. Use the torque wrench (supplied with the system tool kit, see “Tools” on page 1033 for the tool part number) to tighten the actuation bolt. When the wrench clicks, followed by about 15 degrees of free motion, the bolt is seated. 9. If you are installing MCMs with the copper heatsink, remove the two steel alignment rods and tighten the two captive heatsink bushing screws with a 5 mm socket wrench. Attention: Do not over-torque the bolt. Check the MCM/L3 short-circuit test tool to ensure that short circuit was not created when the MCM was installed. If this is the last MCM you are installing, remove the MCM/L3 module short-circuit test tool and meter. 10. Begin the above procedure again until all MCM modules have been replaced.Note: Remove and install the MCMs with the copper heatsink one at a time, completing the above substeps before servicing another MCM. 11. Remove the primary I/O book (see “I/O Books and I/O Blanks” on page 854) and replace the VPD card that was provided with the MCM module.L3CUoD 12. Replace the primary I/O book. 13. Replace the DCAs and DCA fillers that you removed earlier. See “Processor Subsystem DCA (Distributed Converter Assembly) Books and DCA Blanks” on page 861. 14. Replace the fan chassis. See “Fan Chassis” on page 912. 15. If an MCM or L3 cache module was just added or replaced, go to the service processor menu and select System Information Menu, then select MCM/L3 Interposer Plug Count Menu. If the MCM/L3 Interposer Plug Count Menu is not available, the system firmware level is too old, and the MCM plug count must be recorded as described in “Installing the MCM/L3/Passthrough Plug Map” on page 853 and “Using this Label” on page 853. You can also upgrade the system firmware to the latest level to make the MCM/L3 Interposer Plug Count Menu available, then continue with 16 on page 872. Chapter 9. Removal and Replacement Procedures 871 16. Find the index number that corresponds to the module that was just added or replaced and enter the new plug count that was included in the FRU or MES kit. If the plug count is not included with the FRU, enter the number 7 as the new plug count to account for the number of times the module was plugged during the manufacturing process. If the MCM/L3 was replaced from field spares, enter the number 8. Exit the menu completely before powering on the system. 17. Turn on power as described in “Powering the System On” on page 841.Note: The 4-way processor MCM FRU can have odd-numbered processors or even-numbered processors. If you have just replaced a 4-way MCM module, notify the customer that they can determine their processor numbering by issuing the command lsdev -C on the partition that is using the new processors. If the customer finds that they have new available processors, and four processors in the defined state, they can remove the defined processors by using the rmdev command, and then use the cfgmgr command. Methods for Testing MCMs and Pass Through Modules for a Short Circuit The following testing methods are available for use depending on your testing hardware and system requirements. Use the following list to determine which of the testing methods you should use: Method 1 — Multimeter and Original Capacitor Book (part number 11P1540) Use this testing method if you have a handheld multimeter and the original capacitor book. Use this method only when Method 2 and Method 3 are unavailable. Refer to “Method 1. Testing MCMs and Pass Through Modules for a Short Circuit” on page 873. Method 2 — Multimeter and New Capacitor Book (part number 44P2471) Use this testing method if you have a handheld multimeter and a new capacitor book, or any capacitor books with the necessary LGA test-support features. Refer to “Method 2. Testing MCMs and Pass Through Modules for a Short Circuit” on page 876. Method 3 — New Capacitor Book (part number 44P2471) and LGA Shorts Tester (LGA-ST) (part number 44P4107) This method is required for systems with a 1.5 GHz MCM or higher systems. To perform this procedure, you must have a new capacitor book (or any capacitor books with the necessary LGA test-support features), and the new LGA-ST. Refer to “Method 3. Testing MCMs and Pass Through Modules for a Short Circuit” on page 879. 872 406/90 Service Guide Method 1. Testing MCMs and Pass Through Modules for a Short Circuit Performing this test ensures that a short circuit was not created when installing an MCM or a pass through module. Follow the procedure to install and use the MCM/L3 module short-circuit test tool (part number 44P0209) and a multimeter during MCM replacement. Note: This procedure is to be performed by trained personnel only. Ensure the UEPO switch is in the Off position before performing these procedures.MCM Installation Shorts Test, Step 1: Attention: Be sure to record the resistance readings taken during these tests so that you can refer to them if needed. * See “Shorts Test Qualified Service Meters and Fail Criteria” on page 900. Chapter 9. Removal and Replacement Procedures 873 MCM Installation Shorts Test, Step 2: * See the Fail Criteria value in “Shorts Test Qualified Service Meters and Fail Criteria” on page 900. 874 406/90 Service Guide MCM Installation Shorts Test, Step 3: BAre there anyL3 cache modulesto install?YesRemove MeterNORemoveThe Meter &Test CardReinstall the DCAs and reassemblethe systemPower On the System and Perform aslow bootWere anyfaults detected?YesNOReturn System toCustomerUpdate the historyLog on SFPFollow maintenancepackage to isolatefault.If the problempersists contactsupportFollow the L3 cache modulereplacement and and testprocedures *Fold the paper with therecorded data and placeinside the plug trackingchart and then fold thechart * See “L3 Cache Modules” on page 882. Chapter 9. Removal and Replacement Procedures 875 Method 2. Testing MCMs and Pass Through Modules for a Short Circuit Performing this test ensures that a short circuit was not created when installing an MCM or a pass through module. Follow the procedure to install and use a multimeter and capacitor book (part number 44P2471 or later) during MCM replacement. Note: This procedure is to be performed by trained personnel only. Ensure the UEPO switch is in the Off position before performing these procedures.MCM Installation Shorts Test, Step 1: Attention: Be sure to record the resistance readings taken during these tests so that you can refer to them if needed. * See “Shorts Test Qualified Service Meters and Fail Criteria” on page 900. 876 406/90 Service Guide MCM Installation Shorts Test, Step 2: * See the Fail Criteria value in “Shorts Test Qualified Service Meters and Fail Criteria” on page 900. Chapter 9. Removal and Replacement Procedures 877 MCM Installation Shorts Test, Step 3: * See “L3 Cache Modules” on page 882. 878 406/90 Service Guide Method 3. Testing MCMs and Pass Through Modules for a Short Circuit You must test the LGA-ST unit before you can begin testing the MCM and pass through modules for a short circuit with the following steps: 1. Plug the LGA-ST test head on the test box into the 9-pin female self-test D-shell, located on the LGA-ST control box, which looks similar to the following: 1 Tester D-shell 5 Toggle switch 2 Test box 6 Self-test D-shell 3 Jack screw 7 Battery access 4 Control box 2. Perform the following steps to ensure that the LGA-ST battery is charged and that the LGA-ST is functioning properly to detect shorts: a. Place the toggle switch into the MCM/L3 PWR/GND position and observe the lit LEDs. Refer to the following table for valid LED states: Toggle Switch Position MCM Green LED MCM Amber LED L3 Green LED L3 Amber LED Battery Low Amber LED Action Forward (to the MCM/L3 PWR/GND position) Off On Off On Off 1. If Low Battery LED is on, or if no LEDs are lit, replace the battery before proceeding. 2. If the battery appears to be functioning and the LEDs are not lit as shown in this table, the LGA-ST is defective. Replace the unit before proceeding. b. Place the toggle switch into the L3 PWR/PWR position and observe the lit LEDs. Refer to the following table for the valid LED states: Toggle Switch Position MCM Green LED MCM Amber LED L3 Green LED L3 Amber LED Battery Low Amber LED Action Backward (to the L3 PWR/PWR position) Off Off Off On Off 1. If Low Battery LED is on, or if no LEDs are lit, replace the battery before proceeding. 2. If the battery appears to be functioning and the LEDs are not lit as shown in this table, the LGA-ST is defective. Replace the unit before proceeding. Chapter 9. Removal and Replacement Procedures 879 c. Power off the system. d. Set UEPO to OFF. e. Disengage all DCAs from the back plane. f. Plug the LGA-ST Tester D-Shell into the Capacitor card 9-pin D-Shell and tighten the screws. g. Place the toggle switch into the MCM/L3 PWR/GND position and observe the lit LEDs. Refer to the following table for valid LED states: Toggle Switch Position MCM Green LED MCM Amber LED L3 Green LED L3 Amber LED Battery Low Amber LED Action Forward (to the MCM/L3 PWR/GND position) On Off On Off Off 1. If Low Battery LED is lit, or if no LEDs are lit, replace the battery before proceeding. 2. If the battery appears to be functioning and the LEDs are not lit as shown in this table, disengage the Capacitor book from the system. If the LED states are not as shown in this table after the Capacitor book is disengaged, the LGA-ST is defective. Replace the unit before proceeding. h. Place the toggle switch into the L3 PWR/PWR position and observe the lit LEDs. Refer to the following table for valid LED states: Toggle Switch Position MCM Green LED MCM Amber LED L3 Green LED L3 Amber LED Battery Low Amber LED Action Backward (to the L3 PWR/PWR position) Off Off On Off Off 1. If Low Battery LED is lit, or if no LEDs are lit, replace the battery before proceeding. 2. If the battery appears to be functioning and the LEDs are not lit as shown in this table, disengage the Capacitor book from the system. If the LED states are not as shown in this table after the Capacitor book is disengaged, the LGA-ST is defective. Replace the unit before proceeding. 880 406/90 Service Guide 3. Install an MCM or pass through module. 4. Press the toggle switch on the LGA-ST forward to the MCM/L3 PWR/GND position. This action will test the MCM or pass through modules for power-to-ground shorts. Observe the lit LEDs on the LGA-ST and take the corresponding following actions: Green MCM LED is lit This is a pass state. No further action is required. Amber MCM LED is lit This is a fail state. There might be a short across the MCM or pass through voltage and ground. Complete the following steps to correct this condition: a. Remove the MCM or pass through module and inspect the interposer for shorts. b. Replace the MCM or pass through module and repeat step 4. If the fail state persists, replace the MCM or pass through.Green L3 LED is lit This is a pass state. No further action is required. Amber L3 LED is lit This is a fail state. There might be a short between one of the two L3 voltages and ground, which is a failing condition, or the combination of memory books, I/O books, and L3 cache modules in parallel might be creating a low resistance. To ensure that no shorts are present on the L3 modules before power the system on, perform “Method 3. Testing L3 Cache Modules for a Short Circuit” on page 898.5. Install another MCM or pass through module and repeat step 4 until all required MCMs are installed. Chapter 9. Removal and Replacement Procedures 881 L3 Cache Modules Handling Static-Sensitive Modules Attention: MCM and L3 modules are sensitive to static electricity discharge. These devices are shipped in antistatic containers to prevent damage caused by electrostatic discharge. Take the following precautions: v Use an antistatic wrist strap while handling the device. v Do not remove the device from the antistatic container until you are ready to install the device in the system unit. v With the device still in its antistatic container, touch it to a metal frame of the system. v Grasp modules by the heatsink. Avoid touching the interposer buttons. v If you need to lay the device down while it is out of the antistatic container: – Lay the MCMs with the posts down on the antistatic mat. – Lay the L3s with the LGA connector down in the antistatic tray.Before picking it up again, touch the antistatic container and the metal frame of the system unit at the same time. v Handle the devices carefully in order to prevent permanent damage. Attention: Before you remove or replace any L3 cache module, stop, read and understand this entire procedure. Also, check to ensure that the MCM/L3 Label Map is installed, refer to “Installing the MCM/L3/Passthrough Plug Map” on page 853. Attention: The L3 cache modules in this system can be attached to the backplane a maximum of three times. If an L3 cache module is removed for a fourth time, it must be replaced with a new L3 cache module. Check the MCM/L3 Label Map to determine how many times the module has been removed before. New L3 cache modules are shipped with a new MCM/L3 plug count adhesive replacement label. When you install a new L3 cache module, place the new label in the correct position on the MCM/L3 Label Map. If a new MCM/L3 Label Map is needed, see “Tools” on page 1033 for replacement label part number. Removal Attention: If more than one module is to be replaced or reseated, begin with the uppermost module, replace it and then go to the next highest position module, until the lowermost module has been replaced or reseated. 1. Turn off power as described in “Powering Off and Powering On the System” on page 838. 2. Remove the fan chassis. See “Fan Chassis” on page 912. 3. Perform a general cleaning operation using a sash brush, (not the one included with the FRU kit) and vacuum cleaner (see “Tools” on page 1033 for tool part numbers) to remove any accumulated dust and debris that has settled in the processor subsystem. Hold the vacuum hose near the brush to remove any debris the brush generates. 4. Note the location of the module being removed. Note: Arrange the four 4-position trays on the ESD mat in a pattern that corresponds to the locations of the L3 cache modules on the system backplane. Attention: Each L3 cache module must be returned to the location from which it was removed.Check the MCM/L3 Label Map on the system to confirm the number of times the target module has been unplugged. If it has been unplugged three times, order a new module. If this is the first removal/reseat for the module, record the serial number of the module and the date of the action. 882 406/90 Service Guide 5. Use a 5-mm nut driver to loosen the set screw in the center of the L3 module spring plates as shown in the following illustration..165432 1 Protective Cover 4 Rear Spring Plate 2 L3 Cache Module and Heatsink 5 Front Spring Plate with Set Screw 3 Load Posts 6 L3 Filler (Used if no L3 cache module is installed) 6. Slide both plates up and pull them away from the L3 cache module. 7. Remove the sleeves from the posts. 8. Use a disposable lint-free cloth to clean the sleeves. 9. Pull the L3 cache module straight back off the posts, and place it with the interposer, facing down in the four-position tray from the tool kit. 10. Clean the LGA site using the brush supplied with the new module.Note: Before you remove the brush from its protective bag, loosen the bristles by pressing them on the ESD mat until they are 90 degrees from the handle. Chapter 9. Removal and Replacement Procedures 883 Attention: Each time an L3 cache module is removed from the system backplane, the LGA site where the module connects must be cleaned. With the brush supplied with the replacement kit, use only downward strokes, starting in the upper left corner. Move left to right, top to bottom, using overlapping strokes as shown in the following illustration. 1122553344................ The brush must be discarded after cleaning four L3 sites. 11. Each time an L3 cache module is installed, inspect the interposer before installation. The associated interposer must be inspected, using a magnifying glass. The interposer is not acceptable and the module must be returned if the either of the following conditions occur: Shorts If the button, a loop of the button wire, or an individual button wire lead extends more than three times the width of the wire outside of a connector hole chamfer (top side) or counter bore pin indent (bottom side). Missing buttons 12. Record on the system log sheet the date that this L3 cache module was removed. 884 406/90 Service Guide Returning L3 Modules: The following procedure provides handling instructions for L3 modules to be packaged and returned to the plant of manufacture. 1. If you are replacing an L3 module, use the packaging from the new module to return the old module. If the packaging is not available or does not include all the parts described below, order the following part for L3 module protective packaging: v 1 L3 protective package part number 7335692, which includes: 1 carton, 1 top foam cushion, 1 bottom foam cushion with 4 cavities, 1 ESD bag, 1 desiccant)2. Place bottom foam cushion, wrapped with ESD Bag, inside the corrugated carton. Leaving the ESD bag open, place the L3 module with the cover into the bottom cushion, heatsink down. This will fit snugly into the cavity of the foam. 3. Add the desiccant and seal the ESD bag with tape. The desiccant is optional for returned parts. 4. Place the top foam cushion over the assembly inside the carton. 5. Seal the carton and ship.Replacement or New Installation 1. Go to “Methods for Testing L3 Cache Modules for a Short Circuit” on page 887 and install the MCM/L3 module short-circuit test tool. Use the test tool during the L3 cache module replacement procedure to ensure that the L3 cache modules are installed without a short circuit. If you install an L3 cache module and encounter a short circuit, the L3 cache you installed must be replaced. 2. If installing a new L3 cache module, perform a general cleaning operation, using a sash brush, part number 450732 (not the one included with the FRU kit) and vacuum cleaner to remove any accumulated dust and debris that has settled in the processor subsystem. Hold the vacuum hose near the brush to remove any debris the brush generates, then remove the L3 filler from the site where the new L3 cache module is to be located. 3. Loosen the captive thumbscrews on the protective cover, and remove the cover from the new L3 cache module as shown in the following illustration. 4. Align the flat corner on the L3 cache module to match the outline on the system backplane. Carefully slide the L3 cache module straight onto the posts until it meets the system backplane. 5. If you are installing a new L3 cache module, replace the sleeves that you removed earlier with the new sleeves provided with the L3 cache module kit. If you are reseating an existing module, use a lint-free cloth to wipe any dust from the sleeves that you removed earlier. Chapter 9. Removal and Replacement Procedures 885 6. Be sure to align the notches at the upper right corner of each spring plate as shown in the following illustration. Replace the spring plates that you removed earlier with the new spring plates provided with the L3 cache module kit. 1 Notch on upper right corner 2 5-mm set screw 3 Reinforced center 7. Use your 5-mm nut driver to tighten the set screw in the center of the L3 cache module spring plates until the screw is seated. Check the MCM/L3 module short-circuit test tool to ensure that short circuit was not created when the L3 cache module was installed. If this is the last L3 cache module you are installing, remove the MCM/L3 module short-circuit test tool and meter. 8. Remove the primary I/O book (see “I/O Books and I/O Blanks” on page 854) and replace the VPD card that was provided with the L3 cache module.L3CUoD 886 406/90 Service Guide 9. Replace the primary I/O book. 10. Replace the fan chassis. See “Fan Chassis” on page 912. 11. Turn on power as described in “Powering the System On” on page 841. 12. If an MCM or L3 cache module was just added or replaced, go to the service processor menu and select System Information Menu, then select MCM/L3 Interposer Plug Count Menu. If the MCM/L3 Interposer Plug Count Menu is not available, the system firmware level is too old, and the MCM plug count must be recorded as described in “Installing the MCM/L3/Passthrough Plug Map” on page 853 and “Using this Label” on page 853. You can also upgrade the system firmware to the latest level to make the MCM/L3 Interposer Plug Count Menu available, and then continue with 13. 13. Find the index number that corresponds to the module that was just added or replaced and enter the new plug count that was included in the FRU or MES kit. If the plug count is not included with the FRU, enter the number & as the new plug count to account for the number of times the module was plugged during the manufacturing process. If the MCM/L3 was replaced from field spares, enter the number 8. Exit the menu completely before powering on the system.Methods for Testing L3 Cache Modules for a Short Circuit The following testing methods are available for use depending on your testing hardware and system requirements. Use the following list to determine which of the testing methods you should use: Method 1. Multimeter and Original Capacitor Book (part number 11P1540) Use this testing method if you have a handheld multimeter and the original capacitor book. Use this method only when Method 2 and Method 3 are unavailable. Refer to “Method 1. Testing L3 Cache Modules for a Short Circuit” on page 888. Method 2. Multimeter and New Capacitor Book (part number 44P2471) Use this testing method if you have a handheld multimeter and a new capacitor book, or any capacitor books with the necessary LGA test support features. Refer to “Method 2. Testing L3 Cache Modules for a Short Circuit” on page 893. Method 3. New Capacitor Book (part number 44P2471) and LGA Shorts Tester (LGA-ST) (part number 44P4107) This method is required for systems with a 1.5 GHz MCM or higher systems. To perform this procedure, you must have a new capacitor book, (or any capacitor books with the necessary LGA test support features), and the new LGA-ST. Refer to “Method 3. Testing L3 Cache Modules for a Short Circuit” on page 898. Chapter 9. Removal and Replacement Procedures 887 Method 1. Testing L3 Cache Modules for a Short Circuit Performing this test ensures that a short circuit was not created when an L3-cache module is installed. Follow this procedure to install and use the MCM/L3 module short-circuit test tool (part number 44P0209) and a multimeter during L3-cache module replacement. Note: This procedure is to be performed by trained personnel only. Ensure the UEPO switch is in the Off position before performing this procedure.L3 Cache Shorts Test, Step 1: Attention: Be sure to record the resistance readings taken during these tests so that you can refer to them if needed. * See “Shorts Test Qualified Service Meters and Fail Criteria” on page 900. 888 406/90 Service Guide L3 Cache Shorts Test, Step 2: Enter recorded Fail Criteria *, MLR, 1.9vL3TR in theequation below1.9vL3TR - MLR > Fail CriteriaIs the left Side ofthe equation Greaterthan the Right Side?NOYesRecord Resistance.Call this resistance" 1.9v L3Total Resistance"( 1.9vL3TR)Is the left Sideof the equationgreater than 1Ohm?NOYesInspect Interposerfor shortsAre ShortsVisible?YesReject the L3cache moduleReinstall theL3 cache moduleHas this L3been removed &reinstalledNOYesCNOConnect Meter"V" toTest Card "1.9V L3 LGA" (Blue Jack)Meter "COM" toTest Card either "COMMON RET" (Black Jack)Install another L3cache module[U1.18-P1-Cx]Examples:0.10 > 0.40 Follow NO (Fail)0.90 > 0.40 FollowYES (Pass)Remove L3cache moduleNODYesAre all thememory & I/Odisengaged?DO NOT POWERONContact NextLevel Of SupportAre allThe MemoryDisengaged?Disengage A Memory BookMemory are removed one at a timeNODisengage One I/O BookPrimary I/O Book FirstSecondary I/O Books secondYes Chapter 9. Removal and Replacement Procedures 889 L3 Cache Shorts Test, Step 3: * See “Shorts Test Qualified Service Meters and Fail Criteria” on page 900. 890 406/90 Service Guide L3 Cache Shorts Test, Step 4: Chapter 9. Removal and Replacement Procedures 891 L3 Cache Shorts Test, Step 5: 892 406/90 Service Guide Method 2. Testing L3 Cache Modules for a Short Circuit Performing this test ensures that a short circuit was not created when an L3-cache module is installed. Follow this procedure to install and use the capacitor book (part number 44P2471) and a multimeter during L3-cache module replacement. Note: This procedure is to be performed by trained personnel only. Ensure the UEPO switch is in the Off position before performing this procedure.L3 Cache Shorts Test, Step 1: Attention: Be sure to record the resistance readings taken during these tests so that you can refer to them if needed. * See “Shorts Test Qualified Service Meters and Fail Criteria” on page 900. Chapter 9. Removal and Replacement Procedures 893 L3 Cache Shorts Test, Step 2: 894 406/90 Service Guide L3 Cache Shorts Test, Step 3: * See “Shorts Test Qualified Service Meters and Fail Criteria” on page 900. Chapter 9. Removal and Replacement Procedures 895 L3 Cache Shorts Test, Step 4: 896 406/90 Service Guide L3 Cache Shorts Test, Step 5: Chapter 9. Removal and Replacement Procedures 897 Method 3. Testing L3 Cache Modules for a Short Circuit You must test the LGA-ST unit before you can begin testing the MCM and pass through modules for a short circuit with the following steps: 1. Plug the LGA-ST test head on the test box into the 9-pin female self-test D-shell, located on the LGA-ST control box, which looks similar to the following: 1 Test head 5 Toggle switch 2 Test box 6 Self-test D-shell 3 Jack screw 7 Battery access 4 Control box 2. Perform the following steps to ensure that the LGA-ST battery is charged and that the LGA-ST is functioning properly to detect shorts: a. Place the toggle switch into the MCM/L3 PWR/GND position and observe the lit LEDs. Refer to the following table for valid LED states: Toggle Switch Position MCM Green LED MCM Amber LED L3 Green LED L3 Amber LED Battery Low Amber LED Action Forward (to the MCM/L3 PWR/GND position) Off On Off On Off 1. If Low Battery LED is on, or if no LEDs are lit, replace the battery before proceeding. 2. If the battery appears to be functioning and the LEDs are not lit as shown in this table, the LGA-ST is defective. Replace the unit before proceeding. b. Place the toggle switch into the L3 PWR/PWR position and observe the lit LEDs. Refer to the following table for the valid LED states: Toggle Switch Position MCM Green LED MCM Amber LED L3 Green LED L3 Amber LED Battery Low Amber LED Action Backward (to the L3 PWR/PWR position) Off Off Off On Off 1. If Low Battery LED is on, or if no LEDs are lit, replace the battery before proceeding. 2. If the battery appears to be functioning and the LEDs are not lit as shown in this table, the LGA-ST is defective. Replace the unit before proceeding. 898 406/90 Service Guide c. Power off the system. d. Set UEPO to OFF. e. Disengage all DCAs from the back plane. f. Plug the LGA-ST Tester D-Shell into the Capacitor card 9-pin D-Shell and tighten the screws. g. Place the toggle switch into the MCM/L3 PWR/GND position and observe the lit LEDs. Refer to the following table for valid LED states: Toggle Switch Position MCM Green LED MCM Amber LED L3 Green LED L3 Amber LED Battery Low Amber LED Action Forward (to the MCM/L3 PWR/GND position) On Off On Off Off 1. If Low Battery LED is lit, or if no LEDs are lit, replace the battery before proceeding. 2. If the battery appears to be functioning and the LEDs are not lit as shown in this table, disengage the Capacitor book from the system. If the LED states are not as shown in this table after the Capacitor book is disengaged, the LGA-ST is defective. Replace the unit before proceeding. h. Place the toggle switch into the L3 PWR/PWR position and observe the lit LEDs. Refer to the following table for valid LED states: Toggle Switch Position MCM Green LED MCM Amber LED L3 Green LED L3 Amber LED Battery Low Amber LED Action Backward (to the L3 PWR/PWR position) Off Off On Off Off 1. If Low Battery LED is lit, or if no LEDs are lit, replace the battery before proceeding. 2. If the battery appears to be functioning and the LEDs are not lit as shown in this table, disengage the Capacitor book from the system. If the LED states are not as shown in this table after the Capacitor book is disengaged, the LGA-ST is defective. Replace the unit before proceeding. 3. Install an L3 module. Chapter 9. Removal and Replacement Procedures 899 4. Press the toggle switch on the LGA-ST forward to the MCM/L3 PWR/GND position. This action will test the L3 modules for power-to-ground shorts. Observe the lit LEDs on the LGA-ST and take the corresponding following actions: Green L3 LED is lit This is a pass state. No further action is required. Amber L3 LED is lit This is a fail state. There might be a short between one of the two L3 voltages and ground, which is a failing condition, or the combination of memory books, I/O books, and L3 cache modules in parallel might be creating a low resistance. Complete the following steps to correct this condition: a. Disengage memory books followed by I/O books one at a time. b. Repeat step 2a on page 898 through step 2h on page 899 until either the L3 test results in a pass state, or all memory and I/O are disengaged and the L3 test remains in a fail state. c. If the L3 module test remains in a fail state after all memory and I/O are disengaged, remove the L3 module and inspect the interposer for shorts. d. Reinstall the L3 module once and repeat the test. If the fail persists, replace the L3 module.Green MCM LED is lit This is a pass state. No further action is required. Amber MCM LED is lit This is a fail state. There might be a short across the MCM voltage and ground. Perform the “Method 3. Testing MCMs and Pass Through Modules for a Short Circuit” on page 879 and correct any short conditions before powering on.5. Place the toggle switch into the L3 PWR/PWR position. Green and Amber MEM LEDs are not lit Power is removed from these circuits when the switch is in the L3 PWR/PWR position. Green L3 LED is lit This is a pass state. No further action is required. Amber L3 LED is lit This is a fail state. There is a short between the two L3 power domains. Complete the following steps to correct this condition: a. Remove the L3 module and inspect the interposer for shorts. b. Reinstall the L3 module once and repeat the test. If the fail condition persists, replace the L3 module.6. Install another L3 module and repeat step 3 on page 899 through step 5 until all required L3 modules are installed.Shorts Test Qualified Service Meters and Fail Criteria Manufacturer Model Range Accuracy @ 1.0 ohms or less Fail Criteria Fluke 8060A 200 Ohms +/-0.04 +0.40 Fluke 8060A/AA 200 Ohms +/-0.04 +0.40 Fluke 8062A 200 Ohms +/-0.04 +0.40 Fluke 187 500 Ohms +/-0.10 +0.40 Fluke 189 500 Ohms +/-0.10 +0.40 Fluke 83 (original models) 400 Ohms +/-0.10 +0.40 Fluke 85 (original models) 400 Ohms +/-0.10 +0.40 900 406/90 Service Guide Manufacturer Model Range Accuracy @ 1.0 ohms or less Fail Criteria Fluke 87 (original models) 400 Ohms +/-0.10 +0.40 Fluke 87-4 500 Ohms +/-0.10 +0.40 Fluke 89-4 500 Ohms +/-0.10 +0.40 +/-0.20 Accuracy Fluke 10 400 Ohms +/-0.20 +0.50 Fluke 11 400 Ohms +/-0.20 +0.50 Fluke 110 600 Ohms +/-0.20 +0.50 Fluke 111 600 Ohms +/-0.20 +0.50 Fluke 112 600 Ohms +/-0.20 +0.50 Fluke 12 400 Ohms +/-0.20 +0.50 Fluke 12B 400 Ohms +/-0.20 +0.50 Fluke 16 400 Ohms +/-0.20 +0.50 Fluke 175 600 Ohms +/-0.20 +0.50 Fluke 177 600 Ohms +/-0.20 +0.50 Fluke 179 600 Ohms +/-0.20 +0.50 Fluke 18 400 Ohms +/-0.20 +0.50 Fluke 21-2 320 Ohms +/-0.20 +0.50 Fluke 21-3 320 Ohms +/-0.20 +0.50 Fluke 23-2 320 Ohms +/-0.20 +0.50 Fluke 26-3 400 Ohms +/-0.20 +0.50 Fluke 29-2 400 Ohms +/-0.20 +0.50 Fluke 70-2 320 Ohms +/-0.20 +0.50 Fluke 70-3 320 Ohms +/-0.20 +0.50 Fluke 73 320 Ohms +/-0.20 +0.50 Fluke 73-2 320 Ohms +/-0.20 +0.50 Fluke 73-3 320 Ohms +/-0.20 +0.50 Fluke 75 320 Ohms +/-0.20 +0.50 Fluke 75-2 320 Ohms +/-0.20 +0.50 Fluke 75-3 320 Ohms +/-0.20 +0.50 Fluke 76 400 Ohms +/-0.20 +0.50 Fluke 77 320 Ohms +/-0.20 +0.50 Fluke 77-2 320 Ohms +/-0.20 +0.50 Fluke 79-2 400 Ohms +/-0.20 +0.50 Fluke 79-3 400 Ohms +/-0.20 +0.50 Fluke 83 (with CAT III front panel marking) 400 Ohms +/-0.20 +0.50 Fluke 83-3 400 Ohms +/-0.20 +0.50 Fluke 85 (with CAT III front panel marking) 400 Ohms +/-0.20 +0.50 Fluke 85-3 400 Ohms +/-0.20 +0.50 Chapter 9. Removal and Replacement Procedures 901 Manufacturer Model Range Accuracy @ 1.0 ohms or less Fail Criteria Fluke 87 (with CAT III front panel marking) 400 Ohms +/-0.20 +0.50 Fluke 87-3 200 Ohms +/-0.20 +0.50 Capacity Upgrade on Demand (CUoD) VPD Module Use the following instructions to replace the CUoD VPD module. This action should only be taken if the VPD module is listed as a failing FRU. For more information about FRUs and error codes, see Chapter 5, “Error Code to FRU Index,” on page 413. 1. Power off the system and ensure standby power has been removed by moving the red switch to the OFF position. 2. Remove the primary I/O book (for more information, see “I/O Books and I/O Blanks” on page 854) and open the cover to expose the VPD cards. 3. Remove and replace the CUoD VPD card, which is located next to the installed L3 VPD modules, with the procedures in “VPD Card” on page 904. 4. Replace the primary I/O book in the system and connect the cables with the procedures in I/O Books and I/O Blanks “Replacement” on page 856. 5. Power on the system. 6. Move the red switch into the ON position. 7. Contact service support to obtain the correct processor activation codes for the customer. v If the customer is entitled to any processor activations, proceed to step 9. v If the customer is not entitled to any processor activations, service support will give you the zero activation code. Complete step 8. 8. After you complete this step, the procedure is complete for this customer. a. Right-click on the name of the managed system in the Contents area of the Web-based System Manager window. The name of the system displays as the system’s serial number. b. From the menu that displays, select the following options, in the following order: 1) Capacity Upgrade on Demand 2) Processor 3) Processor Activate The Activate CUoD Processors window displays. c. Enter the 34-character string of zeros (0s) into the Activation Code field. d. Reboot the system and return control of the system to the customer. 9. Service support will ask for system information to generate an activation code. To access this information, complete the following steps: a. Right-click on the name of the managed system in the Contents area of the Web-based System Manager window. The name of the system displays as the system’s serial number. b. From the menu that displays, select the following options, in the following order: 1) Capacity Upgrade on Demand 2) Processor 3) Processor Capacity Settings The Processor Capacity Settings window displays. 902 406/90 Service Guide c. Click the Display Order Information button in the Processor Capacity Settings window. This will display the Processor Order Information panel, which appears similar to the following: d. Dictate the contents of each field on this panel to service support.10. After service support generates the activation code, go to the Activate CUoD Processors menu by completing the following steps: a. Right-click the name of the managed system in the Contents area of the Web-based System Manager window. The name of the system displays as the system’s serial number. b. From the menu that displays, select the following options, in the following order: 1) Capacity Upgrade on Demand 2) Processor 3) Processor Activate The Activate CUoD Processors window displays.11. Type the 34-character activation code that service support dictates to you into the Activation Code field. 12. Return to the Processor Capacity Settings window by completing steps a and b in 9 on page 902. Verify that the number of processors shown in this window match the number of processors service support tells you the customer is entitled to. v If the number of processors match, reboot the system and return control of the system to the customer. v If the number of processors does not match, return to the Processor Order Information panel and verify the information with service support. Chapter 9. Removal and Replacement Procedures 903 VPD Card Notes: 1. The MCM module and its associated VPD card are a single FRU and must be replaced as a set. 2. There is only one VPD card for all of the L3 modules. Replacing any L3 module requires replacing the associated VPD card. The VPD cards are located in the primary I/O book. Removal 1. Remove the primary I/O book (see “I/O Books and I/O Blanks” on page 854). 2. Use a T10 torx screw to remove the short cover near the connectors. 3. Locate the VPD card that must be replaced.L3CUoD 4. Remove the VPD card for the associated MCM or L3 modules.Replacement Replace in reverse order with the VPD card that was provided with the new MCM or L3 modules. 904 406/90 Service Guide System Clock Card Removal 1. Turn off power as described in “Powering Off and Powering On the System” on page 838. 2. Remove the fan chassis. See “Fan Chassis” on page 912. 3. Squeeze the tabs on the clock card, and pull the card straight out as shown in the following illustration. 1 Processor Backplane 2 System Clock Card 4. Clean the system board as described at the start of this procedureReplacement Note: Be sure to align the guide feature on the card with the guide feature on the system board. 1. Squeeze the tabs on the clock card and push the card straight in. 2. Replace the fan chassis. See “Fan Chassis” on page 912. 3. Turn on power as described in “Powering the System On” on page 841. Chapter 9. Removal and Replacement Procedures 905 Aluminum Backplane Stiffener Use the following procedure when populating the system with MCM processors that employ the use of copper heatsinks, or if you will be replacing a system backplane prior to November 2003. Removal 1. Turn off the power as described in “Powering Off and Powering On the System” on page 838. 2. Remove the fan chassis as described in“Fan Chassis” on page 912. 3. Perform a general cleaning operation, using a sash brush, (not the one included with the FRU kit) and vacuum cleaner (see “Tools” on page 1033 for the tool part numbers), to remove any accumulated dust and debris that has settled in the processor subsystem. Hold the vacuum hose near the brush to remove any debris that the brush generates. 4. Remove the system clock card as described in “System Clock Card” on page 905 5. Remove the MCM modules as described in “MCM Module (Processor) and Pass Through Modules” on page 863. 6. Remove all LE Modules from the backplane as described in “L3 Cache Modules” on page 882. 7. Using a 5.5 mm deep well socket wrench (part number 39F8452), remove the eight retaining screws that secure the aluminum stiffener to the system backplane. See the following illustration for screw locations. Notes: a. Due to the zero clearance between the aluminum stiffener and the sides of the processor drawer, you must have the 5.5 mm deep well socket to loosen and remove the two outer-side retaining screws. b. (Service Tip) When removing the aluminum stiffener, remove the outer-top retaining screw last. When installing or replacing the aluminum stiffener insert the outer-top retaining screw first. 906 406/90 Service Guide 8. Carefully remove the aluminum stiffener and put it in a safe place. 1 System Backplane 5 Clock Card 2 L3 Cache Assembly 6 MCM pass through Module 3 Aluminum Backplane Stiffener 7 MCM with VPD Card Kit 4 MCM Alignment Rods 8 Aluminum Backplane Stiffener Retaining Screws (quantity 8) Replacement To replace the aluminum stiffener follow the removal procedure in reverse order. Chapter 9. Removal and Replacement Procedures 907 Processor Subsystem Chassis CAUTION: This unit weighs more than 55 kg (121.2 pounds). Material handling systems such as levers, slings, or lifts are required to safely move it. When this is not possible, specially trained persons or services (such as riggers or movers) must be used.C06 Three people are required to handle this operation: one person to push the chassis, and two people to pull the chassis. Removal Attention: All components must be replaced to the location from which they were removed. Before replacing processor subsystem chassis (backplane), check with your support center to ensure you have been directed to replace the processor subsystem chassis (backplane) by a valid error code in Chapter 5, “Error Code to FRU Index,” on page 413. Attention: This process requires: v A safe area reserved to store parts that are removed from the system during this procedure. v The use of a lift tool (part number 09P2481) and lift-plate assembly tool (part number 11P4369). v The following components must be removed from the processor subsystem chassis to reduce its weight to the minimum. – DCA Books – Capacitor Books – I/O Books – Memory Books – Processor Fan Assemblies – MCM(s) (remove the MCMs only if the chassis is to be replaced) – L3 Cache (remove the L3 cache only if the chassis is to be replaced) – System Clock Cardv The lift tool wheels must be chocked to prevent it from moving during the operation. The frame must not be on its casters, and the leveling pads must be engaged to prevent the frame from moving during the operation. 1. Read all safety instructions before performing this operation. Unpack the lift tool and follow assembly instructions included with the tool. Inspect the tool for damage. Do not perform the removal procedure with a damaged lift tool. Replace the blue lift plate with the orange lift plate (part number 11P4369). Orient the two round holes in the lift plate to the side of the lift tool that will be closest to the frame. 2. Turn off power as described in “Powering Off and Powering On the System” on page 838. 3. Open the rack doors and remove them from the rack. 4. Verify that all cables are labeled. If not, label all cables and components along with their location at the front and rear of the system. 5. Unplug all power cables from the line power source. Attention: Because this procedure is performed with power removed from the system, references to using the white or green service buttons on the UEPO switch do not apply during the remainder of this procedure. 6. Unplug any cables attached to the I/O books and processor subsystem DCAs. 7. At the back of the system, remove the safety bars from the I/O books and blanks, using an allen wrench on the captive screws. 8. Remove the I/O books (and blanks if applicable). See “I/O Books and I/O Blanks” on page 854. 9. Remove the DCAs and capacitor books (and blanks if applicable). See “Processor Subsystem DCA (Distributed Converter Assembly) Books and DCA Blanks” on page 861. 10. At the front of the system, remove the safety bars from the memory books and blanks. 908 406/90 Service Guide 11. Remove the memory books (and blanks if applicable). See “Memory Books and Memory Blanks” on page 910. 12. Remove the four fan assemblies. See “Processor Fan Assemblies” on page 912. 13. Remove the fan chassis. See “Fan Chassis” on page 912. 14. Remove the MCM modules. See “MCM Module (Processor) and Pass Through Modules” on page 863. 15. Remove the L3 modules. See “L3 Cache Modules” on page 882. 16. Remove the clock card. See “System Clock Card” on page 905. 17. Remove the media chassis. See “Media Subsystem Chassis” on page 1001. 18. Remove two frame extender mounting bolts from EIA position 17 (10-mm bolt). 19. Remove the UEPO switch. See “Unit Emergency Power Off (UEPO) Card Assembly” on page 850. 20. Using an 8-mm extension socket, remove the screws attaching the hold-down bracket to the vertical EIA rails at the rear of the system. Remove the bracket and store it in a safe place.Note: The rear screws are used to mount the hold-down bracket. 21. Using an 8-mm extension socket, remove the screws attaching the chassis to the vertical EIA rails at the front of the system. 22. Position the lift tool at the front of the rack, and ensure that the extender arms on the lift plate are on the side facing the frame. Align the lift tool such that processor subsystem chassis will not be blocked (at the lip of the orange lift plate) from removal. 23. Raise the plate to a height such that the lift plate is just below the base of the processor subsystem chassis. Release the pins on the lift plate, and fully extend the center portion of the plate toward the processor subsystem chassis and lock it in place. Release pins on the extender arms and position them on both left and right side, arms must fit around the frame extenders and lock in place (do not adjust the lift tool once these arms are locked in place). Engage the lift tool brake. Install wheel chocks on lift tool casters.Note: When you attach the arms to the frame, ensure that any cables are clear of the extender arms. 24. One person should gently push the processor subsystem chassis toward the tool from the rear of the system (do not bend the vertical processor-subsystem chassis support bars). Two people should gently pull and guide the processor subsystem chassis onto the lift plate. The processor subsystem chassis must be removed far enough to clear the front face of the frame. Use the tie-down strap to hold the processor subsystem chassis to the lift plate. Retract the extender arms and center portion of lift plate. 25. Lower the processor subsystem chassis to the bottom position of the lift tool. 26. Remove the wheel chocks and release the brake. Flip the crank handle 180 degrees. Carefully move the lift tool with the processor subsystem chassis to the staging area. The brake and wheel chocks must be in place when transferring the processor subsystem chassis to or from the packaging material.Replacement Note: If you are replacing this assembly with a new assembly, the manufacturing label with the machine type, model number, and the serial number for the subsystem must be ordered to apply to the new part. For information on replacement labels, see Chapter 10, “Parts Information,” on page 1009. Attention: Look for small amounts of debris on the connecting surfaces between the backplane and processor modules. Mating the two surfaces is critical for the processor to function properly. 1. Position the lift tool such that the new processor subsystem chassis can be moved onto the lift plate. 2. Install the wheel chocks and engage the brake. Chapter 9. Removal and Replacement Procedures 909 3. Position the plate at the correct level, and move the new processor subsystem chassis onto the tool with the rear side flush with the lift plate. Strap the processor subsystem chassis to the lift plate using the tie-down strap. 4. Position the lift tool as close as possible to the front of the rack. 5. Raise the lift tool until it is level with the rails. 6. Release the pins on the lift plate and fully extend the center portion of the plate toward the frame and lock it in place. Release the pins on the extender arms and position them on both left and right side. The arms must fit around the frame extenders and lock in place (do not adjust the lift tool after these arms are locked in place). Engage the lift tool brake. Install wheel chocks on lift tool casters.Note: When you attach the arms to the frame, ensure that any cables are clear of the extender arms. 7. Remove the tie-down strap. One person should gently pull the processor subsystem chassis into the frame from the rear of the system (do not bend the vertical processor-subsystem chassis support bars). Two people should gently push and guide the processor subsystem chassis into the frame. Note: Adjust the height of the lift plate, as needed. 8. After the processor subsystem chassis is installed in the frame, lower the lift plate and remove it from the lift tool. Repackage the lift tool. 9. Reinstall all components removed during the removal process by reversing steps 22 on page 909 through 2 on page 908 of the removal procedure.Memory Books and Memory Blanks Removal 1. Turn off the system, and completely remove power as described in “Powering Off and Powering On the System” on page 838. 2. Remove the safety bar from the memory slot. 3. Grasp the blue release latches located at the top and bottom of the memory book between your thumb and index finger, and press the tabs together. Pivot the tabs and pull the book straight back.Inspection You must examine the connector for any connector damage on the removed book using the 3X Magnification Glass (part number 11P4747) included in the system toolkit. If you identify a defect on the book, do not install the replacement card without contacting your next level support for instructions. A damaged connector on a defective book may indicate a damaged system backplane. If the memory book connector is damaged, the adjacent memory book may need to be removed in order to inspect the system backplane for damage. If the removed book is not damaged, carefully remove the new book from the packaging. Be sure to protect the connector and the blue handles when you remove the book from the cardboard box or ESD bag. Do not handle the book by its connector or either of its blue handles. Remove the protective cover from the connector and install it on the defective book.Notes: 1. Do not stack books on top of each other. ESD bags do not provide sufficient stacking protection. 2. Do not rest a book on its connector or on its blue handles at any time.After the memory book is removed from the ESD bag, examine the book VHDM connector using the 3X Magnification Glass (part number 11P4747) included in the system toolkit. If any connector shows signs of damage, do not use it. 910 406/90 Service Guide Replacement Note: If there is no book installed in the position next to the one that you are replacing, install a filler (part number 21P8218) in that adjacent location. To replace a book, do the following: 1. Inspect the new book for damage. For more information, see “Inspection” on page 910. 2. Unlatch the two blue handles on the replacement book. 3. Taking care not to hit the connector against the CEC cage, use both hands to carefully place the book on the CEC guides. 4. Slide the book in until it reaches the alignment pins. 5. Using even pressure, actuate both blue handles simultaneously until the book is fully seated. 6. Repeat this process for all replacement books. 7. Reinstall any safety bars that were removed to perform this procedure. 8. If any cables were removed during the removal procedure, plug all cables into their correct locations. 9. Turn on the power as described in “Powering the System On” on page 841.Fan Controller Removal Attention: The fan controller weighs approximately 0.45 Kg (1 lb.) 1. Locate the fan controller to be replaced. 2. Note the state of the fan controller’s LEDs. 3. Press the green Start Service button on the UEPO switch. 4. Proceed only if all the fan controller’s LEDs go off or the repair has been scheduled. If the repair has been scheduled, power off the system as described in “Powering Off and Powering On the System” on page 838, then return here. 5. Label and remove the plugs from the fan controller. 6. Loosen the two knurled captive screws that attach the fan controller to the fan assembly. 7. Grasp the loosened screws, and pivot and pull the fan controller away from the fan assembly.Replacement 1. Align the metal tabs opposite the captive screws into the corresponding slots in the fan assembly. 2. Pivot gently to seat the autodocking connector. 3. Tighten the knurled captive screws to the fan assembly. 4. Plug all cables into their correct location in the DCA. 5. Press the white Service Complete button on the UEPO switch to indicate completion of this procedure. Chapter 9. Removal and Replacement Procedures 911 Processor Fan Assemblies Removal Attention: The fan controller weighs approximately 5 Kg (11 lb.) 1. Locate the fan assembly to be replaced. 2. Note the state of the fan controller’s LEDs. 3. Press the green Start Service button on the UEPO switch. 4. Proceed only if all the fan controller’s LEDs go off or the repair has been scheduled. If the repair has been scheduled, power off the system as described in “Powering Off and Powering On the System” on page 838, then return here. 5. Verify that all cables are labeled for plug location, and unplug the cables from the fan controller on the fan assembly. 6. Remove the fan controller (see “Fan Controller” on page 911). 7. Loosen the two knurled captive screws that attach the fan assembly to the enclosure. 8. Pull the fan assembly away from the enclosure opening.Replacement 1. Position the fan assembly in front of the opening through the spring-loaded doors in the enclosure. 2. Slide the fan assembly into the enclosure opening and tighten the upper and lower fasteners. 3. Replace the fan controller onto the face of the fan assembly. 4. Plug all cables into their correct location in the DCA. 5. Press the white Service Complete button on the UEPO switch to indicate completion of this procedure.Fan Chassis Removal 1. Turn off power as described in “Powering Off and Powering On the System” on page 838. 2. Remove the top two fans. See “Processor Fan Assemblies.” 3. Loosen the four retaining screws that attach the fan chassis to the processor subsystem chassis. 4. Pull the fan chassis out of the processor subsystem chassis.Replacement Replace in reverse order. Processor Subsystem Air Filters Attention: To maintain proper airflow, the air filters (part number 11P3781) must be inspected every six months and replaced as necessary. To replace the air filters, it is not necessary to remove power from the system or blower assemblies . Removal To remove, grasp finger tabs on the air filter and pull straight out. Replacement To remove, slide the air filter into the slot. The filter is keyed to ensure correct orientation. 912 406/90 Service Guide I/O Subsystem I/O Subsystem PCI Slot LED Definitions 2264411533 1 I/O LED (Currently Unused) 4 I/O Subsystem Backplane Power On (Green) 2 Auxiliary Power Good (Green) 5 I/O Card Power On LED (Green) 3 I/O Subsystem Backplane Fault (Amber) 6 Failing I/O Card LED (Amber) The following table describes the possible states of the LEDs associated with the I/O subsystem PCI slots. These LEDs are located at the rear of the I/O subsystem. One LED is associated with each PCI slot. Green PCI Slot LED Indication Amber PCI Slot LED PCI Slot Power/Status Definition On Off On/No action The PCI slot power is on and the slot is operational. Do not remove or replace adapters. On Flashing On/Identify The PCI slot power is on, and the slot has been identified by the software or firmware. Do not remove or replace adapters at this time. Off Flashing Off/IdentifyOff/Action The PCI slot power is off, and the slot has been identified by the software or firmware. Perform PCI adapter remove, replace, or install only when directed by the PCI hot-plug procedures. Off Off Off/No power The PCI slot power is off. If the system power is off, perform non-hot-plug PCI adapter remove, replace, or install procedure. Chapter 9. Removal and Replacement Procedures 913 PCI Adapters Note: Linux does not support hot-plugging any hot-pluggable PCI adapters or devices. A system with Linux installed on one or more partitions must be shut down and powered off before replacing any PCI adapter or device assigned to a Linux partition. Follow the non-hot-pluggable adapter or device procedures when replacing a PCI adapter or device in any partition with Linux installed. Most PCI adapters can be removed and replaced, or installed in the system without turning off the power to the entire system. These adapters are referred to as hot-pluggable PCI adapters. Some adapters are not hot-pluggable and power must be removed from the system for adapter removal, replacement, or installation. Though some adapters are not hot-pluggable, they must still be installed in the cassette. Note: An adapter or an adapter blank filler must be installed into the PCI adapter cassette assembly before it is reinstalled in a system unit or an I/O drawer. Before you remove or replace an adapter, determine if the PCI adapter you are working with is hot-pluggable. See the PCI Adapter Placement Reference, order number SA23-2504. When assembling, inserting or removing a blindswap cassette, verify that the foam ?T-shaped? EMC gasket strip is firmly attached to the sheet metal bezel. This includes all applications where these gaskets are used (for example, riser cards, blank fillers or any of the various cassette assemblies). Visually inspect the top and bottom of the gasket strips insuring that they are firmly bonded to the sheet metal. Look for any of the following: v gaskets that are starting to unwrap v gaskets lifting or peeling off of the sheet metal v gaskets protruding from the sheet metal bezel when all cassettes, fillers, or risers are plugged into the system. Inspect both the gasket on the cassette or filler that was removed and the gasket visible in the PCI slot. If a bad gasket is found on a cassette, then the entire cassette must be replaced with a new cassette and gasket. If a bad gasket is found on a riser card, then the I/O Planar containing the riser card needs to be replaced. If you are removing a PCI adapter that is: v Not hot-pluggable, go to “Removing a Non-Hot-Pluggable PCI Adapter” on page 920. v Hot-pluggable, go to “Hot-Pluggable PCI Adapter” on page 921. 914 406/90 Service Guide Removing a PCI Adapter Cassette Attention: Perform this procedure only when you are instructed to by the removal and replacement procedures in “Removing a Non-Hot-Pluggable PCI Adapter” on page 920 or “Hot-Pluggable PCI Adapter” on page 921.Note: An adapter or a blank filler must be installed into the PCI adapter cassette before it is reinstalled in a processor subsystem or an I/O subsystem. The following illustration shows the PCI adapter cassette. 1 PCI Adapter Cassette 2 Handle-Release Lever Latch 3 Locking Bar 4 Handle Release Lever 5 Handle 6 PCI Adapter Tailstock Tab 7 Adapter Cassette Pull Tab Chapter 9. Removal and Replacement Procedures 915 To remove a PCI adapter cassette, do the following: 1. Determine which PCI adapter cassette you are removing. 2. Disconnect any cables that are connected to the PCI adapter. 3. Press the handle release lever, then push the handle to the down position. See the following illustration. 1 Locking Bar 2 Handle Release Lever 3 Handle in the Down Position 4. Push in the locking bar until it clicks into the locked position. See the following illustration.Note: In this position, the locking bar locks the handle in the down position and prevents it from being pulled up while the PCI adapter cassette is being removed. 916 406/90 Service Guide 1 Locking Bar 2 PCI Adapter Tailstock Tab 3 Adapter Cassette Pull Tab 5. Position your index finger under the handle and push up the adapter tailstock tab. Grasp the adapter cassette pull tab with the other hand. Pull the PCI adapter cassette from the adapter slot by pulling on the adapter cassette pull tab. Do not pull on the handle when removing the adapter cassette. Notes: a. Pulling on the handle may reengage the adapter connector, preventing the cassette from sliding out. b. Do not force the PCI adapter cassette out of the PCI adapter slot. The cassette will slide out smoothly when this step is performed correctly. c. Do not remove more than one adapter cassette at a time. Chapter 9. Removal and Replacement Procedures 917 Replacing a PCI Adapter Cassette: Attention: Perform this procedure only when you are instructed to from “Removing a Non-Hot-Pluggable PCI Adapter” on page 920 or “Hot-Pluggable PCI Adapter” on page 921. Before you replace the PCI adapter cassette, check the following: v Before installing a PCI adapter cassette in the system, an adapter or blank filler must be installed in each PCI adapter cassette and the top cover for the system must be installed. v Check the adapter in the cassette that you are about to install, and make sure that the adapter is held firmly on all four corners by the retaining arms and ratchet clips. v Review the procedures beginning with “PCI Adapter or Blank Filler Removal from a Snap-Assembly-Type Cassette” on page 935 to make sure that the adapter is correctly installed in the cassette.To replace a PCI adapter cassette, do the following: 1. Depress the handle release lever located in the center of the PCI adapter cassette handle. Move the handle down completely to the fully down position. See the following illustration. 1 Cassette Dovetail Rails 2 Handle-Release Lever Latch 3 Handle Release Lever 2. Push the top of the locking bar in until it clicks into the locked position. 918 406/90 Service Guide Note: In this position, the locking bar locks the handle in the down position and prevents it from being pulled up while the PCI adapter cassette is being removed. 3. Examine the PCI adapter slot opening carefully, and note the location of the dovetail rails and the cassette guide rails. A flashlight can be used to enable you to see the dovetail rails and the cassette guide rail slots, as shown in the following illustration. 1 Cassette Left Dovetail Rail 4 Cassette Bottom Guide Rail 2 Cassette Right Dovetail Rail 5 Side Guide Rail Slot on System Board 3 Cassette Side Guide Rail 6 Bottom Guide Rail Slot on System Board 4. Hold the PCI adapter cassette at the PCI adapter slot so that the PCI adapter cassette is perpendicular to the slot and level with the slot opening. 5. Align the bottom edge of the PCI cassette cover with the PCI cassette guide rail on the system board or I/O unit. 6. Slide the cassette partially into the guide. Attention: If the cassette does not slide in smoothly, the cover might not have engaged the PCI cassette guide rail. Carefully pull the adapter back out and start again. 7. As you slide the cassette into the slot, ensure that the dovetail rail on the top of the PCI adapter cassette aligns with the mating slots on any cassettes that might be installed on the left and right sides. 8. Push the cassette in until it is flush with any other installed cassettes. Chapter 9. Removal and Replacement Procedures 919 9. Visually check to ensure that the cassette is parallel with the cassettes to the left or right. If the cassette is not parallel with the neighboring cassettes, the cassette dovetail rails might not be correctly engaged. If the cassette is not correctly aligned, you might need to pull it out so you can see the ends of the dovetail rails. If so, reinsert the cassette, ensuring that the dovetail rails engage on both sides of the cassette.Note: If there are no other cassettes installed, it is not necessary to check the alignment of the dovetail rails. 10. Depress the handle release latch. Lift the handle up until the handle-release lever clicks when it locks into position. Note: To aid you in seating the PCI adapter connector, it may be necessary to press down on the adapter tailstock tab. The PCI adapter is completely installed into the PCI adapter slot when the handle is in the fully raised position and the tailstock is seated into the slots located on the system board. Attention: Do not force the handle into the raised position.Removing a Non-Hot-Pluggable PCI Adapter: To remove a non-hot-pluggable adapter, do the following: 1. Turn off power and remove the power from the system as described in “Powering Off and Powering On the System” on page 838. 2. Determine the slot from which you are removing the adapter. 3. Refer to “Removing a PCI Adapter Cassette” on page 915. When you have completed the PCI adapter cassette removal procedure, return here. 4. If you are installing another adapter in this slot, follow the instructions given in “Replacing a Non-Hot-Pluggable PCI Adapter.” If you are not installing another adapter in this slot, install a blank PCI adapter cassette into the slot. Refer to “Replacing a PCI Adapter Cassette” on page 918. 5. Connect power to the system and turn the power on. 6. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus SystemsDiagnostic Information for Multiple Bus Systems.Replacing a Non-Hot-Pluggable PCI Adapter: Replace the adapter using the following procedure:Note: Because the adapter you are installing is not hot-pluggable, shut down the system and remove power before performing this procedure. 1. If you have just removed an adapter and you are replacing it in the system, use the procedures in “PCI Adapter or Blank Filler Removal from a Snap-Assembly-Type Cassette” on page 935 to install the replacement adapter into the PCI adapter cassette. 2. Go to “Replacing a PCI Adapter Cassette” on page 918 to replace the appropriate cassette in the PCI slot, then return here. 3. Connect power to the system. 4. Go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Refer to ?Starting the System Without an HMC Attached? in your installation or service guides. 5. Set the color slide to blue. 6. Ensure that the adapter is configured when the system completes the boot process. 920 406/90 Service Guide Hot-Pluggable PCI Adapter: Before performing the following procedure, read “Safety Notices” on page vii. Attention: The Linux operating system does not support these hot-plug procedures. Also, Linux does not support hot-plugging any hot-pluggable PCI adapters. Systems with Linux installed on one or more partitions must be shut down and powered off before replacing any PCI adapter assigned to a Linux partition. Follow the non-hot-pluggable adapter procedures when replacing a PCI adapter in any partition with Linux installed. The following hot-plug procedures take you through removing and replacing hot-plug PCI adapters using software that presents procedures on your display. The LEDs on a PCI adapter cassette are described in either your installation guide or service guide. Removing and Replacing a Hot-Pluggable PCI Adapter: Notes: 1. Use this procedure only when you are replacing an adapter with an identical adapter. If you are replacing an adapter with an adapter that is not identical to the adapter removed, go to “Removing and Replacing a Hot-Pluggable PCI Adapter,” and then to “Installing a New Hot-Pluggable PCI Adapter” on page 923. 2. When a system containing a defective adapter is booted, the system considers that the PCI slot is empty, and adapter LEDs do not light. Under this condition, it is safe to remove the adapter from the system at this time. To replace an adapter, perform the following steps: 1. Determine the slot from which you are removing the adapter. 2. Ensure that any processes or applications that might use the adapter are stopped. Note: Removing a hot-pluggable PCI adapter requires the system administrator to take the PCI adapter offline before performing any PCI adapter hot-plug procedures. Before taking an adapter offline, the devices attached to the adapter must be taken offline as well. This action prevents a service representative or user from causing an unexpected outage for system users. For additional information about taking an adapter offline or removing it from the system configuration, see the AIX System Management Guide: Operating System and Devices. 3. Log in as root user. If the system is a partitioned system, log in as root user on the partition that has the adapter assigned to it. 4. At the command line, type smitty. 5. Select Devices. 6. Select PCI Hot Plug Manager. 7. Select Unconfigure a Device and press Enter. 8. Press F4 to display the Device Names menu. 9. From the menu, select the adapter you are removing. 10. If you are replacing the adapter, go to step 11 on page 922. If you are not replacing the adapter, do the following: a. Use the Tab key to answer NO to Keep Definition. Press Enter. b. Go to step 12 on page 922. Chapter 9. Removal and Replacement Procedures 921 11. Answer YES to Keep Definition. Press Enter. 12. The ARE YOU SURE screen displays. Press Enter to verify the information. Successful unconfiguration is indicated by the OK message displayed next to the Command field at the top of the screen. 13. Press F3 to return to the PCI Hot-Plug Manager menu. 14. Select List PCI Hot-Plug Slots. 15. If the List PCI Hot-Plug Slots menu indicates that the slot is not empty, go to step 17. 16. If the List PCI Hot-Plug Slots menu indicates that the slot is empty, do the following: a. Press F3 to cancel. b. Select Add PCI Hot-Plug Adapter, and press Enter. The display screen indicates that the PCI slot is empty. c. On the display screen, select the indicated empty slot. Look at the adapter you intend to remove. A slow blinking amber LED located next to the adapter on the PCI riser card indicates that the slot has been identified. Press Enter. The adapter has been put into the action state. d. Disconnect any cables that are connected to the adapter being removed. e. Completely remove the hot-plug adapter from the slot. The LED goes off. f. Go to step 21.17. When you are instructed to remove the adapter from the adapter slot, disconnect any cables that are connected to the adapter being removed, and then refer to “Removing a PCI Adapter Cassette” on page 915. 18. Install the replacement adapter into the PCI adapter cassette. See “PCI Adapter or Blank Filler Removal from a Snap-Assembly-Type Cassette” on page 935. 19. With the replaced adapter installed into the PCI adapter cassette, refer to “Replacing a PCI Adapter Cassette” on page 918. 20. If you installed an adapter into the adapter cassette, connect the appropriate cables and devices to the adapter connectors. 21. Continue to follow the screen instructions until you receive a message that the replacement is successful. Successful replacement is indicated by the OK message displayed next to the Command field at the top of the screen. 22. If you did not install an adapter into the adapter cassette, go to step 23. If you did install an adapter into the adapter cassette, do the following: a. Press the F3 key to return to the PCI Hot-Plug Manager menu. b. Select Install/Configure Devices Added After IPL and press Enter. Then follow the instructions on the screen. Successful replacement is indicated by the OK message displayed next to the Command field at the top of the screen.23. If you do not have other adapters to replace, continue with the next step.ORIf you have other adapters to replace, press the F3 key to return to the PCI Hot-Plug Manager menu and then return to step 14. 24. Press F10 to exit the Hot-Plug Manager. If you have added, removed, or replaced any adapters, run the diag -a command. If the system responds with a menu or prompt, follow the instructions to complete the device configuration. 922 406/90 Service Guide Installing a New Hot-Pluggable PCI Adapter: Note: Before installing a new adapter, ensure the device driver for the adapter is installed. Check with the system administrator if the device driver is not already installed. To install an adapter, perform the following steps: 1. Install the new adapter into a PCI adapter cassette. See “PCI Adapter or Blank Filler Removal from a Snap-Assembly-Type Cassette” on page 935. 2. Log in as root user. If the system is a partitioned system, log in as root user on the partition that has the adapter assigned to it. 3. At the command line, type smitty. 4. Select Devices. 5. Select PCI Hot Plug Manager. 6. From the PCI Hot-Plug Manager menu, select Add a PCI Hot-Plug Adapter and press Enter. The Add a Hot-Plug Adapter window displays. 7. For adapter placement information, see the PCI Adapter Placement Reference, order number SA23-2504. Select an empty PCI slot for the adapter. 8. Select the appropriate empty PCI slot from the ones listed on the screen, and press Enter. 9. Follow the instructions on the screen to install the adapter until the visual indicator (LED) for the specified PCI slot is set to the Action state. 10. Install the new adapter in the adapter slot by performing the following: a. Lower the black handle completely. b. Push the locking bar until you hear a clicking sound. c. Hold the assembly straight on and level with the slot. d. Align the bottom edge of the PCI cassette cover with the PCI adapter guide rail on the I/O backplane. Note: If there is a cassette to the left of the one you are installing, align the ridge on the cover with the ?tick? in the notch of the neighboring cassette. Chapter 9. Removal and Replacement Procedures 923 e. Slide the cassette partially into the guide. f. Ensure that the dovetail on the top track aligns with its mating component(s) on both sides. 1 I/O Subsystem 2 PCI Adapter Cassette g. When the cassette is fully inserted, prepare to activate the handle by lowering the locking bar. Lift the handle all the way up until you hear a click. The PCI adapter should be completely seated. h. Set the color slide to the orange color indicating that the adapter in the cassette is hot-pluggable.11. Connect the appropriate cables and devices to the adapter. 12. Continue to follow the screen instructions until you receive a message that the installation is successful. Successful installation is indicated by the OK message displayed next to the Command field at the top of the screen. 13. Press the F3 key to return to the PCI Hot-Plug Manager menu. 14. Select Install/Configure Devices Added After IPL and press Enter. Then follow the instructions on the screen. Successful installation is indicated by the OK message displayed next to the Command field at the top of the screen. 15. If you do not have other adapters to install, continue with the next step. OR If you have other adapters to install, press the F3 key to return to the PCI Hot-Plug Manager menu and then return to step 6 on page 923. 16. Press F10 to exit the Hot-Plug Manager. If you have added, removed, or replaced any adapters, run the diag -a command. If the system responds with a menu or prompt, follow the instructions to complete the device configuration.PCI Hot-Plug Manager Access: Attention: The Linux operating system does not support these hot-plug procedures. Also, Linux does not support hot-plugging any hot-pluggable PCI adapters. Systems with Linux installed on one or more partitions must be shut down and powered off before replacing any PCI adapter assigned to a Linux partition. Follow the non-hot-pluggable adapter procedures when replacing a PCI adapter in any partition with Linux installed. The installation instructions for hot-pluggable PCI adapters refer you to these procedures when it is appropriate to perform them. Note: A PCI adapter is only hot-pluggable if the PCI adapter supports hot-plug applications. See the PCI Adapter Placement Reference, order number SA23-2504. 924 406/90 Service Guide Accessing Hot-Plug Management Functions: Note: Removing or installing a Hot-Pluggable PCI adapter requires the system administrator to take the PCI adapter offline prior to performing the operation. Before taking an adapter offline, the devices attached to the adapter must be taken offline as well. This action prevents a service representative or user from causing an unexpected outage for system users. For additional information about taking an adapter offline or removing it from the system configuration, see the AIX System Management Guide: Operating System and Devices. To access the hot-plug menus, do the following: 1. Log in as root user. If the system is a partitioned system, log in as root user on the partition that has the adapter assigned to it. 2. At the command line, type smitty. 3. Select Devices. 4. Select PCI Hot Plug Manager and press Enter. 5. The PCI Hot-Plug Manager menu displays. Return to the procedure that directed you here. For a description of the menu options, see below.PCI Hot-Plug Manager Menu: The following options are available from the PCI Hot Plug Manager menu: Note: For information about the PCI slot LED states, refer to either your installation guide or service guide. List PCI Hot-Plug Slots Provides a descriptive list of all slots that support PCI hot-plug capability. If the listing for a slot indicates it holds an “Unknown” device, select the Install/Configure Devices Added after IPL to configure the adapter in that slot. Add a PCI Hot-Plug Adapter Allows the user to add a new PCI hot-plug-capable adapter to the slot with the system turned on. You will be asked to identify the PCI slot that you have selected prior to the actual operation. The selected PCI slot will go into the Action state and finally into the On state. Note: The system will indicate the slot holds an “Unknown” device until you perform the Install/Configure Devices Added After IPL option to configure the adapter. Replace/Remove a PCI Hot-Plug Adapter Allows the user to remove an existing adapter, or replace an existing adapter with an identical one. For this option to work, the adapter must be in the Defined state (see “Unconfigure a Device” option below). You will be asked to identify the PCI slot prior to the actual operation. The selected PCI slot will go into the Action state. Chapter 9. Removal and Replacement Procedures 925 Identify a PCI Hot-Plug Slot Allows the user to identify a PCI slot. The selected PCI slot will go into the Identify state. Unconfigure a Device Allows the user to put an existing PCI adapter into the Defined state if the device is no longer in use. This step must be completed successfully before starting any removal or replacement operation. If this step fails, the customer must take action to release the device. Configure a Defined Device Allows a new PCI adapter to be configured into the system if software support is already available for the adapter. The selected PCI slot will go into the On state. Install/Configure Devices Added After IPL The system attempts to configure any new devices and tries to find and install any required software from a user-selected source. The add, remove, and replace functions return information to the user indicating whether the operation was successful. If additional instructions are provided on the screen, complete the recommended actions. If the instructions do not resolve the problem, see the following: v If the adapter is listed as Unknown, perform the Install/Configure Devices Added After IPL option to configure the adapter. v If you receive a warning indicating that needed device packages are not installed, the system administrator must install the specified packages before you can configure or diagnose the adapter. v If you receive a failure message indicating a hardware error, the problem might be either the adapter or the PCI slot. Isolate the problem by retrying the operation in a different PCI slot, or trying a different adapter in the slot. If you determine that you have failing hardware, call your service representative. 926 406/90 Service Guide PCI Adapter or Blank Filler Removal from a Screw Assembly Type Cassette: Note: Before doing the following procedure determine if your PCI adapter cassette is snapped together or held together with three screws. If your PCI adapter cassette is held together with three screws, follow this procedure. If your PCI adapter cassette is snapped together, see “PCI Adapter or Blank Filler Removal from a Snap-Assembly-Type Cassette” on page 935. PCI adapter cassettes that are snapped together, can be identified by the blue grommet and snap button located next to the PCI adapter cassette release handle.Removal: 1. Remove the PCI card assembly from the I/O subsystem (for hot-pluggable adapters see “Removing and Replacing a Hot-Pluggable PCI Adapter” on page 921, for non-hot-pluggable adapters, see “Removing a Non-Hot-Pluggable PCI Adapter” on page 920). 2. Position the assembly on a flat surface, with the tailstock to the left and the card connector toward you. 3. Remove the three screws from the top cover groove, and the two screws at the heel corner. 4. Pop the rear end of the cover off the guide bump and remove the cover. 5. Position the assembly on a flat surface, with the tailstock to the right and the card connector toward you. 6. Remove the three screws from the top of the bezel assembly. 7. Position the assembly on a flat surface, with the tailstock to the left and the card connector toward you. 8. Flex the support arm downward until the top clip on the card arm is accessible. 9. Use a small screwdriver to lift the guide lock, and slide the card arm away from the PCI card until the card end is free. 10. Push the lower-front heel off the card, and remove the PCI card from the assembly. Chapter 9. Removal and Replacement Procedures 927 Replacement: 1. Ensure that you have the following parts: v Bezel and linkage subassembly v Cover v Clip v Insertion Clip (see the following note) v Short card arm v Long card arm v Screws (10--2 are extra)Note: If you are installing a Gigabit Ethernet (FC 2969 type 9U), Ultra3 SCSI (FC 6203 Type 4Y), or 10/100 BaseT Ethernet (FC 4962 Type AF), use the additional clips listed below to help when you are seating the adapter during installation. There are different clips (1) for each adapter type (2): 12 44P0323 Clip for Gigabit Ethernet FC 2969 type 9U 44P0324 Clip for Ultra3 SCSI FC 6203 Type 4Y 44P0321 Clip for 10/100 BaseT Ethernet FC 4962 Type AF 928 406/90 Service Guide 2. Position the bezel and linkage subassembly with the bezel to the left, on a flat ESD surface. 3. Pivot the bezel so that the lower end is away from the linkage subassembly.Note: Ensure that the top front clip is in the uppermost position before continuing. a. If the adapter card is full-height (that is, taller than the tailstock of the card), do the following: 1) Place the top corner of the PCI card so that it seats in the top front clip, as shown in the following diagram: 1 Cassette Top Linkage Bracket 4 Front Heel Clip 2 Top Front Clip 5 Adapter Cassette Bezel 3 Long Adapter 2) Rotate the bottom of the tailstock to fit in the front heel clip. 3) To remove any slack in the fit, use pliers or screwdriver to adjust the top front clip downward.b. If the adapter card is not full-height (that is, shorter than the tailstock of the card), do the following: 1) Place the bottom corner of the PCI card on the front heel clip. 2) To remove any slack between the top and bottom clips, use pliers or screwdriver to adjust the top front clip downward. If the top clip is pushed too far down without engaging the card, remove the top front clip by popping it off the rail. Reattach the clip and start again. Chapter 9. Removal and Replacement Procedures 929 4. If the new PCI adapter does not extend beyond the mark at the top of the linkage subassembly, attach the appropriate short card arm card. If the adapter extends beyond this mark, attach the long card arm to the support arm of the linkage subassembly as follows: 1 Mark at the Top of Linkage Subassembly 5 Short Adapter 2 Top Rear Clip (Shown on the long, and short card arms and middle-spacer arm) 6 Top Front Clip 3 Long Card Arm 7 Long Adapter 4 Short Card Arm Note: Ensure the card arm is fully on the track and clears the end of the support arm of the linkage subassembly. A flat head screwdriver might be needed to push the card arm in the remaining distance. a. Bend the support arm downward, to allow the middle spacer (long cards only) and card arm to slide onto the end of the support arm. b. Slide the card arm until the top-rear and lower clips align with the edge of the card. 930 406/90 Service Guide c. Press the lower clip under the bottom edge of the card. The bottom of the clip should be even with the bottom of the card. d. Slide the top rear clip of the card arm downward to engage the top corner of the card. The card should now be securely stabilized in all the clips, as shown in the following illustration. 1 Short Card Arm Top Clip 6 Short Card Arm Bottom Clip 2 Short Card Arm 7 Short Adapter Card 3 Long Card Arm Top Clip 8 Bottom Front Clip 4 Long Card Arm 9 Adapter Cassette Bezel 5 Long Card Arm Bottom Clip 10 Front Top Clip 5. Tilt the top of the bezel away from the PCI card while lowering the bottom of the tailstock into bottom of the bezel, as shown in the following illustration. 1 Adapter Cassette Bezel 2 Short Card Arm 3 Long Card Arm Chapter 9. Removal and Replacement Procedures 931 6. When the lower corner of the card is fully seated into the bezel, rotate the top of the bezel over the tailstock of the card. 1 Adapter Card 2 Adapter Card (Lower Corner) 3 Front Bottom Clip 4 Adapter Cassette Bezel 7. Position the bezel fully onto the assembly by snapping the three small holes on bezel arm over the three small embossments on the linkage assembly, as shown in the following illustration. 1 Alignment Studs 2 Adapter Card 3 ESD Surface 932 406/90 Service Guide 8. Secure the bezel arm to the linkage assembly with three small screws in the sequence shown in the following illustration. 1 Retaining Screws 9. With the tailstock to the left on a flat surface, position the cover over the linkage assembly. 10. Tuck the top and bottom tabs at the left of the cover under the bezel. 1 Front Top Tab Chapter 9. Removal and Replacement Procedures 933 2 Front Bottom Tab 11. Snap the top-right corner of the cover over the guide bump on the linkage assembly. 12. Align the holes at the top of the cover with those in the linkage subassembly and screw the cover to the linkage subassembly in the sequence shown in the following illustration.15432 13. Snap the tailstock clip onto the tailstock by rotating it onto the bracket while inside the bezel opening. 14. Raise the lever to the horizontal extended position and snap the tailstock clip onto the tailstock tab as follows. 15. Visually inspect the assembly and ensure that the following are correct: v The card moves up and down correctly when the handle is raised and lowered. v The tailstock clip rides within the bezel opening. 934 406/90 Service Guide PCI Adapter or Blank Filler Removal from a Snap-Assembly-Type Cassette: CAUTION: Metal edges might be sharp.C38Note: Before doing the following procedure determine if your PCI adapter cassette is snapped together or held together with three screws. If your PCI adapter cassette is held together with three screws, go to “PCI Adapter or Blank Filler Removal from a Screw Assembly Type Cassette” on page 927. PCI adapter cassettes that are snapped together, can be identified by the blue grommet and snap button located next to the PCI adapter cassette release handle. Use this procedure when you are preparing to install or replace an adapter or an adapter blank filler in the system unit or an I/O drawer. Note: An adapter or an adapter blank filler must be installed into the PCI adapter cassette assembly before it is reinstalled in a system unit or an I/O drawer. Determine the slot of the system or I/O drawer in which you plan to install or remove an adapter, and refer to the procedures for removing an adapter from the system. Before beginning this procedure, remove the PCI adapter cassette assembly from the system unit or I/O drawer.Note: It may take approximately 30 to 40 minutes to perform this procedure the first time. This time includes using the instructions in this guide and performing the steps. Thereafter, performing this procedure usually takes approximately 10 minutes. Before performing the following steps, familiarize yourself with the entire procedure. To remove a PCI adapter or blank filler from a cassette assembly, do the following: 1. Place the PCI adapter cassette assembly on a flat work surface with the cover facing up, and the top of the adapter facing you. See the following illustration. 1 Cover 3 Top of Cover 2 Handle Chapter 9. Removal and Replacement Procedures 935 2. Using two fingers, remove the bushing-lock pin from the bushing. The bushing-lock pin can be removed by pulling it out of the bushing with your fingernails. 1 Bushing 2 Bushing-Lock Pin 1 Bushing-Lock Pin 936 406/90 Service Guide 3. Remove the bushing. The bushing can be removed by pulling it out of the PCI adapter cassette assembly with your fingernails. 1 Bushing 4. Turn over the PCI adapter cassette assembly so that the top is facing away from you. 1 Top of Adapter 2 Handle Chapter 9. Removal and Replacement Procedures 937 5. Remove the bezel, as follows: a. Locate the plastic latch fingers in the top part of the cassette. 1 Top of Cassette 2 Bezel 3 Plastic Latch Fingers b. Using one hand, pinch the plastic latch fingers, and with your other hand, carefully lift the top part of the bezel extension out until the tabs clear the slots in the PCI adapter cassette assembly. 1 Slots 2 Tab 3 Bezel Extension 938 406/90 Service Guide c. While holding the bezel extension out, push the plastic cover latch out of the bezel hook, as shown in the following illustration. This action allows the bezel to be removed. 1 Bezel Extension 2 Plastic Cover Latch in Bezel Hook d. On the opposite side of the cassette (cover side), push the cover latch to release the bezel. 1 Cover Side 2 Cover Latch Chapter 9. Removal and Replacement Procedures 939 e. While holding the bezel extension out, carefully slide the bezel off. You might have to work from both sides to loosen the bezel assembly from the cassette assembly. When the bezel is free, slide it completely off the cassette assembly, and set it aside. 1 Bezel Extension 2 Bezel 1 Bezel 940 406/90 Service Guide 6. Turn over the cassette so that the cover is facing up, as shown in the following illustration. Remove the cover from the cassette as follows: a. Slide the cover until it releases from the cassette assembly. Attention: The cover might be tight and difficult to slide. If you grasp the left end (handle end) of the cassette and the right end of the cover, you can use enough force to pull the cover off the PCI adapter cassette assembly. 1 Left End of the Cassette 2 Cassette Cover b. Lift the cover off the assembly, and set it aside. c. Depress the handle release lever to unlock the locking bar. The locking bar is located on the handle, ensure that the handle is pulled into the unlocked position. Raise the handle on the cassette linkage assembly until it locks into the up position (the blank filler or adapter moves downward). 1 Handle Release Lever 2 Locking Bar Chapter 9. Removal and Replacement Procedures 941 7. Remove the metal EMC shield from the top of the tailstock.12 1 Metal EMC Shield 8. Remove the blank filler or adapter that is installed in the cassette linkage assembly.Note: If there is a blank filler in the cassette linkage assembly, as shipped from the manufacturer, there are two adapter arms. One adapter arm is used with short adapters (short adapter arm) and the other adapter arm is used with long adapters (long adapter arm). If you are removing a short adapter or blank filler, see “Short Adapter or Blank Filler Removal” on page 943. If you are removing a long adapter, see “Long Adapter Removal” on page 946. 942 406/90 Service Guide Short Adapter or Blank Filler Removal: To remove a short adapter or blank filler, do the following: 1. Slide the long and short adapter arms away from the adapter or blank filler by doing the following: a. Each adapter arm has a release tab that allows the arm to be moved away from the adapter or blank filler in the cassette assembly. Use your fingernail to lift the tab, to allow each arm to be moved away from the adapter or blank filler. Lift the release tab on the short adapter arm, and push on the slotted tab to release the end of the blank filler.Note: If you plan to install a short adapter, leave the long adapter arm on the cassette linkage assembly. If you plan to install a long adapter, remove both the long and short adapter arms from the cassette linkage assembly in the next step. b. Slide the long and short adapter arms away from the blank filler or adapter. 1 Long Adapter Arm 2 Release Tab 3 Release Tab 4 Short Adapter Arm 5 Slotted Tab Note: Some dual linkage PCI adapter cassettes might contain a middle-spacer arm to be used with long adapter cards only. The middle-spacer arm and its installation can be seen in the following Chapter 9. Removal and Replacement Procedures 943 illustration.213453 1 Short PCI Adapter Card 2 Short Adapter Arm 3 Middle-Spacer Arm 4 Long Adapter Arm 5 Long PCI Adapter Card 944 406/90 Service Guide 2. Remove the adapter or blank filler from the cassette linkage assembly by rotating the bottom of the tailstock out, as shown in the following illustration. Store the adapter or blank filler in a safe place.123 1 Cassette Linkage Assembly 2 Bottom of Tailstock 3 Handle 3. The removal procedure for the PCI adapter cassette assembly is complete. To install a new adapter or blank filler in the cassette, go to “Installing a New Hot-Pluggable PCI Adapter” on page 923. Chapter 9. Removal and Replacement Procedures 945 Long Adapter Removal: To remove a long adapter, do the following: 1. Each adapter arm has a release tab that allows the arm to be moved away from the adapter in the cassette assembly. Use your fingernail to lift the tab, to allow the arm to be moved away from the adapter. Lift the release tab on the long adapter arm, and slide it off the cassette linkage assembly. 1 Release Tab 2. Remove the adapter from the cassette linkage assembly by rotating the bottom of the tailstock out, as shown in the following illustration. Store the adapter in a safe place. 1 Cassette Linkage Assembly 3 Bottom of Tailstock 2 Adapter 4 Handle 3. The removal procedure for the PCI adapter cassette assembly is complete. To install a new adapter or blank filler in the cassette, go to “Installing a New Hot-Pluggable PCI Adapter” on page 923.Note: If you are going to install a short adapter card, and your cassette linkage has the middle-spacer arm installed, remove the middle-spacer arm and install the short adapter arm, refer to page 1a on page 943. The middle-spacer arm and long adapter arm can be stored on the cassette linkage. See the illustration on page 944. To install a short adapter into a cassette see “Replacing an Adapter in a PCI Adapter Cassette.”Replacing an Adapter in a PCI Adapter Cassette CAUTION: Metal edges might be sharp.C38 946 406/90 Service Guide This procedure is performed when preparing to install a new adapter or a blank filler in the processor subsystem or an I/O drawer. Familiarize yourself with the entire procedure before performing the following steps. 1. Place the empty PCI Adapter Cassette Assembly linkage on a flat work surface in front of you. Position the cassette so that the handle is in the raised (up) position, and on the left, with the top of the linkage facing away from you.Note: Your PCI Adapter Cassette Assembly linkage might not have the short or long adapter retaining arms or the middle-spacer arm installed if you removed them during the removal procedure. In the following illustration, both arms are shown. 1 Handle 2 Cassette Linkage Assembly 3 Retaining Arms 4 Middle-Spacer Arm Chapter 9. Removal and Replacement Procedures 947 Note: If you are installing an adapter that is in the following list, use the additional brackets listed to help when you seat the adapter during installation. There are unique brackets (1) for each adapter type (2):12 Feature Code Description Bracket Part Number 2751 S/390 ESCON Channel PCI Adapter 31L8728 2969 Gigabit Ethernet-SX PCI Adapter 44P2676 2975 10/100/1000 Base-T Ethernet PCI Adapter 44P2676 4962 10/100 Mbps Ethernet PCI Adapter II 44P0321 5700 IBM Gigabit Ethernet-SX PCI-X Adapter 53P5450 5701 IBM 10/100/1000 Base-TX Ethernet PCI-X Adapter 53P5450 5706 IBM 2-Port 10/100/1000 Base-TX Ethernet PCI-X Adapter 53P5450 5707 IBM 2-Port Gigabit Ethernet-SX PCI-X Adapter 53P5450 6203 PCI Dual Channel Ultra3 SCSI Adapter 44P2661 6228 2 Gigabit Fibre Channel Adapter for 64-bit PCI Bus 44P2675 6239 2 Gigabit Fibre Channel PCI-X Adapter 44P3912 948 406/90 Service Guide 2. To determine if you are installing a long adapter or a short adapter, use the following steps. If you know the length of your adapter, skip the following steps and proceed to step 3 on page 951. a. Remove the adapter from its protective packaging and, if present, remove the plastic extension handle from the end of the adapter. b. Place the adapter or a blank filler over the cassette, with the upper-left corner aligned into the top adapter-retaining clip. 54231 1 Top Adapter-Retaining Clip 3 Adapter 2 Cassette Linkage Assembly 4 Bottom of Tailstock 5 Handle Chapter 9. Removal and Replacement Procedures 949 c. The cassette linkage assembly is marked to show long or short adapters similar to the adapter shown in the following illustration. Check the right end of the adapter to determine if it is long enough to fall into the long area of the cassette linkage assembly. Ensure that the corner on the left end of the adapter is still aligned into the top adapter-retaining clip, and determine the length of the adapter.LONGARMARMSHORT12 1 Top Adapter-Retaining Clip 2 Long Adapter-Retaining Arm and Clip 950 406/90 Service Guide 3. Remove the adapter or blank filler and ensure that the handle is in the up position. If you need to move the handle, ensure that the locking bar is pulled into the unlocked position, and then rotate the adapter handle until the handle is in the up position (the cassette linkage assembly will extend down beyond the cassette top). If you are installing a short adapter or blank filler, go to “Short Adapter or Blank Filler Installation.” If you are installing a long adapter, go to “Long Adapter Installation” on page 968.21 1 Handle Release Lever 2 Lock Short Adapter or Blank Filler Installation To install a short adapter or blank filler, do the following: Chapter 9. Removal and Replacement Procedures 951 1. Place the adapter or blank filler into the cassette so that the upper-left corner of the adapter engages the adjustable top adapter-retaining clip as shown in the following illustration.4312 1 Cassette Linkage Assembly 2 Bottom of Tailstock 3 Retaining Clip 4 Handle 952 406/90 Service Guide 2. Rotate the adapter so that the adapter engages the slot in the bottom adapter-retaining clip and the top corner of the adapter is seated into the adjustable top adapter-retaining clip.Note: If the adapter is not a full-height adapter, you must slide the adjustable top adapter-retaining clip downward until the lower edge of the adapter is seated into the slot on the bottom adapter-retaining clip.321 1 Adapter 2 Slot 3 Bottom Retaining Clip Chapter 9. Removal and Replacement Procedures 953 3. Slide the short adapter-retaining arm toward the adapter or blank filler on the cassette linkage rail. If the short-adapter retaining arm has been removed from the cassette linkage rail, install the short-adapter retaining arm and then slide the arm until it contacts the adapter or blank filler as shown in the following illustration.123 1 Short-adapter Retaining Arm 2 Adapter or Blank Filler 3 Cassette Linkage Rail 954 406/90 Service Guide 4. Ensure that the adjustable top retaining clip catches the corner of the adapter as shown in the following illustration: 1 1 Short Adapter Top Retaining Clip 5. Use the lower short adapter arm clip to engage and hold the bottom of the adapter.Note: It might be necessary to apply pressure to engage and hold the bottom of the adapter.1 1 Lower Short Adapter Arm Clip Chapter 9. Removal and Replacement Procedures 955 6. To hold the top of the adapter, slide down the adjustable top adapter-retaining clip on the retaining arm. Ensure that the bottom edge of the adapter is held by the lower part of the adapter retaining arm. 1 Adjustable Top Adapter-Retaining Clip 956 406/90 Service Guide 7. Press the lock on the handle, and rotate the adapter handle until it is in the down position (adapter or blank filler moves up into the cassette assembly). 21 1 Handle Release Lever 2 Lock Ensure that the right end of the cassette linkage rail moves up into the cassette linkage assembly as shown in the following illustration:1 1 Right End of Cassette Linkage Chapter 9. Removal and Replacement Procedures 957 8. Position the adapter and cassette assembly with the handle on the left (in the down position) and the top facing away from you.21 1 Top of Cassette 2 Handle 958 406/90 Service Guide 9. Install the cover on the cassette assembly, as follows: a. Place the cassette cover on the cassette assembly as shown in the following illustration. Slide the cover toward the handle until the hole in the cover aligns with the hole in the cassette assembly. 21 1 Handle 2 Holes Aligned Chapter 9. Removal and Replacement Procedures 959 10. Install the bezel assembly using the following procedure: a. Carefully slide the bezel onto the cassette assembly.1 1 Bezel b. Align and insert the cover arm latch in the hooked notch in the bezel.12 1 PCI Adapter Pull Tab 2 Hooked Notch in the Bezel for the Cover Arm Latch 960 406/90 Service Guide c. Align the top of the bezel assembly into the grooves on the top of the cassette assembly. Push the bezel onto the cassette linkage assembly until the tab on the top of the bezel is seated in the recess of the cassette assembly.1234 1 Recess 3 Tab 2 Groove 4 Bezel Assembly d. Insert the two tabs on the bezel extension into the two slots on the cassette linkage assembly.1 1 Tabs Chapter 9. Removal and Replacement Procedures 961 11. Check for the following: a. Ensure that the extension arm engages the pins on the cassette. You should be able to see the three pins in the holes in the arm. The bezel tab should be seated as shown in the following illustration.111 1 Pin b. Ensure that the cover arm latch is completely pressed into the hooked notch on the bezel as shown.12 1 PCI Adapter Pull Tab 2 Hooked Notch in the Bezel 962 406/90 Service Guide c. Turn the cassette assembly over so the cover is facing up. Check the cover latch to ensure that it is holding the bezel to the cover as shown.1 1 Cover Latch 12. Ensure the holes are aligned, and insert the bushing-lock pin into the bushing as shown. 1 Bushing 2 Bushing-Lock Pin Chapter 9. Removal and Replacement Procedures 963 The following illustration shows the location of the bushing in the cassette.1 1 Location of the Installed Bushing and Bushing-Lock Pin 13. Insert the bushing-lock pin into the hole in the bushing, and push it in until it seats as shown in the following illustration.1 1 Bushing-Lock Pin 964 406/90 Service Guide 14. Depress the handle release lever to ensure the locking bar on the handle is pulled into the unlocked position. Raise the handle on the cassette linkage assembly until it locks into the up position (the blank filler or adapter moves downward). 1 Handle Release Lever 2 Locking Bar 15. To install the metal EMC shield on the adapter bracket, grasp the EMC shield as shown in the following illustration, and do the following: 12 1 Metal EMC Shield 2 Installation Location for the Metal EMC Shield Chapter 9. Removal and Replacement Procedures 965 a. Ensure that the metal EMC shield slides up inside the top of the cassette.12 1 Metal EMC Shield b. The metal EMC shield has clips that slide over the top of the tailstock. Ensure that these clips are holding the metal EMC shield to the tailstock.1 1 Top of Tailstock 966 406/90 Service Guide 16. Depress the handle release lever on the handle, and rotate the adapter handle until the handle is in the down position (adapter or blank filler moves up into the cassette assembly). 1 1 Handle 17. Using the system documentation, determine if the adapter you are installing is hot-swappable. If the adapter is hot-swappable, move the slider on the color indicator to allow the orange color to be visible. If the adapter is not hot-swappable, the blue color is visible.12 1 Orange indicates Hot-Swappable 2 Blue indicates Not Hot-Swappable 18. The adapter is ready to be installed into a system or an I/O drawer. Chapter 9. Removal and Replacement Procedures 967 Long Adapter Installation To install a long adapter, use the following procedure. 1. If installed, remove both long and short adapter retaining arms. See step 1a on page 943 to remove the arms. 2. Install the middle-spacer arm, see the following illustration. Note: Some long adapters are shipped with a middle-spacer arm. If your adapter uses the middle-spacer arm, slide it onto the cassette linkage as shown in the following illustration. If there is an embossed line on the face of the top rail, align the middle-spacer arm as shown in the following illustration. If there is no line embossed on the face of the top rail, align the middle-spacer arm with the leading edge of the adapter’s connector, as shown in the following illustration.465312 1 Embossed Middle-Spacer Alignment Lines 4 Adapter Connector (Leading Edge) 2 Top Cassette Rail 5 Long Adapter Card 3 Middle-Spacer Arm 6 Long Adapter Arm 968 406/90 Service Guide 3. Place the adapter into the cassette so that the upper-left corner of the adapter engages the top adjustable adapter-retaining clip as shown in the following illustration:54231 1 Top Adapter-Retaining Clip 3 Adapter 2 Cassette Linkage Assembly 4 Bottom of Tailstock 5 Handle Chapter 9. Removal and Replacement Procedures 969 4. Rotate the adapter so that the adapter engages the slot in the bottom retaining clip and the top corner of the adapter is seated into the top adjustable adapter-retaining clip.Note: If the adapter is not a full-height adapter, slide the top adjustable adapter-retaining clip downward until the lower edge of the adapter is seated into the slot on the bottom retaining clip.213 1 Adapter 3 Bottom Retaining Clip 2 Slot 970 406/90 Service Guide 5. Install the long adapter-retaining arm onto the cassette linkage rail, as shown in the following illustration, and slide the arm toward the adapter.123 1 Cassette Linkage Assembly 3 Top Adapter Retaining Clip 2 Adapter Retaining Arm Chapter 9. Removal and Replacement Procedures 971 6. Ensure that the top adjustable adapter-retaining clip catches the corner of the adapter as shown in the following illustration. 1 Top Adjustable-Retaining Clip 7. Ensure that the bottom edge of the adapter is held by the groove in the lower part of the adapter-retaining arm.12 1 Top Adjustable-Retaining Clip 2 Long Adapter Retaining Arm Slot 972 406/90 Service Guide 8. To hold the top of the adapter, slide down the top adjustable adapter-retaining clip on the retaining arm. 1 1 Long Adapter Retaining Arm 9. If you installed a middle-spacer arm, ensure that the adapter-retaining clip is seated around the top edge of the adapter. 10. Depress the handle release lever on the handle. Rotate the adapter handle until it is in the down position (adapter or blank filler moves up into the cassette assembly). 21 1 Handle 2 Handle Release Lever Chapter 9. Removal and Replacement Procedures 973 Ensure the right end of the cassette linkage rail moves up into the cassette linkage assembly as shown:1 1 Right End of Cassette Linkage 11. Position the adapter and cassette assembly with the handle on the left (in the down position) and the top facing away from you.12 1 Handle 2 Top of Cassette 974 406/90 Service Guide 12. Install the cover on the cassette assembly by placing the cassette cover on the cassette assembly as shown. Slide the cover toward the handle until the hole in the cover aligns with the hole in the cassette assembly. 21 1 Handle 2 Hole Aligned in the Cover Chapter 9. Removal and Replacement Procedures 975 13. Install the bezel assembly using the following procedure: a. Carefully slide the bezel onto the cassette assembly.1 1 Bezel b. Align and insert the cover arm latch in the hooked notch in the bezel.12 1 PCI Adapter Pull Tab 2 Hooked Notch in the Bezel 976 406/90 Service Guide c. Align the top of the bezel assembly into the grooves on the top of the cassette assembly, and push the bezel onto the cassette linkage assembly until the tab on the top of the bezel is seated in the recess of the cassette assembly.1234 1 Recess 3 Tab 2 Grooves 4 Bezel d. Insert the two tabs on the bezel extension into the two slots on the cassette assembly.1 1 Tabs Chapter 9. Removal and Replacement Procedures 977 14. Check for the following: a. Ensure that the extension arm engages the three pins on the cassette. You can see the pins in the holes in the arm. The bezel tab is seated as shown in the following illustration.111 1 Pins b. Ensure that the cover arm latch is completely pressed into the hooked notch on the bezel as shown.12 1 PCI Adapter Pull Tab 2 Hooked Notch in the Bezel 978 406/90 Service Guide c. Turn the cassette assembly over so that the cover is facing up. Check the cover latch to ensure that it is holding the bezel to the cover as shown.1 1 Cover Latch 15. Position the adapter with the cover side facing up. Ensure that the holes are aligned, and insert the bushing as shown. 1 Bushing 2 Bushing-Lock Pin Chapter 9. Removal and Replacement Procedures 979 1 1 Bushing 2 Bushing-Lock Pin 16. Insert the bushing-lock pin into the hole in the bushing. Push on the pin until it seats.1 1 Location of the Installed Bushing and Bushing-Lock Pin 980 406/90 Service Guide 17. Depress the handle release lever to ensure the locking bar on the handle is pulled into the unlocked position. Raise the handle on the cassette linkage until it locks into the up position (the blank filler or adapter moves downward). 1 Handle 2 Locking Bar 18. To install the metal EMC shield on the adapter bracket, grasp the metal EMC shield as shown in the following illustration, and do the following: 12 1 Metal EMC Shield 2 Adapter Bracket Chapter 9. Removal and Replacement Procedures 981 a. Ensure that the shield slides up inside the top of the cassette.12 1 Metal EMC Shield 2 Adapter Bracket 982 406/90 Service Guide b. The metal EMC shield has clips that slide over the top of the tailstock. Ensure that these clips are holding the EMC shield to the tailstock.1 1 Top of Tailstock 19. Press the lock on the handle and rotate the adapter handle until the handle is in the down position (adapter or blank filler moves up into the cassette assembly). 1 1 Handle 20. Using your system documentation, determine if the adapter you are installing is hot-swappable. If the adapter is hot-swappable, move the slider on the color indicator to allow the orange color to be Chapter 9. Removal and Replacement Procedures 983 visible. If the adapter is not hot-swappable, the blue color is visible.12 1 Orange indicates Hot-Swappable 2 Blue indicates Not Hot-Swappable 21. The adapter is ready to be installed into a system or an I/O drawer. For more information, refer to your system documentation. 984 406/90 Service Guide Replacing a Double-Wide Blind-Swap Adapter Before installing a replacement adapter in a cassette that uses two adapter positions, referred to as a double-wide cassette, perform the following checks: v Check that the screws for the cassette are in place: – Across the male dovetail side – Holding the extension bracket – In the top center holding the cover – In the end of the top trackv Check to ensure that the: – Rail and the metal bezel are aligned – Rail is not caught on the top of the bezel – Tailstock is aligned with the bezelv Check the handle assembly to ensure that the: – Pivot block is not broken – Handle is set to the blue color – Handle gray bar is not brokenv Check that all the EMC brackets are intact and in place. v Check that the light pipes are intact and free from damage. v Check the bezel hole to ensure that the: – Handle is activated – Hole in the tailstock aligns with the hole in the bezelv Check the cover assembly to ensure that the cover end is held by the rear arm of the bezel assembly.To replace the adapter, use the following procedure: 1. Press the center of the handle of the PCI card cassette to release the latch, then lower the handle completely. 2. When the handle is completely lowered, push the gray locking cross bar. 3. Hold the assembly straight on with the PCI slots. 4. Place the double-wide blind-swap cassette assembly with the lower leading corner between installed single-wide cassettes. 5. Slide the cassette partially into the guide. 6. Ensure the dovetail on the top track aligns with its mating component(s) on both sides. After the dovetails are engaged, push the cassette fully into the drawer. You will feel some resistance when the cassette is fully inserted. When the cassette is fully inserted, it aligns with the other installed cassettes. 7. If the cassette is fully inserted, press the button on the handle to unlock the gray crossbar. 8. Raise the handle with one hand and at the same time, use your other hand to press down on the metal tab (adapter tailstock tab just below the handle mechanism) until the handle latches. You should hear a click as you lift the handle all the way up. Note: To ensure that the PCI adapter is completely seated, you may have to release the handle and repeat this process while pressing on the tailstock tab. When you are sure the adapter is completely seated, be sure that the handle is latched. 9. Connect power to the system and go to ?MAP 0410: Repair Checkout? in the Diagnostic Information for Multiple Bus Systems. Refer to “Powering the System On” on page 841 if necessary. 10. Ensure that the adapter is configured when the system completes the boot process. Chapter 9. Removal and Replacement Procedures 985 I/O Subsystem RIO Riser Card Removal Note: The RIO Riser Card is part of the I/O Subsystem Backplane FRU. It cannot be ordered separately. 1. Turn off power as described in “Powering Off and Powering On the System” on page 838. 2. Disconnect all cables from the RIO riser card. 3. Remove the I/O subsystem backplane, as described in “I/O Subsystem I/O Backplane Assembly” on page 989. 4. Remove the two screws that attach the RIO riser card to the I/O subsystem backplane. 5. Carefully pull the card out of the slot as shown in the following illustration. 1 RIO Riser Card Replacement Replace in reverse order. 986 406/90 Service Guide I/O Subsystem Distributed Converter Assembly (DCA) Notes: 1. If multiple I/O planars or I/O Drawer DCAs are being replaced during a maintenance window, then they must be replaced one at a time in order to ensure the I/O Drawer serial number does not become damaged. After a single part is replaced, the I/O drawer needs to have power applied to it and the system brought to the OK prompt. At this point, the system can be powered off to replace the next part. An algorithm is used (shortly after the UEPO is turned on) to overwrite the new part. If more than one part is replaced at a time, the VPD will not be updated and the system will not be able to boot in LPAR mode. 2. Each I/O subsystem can have up to two DCAs. You can replace each DCA separately. The removal and replacement procedures are identical for each DCA. 3. This procedure applies to the I/O subsystem DCAs. 4. If a subsystem has two DCAs, each is hot-pluggable. If only one needs to be removed, you do not have to power off the system. Do not attempt to replace two DCAs at once or to move DCAs from one I/O drawer to another to troubleshoot a problem. The machine type/serial number will be damaged when these actions are attempted.Removal DANGERDo not attempt to open the covers of the power supply. Power supplies are not serviceable and are to be replaced as a unit.D021. Open the rack front door 2. Remove the I/O subsystem front cover. 3. Locate the DCA to be replaced. 4. Proceed only if all the DCA’s LEDs go off or the repair has been scheduled. If the repair has been scheduled, power off the system as described in “Powering Off and Powering On the System” on page 838, then return here. 5. Press the green Start Service button on the UEPO switch. 6. Proceed only if all the DCA’s LEDs go off or the repair has been scheduled (System outage - EPO off), 7. Verify the cable labeling and locations, and then unplug the cables from the DCA. 8. Loosen the left and right captive thumbscrews, using the torque tool (P/N 6422789), alternating between the fasteners until the unit can slide out of the slot. Chapter 9. Removal and Replacement Procedures 987 9. Grasp both sides of the front DCA, and pull it out of the drawer. 1 I/O Subsystem 3 I/O Subsystem Front Cover 2 DCA Captive Thumbscrew (quantity 2) 4 Distributed Converter Assembly (DCA) Replacement 1. Rotate, counter clockwise, the two captive thumbscrews located on the front of the DCA. Ensure that the each thumbscrew has been rotated fully into the counter clockwise position. Attention: Rotating the thumbscrews counter clockwise engages the stop mechanism located on the bottom of the DCA. Failure to do so could result in the system powering off during a concurrent replacement operation. The following illustrations shows a DCA stop mechanism completely engaged. 1 DCA Captive Thumbscrew (quantity 2) 2a Stop Mechanism (disengaged) 2 Stop Mechanism (engaged) 3 Distributed Converter Assembly (DCA) 2. Carefully slide the DCA into the slot as far as possible (the back edge of the top of the DCA is about .25 inch from the cage bracket).Note: Do not force the DCA against the backplane, as this may damage the connector. 3. Hold the upper part of the DCA against the backplane, and use the torque tool to engage the left captive thumbscrew about two turns into the cage bracket. 4. Repeat the previous step for the right captive thumbscrew. 5. Alternate tightening the left and right thumbscrews until both are fully torqued in. 6. Check torque on both thumbscrews. 7. Plug all cables into their correct location in the DCA. 8. Press the white Service Complete button on the UEPO switch. 988 406/90 Service Guide I/O Subsystem I/O Backplane Assembly Removal Note: If multiple I/O planars or I/O Drawer DCAs are being replaced during a maintenance window, then they must be replaced one at a time in order to ensure the I/O Drawer serial number does not become damaged. After a single part is replaced, the I/O drawer needs to have power applied to it and the system brought to the OK prompt. At this point, the system can be powered off to replace the next part. An algorithm is used (shortly after the UEPO is turned on) to overwrite the new part. If more than one part is replaced at a time, the VPD will not be updated and the system will not be able to boot in LPAR mode. 1. Power off the system as described in “Powering Off and Powering On the System” on page 838. 2. Disconnect and label all cables from the adapters on the affected I/O backplane. 3. Remove and label adapters. (See “PCI Adapters” on page 914). 4. Remove the two screws securing the I/O backplane to the subsystem. 5. Insert the I/O subsystem backplane removal tool (part number 44P0549), and pry the tool backward to remove the board from the connector.Note: If the I/O subsystem is in the bottom position of the rack, the I/O subsystem may need to be removed from the rack to allow correct usage of the I/O subsystem backplane removal tool to install a new I/O backplane. 6. Slide the I/O subsystem backplane assembly to the rear.Note: Be sure to hold the board level to ensure that the board can clear the step in the cover and then again to clear the EMC gasket. 7. Remove the assembly. 12 1 Backplane removal tool (part number 11P4789) 2 Backplane removal and insertion tab Chapter 9. Removal and Replacement Procedures 989 Replacement 1. Align the I/O subsystem backplane assembly with the chassis opening and ensure that the roll bar clears the top-edge EMC gasket without damaging it. 2. Slide the I/O subsystem backplane assembly into the chassis. Ensure that the two pins in the chassis align with the bottom of the backplane. Align the backplane with the connector. 3. Continue to slide the backplane in slowly until it makes contact with the connector. 4. Upon contact, insert the insertion tool (part number 11P4789) to cam the backplane into the connector. 5. Pry with the tool until the screw holes on the chassis and the board align. 6. Secure with two screws. 7. Reinstall all adapters and reconnect all cables to the replaced I/O backplane. 990 406/90 Service Guide I/O Subsystem DASD Hard Disk Drive Assembly Note: Linux does not support hot-plugging any hot-pluggable PCI adapters or devices. A system with Linux installed on one or more partitions must be shut down and powered off before replacing any PCI adapter or device assigned to a Linux partition. Follow the non-hot-pluggable adapter or device procedures when replacing a PCI adapter or device in any partition with Linux installed. Removal 1. This procedure is a hot-swap procedure. Refer to the ?“SCSI Hot-Swap Manager” in the “Introduction to Tasks and Service Aids”chapter in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. Use the SCSI Hot-Swap Manager Task to prepare to remove and replace a SCSI disk drive. Then continue with the following steps. 2. Press on the blue latch to release it from hooking the body. 3. Move the black handle all the way down and pull out the assembly. A slight initial resistance is normal. 4. Ensure that the 4-pack has not been damaged. If the DASD assembly was installed incorrectly, the removal could force the metal tab in the 4-pack to bend over the camming slot. 1 DASD 4-Pack Assembly 4 Cover 2 I/O Subsystem Chassis 5 Disk Drive Assembly 3 EMC Filler Shield 6 DASD Filler Chapter 9. Removal and Replacement Procedures 991 Replacement Note: All four bays in each DASD 4-pack must be occupied. If a bay does not have a DASD assembly, it must be occupied by a filler assembly. 1. Inspect the DASD 4-pack to ensure it is not damaged. 2. Ensure that the lower retaining screws are inserted and secured. 3. Ensure that the front retaining screws are inserted and secured without interfering with the DCA. 4. Open fully the latch on the DASD assembly by pressing the blue handle and pulling the black handle. A thump signals that it is fully open. 5. With the handle to the bottom, insert the assembly into the appropriate slot. 6. Slide the assembly forward until you feel some resistance. Verify that the spring on the top is contacting the edge of the 4-pack. This should be the only resistance. 7. Push the assembly forward with slightly more force until you feel more resistance. Verify that the tab on the black handle is located above the cam slot in the 4-pack.Note: If the tab is not aligned with the slot, do not raise the handle. Remove the assembly and start again. 8. Lift the black handle to move the assembly in slightly. 9. Lift the handle and push forward until the blue latch hooks the assembly body. 992 406/90 Service Guide I/O Subsystem DASD 4-Pack Removal 1. Press the green Start Service button on the UEPO switch. 2. Locate the DASD backplane to be replaced. 3. Turn off the power as described in “Powering Off and Powering On the System” on page 838. 4. Remove the front cover from the I/O subsystem by grasping the pins at either side of the cover, pulling them towards the center of the drawer, and simultaneously lifting the cover up and to the front to release it from the I/O subsystem chassis. 5. Remove the DASD disk drive assemblies or DASD filler(s) from the affected 4-pack as shown in the following illustration. 1 DASD 4-Pack Assembly 4 Cover 2 I/O Subsystem Chassis 5 Disk Drive Assembly 3 EMC Filler Shield 6 DASD Filler 6. Remove two M4x0.7 screws at the lower inside front of the DASD 4-pack with a short-blade screwdriver. 7. Remove two M4x0.7 hex socket-head cap screws at the lower inside rear of the DASD 4-pack. 8. Grasp the DASD 4-pack through the cutouts in the top of the I/O subsystem, and pull the DASD 4-pack out of the front of the I/O subsystem.Replacement 1. Perform steps 4 through 8 of the removal instructions in reverse order. 2. Turn on the power as described in “Powering Off and Powering On the System” on page 838. 3. Press the white Service Complete button on the UEPO switch. Chapter 9. Removal and Replacement Procedures 993 I/O Subsystem Fan Removal 1. Press the green Start Service button on the UEPO switch. 2. Locate the fan to be replaced. 3. Turn off the power as described in “Powering Off and Powering On the System” on page 838. 4. Remove the DASD cage from the front of the fan to be removed (see “I/O Subsystem DASD 4-Pack” on page 993). 5. Slide the black retention latch on the fan assembly upward and, with the handle located on the fan, pull the fan from the front of the I/O subsystem. A door automatically closes when the fan is removed, preventing loss of airflow while the fan is out of the enclosure.123 1 DCA (DCA 1 shown) 2 I/O Subsystem Chassis 3 Fan (Fan 4 shown) Replacement Note: All four openings to the front of the fans must be covered either by a DASD 4-pack or by a DASD 4-pack filler. Replace in reverse order. When you have completed the replacement of the DASD fan, turn on the power as described in “Powering Off and Powering On the System” on page 838, and press the white Service Complete button on the UEPO switch. This completes the replacement. 994 406/90 Service Guide I/O Subsystem Chassis Removal Attention: This process requires: v The use of a lift tool (part number 09P2481) and lift-plate assembly tool (part number 11P4369). v The lift tool wheels must be chocked to prevent the chassis from moving during the operation. The frame must not be on its casters, the leveling pads must be engaged to prevent the frame from moving during the operation. 1. Read all safety instructions before performing this operation. Unpack lift tool and follow assembly instructions included with the tool. Inspect the tool for damage (do not perform the remove procedure with a damaged lift tool). Replace the blue lift plate with the orange lift plate (P/N 11P4369). Orient two round holes in lift plate to the side of the lift tool that will be closest to the frame. 2. Turn off power as described in “Powering Off and Powering On the System” on page 838. 3. Unplug all power cables from the line power source. Attention: Because this procedure is performed with power removed from the system, references to using the white or green service buttons on the UEPO switch do not apply during the remainder of this procedure. 4. Open the rack doors and remove them from the rack. 5. Remove the I/O subsystem front cover. 6. If your system is configured with a half I/O drawer filler plate, remove the filler plate. 7. Verify that all cables are labeled. If not, then label all cables and components along with their location at the front and rear of your system. 8. Remove all DASD hard disk drives or DASD fillers as described in the removal section of “I/O Subsystem DASD Hard Disk Drive Assembly” on page 991. 9. Remove all DASD 4-pack or 4-pack filler assemblies as described in the removal section of “I/O Subsystem DASD 4-Pack” on page 993. 10. Remove all DCA units as described in the removal section of “I/O Subsystem Distributed Converter Assembly (DCA)” on page 987. 11. Remove all I/O subsystem fan assemblies as described in the removal section of “I/O Subsystem Fan” on page 994. 12. Remove all PCI adapter cassettes as described in the removal section of “PCI Adapters” on page 914. 13. Remove the I/O backplanes (up to 2) as described in the removal section of “I/O Subsystem I/O Backplane Assembly” on page 989. 14. Remove the screws that attach the I/O subsystem chassis to the rack. 15. Position the lift tool at the front of the rack. Align the lift tool so that I/O subsystem chassis will not be blocked (at the lip of the orange lift plate) from removal. Install wheel chocks on the lift tool casters. Engage the lift tool brake. 16. Raise the plate to a height such that the lift plate is just below the base of the I/O subsystem chassis you are removing. Release pins on the extender arms and position them on both left and right side. Arms must fit around the frame extenders and lock in place (do not adjust the lift tool after these arms are locked in place). 17. One person should gently pull and guide the I/O subsystem chassis onto the lift plate. The I/O subsystem chassis must be removed far enough to clear the front face of the frame. Retract the extender arms. 18. Lower I/O subsystem chassis to the bottom position of the lift tool. 19. Remove the wheel chocks and release the brake. Flip the crank handle 180 degrees. Carefully move the lift tool with the I/O subsystem chassis to the staging area. The brake and wheel chocks must be in place when transferring the I/O subsystem chassis to or from the packaging material. Chapter 9. Removal and Replacement Procedures 995 Front: 1 I/O Subsystem Chassis 6 SCSI DASD Disk Drives 2 Fan assembly 7 DASD-4 Pack Cage 3 DASD 4-Pack Filler 8 DCA 4 DASD Filler 9 Half I/O drawer filler plate 5 Front Cover Rear: 1 PCI I/O Subsystem Chassis 3 Blind-swap cassette or PCI filler with no adapter 2 RIO Riser Card, part of the I/O subsystem backplane assembly 4 PCI I/O Backplane 996 406/90 Service Guide Replacement Notes: 1. If you are replacing this assembly with a new assembly, the manufacturing label with the machine type, model number, and the serial number for the subsystem must be ordered to apply to the new part. For information on replacement labels, see Chapter 10, “Parts Information,” on page 1009. 2. Before beginning the replacement procedure, install rails in the frame if they are not already installed. 1. Position the lift tool such that the new I/O subsystem chassis can be moved onto the lift plate. 2. Install the wheel chocks and engage the brake. 3. Move the new I/O subsystem chassis onto the tool with its rear (PCI adapter end) flush with the lift plate. 4. Position the lift tool as close as possible to the front of the rack. Install wheel chocks on the lift tool casters. Engage the lift tool brake. 5. Raise the plate to a height such that the lift plate is just above the rails onto which the I/O subsystem chassis will be installed. Release pins on the extender arms and position them on both left and right side. Arms must fit around the frame extenders and lock in place (do not adjust the lift tool after these arms are locked in place). 6. One person should gently push the I/O subsystem chassis toward the frame from the front of the system. Adjust the height of the lift plate, as needed. After the I/O subsystem is installed in the frame, retract the extender arms. 7. Lower the lift plate and remove it from the lift tool. Repackage the lift tool. 8. Align the I/O subsystem chassis, then secure it to the rack with retaining screws. 9. Replace the I/O backplanes (up to 2) as described in the replacement section of “I/O Subsystem I/O Backplane Assembly” on page 989. 10. Replace all PCI adapter cassettes or fillers as described in “PCI Adapters” on page 914. 11. Replace all I/O subsystem fan assemblies as described in the replacement section of “I/O Subsystem Fan” on page 994. 12. Replace all DCA units as described in the replacement section of “I/O Subsystem Distributed Converter Assembly (DCA)” on page 987. 13. Replace all DASD 4-pack or 4-pack filler assemblies as described in the replacement section of “I/O Subsystem DASD 4-Pack” on page 993. 14. Replace all DASD hard disk drives or DASD fillers as described in the replacement section of “I/O Subsystem DASD Hard Disk Drive Assembly” on page 991. 15. Reconnect all cables to the front and rear of your system. 16. If your system is configured with a half I/O drawer filler plate, remove the filler plate. 17. Remove the I/O subsystem front cover. Chapter 9. Removal and Replacement Procedures 997 Media Subsystem CD-ROM, DVD-RAM, or Tape Drive Attention: DVD-RAM and 4-mm tape drives can be located in the rear of the media subsystem only if the room is maintained at a nominal room temperature of 24°C (75.2°F). Attention: Do not open the drive; no user adjustments or serviceable parts are inside. CAUTION: All laser modules are designed so that there is never any human access to laser radiation above a class 1 level during normal operation, user maintenance, or prescribed service conditions. Data processing environments can contain equipment transmitting on system links with laser modules that operate at greater than class 1 power levels. For this reason, never look into the end of an optical fiber cable or open receptacle. Only trained service personnel should perform the inspection or repair of optical fiber cable assemblies and receptacles.C25, C26 Removal 1. Ensure that the SCSI drive is in idle mode (the activity light remains off). 2. Loosen the thumbscrew that holds the SCSI drive assembly in the subsystem (the top screw is for the drive to the right, and the bottom screw is for the left drive).2SCSISCSI314 3. Grasping the thumbscrew and not the bezel, carefully pull the drive assembly out until you can access the cables at the rear. Attention: When performing the next two steps, always disconnect the power cable before disconnecting the SCSI cable. 4. Disconnect the power cable from the rear of the drive assembly. 5. Disconnect the SCSI cable from the rear of the drive assembly. 6. Remove the four mounting screws that hold the drive or blank to the carrier bracket and remove it from the carrier.Replacement Note: Ensure that the cables are routed through the grommet at the rear of the carrier. Attention: When replacing a SCSI drive, always connect the SCSI cable before connecting the power cable. Replace in reverse order. 998 406/90 Service Guide Diskette Drive Removal 1. Ensure that the diskette drive is in idle mode (the activity light remains off). 2. Loosen the thumbscrew that attaches the diskette carrier to the media subsystem.2SCSISCSI314 3. Carefully grasping the thumbscrew and not the bezel, pull the drive assembly out until you can access the cables at the rear. Attention: When performing the next two steps, always disconnect the power cable before disconnecting the diskette signal cable. 4. Disconnect the power cable from the rear of the drive assembly. 5. Disconnect the diskette signal cable from the rear of the drive assembly. 6. Remove the four screws that connect the carrier to the diskette drive.Replacement Replace in reverse order. Attention: When replacing a diskette drive, always connect the diskette signal cable before connecting the power cable. Chapter 9. Removal and Replacement Procedures 999 Operator Panel Removal 1. Turn off power as described in “Powering Off and Powering On the System” on page 838. 2. Remove the upper-left and lower-right screws that hold the media subsystem in the rack, and pull the subsystem out no more that two inches. 3. Pry the spring clip on the left side just enough to release the panel while grasping the operator panel by the top and bottom front (do not pull on the bezel). Pull the operator panel out of the chassis. 4. Note the locations and disconnect the cables from the operator panel, and then remove the operator panel as shown in the following illustration. Replacement 1. Remove the VPD module from the old operator panel and insert it in the new operator panel. Make sure you insert the module in the same position as it was in the old panel. 2. Reconnect the cables to the operator panel. 3. Push the operator panel into the chassis until the spring clip engages. 4. Push the chassis into the rack, and replace the two mounting screws. 5. Turn on the power as described in “Powering the System On” on page 841. 1000 406/90 Service Guide Media Subsystem Chassis Removal 1. Turn off power as described in “Powering Off and Powering On the System” on page 838. 2. Label and disconnect all cables from the rear of the media subsystem.28361SCSISCSI754 3. Remove the screw that holds the left side of the media subsystem chassis to the rack. 4. Remove the screw that holds the right side of the media subsystem chassis to the rack.2SCSISCSI314 5. Carefully pull the chassis out of the front of the rack.Replacement Replace in reverse order. If the new media drawer contains a new operator panel, remove the VPD module from the old operator panel and insert it in the new operator panel. Make sure you insert the module in the same position as it was in the old panel. Be sure to move any other parts or features from the old media drawer to the new media drawer if they are not included as part of the new media drawer. Chapter 9. Removal and Replacement Procedures 1001 Media Subsystem Internal Cables Removal 1. Turn off power as described in “Powering Off and Powering On the System” on page 838. 2. If you have not already done so, open the front door of the rack unit. 3. Remove the media subsystem from the rack as described in “Media Subsystem Chassis” on page 1001, and place it on a work surface. 4. Remove the six screws and the eight 7-mm nuts that hold the cover to the subsystem, as shown in the following illustration. 5. Loosen the four captive screws (one for each SCSI drive bay) to free the two truss brackets. 6. Remove the cover. 7. Release the ties that hold the affected cable. If ties must be cut, cut them from the underside of the chassis. 8. Disconnect the cable connector from the chassis. 9. Note the cable routing, and remove the cable from the subsystem chassis.Replacement Replace in reverse order.Note: Be sure to secure the cables with the ties as they were originally installed. Allow enough length for a service loop in case the drives are removed in the future. 1002 406/90 Service Guide p690 Cable Installation and Management In some configurations, cables that are not properly supported might damage the associated connectors. Connector damage can be prevented by following the procedures described in this section. These procedures must be used for installing I/O cables, administrative LAN cables, and High Performance Switch (HPS) cables in a M/T 7040-W42 frame. Note: Planning information for switch cable installation is detailed in ?Appendix D. Cabling the pSeries HPS? of the 45H/W4 Switch Planning, Installation, and Service Guide. This section describes the procedures you must follow when installing a 7040 frame, especially with administrative LAN and High Performance Switch cables. Failure to follow these procedures might damage network components. The objectives of these procedures are to: v Provide HPS cables with continuous support and strain relief. v Minimize the possibility of pin damage to the SNI ports. v Form consistent cable groups. v Maintain efficient cooling.A high-level workflow for cable installation is as follows: 1. Install the frame. 2. Route all signal cables up through the tailgate. 3. Separate cables into groups based on their location and size.Note: If a High Performance Switch is in the configuration, the cables must not be connected to SNIs or switch port connection cards until the weight of each cable is fully supported. 4. Make certain that cable groupings do not interfere with PCI adapter slots. 5. Make certain that the cable bend radius allows clearance for closing the rear door but is not so sharp that it places excessive strain on the cable or the associated connectors. 6. Cable the frame starting with the upper cable connections and working down. 7. Apply tailgate insert clamp supports for the cables from left to right.The following illustration shows the tailgate assembly and standard frame extenders of a properly cabled frame. Refer to the service guide for more parts information. Chapter 9. Removal and Replacement Procedures 1003 Note: Do not connect any High Performance Switch cables (if installed) until instructed to do so. 1. Move the frame into position over the floor tile cutouts. 2. If you have not already done so, install the frame using the appropriate installation instructions. 3. Verify that the cable management bracket (tailgate) is properly installed on the base at the rear of the frame, if not: a. Remove the shipping bracket from the rear of the frame. b. Install the stiffener into the rear cutout on the base of the frame. c. Attach the cable management bracket (tailgate) to the stiffener. d. Make sure that the extender is available for installation after all cables have been installed.4. Route all HMC, Ethernet, I/O, and HPS cables up through the tailgate.Notes: a. Main power cables do not pass through the tailgate assembly. 1 Rear Cutout 4 Cable Management Backet 2 Extender 5 Stiffener 3 Tailgate Insert 1004 406/90 Service Guide b. High Performance Switch cables must not be connected to SNIs or switch port connection cards until the cable weight is supported. The following illustration shows a properly cabled frame. Before starting this part of the procedure: v Verify that all HMC, Ethernet, I/O, and HPS cables (if present) are inside the tailgate. v Cables are clamped into the tailgate from left to right. If HPS cables are present, locate these cables closer to the right side of the tailgate by using the foam-block tailgate inserts. Chapter 9. Removal and Replacement Procedures 1005 – Install one or more foam-block tailgate inserts into any open slot (refer to the tailgate illustration above).Note: Do not connect any HPS cables until instructed to do so. After the preliminary steps are complete: 1. Sort the HPS cables into groups based on SNI port location, but do not clamp or connect cables until instructed to do so. An overview of the next few steps is as follows: a. Switch cables will be clamped into the tailgate in groups of either four or two. b. High Power Switch cables get clamped into the next available tailgate slot. c. Continue this cable pattern working down the frame and clamping to the right in the tailgate.2. Before routing the switch cables, position the HMC, Ethernet, and I/O cables but do not clamp these cables into the tailgate until instructed to do so. a. Route the HMC and Ethernet cables to the appropriate connectors and connect each cable. v If present, make certain the HMC and Ethernet cables do not interfere with the SNI ports.b. Wire tie the HMC and Ethernet cables into a bundle running down the side of the frame. c. Wire tie any I/O cables into a bundle running down the side of the frame.3. If HPS cables are present, without plugging the switch cables into the SNI ports, take the switch cables and raise them into position. a. Check any cable labels and verify that you have the correct cables for each SNI port. b. Make certain that the cable for the upper SNI port extends further than the cable for the lower SNI port. c. Place a wire tie far enough down the cable set so that the cables will have sufficient length for a gentle bend radius. d. Wire tie the cables together.4. Clamp any HPS cables into the next available tailgate slot using a rubber-faced, tailgate insert with the appropriate thickness of rubber. 5. If they are used, clamp the I/O cables into the tailgate using tailgate inserts. Depending on the number and size of the I/O cables, you may have to use one or two tailgate slots and clamping inserts. 6. Clamp the HMC and Ethernet cables into an available tailgate slot using a tailgate insert. 7. Insert foam-block tailgate inserts into any remaining tailgate slots.If you have properly cabled the frame, the cable layout should look like the following illustration. 1006 406/90 Service Guide HMCandEthernetcablesHMCandEthernetcablesI/Ocables(ifused)I/Ocables(ifused)I/Ocables(ifused)I/Ocables(ifused)Switchcablestoframeslots8and7Switchcablestoframeslots6and5Switchcablestoframeslots4and3Switchcablestoframeslot2Switchcablestoframeslot1Switchcablestoframeslots8and7Switchcablestoframeslots6and5Switchcablestoframeslots4and3Switchcablestoframeslot1Switchcablestoframeslot2Cablemanagementbracket(tailgate)Foamblocktailgateinsert Chapter 9. Removal and Replacement Procedures 1007 1008 406/90 Service Guide Chapter 10. Parts Information This chapter contains parts information for the 406/90, which includes: v 406/1R Rack Subsystem v 406/81 Processor Subsystem v 406/1D I/O Subsystem 1009 406/1R Rack Subsystem The rack subsystem consists of the rack, power subsystem, and the integrated battery feature. 17953281064 11 - Integrated Battery Feature (not pictured) 1010 406/90 Service Guide Index Part Number Units Per Assembly Description 1 44P010744P0127 1 Rear DoorRear Door (Accoustic) 2 11P394844P012544P012611P289011P2923 11112 42U FrameCover, side leftCover, side, rightTop hatShipping bar 3 44P042644P0434 1 Front Door Primary RackFront Door Secondary Rack 11P109711P1096 1 Front Latch Primary RackFront Latch Secondary Rack 11P1093 2 Latch Bracket 44P1152 1 Vertical Extender Latch Side 44P1153 1 Extender Bottom Front 44P1154 1 Extender Bottom Rear 11P4106 2 Door Hinge 11P3990 1 Door key 54G2882 12 Screw 11P3535 2 Hinge 2665525 Bolt 4 44P1124 1 Universal Emergency Power Off (UEPO) Switch 5 44P1958 2 Bulk Power Enclosure (BPE) 6 44P1959 4 Bulk Power Distribution (BPD) 7 11P3645 2 Bulk Power Controller (BPC) 11P1597 Y-Cable (CSP) 8 44P1995 6 Bulk Power Regulator (BPR) Note: Do not mix (BPR) 44P1995 in the same system with (BPR) 44P2645. 8 44P2645 6 Bulk Power Regulator (BPR) Notes: 1. To use this BPR your firmware must be at 3H030916 or later.Note: Do not mix (BPR) 44P2645 in the same system with (BPR) 44P1995. 9 11P1787 2 Bulk Power Fan (BPF) 11J5335 2 Bulk Power Assembly Rail 10 11P0287 1 Fan Cover Plate 11 11P3732 Up to 6 Integrated Battery Feature (IBF) 44P2471 1 Capacity card for power subsystem 12 31L8302 (Not Shown) Rack Joining Kit - Contains bolts, washers, and standoffs to be used when bolting two or more rack cabinets together. Chapter 10. Parts Information 1011 406/81 Processor Subsystem (Front) This processor subsystem front view consists of the processor chassis, memory books, fans, and filters. 3214 1012 406/90 Service Guide Index Part Number Units Per Assembly Description 1 00P4015 1 Processor subsystem chassis (with backplane) 2 11P2935 1 to 8 Memory book retainerSee “406/81 Processor Subsystem (Front)” on page 1014 for Memory books part numbers. 3 31L7627 4 Fan Controller (MDA) 07H5349 4 Fans 4 11P3781 2 Air filter Chapter 10. Parts Information 1013 406/81 Processor Subsystem (Front) This processor subsystem front view consists of the memory books, MCM and L3 modules. 1014 406/90 Service Guide Index Part Number CCIN Units Per Assembly Description 1 00P4015 25C0 1 System Backplane 2 NA Protective Cover (Included as part of the L3 Cache Module FRU.) Note: Do not discard the protective cover. Use it when returning L3 FRUs. 3 00P2839 25D9 4 to 16 128 MB (4x32)/400 MHz L3 Cache kitIncludes: v PN 21P7972 400 MHz VPD card (CCIN 25D9) v PN 11P3873 L3 Cache Module v PN 04N6923 Brush v PN 11P4055 Label v PN 44P0360 Mech Spring Kit 00P2849 25F1 4 to 16 128 MB (4x32)/433 MHz L3 Cache kitIncludes: v PN 21P788175 433 MHz VPD card 433 MHz VPD card (CCIN 25F1) v PN 11P3874 433 MHz L3 Cache Module v PN 04N6923 Brush v PN 11P4055 Label v PN 44P0360 Mech Spring Kit 00P6346 500 MHz L3 Cache kitIncludes: v PN 00P5159 VPD Card v PN 04N6923 Pad Cleaning Brush v PN 11P4055 Label 00P6258 25E7 600 MHz L3 Cache kitIncludes: v PN 00P4355 VPD Card v PN 04N6923 Pad Cleaning Brush v PN 11P4055 Label 4 21P4589 Load Post Sleeve 5 11P2788 Rear Spring Plate 6 11P2785 Front Spring Plate with Set Screw 7 00P4668 272D Up to 3 MCM Passthru Module 8 53P4833 25C1 1 Clock Card 9 00P2821 25D5 1 to 4 1.1 GHz 8-way MCM with VPD card kitIncludes: v PN 11P4952 3.2 STND MCM Module (and VPD Smart Card, CCIN 25D5 ) v PN 04N6923 Brush v PN 11P3876 Screw v PN 11P4055 Label 00P2796 25C7 1 to 4 1.3 GHz 4-way MCM with VPD card kitIncludes: v PN 11P4949 3.2 Fast MCM Module(and VPD Smart Card, CCIN 25C7)orPN 11P4950 3.2 Fast MCM Module(and VPD Smart Card, CCIN 25C7) v PN 04N6923 Brush v PN 11P3876 Screw v PN 11P4055 Label Chapter 10. Parts Information 1015 Index Part Number CCIN Units Per Assembly Description 00P2818 25D3 1 to 4 1.3 GHz 8-way MCM with VPD card kitIncludes: v PN 11P4951 3.2 Fast MCM Module (and VPD Smart Card, CCIN 25D3) v PN 04N6923 Brush v PN 11P3876 Screw v PN 11P4055 Label 00P4629 25CF 1 to 4 1.5 GHz 8-way processorIncludes: v PN 04N6923 Brush v PN 11P3867 Screw v PN 11P4055 Label 00P4687 25E9 1.7 GHz 8-way processorIncludes: v PN 04N6923 Brush v PN 11P3867 Screw v PN 11P4055 Label 00P2833 526C Up to 3 CUoD 1.1 GHz Processor MCM Includes: v MCM Module (and VPD Smart Card, CCIN 527C) v etc... 00P2834 526D Up to 3 CUoD 1.3 GHz Processor MCMIncludes: v MCM Module (and VPD Smart Card, CCIN 527A) v PN 04N6923 Brush v PN 11P3876 Screw v PN 11P4055 Label 80P3442 528B CUoD 1.3 GHz MCM and VPD smart chip card 00P5882 526B525E CUoD 1.5 GHz Processor MCMIncludes: v PN 00P4351 Capacity Card v PN 04N6923 Brush v PN 11P3867 Screw v PN 11P4055 Label 80P3428 529B CUoD 1.5 GHz MCM and VPD smart chip card 00P5883 525F526E CUoD 1.7 GHz Processor MCMIncludes: v PN 00P4353 Capacity Card v PN 04N6923 Brush v PN 11P3867 Screw v PN 11P4055 Label 80P3438 529C CUoD 1.7 GHz MCM and VPD smart chip card 80P2941 25B0 CUoD 1.9 GHz MCM and VPD smart chip card 80P2936 25CC 1.9 GHz L3 VPD card 53P6170 527C 1 CUoD 1.1 GHz Processor MCM Capacity Card 1016 406/90 Service Guide Index Part Number CCIN Units Per Assembly Description 53P6168 527A 1 CUoD 1.3 GHz Processor MCM Capacity Card 80P3708 527D CUoD 1.9 GHz Processor Capacity Card 10 11P2793 Up to 12 L3 Filler 11 53P560353P562797P346897P3489 53P5604 53P562897P347197P349253P560653P563097P347497P349553P560853P563297P347997P350097P348497P350597P403297P4035 305C305D30E030E1302A 302B30E2 30E330E4 30E5305E305F302C302D30E630E730E830E930EE30EF 1 to 8 4 GB Memory Book (Inner)4 GB Memory Book (Outer)4 GB Memory Book (Inner)4 GB Memory Book (Outer)8 GB Memory Book (Inner)8 GB Memory Book (Outer)8 GB Memory Book (Inner)8 GB Memory Book (Outer)16 GB Memory Book (Inner)16 GB Memory Book (Outer)16 GB Memory Book (Inner)16 GB Memory Book (Outer)32 GB Memory Book (Inner)32 GB Memory Book (Outer)32 GB Memory Book (Inner)32 GB Memory Book (Outer)64 GB Memory Book (Inner)64 GB Memory Book (Outer)128 GB Memory Book (Inner)128 GB Memory Book (Outer) 53P424253P4273 53P424753P427853P425253P4283 53P425953P429053P426653P4297 307A308A307B308B307C308C307D308D307E308E 1 to 8 4 GB Memory Book I, 567 MHz4 GB Memory Book O, 567 MHz8 GB Memory Book I, 567 MHz8 GB Memory Book O, 567 MHz16 GB Memory Book I, 567 MHz16 GB Memory Book O, 567 MHz32 GB Memory Book I, 567 MHz32 GB Memory Book O, 567 MHz64 GB Memory Book I, 567 MHz64 GB Memory Book O, 567 MHz 53P425453P428597P347697P349753P426153P429297P348197P3502 30BD30C630EA30EB30C130CA30EC30ED CUoD 8/16 GB Memory Book ICUoD 8/16 GB Memory Book OCUoD 8/16 GB Memory Book ICUoD 8/16 GB Memory Book OCUoD 16/32 GB Memory Book ICUoD 16/32 GB Memory Book OCUoD 16/32 GB Memory Book ICUoD 16/32 GB Memory Book O 12 44P4420 1 Aluminum Backplane Stiffener 13 44P4415 2 Steel Alignment Rods 14 Use the 1.7 GHz MCM FRU part number when ordering this FRU. MCM (with copper heat sync) Chapter 10. Parts Information 1017 406/81 Processor Subsystem (Rear) This processor subsystem rear view consists of the RIO books, DCAs, and capacitor books, MCM and L3 modules, as well as the media subsystem. 1018 406/90 Service Guide Index Part Number CCIN Units Per Assembly Description 1 53P1573 2738 1 Fourth I/O book without RIO-2 ports 53P4554 2806 1 Fourth I/O book with RIO-2 ports 11P2936 I/O book retainer 2 44P3024 1 Distributed converter assembly DCA1 11P1543 1 Cable assembly (DCA1) P00 11P1544 1 Cable assembly (DCA1) P01Attention: Do not plug DCA PN 11P4741 into the U1.18-P1-V1 location. 3 44P3024 1 Distributed Converter Assembly DCA2 11P1545 1 Cable assembly (DCA2) P00 11P1546 1 Cable assembly (DCA2) P01Attention: Do not plug DCA PN 11P4741 into the U1.18-P1-V2 location. 4 11P1540 1 Capacitor card 44P2471 1 Capacitor card with LGA test support 5 44P3024 1 Distributed converter assembly DCA3 11P1547 1 Cable assembly (DCA3) P00 11P1548 1 Cable assembly (DCA3) P01Attention: DCA PN 11P4741 can be plugged into U1.18-P1-V4 (DCA3) location. 6 44P3024 1 Distributed converter assembly DCA4 11P1549 1 Cable assembly (DCA4) P00 11P1550 1 Cable assembly (DCA4) P01Attention: DCA PN 11P4741 can be plugged into U1.18-P1-V5 (DCA4) location. 7 44P3024 1 Distributed converter assembly DCA5 11P1551 1 Cable assembly (DCA5) P00 11P1552 1 Cable assembly (DCA5) P01Attention: DCA PN 11P4741 can be plugged into U1.18-P1-V6 (DCA5) location. 8 11P1540 1 Capacitor card 44P2471 1 Capacitor card with LGA test support 9 44P3024 1 Distributed converter assembly DCA6 44P4085 1 Cable assembly (DCA6) P00 44P4086 1 Cable assembly (DCA6) P01Attention: DCA PN 11P4741 can be plugged into U1.18-P1-V8 (DCA6) location. 10 09P5687 273C 1 Primary I/O book without RIO-2 ports 09P3462 273D Primary I/O book with RIO-2 ports Chapter 10. Parts Information 1019 Index Part Number CCIN Units Per Assembly Description 11P2936 I/O book retainer 16G8095 1 TOD Battery (inside primary I/O book) 44P0206 1 9-9 pin serial cable connects the managed system primary I/O book to the HMC 31L7196 1 9-25 pin serial cable connects the managed system primary I/O book to HMC async adapter 11 53P1573 2738 1 Third I/O book without RIO-2 ports 53P4554 2806 1 Third I/O book with RIO-2 ports 11P2936 1 I/O book retainer 12 53P1573 2738 1 Secondary I/O book without RIO-2 ports 53P4554 2806 1 Secondary I/O book with RIO-2 ports 11P2936 I/O book retainer 1020 406/90 Service Guide 406/81 Media Subsystem 12345678910 Chapter 10. Parts Information 1021 Index Part Number Units Per Assembly Description 1 24L108905G2882 1 Operator PanelScrew 2 See Note 1 Optional Media Device (SCSI Drive) 3 See Note 1 or 2 CD-ROM Drive (SCSI Drive) 4 76H4091 1 Diskette Drive 5 44P1799 1 Operator panel cable (J05) 6 44P18005420242 1 Diskette Drive Signal Cable (J04)Cable Tie 7 11P42155420242 2 Media Device SCSI Cables from J01 to rear drives, J03 to front drivesCable Tie 8 11P42145420242 2 Media Device Power Cables (J00 to rear drives, J02 to front drives)Cable Tie 9 See Note 1 Optional Media Device (SCSI Drive) 10 See Note 1 Optional Media Device (SCSI Drive) 44P2566 Up to 3 Media Bay Cover 44P1055 Cassette, PCI adapter 44P3592 Cassette, PCI adapter (type 2) 97P2686 1 Ultra320 SCSI 44P1055 1 Ultra320 SCSI cassette 44P0239 1 BSM Note: See Diagnostic Information for Multiple Bus Systems for part numbers. 1022 406/90 Service Guide Power and SCSI Cables to the Media Subsystem 123546 Chapter 10. Parts Information 1023 Index Part Number Units Per Assembly Description 1 11P4301 1 Cable Connecting the Operator Panel to the Media Subsystem 2 11P4300 1 Cable Connecting the Diskette Drive to the Media Subsystem 3 11P2352 1 Power Cable for Media Subsystem Front Components 4 11P4303 1 SCSI Cable for Media Subsystem Rear Components 5 11P4303 1 SCSI Cable for Media Subsystem Front Components 6 11P2351 1 Power Cable for Media Subsystem Rear Components 1024 406/90 Service Guide 406/1D I/O Subsystem The system can contain up to eight I/O subsystems. This section lists detail for one I/O subsystem. Depending on your configuration, the quantities of parts listed may increase by the actual number of I/O subsystems in your system. Front Index Part Number Units Per Assembly Description 1 44P0705 1 I/O Subsystem Chassis 07H5247 2 Frame Rail 2 11P4624 1 to 4 Fan assembly 3 44P1277 1 to 4 DASD 4-Pack Filler 4 11P3662 Up to 15 DASD Filler 5 44P0189 1 Front Cover 6 See Note 1 to 16 SCSI DASD Disk Drives 7 11P485511P243611P3457 4 DASD-4 Pack Cage/Backplane AssemblyScrew - used on the DASD backplane assemblyScrew 11P485511P3667 4 DASD-4 Pack Cage/Backplane AssemblyScrew - used on the DASD backplane assembly 8 11P3582 2 DCA Note: See Diagnostic Information for Multiple Bus Systems for part numbers. Chapter 10. Parts Information 1025 Rear 123456 Index Part Number Units Per Assembly Description 1 44P0705 1 PCI I/O Subsystem Chassis 1 44P2285 1 PCI-X I/O Subsystem Chassis 2 NA 2 RIO Riser Card, part of the I/O subsystem backplane assembly 3 11P4861 1 to 20 PCI Adapter Cassette Assembly 44P0239 PCI Adapter Cassette Assembly 11P4089 1 to 20 PCI Cassette Filler 4 44P1055 1 to 20 Blind-swap cassette, for standard PCI adapter 44P1390 As needed Blind-swap cassette, for two-adapter-wide PCI adapter 44P3320 As needed Blind-swap cassette, for Gigabit Ethernet PCI adapter 5 44P032344P032444P032144P0406 As needed Adapter clip, used with blind-swap cassette: for Gigabit Ethernet FC 2969 type 9U for Ultra3 SCSI FC 6203 Type 4Y for 10/100 BaseT Ethernet FC 4962 Type AF for Gigabit Fibre Channel PCI FC 6228 Type 4W 11P4861 1 to 20 PCI filler, for slots with no adapter 6 44P1414 1 to 2 PCI I/O Backplane 44P1970 1 to 2 PCI-X I/O Backplane (Not Shown) 21P5454 RIO-2 Remote I/O-2 Cable, 1.2 M 1026 406/90 Service Guide Power Cabling Chapter 10. Parts Information 1027 Index Part Number Units Per Assembly Description 1 NA11P1598 Bulk Power Distribution on the A side of the system (BPD A Side)Cable A35-BPC-P00 to B35-BPC-P00 2 NA11P1598 Bulk Power Distribution on the B side of the system (BPD B Side)Cable B35-BPC-P00 to A35-BPC-P00 3 NA 1 A35-BPC-P03 to Bulk Power Fan P1-F1, Cable attached to FAN (BPF-A) 4 NA 1 B35-BPC-P03 to Bulk Power Fan P2-F1, Cable attached to FAN (BPF-B) 5 11P4449 1 B35-BPC-P08 to DCA-1-P01, Red 6 11P4448 1 A35-BPC-P08 to DCA-1-P00, Red with white stripe 7 11P4451 1 B35-BPC-P09 to DCA-2-P01, Red 8 11P4450 1 A35-BPC-P09 to DCA-2-P00, Red with white stripe 9 11P4453 1 B35-BPC-P10 to DCA-3-P01, Red 10 11P4452 1 A35-BPC-P10 to DCA-3-P00, Red with white stripe 11 11P4455 1 B35-BPD1-P08 to DCA-4-P01, Red 12 11P4454 1 A35-BPD1-P08 to DCA-4-P00, Red with white stripe 13 11P4457 1 B35-BPD1-P09 to DCA-5-P01, Red 14 11P4456 1 A35-BPD1-P09 to DCA-5-P00, Red with white stripe 14a 1 A35-BPD1-P06 to DCA-6-P01, Red with white stripe 14b 1 A35-BPD1-P06 to DCA-6-P00, Red 15 to 17 11P4734 1 I/OB9-P00 to Media P02 16 to 18 11P4735 1 I/OB9-P01 to Media P00 19, 20,21,22 NA B35-BPDx-P0x to I/O Subsystem #x “RIO Cables and I/O Power Cables” on page 1029 23 11P4462 1 B35-BPC-P05 to (AMD2-P01) MDA2-J02, Red 24 11P4461 1 A35-BPC-P05 to (AMD2-P00) MDA2-J01, Red with white stripe 25 11P4460 1 B35-BPC-P04 to (AMD1-P01) MDA1-J02, Red 26 11P4459 1 A35-BPC-P04 to (AMD1-P00) MDA1-J01, Red with white stripe 27 11P4463 1 A35-BPC-P06 to (AMD3-P00) MDA3-J01, Red with white stripe 28 11P4464 1 B35-BPC-P06 to (AMD3-P01) MDA3-J02, Red 29 11P4466 1 A35-BPC-P07 to (AMD4-P01) MDA4-J02, Red 30 11P4465 1 B35-BPC-P07 to (AMD4-P00) MDA4-J01, Red with white stripe 31 11P1596 1 B35-BPC-P02 to UEPO J01, Red 32 11P1595 1 A35-BPC-P02 to UEPO J00, Red with white stripe 33 05N6740 1 A35-BPR-P00 to A13-IBF1-P00B35-BPR-P00 to B13-IBF2-P00 34 05N6740 1 A35-BPR-P00 to A13-IBF1-P00B35-BPR-P00 to B13-IBF2-P00 35 11P2998 1 A35-BPR-P00 to A-IBF3-P00 (Secondary Frame) 35 11P4248 1 B35-BPR-P00 to B-IBF3-P00 (Secondary Frame) 1028 406/90 Service Guide RIO Cables and I/O Power Cables Index Number FRU Part Number Units Per Assembly Description 11P2355 1 RIO cable 0.5-meter, per drawer 11P0272 Up to 2 RIO cable 2-meter, per drawer 44H9137 Up to 2 RIO cable 15-meter, per drawer 00P5238 Up to 2 RIO-2 cable 1.75-meter 00P5239 Up to 2 RIO-2 cable 2.5-meter 21P5477 Up to 2 RIO to RIO-2 cable 6-meter, per drawer 53P5242 1 RIO to RIO-2 cable 1-meter, per drawer 53P5243 Up to 2 RIO to RIO-2 cable 3-meter, per drawer 53P5454 1 RIO-2 to RIO-2 cable 1-meter, per drawer 53P2675 Up to 2 RIO-2 to RIO-2 cable 8-meter, per drawer 53P2676 Up to 2 RIO-2 to RIO-2 cable 3.5-meter, per drawer 11P4490 1 I/O Subsystem 1 power cable (DCA1-P01) 11P4491 1 I/O Subsystem 1 power cable (DCA1-P00) 11P4492 1 I/O Subsystem 1 power cable (DCA2-P01) 11P4493 1 I/O Subsystem 1 power cable (DCA2-P00) 11P4494 1 I/O Subsystem 2 power cable (DCA1-P01) 11P4495 1 I/O Subsystem 2 power cable (DCA1-P00) 11P4496 1 I/O Subsystem 2 power cable (DCA2-P01) 11P4497 1 I/O Subsystem 2 power cable (DCA2-P00) 11P4498 1 I/O Subsystem 3 power cable (DCA1-P01) 11P4499 1 I/O Subsystem 3 power cable (DCA1-P00) 11P4500 1 I/O Subsystem 3 power cable (DCA2-P01) 11P4501 1 I/O Subsystem 3 power cable (DCA2-P00) 11P4502 1 I/O Subsystem 4 power cable, primary rack (DCA1-P01) 11P4503 1 I/O Subsystem 4 power cable, primary rack (DCA1-P00) 11P4504 1 I/O Subsystem 4 power cable, primary rack (DCA2-P01) 11P4505 1 I/O Subsystem 4 power cable, primary (DCA2-P00) 11P4514 1 I/O Subsystem 4 power cable, secondary rack (DCA1-P01) 11P4515 1 I/O Subsystem 4 power cable, secondary rack (DCA1-P00) 11P4516 1 I/O Subsystem 4 power cable, secondary rack (DCA2-P01) 11P4517 1 I/O Subsystem 4 power cable, secondary rack (DCA2-P00) 11P4506 1 I/O Subsystem 5 power cable (DCA1-P01) 11P4507 1 I/O Subsystem 5 power cable (DCA1-P00) 11P4508 1 I/O Subsystem 5 power cable (DCA2-P01) 11P4509 1 I/O Subsystem 5 power cable (DCA2-P00) 11P4510 1 I/O Subsystem 6 power cable (DCA1-P01) 11P4511 1 I/O Subsystem 6 power cable (DCA1-P00) 11P4512 1 I/O Subsystem 6 power cable (DCA2-P01) 11P4513 1 I/O Subsystem 6 power cable (DCA2-P00) 44P0190 1 I/O Subsystem 7 power cable (DCA1-P01) 44P0191 1 I/O Subsystem 7 power cable (DCA1-P00) 44P0192 1 I/O Subsystem 7 power cable (DCA2-P01) 44P0193 1 I/O Subsystem 7 power cable (DCA2-P00) 44P0194 1 I/O Subsystem 8 power cable (DCA1-P01) 44P0195 1 I/O Subsystem 8 power cable (DCA1-P00) 44P0196 1 I/O Subsystem 8 power cable (DCA2-P01) Chapter 10. Parts Information 1029 Index Number FRU Part Number Units Per Assembly Description 44P0197 1 I/O Subsystem 8 power cable (DCA2-P00) 1030 406/90 Service Guide Keyboards and Mouse Index Number FRU Part Number Units Per Assembly Description 1 24P0440 1 Keyboard, 101 US English (ID 103P) USB 24P0450 1 Keyboard, 101 French (ID 189) USB 24P0458 1 Keyboard, 101 Italian (ID 142) USB 24P0453 1 Keyboard, 101 German/Austrian (ID 129) USB 24P0476 1 Keyboard, 101 UK English (ID 166) USB 24P0470 1 Keyboard, 101 Spanish (ID 172) USB 24P0459 1 Keyboard, 101 Japanese (ID 194) USB 24P0444 1 Keyboard, 101 Brazilian Portuguese (ID 275) USB 24P0431 1 Keyboard, 101 Swedish, Finnish (ID 153) USB 24P0471 1 Keyboard, 101 LA Spanish (ID 171) USB 1 Keyboard, 101 Korean (ID 413) USB 24P0446 1 Keyboard, 101 Chinese (ID 467) USB 24P0466 1 Keyboard, 101 Russian (ID 443) USB 2 09N5532 1 Three Button Mouse (USB) 8841 Chapter 10. Parts Information 1031 System Power Cables Index Number FRU Part Number Units Per Assembly Description 1 11P0916 2 Line Power Cord, US, Canada, Japan, 30 AMP 480 V ac 1 11P0367 2 Line Power Cord, US, Canada, Japan, 60 AMP 240 V ac 1 11P0914 2 Line Power Cord, US, Chicago, 30 AMP 480 V ac 1 11P0365 2 Line Power Cord, US, Chicago, 60 AMP 240 V ac 1 11P0918 2 World Trade Power Cord, 4.27m (14 ft.), 8AWG, high voltage 1032 406/90 Service Guide Tools The following tools are needed to perform service actions on the 406/90. These tools can be ordered through your normal tools supplier. Index Part Number Units Per Assembly Description 1 46H9635 1 Tool box 2 11P4486 1 Torque tool MCM 3 04N6923 1 Sash brush 4 11P4747 1 Magnifying glass 5 11P4654 1 L3 cache tray 6 44P0549 1 I/O backplane insertion/removal tool 7 6422789 1 DCA removal tool 8 31L8313 1 Leveling tool (for rack) 9 47H0182 1 Torx, T10 (this tool might not be available in the shipped tool kit. A T10 tip used with a replacable driver handle can be substituted) 10 11P4861 1 Blank cassette 11 11P3990 1 Door key 12 44P0182 1 DASD service tool 13 11P4914 1 MCM/L3/Pass-through plug-location map (tri-fold chart) should be installed in the primary rack 14 11P4055 NA Plug count adhesive replacement label 15 44P0209 1 Test card asm, MCM/L3 Short-To-GND 11P473211P473411P4735 1 Cabe asmCabe asmCabe asm 16 00P7023 NA Vacuum cleaner (110 Vac) 36P243536P243636P2437 NA Vacuum cleaner (220 Vac)OMEGA FILTER 0.3 MICRON CORD OMEGA UK PLUG 17 07H6824 12 Bracket, insert 18 07H6823 2 Bracket, 11 x 6 insert 19 11P1262 4 Bracket, asm w/ foam 20 07H6827 4 Blank filler plate - 3 21 07H6826 3 Blank filler plate - 1 22 54G2882 16 Screws 23 31L7174 1 Hook-and-loop fastener (roll) 24 08J5557 1 Wheel chocks 25 44P0144 Tailgate assembly 26 44P4107 1 Shorts Tester 27 00P5758 1 PCI short adapter Chapter 10. Parts Information 1033 Fiber Optic Connector Cleaning Tool Kit To clean fiber optic connectors in the field a fiber optic cleaning procedure has been developed. Use order number SY27-2604 to order the fiber optic cleaning procedure. Along with the cleaning procedure you should order the following: v Fiber Optic Cleaning Kit 46G6844 v Extra Alcohol Saturated Pads 9900679 The kits can be ordered through Mechanicsburg. The kit contains some parts which can be replenished by ordering direct from Texwipe, which are the lint-free cloth pads (Texwipe p/n TX 304) and microswaps (TX 732). 1034 406/90 Service Guide Chapter 11. SRNs for Linux The x in the following group of SRNs is encoded as follows: 8 4 2 1 | | | | | | | Replace all FRUs listed | | Hot swap supported | Software might be the cause Reserved Note: Some SRNs in this chapter may have 4 rather than 3 digits after the dash (-). Use the physical location codes and FRU numbers listed on the diagnostics Problem Report Screen.Failing Function Codes (FFCs) are not used in this group of SRNs. For more detailed information refer to “MAP 1230: Linux Platform Error Problem Resolution” on page 163. Service Request Number Description and Action 651-88x The CEC or SPCN reported an error. Action: Refer to the entry MAP in this system unit’s system service guide, with the 8 digit error and location codes, for the necessary repair action. If the 8 digit error and location codes were NOT reported, then call support. 651-89x The CEC or SPCN reported an error. Action: Refer to the entry MAP in this system unit’s system service guide, with the 8 digit error and location codes, for the necessary repair action. If the 8 digit error and location codes were NOT reported, then call support. 652-88x The CEC or SPCN reported a non-critical error. Action: Schedule deferred maintenance. Refer to the entry MAP in this system unit’s system service guide, with the 8 digit error and location codes, for the necessary repair action. If the 8 digit error and location codes were NOT reported, then call support. 652-89x The CEC or SPCN reported a non-critical error. Action: Schedule deferred maintenance. Refer to the entry MAP in this system unit’s system service guide, with the 8 digit error and location codes, for the necessary repair action. If the 8 digit error and location codes were NOT reported, then call support. A00-FF0 Description: Error log analysis is unable to determine the error. The error log indicates the following physical FRU locations as the probable causes. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A01-00x Description: Error log analysis indicates an error detected by the CPU, but the failure could not be isolated. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A01-01x Description: CPU internal error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A01-02x Description: CPU internal cache or cache controller error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A01-03x Description: External cache parity or multi-bit ECC error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A01-04x Description: External cache ECC single-bit error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A01-05x Description: System bus time-out error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A01-06x Description: Time-out error waiting for I/O. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. 1035 Service Request Number Description and Action A01-07x Description: System bus parity error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A01-08x Description: System bus protocol/transfer error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A02-00x Description: Error log analysis indicates an error detected by the memory controller, but the failure could not be isolated. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A02-01x Description: Uncorrectable Memory Error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A02-02x Description: ECC correctable error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A02-03x Description: Correctable error threshold exceeded. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A02-04x Description: Memory Control subsystem internal error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A02-05x Description: Memory Address Error (invalid address or access attempt). Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A02-06x Description: Memory Data error (Bad data going to memory). Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A02-07x Description: Memory bus/switch internal error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A02-08x Description: Memory time-out error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A02-09x Description: System bus parity error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A02-10x Description: System bus time-out error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A02-11x Description: System bus protocol/transfer error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A02-12x Description: I/O Host Bridge time-out error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A02-13x Description: I/O Host Bridge address/data parity error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A02-15x Description: System support function error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A02-16x Description: System bus internal hardware/switch error. A03-00x Description: Error log analysis indicates an error detected by the I/O device, but the failure could not be isolated. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A03-01x Description: I/O Bus Address parity error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A03-02x Description: I/O Bus Data parity error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. 1036 406/90 Service Guide Service Request Number Description and Action A03-03x Description: I/O bridge/device time-out, access or other error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A03-04x Description: I/O bridge/device internal error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A03-05x Description: I/O Error on non-PCI bus. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A03-06x Description: Mezzanine bus address parity error. A03-07x Description: System bus address parity error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A03-08x Description: Mezzanine bus data parity error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A03-09x Description: System bus data parity error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A03-10x Description: Mezzanine bus time-out, transfer or protocol error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A03-11x Description: System bus time-out error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A03-12x Description: Error on System bus. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A03-13x Description: I/O Expansion bus parity error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A03-14x Description: I/O Expansion bus time-out error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A03-15x Description: I/O Expansion bus connection failure. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A03-16x Description: I/O Expansion unit not in an operating state. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A05-00x Description: Error log analysis indicates an environmental and power warning, but the failure could not be isolated. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A05-01x Description: Sensor indicates a fan has failed. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A05-02x Description: System shutdown due to a fan failure. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A05-03x Description: Sensor indicates a voltage outside normal range. Action: Use “MAP 1520: Power” on page 192. A05-04x Description: System shutdown due to voltage outside normal range. Action: Use “MAP 1520: Power” on page 192. Chapter 11. SRNs for Linux 1037 Service Request Number Description and Action A05-05x Description: Sensor indicates an abnormally high internal temperature. Action: Verify that: 1. The room ambient temperature is within the system operating environment. 2. There is unrestricted air flow around the system. 3. All system covers are closed. 4. There are no fan failures.If none of these problems exist, then proceed with “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A05-06x Description: System shutdown due to abnormally high internal temperature. Action: Verify that: 1. The room ambient temperature is within the system operating environment. 2. There is unrestricted air flow around the system. 3. All system covers are closed. 4. There are no fan failures.If none of these problems exist, then proceed with “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A05-07x Description: Sensor indicates a power supply has failed. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A05-08x Description: System shutdown due to power supply failure. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A05-09x Description: Sensor detected a FRU that has failed. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A05-10x Description: System shutdown due to FRU that has failed. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A05-14x Description: System shutdown due to power fault with an unspecified cause. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A05-16x Description: System shutdown due to internal power supply failure. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A05-19x Description: System shutdown due to Fan failure. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A05-21x Description: System shutdown due to Over temperature condition. Action: Verify the following: v The room ambient temperature is within the system operating environment. v There is unrestricted air flow around the system. v All system covers are closed.If all conditions are met, then use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A05-22x Description: System shutdown due to over temperature and fan failure. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A05-24x Description: Power Fault specifically due to internal battery failure. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-00x Description: Error log analysis indicates an error detected by the Service Processor, but the failure could not be isolated. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. 1038 406/90 Service Guide Service Request Number Description and Action A0D-01x Note: Ensure that the system IPLROS and Service Processor are at the latest firmware level before removing any parts from the system. Description: Time-out communication response from Service Processor. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-02x Description: I/O (I2C) general bus error. Note: If you are servicing a 9076/Nxx system, go to the Service Processor Error Log (SVP). Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-03x Description: Secondary I/O (I2C) general bus error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-04x Description: Internal Service Processor memory error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-05x Description: Service Processor error accessing special registers. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-06x Description: Service Processor reports unknown communication error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-07x Description: Internal service processor firmware error or incorrect version. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-08x Description: Other internal Service Processor hardware error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-09x Description: Service Processor error accessing Vital Product Data EEPROM. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-10x Description: Service Processor error accessing Operator Panel. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-11x Description: Service Processor error accessing Power Controller. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-12x Description: Service Processor error accessing Fan Sensor. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-13x Description: Service Processor error accessing Thermal Sensor. Note: If you are servicing a 9076/Nxx system, go to the Service Processor Error Log (SVP).Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-14x Description: Service Processor error accessing Voltage Sensor. Note: If you are servicing a 9076/Nxx system, go to the Service Processor Error Log (SVP).Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-17x Description: Service Processor error accessing serial port. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-18x Description: Service Processor detected NVRAM error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-19x Description: Service Processor error accessing Real Time Clock/Time-of-Day Clock. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-20x Description: Service Processor error accessing scan controller/hardware. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-21x Description: Service Processor detect error with Time-of-Day Clock backup battery. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-23x Description: Loss of heart beat from Service Processor. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. Chapter 11. SRNs for Linux 1039 Service Request Number Description and Action A0D-24x Description: Service Processor detected a surveillance time-out. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-25x Description: Power Control Network general connection failure. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-26x Description: Power Control Network node failure. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-29x Description: Service Processor error accessing Power Control Network. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-30x Description: Non-supported hardware. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-31x Description: Error detected while handling an attention/interrupt from the system hardware. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-33x Description: Array of Logic Built in Self Test Error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-34x Description: Wire Test Error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-35x Description: Mainstore or Cache IPL Diagnostic Error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-36x Description: Other IPL Diagnostic Error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-37x Description: Clock or PLL Error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-38x Description: Hardware Scan or Initialization Error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-39x Description: Chip ID Verification Error. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A0D-40x Description: FRU Presence/Detect Error (Mis-Plugged). Action: Use“MAP 1230: Linux Platform Error Problem Resolution” on page 163. A10-100 Description: The resource is unavailable due to an error. System is operating in degraded mode. Action: Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A10-200 Description: The resource was marked failed by the platform. The system is operating in degraded mode. Action: Schedule maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A10-210 Description: The processor has been deconfigured. The system is operating in degraded mode. Action: Schedule maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A11-00x Description: A non-critical error has been detected. Error log analysis indicates an error detected by the CPU, but the failure could not be isolated. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A11-01x Description: A non-critical error has been detected, a CPU internal error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A11-02x Description: A non-critical error has been detected, a CPU internal cache or cache controller error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. 1040 406/90 Service Guide Service Request Number Description and Action A11-03x Description: A non-critical error has been detected, an external cache parity or multi-bit ECC error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A11-04x Description: A non-critical error has been detected, an external cache ECC single-bit error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A11-05x Description: A non-critical error has been detected, a system bus time-out error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A11-06x Description: A non-critical error has been detected, a time-out error waiting for an I/O device. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A11-07x Description: A non-critical error has been detected, a system bus parity error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A11-08x Description: A non-critical error has been detected, a system bus protocol/transfer error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A11-50x Description: Recoverable errors on resource indicate a trend toward an unrecoverable error. However, the resource could not be deconfigured and is still in use. The system is operating with the potential for an unrecoverable error. Action: If repair is not immediately available, reboot and the resource will be deconfigured. Then operations can continue in a degraded mode. To repair use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A11-510 Description: Resource has been deconfigured and is no longer in use due to a trend toward an unrecoverable error. Action: The system is operating in a degraded mode. Schedule maintenance. To repair use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A11-520 Description: Recoverable errors on resource indicate a trend toward an unrecoverable error. However, the resource could not be deconfigured and is still in use. The system is operating with the potential for an unrecoverable error. Action: Try removing the processes from the failing processor and retry the deconfiguration, use the ha_star -C command. An alternative is reboot, and the processor will be deconfigured. Then the system can continue in a degraded mode. To repair use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A11-530 Description: Recoverable errors on resource indicate a trend toward an unrecoverable error. However, the resource could not be deconfigured and is still in use. The system is operating with the potential for an unrecoverable error. Run-time processor deconfiguration can be enabled to deconfigure the processor. Action: To enable run-time processor deconfiguration, use the chdev -a cpuguard=enable -l sys0 command. Then to retry the deconfiguration, use the ha_star -C command. An alternative is reboot, and the processor will be deconfigured. Then the system can continue in a degraded mode. To repair use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A11-540 Description: Recoverable errors on resource indicate a trend toward an unrecoverable error. However, the resource could not be deconfigured and is still in use. The system is operating with the potential for an unrecoverable error. Action: If repair is not immediately available, reboot and the resource will be deconfigured. Then operations can continue in a degraded mode. To repair use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. Chapter 11. SRNs for Linux 1041 Service Request Number Description and Action A11-550 Description: Recoverable errors on resource indicate a trend toward an unrecoverable error. However, the resource could not be deconfigured and is still in use. The system is operating with the potential for an unrecoverable error. Action: If repair is not immediately available, reboot and the resource will be deconfigured. Then operations can continue in a degraded mode. To repair use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A11-560 Description: Recoverable errors on a resource indicate a potential for unrecoverable errors. The resource cannot be deconfigured and is still in use. The problem may be corrected by array bit steering. Use “MAP 1235: System Array Self-Repair Problem Resolution for Linux” on page 168. A11-570 Description: A processor has been deconfigured due to a trend toward an unrecoverable error. The system is operating in a degraded mode. The problem may be corrected by array bit steering. Use “MAP 1235: System Array Self-Repair Problem Resolution for Linux” on page 168. A11-580 Description: Recoverable errors on a processor indicate a potential for an unrecoverable error. The resource cannot be deconfigured and is still in use. The problem may be corrected by array bit steering. Use “MAP 1235: System Array Self-Repair Problem Resolution for Linux” on page 168. A12-00x Description: A non-critical error has been detected. Error log analysis indicates an error detected by the memory controller, but the failure could not be isolated. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A12-01x Description: A non-critical error has been detected, an uncorrectable memory error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A12-02x Description: A non-critical error has been detected, an ECC correctable error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A12-03x Description: A non-critical error has been detected, a correctable error threshold exceeded. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A12-04x Description: A non-critical error has been detected, a memory control subsystem internal error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A12-05x Description: A non-critical error has been detected, a memory address error (invalid address or access attempt). Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A12-06x Description: A non-critical error has been detected, a memory data error (bad data going to memory). Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A12-07x Description: A non-critical error has been detected, a memory bus/switch internal error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A12-08x Description: A non-critical error has been detected, a memory time-out error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A12-09x Description: A non-critical error has been detected, a system bus parity error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. 1042 406/90 Service Guide Service Request Number Description and Action A12-10x Description: A non-critical error has been detected, a system bus time-out error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A12-11x Description: A non-critical error has been detected, a system bus protocol/transfer error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A12-12x Description: A non-critical error has been detected, an I/O host bridge time-out error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A12-13x Description: A non-critical error has been detected, a I/O host bridge address/data parity error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A12-15x Description: A non-critical error has been detected, a system support function error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A12-16x Description: A non-critical error has been detected, a system bus internal hardware/switch error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A12-50x Description: Recoverable errors on resource indicate a trend toward an unrecoverable error. However, the resource could not be deconfigured and is still in use. The system is operating with the potential for an unrecoverable error. Action: If repair is not immediately available, reboot and the resource will be deconfigured. Then operations can continue in a degraded mode. To repair use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A13-00x Description: A non-critical error has been detected, a error log analysis indicates an error detected by the I/O device, but the failure could not be isolated. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A13-01x Description: A non-critical error has been detected, an I/O bus address parity error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A13-02x Description: A non-critical error has been detected, an I/O bus data parity error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A13-03x Description: A non-critical error has been detected, an I/O bus time-out, access or other error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A13-04x Description: A non-critical error has been detected, an I/O bridge/device internal error. Action: Schedule deferred maintenance. Use v. A13-05x Description: A non-critical error has been detected, an I/O error on non-PCI bus. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A13-06x Description: A non-critical error has been detected, a mezzanine bus address parity error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A13-07x Description: A non-critical error has been detected, a system bus address parity error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A13-08x Description: A non-critical error has been detected, a mezzanine bus data parity error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. Chapter 11. SRNs for Linux 1043 Service Request Number Description and Action A13-09x Description: A non-critical error has been detected, a system bus data parity error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A13-10x Description: A non-critical error has been detected, a mezzanine bus time-out error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A13-11x Description: A non-critical error has been detected, a system bus time-out error Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A13-12x Description: A non-critical error has been detected, an error on system bus. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A13-13x Description: A non-critical error has been detected, an I/O expansion bus parity error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A13-14x Description: A non-critical error has been detected, an I/O expansion bus time-out error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A13-15x Description: A non-critical error has been detected, an I/O expansion bus connection failure. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A13-16x Description: A non-critical error has been detected, an I/O expansion unit not in an operating state. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A13-50x Description: Recoverable errors on resource indicate a trend toward an unrecoverable error. However, the resource could not be deconfigured and is still in use. The system is operating with the potential for an unrecoverable error. Action: If repair is not immediately available, reboot and the resource will be deconfigured. Then operations can continue in a degraded mode. To repair use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A15-01x Description: Sensor indicates a fan is turning too slowly. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A15-03x Description: Sensor indicates a voltage outside normal range. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A15-05x Description: Sensor indicates an abnormally high internal temperature. Action: Verify that: 1. The room ambient temperature is within the system operating environment. 2. There is unrestricted air flow around the system. 3. All system covers are closed. 4. There are no fan failures.If none of these problems exist, then proceed with “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A15-07x Description: Sensor indicates a power supply has failed. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A15-09x Description: Sensor indicates a FRU has failed. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A15-11x Description: Sensor detected a redundant fan failure. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. 1044 406/90 Service Guide Service Request Number Description and Action A15-12x Description: Sensor detected redundant power supply failure. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A15-13x Description: Sensor detected a redundant FRU that has failed. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A15-14x Description: Power fault due to unspecified cause. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A15-16x Description: Internal power supply failure. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A15-17x Description: Internal redundant power supply failure. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A15-19x Description: Fan failure. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A15-20x Description: Non-critical cooling problem, loss of redundant fan. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A15-21x Description: Over temperature condition. Action: Verify the following: v The room ambient temperature is within the system operating environment. v There is unrestricted air flow around the system. v All system covers are closed.If all conditions are met, then use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A15-22x Description: Fan failure and Over temperature condition. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A15-23x Description: Non-critical power problem, loss of redundant power supply. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A15-24x Description: Power Fault specifically due to internal battery failure. Action: Schedule deferred maintainence. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A15-50x Description: Recoverable errors on resource indicate a trend toward an unrecoverable error. However, the resource could not be deconfigured and is still in use. The system is operating with the potential for an unrecoverable error. Action: If repair is not immediately available, reboot and the resource will be deconfigured. Then operations can continue in a degraded mode. To repair use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-00x Description: A non-critical error has been detected. Error log analysis indicates an error detected by the Service Processor, but the failure could not be isolated. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-01x Note: Ensure that the system IPLROS and Service Processor are at the latest firmware level before removing any parts from the system. Description: A non-critical error has been detected. Time Out communication response from Service Processor Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-02x Description: A non-critical error has been detected, an I/O (I2C) general bus error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-03x Description: A non-critical error has been detected, a secondary I/O (I2C) general bus error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. Chapter 11. SRNs for Linux 1045 Service Request Number Description and Action A1D-04x Description: A non-critical error has been detected, an internal service processor memory error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-05x Description: A non-critical error has been detected, a service processor error accessing special registers. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-06x Description: A non-critical error has been detected, a service processor reports unknown communication error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-07x Description: A non-critical error has been detected,: Internal service processor firmware error or incorrect version. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-08x Description: A non-critical error has been detected, an other internal service processor hardware error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-09x Description: A non-critical error has been detected, a service processor error accessing vital product data EEPROM. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-10x Description: A non-critical error has been detected, a service processor error accessing operator panel. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-11x Description: A non-critical error has been detected, a service processor error accessing power controller. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-12x Description: A non-critical error has been detected, a service processor error accessing fan sensor. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-13x Description: A non-critical error has been detected, a service processor error accessing a thermal sensor. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-14x Description: A non-critical error has been detected, a service processor error accessing voltage sensor. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-17x Description: A non-critical error has been detected, a service processor error accessing serial port. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-18x Description: A non-critical error has been detected, a service processor detected NVRAM error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-19x Description: A non-critical error has been detected, a service processor error accessing real time clock/time-of-day clock. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-20x Description: A non-critical error has been detected: Service processor error accessing scan controller/hardware. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-21x Description: A non-critical error has been detected, a service processor detected error with time-of-day clock backup battery. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. 1046 406/90 Service Guide Service Request Number Description and Action A1D-23x Description: A non-critical error has been detected: Loss of heart beat from Service Processor. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-24x Description: A non-critical error has been detected, a service processor detected a surveillance time-out. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-25x Description: A non-critical error has been detected, a power control network general connection failure. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-26x Description: A non-critical error has been detected, a power control network node failure. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-29x Description: A non-critical error has been detected, a service process error accessing power control network. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-30x Description: A non-critical error has been detected: Non-supported hardware. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-31x Description: A non-critical error has been detected: Error detected while handling an attention/interrupt from the system hardware. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-33x Description: A non-critical error has been detected: Array of Logic Built in Self Test Error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-34x Description: A non-critical error has been detected: Wire Test Error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-35x Description: A non-critical error has been detected: Mainstore or Cache IPL Diagnostic Error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-36x Description: A non-critical error has been detected: Other IPL Diagnostic Error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-37x Description: A non-critical error has been detected: Clock or PLL Error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-38x Description: A non-critical error has been detected: Hardware Scan or Initialization Error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-39x Description: A non-critical error has been detected: Chip ID Verification Error. Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. A1D-40x Description: A non-critical error has been detected: Presence/Detect Error (Mis-Plugged). Action: Schedule deferred maintenance. Use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. Chapter 11. SRNs for Linux 1047 Service Request Number Description and Action A1D-50x Description: Recoverable errors on resource indicate a trend toward an unrecoverable error. However, the resource could not be deconfigured and is still in use. The system is operating with the potential for an unrecoverable error. Action: If repair is not immediately available, reboot and the resource will be deconfigured. Then operations can continue in a degraded mode. To repair use “MAP 1230: Linux Platform Error Problem Resolution” on page 163. 1048 406/90 Service Guide Appendix A. Environmental Notices Product Recycling and Disposal Components of the system unit, such as structural parts and circuit boards, can be recycled where recycling facilities exist. Companies are available to disassemble, reutilize, recycle, or dispose of electronic products. Contact your account representative for more information. This system unit contains parts such as circuit boards, cables, electromagnetic compatibility gaskets and connectors which may contain lead and copper/beryllium alloys that require special handling and disposal at end of life. Before this unit is disposed, these materials must be removed and recycled or discarded according to applicable regulations. This book contains specific information on each battery type where applicable. This product may contain a sealed lead acid, nickel cadmium, nickel metal hydride, lithium, or lithium ion battery. Consult your user manual or service manual for specific battery information. The battery must be recycled or disposed of properly. Recycling facilities may not be available in your area. For information on disposal of batteries, contact your local waste disposal facility. 1049 Detteproduktkanindeholdeetforsegletbatteri,derindeholderbly,nikkel-kadmium,nikkel-metal-hydrid,litiumellerlitium-ion.Dererflereoplysningerombatterietibruger-ellerservicevejledningen.Batterietmåikkekasseressammenmeddetalmindeligeaffald.Batterietskalkasseresihenholdtildelokalebestemmelser.Spørgeventueltkommunenstekniskeforvaltning(Miljøafdelingen).Ditproductbevatmogelijkeenafgeslotenbatterijvanhettypelood/zuur,nikkel/cadmium,nikkel/metaalhydride,lithium,oflithiumionen.Raadpleeghethandboekvoordegebruikerofhetservicehandboekvoorinformatieoverdebatterij.Debatterijmoetcorrectwordengerecycledofweggegooid.MogelijkzijnerinuwregiogeenKCA-inzamelingspunten.Meerinformatieoverhetverwerkenvangebruiktebatterijenkuntuverkrijgenbijuwafvalverwerkingsbedrijf.DiesesProduktenthälteineauslaufsichereBlei-,Nickel-Cadmium-,Nickel-metallhydrid-,Lithium-oderLithiumionenbatterie.SpezifischeInformationenzurBatterieentnehmenSiebittedemBenutzer-oderServicehandbuch.DieBatteriemusswiederverwertetoderalsSondermüllentsorgtwerden.Esistmöglich,dassesinIhrerNähekeineWiederverwertungsanlagegibt.WeitereInformationenzurEntsorgungvonBatterienerhaltenSievonIhremörtlichenMüllentsorgungsunternehmen. 1050 406/90 Service Guide Questoprodottopotrebbecontenereunabatteriaalpiombo,alnichel-cadmio,all’idrurodinichel,allitiooagliionidilitio.Consultareilmanualeutenteoilmanualediassistenzaperinformazionispecifichesullabatteria.Labatteriadeveessereappropriatamentericiclataosmaltita.Ilservizioperilriciclaggiopotrebberononesseredisponibilenellavostrazona.Perinformazionisullosmaltimentodellebatterie,contattarel’aziendaprepostaperlosmaltimentodeirifiuti. Appendix A. Environmental Notices 1051 Produktenkaninnehållaettförseglatbatteriavnågonavtypernablyackumulator,nickel-kadmium,nickel-metallhydrid,litiumellerlitiumjon.DetaljeradbatteriinformationfinnsIanvändar-ellerservicehandboken.Batterietmåstelämnastillåtervinningellerkastaspåettmiljösäkertsätt.Detkanskeintefinnsnågraåtervinningsmöjligheterdärdubor.Närdetgälleromhändertagandeavbatterierberviattfåhänvisatillmyndigheternasanvisningarirespektiveland. 1052 406/90 Service Guide Appendix A. Environmental Notices 1053 Acoustical Noise Emissions The equivalent continuous A-weighted sound pressure level at workstations (emission sound pressure level at the 1-meter bystander positions) does not exceed 70 dB(A). Der Geräuschpegel der Einheit ist kleiner oder gleich 70 db(A). Declared Acoustical Noise Emissions Acoustical Characteristic Product Configuration Declared A-Weighted Sound Power Level, LWAd (B) Declared A-Weighted Sound Pressure Level, LpAm (dB) Operating Idle Operating Idle Processor Subsystem 6.15 6.15 446 446 Notes: 1. Noise levels cited are for the typical configuration (A-Frame: Bulk Power, CEC cage, battery option, media drawer, and two I/O drawers). 2. The 0.6-B (6-dB) reduction in noise emission levels with the acoustical rear door corresponds to a factor of 4 reduction. That is, the noise level of a single A-Frame with Slimline covers is about the same as the noise level of four A-Frames with acoustical covers. 3. LWAd is the upper-limit A-weighted sound power level; LpAm is the mean A-weighted sound pressure level at the 1-meter bystander positions;1 B =10 dB. 4. All measurements made in conformance with ISO 7779 and declared in conformance with ISO 9296. 1054 406/90 Service Guide Appendix B. Notices This information was developed for products and services offered in the U.S.A. The manufacturer may not offer the products, services, or features discussed in this document in other countries. Consult the manufacturer’s representative for information on the products and services currently available in your area. Any reference to the manufacturer’s product, program, or service is not intended to state or imply that only that product, program, or service may be used. Any functionally equivalent product, program, or service that does not infringe any intellectual property right of the manufacturer may be used instead. However, it is the user’s responsibility to evaluate and verify the operation of any product, program, or service. The manufacturer may have patents or pending patent applications covering subject matter described in this document. The furnishing of this document does not give you any license to these patents. You can send license inquiries, in writing, to the manufacturer. The following paragraph does not apply to the United Kingdom or any country where such provisions are inconsistent with local law: THIS MANUAL IS PROVIDED ?AS IS? WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF NON-INFRINGEMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Some states do not allow disclaimer of express or implied warranties in certain transactions; therefore, this statement may not apply to you. This information could include technical inaccuracies or typographical errors. Changes are periodically made to the information herein; these changes will be incorporated in new editions of the publication. The manufacturer may make improvements and/or changes in the product(s) and/or the program(s) described in this publication at any time without notice. Information concerning products made by other than the manufacturer was obtained from the suppliers of those products, their published announcements, or other publicly available sources. The manufacturer has not tested those products and cannot confirm the accuracy of performance, compatibility or any other claims related to products made by other than the manufacturer. Questions on the capabilities of products made by other than the manufacturer should be addressed to the suppliers of those products. 1055 1056 406/90 Service Guide Appendix C. Service Processor Setup and Test Attention: This procedure applies to modems attached to the serial ports (S1 and S2) on the primary I/O book (location U1.18-P1-H2). It does not affect the operation of the modem attached to the Hardware Management Console (HMC). The call-out function is normally handled by the Service Focal Point application running on the HMC. Note: The call-out and surveillance options are disabled in partitioned systems. For your convenience, an example of a basic service processor setup checklist is included here. Your setup may include more or fewer of the available features, so you can adjust this checklist for your own application. Service Processor Setup Checklist 1. Shut down the managed system, and wait for the OK in the physical operator panel. 2. Open a virtual terminal window on the HMC. 3. Start the service processor menus. 4. Set the system name. 5. Enable Surveillance. 6. Configure call-in/call-out. 7. Attach modems if necessary. If modem attachment is not necessary, proceed to step 8. To attach a modem, do the following: a. Exit the service processor menus. b. Disable the 350 V dc outputs from the Bulk Power Assembly (BPA) by placing the UEPO switch in the off position. Attention: With the UEPO off, power is still present within the BPA. c. Attach the cables and modems. d. Place the UEPO switch in the on position.8. Test both of the following: v Call-In, go to “Testing Call-In” on page 1058 v Call-Out, go to “Testing Call-Out” on page 1058 Your service processor is now ready for use. 1057 Testing the Service Processor Setup This section contains sample testing procedures to help ensure your service processor setup is working. These tests include communicating with the server operating system. Before you start, ensure that the necessary serial port(s) is configured. If you need assistance, refer to “Serial Port Configuration” on page 1059. The server should be powered off as a result of the “Service Processor Setup Checklist” on page 1057. Testing Call-In 1. At your remote terminal, call in to your server. Your server answers and offers you the service processor Main Menu after requesting your privileged-access password. 2. Select System Power Control. 3. Select Power-On System. When you are asked if you wish to continue powering on the system, type Y. 4. After the system firmware and operating system have initialized the server, the login prompt displays at your remote terminal if you set up seamless modem transfer. This may take several minutes. When the login prompt displays, you have successfully called the service processor. 5. Type logout to disconnect from the operating system. The message No Carrier displays on your remote terminal. 6. Call your server again. The operating system answers and offers you the login prompt. If these tests are successful, call-in is working. 7. Log in and type shutdown -F to shut down your server. 8. The message No Carrier displays on your remote terminal.Testing Call-Out During the service processor setup, you entered your phone numbers for the pager and customer voice. These numbers are used for this test. 1. Your remote terminal is disconnected as a result of the Call-In test. 2. Call your server again. 3. At the service processor Main Menu, select Call-In/Call-Out Setup menu, then select Call-Out test. This action causes a simulated error condition for the purposes of this test. 4. After a few moments, a message displays, regarding an illegal entry. Press Enter to clear the message and return to the main menu. 5. When your telephone rings, answer the call. You should hear the sound of a telephone being dialed. This is your computer trying to page you.If this test is successful, call-out is working correctly. 1058 406/90 Service Guide Serial Port Configuration To configure the serial port on an AIX system, do the following from an AIX console: 1. Log in as root user. 2. To determine if you have any serial ports already configured, type: lsdev -Cc tty If no serial ports are configured, none are listed. If you want to configure serial ports that are not listed, continue with the remaining steps. 3. Identify the serial port(s) with the modem(s). 4. Type smit tty. 5. Select add tty. 6. Select RS232. 7. Select Baud rate 9600 or higher. 8. Select login enable and set the flow control to RTS. 9. Commit the selections and set up any other needed serial ports. 10. Exit SMIT. Appendix C. Service Processor Setup and Test 1059 1060 406/90 Service Guide Appendix D. Modem Configurations Note: This appendix applies only to modems attached to the serial ports (S1 and S2) on the primary I/O book (location U1.18-P1-H2). These configuration files do not apply to the modem attached to the Hardware Management Console (HMC). Sample Modem Configuration Files This appendix contains information about several sample modem configuration files that either work directly with your modem or provide a good starting point for a custom setup. The sample modem configuration files are located in the firmware in /usr/share/modems subdirectory (if your server is using AIX) with the following names. A listing of each specific file is included in this appendix. Generic Modem Configuration Files AIX File Name Service Processor Firmware File Name modem_z.cfg modem_z_sp modem_z0.cfg modem_z0_sp modem_f.cfg modem_f_sp modem_f0.cfg modem_f0_sp modem_f1.cfg modem_f1_sp Specific Modem Configuration Files AIX File Name Service Processor Firmware File Name modem_m0.cfg modem_m0_sp modem_m1.cfg modem_m1_sp Use the following selection procedures and your modem manual to determine which of the configuration files is suitable for your use. 1061 Configuration File Selection Use the following steps to select a configuration file: 1. Is your modem an IBM 7852-400? If Yes, use modem configuration file modem_m0.cfg and go to step 7. Note: The IBM 7852-400 modem has DIP switches on the right side of the unit. See “IBM 7852-400 DIP Switch Settings” on page 1064 for the correct switch settings. If No, continue with step 2. 2. Is your modem an IBM 7857-017? If Yes, use modem configuration file modem_m1.cfg and go to step 7. Note: The IBM 7857-017 modem has two telephone line connections on the back of the unit. One is marked LL (for Leased Line), and the other is marked PTSN (for Public Telephone Switched Network). The service processor expects to use the modem on the public network, so the telephone line should attach to the PTSN connector. If No, continue with step 3. 3. Does your modem respond to the extended command set (prefixed with &)? If Yes, go to step 5. If No, continue with step 4. 4. Does your modem respond to: v ATZ reset command or v ATZn reset commands, where n can be 0, 1, and so on? If ATZ, configuration file modem_z.cfg is recommended. If ATZn, configuration file modem_z0.cfg is recommended.Go to step 7. 5. Does your modem command set include a test for V.42 error correction at the remote modem (often called “Auto-Reliable Mode”)? If Yes, disable this test. You can use sample configuration files /usr/share/modem_m0.cfg or /usr/share/modem_m1.cfg as models to help you create a file for your particular modem. See “Customizing the Modem Configuration Files” on page 1064 for more information. Go to step 7. If No, go to step 6. 6. Does your modem respond to: v AT&F reset command or v AT&Fn reset commands, where n can be 0, 1, and so on?If AT&F, configuration file modem_f.cfg is recommended. If AT&Fn, configuration file modem_f0.cfg or modem_f1.cfg is recommended, depending on which provides the hardware flow control profile. 7. You have completed selection of the configuration file. If your modem configuration selection is not available in the Service Processor Modem Configuration Menu, you must access it through the Configure Remote Maintenance Policy Service Aid. If you find it necessary to adjust any of these configuration files, use the manual provided with your modem to accomplish that task. It is recommended you select settings that enable hardware flow control and respond to DTR. Note: Some older modems do not respond to the X0 or &R1 commands. Edit out these commands from the modem configuration file if yours is such a modem. See your modem manual for more information. 1062 406/90 Service Guide Some modems, such as the IBM 7857-017, are not designed for the paging function. Although they can be used for paging, they return an error message when they do not get the expected response from another modem. Therefore, even though the paging was successful, the error message causes the service processor to retry, continuing to place pager calls for the number of retries specified in the Call-Out Policy Setup Menu. These retries result in redundant pages. Examples for Using the Generic Sample Modem Configuration Files The following table contains information to help you determine which modem configuration file to use with various modems. Modem Setup Z Setup Z0 (Rare) Setup F Setup F0 Setup F1 AT&T DataPort 2001 (Ring interrupt only on first ring) X Bocamodem 1440E X Hayes Smart Modem 300 X IBM 5841 X IBM 5843 X IBM 7851 X IBM 7852-10 X IBM 7855 X USRobotics 36.6K Sportster X Zoom V.32 X Appendix D. Modem Configurations 1063 Customizing the Modem Configuration Files You can create your own modem configuration files or modify the samples provided. After you customize your modem configuration files, you must access them through the Configure Remote Maintenance Policy Service Aid rather than from the service processor menus. If you have already set up your serial ports, line speeds, authorizations, and telephone numbers from the service processor menus, use the service aid to specify your customized modem configuration files. If you have not already set up your serial ports, line speeds, authorizations, and telephone numbers from the service processor menus, use the service aids to set them while you specify your customized modem configuration files. To disable Auto-Reliable Mode testing of the remote modem, use the sample modem configuration file /usr/share/modems/modem_f.cfg as a model that you can modify, as follows: 1. Find the necessary command in your modem manual. 2. Copy the /usr/share/modems/modem_f.cfg file to a new file with a different name (for example, modem_fx.cfg). 3. In the new file (modem_fx.cfg), change the line Send "ATE0T\r" to Send "ATcccE0T\r" where ccc is the added command as specified in your modem manual, as follows. Change the third line of each of the following stanzas: v condout v condin v ripo4. Save the changes.IBM 7852-400 DIP Switch Settings If you are using a 7852-400 modem to enable service processor communications, for proper operation, the dual inline package (DIP) switches must be set according to the following table: Switch Position Function 1 Up Force DTR 2 Up Flow Control &E4 3 Down Result Codes Enabled 4 Down Modem Emulation Disabled 5 Up Auto Answer Enabled 6 Up Maximum Throughput Enabled 7 Up RTS Normal Functions 8 Down Enable Command Mode 9 Down Remote Digital Loopback Test Enabled 10 Up Dial-Up Line Enabled 11 *Up AT Responses Enabled (Extended Responses Disabled) 12 *Down Asynchronous Operation 13 UP 28.8KB Line Speed 14 Up 15 Up CD and DSR Normal Functions 16 Up 2-Wire Leased Line Enabled * Only switches 11 and 12 are changed from the factory default settings. 1064 406/90 Service Guide Xon/Xoff Modems Some early modems assume software flow control (Xon/Xoff) between the computer and the modem. Modems with this design send extra characters during and after the transmitted data. The service processor cannot accept these extra characters. If your configuration includes such a modem, your functional results may be unpredictable. The sample modem configuration files included in this appendix do not support these modems, so custom configuration files are necessary. Anchor Automation 2400E is an example of such a modem. If you experience unexplainable performance problems that may be due to Xon/Xoff characters, it is recommended that you upgrade your modem. Ring Detection Most modems produce an interrupt request each time they detect a ring signal. Some modems generate an interrupt only on the first ring signal that they receive. AT&T DataPort 2001 is an example of such a modem. The service processor uses the ring interrupt request to count the number of rings when Ring Indicate Power-On (RIPO) is enabled. If your modem produces an interrupt on only the first ring, set Ring Indicate Power-On to start on the first ring. Otherwise, you can choose to start Ring Indicate Power-On on any ring count. Terminal Emulators The service processor is compatible with simple ASCII terminals, and therefore compatible with most emulators. When a remote session is handed off from the service processor to the operating system, agreement between terminal emulators becomes important. The server’s operating system will have some built-in terminal emulators. You may also have a commercially available terminal emulation. It is important that the local and host computers select the same or compatible terminal emulators so that the key assignments and responses match, ensuring successful communications and control. For best formatting, choose line wrap in your terminal emulator setup. Recovery Procedures Situations such as line noises and power surges can sometimes cause your modem to enter an undefined state. When it is being used for dial-in, dial-out or ring indicate power-on, your modem is initialized each time one of these actions is expected. If one of these environmental conditions occur after your modem has been initialized, it might be necessary to recover your modem to a known state. If your modem communicates correctly with remote users, it is probably in control. It may be wise to occasionally change some of the functional settings and then change them back, just for the sense of security that the modem is communicating, and to ensure it has been initialized recently. If your system is particularly difficult to access physically, another strategy is to protect it with an Uninterruptible Power Source (UPS) and a phone-line surge protector. In case recovery becomes necessary, shut down your system using established procedures. Disconnect the power cable and press the power button to drain capacitance while power is disconnected. Disconnect and reconnect modem power, and then reconnect system power to completely reinitialize your system. Appendix D. Modem Configurations 1065 Transfer of a Modem Session Because many modem command variations exist, the sample modem configuration files located at the end of this appendix have been written to capture the largest number of workable modem settings. The modem command &Dn (where n is a number) generally sets the modem response to the Data Terminal Ready (DTR) signal from the server’s serial port. The desired response is that the modem will hold a connection while DTR is enabled, and drop the connection when DTR is released. Using this mechanism, the server hangs up on a connection under normal conditions. Consult your modem manual for its specific response scheme for the &Dn command. Two strategies are available for dealing with the modem’s response to DTR: v Recovery v PreventionBefore you use one of these strategies, determine if your server’s modem is set up correctly to respond to DTR. With the remote terminal connected to serial port 1 and defined as the Primary Console Device, there are two tests you can perform: v Will the modem drop the connection after the System initialization complete message displays at the remote terminal? If Yes, the modem is set up correctly. If No, try another &Dn setting for your server’s modem. See your modem manual for this information. The &Dn command appears in three places each in three of the sample modem configuration files, as follows: v Will the server’s modem disconnect when the power drops? You can make this observation at the remote terminal by commanding your server to shut down and power off. (Use the AIX command shutdown -F.) Watch for the message NO CARRIER on your remote terminal. If Yes, this is the correct response. The modem is set up correctly. If No, try another &Dn setting for your server’s modem. See your model manual for this information. The &Dn command appears in three places each in three of the sample modem configuration files. Only the following sample modem configuration files contain the &Dn command (in three places each): – modem_f.cfg – modem_f0.cfg – modem_f1.cfgIf you are using modem_z.cfg or modem_z0.cfg, you cannot control DTR response. If your remote terminal does not disconnect after logging off, you must command the remote terminal emulator to hang up. This then breaks the connection. 1066 406/90 Service Guide Recovery Strategy The recovery strategy consists of making two calls to establish a remote session. This solution is the easiest to implement and allows more freedom for configuring your server’s serial ports. To set up a remote terminal session, dial into the service processor and start the system. After the operating system is loaded and initialized, the connection will be dropped. At this point, call the server back and the operating system will answer and offer you the login prompt. Prevention Strategy The disconnect is caused by the operating system when it initializes the Primary Console. The tests listed in “Transfer of a Modem Session” on page 1066 are conducted with the remote terminal selected as the primary console to manifest the modem’s response to DTR transitions. v If a local ASCII terminal or a graphics console is to be a permanent part of your server, then make one of them the primary console. Your remote terminal will no longer experience the connection loss. v If a local console is not a permanent part of your server, you can still assign either the unused graphics console or the unused serial port as the primary console. This gives you the desired seamless connection at your remote terminal.If you choose to use the unused serial port as the primary console, some initialization traffic will be sent to any serial device attached to that port. As a result, that serial device’s connection and function could be affected. These impacts may make that port unattractive for devices other than a temporary local ASCII terminal. Appendix D. Modem Configurations 1067 Modem Configuration Sample Files Sample File modem_m0.cfg # # COMPONENT_NAME: (ESPSETUP) ENTRY SERVICE PROCESSOR SETUP: modem_m0 # # FUNCTIONS: Modem configuration file specifically for IBM 7852-400 # modem with Auto-Reliable feature. This feature must be turned off # for Catcher calls. This example uses the AT&F reset command to # choose the factory defaults. # # (C) COPYRIGHT International Business Machines Corp. 1996 # All Rights Reserved # Licensed Materials - Property of IBM # # US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. # # # The modem has configuration switches. They should be set to the # factory default settings, except switches 11 and 12. These must be # to UP ("AT" responses) and DOWN (Asynchronous operation), respectively. ICDelay 1 DefaultTO 10 CallDelay 120 # # %N Call-Out phone number %R Return phone number # # # PROGRAMMING NOTE: No blanks between double quote marks ("). condout: send "AT&F&E2E0T\r" # Reset to factory defaults # Reliable mode # Echo off ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "AT&E12&E14\r" # Disable pacing # Disable data compression expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful. send "AT&SF1&S0S9=1\r" # DSR independent of CD # Force DSR on. # CD respond time=100ms expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful. send "ATV0S0=0\r" # Numeric response code # Auto-Answer off expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful. done connect: send "ATDT%N\r" # Tone dialing command. # %N from Call Home setup. # Expect a connection response. expect "33\r" or "31\r" or "28\r" or "26\r" or "24\r" or "21\r" or "19\r" or "13\r" or "12\r" or "1\r" busy "7\r" timeout 60 done retry: send "A/" # Repeat the previous command. # Expect a connection response. expect "33\r" or "31\r" or "28\r" or "26\r" or "24\r" or "21\r" or 1068 406/90 Service Guide "19\r" or "13\r" or "12\r" or "1\r" busy "7\r" timeout 60 done disconnect: delay 2 # Separate from previous data. send "+++" # Assure command mode. delay 2 # Allow mode switching delay. send "ATH0T\r" # Set modem switch-hook down # (i.e., hang up). ignore "0\r" or "OK\r" timeout 2 # Ignore modem response. send "ATE0Q1\r" # Initialize modem: Echo OFF, # Disable responses. ignore "0\r" timeout 1 done condin: send "AT&F&E2E0T\r" # Reset to factory defaults. # Reliable mode # Echo off ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "AT&E12&E14\r" # Disable pacing # Disable data compression expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful send "AT&SF1&S0S9=1\r" # DSR independent of CD. # Force DSR on. # CD respond time=100ms expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful. send "ATV0S0=2\r" # Numberic response code # Answer on 2nd ring expect "0\r" timeout 2 # Confirm commands successful. done waitcall: ignore "2\r" timeout 1 # Ignore first ring. expect "2\r" timeout 10 # Pickup 2nd ring or timeout # Expect a connection response. expect "33\r" or "31\r" or "28\r" or "26\r" or "24\r" or "21\r" or "19\r" or "13\r" or "12\r" or "1\r" busy "7\r" timeout 60 done page: send "ATDT%N,,,,%R;\r" # %N = pager call center number # Add enough commas to wait for # time to enter paging number. # %R = paging number expect "0\r" timeout 60 # Confirm successful command. delay 2 # Wait before hanging up. send "ATH0\r" # Hang up. expect "0\r" timeout 2 # Confirm successful command. done ripo: send "AT&F&E2E0T\r" # Reset to factory defaults. # Reliable mode # Echo off ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "AT&E12&E14\r" # Disable pacing # Disable data compression expect "0\r" or "OK\r\n" timeout 2 # Confirm successful command. send "AT&SF1&S0S9=1\r" # DSR independent of CD. # Force DSR on. # CD respond time=100ms Appendix D. Modem Configurations 1069 expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful. send "ATV0S0=0\r" # Numeric response code # Auto Answer OFF expect "0\r" timeout 2 # Confirm commands successful. done # error: # Handle unexpected modem # responses. expect "8\r" or "7\r" or "6\r" or "4\r" or "3\r" delay 2 done 1070 406/90 Service Guide Sample File modem_m1.cfg # # COMPONENT_NAME: (ESPSETUP) ENTRY SERVICE PROCESSOR SETUP modem_m1 # # FUNCTIONS: Modem configuration file specifically for IBM 7857-017 modem with # Auto-Reliable feature. This feature must be turned off for Catcher calls. # This example uses the AT&F reset command to choose the factory defaults. # # To allow dial commands for digital pagers, it is necessary to reduce # the number of characters in the dial command. Each comma (delay) has # been set to 6 seconds (S8=6) for that reason. # # # (C) COPYRIGHT International Business Machines Corp. 1996 # All Rights Reserved # Licensed Materials - Property of IBM # # US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. # ICDelay 1 DefaultTO 10 CallDelay 120 # # %N Call-Out phone number %R Return phone number # # # PROGRAMMING NOTE: No blanks between double quote marks ("). condout: send "AT&F*E0E0\r" # Reset to factory defaults. # *E0=data compression disabled # E0=echo disabled ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "AT#F0*Q2S8=6\r" # Trellis modulation disabled # Retrain with adaptive rate # Set ,=6second expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful send "ATV0X0S0=0\r" # Numeric response code # AT compatible messages # Auto-Answer disabled expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful. done connect: send "ATDT%N\r" # Tone dialing command. # %N from Call Home setup. expect "1\r" busy "7\r" timeout 60 # Expect a connection response. done retry: send "A/" # Repeat the previous command. expect "1\r" busy "7\r" timeout 60 # Expect a connection response. done disconnect: delay 2 # Separate from previous data. send "+++" # Assure command mode. delay 2 # Allow mode switching delay. send "ATH0\r" # Set modem switch-hook down # (i.e., hang up). ignore "0\r" or "OK\r" timeout 2 # Ignore modem response. Appendix D. Modem Configurations 1071 send "ATE0Q1\r" # Initialize modem: Echo OFF, # Disable responses. ignore "0\r" timeout 1 done condin: send "AT&F*E0E0\r" # Reset to factory defaults. # *E0=data compression disabled # E0=echo disabled ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "AT#F0*Q2\r" # Trellis modulation disabled # Retrain with adaptive rate expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful send "ATV0X0S0=2\r" # Numeric response code # AT compatible messages # Answer on 2nd ring expect "0\r" timeout 2 # Confirm commands successful. done waitcall: ignore "2\r" timeout 1 # Ignore first ring. expect "2\r" timeout 10 # Pick up second ring # or timeout. # Expect a connection response. expect "1\r" timeout 60 done page: send "ATD%N,%R\r" # %N = pager call center number # commas=6sec wait time to # enter paging number. # %R = return number expect "0\r" or "3\r" timeout 30 # Confirm successful command. delay 2 # Wait before hanging up. send "+++" # Assure command mode. delay 2 # Allow mode switching delay. send "ATH0\r" # Hang up. expect "0\r" timeout 2 # Confirm successful command. done ripo: send "AT&F*E0E0\r" # Reset to factory defaults. # *E0=data compression disabled # E0=echo disabled ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "AT#F0*Q2\r" # Trellis modulation disabled # Retrain with adaptive rate expect "0\r" or "OK\r\n" timeout 2 # Confirm successful command. send "ATV0X0S0=0\r" # Numeric response code # AT compatible messages # Auto-Answer disabled expect "0\r" timeout 2 # Confirm commands successful. done # error: # Handle unexpected modem # responses. expect "8\r" or "7\r" or "4\r" or "3\r" delay 2 done 1072 406/90 Service Guide Sample File modem_z.cfg # # COMPONENT_NAME: (ESPSETUP) ENTRY SERVICE PROCESSOR SETUP Z # # FUNCTIONS: Modem configuration file for many early Hayes* compatible modems. # This example uses the ATZ reset command to choose the factory defaults. # This setup will work for many modems, but it is required for early vintage # modems which respond to neither the ATZ0 reset command nor the extended (&) # commands. Refer to your modem manual. # # * Trademark of Hayes Microcomputer Products, Inc. # # (C) COPYRIGHT International Business Machines Corp. 1996 # All Rights Reserved # Licensed Materials - Property of IBM # # US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. # # # If the modem has configuration switches, they should be set to the # factory default settings. ICDelay 1 DefaultTO 10 CallDelay 120 # AT Attention Code , Inserts delay in dialing commands # Z Reset to factory defaults Q0 Turn on responses # E0 Turn echo off Q1 Turn off responses # V0 Use numeric responses S0=0 Automatic answer inhibit # +++ Escape to command mode S0=2 Answer on second ring # H0 Hang-up T = Tone mode. When used as T\r, it is a # no op to maintain program synchronization # when modem may/will echo the commands. # # %N Call-Out phone number %P Paging phone number # %S Modem speed (available to users) # # Following are common responses from a wide range of modems: # 16, 15, 12, 10, 5 and 1 are connection responses. Add others as required. # 7=busy; 6=no dial tone; 4=error; 3=no carrier; 2=ring; 0=OK # # PROGRAMMING NOTE: No blanks between double quote marks ("). condout: send "ATZQ0T\r" # Reset to factory defaults. ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful. done connect: send "ATDT%N\r" # Tone dialing command. # %N from Call Home setup. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60 Appendix D. Modem Configurations 1073 done retry: send "A/" # Repeat the previous command. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60 done disconnect: delay 2 # Separate from previous data. send "+++" # Assure command mode. delay 2 # Allow mode switching delay. send "ATH0T\r" # Set modem switch-hook down # (i.e., hang up). ignore "0\r" or "OK\r" timeout 2 # Ignore modem response. send "ATE0Q1\r" # Initialize modem: Echo OFF, # Disable responses. ignore "0\r" timeout 1 done condin: send "ATZQ0T\r" # Reset to factory defaults. ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=2\r" # Set AutoAnswer ON expect "0\r" timeout 2 # Confirm command successful. done waitcall: ignore "2\r" timeout 1 # Ignore first ring. expect "2\r" timeout 10 # Pick up second ring # or timeout. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" timeout 60 done page: send "ATDT%N,,,,%R;\r" # %N = pager call center number # Add enough commas to wait for # time to enter paging number. # %R = paging number # Confirm successful command. expect "0\r" timeout 60 delay 2 # Wait before hanging up. send "ATH0T\r" # Hang up. expect "0\r" timeout 2 # Confirm successful command. done ripo: send "ATZQ0T\r" # Reset to factory defaults. ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful. done # RI Power On enabled. 1074 406/90 Service Guide error: # Handle unexpected modem # responses. expect "8\r" or "7\r" or "6\r" or "4\r" or "3\r" delay 2 done Appendix D. Modem Configurations 1075 Sample File modem_z0.cfg # # COMPONENT_NAME: (ESPSETUP) ENTRY SERVICE PROCESSOR SETUP Z0 # # FUNCTIONS: Modem configuration file for some early Hayes* compatible modems. # This example uses the ATZ0 reset command to choose the factory defaults. # This setup is recommended for modems that will respond to the ATZ0 command # and which do not respond to the extended (&) commands. Refer to your modem # manual. # # * Trademark of Hayes Microcomputer Products, Inc. # # (C) COPYRIGHT International Business Machines Corp. 1996 # All Rights Reserved # Licensed Materials - Property of IBM # # US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. # # # If the modem has configuration switches, they should be set to the # factory default settings. ICDelay 1 DefaultTO 10 CallDelay 120 # AT Attention Code , Inserts delay in dialing commands # Z0 Reset. Restore Profile 0 Q0 Turn on responses # E0 Turn echo off Q1 Turn off responses # V0 Use numeric responses S0=0 Automatic answer inhibit # +++ Escape to command mode S0=2 Answer on second ring # H0 Hang-up X0=0 Limit modem response codes # T = Tone mode. When used as T\r, it is a # no op to maintain program synchronization # when modem may/will echo the commands. # # %N Call-Out phone number %P Paging phone number # %S Modem speed (available to users) # # Following are common responses from a wide range of modems: # 16, 15, 12, 10, 5 and 1 are connection responses. Add others as required. # 7=busy; 6=no dial tone; 4=error; 3=no carrier; 2=ring; 0=OK # # PROGRAMMING NOTE: No blanks between double quote marks ("). condout: send "ATZ0Q0T\r" # Reset modem. Select profile 0 ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful. done connect: send "ATDT%N\r" # Tone dialing command. # %N from Call Home setup. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" 1076 406/90 Service Guide timeout 60 done retry: send "A/" # Repeat the previous command. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60 done disconnect: delay 2 # Separate from previous data. send "+++" # Assure command mode. delay 2 # Allow mode switching delay. send "ATH0T\r" # Set modem switch-hook down # (i.e., hang up). ignore "0\r" or "OK\r" timeout 2 # Ignore modem response. send "ATE0Q1\r" # Initialize modem: Echo OFF, # Disable responses. ignore "0\r" timeout 1 done condin: send "ATZ0Q0T\r" # Reset modem. Select profile 0 ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=2\r" # Set AutoAnswer ON expect "0\r" timeout 2 # Confirm command successful. done waitcall: ignore "2\r" timeout 1 # Ignore first ring. expect "2\r" timeout 10 # Pick up second ring # or timeout. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" timeout 60 done page: send "ATDT%N,,,,%R;\r" # %N = pager call center number # Add enough commas to wait for # time to enter paging number. # %R = paging number # Confirm successful command. expect "0\r" timeout 60 delay 2 # Wait before hanging up. send "ATH0T\r" # Hang up. expect "0\r" timeout 2 # Confirm successful command. done ripo: send "ATZ0Q0T\r" # Reset modem. Select profile 0 ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful. Appendix D. Modem Configurations 1077 done # RI Power On enabled. error: # Handle unexpected modem # responses. expect "8\r" or "7\r" or "6\r" or "4\r" or "3\r" delay 2 done 1078 406/90 Service Guide Sample File modem_f.cfg # # COMPONENT_NAME: (ESPSETUP) ENTRY SERVICE PROCESSOR SETUP F # # FUNCTIONS: Modem configuration file for many recent Hayes* compatible modems. # This example uses the AT&F reset command to choose the factory defaults. # This set up is preferred for modems with extended (&) commands. For early # vintage modems, setup Z or Z0 is recommended. If your modem responds to # the extended (&) commands and to factory default choices (&Fn), setup file # F0 or F1 is recommended. # # * Trademark of Hayes Microcomputer Products, Inc. # # (C) COPYRIGHT International Business Machines Corp. 1996 # All Rights Reserved # Licensed Materials - Property of IBM # # US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. # # # If the modem has configuration switches, they should be set to the # factory default settings. ICDelay 1 DefaultTO 10 CallDelay 120 # AT Attention Code , Inserts delay in dialing commands # &F Reset to default profile Q0 Turn on responses # E0 Turn echo off Q1 Turn off responses # V0 Use numeric responses S0=0 Automatic answer inhibit # +++ Escape to command mode S0=2 Answer on second ring # H0 Hang-up X0=0 Limit modem response codes # T = Tone mode. When used as T\r, it is a # no op to maintain program synchronization # when modem may/will echo the commands. # # &C1 Detect CD &D2 Respond to DTR (often the default) # # %N Call-Out phone number %P Paging phone number # %S Modem speed (available to users) # # Following are common responses from a wide range of modems: # 16, 15, 12, 10, 5 and 1 are connection responses. Add others as required. # 7=busy; 6=no dial tone; 4=error; 3=no carrier; 2=ring; 0=OK # # PROGRAMMING NOTE: No blanks between double quote marks ("). condout: send "AT&FQ0T\r" # Reset to factory defaults. ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful. send "AT&C1&D2\r" # Detect carrier and DTR. expect "0\r" timeout 2 # Confirm command successful. done Appendix D. Modem Configurations 1079 connect: send "ATDT%N\r" # Tone dialing command. # %N from Call Home setup. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60 done retry: send "A/" # Repeat the previous command. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60 done disconnect: delay 2 # Separate from previous data. send "+++" # Assure command mode. delay 2 # Allow mode switching delay. send "ATH0T\r" # Set modem switch-hook down # (i.e., hang up). ignore "0\r" or "OK\r" timeout 2 # Ignore modem response. send "ATE0Q1\r" # Initialize modem: Echo OFF, # Disable responses. ignore "0\r" timeout 1 done condin: send "AT&FQ0T\r" # Reset to factory defaults. ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=2\r" # Set AutoAnswer ON expect "0\r" timeout 2 # Confirm command successful. send "AT&C1&D2\r" # Detect carrier and DTR. expect "0\r" timeout 2 # Confirm command successful. done waitcall: ignore "2\r" timeout 1 # Ignore first ring. expect "2\r" timeout 10 # Pick up second ring # or timeout. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" timeout 60 done page: send "ATDT%N,,,,%R;\r" # %N = pager call center number # Add enough commas to wait for # time to enter paging number. # %R = paging number # Confirm successful command. expect "0\r" timeout 60 delay 2 # Wait before hanging up. send "ATH0T\r" # Hang up. expect "0\r" timeout 2 # Confirm successful command. done ripo: send "AT&FQ0T\r" # Reset to factory defaults. 1080 406/90 Service Guide ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful. send "AT&C1&D2\r" # Detect carrier and DTR. expect "0\r" timeout 2 # Confirm command successful. done # RI Power On enabled. error: # Handle unexpected modem # responses. expect "8\r" or "7\r" or "6\r" or "4\r" or "3\r" delay 2 done Appendix D. Modem Configurations 1081 Sample File modem_f0.cfg # # COMPONENT_NAME: (ESPSETUP) ENTRY SERVICE PROCESSOR SETUP F0 # # FUNCTIONS: Modem configuration file for many recent Hayes* compatible modems. # This example uses the AT&F0 reset command to choose the factory defaults. # This set up is preferred for modems with extended (&) commands. For early # vintage modems, setup Z or Z0 is recommended. If your modem responds to # the extended (&) commands and to factory default choices (&Fn), but doesn’t # work properly with this setup file, setup F1 is recommended. # # * Trademark of Hayes Microcomputer Products, Inc. # # (C) COPYRIGHT International Business Machines Corp. 1996 # All Rights Reserved # Licensed Materials - Property of IBM # # US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. # # # If the modem has configuration switches, they should be set to the # factory default settings. ICDelay 1 DefaultTO 10 CallDelay 120 # AT Attention Code , Inserts delay in dialing commands # &F0 Reset. Restore profile 0 Q0 Turn on responses # E0 Turn echo off Q1 Turn off responses # V0 Use numeric responses S0=0 Automatic answer inhibit # +++ Escape to command mode S0=2 Answer on second ring # H0 Hang-up X0=0 Limit modem response codes # T = Tone mode. When used as T\r, it is a # no op to maintain program synchronization # when modem may/will echo the commands. # # &C1 Detect CD &D2 Respond to DTR (often the default) # &R1 Ignore RTS (CTS) # # %N Call-Out phone number %P Paging phone number # %S Modem speed (available to users) # # Following are common responses from a wide range of modems: # 16, 15, 12, 10, 5 and 1 are connection responses. Add others as required. # 7=busy; 6=no dial tone; 4=error; 3=no carrier; 2=ring; 0=OK # # PROGRAMMING NOTE: No blanks between double quote marks ("). condout: send "AT&F0Q0T\r" # Reset modem. Select profile 0 ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful. send "AT&C1&D2&R1\r" # Detect carrier and DTR, # Ignore RTS. expect "0\r" timeout 2 # Confirm command successful. 1082 406/90 Service Guide done connect: send "ATDT%N\r" # Tone dialing command. # %N from Call Home setup. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60 done retry: send "A/" # Repeat the previous command. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60 done disconnect: delay 2 # Separate from previous data. send "+++" # Assure command mode. delay 2 # Allow mode switching delay. send "ATH0T\r" # Set modem switch-hook down # (i.e., hang up). ignore "0\r" or "OK\r" timeout 2 # Ignore modem response. send "ATE0Q1\r" # Initialize modem: Echo OFF, # Disable responses. ignore "0\r" timeout 1 done condin: send "AT&F0Q0T\r" # Reset modem. Select profile 0 ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=2\r" # Set AutoAnswer ON expect "0\r" timeout 2 # Confirm command successful. send "AT&C1&D2&R1\r" # Detect carrier and DTR, # Ignore RTS. expect "0\r" timeout 2 # Confirm command successful. done waitcall: ignore "2\r" timeout 1 # Ignore first ring. expect "2\r" timeout 10 # Pick up second ring # or timeout. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" timeout 60 done page: send "ATDT%N,,,,%R;\r" # %N = pager call center number # Add enough commas to wait for # time to enter paging number. # %R = paging number # Confirm successful command. expect "0\r" timeout 60 delay 2 # Wait before hanging up. send "ATH0T\r" # Hang up. expect "0\r" timeout 2 # Confirm successful command. Appendix D. Modem Configurations 1083 done ripo: send "AT&F0Q0T\r" # Reset modem. Select profile 0 ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful. send "AT&C1&D2&R1\r" # Detect carrier and DTR, # Ignore RTS. expect "0\r" timeout 2 # Confirm command successful. done # RI Power On enabled. error: # Handle unexpected modem # responses. expect "8\r" or "7\r" or "6\r" or "4\r" or "3\r" delay 2 done 1084 406/90 Service Guide Sample File modem_f1.cfg # # COMPONENT_NAME: (ESPSETUP) ENTRY SERVICE PROCESSOR SETUP F1 # # FUNCTIONS: Modem configuration file for many recent Hayes* compatible modems. # This example uses the AT&F1 reset command to choose the factory defaults. # This set up is for modems with extended (&) commands and which do not work # properly with setup F0. For early vintage modems, setup Z or Z0 is # recommended. # # * Trademark of Hayes Microcomputer Products, Inc. # # (C) COPYRIGHT International Business Machines Corp. 1996 # All Rights Reserved # Licensed Materials - Property of IBM # # US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. # # # If the modem has configuration switches, they should be set to the # factory default settings. ICDelay 1 DefaultTO 10 CallDelay 120 # AT Attention Code , Inserts delay in dialing commands # &F1 Reset. Restore profile 1 Q0 Turn on responses # E0 Turn echo off Q1 Turn off responses # V0 Use numeric responses S0=0 Automatic answer inhibit # +++ Escape to command mode S0=2 Answer on second ring # H0 Hang-up X0=0 Limit modem response codes # T = Tone mode. When used as T\r, it is a # no op to maintain program synchronization # when modem may/will echo the commands. # # &C1 Detect CD &D2 Respond to DTR (often the default) # &R1 Ignore RTS (CTS) # # %N Call-Out phone number %P Paging phone number # %S Modem speed (available to users) # # Following are common responses from a wide range of modems: # 16, 15, 12, 10, 5 and 1 are connection responses. Add others as required. # 7=busy; 6=no dial tone; 4=error; 3=no carrier; 2=ring; 0=OK # # PROGRAMMING NOTE: No blanks between double quote marks ("). condout: send "AT&F1Q0T\r" # Reset modem. Select profile 1 ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful. send "AT&C1&D2&R1\r" # Detect carrier and DTR, # Ignore RTS. expect "0\r" timeout 2 # Confirm command successful. done Appendix D. Modem Configurations 1085 connect: send "ATDT%N\r" # Tone dialing command. # %N from Call Home setup. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60 done retry: send "A/" # Repeat the previous command. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60 done disconnect: delay 2 # Separate from previous data. send "+++" # Assure command mode. delay 2 # Allow mode switching delay. send "ATH0T\r" # Set modem switch-hook down # (i.e., hang up). ignore "0\r" or "OK\r" timeout 2 # Ignore modem response. send "ATE0Q1\r" # Initialize modem: Echo OFF, # Disable responses. ignore "0\r" timeout 1 done condin: send "AT&F1Q0T\r" # Reset modem. Select profile 1 ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=2\r" # Set AutoAnswer ON expect "0\r" timeout 2 # Confirm command successful. send "AT&C1&D2&R1\r" # Detect carrier and DTR, # Ignore RTS. expect "0\r" timeout 2 # Confirm command successful. done waitcall: ignore "2\r" timeout 1 # Ignore first ring. expect "2\r" timeout 10 # Pick up second ring # or timeout. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" timeout 60 done page: send "ATDT%N,,,,%R;\r" # %N = pager call center number # Add enough commas to wait for # time to enter paging number. # %R = paging number # Confirm successful command. expect "0\r" timeout 60 delay 2 # Wait before hanging up. send "ATH0T\r" # Hang up. expect "0\r" timeout 2 # Confirm successful command. done 1086 406/90 Service Guide ripo: send "AT&F1Q0T\r" # Reset modem. Select profile 1 ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful. send "AT&C1&D2&R1\r" # Detect carrier and DTR, # Ignore RTS. expect "0\r" timeout 2 # Confirm command successful. done # RI Power On enabled. error: # Handle unexpected modem # responses. expect "8\r" or "7\r" or "6\r" or "4\r" or "3\r" delay 2 done Appendix D. Modem Configurations 1087 1088 406/90 Service Guide Appendix E. Ground Path The ground path for the 406/90 is shown in the following figure. 1089 1090 406/90 Service Guide Index Numerics406/90 overview 1 Aabout this book xiii acousticalnoiseemissions 1054 addingFRUs 9 serviceable event comments 8 air filters 912 AIX location codes 66, 71 aluminum backplane stiffener 906 attention LED 22 HMC 22 automatic call homeoverview 5 Bbatterydisposal, recycling 1049 removal 859 bay locations 20 bay locations, DASD 26 boot list 762 boot mode menu 780 boot problems 409 bypass interlock, room EPO 16, 850 Ccable installation and management procedures 1003 cablingI/O subsystem power 30, 33 rulesRIO 40 RIO-2 40 call-intesting 1058 call-in/call-out setup menu 795 call-outtesting 1058 call-out (call-home) 815 call-out policy setup menu 800 capacitor books and capacitor blanks 862 CD-ROM removal 998 change general access password 774 change privileged access password 774 chassisfan 912 checkpoints 369 description 145 firmware 379 service processor 372 CoD transactions 752 code, error 413 codes, spcn informational 755 codes, VPD informations 756 common service processor error codes 737 configurationclient 767 modemcustomizing files 1064 file selection 1062 sample files 1061 specific files 1061 NIM server 766 service processorserial port 1059 configurationsmodem 1061 configurememory 812 processor 812 console mirroring 816 enable/disable 774 customer account setup menu 801 customizing filesmodemconfiguration 1064 Ddata flow 11 DCA books and DCA blanks 861 DCA power cables 30 deconfigurememory 812 processor 812 description, ipl flow 369 diagnosticsconsiderations 759 loading 764 online 760 concurrent mode 762, 763 maintenance mode 763 overview 145 runningservice mode 761 running diagnostics on S1 port 148 service mode 760 standalone 764, 765 considerations 764 NIM server 766 partition considerations 764 using 759 disable console mirroring 774 diskette drive removal 999 documentationAIX 763 DVD-RAM removal 998 1091 Eenable console mirroring 774 entering open firmware 13 entry MAP 145, 149 EPO bypass interlock 16, 850 error code to FRU index 413 error code, B1xx 8FF0 752 error codescommon firmware 737 firmware/POST 488 HMC surveillance 756 operator panel 419 POST 413 problem determination-generated 757 scan dump 736 service processor 516 SPCN 420 system firmware 736 error logsservice processor 817 extended error datamanaging 8 external cables 1029 Ffan assembly 912 fan chassis 912 fan controller 911 fan power cables 30 file selectionmodemconfiguration 1062 firmware 806 frame (power subsystem controller firmware update) 809 level 806 systemupdates 805 update 806 firmware checkpoints 379 firmware/POST error codes 488 flow, data 11 flow, IPL 369 focal point procedures, service 145 FRUhot-pluggable 842 isolation 147 updating information 8 FRU index 413 FRU LEDsactivating 9 deactivating 9 FRUsadding 9 replacing 9 Ggeneral user menu 771 Hhandling static-sensitive devices 837 handling static-sensitive L3 modules 863, 882 handling static-sensitive MCM modules 863, 882 hardware service functionsactivating and deactivating FRU LEDs 9 HMCattention LED 22 cabling 59 resetting the system attention LED 22 HMC (hardware management console)system error access 21 HMC systemresetting attention LED 22 HMC system error access 21 hot-pluggable FRUs 842 II/O blanks 854 I/O books 854 I/O subsystem backplane assembly 989 I/O subsystem chassis 995 I/O subsystem DASD 4-pack 993 I/O subsystem disk drive assembly 991 I/O subsystem I/O bus adapter 994 I/O subsystem power cables 33 I/O subsystem RIO riser card 986 IBF 37 IBF power cables 30 index, error code to FRU 413 indicators, POST 12 information, reference 1 installcorrective service on the frame 810 reconfiguration procedure for SNI adapters 811 installation and managementswitch and network cables 1003 integrated battery feature 35 introducing the system 1 IPLflow 369 phases 369 isolation, FRU 147 Kkeyboards 1031, 1032 keysnumeric 1 key 12 numeric 5 12 numeric 6 12 numeric 8 key 13 Llanguage selection menu 794 laser compliance statement ix, 998 laser safety information ix LCD progress indicator log 818 1092 406/90 Service Guide LEDindicatormenu 788 LED status 21 Linux SRNs 1035 location codeformat 64 location codes 64, 71 AIX 66 physical 64 locations 13 AIX 68 DASD bays 26 DASD SCSI IDs 26 I/O subsystem front 24, 26 I/O subsystem rear 25 location code reference tables 68 logical 64 media bays 20 media SCSI IDs 20 media subsystem operator panel 21 media, front 20 media, rear 20 physical 64, 68 processor subsystem front 13 processor subsystem rear 18, 19 Mmaintenance analysis procedures 145, 149 MAP 1230, platform error problem resolution 163 MAP 1235, system array self-repair problem resolution 168 quick entry MAP 151 maintenance analysis procedures (MAPs) 149 hot-plug MAP 188 minimum configuration MAP 318 power MAP 192 problem determination MAP 158 managingextended error data 8 MAP hot-plug 188 media subsystem cable removal 1002 media subsystem operator panel locations 21 media subsystem, front 20 media subsystem, rear 20 memoryconfigure 812 deconfigure 812 memory books and memory blanks 910 memory configuration/deconfiguration menu 783, 784 menuboot mode 780 call-in/call-out 795 call-out policy setup 800 customer account setup 801 generaluser 771 language selection 794 LED indicator 788 main 772 menu (continued)memory configuration/deconfiguration 783 modem configuration 796 OS surveillancesetup 775 power control network utilities 785 privileged user 772 processor configuration/deconfiguration 782 reboot/restart policy setup 779 ring indicate 778 serial port selection 797 serial port speed setup 798 setup 773 system information 781 system power control 778 telephone setup 798 messages, service processor checkpoints 372 minimum configuration MAP 318 modemconfigurationcustomizing files 1064 file selection 1062 sample files 1061 specific files 1061 configurations 1061 modem sessiontransfer 1066 modem_f.cfg, sample file 1079 modem_f0.cfg, sample file 1082 modem_f1.cfg, sample file 1085 modem_z.cfg, sample file 1073 modem_z0.cfg, sample file 1076 NNIM serverconfiguration 766 client 767 standalone diagnostics 766 noise emissionsacoustical 1054 non-HMC attached system error access 22 Oonline diagnosticsconcurrent mode 762, 763 maintenance mode 763 modes of operation 760 service mode 760 open firmware, entering 13 operating considerationsonline and standalone diagnostics 759 operational phasesservice processor 820 operator panel error codes 419 operator panel removal 1000 OS defined restart policy 779 overview 1 diagnostics 145 service focal point 4 Index 1093 overview (continued)system 1 Pparametersservice processorservice mode 801 partitionsconsiderationsstandalone 764 partskeyboard 1031, 1032 parts information 1009 external cables 1029 media subassembly 1021 power subsystem 1010 processor subsystem 1012, 1014, 1018 rack subsystem 1010 subassembly 1025 passwords 774 general, changing 774 privileged, changing 774 PCIslot LED definitions 913 PCI adapters 914 accessing hot-plug functions 925 hot-plug manager access 924 hot-pluggable adapters 921 hot-pluggable manager menu 925 install a short adapter into cassette 951 installing a long adapter into a cassette 968 installing a new hot-pluggable adapter 923 long adapter removal 946 removing adapter from cassette 935 removing and replacing hot-pluggable adapter 921 removing non-hot-pluggable adapter 920 removing PCI adapter cassette 915 replacing a PCI adapter cassette 918 replacing an adapter in a cassette 946 replacing non-hot-pluggable adapter 920 short adapter removal 943 phases, ipl 369 physical location codes 64, 71 platform error problem resolution, MAP 1230 163 POST 413 POST indicators 12 POST keysnumeric 1 12 numeric 5 key 12 numeric 6 key 12 numeric 8 key 13 power cablesDCAs 30 fans 30 I/O subsystem 33 IBF 30 power cables, I/O subsystem 33 power control network utilities menu 785 power MAP 145, 192 power source 133 power subsystem 29 power supply fan removal 987 power supply removal 987 power-onmethods 802 power-on self-test 12 powering off 838 powering on 838, 841 powering on and off 838 powering on with service processor 841 preface xiii preventive maintenance 142 air filters 142 primary console 1067 primary I/O book 854 privileged user menus 772 problem determination MAP 158 problem determination-generated error codes 757 problem isolation MAP 145 problems, boot 409 procedurescable installation and management 1003 procedures, removal and replacement 834 procedures, service focal point 145 processorconfigure 812 deconfigure 812 fans 912 processor configuration/deconfiguration menu 782 processor fan assembly 912 processor fan controller 911 processor fans 912 processor subsystem chassis 908 product disposal 1049 Qquick entry MAP 145, 151 Rracksecondary 37, 38, 39, 43, 48, 49, 52, 53 reboot/restart policy setup menu 779 recoveryprocedures 1065 strategy 1067 recycling 1049 reference information 1 reference tablesAIX location code 68 physical location code 68 related publications xiv removal and replacement introduction 837 removal and replacement procedures 834 air filters 912 battery 859 capacitor books and capacitor blanks 862 CD-ROM drive 998 DCA books and DCA blanks 861 diskette drive 999 1094 406/90 Service Guide removal and replacement procedures (continued)DVD-RAM drive 998 I/O blanks 854 I/O books 854 I/O subsystem backplane assembly 989 I/O subsystem chassis 995 I/O subsystem DASD 4-pack 993 I/O subsystem DASD assembly 991 I/O subsystem I/O bus adapter 994 I/O subsystem RIO riser card 986 media subsystem cables 1002 memory books and memory blanks 910 operator panel 1000 power supply 987 power supply fan 987 primary I/O book 854 processor subsystem chassis 908 removal procedures 863 L3 cache modules 882 mcm filler covers 863 mcm modules 863 pass through modules 863 replacement procedures 870 I/O book 856 L3 cache modules 885 mcm modules 870 memory modules 911 replacingFRUs 9 resetting the system attention LED 22 HMC 22 restart policyOS defined 779 supplemental 779 restart recovery 803 RETAIN 801 retries 800 ring indicate power-on menu 778 RIOcabling 40 rules 40 RIO-2cabling 40 rules 40 room EPO bypass interlock 16, 850 running online diagnostics with no attached HMC 762 running standalone diagnostics with no attached HMC 765 SS1 port testingnative serial port 148 safetyelectrical vii safety notices vii laser compliance statement ix safety statement, laser 998 sample filesmodemconfigurations 1061 scan dump messages 736 scan log dump policy 776 SCSI IDs 20, 26 self-test, power-on 12 serial portselection menu 797 snoop setup menu 776 speed setup menu 798 service focal pointactivating and deactivating FRU LEDs 9 adding FRUs 9 call home overview 5 closing a serviceable event 8 enabling surveillance notifications 6 overview 4 procedures 145 replacing FRUs 9 saving and managing extended error data 8 serviceable events 7 setting up surveillance 6 settings 5 testing error reporting 5 updating FRU information 8 viewing and adding serviceable event comments 8 viewing error details 8 viewing service processor error details 8 viewing serviceable event details 7 viewing serviceable event partition information 9 viewing serviceable events 7 service inspection guide 142 service processor 769 accesslocal 769 remote 770 boot mode menu 780 call-intest 1057 call-in/call-out setup menu 795 call-outtest 1057 call-out call-home 815 call-out policy setup menu 800 checklist 1057 checkpoints 372 console mirroring 816 customer account setup menu 801 error logs 817 general user menu 771 language selection menu 794 main menu 772 memory configuration/deconfiguration menu 783, 784 menus 769 modem configuration menu 796 monitoring - surveillance 813 operational phases 820 OS surveillance setup menu 775 parametersservice mode 801 passwords 774 power control network utilities menu 785 Index 1095 service processor (continued)privileged user menus 772 procedures in service mode 801 processor configuration/deconfiguration menu 782 reboot/restart policy setup menu 779 reboot/restart recovery 803 ring indicate power-on menu 778 serial portconfiguration 1059 serial port selection menu 797 serial port snoop setup menu 776 serial port speed setup menu 798 settingssaving and restoring 770 setup 1057 setup checklist 1057 setup menu 773 system information menu 781 system power control menu 778 telephone setup menu 798 test 1057 service processor checkpoints 372 service processor error codes 516 serviceable eventviewing partition information 9 serviceable eventsadding comments 8 selecting and viewing 7 updating FRU information 8 viewing comments 8 viewing details 7 viewing error details 8 viewing service processor error details 8 working with 7 setup and testservice processor 1057 setup menu 773 SPCN error codes 420 spcn informational codes 755 specifc filesmodemconfiguration 1061 specifications 129 distribution, weight 135 environmental 135 height 133 noise emissions 135 processor subsystem 129 weight 133, 134 SRNs for Linux 1035 standalone diagnostics 764, 765 NIM server 766, 767 partition considerations 764 supplemental restart policy 779 surveillancemonitoring 813 switch cableinstallation and management procedures 1003 systemattention LED 21 HMC attached system error access 21 system (continued)non-HMC attached system error access 22 resetting attention LED 22 system array self-repair problem resolution , MAP 1235 168 system firmware update messages 736 system firmware updates 805 system information menu 781 system introduction 1 system management services 823 boot devices, select 831 boot options, select 829 error log, view 825 exiting SMS 834 IPL, remote setup 826 IPL, setup 826 multiboot startup 834 password, change options 825 password, privileged-access 825 SCSI settings, change 828 select console 828 select language 824 settings, display current 833 settings, restore default 834 system power control menu 778 system power-on methods 802 system specifications 129 system, powering 841 Ttelephone setup menu 798 terminal emulators 1065 terminal typeidentifying 760 testing the setupcall-in 1058 call-out 1058 trademarks xiv transactions, CoD 752 transfer of a modem session 1066 UUEPO card assembly 850 UEPO locations 16 UEPO removal and replacement 850 unit emergency power off card assembly 850 updatingframe firmware 809 FRU information 8 Vviewingserviceable event comments 8 serviceable event details 7 serviceable event error details 8 serviceable event partition information 9 VPD informational codes 756 1096 406/90 Service Guide Wweb sitesfirmware updates 806 microcode updates 806 Index 1097 1098 406/90 Service Guide Vos remarques sur ce document / Technical publication remark form Titre / Title : Bull ESCALA PL 3200R Service Guide Nº Reférence / Reference Nº : 86 A1 82EF 03 Daté / Dated : March 2004 ERREURS DETECTEES / ERRORS IN PUBLICATION AMELIORATIONS SUGGEREES / SUGGESTIONS FOR IMPROVEMENT TO PUBLICATION Vos remarques et suggestions seront examinées attentivement. Si vous désirez une réponse écrite, veuillez indiquer ci-après votre adresse postale complète. Your comments will be promptly investigated by qualified technical personnel and action will be taken as required. If you require a written reply, please furnish your complete mailing address below. NOM / NAME : Date : SOCIETE / COMPANY : ADRESSE / ADDRESS : Remettez cet imprimé à un responsable BULL ou envoyez-le directement à : Please give this technical publication remark form to your BULL representative or mail to: BULL CEDOC 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE Technical Publications Ordering Form Bon de Commande de Documents Techniques To order additional publications, please fill up a copy of this form and send it via mail to: Pour commander des documents techniques, remplissez une copie de ce formulaire et envoyez-la à : BULL CEDOC ATTN / Mr. L. CHERUBIN 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE Phone / Téléphone : +33 (0) 2 41 73 63 96 FAX / Télécopie +33 (0) 2 41 73 60 19 E–Mail / Courrier Electronique : srv.Cedoc@franp.bull.fr Or visit our web sites at: / Ou visitez nos sites web à: http://www.logistics.bull.net/cedoc http://www–frec.bull.com http://www.bull.com CEDOC Reference # No Référence CEDOC Qty Qté CEDOC Reference # No Référence CEDOC Qty Qté CEDOC Reference # No Référence CEDOC Qty Qté _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] [ _ _ ] : no revision number means latest revision / pas de numéro de révision signifie révision la plus récente NOM / NAME : Date : SOCIETE / COMPANY : ADRESSE / ADDRESS : PHONE / TELEPHONE : FAX : E–MAIL : For Bull Subsidiaries / Pour les Filiales Bull : Identification: For Bull Affiliated Customers / Pour les Clients Affiliés Bull : Customer Code / Code Client : For Bull Internal Customers / Pour les Clients Internes Bull : Budgetary Section / Section Budgétaire : For Others / Pour les Autres : Please ask your Bull representative. / Merci de demander à votre contact Bull. BULL CEDOC 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE 86 A1 82EF 03 ORDER REFERENCE Utiliser les marques de découpe pour obtenir les étiquettes. Use the cut marks to get the labels. 86 A1 82EF 03 Service Guide ESCALA PL 3200R 86 A1 82EF 03 Service Guide ESCALA PL 3200R 86 A1 82EF 03 Activation SAS (Serial-Attached SCSI) RAID pour les modèles E1-700, E1-705, E3-700 ou E3-705 REFERENCE 86 F1 89FF 03 ESCALA Power7 Activation SAS (Serial-Attached SCSI) RAID pour les modèles E1-700, E1-705, E3-700 ou E3-705 Les publications pour ESCALA Power7 concernent les modèles suivants : - Bull Escala E5-700 (Power 750 / 8233-E8B) - Bull Escala M6-700 (Power 770 / 9117-MMB) - Bull Escala M6-705 (Power 770 / 9117-MMC) - Bull Escala M7-700 (Power 780 / 9179-MHB) - Bull Escala M7-705 (Power 780 / 9179-MHC) - Bull Escala E1-700 (Power 710 / 8231-E2B) - Bull Escala E1-705 (Power 710 / 8231-E1C) - Bull Escala E2-700 / E2-700T (Power 720 / 8202-E4B) - Bull Escala E2-705 / E2-705T (Power 720 / 8202-E4C) - Bull Escala E3-700 (Power 730 / 8231-E2B) - Bull Escala E3-705 (Power 730 / 8231-E2C) - Bull Escala E4-700 / E4-700T (Power 740 / 8205-E6B) - Bull Escala E4-705 (Power 740 / 8205-E6C) Les références au modèle Power 755 / 8236-E8C sont sans objet. Hardware October 2011 BULL CEDOC 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE REFERENCE 86 F1 89FF 03 L’avis juridique de copyright ci–après place le présent document sous la protection des lois de Copyright qui prohibent, sans s’y limiter, des actions comme la copie, la distribution, la modification et la création de produits dérivés. Copyright Bull SAS 2011 Imprimé en France Marques déposées Toutes les marques citées dans ce manuel sont la propriété de leurs détenteurs respectifs. Tous les noms de marques ainsi que les noms de produits matériels et/ou logiciels sont régis par le droit des marques et/ou des brevets. La citation des noms de marques et de produits est purement informative et ne constitue pas une violation du droit des marques et/ou des brevets. Des corrections ou des modifications au contenu de ce document peuvent intervenir sans préavis. Bull SAS ne pourra pas être tenu pour responsable des éventuelles erreurs qui pourraient être contenues dans ce manuel, ni pour tout dommage p ouvant résulter de son application. Table des matières Avis aux lecteurs canadiens . . . . . . . . . . . . . . . . . . . . . . . . . . . v Consignes de sécurité . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix Activation SAS RAID et bloc de batteries de cache pour le modèle 31E/2B, 31E/1C ou 31E/2C. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 Activation SA RAID - Nouveautés. . . . . . . . . . . . . . . . . . . . . . . . . 3 Installation du fond de panier de stockage - 6 disques SFF/DVD SATA (FC 5267 et FC EJ0D). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 Installation du fond de panier de stockage - 3 disques SFF/DVD SATA/Bande HH (FC 5263 et FC EJ0E). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9 Installation du fond de panier de stockage - 6 disques SFF/DVD SATA/RAID 175 Mo/Adaptateur d'E-S double/Port SAS externe (FC 5268 et FC EJ0F) . . . . . . . . . 13 Retrait et remise en place des adaptateurs et batteries SAS RAID . . . . . . . . . . 23 Retrait et remplacement du fond de panier d'unités de disque . . . . . . . . . . . . . . . . . . 23 Retrait du fond de panier d'unités de disques sur les modèles 31E/2B, 31E/1C, 31E/2C. . . . . . . . . 23 Remplacement du fond de panier d'unités de disques sur les modèles 31E/2B, 31E/1C, 31E/2C . . . . . . 29 Retrait et remplacement du fond de panier système 31E/2B, 31E/1C ou 31E/2C . . . . . . . . . . . . 34 Retrait du fond de panier système 31E/2B, 31E/1C ou 31E/2C . . . . . . . . . . . . . . . . . 34 Remplacement du fond de panier système 31E/2B, 31E/1C ou 31E/2C . . . . . . . . . . . . . . 45 Considérations relatives à la maintenant du sous-système SAS 31E/2B . . . . . . . . . . . . . . . 60 Considérations de maintenance pour le bloc de batterie de cache . . . . . . . . . . . . . . . . 61 Retrait de la carte d'activation RAID/cache31E/2B . . . . . . . . . . . . . . . . . . . . . . 63 Remplacement de la carte d'activation RAID/cache31E/2B . . . . . . . . . . . . . . . . . . . 64 Retrait du contrôleur de stockage RAID/cache sur le système 31E/2B, 31E/1C ou 31E/2C. . . . . . . . . 66 Remplacement du contrôleur de stockage RAID/cache sur le système 31E/2B, 31E/1C ou 31E/2C . . . . . . 70 Retrait et installation de la carte de batterie de cache . . . . . . . . . . . . . . . . . . . . . 73 Retrait et remplacement du bloc de batterie de cache . . . . . . . . . . . . . . . . . . . . . 80 Procédures communes aux dispositifs installables . . . . . . . . . . . . . . . . 83 Avant de commencer . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83 Identification d'un composant défaillant. . . . . . . . . . . . . . . . . . . . . . . . . . 86 Voyants du panneau de commande . . . . . . . . . . . . . . . . . . . . . . . . . . 87 Identification d'un composant défaillant sur un système AIX ou une partition logique . . . . . . . . . 88 Localisation d'un composant défaillant sur un système AIX ou une partition logique. . . . . . . . . 88 Activation du voyant associé au composant défaillant . . . . . . . . . . . . . . . . . . . 88 Désactivation du voyant d'un composant défaillant . . . . . . . . . . . . . . . . . . . . 89 Identification d'un composant défaillant sur un système Linux ou une partition logique. . . . . . . . . 89 Localisation d'un composant défaillant sur un système Linux ou une partition logique . . . . . . . . 89 Localisation d'un composant défaillant sur un système ou une partition logique Linux . . . . . . . . 89 Activation du voyant associé au composant défaillant . . . . . . . . . . . . . . . . . . . 90 Désactivation du voyant d'un composant défaillant . . . . . . . . . . . . . . . . . . . . 90 Localisation d'un composant défaillant sur un système ou une partition logique Virtual I/O Server . . . . . 90 Identification d'un composant à l'aide du Virtual I/O Server . . . . . . . . . . . . . . . . . 91 Démarrage du système ou de la partition logique . . . . . . . . . . . . . . . . . . . . . . 91 Démarrage d'un système non géré par une console HMC ou une console SDMC . . . . . . . . . . . 91 Démarrage d'un système ou d'une partition logique via la console HMC. . . . . . . . . . . . . . 93 iii Démarrage d'un système ou d'un serveur virtuel via la console SDMC . . . . . . . . . . . . . . 93 Arrêt d'un système ou d'une partition logique. . . . . . . . . . . . . . . . . . . . . . . . 93 Arrêt d'un système non géré par une console HMC ou une console SDMC . . . . . . . . . . . . . 94 Arrêt d'un système via la console HMC . . . . . . . . . . . . . . . . . . . . . . . . . 95 Arrêt d'un système via la console SDMC . . . . . . . . . . . . . . . . . . . . . . . . 96 Retrait et remplacement des capots sur le système 31E/2B, 31E/1C ou 31E/2C. . . . . . . . . . . . . 96 Retrait du capot d'accès du système 31E/2B, 31E/1C ou 31E/2C . . . . . . . . . . . . . . . . 96 Installation du capot d'accès du système 31E/2B, 31E/1C ou 31E/2C . . . . . . . . . . . . . . . 97 Mise en position de maintenance ou de fonctionnement du système 31E/2B, 31E/1C ou 31E/2C. . . . . . . 98 Mise en position de maintenance du système 31E/2B, 31E/1C ou 31E/2C monté en armoire . . . . . . . 98 Mise en position de fonctionnement du système 31E/2B, 31E/1C ou 31E/2C monté en armoire . . . . . . 99 Déconnexion des cordons d'alimentation du système 31E/2B, 31E/1C ou 31E/2C . . . . . . . . . . . 101 Connexion des cordons d'alimentation au système 31E/2B, 31E/1C ou 31E/2C . . . . . . . . . . . . 101 Installation d'un dispositif via la console HMC . . . . . . . . . . . . . . . . . . . . . . . 102 Retrait d'un composant via la console HMC . . . . . . . . . . . . . . . . . . . . . . . . 103 Remplacement d'un composant via la console HMC . . . . . . . . . . . . . . . . . . . . . 103 Vérification du composant installé . . . . . . . . . . . . . . . . . . . . . . . . . . . 104 Vérification d'un dispositif installé ou d'un composant remplacé sur un système ou une partition logique AIX 104 Vérification d'un composant installé sur un système Linux ou une partition logique. . . . . . . . . . 107 Vérification d'un composant installé via les programmes de diagnostic autonome . . . . . . . . . . 107 Vérification d'un composant installé à l'aide de la console HMC (Hardware Management Console). . . . . 108 Activation/Désactivation des voyants via la console HMC . . . . . . . . . . . . . . . . . 109 Désactivation d'un voyant d'avertissement système ou de partition . . . . . . . . . . . . . 109 Activation ou désactivation d'un voyant d'identification . . . . . . . . . . . . . . . . . 110 Affichage des événements réparables à l'aide de la console HMC . . . . . . . . . . . . . . . 111 Vérification du composant installé via la console SDMC . . . . . . . . . . . . . . . . . . . 111 Activation/Désactivation de voyants via la console de gestion Systems Director . . . . . . . . . . 112 Désactivation d'un voyant d'avertissement système ou de partition . . . . . . . . . . . . . 112 Activation ou désactivation de voyants d'identification via la console SDMC . . . . . . . . . . 113 Visualisation d'événements réparables via la console de gestion Systems Director. . . . . . . . . . 113 Vérification d'un dispositif installé ou d'un composant remplacé sur un système ou une partition logique à l'aide des outils Virtual I/O Server . . . . . . . . . . . . . . . . . . . . . . . . . . 114 Installation d'un composant via la console SDMC . . . . . . . . . . . . . . . . . . . . . . 117 Remplacement d'un composant via la console SDMC . . . . . . . . . . . . . . . . . . . . . 117 Retrait d'un composant via la console SDMC. . . . . . . . . . . . . . . . . . . . . . . . 118 Remarques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119 Marques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 120 Bruits radioélectriques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 120 Remarques sur la classe A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 121 Remarques sur la classe B . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 124 Dispositions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 127 iv Activation SAS (Serial-attached SCSI) RAID Avis aux lecteurs canadiens Le présent document a été traduit en France. Voici les principales différences et particularités dont vous devez tenir compte. Illustrations Les illustrations sont fournies à titre d'exemple. Certaines peuvent contenir des données propres à la France. Terminologie La terminologie des titres IBM peut différer d'un pays à l'autre. Reportez-vous au tableau ci-dessous, au besoin. IBM France IBM Canada ingénieur commercial représentant agence commerciale succursale ingénieur technico-commercial informaticien inspecteur technicien du matériel Claviers Les lettres sont disposées différemment : le clavier français est de type AZERTY, et le clavier français-canadien, de type QWERTY. OS/2 - Paramètres canadiens Au Canada, on utilise : v les pages de codes 850 (multilingue) et 863 (français-canadien), v le code pays 002, v le code clavier CF. Nomenclature Les touches présentées dans le tableau d'équivalence suivant sont libellées différemment selon qu'il s'agit du clavier de la France, du clavier du Canada ou du clavier des États-Unis. Reportez-vous à ce tableau pour faire correspondre les touches françaises figurant dans le présent document aux touches de votre clavier. v Recommandations à l'utilisateur Ce matériel utilise et peut émettre de l'énergie radiofréquence. Il risque de parasiter les communications radio et télévision s'il n'est pas installé et utilisé conformément aux instructions du constructeur (instructions d'utilisation, manuels de référence et manuels d'entretien). Si cet équipement provoque des interférences dans les communications radio ou télévision, mettez-le hors tension puis sous tension pour vous en assurer. Il est possible de corriger cet état de fait par une ou plusieurs des mesures suivantes : v Réorienter l'antenne réceptrice ; v Déplacer l'équipement par rapport au récepteur ; v Éloigner l'équipement du récepteur ; v Brancher l'équipement sur une prise différente de celle du récepteur pour que ces unités fonctionnent sur des circuits distincts ; v S'assurer que les vis de fixation des cartes et des connecteurs ainsi que les fils de masse sont bien serrés ; v Vérifier la mise en place des obturateurs sur les connecteurs libres. Si vous utilisez des périphériques non IBM avec cet équipement, nous vous recommandons d'utiliser des câbles blindés mis à la terre, à travers des filtres si nécessaire. En cas de besoin, adressez-vous à votre détaillant. Le fabricant n'est pas responsable des interférences radio ou télévision qui pourraient se produire si des modifications non autorisées ont été effectuées sur l'équipement. vi Activation SAS (Serial-attached SCSI) RAID L'obligation de corriger de telles interférences incombe à l'utilisateur. Au besoin, l'utilisateur devrait consulter le détaillant ou un technicien qualifié pour obtenir de plus amples renseignements. Brevets Il est possible qu'IBM détienne des brevets ou qu'elle ait déposé des demandes de brevets portant sur certains sujets abordés dans ce document. Le fait qu'IBM vous fournisse le présent document ne signifie pas qu'elle vous accorde un permis d'utilisation de ces brevets. Vous pouvez envoyer, par écrit, vos demandes de renseignements relatives aux permis d'utilisation au directeur général des relations commerciales d'IBM, 3600 Steeles Avenue East, Markham, Ontario, L3R 9Z7. Assistance téléphonique Si vous avez besoin d'assistance ou si vous voulez commander du matériel, des logiciels et des publications IBM, contactez IBM direct au 1 800 465-1234. Avis aux lecteurs canadiens vii viii Activation SAS (Serial-attached SCSI) RAID Consignes de sécurité Les consignes de sécurité peuvent être imprimées tout au long de ce guide. v DANGER - Consignes attirant votre attention sur un risque de blessures graves, voire mortelles. v ATTENTION - Consignes attirant votre attention sur un risque de blessures graves, en raison de certaines circonstances réunies. v Avertissement - Consignes attirant votre attention sur un risque de dommages sur un programme, une unité, un système ou des données. Consignes de sécurité relatives au commerce international Plusieurs pays nécessitent la présentation des consignes de sécurité indiquées dans les publications du produit dans leur langue nationale. Si votre pays en fait partie, un livret de consignes de sécurité est inclus dans l'ensemble des publications livré avec le produit. Ce livret contient les consignes de sécurité dans votre langue en faisant référence à la source en anglais (Etats-Unis). Avant d'utiliser une publication en version originale anglaise pour installer, faire fonctionner ou dépanner ce produit, vous devez vous familiariser avec les consignes de sécurité figurant dans ce livret. Vous devez également consulter ce livret chaque fois que les consignes de sécurité des publications en anglais (Etats-Unis) ne sont pas assez claires pour vous. Informations sur les appareils à laser Les serveurs peuvent comprendre des cartes d'E-S ou des composants à fibres optiques, utilisant des lasers ou des diodes électroluminescentes (LED). Conformité aux normes relatives aux appareils à laser Les serveurs peuvent être installés à l'intérieur ou à l'extérieur d'une armoire d'équipement informatique. ix DANGER Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes suivantes : Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut présenter un danger. Pour éviter tout risque de choc électrique : v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce dernier avec un autre produit. v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique. v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de reconfiguration de ce produit au cours d'un orage. v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de danger électrique, débranchez tous les cordons d'alimentation. v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent aux informations de la plaque d'alimentation électrique du système. v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce produit. v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les cordons d'interface. v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence de dommages matériels. v Avant de retirer les capots de l'unité, mettez celle-ci hors tension et déconnectez ses cordons d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication et aux modems (sauf mention contraire dans les procédures d'installation et de configuration). v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et déconnecter les différents cordons. Pour déconnecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Débranchez les cordons d'alimentation des prises. 3. Débranchez les cordons d'interface des connecteurs. 4. Débranchez tous les câbles des unités. Pour connecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Branchez tous les cordons sur les unités. 3. Branchez les cordons d'interface sur des connecteurs. 4. Branchez les cordons d'alimentation aux prises. 5. Mettez l'unité sous tension. (D005a) x Activation SAS (Serial-attached SCSI) RAID DANGER Observez les consignes suivantes lors de l'utilisation du système en armoire ou lorsque vous travaillez à proximité de ce dernier : v Un mauvais maniement de l'équipement lourd peut engendrer blessures et dommages matériels. v Abaissez toujours les vérins de mise à niveau de l'armoire. v Installez toujours des équerres de stabilisation sur l'armoire. v Pour prévenir tout danger lié à une mauvaise répartition de la charge, installez toujours les unités les plus lourdes dans la partie inférieure de l'armoire. Installez toujours les serveurs et les unités en option en commençant par le bas de l'armoire. v Un serveur monté en armoire n'est pas une étagère ou un espace de travail. Ne posez pas d'objet sur un serveur monté en armoire. v Chaque armoire peut être équipée de plusieurs cordons d'alimentation. Avant de manipuler l'armoire, vous devez débrancher l'ensemble des cordons d'alimentation. v Reliez toutes les unités installées dans l'armoire aux dispositifs d'alimentation installés dans la même armoire. Vous ne devez pas brancher le cordon d'alimentation d'une unité installée dans une armoire au dispositif d'alimentation installé dans une autre armoire. v Un mauvais câblage du socle de prise de courant peut provoquer une mise sous tension dangereuse des parties métalliques du système ou des unités qui lui sont raccordées. Il appartient au client de s'assurer que le socle de prise de courant est correctement câblé et mis à la terre afin d'éviter tout risque de choc électrique. ATTENTION v N'installez pas d'unité dans une armoire dont la température ambiante interne dépasse la température ambiante que le fabricant recommande pour toutes les unités montées en armoire. v N'installez pas d'unité dans une armoire où la ventilation n'est pas assurée. Vérifiez que les côtés, l'avant et l'arrière de l'unité sont correctement ventilés. v Le matériel doit être correctement raccordé au circuit d'alimentation pour éviter qu'une surcharge des circuits n'entrave le câblage des dispositifs d'alimentation ou de protection contre les surintensités. Pour choisir des connexions d'alimentation à l'armoire adaptées, consultez les étiquettes de puissance nominale situées sur le matériel dans l'armoire afin de déterminer l'alimentation totale requise par le circuit d'alimentation. v Armoires dotées de tiroirs coulissants : Si l'armoire n'est pas équipée d'équerres de stabilisation, ne sortez et n'installez pas de tiroir ou de dispositif. Ne retirez pas plusieurs tiroirs à la fois. Si vous retirez plusieurs tiroirs simultanément, l'armoire risque de devenir instable. v Armoires dotées de tiroirs fixes : Sauf indication du fabricant, les tiroirs fixes ne doivent pas être retirés à des fins de maintenance. Si vous tentez de retirer une partie ou l'ensemble du tiroir, l'armoire risque de devenir instable et le tiroir risque de tomber. (R001) Consignes de sécurité xi ATTENTION : Le retrait des composants des parties supérieures de l'armoire améliore sa stabilité au cours du déplacement. Pour déplacer une armoire remplie de composants dans une pièce ou dans un bâtiment, procédez comme suit : v Pour réduire le poids de l'armoire, retirez les équipements, à commencer par celui situé en haut. Si possible, restaurez la configuration d'origine de l'armoire. Si vous ne connaissez pas cette configuration, procédez comme suit : – Retirez toutes les unités de la position 32U et plus. – Assurez-vous que les unités les plus lourdes sont installées dans la partie inférieure de l'armoire. – Assurez-vous qu'il ne reste aucun niveau U vide entre les unités installées dans l'armoire, en dessous du niveau 32U. v Si l'armoire déplacée fait partie d'un groupe d'armoires, séparez-la de ce dernier. v Vérifiez l'itinéraire envisagé pour éliminer tout risque. v Vérifiez que l'armoire une fois chargée n'est pas trop lourde pour l'itinéraire choisi. Pour plus d'informations sur le poids d'une armoire chargée, consultez la documentation fournie avec votre armoire. v Vérifiez que toutes les ouvertures mesurent au moins 760 x 230 mm. v Vérifiez que toutes les unités, toutes les étagères, tous les tiroirs, toutes les portes et tous les câbles sont bien fixés. v Vérifiez que les vérins de mise à niveau sont à leur position la plus haute. v Vérifiez qu'aucune équerre de stabilisation n'est installée sur l'armoire pendant le déplacement. v N'utilisez pas de rampe inclinée à plus de dix degrés. v Dès que l'armoire est à son nouvel emplacement, procédez comme suit : – Abaissez les quatre vérins de mise à niveau. – Installez des équerres de stabilisation sur l'armoire. – Si vous avez retiré des unités de l'armoire, remettez-les à leur place, en remontant de la partie inférieure à la partie supérieure de l'armoire. v Si un déplacement important est nécessaire, restaurez la configuration d'origine de l'armoire. Mettez l'armoire dans son emballage d'origine ou dans un autre emballage équivalent. De plus, abaissez les vérins de mise à niveau pour que les roulettes ne soient plus au contact de la palette et fixez l'armoire à celle-ci. (R002) (L001) (L002) xii Activation SAS (Serial-attached SCSI) RAID (L003) ou Aux Etats-Unis, tous les appareils à laser sont certifiés conformes aux normes indiquées dans le sous-chapitre J du DHHS 21 CFR relatif aux produits à laser de classe 1. Dans les autres pays, ils sont certifiés être des produits à laser de classe 1 conformes aux normes CEI 60825. Consultez les étiquettes sur chaque pièce du laser pour les numéros d'accréditation et les informations de conformité. ATTENTION : Ce produit peut contenir des produits à laser de classe 1 : lecteur de CD-ROM, DVD-ROM, DVD-RAM ou module à laser. Notez les informations suivantes : v Ne retirez pas les capots. En ouvrant le produit à laser, vous vous exposez au rayonnement dangereux du laser. Vous ne pouvez effectuer aucune opération de maintenance à l'intérieur. v Pour éviter tout risque d'exposition au rayon laser, respectez les consignes de réglage et d'utilisation des commandes, ainsi que les procédures décrites dans le présent manuel. (C026) Consignes de sécurité xiii ATTENTION : Les installations informatiques peuvent comprendre des modules à laser fonctionnant à des niveaux de rayonnement excédant les limites de la classe 1. Il est donc recommandé de ne jamais examiner à l'oeil nu la section d'un cordon optique ni une prise de fibres optiques ouverte. (C027) ATTENTION : Ce produit contient un laser de classe 1M. Ne l'observez pas à l'aide d'instruments optiques. (C028) ATTENTION : Certains produits à laser contiennent une diode à laser intégrée de classe 3A ou 3B. Prenez connaissance des informations suivantes. Rayonnement laser lorsque le capot est ouvert. Evitez toute exposition directe au rayon laser. Evitez de regarder fixement le faisceau ou de l'observer à l'aide d'instruments optiques. (C030) ATTENTION : Cette pile contient du lithium. Pour éviter tout risque d'explosion, n'essayez pas de la recharger et ne la faites pas brûler. Ne pas : v ___ la jeter à l'eau v ___ l'exposer à une température supérieure à 100 °C v ___ chercher à la réparer ou à la démonter Ne la remplacez que par une pile agréée. Pour le recyclage ou la mise au rebut, reportez-vous à la réglementation en vigueur. (C003a) Informations sur l'alimentation électrique et sur le câblage relatives au document GR-1089-CORE du NEBS (Network Equipment-Building System) Les commentaires suivants s'appliquent aux serveurs qui ont été déclarés conformes au document GR-1089-CORE du NEBS (Network Equipment-Building System) : Cet équipement peut être installé : v dans des infrastructures de télécommunications réseau v aux endroits préconisés dans les directives NEC (National Electrical Code). Les ports de ce matériel qui se trouvent à l'intérieur du bâtiment peuvent être connectés à des câbles internes ou non exposés uniquement. Ils ne doivent pas être connectés par leur partie métallique aux interfaces connectées au réseau extérieur ou à son câblage. Ces interfaces sont conçues pour être exclusivement utilisées à l'intérieur d'un bâtiment (ports de type 2 ou 4 décrits dans le document GR-1089-CORE) ; elles doivent être isolées du câblage à découvert du réseau extérieur. L'ajout de dispositifs de protection primaires n'est pas suffisant pour pouvoir connecter ces interfaces par leur partie métallique au câblage du réseau extérieur. Remarque : Tous les câbles Ethernet doivent être blindés et mis à la terre aux deux extrémités. Dans le cas d'un système alimenté en courant alternatif, il n'est pas nécessaire d'installer un dispositif externe de protection contre les surtensions (SPD). Un système alimenté en courant continu fait appel à un dispositif de retour du continu (DC-I). La borne de retour de la batterie en courant continu ne doit pas être connectée à la masse. xiv Activation SAS (Serial-attached SCSI) RAID Activation SAS RAID et bloc de batteries de cache pour le modèle 31E/2B, 31E/1C ou 31E/2C Vous pouvez être amené à configurer des disques RAID (Redundant Array of Independent Disks) sur les unités de disque SAS (serial-attached SCSI) du boîtier système. Suivez les instructions de cette rubrique pour installer sur le système les dispositifs d'activation RAID. Les instructions suivantes permettent d'installer des cartes d'activation SAS RAID afin de préparer le système à l'utilisation d'unités de disque SAS dans une configuration RAID. 1 2 Activation SAS (Serial-attached SCSI) RAID Activation SA RAID - Nouveautés Prenez connaissance des nouveautés et des modifications significatives apportées à "Activation SAS RAID" depuis la dernière mise à jour de cet ensemble de rubriques. Octobre 2011 v Ajout d'informations relatives aux systèmes 31E/1C et 31E/2C. 3 4 Activation SAS (Serial-attached SCSI) RAID Installation du fond de panier de stockage - 6 disques SFF/DVD SATA (FC 5267 et FC EJ0D) Utilisez cette procédure pour installer les composants inclus avec le dispositif de fond de panier de stockage - 6 disques SFF/DVD SATA. Cette procédure s'applique à l'installation de nouveaux dispositifs. Avant de commencer Remarque : FC 5267 s'applique à 31E/2B et FC EJ0D s'applique à 31E/1C ou 31E/2C. Avertissement : Pour maintenir la disponibilité du système et éviter d'éventuelles pertes de données, vous devez comprendre que ces fonctions ont des conséquences sur la configuration de SAS (serial-attached SCSI) et l'accessibilité des données du système. Consultez la rubrique Sous-système SAS pour comprendre les différentes remarques sur la configuration et la maintenance SAS. Avant toute opération d'installation ou de maintenance d'un dispositif, consultez les configurations d'E-S pour chaque partition logique de votre système. Il est important de comprendre les différences entre les diverses configurations et les considérations de maintenance qui en résultent. En outre, tenez compte des incidences possibles sur les domaines suivants : v Partitions logiques et leurs affectations à l'adaptateur d'E-S v Configurations de grappes RAID v Emplacements de disque physique v Chemin d'accès à votre unité d'amorçage Pourquoi et quand exécuter cette tâche Avant d'installer ou de remplacer un dispositif, assurez-vous que le logiciel requis pour prendre en charge ce dispositif est installé sur votre système. Si votre système est géré par la console HMC (Hardware Management Console), utilisez la console HMC pour installer un dispositif sur le serveur. Pour plus d'instructions, voir «Installation d'un dispositif via la console HMC», à la page 102. Si votre système est géré par la console SDMC (Systems Director Management Console), utilisez la console SDMC pour installer un dispositif sur le serveur. Pour plus d'instructions, voir «Installation d'un composant via la console SDMC», à la page 117. Procédure Effectuez les étapes suivantes pour installer les composants inclus avec le dispositif de fond de panier de stockage - 6 disques SFF/DVD SATA. 1. Effectuez les tâches prérequises telles qu'elles sont décrites dans «Avant de commencer», à la page 83. Important : Ne coupez pas d'alimentation du système. 2. Arrêtez le système (voir «Arrêt d'un système ou d'une partition logique», à la page 93). 3. Placez le système en position de maintenance, comme indiqué dans la section «Mise en position de maintenance du système 31E/2B, 31E/1C ou 31E/2C monté en armoire», à la page 98. 4. Retirez le capot d'accès, comme indiqué à la rubrique «Retrait du capot d'accès du système 31E/2B, 31E/1C ou 31E/2C», à la page 96. 5 5. Utilisez les voyants de maintenance pour vous aider à identifier le composant, comme indiqué dans «Identification d'un composant défaillant», à la page 86. 6. Débranchez le système pour l'isoler de la source d'alimentation. Remarque : Ce système peut être équipé d'un deuxième bloc d'alimentation. Avant de poursuivre cette procédure, vérifiez que la source d'alimentation du système est complètement déconnectée. (L003) ou 7. Attachez le bracelet antistatique. Avertissement : v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel contre les risques de décharge électrostatique. v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation d'appareils électriques. v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique, d'installer ou de remplacer du matériel. 8. Installez l'interposeur système. Pour des instructions, voir Remplacement hors tension de l'interposeur de stockage 31E/2B, 31E/1C ou 31E/2C. 9. Installez le fond de panier de l'unité de disque. Pour des instructions, voir Remplacement du fond de panier de l'unité de disque 31E/2B, 31E/1C ou 31E/2C. 10. Installez les unités de disque. v Pour les unités de disque internes, voir Remplacement de l'unité de disque interne dans un système 31E/2B, 31E/1C ou 31E/2C. v Pour une unité SSD, voir Installation d'une unité de disque ou d'une unité SSD. 6 Activation SAS (Serial-attached SCSI) RAID 11. Installez l'unité de DVD SATA. Pour des instructions, voir Installation d'une unité de stockage extra-plate sur le modèle 31E/2B, 31E/1C, ou 31E/2C. 12. Replacez le capot d'accès, comme indiqué à la rubrique «Installation du capot d'accès du système 31E/2B, 31E/1C ou 31E/2C», à la page 97. 13. Pour un système monté en armoire, mettez-le en position de fonctionnement, comme indiqué dans «Mise en position de fonctionnement du système 31E/2B, 31E/1C ou 31E/2C monté en armoire», à la page 99. 14. Rebranchez la source d'alimentation. 15. Démarrez le système comme indiqué dans «Démarrage du système ou de la partition logique», à la page 91. 16. Vérifiez le composant installé : v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme décrit dans Vérification du composant installé. Installation du fond de panier de stockage - 6 disques SFF/DVD SATA (FC 5267 et FC EJ0D) 7 8 Activation SAS (Serial-attached SCSI) RAID Installation du fond de panier de stockage - 3 disques SFF/DVD SATA/Bande HH (FC 5263 et FC EJ0E) Utilisez cette procédure pour installer les composants inclus avec le dispositif de fond de panier de stockage - 3 disques SFF/DVD SATA/Bande HH. Cette procédure s'applique à l'installation de nouveaux dispositifs. Avant de commencer Avertissement : Pour maintenir la disponibilité du système et éviter d'éventuelles pertes de données, vous devez comprendre que ces fonctions ont des conséquences sur la configuration de SAS (serial-attached SCSI) et l'accessibilité des données du système. Consultez la rubrique Sous-système SAS pour comprendre les différentes remarques sur la configuration et la maintenance SAS. Avant toute opération d'installation ou de maintenance d'un dispositif, consultez les configurations d'E-S pour chaque partition logique de votre système. Il est important de comprendre les différences entre les diverses configurations et les considérations de maintenance qui en résultent. En outre, tenez compte des incidences possibles sur les domaines suivants : v Partitions logiques et leurs affectations à l'adaptateur d'E-S v Configurations de grappes RAID v Emplacements de disque physique v Chemin d'accès à votre unité d'amorçage Pourquoi et quand exécuter cette tâche Avant d'installer ou de remplacer un dispositif, assurez-vous que le logiciel requis pour prendre en charge ce dispositif est installé sur votre système. Si votre système est géré par la console HMC (Hardware Management Console), utilisez la console HMC pour installer un dispositif sur le serveur. Pour plus d'instructions, voir «Installation d'un dispositif via la console HMC», à la page 102. Si votre système est géré par la console SDMC (Systems Director Management Console), utilisez la console SDMC pour installer un dispositif sur le serveur. Pour plus d'instructions, voir «Installation d'un composant via la console SDMC», à la page 117. Procédure 1. Effectuez les tâches prérequises telles qu'elles sont décrites dans «Avant de commencer», à la page 83. Important : Ne coupez pas d'alimentation du système. 2. Arrêtez le système (voir «Arrêt d'un système ou d'une partition logique», à la page 93). 3. Placez le système en position de maintenance, comme indiqué dans la section «Mise en position de maintenance du système 31E/2B, 31E/1C ou 31E/2C monté en armoire», à la page 98. 4. Retirez le capot d'accès, comme indiqué à la rubrique «Retrait du capot d'accès du système 31E/2B, 31E/1C ou 31E/2C», à la page 96. 5. Utilisez les voyants de maintenance pour vous aider à identifier le composant, comme indiqué dans «Identification d'un composant défaillant», à la page 86. 6. Débranchez le système pour l'isoler de la source d'alimentation. 9 Remarque : Ce système peut être équipé d'un deuxième bloc d'alimentation. Avant de poursuivre cette procédure, vérifiez que la source d'alimentation du système est complètement déconnectée. (L003) ou 7. Attachez le bracelet antistatique. Avertissement : v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel contre les risques de décharge électrostatique. v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation d'appareils électriques. v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique, d'installer ou de remplacer du matériel. 8. Installez l'interposeur système. Pour des instructions, voir Remplacement de l'interposeur de stockage 31E/2B, 31E/1C ou 31E/2C avec mise hors tension. 9. Installez le fond de panier de l'unité de disque. Pour des instructions, voir Remise en place du fond de panier de l'unité de disque 31E/2B, 31E/1C ou 31E/2C. 10. Installez les unités de disque. v Pour les unités de disque internes, voir Remise en place de l'unité de disque interne dans un système 31E/2B, 31E/1C ou 31E/2C . v Pour une unité SSD, voir Installation d'une unité de disque ou d'une unité SSD. 11. Installez l'unité de bande SAS. Pour des instructions, voir Installation d'une unité de bande SAS dans le système 31E/2B, 31E/1C ou 31E/2C . 12. Replacez le capot d'accès, comme indiqué à la rubrique «Installation du capot d'accès du système 31E/2B, 31E/1C ou 31E/2C», à la page 97. 10 Activation SAS (Serial-attached SCSI) RAID 13. Pour un système monté en armoire, mettez-le en position de fonctionnement, comme indiqué dans «Mise en position de fonctionnement du système 31E/2B, 31E/1C ou 31E/2C monté en armoire», à la page 99. 14. Rebranchez la source d'alimentation. 15. Démarrez le système comme indiqué dans «Démarrage du système ou de la partition logique», à la page 91. 16. Vérifiez le composant installé : v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme décrit dans Vérification du composant installé. Installation du fond de panier de stockage - 3 disques SFF/DVD SATA/Bande HH (FC 5263 et FC EJ0E) 11 12 Activation SAS (Serial-attached SCSI) RAID Installation du fond de panier de stockage - 6 disques SFF/DVD SATA/RAID 175 Mo/Adaptateur d'E-S double/Port SAS externe (FC 5268 et FC EJ0F) Utilisez cette procédure pour installer les composants inclus avec le dispositif de fond de panier de stockage - 6 disques SFF/DVD SATA/RAID 175 Mo/Adaptateur d'E-S double/Port SAS externe. Cette procédure s'applique à l'installation de nouveaux dispositifs. Avant de commencer Remarque : FC 5267 s'applique à 31E/2B et FC EJ0F s'applique à 31E/1C ou 31E/2C. FC EJ0F ne dispose pas de la carte d'activation RAID. Identifiez le système sur lequel vous travaillez à l'aide du processus d'identification du système afin d'activer l'indicateur de système (bleu). Pour plus d'informations, voir «Voyants du panneau de commande», à la page 87. Avertissement : Pour maintenir la disponibilité du système et éviter d'éventuelles pertes de données, vous devez comprendre que ces fonctions ont des conséquences sur la configuration de SAS (serial-attached SCSI) et l'accessibilité des données du système. Consultez la rubrique Sous-système SAS pour comprendre les différentes remarques sur la configuration et la maintenance SAS. Avant toute opération d'installation ou de maintenance d'un dispositif, consultez les configurations d'E-S pour chaque partition logique de votre système. Il est important de comprendre les différences entre les diverses configurations et les considérations de maintenance qui en résultent. En outre, tenez compte des incidences possibles sur les domaines suivants : v Partitions logiques et leurs affectations à l'adaptateur d'E-S v Configurations de grappes RAID v Emplacements de disque physique v Chemin d'accès à votre unité d'amorçage Avertissement : Si vous effectuez la maintenance d'un composant défaillant, voir les procédures de maintenance suivantes : v Pour la carte d'activation RAID, voir Retrait de la carte d'activation 31E/2B RAID/cache et Remise en place de la carte d'activation 31E/2B RAID/cache. v Pour le contrôleur de stockage RAID/cache, voir Retrait du contrôleur de stockage 31E/2B, 31E/1C ou 31E/2C RAID/cache et Remise en place du contrôleur de stockage 31E/2B, 31E/1C ou 31E/2C RAID/cache. Pourquoi et quand exécuter cette tâche Avant d'installer ou de remplacer un dispositif, assurez-vous que le logiciel requis pour prendre en charge ce dispositif est installé sur votre système. Si votre système est géré par la console HMC (Hardware Management Console), utilisez la console HMC pour installer un dispositif sur le serveur. Pour plus d'instructions, voir «Installation d'un dispositif via la console HMC», à la page 102. 13 Si votre système est géré par la console SDMC (Systems Director Management Console), utilisez la console SDMC pour installer un dispositif sur le serveur. Pour plus d'instructions, voir «Installation d'un composant via la console SDMC», à la page 117. Si vous ne disposez pas d'une console HMC ou SDMC, procédez comme suit : Procédure 1. Effectuez les tâches prérequises, comme décrit dans la rubrique «Avant de commencer», à la page 83. Important : Ne coupez pas d'alimentation du système. 2. Arrêtez le système (voir «Arrêt d'un système ou d'une partition logique», à la page 93). 3. Placez le système en position de maintenance, comme indiqué dans la section «Mise en position de maintenance du système 31E/2B, 31E/1C ou 31E/2C monté en armoire», à la page 98. 4. Retirez le capot d'accès, comme indiqué à la rubrique «Retrait du capot d'accès du système 31E/2B, 31E/1C ou 31E/2C», à la page 96. 5. Utilisez les voyants de maintenance pour vous aider à identifier le composant, comme indiqué dans «Identification d'un composant défaillant», à la page 86. 6. Débranchez le système pour l'isoler de la source d'alimentation. Remarque : Ce système peut être équipé d'un deuxième bloc d'alimentation. Avant de poursuivre cette procédure, vérifiez que la source d'alimentation a été débranchée du système. (L003) ou 7. Attachez le bracelet antistatique. 14 Activation SAS (Serial-attached SCSI) RAID Avertissement : v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel contre les risques de décharge électrostatique. v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation d'appareils électriques. v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique, d'installer ou de remplacer du matériel. 8. Pour 31E/2B, localisez la carte d'activation RAID (P1-C12), le contrôleur de stockage RAID/cache (P1-C18) et la carte de batterie de cache (P1-C13). 9. Pour 31E/1C ou 31E/2C, localisez le contrôleur de stockage RAID/cache (P1-C18) et la carte de batterie de cache (P1-C13). Figure 1. Emplacement d'installation de la carte d'activation RAID, du contrôleur de stockage et de la carte de batterie de cache pour 31E/2B Installation du fond de panier de stockage - 6 disques SFF/DVD SATA/RAID 175 Mo/Adaptateur d'E-S double/Port SAS externe 15 10. Pour 31E/2B, installez la carte d'activation RAID en procédant comme suit : Remarque : Pour 31E/1C ou 31E/2C, passez à l'étape 11, à la page 17 a. Alignez la carte d'activation RAID (A) sur l'emplacement dans le système. La carte d'activation RAID comporte une plaque métallique, comme illustré ci-après. b. Faites glisser la carte d'activation RAID dans le système jusqu'au verrouillage du loquet, comme indiqué dans l'illustration suivante. Figure 2. Emplacement d'installation du contrôleur de stockage et de la carte de batterie de cache pour 31E/1C ou 31E/2C 16 Activation SAS (Serial-attached SCSI) RAID 11. Installez le contrôleur de stockage RAID/cache en procédant comme suit : a. Alignez les contrôleurs de stockage RAID et cache sur l'emplacement dans le système. b. Faites glisser le contrôleur de stockage RAID/cache ou cache dans le système jusqu'au verrouillage du loquet comme indiqué dans l'illustration suivante. Figure 3. Modèle monté en armoire : remplacement de la carte d'activation RAID Installation du fond de panier de stockage - 6 disques SFF/DVD SATA/RAID 175 Mo/Adaptateur d'E-S double/Port SAS externe 17 12. Installez la carte de batterie de cache en procédant comme suit : Figure 4. Modèle monté en armoire : remplacement des contrôleurs de stockage RAID/cache pour 31E/2B Figure 5. Modèle monté en armoire : remplacement des contrôleurs de stockage RAID/cache pour 31E/1C ou 31E/2C 18 Activation SAS (Serial-attached SCSI) RAID Avertissement : Laissez la carte de batterie de cache déconnectée pendant au moins 60 secondes avant de connecter la nouvelle carte de batterie. a. Faites glisser la carte de batterie de cache dans les glissières de montage du contrôleur jusqu'à ce qu'elle s'insère dans le connecteur de batterie. b. Une fois que la batterie est correctement positionnée dans le connecteur, ramenez la languette en position verrouillée pour immobiliser la batterie sur le connecteur. Figure 6. Remise en place de la carte de batterie de cache pour 31E/2B Installation du fond de panier de stockage - 6 disques SFF/DVD SATA/RAID 175 Mo/Adaptateur d'E-S double/Port SAS externe 19 13. Remplacez l'interposeur système par l'interposeur système à haute fonctionnalité. Pour des instructions, voir Remplacement hors tension de l'interposeur de stockage 31E/2B, 31E/1C ou 31E/2C. 14. Installez le fond de panier de l'unité de disque. Pour des instructions, voir Remplacement du fond de panier de l'unité de disque 31E/2B, 31E/1C ou 31E/2C. 15. Installez les unités de disque. v Pour les unités de disque internes, voir Remplacement de l'unité de disque interne dans un système 31E/2B, 31E/1C ou 31E/2C. v Pour une unité SSD, voir Installation d'une unité de disque ou d'une unité SSD. 16. Installez l'unité de DVD SATA. Pour des instructions, voir Installation d'une unité de stockage extra-plate sur le modèle 31E/2B, 31E/1C, ou 31E/2C. 17. Installez le câble de connexion d'E-S. Figure 7. Remise en place de la carte de batterie de cache pour 31E/1C ou 31E/2C 20 Activation SAS (Serial-attached SCSI) RAID 18. Replacez le capot d'accès, comme indiqué à la rubrique «Installation du capot d'accès du système 31E/2B, 31E/1C ou 31E/2C», à la page 97. 19. Pour un système monté en armoire, mettez-le en position de fonctionnement, comme indiqué dans «Mise en position de fonctionnement du système 31E/2B, 31E/1C ou 31E/2C monté en armoire», à la page 99. 20. Rebranchez la source d'alimentation. Figure 8. Câble de connexion d'E-S SAS pour 31E/2B Figure 9. Câble de connexion d'E-S SAS pour 31E/1C ou 31E/2C Installation du fond de panier de stockage - 6 disques SFF/DVD SATA/RAID 175 Mo/Adaptateur d'E-S double/Port SAS externe 21 21. Démarrez le système (voir «Démarrage du système ou de la partition logique», à la page 91). 22. Vérifiez le composant installé : v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme décrit dans Vérification du composant installé. 22 Activation SAS (Serial-attached SCSI) RAID Retrait et remise en place des adaptateurs et batteries SAS RAID Il se peut que vous deviez retirer ou remettre en place les adaptateurs SAS RAID et les batteries. La présente section comprend les procédures vous permettant d'effectuer ces tâches. Remarque : Le remplacement du dispositif incombe à l'utilisateur. Vous pouvez soit exécuter la tâche vous-même, soit faire appel à un prestataire agréé. Il vous imputera probablement les frais correspondants. Retrait et remplacement du fond de panier d'unités de disque Utilisez cette procédure pour retirer, remplacer ou installer le fond de panier d'unités de disques sur les serveurs 7/10 et 7/30 (31E/2B, 31E/1C et 31E/2C) et Les figures suivantes représentent le fond de panier d'unités de disques sur les modèles 31E/2B, 31E/1C ou 31E/2C. Retrait du fond de panier d'unités de disques sur les modèles 31E/2B, 31E/1C, 31E/2C Cette procédure permet de retirer le fond de panier d'unités de disque du serveur. Pourquoi et quand exécuter cette tâche Si le système est géré par la console HMC (Hardware Management Console), utilisez la console HMC pour retirer le fond de panier d'unité de disque du serveur. Pour obtenir des informations sur l'utilisation de la console HMC en vue de retirer le fond de panier d'unité de disque, voir Retrait d'un composant via la console HMC. Si le système est géré par la console SDMC (Systems Director Management Console), utilisez la console SDMC pour retirer le fond de panier d'unité de disque du serveur. Pour des instructions, voir Retrait d'un composant via la console SDMC. Si vous ne disposez pas d'une console HMC ou SDMC, procédez comme suit pour retirer le fond de panier d'unité de disque : Figure 10. Fond de panier d'unités de disques Figure 11. Fond de panier d'unités de disques avec bande magnétique 23 Avertissement : Veillez à retirer le panneau de commande avant le fond de panier d'unité de disque pour ne pas endommager les câbles du panneau de commande. Procédure 1. Identifiez le système sur lequel vous travaillez à l'aide du processus d'identification du système afin d'activer l'indicateur de système (bleu). Pour plus d'informations, voir Voyants du panneau de commande. 2. Effectuez les tâches prérequises telles qu'elles sont décrites dans Avant de commencer. 3. Effectuez l'une des opérations suivantes : a. Si vous retirez le fond de panier d'unité de disques en vue d'une mise à niveau système ou dans le cadre d'une autre procédure, passez à l'étape 4. b. Si vous retirez le fond de panier d'unités de disques en raison d'une défaillance du système, consultez le journal de maintenance pour identifier le composant défaillant. Voir Identification d'un composant défaillant. 4. Arrêtez le système comme indiqué dans Arrêt d'un système ou d'une partition logique. 5. Mettez le système en position de maintenance, comme indiqué dans Mise en position de maintenance d'un modèle monté en armoire. 6. Débranchez le système pour l'isoler de la source d'alimentation, comme décrit dans Déconnexion des cordons d'alimentation. Remarque : Si le système est équipé d'un second bloc d'alimentation, vérifiez que la source d'alimentation a été débranchée du système. (L003) ou 7. Retirez le capot d'accès, comme indiqué dans Retrait du capot d'accès d'un modèle monté en armoire. 24 Activation SAS (Serial-attached SCSI) RAID 8. Aidez-vous des voyants de maintenance pour identifier le composant, comme indiqué dans Identification d'un composant. 9. Attachez le bracelet antistatique. Avertissement : v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel contre les risques de décharge électrostatique. v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation d'appareils électriques. v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique, d'installer ou de remplacer du matériel. 10. Si vous devez remplacer le fond de panier en vue d'une mise à niveau ou d'un incident système, procédez comme suit afin de retirer les unités de disques, en fonction de votre fond de panier : Remarque : Si vous retirez un fond de panier comportant des unités de disques, le fond de panier est lourd. Assurez-vous de pouvoir effectuer cette procédure en toute sécurité. a. Pour retirer l'unité de disque du fond de panier, procédez comme suit: Remarque : Notez l'emplacement de chaque unité de disque retirée, de manière à remettre en place les unités de disque à leur emplacement d'origine. 1) Déverrouillez la poignée de l'unité de disques (A) et tirez-la vers vous pour la dégager. 2) Placez une main sous l'unité de disques et tirez avec précaution l'unité de disques du système, comme indiqué dans la figure 12. Figure 12. Retrait de l'unité de disque du fond de panier Retrait et remise en place des adaptateurs et batteries SAS RAID 25 11. Retirez l'unité de stockage en appuyant sur le taquet (A) et faites-la glisser hors du système. Voir la figure 2 ou 3 en fonction de la configuration mécanique de votre système. Pour plus d'informations sur le retrait des différents types d'unités de stockage, voir Retrait d'une unité de stockage SAS sur les modèles 31E/2B, 31E/1C ou 31E/2C. Remarque : L'unité est maintenue en place par une pièce d'attache par friction. Il n'y a aucune vis ou rivet qui la maintient en place. Veillez à retirer l'unité avec précaution afin de ne pas faire tomber la pièce. Figure 13. Retrait de l'unité de stockage extra-plate du fond de panier d'unités de disques 26 Activation SAS (Serial-attached SCSI) RAID 12. Retirez le panneau de commande et le câble d'interface du fond de panier. Pour plus d'informations, voir Panneau de commande et câbles d'interface. 13. Déverrouillez le fond de panier de l'unité de disques en tirant les poignées (A) afin de les mettre en position ouverte, au centre du fond de panier, comme indiqué dans la figure 15, à la page 28 et figure 16, à la page 28. Figure 14. Retrait de l'unité de stockage SAS du fond de panier d'unités de disques Retrait et remise en place des adaptateurs et batteries SAS RAID 27 14. Extrayez le fond de panier du serveur. 15. Si vous avez retiré le fond de panier dans le cadre d'une autre procédure, reprenez cette procédure. Pour remettre en place le fond de panier, voir Remplacement du fond de panier d'unités de disques sur les modèles 31E/2B, 31E/1C, 31E/2C. Figure 15. Retrait du fond de panier comportant des unités de disque Figure 16. Retrait du fond de panier ne comportant pas d'unités de disque 28 Activation SAS (Serial-attached SCSI) RAID Remplacement du fond de panier d'unités de disques sur les modèles 31E/2B, 31E/1C, 31E/2C Utilisez cette procédure pour remplacer le fond de panier d'unités de disque. Pourquoi et quand exécuter cette tâche Si le système est géré par la console HMC (Hardware Management Console), utilisez la console HMC pour effectuer la procédure de remplacement du fond de panier d'unité de disque sur le serveur. Pour plus d'informations, voir la rubrique Remplacement d'un composant via la console HMC. Si le système est géré par la console SDMC (Systems Director Management Console), utilisez la console SDMC pour effectuer la procédure de remplacement du fond de panier d'unité de disque sur le serveur. Pour obtenir des instructions, voir la rubrique Remplacement d'un composant via la console SDMC. Si vous ne disposez pas d'une console HMC ou SDMC, procédez comme suit pour remplacer le fond de panier d'unité de disque : Procédure 1. Identifiez le système sur lequel vous travaillez à l'aide du processus d'identification du système afin d'activer l'indicateur de système (bleu). Pour plus d'informations, voir Voyants du panneau de commande. 2. Effectuez les tâches prérequises telles qu'elles sont décrites dans Avant de commencer. 3. Arrêtez le système comme indiqué dans Arrêt d'un système ou d'une partition logique. 4. Mettez le système en position de maintenance, comme indiqué dans Mise en position de maintenance d'un modèle monté en armoire. 5. Débranchez le système pour l'isoler de la source d'alimentation, comme décrit dans Déconnexion des cordons d'alimentation. Remarque : Ce système est équipé d'un second bloc d'alimentation. Avant de poursuivre cette procédure, vérifiez que la source d'alimentation du système est complètement déconnectée. (L003) ou Retrait et remise en place des adaptateurs et batteries SAS RAID 29 6. Retirez le capot d'accès, comme indiqué dans Retrait du capot d'accès d'un modèle monté en armoire. 7. Aidez-vous des voyants de maintenance pour identifier le composant, comme indiqué dans Identification d'un composant. 8. Attachez le bracelet antistatique. Avertissement : v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel contre les risques de décharge électrostatique. v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation d'appareils électriques. v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique, d'installer ou de remplacer du matériel. 9. Installez le panneau de commande et le cordon d'interface. Pour plus d'informations, voir Panneau de commande et câbles d'interface. Remarque : Vérifiez que le câble d'interface n'est pas connecté lors de l'installation du panneau de commande. 10. Vérifiez la présence éventuelle d'un capot de protection sur les broches sur la partie inférieure droite. Si un capot de protection est installé, retirez-le. 11. Soulevez délicatement le fond de panier sur les deux côtés et alignez-le avec son emplacement dans le serveur. 12. Insérez le fond de panier fermement dans le serveur. 13. Fixez le fond de panier à l'aide des taquets de verrouillage (A), comme indiqué dans les figures suivantes. 30 Activation SAS (Serial-attached SCSI) RAID 14. Réinstallez les unités de disques si vous les avez préalablement retirées. Utilisez les informations d'emplacement que vous avez enregistrées lors du retrait des unités pour les installer dans les emplacements appropriés. Pour remplacer les unités de disque, procédez comme suit : Figure 17. Installation d'un fond de panier sans unités de disque dans un modèle monté en armoire Figure 18. Installation d'un fond de panier doté d'unités de disque dans un modèle monté en armoire Retrait et remise en place des adaptateurs et batteries SAS RAID 31 a. Tenez l'unité de disque par les bords hauts et bas pour insérer le lecteur dans l'unité d'extension. Ne la tenez pas par la poignée. b. Faites glisser complètement l'unité de disques dans l'unité centrale ou l'unité d'extension, puis verrouillez l'unité dans son emplacement en poussant la poignée (A), comme indiqué dans la figure suivante. 15. Replacez l'unité de stockage Slimline en procédant comme suit : a. Alignez la languette (a) sur le côté de l'unité de stockage. b. Alignez les petits taquets dans les emplacements de l'unité de stockage (B) et insérez l'unité et le taquet en même temps. Voir la figure 4 ou 5 en fonction de la configuration mécanique de votre système. Pour plus d'informations sur le remplacement des différents types d'unités de stockage, voir Retrait d'une unité de stockage SAS sur les modèles 31E/2B, 31E/1C ou 31E/2C. Figure 19. Installation des unités de disque du fond de panier 32 Activation SAS (Serial-attached SCSI) RAID 16. Remplacez le capot d'accès, comme décrit dans Installation du capot d'accès sur un système 31E/2B, 31E/1C, 31E/2C monté en armoire. Figure 20. Installation de l'unité de stockage SAS Figure 21. Installation de l'unité de stockage extra-plate Retrait et remise en place des adaptateurs et batteries SAS RAID 33 17. Mettez le système en position de fonctionnement, comme indiqué dans Mise en position de fonctionnement d'un système 31E/2B, 31E/1C, 31E/2C monté en armoire. 18. Connectez les cordons d'alimentation au système, comme décrit dans Connexion des cordons d'alimentation. 19. Démarrez le système ou la partition logique. Voir Démarrage du système ou de la partition logique. 20. Vérifiez le composant installé. v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme décrit dans Vérification du composant installé. Retrait et remplacement du fond de panier système 31E/2B, 31E/1C ou 31E/2C Découvrez comment retirer ou remplacer le fond de panier système. Pourquoi et quand exécuter cette tâche Les procédures suivantes expliquent comment retirer et remplacer le fond de panier système. Retrait du fond de panier système 31E/2B, 31E/1C ou 31E/2C Prenez connaissance des instructions de retrait du fond de panier système d'un serveur en vue du remplacement d'un composant défaillant ou dans le cadre d'une autre procédure de maintenance. Avant de commencer Remarques : v Avant de commencer cette procédure, vérifiez que vous disposez des droits de connexion en tant que fournisseur de services agréé. Les différents niveaux d'autorisation sont décrits dans Niveaux d'autorisation ASMI. v Identifiez le système sur lequel vous travaillez à l'aide du processus d'identification du système afin d'activer l'indicateur de système (bleu). Pour de plus amples informations, voir Voyant du panneau de commande. Pourquoi et quand exécuter cette tâche Si votre système est géré par la console HMC (Hardware Management Console), utilisez-la pour retirer le fond de panier système. Pour obtenir des instructions, voir Retrait d'un composant à l'aide de la console SDMC. Si votre système est géré par la console SDMC (Systems Director Management Console), utilisez-la pour retirer le fond de panier système. Pour des instructions, voir Retrait d'un composant à l'aide de la console SDMC. Si vous ne disposez pas d'une console HMC ou SDMC, procédez comme suit pour retirer le fond de panier système : Procédure 1. Accédez à l'interface ASMI. L'interface ASMI (Advanced System Management Interface) est l'interface du processeur de service requise pour effectuer des tâches de maintenance générales et de niveau administrateur, telles que la lecture des journaux d'erreurs du processeur de service, la lecture des données essentielles du produit, la configuration du processeur de service et le contrôle de l'alimentation système. L'interface 34 Activation SAS (Serial-attached SCSI) RAID ASMI est également désignée comme l'ensemble des menus du processeur de service. Pour accéder à l'interface ASMI, voir Accès à l'interface ASMI. Une fois que vous avez accédé à l'interface ASMI, passez à l'étape suivante. 2. Vérifiez et enregistrez le type de boîtier et le code dispositif. Le numéro de série du boîtier est un numéro à sept chiffres imprimé sur une étiquette code barre placée sur l'unité centrale. Remarque : Le code dispositif du fond de panier système 31E/2B, 31E/1C ou 31E/2C est 78AB-001. 3. Vérifiez et notez les paramètres du serveur. Avant de remplacer cette unité remplaçable sur site (FRU), vérifiez et enregistrez tous les paramètres du serveur étant donné que vous devrez peut-être mettre à jour le système après avoir remplacé l'unité FRU. Cela peut être effectué via l'interface ASMI (Advanced System Management Interface). a. Recherchez et notez le niveau de microprogramme du serveur indiqué sur le bord droit de la fenêtre d'état, la zone dans laquelle se trouve le bouton Déconnexion. Vous devrez peut-être mettre à jour le système avec le niveau de code de microprogramme de serveur le plus récent après avoir remplacé l'unité FRU. b. Vérifiez et notez les paramètres du processeur de service que vous avez précédemment définis à l'aide de l'interface ASMI. Vous devrez réinitialiser les paramètres du processeur de service après la mise à jour de cette unité FRU, faute de quoi les paramètres par défaut seront utilisés. Vous devez noter les paramètres suivants : v Paramètre de nom de système v Paramètres de contrôle de l'alimentation système v Paramètres de l'aide à la maintenance système v Paramètres de configuration système v Paramètres des services réseau v Paramètres des options de performances v Paramètres de profil de connexion v Paramètres de liste des unités d'amorçage v Paramètres de carte Ethernet hôte c. Notez tous les paramètres du processeur de service que vous avez éventuellement définis à l'aide des commandes du système d'exploitation. 4. Arrêtez les partitions et mettez le système hors tension. L'entretien de cette unité remplaçable sur site ne peut pas s'effectuer en simultané. Chaque partition doit être arrêtée et le système mis hors tension pour poursuivre la réparation. Le client doit donner l'autorisation d'arrêter toutes les partitions en cours d'exécution et de mettre le système hors tension. Si le client n'autorise pas l'arrêt des partitions ou la mise hors tension du système, la procédure de maintenance sera retardée. Si le client utilise une secondaire pour gérer le système, assurez-vous qu'il ne met pas le système sous tension à ce moment là. Pour obtenir des instructions, voir Arrêt du système ou de la partition logique. 5. Débranchez le système pour l'isoler de la source d'alimentation. Remarque : Ce système est équipé en option d'un second bloc d'alimentation. Avant de poursuivre cette procédure, vérifiez que la source d'alimentation du système est complètement déconnectée. (L003) Retrait et remise en place des adaptateurs et batteries SAS RAID 35 ou 6. Débranchez les câbles externes du système. Etiquetez et débranchez les câbles externes connectés au système dont vous effectuez la maintenance. 7. Placez le système en position de maintenance, comme indiqué dans la rubrique Mise en position de maintenance d'un système 31E/2B, 31E/1C ou 31E/2C monté en armoire. 8. Retirez le capot d'accès comme indiqué dans la rubrique Retrait du capot d'accès d'un système 31E/2B, 31E/1C ou 31E/2C monté en armoire. 9. Retirez les modules de mémoire. Pour obtenir des instructions, voir Retrait du module de mémoire 31E/2B, 31E/1C ou 31E/2C. 10. Retirez les adaptateurs GX. Pour obtenir des instructions, voir Retrait des cartes GX sur 31E/2B, 31E/1C ou 31E/2C. 11. Retirez les cartes PCI. Pour obtenir des instructions, voir Retrait hors tension d'une carte PCI sur le serveur 31E/2B, 31E/1C ou 31E/2C. 12. Retirez la carte Ethernet hôte (31E/2B). Pour obtenir des instructions, voir Retrait hors tension de la carte Ethernet hôte 31E/2B. 13. Retirez le régulateur de tension. Pour obtenir des instructions, voir Retrait du régulateur de tension 31E/2B, 31E/1C ou 31E/2C. 14. Retirez les ventilateurs. Pour obtenir des instructions, voir Retrait des ventilateurs 31E/2B, 31E/1C ou 31E/2C. 15. Retirez les blocs d'alimentation. Pour obtenir des instructions, voir Retrait hors tension des blocs d'alimentation 31E/2B, 31E/1C ou 31E/2C. 16. Débranchez tous les câbles internes du fond de panier système. Exemples de câbles à débrancher : a. Débranchez le câble du panneau de commande (A). 36 Activation SAS (Serial-attached SCSI) RAID b. Débranchez le câble SAS (A). Figure 22. Débranchement du câble du panneau de commande Figure 23. Débranchement du câble SAS Retrait et remise en place des adaptateurs et batteries SAS RAID 37 c. Débranchez les câbles de l'unité de stockage (A). d. Retirez le câble SPCN à l'arrière du châssis (31E/2C). 17. Retirez le boîtier de l'unité de disque : a. Saisissez les loquets de l'unité de disque et tirez cette dernière vers le haut pour déverrouiller le boîtier de l'unité de disque. b. Saisissez les poignées, puis tirez le boîtier d'unité de disque vers vous. Figure 24. Débranchement des câbles de l'unité de stockage 38 Activation SAS (Serial-attached SCSI) RAID 18. Retirez le régulateur de tension et le répartiteur d'air. Figure 25. Retrait du boîtier d'unité de disque Figure 26. Retrait du régulateur de tension et du répartiteur d'air (31E/2B) Retrait et remise en place des adaptateurs et batteries SAS RAID 39 19. Retirez le logement de barrette DIMM : a. Retirez les quatre vis qui maintiennent le boîtier de barrette DIMM (A). b. Retirez le boîtier de barrette DIMM en le soulevant vers le haut. Figure 27. Retrait du régulateur de tension et du répartiteur d'air (31E/1C ou 31E/2C) 40 Activation SAS (Serial-attached SCSI) RAID 20. Retirez le câble du ventilateur avant. 21. Retirez le boîtier de ventilation : Figure 28. Retrait du boîtier de barrette DIMM Figure 29. Retrait du câble du ventilateur avant Retrait et remise en place des adaptateurs et batteries SAS RAID 41 a. Desserrez les deux vis (A) qui maintiennent le boîtier de ventilation. b. Retirez la vis (B) qui maintient le boîtier de ventilation à l'extérieur du boîtier du système. c. Retirez le boîtier de ventilation en le soulevant vers le haut. 22. Retirez le commutateur de verrouillage d'alimentation. Pour obtenir des instructions, voir Retrait hors tension du commutateur de verrouillage d'alimentation 31E/2B, 31E/1C ou 31E/2C. 23. Retirez la carte de données techniques essentielles. Pour obtenir des instructions, voir Retrait de la carte de données techniques essentielles 31E/2B, 31E/1C ou 31E/2C. 24. Retirez l'interposeur de stockage. Pour obtenir des instructions, voir Retrait hors tension de l'interposeur de stockage 31E/2B, 31E/1C ou 31E/2C. 25. Retirez le second régulateur de tension, le cas échéant. Pour obtenir des instructions, voir Retrait du régulateur de tension 31E/2B, 31E/1C ou 31E/2C. 26. Retirez le support métallique du second régulateur de tension : a. Retirez les deux vis qui maintiennent le support au châssis. b. Soulevez le support du châssis. 27. Retirez les modules du processeur système. Pour obtenir des instructions, voir Retrait du module processeur système 31E/2B, 31E/1C ou 31E/2C. 28. Retirez le fond de panier système. Remarque : Retirez le fond du panier système avec précaution. Ne le soulevez pas à l'aide des modules fixés. a. Retirez les vis (A). Remarque : Ne retirez aucune des vis Torx. Retirez uniquement les cinq vis à tête hexagonale et à fente. Figure 30. Retrait du boîtier de ventilation 42 Activation SAS (Serial-attached SCSI) RAID b. Saisissez le fond de panier système par les deux bras de chargement du dissipateur thermique (B). Figure 31. Retrait du fond de panier système (cinq vis pour 31E/2B) Figure 32. Retrait du fond de panier système (quatre vis pour 31E/1C ou 31E/2C) Retrait et remise en place des adaptateurs et batteries SAS RAID 43 c. Soulevez l'avant du fond de panier système pour dégager les chevilles de positionnement (6,35 mm environ). d. Faites coulisser le fond de panier vers l'avant du système de façon à ce que ses connecteurs laissent apparaître le châssis arrière. e. Une fois les connecteurs dégagés du châssis, soulevez l'arrière du fond de panier système et retirez ce dernier en l'inclinant. Figure 33. Retrait du fond de panier système (31E/2B) 44 Activation SAS (Serial-attached SCSI) RAID Remplacement du fond de panier système 31E/2B, 31E/1C ou 31E/2C Informations sur le remplacement du fond de panier système dans le cadre d'une opération de maintenance. Pourquoi et quand exécuter cette tâche Avant d'effectuer cette procédure, vous devez avoir effectué le retrait. Pour connaître les instructions de la procédure de retrait, voir Retrait du fond de panier système 31E/2B, 31E/1C ou 31E/2C. Si votre système est géré par la console HMC (Hardware Management Console), utilisez-la pour remplacer le fond de panier système. Pour plus d'informations, voir la rubrique Remplacement d'un composant à l'aide de la console HMC. Si votre système est géré par la console Systems Director Management Console (SDMC), utilisez-la pour remplacer le fond de panier système. Pour obtenir des instructions, voir Remplacement d'un composant à l'aide de la console SDMC. Si vous ne disposez pas d'une console HMC ou SDMC, procédez comme suit pour remplacer le fond de panier système : Figure 34. Retrait du fond de panier système (31E/1C ou 31E/2C) Retrait et remise en place des adaptateurs et batteries SAS RAID 45 Avertissement : Attachez un bracelet antistatique à une surface métallique non peinte de votre matériel afin d'empêcher d'éventuelles décharges électrostatiques d'endommager celui-ci. Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation d'appareils électriques. Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du système pendant au moins 5 secondes avant de déballer le produit de son emballage antistatique et d'installer ou de remettre en place le matériel. Procédure 1. Remplacez le fond de panier système. Remarque : Remplacez le fond de panier système avec précaution. a. Saisissez avec précaution le fond de panier système par les deux bras de chargement du dissipateur thermique (A). b. Abaissez le fond de panier système vers l'avant du boîtier du système. c. Abaissez l'arrière du fond de panier système (1), puis faites-le glisser vers l'arrière du système jusqu'à ce que les connecteurs laissent apparaître les ouvertures dans le boîtier. d. Abaissez l'avant du fond de panier système (2) sur les broches d'alignement (b). Figure 35. Remplacement du fond de panier système (31E/2B) 46 Activation SAS (Serial-attached SCSI) RAID e. Serrez les vis (C) qui maintiennent le fond de panier système au boîtier du système. Figure 36. Remplacement du fond de panier système (31E/1C ou 31E/2C) Figure 37. Remplacement du fond de panier système (cinq vis pour 31E/2B) Retrait et remise en place des adaptateurs et batteries SAS RAID 47 f. Rebranchez tous les câbles Ethernet qui ont été précédemment retirés. 2. Remettez en place les modules du processeur système. Pour obtenir des instructions, voir Remplacement du module processeur système 31E/2B, 31E/1C ou 31E/2C. 3. Remplacez le support métallique du second régulateur de tension : a. Alignez le support sur les broches du châssis b. Replacez les deux vis pour maintenir le support au châssis. 4. Remplacez le second régulateur de tension, le cas échéant. Pour obtenir des instructions, voir Remplacement du régulateur de tension 31E/2B, 31E/1C ou 31E/2C. 5. Remplacez le commutateur de verrouillage d'alimentation. Pour obtenir des instructions, voir Remplacement hors tension du commutateur de verrouillage d'alimentation 31E/2B, 31E/1C ou 31E/2C. 6. Remplacez l'interposeur de stockage. Pour obtenir des instructions, voir Remplacement hors tension de l'interposeur de stockage 31E/2B, 31E/1C ou 31E/2C. 7. Remplacez la carte de données techniques essentielles. Pour obtenir des instructions, voir Remplacement de la carte de données techniques essentielles 31E/2B, 31E/1C ou 31E/2C. 8. Remplacez le boîtier de ventilation : a. Faites glisser le boîtier vers le bas afin que les trois encoches (A) et (C) s'adaptent sur les goupilles de montage (B) et (D). b. Serrez la vis (F) pour fixer au côté du châssis. c. Serrez les deux vis captives (E) pour fixer au fond du châssis. Figure 38. Remplacement du fond de panier système (quatre vis pour 31E/1C ou 31E/2C) 48 Activation SAS (Serial-attached SCSI) RAID 9. Connectez le câble du ventilateur (A) en l'enfonçant dans le connecteur correspondant. 10. Remplacez le logement de barrette DIMM : Figure 39. Remplacement du boîtier de ventilation Figure 40. Connexion du câble du ventilateur Retrait et remise en place des adaptateurs et batteries SAS RAID 49 a. Positionnez le boîtier de la barrette DIMM en le baissant et en vous assurant que sa marque d'alignement (A) et le guide 'alignement (B) sont alignés. b. Insérez et serrez les six vis (C) qui maintiennent le boîtier de barrette DIMM. 11. Remplacez le régulateur de tension et le répartiteur d'air. a. Abaissez le régulateur de tension (a) pour le positionner et serrez les trois vis (B) qui le maintiennent en place. b. Abaissez le répartiteur d'air (C) pour le positionner, puis appuyez fermement jusqu'à ce qu'il se mette en place. Figure 41. Remplacement du boîtier de barrette DIMM 50 Activation SAS (Serial-attached SCSI) RAID 12. Remplacez le boîtier de l'unité de disque : Figure 42. Remplacement du répartiteur d'air du régulateur de tension (31E/2B) Figure 43. Remplacement du répartiteur d'air du régulateur de tension (31E/1C ou 31E/2C) Retrait et remise en place des adaptateurs et batteries SAS RAID 51 a. Les loquets de l'unité de disque étant ouverts, poussez le boîtier de l'unité de disque à l'intérieur du système. b. Poussez les loquets de l'unité de disque vers le système jusqu'à ce qu'ils soient verrouillés. 13. Connectez les câbles internes au fond de panier : a. Connectez les câbles de l'unité de stockage. Connectez les câbles de l'unité de stockage, le cas échéant, en appuyant fermement sur les fiches du câble pour les insérer dans les prises correspondantes (A). Figure 44. Remplacement du boîtier de l'unité de disque 52 Activation SAS (Serial-attached SCSI) RAID b. Connectez le câble SAS, le cas échéant, en appuyant fermement sur sa fiche (A) pour l'insérer dans la prise correspondante. Figure 45. Connexion des câbles de l'unité de stockage Figure 46. Connexion du câble SAS Retrait et remise en place des adaptateurs et batteries SAS RAID 53 c. Connectez le câble du panneau de commande en appuyant fermement sur ses fiches pour les insérer dans les prises correspondantes (A). d. Replacez le câble SPCN à l'arrière du châssis (31E/2C). 14. Remplacez les blocs d'alimentation. Pour obtenir des instructions, voir Remplacement hors tension des blocs d'alimentation 31E/2B, 31E/1C ou 31E/2C ou Remplacement sous tension des blocs d'alimentation 31E/2B, 31E/1C ou 31E/2C. 15. Remplacez les ventilateurs. Pour obtenir des instructions, voir Remplacement des ventilateurs 31E/2B, 31E/1C ou 31E/2C. 16. Remplacez les modules de mémoire. Pour obtenir des instructions, voir Remplacement du module de mémoire 31E/2B, 31E/1C ou 31E/2C. 17. Remplacez le régulateur de tension. Pour obtenir des instructions, voir Remplacement du régulateur de tension31E/2B, 31E/1C ou 31E/2C. 18. Remplacez la carte Ethernet hôte (31E/2B). Pour obtenir des instructions, voir Remplacement hors tension de la carte Ethernet hôte du modèle 31E/2B. 19. Remplacez les cartes GX. Pour obtenir des instructions, voir Remplacement des cartes GX pour 31E/2B, 31E/1C ou 31E/2C. 20. Remplacez les cartes PCI. Pour obtenir des instructions, voir Remplacement hors tension d'une carte PCI sur le serveur 31E/2B, 31E/1C ou 31E/2C. 21. Remplacez le capot d'accès comme indiqué dans la rubrique Installation du capot d'accès d'un système 31E/2B, 31E/1C ou 31E/2C monté en armoire. 22. Placez le système en position de fonctionnement, comme indiqué dans la rubrique Mise en position de fonctionnement du système 31E/2B, 31E/1C ou 31E/2C monté en armoire. 23. Rebranchez tous les cordons d'alimentation et d'interface. 24. Démarrez le système ou la partition logique. Voir Démarrage du système ou de la partition logique. 25. Restaurez la connectivité réseau. Figure 47. Connexion du câble du panneau de commande 54 Activation SAS (Serial-attached SCSI) RAID Le processeur de service doit être connecté au réseau pour être accessible à partir des console de gestion. Comment l'adresse IP du processeur de service sera-t-elle gérée ? Obtenez automatiquement une adresse IP à partir d'un serveur DHCP (Dynamic Host Configuration Protocol) Passez à l'étape suivante. Configurez une adresse IP statique Passez à l'étape 27. 26. Restaurez les paramètres du processeur de service. Activez l'accès réseau sur le processeur de service. Si la connexion réseau utilise le protocole DHCP (Dynamic Host Configuration Protocol) pour établir une adresse IP, et utilise les Consoles de gestion comme serveur DHCP, aucune configuration supplémentaire n'est nécessaire pour activer l'accès réseau sur le processeur de service. Procédez comme suit : a. Assurez-vous que le processeur de service est connecté au réseau de service existant en vérifiant que le câble console de gestion est connecté au port console de gestion du fond de panier système. b. Le cas échéant, connectez tous les câbles d'alimentation système en les branchant dans les prises. Remarque : ne démarrez pas le système à ce stade. Pouvez-vous accéder au réseau de maintenance ? Oui Non Passez à l'étape 29, à la page 56. Passez à l'étape 32, à la page 56. 27. Restaurez les paramètres du processeur de service. L'interface ASMI (Advanced System Management Interface) est l'interface d'accès au processeur de service qui est requise pour effectuer des tâches de maintenance générales et de niveau administrateur, telles que la lecture des journaux d'erreurs du processeur de service, la lecture des données essentielles du produit, la configuration du processeur de service, le contrôle de l'alimentation système et la configuration des ports réseau du processeur de service. L'interface ASMI est également désignée comme l'ensemble des menus du processeur de service. L'interface ASMI est accessible via HTTPS. La connexion réseau utilisant des affectations d'adresses IP statiques, procédez comme suit. a. Pour identifier les ports réseau du processeur de service, vous pouvez utiliser le panneau de commande en mode manuel. Reportez-vous aux fonctions de maintenance pour savoir comment utiliser le panneau de commande dans ce cas. b. Connectez un client disposant d'un navigateur Web directement aux ports réseau du processeur de service que vous avez notés précédemment. Il peut s'agir d'une URL similaire aux exemples suivants : v https://169.254.2.147 v https://169.254.3.147 c. Le cas échéant, connectez tous les câbles d'alimentation système en les branchant dans les prises. Remarque : ne démarrez pas le système à ce stade. d. Connectez-vous à l'interface ASMI à l'aide de l'ID utilisateur admin et du mot de passe par défaut admin. e. A l'invite, modifiez le mot de passe de l'ID administrateur et le mot de passe de l'ID utilisateur général. f. Pour configurer l'accès réseau, procédez comme suit. 1) Cliquez sur Configuration de réseau sous le noeud Services réseau. 2) Dans le panneau de droite, configurez les interfaces réseau eth0 et eth1, sélectionnez Statique comme Type d'adresse IP, puis définissez un nom d'hôte, une adresse IP et un masque de sous-réseau. Retrait et remise en place des adaptateurs et batteries SAS RAID 55 28. Restaurez les paramètres du processeur de service. La connexion réseau étant maintenant configurée pour utiliser des affectations d'adresses IP statiques, essayez d'accéder aux ports réseau du processeur de service. Pouvez-vous accéder au réseau de maintenance ? Oui Non Passez à l'étape suivante. Passez à l'étape 33, à la page 57. 29. Lancez l'interface ASMI. L'interface ASMI (Advanced System Management Interface) est l'interface du processeur de service requise pour effectuer des tâches de maintenance générales et de niveau administrateur, telles que la lecture des journaux d'erreurs du processeur de service, la lecture des données essentielles du produit, la configuration du processeur de service et le contrôle de l'alimentation système. L'interface ASMI est également désignée comme l'ensemble des menus du processeur de service. Pour accéder à l'interface ASMI, voir Accès à l'interface ASMI. Une fois que vous avez accédé à l'interface ASMI, passez à l'étape suivante. 30. Définissez le type de boîtier et le code dispositif. Le numéro de série du boîtier est un numéro à sept chiffres imprimé sur une étiquette code barre placée sur l'unité centrale. Lorsque le fond de panier système est remplacé, le numéro de série à sept chiffres du boîtier et le code dispositif doivent être entrés manuellement et stockés dans le nouveau fond de panier système à l'aide de l'interface ASMI (Advanced System Management Interface). Remarque : Le code dispositif du fond de panier système 31E/2B, 31E/1C ou 31E/2C est 78AB-001. Remarque : Pour effectuer cette opération, vous devez disposer du niveau d'autorisation Prestataire de services agréé (ID : celogin). Pour entrer le numéro de série du système : a. Dans la fenêtre d'accueil de l'interface ASMI, indiquez votre ID utilisateur et votre mot de passe, et cliquez sur Connexion. b. Dans la zone de navigation, développez Configuration système et Données techniques essentielles du programme. c. Sélectionnez Boîtiers du système. Dans le panneau de droite, les boîtiers du système actuel s'affichent. d. Sur le système concerné, enregistrez le numéro de série du boîtier imprimé l'étiquette code barre apposée sur l'unité centrale. e. Entrez le numéro de série du boîtier enregistré dans la zone Numéro de série du boîtier de l'unité centrale. f. Entrez le numéro du code dispositif dans la zone Code dispositif/Numéro de séquence. g. Cliquez sur Sauvegarder les paramètres pour mettre à jour et enregistrer le numéro de série du boîtier et le code dispositif. 31. Passez à l'étape 35, à la page 57. 32. Restaurez les paramètres du processeur de service. Si la connexion réseau utilise DHCP pour établir une adresse IP, mais n'utilise pas de en tant que serveur DHCP, procédez comme suit : a. Effectuez toutes les configurations de réseau requises pour permettre au serveur DHCP d'identifier une adresse IP et de l'affecter au processeur de service. b. Assurez-vous que le processeur de service est connecté au réseau de service existant en vérifiant que le câble réseau est connecté au port réseau. c. Le cas échéant, connectez tous les câbles d'alimentation système en les branchant dans les prises. Remarque : Ne démarrez pas le système à ce stade. 56 Activation SAS (Serial-attached SCSI) RAID Pouvez-vous accéder au réseau de maintenance ? Oui Non Accédez à 29, à la page 56 Passez à l'étape suivante. 33. Restaurez les paramètres du processeur de service. Vous avez indiqué que vous n'êtes toujours pas en mesure d'accéder au réseau de service. Vous devrez réinitialiser le processeur de service en appuyant sur le commutateur (E) du panneau de commande. 34. Passez à l'étape 25, à la page 54. 35. Restaurez les paramètres du processeur de service. Modifiez le mot de passe de l'ID utilisateur d'administrateur lorsque vous y êtes invité en procédant de la manière suivante. a. Dans la zone de navigation de l'interface ASMI, développez Profil de connexion. b. Sélectionnez Modifier le mot de passe sous Profil de connexion. Figure 48. Commutateur du panneau de commande Retrait et remise en place des adaptateurs et batteries SAS RAID 57 c. Indiquez les informations requises, puis cliquez sur Continuer. Par mesure de sécurité, vous êtes invité à entrer votre mot de passe d'utilisateur en cours dans le champ Mot de passe en cours pour l'ID utilisateur en cours. Remarque : Les mots de passe peuvent être une combinaison quelconque de 64 caractères alphanumériques au maximum. Le mot de passe par défaut pour l'ID utilisateur d'administrateur est admin. Après votre première connexion à l'interface ASMI, le mot de passe admin doit être modifié. Par mesure de sécurité, vous êtes invité à entrer le mot de passe de votre ID utilisateur en cours dans le champ Mot de passe en cours de l'utilisateur en cours. 36. Restaurez les paramètres du processeur de service. Définissez le nom du système. Le nom du système peut être modifié pour toute chaîne ASCII autorisée. Il ne doit pas nécessairement correspondre au format numéro de série_modèle_type de machine. Pour modifier le nom du système, procédez comme suit. a. Dans la zone de navigation de l'interface, développez Configuration système. b. Sélectionnez Nom du système sous Configuration système. c. Entrez le nom souhaité. d. Cliquez sur Sauvegarder les paramètres pour mettre à jour le nom du système avec la nouvelle valeur. Le nouveau nom du système s'affiche dans le cadre d'état, zone dans laquelle le bouton Déconnexion se trouve. Si une autre méthode, telle que la console de gestion est utilisée pour modifier le nom du système, le cadre d'état ne reflète pas la modification. 37. Restaurez les paramètres du processeur de service. Pour définir l'heure, procédez comme suit. a. Dans la zone de navigation de l'interface, développez Configuration système. b. Sélectionnez Heure locale sous Configuration système. c. Si le système est hors tension, le volet droit affiche un formulaire qui indique la date (mois, jour et année) et l'heure (heures, minutes, secondes). d. Modifiez la date ou l'heure (ou les deux) et cliquez sur Sauvegarder les paramètres. 38. Restaurez les paramètres du processeur de service. Vérifiez que le niveau du microprogramme de serveur est identique à la valeur enregistrée avant le remplacement du fond de panier système. Si la valeur est différente, elle doit être corrigée. Pour plus d'informations sur la correction du microprogramme, voir Obtention des correctifs de microprogramme. 39. Restaurez les paramètres du processeur de service. Entrez à nouveau les paramètres suivants qui ont été précédemment modifiés via l'interface ASMI (Advanced System Management Interface), sauf si vous souhaitez utiliser les paramètres par défaut. v Paramètre de nom de système v Paramètres de contrôle de l'alimentation système v Paramètres de l'aide à la maintenance système v Paramètres de configuration système v Paramètres des services réseau v Paramètres des options de performances v Paramètres de profil de connexion v Paramètres de liste des unités d'amorçage v Paramètres de carte Ethernet hôte 40. Restaurez les paramètres du processeur de service. 58 Activation SAS (Serial-attached SCSI) RAID a. Redéfinissez les paramètres du processeur de service que vous aviez éventuellement définis via les commandes du système d'exploitation. Vous avez enregistré ces paramètres au début de cette procédure. b. Si vous choisissez de réinitialiser le mot de passe d'accès aux Consoles de gestion, procédez comme suit : Pour modifier le mot de passe d'accès à la console HMC, choisissez l'une des méthodes suivantes ; 1) A partir de l'interface graphique de la console HMC (méthode recommandée) : a) Développez le dossier Gestion de systèmes dans le volet de l'arborescence de navigation. b) Cliquez deux fois sur le dossier Serveurs. c) Sélectionnez un serveur dans le panneau central. d) Sous le dossier Opérations, sélectionnez Modifier le mot de passe. e) Indiquez les informations souhaitées et cliquez sur OK. 2) Sur la ligne de commande de la console HMC, saisissez : /usr/hmcrbin/chsyspwd -m système_géré -t access --passwd --newpasswd nouveau_mot_de_passe où : v La valeur de système_géré correspond au nom du système géré du nouveau processeur de service. v Aucune valeur n'est indiquée pour passwd-- de façon à permettre l'authentification. v La valeur de newpasswd est celle de nouveau_mot_de_passe. Pour modifier le mot de passe d'accès à la console SDMC, procédez comme suit : 1) Allez dans l'onglet Resources et cliquez sur Hosts. 2) Sélectionnez un serveur dans le panneau de droite. 3) Dans le menu Actions, cliquez sur Operations > Change Password. 4) Indiquez les informations souhaitées et cliquez sur OK. 41. Récupérez les données de partition sur le processeur de service. Les données de profil stockées sur le serveur géré ont été supprimées ou endommagées. Pour restaurer les données de profil à l'aide de la console HMC, procédez comme suit. a. Développez le dossier Gestion de systèmes dans le volet de l'arborescence de navigation. b. Cliquez deux fois sur le dossier Serveurs. c. Dans le panneau central, sélectionnez un serveur en cochant la case correspondante. d. Dans le dossier Configuration, si vous avez accès au dossier Gestion des données de partition, sélectionnez Restaurer. Pour restaurer les données de profil à l'aide de la console SDMC, procédez comme suit : a. Allez dans l'onglet Resources et cliquez sur Hosts. b. Sélectionnez un serveur dans le panneau de droite. c. Dans le menu Actions, cliquez sur System Configuration > Manage Partition Data > Restore. d. Sélectionnez l'option de restauration souhaitée et cliquez sur OK. 42. Mettez à jour les données de produit essentielles. Si votre système exécute et n'est pas géré par les Consoles de gestion, vous devez mettre à jour la source de chargement après avoir remplacé le fond de panier système. Pour mettre à jour la source de chargement sans l'aide des Consoles de gestion, procédez comme suit : a. Chargez le support optique du microcode sous licence I_BASE_01 dans l'unité qui est définie pour le système et accédez-y pour effectuer un IPL (procédure de chargement initial). Remarque : Cet IPL peut prendre plusieurs minutes. Retrait et remise en place des adaptateurs et batteries SAS RAID 59 b. Une fois l'écran de sélection de groupe de langues affiché, appuyez sur la touche Entrée pour sélectionner la langue par défaut 2924 (anglais). Remarque : Pour modifier la langue, tapez celle qui apparaît sur le support du système d'exploitation, puis appuyez sur la touche Entrée. Confirmez la langue en appuyant sur la touche Entrée. c. A partir de l'écran d'installation du microcode sous licence, sélectionnez l'option 2 (utilisation des outils de maintenance en mode dédié) et appuyez sur la touche Entrée. Utilisez les outils de maintenance en mode dédié pour vous assurer que les unités de disque, y compris la source de chargement, génèrent correctement des rapports et qu'il n'y a pas d'erreurs susceptibles d'empêcher l'exécution de l'IPL à partir du disque. Lorsque toutes les erreurs ont été résolues et que le disque source de chargement génère correctement des rapports, passez à l'étape suivante. d. Dans l'écran d'utilisation des outils de maintenance en mode dédié (DST), sélectionnez l'option 2 (utilisation des unités de disque) et appuyez sur la touche Entrée. e. Sélectionnez l'option 8 (mise à jour des données produit essentielles du système) pour identifier l'emplacement de l'unité de disque source de chargement. Appuyez sur Entrée. L'écran de mise à jour des données produit essentielles du système s'affiche. f. Appuyez sur la touche Entrée pour confirmer que vous souhaitez que les données produit essentielles soient enregistrées. g. Appuyez deux fois sur F12 (Précédent). h. Tapez 1 et appuyez sur la touche Entrée pour quitter l'écran des outils de maintenance en mode dédié. Utilisez le panneau de configuration système pour retarder l'arrêt du système. Passez le mode IPL sur B et démarrez le système pour effectuer un IPL à partir du disque. 43. Restaurez les paramètres de liste d'amorçage et de carte Ethernet hôte. La liste des unités d'amorçage n'est pas présente sur un nouveau fond de panier système. Par conséquent, il est nécessaire de créer les unités d'amorçage en démarrant les menus des services SMS. Voir Sélection des options d'amorçage pour obtenir des informations sur la configuration de la liste des unités d'amorçage. Une fois le système d'exploitation démarré, vous pouvez utiliser la Commande bootlist pour vérifier la liste des unités d'amorçage d'un système ou d'une partition, ou y apporter des modifications supplémentaires. Pour restaurer les paramètres de la carte Ethernet hôte, voir Configuration de ports physiques sur une carte Ethernet hôte. 44. Vérifiez la réparation. Pour plus d'informations, voir Vérification d'une réparation. Considérations relatives à la maintenant du sous-système SAS 31E/2B Il existe plusieurs éléments à consulter avant d'effectuer la maintenance des fonctions au sein du sous-système SAS pour le système 31E/2B. Avertissement : Ne tentez pas de retirer les composants associés au sous-système SAS si le voyant des données en cache sur la carte de P1-C18 ou P1-C13 clignote. Pour voir ce voyant, vous devez arrêter le système et retirer le capot de service. Une icône d'interdiction de toucher (une main barrée) s'affiche sur le voyant, indiquant la présence possible de données dans le cache de l'adaptateur. Pour éviter la perte de données, assurez-vous tout d'abord que le système a été arrêté correctement. Si le voyant des données en cache clignote, reportez-vous à l'une des rubriques suivantes avant de retirer tout composant du sous-système SAS. Vous pouvez remplacer en toute sécurité le bloc de batteries de cache lorsque Yes est affiché en regard de Battery pack can be safely replaced sur l'écran d'information de batterie (pour les systèmes Linux et IBM® i) ou sur l'écran de statut de la commande (pour les systèmes AIX). 60 Activation SAS (Serial-attached SCSI) RAID Avant d'effectuer la maintenance, vérifiez les configurations d'E-S de chaque partition logique sur le système. Il est important de comprendre les différences entre les diverses configurations et les considérations qui en découlent en matière de maintenance. En outre, tenez compte des incidences possibles sur les domaines suivants : v Partitions logiques et leurs affectations d'adaptateur d'E-S v Configuration des grappes de disques RAID v Emplacements de disque physique v Chemin d'accès à votre unité d'amorçage Considérations de maintenance pour le bloc de batterie de cache Pour éviter la perte de données, suivez les procédures appropriées avant de remplacer le bloc de batterie de cache sur la carte Cache RAID. Remarque : La maintenance simultanée de la batterie de l'adaptateur n'est pas prise en charge. Même en cas de batteries défaillantes ou manquantes, la carte associée à la batterie doit être retirée en premier. Avertissement : Pour maintenir la disponibilité du système et éviter d'éventuelles pertes de données, vous devez comprendre que ces fonctions ont des conséquences sur la configuration de SAS et l'accessibilité des données du système. Avertissement : Ne tentez pas de retirer les composants associés au sous-système SAS si le voyant des données en cache sur la carte de P1-C18 ou P1-C13 clignote. Pour voir ce voyant, vous devez arrêter le système et retirer le capot de service. Une icône d'interdiction de toucher (une main barrée) s'affiche sur le voyant, indiquant la présence possible de données dans le cache de l'adaptateur. Pour éviter la perte de données, assurez-vous tout d'abord que le système a été arrêté correctement. Si le voyant des données en cache clignote, reportez-vous à l'une des rubriques suivantes avant de retirer tout composant du sous-système SAS. Vous pouvez remplacer en toute sécurité le bloc de batteries de cache lorsque Yes est affiché en regard de Battery pack can be safely replaced sur l'écran d'information de batterie (pour les systèmes Linux et IBM i) ou sur l'écran de statut de la commande (pour les systèmes AIX). La figure suivante indique l'emplacement du voyant de présence de données en cache dans les sous-systèmes SAS 31E/2B. Retrait et remise en place des adaptateurs et batteries SAS RAID 61 Le tableau suivant décrit les états du voyant de présence des données en cache et les éléments à prendre en considération avant de tenter une opération de maintenance. Tableau 1. Indications du voyant et actions Carte de batterie de cache (Un-P1-C13) Contrôleur de stockage RAID/cache (Un-P1-C18) Indication Action désactivée désactivée La mémoire cache n'est pas active. Les actions de maintenance peuvent s'effectuer. désactivée clignote La mémoire cache est active sur C18. La mémoire cache peut avoir été vidée sur C13 en cas de défaillance de C18. Essayez de mettre le système sous tension et de l'éteindre correctement avant d'effectuer toute action de maintenance. clignote désactivée La mémoire cache est active sur C13. La mémoire cache peut avoir été vidée sur C18 en cas de défaillance de C13. Essayez de mettre le système sous tension et de l'éteindre correctement avant d'effectuer toute action de maintenance. clignote clignote La mémoire cache est active sur les contrôleurs doubles. Essayez de mettre le système sous tension et de l'éteindre correctement avant d'effectuer toute action de maintenance. Si cela ne parvient pas à effacer les voyants de cache actifs, contactez le support. Voir Contacter le support et la maintenance IBM. Figure 49. Voyant des données en cache 62 Activation SAS (Serial-attached SCSI) RAID Retrait de la carte d'activation RAID/cache31E/2B Utilisez cette procédure pour retirer des cartes d'activation RAID (Redundant Array of Independent Disks) et cache avant d'installer une carte d'activation nouvelle ou mise à niveau. Pourquoi et quand exécuter cette tâche Si le système est géré par la console HMC (Hardware Management Console), utilisez-la pour retirer la carte du serveur. Pour plus d'informations, voir «Retrait d'un composant via la console HMC», à la page 103. Si le système est géré par la console Systems Director Management Console (SDMC), utilisez SDMC pour retirer la carte du serveur. Pour des instructions, voir «Retrait d'un composant via la console SDMC», à la page 118. Si vous ne disposez pas d'une console HMC ou SDMC, procédez comme suit pour retirer la carte d'activation : Procédure 1. Effectuez les tâches prérequises, comme décrit dans la rubrique «Avant de commencer», à la page 83. 2. Arrêtez le système (voir «Arrêt d'un système ou d'une partition logique», à la page 93). 3. Placez le système en position de maintenance, comme indiqué dans la section «Mise en position de maintenance du système 31E/2B, 31E/1C ou 31E/2C monté en armoire», à la page 98. 4. Retirez le capot d'accès, comme indiqué à la rubrique «Retrait du capot d'accès du système 31E/2B, 31E/1C ou 31E/2C», à la page 96. 5. Utilisez les voyants de maintenance pour vous aider à identifier le composant, comme indiqué dans «Identification d'un composant défaillant», à la page 86. 6. Débranchez le système pour l'isoler de la source d'alimentation. 7. Attachez le bracelet antistatique. Avertissement : v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel contre les risques de décharge électrostatique. v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation d'appareils électriques. v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique, d'installer ou de remplacer du matériel. 8. Identifiez les cartes d'activation RAID et cache à retirer. Remarque : Si une unité de bande en option est installée, elle doit être déplacée pour pouvoir accéder à la carte d'activation RAID. 9. Vérifiez le voyant de validité des données de cache situé sur la carte de batterie de cache de l'autre côté du système. Remarque : La carte de batterie alimente les données de cache placées sur la carte RAID. Si le voyant clignote, vous risquez de perdre des données en cache lors du retrait de cette carte. 10. Appuyez sur le loquet et faites glisser la carte en dehors du système, comme illustré. Retrait et remise en place des adaptateurs et batteries SAS RAID 63 11. Replacez le capot d'accès, comme indiqué à la rubrique «Installation du capot d'accès du système 31E/2B, 31E/1C ou 31E/2C», à la page 97. 12. Pour un système monté en armoire, mettez-le en position de fonctionnement, comme indiqué dans «Mise en position de fonctionnement du système 31E/2B, 31E/1C ou 31E/2C monté en armoire», à la page 99. 13. Rebranchez la source d'alimentation. 14. Démarrez le système, comme décrit dans Mise sous tension. 15. Vérifiez le composant installé : v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme décrit dans Vérification du composant installé. Que faire ensuite Remplacez les cartes d'activation RAID et cache. Pour obtenir des instructions, voir «Remplacement de la carte d'activation RAID/cache31E/2B». Remplacement de la carte d'activation RAID/cache31E/2B Utilisez cette procédure pour remplacer des cartes d'activation RAID (Redundant Array of Independent Disks) et cache. Pourquoi et quand exécuter cette tâche Vous devez avoir exécuté la procédure «Retrait de la carte d'activation RAID/cache31E/2B», à la page 63. Pour remplacer une carte d'activation RAID/cache, procédez comme suit : Figure 50. Retrait de cartes d'activation RAID et cache d'une unité centrale montée en armoire 64 Activation SAS (Serial-attached SCSI) RAID Procédure 1. Effectuez les tâches prérequises, comme décrit dans la rubrique «Avant de commencer», à la page 83. 2. Arrêtez le système (voir «Arrêt d'un système ou d'une partition logique», à la page 93). 3. Placez le système en position de maintenance, comme indiqué dans la section «Mise en position de maintenance du système 31E/2B, 31E/1C ou 31E/2C monté en armoire», à la page 98. 4. Retirez le capot d'accès, comme indiqué à la rubrique «Retrait du capot d'accès du système 31E/2B, 31E/1C ou 31E/2C», à la page 96. 5. Utilisez les voyants de maintenance pour vous aider à identifier le composant, comme indiqué dans «Identification d'un composant défaillant», à la page 86. 6. Débranchez le système pour l'isoler de la source d'alimentation. Remarque : Ce système peut être équipé d'un deuxième bloc d'alimentation. Avant de poursuivre cette procédure, vérifiez que la source d'alimentation du système est complètement déconnectée. (L003) ou 7. Attachez le bracelet antistatique. Avertissement : v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel contre les risques de décharge électrostatique. v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation d'appareils électriques. v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique, d'installer ou de remplacer du matériel. 8. Identifiez l'emplacement de la carte d'activation RAID/cache Retrait et remise en place des adaptateurs et batteries SAS RAID 65 9. Alignez la carte d'activation RAID/cache sur l'emplacement dans le système. 10. Faites glisser complètement la carte d'activation RAID/cache dans le système jusqu'au verrouillage du taquet (A). 11. Replacez le capot d'accès, comme indiqué à la rubrique «Installation du capot d'accès du système 31E/2B, 31E/1C ou 31E/2C», à la page 97. 12. Pour un système monté en armoire, mettez-le en position de fonctionnement, comme indiqué dans «Mise en position de fonctionnement du système 31E/2B, 31E/1C ou 31E/2C monté en armoire», à la page 99. 13. Rebranchez la source d'alimentation. 14. Démarrez le système (voir «Démarrage du système ou de la partition logique», à la page 91). 15. Vérifiez le composant installé ou réparé v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme décrit dans Vérification du composant installé. Retrait du contrôleur de stockage RAID/cache sur le système 31E/2B, 31E/1C ou 31E/2C Utilisez cette procédure pour retirer un contrôleur de stockage RAID (Redundant Array of Independent Disks) et cache avant d'installer un contrôleur de stockage nouveau ou mis à niveau. Vous pouvez également utiliser cette procédure pour retirer un contrôleur de stockage RAID-10. Pourquoi et quand exécuter cette tâche Identifiez le système sur lequel vous travaillez à l'aide du processus d'identification du système afin d'activer l'indicateur de système (bleu). Pour plus d'informations, voir «Voyants du panneau de commande», à la page 87. Figure 51. Remplacement des cartes d'activation RAID et cache sur une unité centrale montée en armoire 66 Activation SAS (Serial-attached SCSI) RAID Si le système est géré par la console HMC (Hardware Management Console), utilisez la console HMC pour retirer les contrôleurs de stockage RAID et cache du serveur. Pour plus d'informations, voir «Retrait d'un composant via la console HMC», à la page 103. Si le système est géré par la console Systems Director Management Console (SDMC), utilisez SDMC pour retirer les contrôleurs de stockage RAID et cache du serveur. Pour des instructions, voir «Retrait d'un composant via la console SDMC», à la page 118. Si vous ne disposez pas d'une console HMC ou SDMC, procédez comme suit pour retirer le contrôleur de stockage RAID/cache : Procédure 1. Effectuez les tâches prérequises, comme décrit dans la rubrique «Avant de commencer», à la page 83. 2. Arrêtez le système (voir «Arrêt d'un système ou d'une partition logique», à la page 93). 3. Mettez le système en position de maintenance, comme décrit dans «Mise en position de maintenance du système 31E/2B, 31E/1C ou 31E/2C monté en armoire», à la page 98. 4. Retirez le capot d'accès, comme indiqué dans «Retrait du capot d'accès du système 31E/2B, 31E/1C ou 31E/2C», à la page 96. 5. Utilisez les voyants de maintenance pour vous aider à identifier le composant, comme indiqué dans «Identification d'un composant défaillant», à la page 86. 6. Débranchez le système pour l'isoler de la source d'alimentation. Remarque : Ce système peut être équipé d'un deuxième bloc d'alimentation. Avant de poursuivre cette procédure, vérifiez que la source d'alimentation du système est complètement déconnectée. (L003) ou 7. Attachez le bracelet antistatique. Retrait et remise en place des adaptateurs et batteries SAS RAID 67 Avertissement : v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel contre les risques de décharge électrostatique. v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation d'appareils électriques. v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique, d'installer ou de remplacer du matériel. 8. Identifiez les contrôleurs de stockage RAID et cache à retirer. Avertissement : Ne tentez pas de retirer les composants associés au sous-système SAS sur votre système si le voyant de la carte dans C18-C18 ou P1-C13 clignote. Ce voyant indique l'écriture de données en cache. Pour éviter la perte de données, voir Affichage des informations relatives aux batteries rechargeables. Vous pouvez remplacer le bloc de batterie de cache sans danger si la mention Yes apparaît en regard du bloc de batterie. Figure 52. Emplacement d'installation de la carte d'activation RAID (P1 à C12), du contrôleur de stockage (P1 à C18) et de la carte de batterie de cache (P1 à C13) pour 31E/2B 68 Activation SAS (Serial-attached SCSI) RAID 9. Appuyez sur le taquet (A) et tirez sur la poignée (B) dans les directions indiquées, puis faites glisser la carte hors du système, comme l'illustre la figure suivante. Figure 53. Emplacement d'installation du contrôleur de stockage (P1 à C18) et de la carte de batterie de cache (P1 à C13) pour 31E/1C ou 31E/2C Figure 54. Retrait de contrôleurs de stockage RAID et cache d'une unité centrale montée en armoire pour 31E/2B Retrait et remise en place des adaptateurs et batteries SAS RAID 69 10. Réinstallez le capot d'accès. Voir «Installation du capot d'accès du système 31E/2B, 31E/1C ou 31E/2C», à la page 97 11. Pour un système monté en armoire, mettez le système en position de fonctionnement, comme décrit dans «Mise en position de fonctionnement du système 31E/2B, 31E/1C ou 31E/2C monté en armoire», à la page 99. 12. Rebranchez la source d'alimentation. 13. Démarrez le système (voir «Démarrage du système ou de la partition logique», à la page 91). 14. Vérifiez le composant installé : v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme décrit dans Vérification du composant installé. Que faire ensuite Pour remplacer le contrôleur de stockage RAID/cache, voir «Remplacement du contrôleur de stockage RAID/cache sur le système 31E/2B, 31E/1C ou 31E/2C». Remplacement du contrôleur de stockage RAID/cache sur le système 31E/2B, 31E/1C ou 31E/2C Utilisez cette procédure pour remplacer des contrôleurs de stockage RAID et cache. Vous pouvez également utiliser cette procédure pour remplacer un contrôleur de stockage RAID-10. Pourquoi et quand exécuter cette tâche Identifiez le système sur lequel vous travaillez à l'aide du processus d'identification du système afin d'activer l'indicateur de système (bleu). Pour plus d'informations, voir «Voyants du panneau de commande», à la page 87. Figure 55. Retrait de contrôleurs de stockage RAID et cache d'une unité centrale montée en armoire pour 31E/1C ou 31E/2C 70 Activation SAS (Serial-attached SCSI) RAID Vous devez avoir exécuté la procédure «Retrait du contrôleur de stockage RAID/cache sur le système 31E/2B, 31E/1C ou 31E/2C», à la page 66. Pour remplacer des contrôleurs de stockage RAID et cache, procédez comme suit : Procédure 1. Effectuez les tâches prérequises, comme décrit dans la rubrique «Avant de commencer», à la page 83. 2. Arrêtez le système (voir «Arrêt d'un système ou d'une partition logique», à la page 93). 3. Mettez le système en position de maintenance, comme décrit dans «Mise en position de maintenance du système 31E/2B, 31E/1C ou 31E/2C monté en armoire», à la page 98. 4. Retirez le capot d'accès, comme indiqué dans «Retrait du capot d'accès du système 31E/2B, 31E/1C ou 31E/2C», à la page 96. 5. Utilisez les voyants de maintenance pour vous aider à identifier le composant, comme indiqué dans «Identification d'un composant défaillant», à la page 86. 6. Débranchez le système pour l'isoler de la source d'alimentation. Remarque : Ce système peut être équipé d'un deuxième bloc d'alimentation. Avant de poursuivre cette procédure, vérifiez que la source d'alimentation du système est complètement déconnectée. (L003) ou 7. Attachez le bracelet antistatique. Retrait et remise en place des adaptateurs et batteries SAS RAID 71 Avertissement : v Portez un bracelet antistatique relié à une surface métallique non peinte pour protéger le matériel contre les risques de décharge électrostatique. v Si vous portez un bracelet antistatique, respectez toutes les consignes de sécurité relatives aux dangers électriques. Un bracelet antistatique permet de réguler l'électricité statique. Il ne réduit et n'augmente en aucun cas les risques d'électrocution liés à l'utilisation ou à la manipulation d'appareils électriques. v Si vous ne disposez pas d'un bracelet antistatique, touchez une surface métallique non peinte du système pendant au moins 5 secondes avant de déballer un produit de son emballage antistatique, d'installer ou de remplacer du matériel. 8. Identifiez l'emplacement des contrôleurs de stockage RAID et cache. 9. Alignez les contrôleurs de stockage RAID et cache sur l'emplacement dans le système. 10. Faites glisser complètement les contrôleurs de stockage RAID et cache dans le système jusqu'au verrouillage du loquet (A). Figure 56. Remplacement des contrôleurs de stockage RAID/cache pour 31E/2B 72 Activation SAS (Serial-attached SCSI) RAID 11. Replacez le capot d'accès, comme indiqué à la rubrique «Installation du capot d'accès du système 31E/2B, 31E/1C ou 31E/2C», à la page 97. 12. Pour un système monté en armoire, mettez-le en position de fonctionnement, comme indiqué dans «Mise en position de fonctionnement du système 31E/2B, 31E/1C ou 31E/2C monté en armoire», à la page 99. 13. Rebranchez la source d'alimentation. 14. Démarrez le système comme indiqué dans «Démarrage du système ou de la partition logique», à la page 91. 15. Vérifiez le composant installé : v Si vous avez installé le composant pour toute autre raison, vérifiez le composant installé comme décrit dans Vérification du composant installé. Retrait et installation de la carte de batterie de cache Pour effectuer la maintenance de la carte de batterie de cache, procédez comme suit : 1. Commencez avec le système sous tension. Pour éviter la perte de données, la carte de batterie de cache doit être à l'état d'erreur avant d'être retirée du système. Si votre système est hors tension, rallumez-le avant de continuer. 2. Choisissez votre système d'exploitation : v AIX : Passez à l'étape 3. v Linux : Passez à l'étape 5. v IBM i : Passez à l'étape 7. 3. Sous AIX, mettez la carte de batterie de cache à l'état d'erreur (afin d'éviter d'éventuelles pertes de données) en procédant comme suit : a. Accédez au gestionnaire de grappes de disques SAS IBM, comme indiqué ci-dessous : Figure 57. Remplacement des contrôleurs de stockage RAID/cache pour 31E/1C ou 31E/2C Retrait et remise en place des adaptateurs et batteries SAS RAID 73 1) A l'invite de commande, tapez smit, puis appuyez sur Entrée. 2) Sélectionnez Devices. 3) Sélectionnez Disk Array. 4) Sélectionnez IBM SAS Disk Array. 5) Sélectionnez IBM SAS Disk Array Manager dans le menu comprenant les options de configuration et de gestion du contrôleur IBM SAS RAID. b. Sélectionnez Diagnostics and Recovery Options. c. Sélectionnez l'option Controller Rechargeable Battery Maintenance. d. Sélectionnez l'option Force Controller Rechargeable Battery Error. e. Sélectionnez le contrôleur SAS RAID IBM dont vous souhaitez remplacer la batterie. Cette option met la batterie à l'état d'erreur, ce qui nécessite son remplacement. f. Déterminez si vous pouvez remplacer la carte de batterie de cache sans risque. Voir la procédure Affichage des informations relatives aux batteries rechargeables. Vous pouvez remplacer la carte de batterie de cache sans danger si la mention Yes apparaît en regard de l'option Battery pack can be safely replaced. Vous devrez peut-être sélectionner à plusieurs reprises l'option permettant d'afficher les informations relatives à la batterie rechargeable du contrôleur car plusieurs minutes peuvent s'écouler avant que le remplacement de la carte de batterie de cache puisse s'effectuer sans danger. Avertissement : Ne tentez pas de retirer les composants associés au sous-système SAS sur votre système si le voyant de la carte dans C18-C18 ou P1-C13 clignote. Ce voyant indique l'écriture de données en cache. g. Vérifiez que le voyant de présence de données en cache ne clignote pas. Affichage des informations relatives aux batteries rechargeables : a. Accédez au gestionnaire de grappes de disques SAS IBM en suivant la procédure précédente. b. Sélectionnez Diagnostics and Recovery Options. c. Sélectionnez l'option Controller Rechargeable Battery Maintenance. d. Sélectionnez l'option Displaying Rechargeable Battery Information. e. Sélectionnez le contrôleur IBM SAS RAID. 4. Passez à l'étape 8. 5. Mettez la carte de batterie de cache à l'état d'erreur sous Linux (afin d'éviter d'éventuelles pertes de données) en procédant comme suit : a. Exécutez l'utilitaire iprconfig en saisissant iprconfig. b. Sélectionnez Work with disk unit recovery. c. Sélectionnez Work with resources containing cache battery packs. d. Sélectionnez votre carte et tapez 2. Appuyez ensuite sur Entrée pour mettre la batterie à l'état d'erreur. Remarque : Cette option permet de mettre la batterie à l'état d'erreur, ce qui nécessite son remplacement. e. Si vous êtes sûr de devoir mettre la batterie à l'état d'erreur, tapez c pour confirmer. Si vous ne souhaitez pas mettre la batterie à l'état d'erreur, tapez q pour annuler. f. Déterminez si vous pouvez remplacer la carte de batterie de cache sans risque. Voir Affichage des informations relatives aux batteries rechargeables. g. Vous pouvez remplacer la carte de batterie de cache sans danger si la mention Yes apparaît en regard de l'option Battery pack can be safely replaced. Vous devrez peut-être sélectionner à plusieurs reprises l'option permettant d'afficher les informations relatives à la batterie rechargeable du contrôleur car plusieurs minutes peuvent s'écouler avant que le remplacement de la carte de batterie de cache puisse s'effectuer sans danger. 6. Passez à l'étape 8. 74 Activation SAS (Serial-attached SCSI) RAID 7. Mettez la carte de batteries de cache à l'état d'erreur sous (afin d'éviter d'éventuelles pertes de données) en appliquant la procédure suivante au système ou à la partition logique qui comporte l'adaptateur : a. Vous devez avoir ouvert une session avec au minimum les droits d'accès aux outils de maintenance. b. Tapez strsst sur la ligne de commande et appuyez sur Entrée. c. Sur l'écran System Service Tools (SST) Sign On, tapez votre ID utilisateur et votre mot de passe d'accès aux outils de maintenance. Appuyez sur Entrée. d. Sur l'écran System Service Tools (SST), sélectionnez Start a Service Tool et appuyez sur Entrée. e. Dans l'écran Start a Service Tool, sélectionnez Hardware Service Manager et appuyez sur la touche Entrée. f. Sélectionnez Work with resources containing cache battery packs dans l'écran Hardware Service Manager. g. Sélectionnez Force battery pack into error state pour l'adaptateur d'entrée-sortie que vous utilisez dans l'écran Work with Resources containing cache battery packs, puis appuyez sur Entrée. h. Dans l'écran Force Battery Packs Into Error State, vérifiez que l'adaptateur d'entrée-sortie approprié est sélectionné et appuyez sur la touche de fonction correspondant à votre choix pour confirmer. i. Revenez à l'écran Work with Resources containing cache battery packs et sélectionnez Display battery information. Assurez-vous que la zone Safe to replace cache battery a la valeur Yes. Cette opération peut prendre plusieurs minutes et vous devrez peut-être appuyer sur le bouton d'actualisation pour que la zone soit mise à jour. 8. Placez le système en position de maintenance, comme indiqué dans la section «Mise en position de maintenance du système 31E/2B, 31E/1C ou 31E/2C monté en armoire», à la page 98. 9. Retirez le capot d'accès de maintenance. Voir «Retrait du capot d'accès du système 31E/2B, 31E/1C ou 31E/2C», à la page 96 10. Vérifiez le voyant de la batterie de cache en examinant le voyant de la carte de batterie de cache. v Si le voyant clignote, passez à l'étape suivante. v Si le voyant ne clignote pas, passez à l'étape 12. Retrait et remise en place des adaptateurs et batteries SAS RAID 75 11. Arrêt du retrait de la carte de batterie de cache. Le voyant clignotant indique que la batterie conserve les données en cache pour les cartes d'activation RAID et cache. Ces données seront perdues si la batterie est retirée à ce stade. Effectuez cette procédure de maintenance sans remplacer la batterie, puis effectuez-la de nouveau pour vous assurer que toutes les données en cache sont enregistrées sur le disque avant de remplacer la batterie. Passez à l'étape 18. 12. Retirez la carte de batterie de cache en procédant comme suit : Avertissement : Ne retirez pas la batterie si la DEL (B) clignote ; cela signifie qu'il existe des données en cache. a. Enfoncez la partie métallique (A) pour retirer la carte. b. Retirez la carte de batterie de cache des glissières de montage afin de l'extraire du contrôleur. Figure 58. Voyant des données en cache 76 Activation SAS (Serial-attached SCSI) RAID 13. Installez la carte de batterie de cache en procédant comme suit : Figure 59. Retrait de la carte de batterie de cache pour31E/2B Figure 60. Retrait de la carte de batterie de cache pour 31E/1C ou 31E/2C Retrait et remise en place des adaptateurs et batteries SAS RAID 77 Avertissement : Laissez la carte de batterie de cache déconnectée pendant au moins 60 secondes avant de connecter la nouvelle batterie. Il s'agit du temps minimum nécessaire à l'adaptateur pour détecter que la batterie a été remplacée. a. Faites glisser la carte de batterie de cache dans les glissières de montage du contrôleur jusqu'à ce qu'elle s'insère dans le connecteur de batterie. b. Alignez les trous d'alignement (B) sur ceux du châssis afin que la carte s'insère correctement. Figure 61. Installation de la carte de batterie de cache pour31E/2B 78 Activation SAS (Serial-attached SCSI) RAID 14. Redémarrez le cache d'écriture de l'adaptateur en procédant comme suit : Choisissez votre système d'exploitation : v AIX : Passez à l'étape 15. v Linux : Passez à l'étape 16. v IBM i : Passez à l'étape 17. 15. Procédez comme suit : a. Accédez au gestionnaire de grappes de disques IBM SAS en suivant la procédure de la section 3a. b. Sélectionnez Diagnostics and Recovery Options. c. Sélectionnez Controller Rechargeable Battery Maintenance. d. Sélectionnez Start Adapter Cache. e. Sélectionnez le contrôleur dont vous venez de remplacer la batterie et appuyez sur Entrée. f. Passez à l'étape 18. 16. Procédez comme suit : a. Exécutez l'utilitaire iprconfig en saisissant iprconfig. b. Sélectionnez Work with disk unit recovery. c. Sélectionnez Work with resources containing cache battery packs. d. Démarrez le cache de l'adaptateur d'E-S sur l'adaptateur correspondant à la batterie que vous venez de remplacer en saisissant 3, puis en appuyant sur Entrée. e. Passez à l'étape 18. 17. Procédez comme suit : a. Revenez à l'écran Work with Resources containing Cache Battery Packs en utilisant les informations obtenues à l'étape 7a à 7f et sélectionnez Start IOA Cache. Appuyez sur Entrée. b. Vérifiez que vous obtenez bien le message Cache was started. Figure 62. Installation de la carte de batterie de cache pour 31E/1C ou 31E/2C Retrait et remise en place des adaptateurs et batteries SAS RAID 79 c. Passez à l'étape 18. 18. Replacez le capot d'accès, comme indiqué à la rubrique «Installation du capot d'accès du système 31E/2B, 31E/1C ou 31E/2C», à la page 97. 19. Pour un système monté en armoire, mettez-le en position de fonctionnement, comme indiqué dans «Mise en position de fonctionnement du système 31E/2B, 31E/1C ou 31E/2C monté en armoire», à la page 99 20. Fermez le volet arrière de l'unité dont vous effectuez la maintenance. 21. Vérifiez que la nouvelle ressource fonctionne correctement, comme indiqué dans la rubrique «Vérification du composant installé», à la page 104. Retrait et remplacement du bloc de batterie de cache Cette procédure permet de remplacer le bloc de batterie de cache sur une carte de batterie. Avant de commencer Identifiez le système sur lequel vous travaillez à l'aide du processus d'identification du système afin d'activer l'indicateur de système (bleu). Pour plus d'informations, voir Voyants du panneau de commande. Avertissement : Avant de poursuivre ces procédures, déterminez si vous pouvez remplacer le bloc de batterie de cache sans risque. Voir Affichage des informations sur la batterie rechargeable. Vous pouvez remplacer le bloc de batterie de cache sans danger si la mention Yes apparaît en regard de l'option Battery pack can be safely replaced. Avertissement : Pour maintenir la disponibilité du système et éviter d'éventuelles pertes de données, vous devez comprendre que ces fonctions ont des conséquences sur la configuration de SAS (serial-attached SCSI) et l'accessibilité des données du système. Consultez la rubrique Sous-système SAS pour comprendre les différentes remarques sur la configuration et la maintenance SAS. Avant toute opération d'installation ou de maintenance d'un dispositif, consultez les configurations d'E-S pour chaque partition logique de votre système. Il est important de comprendre les différences entre les diverses configurations et les considérations de maintenance qui en résultent. En outre, tenez compte des incidences possibles sur les domaines suivants : v Partitions logiques et leurs affectations à l'adaptateur d'E-S v Configurations de grappes RAID v Emplacements de disque physique v Chemin d'accès à votre unité d'amorçage Pourquoi et quand exécuter cette tâche Procédez comme suit pour remplacer un bloc de batterie n'acceptant pas la maintenance simultanée : Procédure 1. Retirez le contrôleur du système Pour plus d'informations, voir «Retrait du contrôleur de stockage RAID/cache sur le système 31E/2B, 31E/1C ou 31E/2C», à la page 66. 2. Placez le contrôleur sur une surface protégée des décharges électrostatiques. 3. Débranchez le connecteur de batterie (A) de son connecteur sur la carte, en tirant la languette de verrouillage tout en tirant doucement sur la fiche. La fiche est connectée à la carte mère d'une seule façon ; elle ne peut donc être insérée de manière incorrecte lors de la procédure de remplacement. 80 Activation SAS (Serial-attached SCSI) RAID Remarque : Laissez le bloc de batterie de cache déconnecté pendant au moins 60 secondes avant de connecter la nouvelle batterie. Il s'agit du temps minimum nécessaire à la carte pour détecter que la batterie a été remplacée. Connecteur de la batterie (A) (B) Bloc de batterie du cache 4. La batterie est montée sur une plaque en plastique noir avec deux taquets, maintenue par des vis de butée à l'aide de goupilles de retenue. Pour retirer le bloc de batterie, pressez le dispositif de retenue de chaque goupille, puis soulevez le taquet de la plaque de montage noire afin de la libérer des deux broches de retenue. 5. Installez la batterie sur les supports de façon à ce que les loquets s'enclenchent. Figure 63. Retrait de la batterie de cache Retrait et remise en place des adaptateurs et batteries SAS RAID 81 6. Connectez le connecteur du bloc de batteries de cache (B) à la carte. La fiche est connectée à la carte mère d'une seule façon ; elle ne peut donc être insérée de manière incorrecte. 7. Réinstallez l'adaptateur et redémarrez le cache. Pour plus d'informations, voir les procédures suivantes : v «Retrait et installation de la carte de batterie de cache», à la page 73 v «Retrait du contrôleur de stockage RAID/cache sur le système 31E/2B, 31E/1C ou 31E/2C», à la page 66. Figure 64. Remplacement de la batterie de cache 82 Activation SAS (Serial-attached SCSI) RAID Procédures communes aux dispositifs installables Cette section contient toutes les procédures communes concernant l'installation, le retrait et le remplacement de fonctionnalités. Avant de commencer Respectez ces précautions lorsque vous installez, retirez ou remplacez des dispositifs ou des composants. Pourquoi et quand exécuter cette tâche Ces précautions visent à créer un environnement sûr pour la maintenance de votre système, mais ne proposent pas de procédure de maintenance du système. Les procédures d'installation, de retrait et de remplacement fournissent les processus étape par étape pour la maintenance de votre système. 83 DANGER Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes suivantes : Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut présenter un danger. Pour éviter tout risque de choc électrique : v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce dernier avec un autre produit. v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique. v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de reconfiguration de ce produit au cours d'un orage. v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de danger électrique, débranchez tous les cordons d'alimentation. v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent aux informations de la plaque d'alimentation électrique du système. v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce produit. v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les cordons d'interface. v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence de dommages matériels. v Avant de retirer les capots de l'unité, mettez celle-ci hors tension et déconnectez ses cordons d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication et aux modems (sauf mention contraire dans les procédures d'installation et de configuration). v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et déconnecter les différents cordons. Pour déconnecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Débranchez les cordons d'alimentation des prises. 3. Débranchez les cordons d'interface des connecteurs. 4. Débranchez tous les câbles des unités. Pour connecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Branchez tous les cordons sur les unités. 3. Branchez les cordons d'interface sur des connecteurs. 4. Branchez les cordons d'alimentation aux prises. 5. Mettez l'unité sous tension. 84 Activation SAS (Serial-attached SCSI) RAID (D005a) DANGER Observez les consignes suivantes lors de l'utilisation du système en armoire ou lorsque vous travaillez à proximité de ce dernier : v Un mauvais maniement de l'équipement lourd peut engendrer blessures et dommages matériels. v Abaissez toujours les vérins de mise à niveau de l'armoire. v Installez toujours des équerres de stabilisation sur l'armoire. v Pour prévenir tout danger lié à une mauvaise répartition de la charge, installez toujours les unités les plus lourdes dans la partie inférieure de l'armoire. Installez toujours les serveurs et les unités en option en commençant par le bas de l'armoire. v Un serveur monté en armoire n'est pas une étagère ou un espace de travail. Ne posez pas d'objet sur un serveur monté en armoire. v Chaque armoire peut être équipée de plusieurs cordons d'alimentation. Avant de manipuler l'armoire, vous devez débrancher l'ensemble des cordons d'alimentation. v Reliez toutes les unités installées dans l'armoire aux dispositifs d'alimentation installés dans la même armoire. Vous ne devez pas brancher le cordon d'alimentation d'une unité installée dans une armoire au dispositif d'alimentation installé dans une autre armoire. v Un mauvais câblage du socle de prise de courant peut provoquer une mise sous tension dangereuse des parties métalliques du système ou des unités qui lui sont raccordées. Il appartient au client de s'assurer que le socle de prise de courant est correctement câblé et mis à la terre afin d'éviter tout risque de choc électrique. ATTENTION v N'installez pas d'unité dans une armoire dont la température ambiante interne dépasse la température ambiante que le fabricant recommande pour toutes les unités montées en armoire. v N'installez pas d'unité dans une armoire où la ventilation n'est pas assurée. Vérifiez que les côtés, l'avant et l'arrière de l'unité sont correctement ventilés. v Le matériel doit être correctement raccordé au circuit d'alimentation pour éviter qu'une surcharge des circuits n'entrave le câblage des dispositifs d'alimentation ou de protection contre les surintensités. Pour choisir des connexions d'alimentation à l'armoire adaptées, consultez les étiquettes de puissance nominale situées sur le matériel dans l'armoire afin de déterminer l'alimentation totale requise par le circuit d'alimentation. v Armoires dotées de tiroirs coulissants : Si l'armoire n'est pas équipée d'équerres de stabilisation, ne sortez et n'installez pas de tiroir ou de dispositif. Ne retirez pas plusieurs tiroirs à la fois. Si vous retirez plusieurs tiroirs simultanément, l'armoire risque de devenir instable. v Armoires dotées de tiroirs fixes : Sauf indication du fabricant, les tiroirs fixes ne doivent pas être retirés à des fins de maintenance. Si vous tentez de retirer une partie ou l'ensemble du tiroir, l'armoire risque de devenir instable et le tiroir risque de tomber. (R001) Avant de commencer toute procédure de remplacement ou d'installation, procédez comme suit : Procédures communes aux dispositifs installables 85 Procédure 1. Si vous installez un nouveau dispositif, vous devez disposer des logiciels requis pour la prise en charge de ce nouveau dispositif. 2. Si vous exécutez une procédure d'installation ou de remplacement qui risque d'affecter la sécurité des données, vous devez disposer, dans la mesure du possible, d'une copie de sauvegarde récente de votre système ou de la partition logique (systèmes d'exploitation, logiciels sous licence et données). 3. Passez en revue la procédure d'installation ou de remplacement relative au dispositif ou au composant. 4. Notez la correspondance des couleurs utilisées sur le système. La couleur bleue ou ocre sur les composants matériels indique un point de contact que vous pouvez utiliser pour retirer ou installer le composant sur le système, ouvrir ou fermer un levier, etc. La couleur ocre peut également désigner un composant que vous pouvez retirer ou remplacer alors que le système ou la partition logique est sous tension. 5. Vous devez vous munir d'un tournevis à lame plate moyen, d'un tournevis cruciforme et d'une paire de ciseaux. 6. Si certains composants sont incorrects, manquants ou visiblement endommagés, procédez comme suit : v Si vous remplacez un composant, contactez votre fournisseur de services ou le support technique. v Si vous installez une fonctionnalité, contactez l'un des services suivants : – Le fournisseur de services ou le support technique 7. Si vous rencontrez des difficultés lors de l'installation, contactez votre fournisseur de services, ou le support technique. 8. Si vous installez du nouveau matériel dans une partition logique, vous devez planifier et comprendre les implications inhérentes au partitionnement du système. Pour plus d'informations, consultez la rubrique Partitionnement logique. Identification d'un composant défaillant Ces instructions permettent d'apprendre à localiser et identifier un composant défaillant ou à retirer, ou encore de localiser l'emplacement d'installation d'un nouveau composant sur votre système ou unité d'extension en utilisant une méthode adaptée à votre système. Pourquoi et quand exécuter cette tâche Pour les serveurs systèmes dotés du processeur POWER7, les voyants peuvent être utilisés pour identifier ou vérifier un composant à retirer, installer ou en cours de maintenance. La combinaison de la fonction d'identification et du voyant d'incident (couleur ambre) montre l'emplacement d'une unité remplaçable sur site (unité FRU). Lorsque vous retirez une unité FRU, vérifiez d'abord que vous travaillez sur l'unité FRU correcte à l'aide de la fonction d'identification dans la console de gestion ou dans une autre interface utilisateur. Lorsque vous retirez une unité FRU dans la console de gestion du matériel, la fonction d'identification est activée et désactivée automatiquement aux moments opportuns. La fonction d'identification produit le clignotement du voyant jaune. Lorsque vous désactivez la fonction d'identification, le voyant revient à son état précédent. Pour les composants dotés d'un bouton de maintenance bleu, la fonction d'identification définit les informations du voyant pour le bouton de maintenance de sorte que lorsque le bouton est activé, les voyants corrects du composant clignotent. Si vous avez besoin d'utiliser la fonction d'identification, utilisez les procédures suivantes. 86 Activation SAS (Serial-attached SCSI) RAID Voyants du panneau de commande Ces informations peuvent s'utiliser comme un guide sur les voyants et boutons du panneau de commande. Le panneau de commande est doté de voyants qui indiquent les différents états du système. v A : Bouton de mise sous tension v B : Voyant d'alimentation – Un voyant fixe indique que l'unité est sous alimentation système complète. – Un voyant clignotant indique que l'unité est sous alimentation de mode veille. Remarque : Une période d'environ 30 secondes s'écoule entre le moment où vous appuyez sur le bouton de mise sous tension et celui où le voyant d'alimentation cesse de clignoter pour devenir fixe. Durant cette période de transition, le clignotement peut s'accélérer. v C : Voyant d'identification du boîtier – Une lumière constante indique l'état d'identification, qui permet d'identifier une pièce. – L'absence de lumière indique un fonctionnement normal du système. v D: Voyant d'information système – L'absence de lumière indique un fonctionnement normal du système. – Un voyant allumé signifie qu'une intervention est requise sur le système. v E : Port USB v F : Voyant de panne de boîtier – Un voyant fixe indique une panne dans l'unité centrale. – L'absence de lumière indique un fonctionnement normal du système. v G : Ecran Fonctions/Données v H : Bouton de décrémentation v I : Bouton Entrée Figure 65. Panneau de commande Procédures communes aux dispositifs installables 87 v J : Bouton d'incrémentation v K : Bouton de réinitialisation Concepts associés Identification d'un composant défaillant Ces instructions permettent de savoir comment localiser et identifier un composant défaillant sur un système ou une unité d'extension en utilisant la méthode propre à votre système. Identification d'un composant défaillant sur un système AIX ou une partition logique Les instructions permettent de savoir comment rechercher un composant défaillant, puis d'activer le voyant de ce composant sur un système ou une partition logique exécutant le système d'exploitation AIX. Localisation d'un composant défaillant sur un système AIX ou une partition logique Il se peut que vous deviez utiliser les outils AIX, avant d'activer le voyant dans le cadre de la recherche d'un composant défaillant. Procédure 1. Connectez-vous en tant qu'utilisateur root ou celogin-. 2. A l'invite, tapez diag et appuyez sur Entrée. 3. Dans le menu Sélection des fonctions, choisissez Sélection d'une tâche et appuyez sur Entrée. 4. Sélectionnez Affichage des résultats de tests de diagnostic précédents, puis appuyez sur Entrée. 5. Dans les précédents diagnostics, choisissez Affichage du journal des diagnostics abrégé. L'écran Affichage du journal des diagnostics qui contient une liste chronologique des événements s'affiche. 6. Dans la colonne T, recherchez l'entrée S la plus récente. Sélectionnez la ligne, puis appuyez sur Entrée. 7. Sélectionnez Validation. Le programme affiche les détails de l'entrée du journal. 8. Notez l'emplacement et le code SRN figurant vers la fin de l'entrée. 9. Retournez à la ligne de commande. Que faire ensuite Utilisez les informations de localisation du composant défaillant pour activer le voyant qui identifie ce dernier. Pour plus d'informations, voir «Activation du voyant associé au composant défaillant». Activation du voyant associé au composant défaillant Ces instructions permettent d'identifier physiquement l'emplacement d'un composant pour lequel vous intervenez. Procédure 1. Connectez-vous en tant qu'utilisateur root. 2. A l'invite, tapez diag et appuyez sur Entrée. 3. Dans le menu Sélection des fonctions, choisissez Sélection d'une tâche et appuyez sur Entrée. 4. Dans le menu Sélection des tâches, sélectionnez Indicateurs d'identification et d'avertissement, puis appuyez sur Entrée. 5. Dans la liste des voyants, sélectionnez le code d'emplacement associé au composant défaillant et appuyez sur Entrée. 6. Sélectionnez Validation. Le programme allume le voyant d'incident associé au composant défaillant. 7. Retournez à la ligne de commande. 88 Activation SAS (Serial-attached SCSI) RAID Désactivation du voyant d'un composant défaillant Cette procédure permet d'éteindre un voyant allumé dans le cadre d'une opération de maintenance. Pourquoi et quand exécuter cette tâche Pour le désactiver, procédez comme suit : Procédure 1. Connectez-vous en tant qu'utilisateur root. 2. A l'invite, tapez diag et appuyez sur Entrée. 3. Dans le menu Sélection des fonctions, choisissez Sélection d'une tâche et appuyez sur Entrée. 4. Dans le menu Sélection des tâches, sélectionnez Indicateurs d'identification et d'avertissement, puis appuyez sur Entrée. 5. Dans la liste des voyants, sélectionnez le code d'emplacement associé au composant défaillant et appuyez sur Entrée. Lorsque le voyant d'un composant défaillant est activé, le caractère I précède le code d'emplacement. 6. Sélectionnez Validation. Le programme désactive l'intervention du système ainsi que le voyant associé au composant défaillant. 7. Retournez à la ligne de commande. Identification d'un composant défaillant sur un système Linux ou une partition logique Si l'aide à la maintenance a été installée sur un système ou une partition logique, vous pouvez activer ou désactiver les voyants pour localiser un composant ou effectuer une opération de maintenance. Localisation d'un composant défaillant sur un système Linux ou une partition logique Si l'aide à la maintenance a été installée sur un système ou une partition logique, vous devez activer les voyants pour localiser un composant. Pourquoi et quand exécuter cette tâche Pour l'activer, procédez comme suit : Procédure 1. Connectez-vous en tant qu'utilisateur root. 2. Sur la ligne de commande, tapez /usr/sbin/usysident -s identify -l et appuyez sur Entrée. 3. Repérez le voyant d'incident pour identifier le boîtier qui renferme le composant défaillant. Localisation d'un composant défaillant sur un système ou une partition logique Linux Pour retrouver le code d'emplacement d'un composant défaillant si vous avez oublié cette information, utilisez la procédure décrite dans cette rubrique. Pourquoi et quand exécuter cette tâche Pour localiser le composant défaillant sur un système ou une partition logique, procédez comme suit : Procédure 1. Connectez-vous en tant qu'utilisateur root. 2. Sur la ligne de commande, tapez grep diagela /var/log/platform et appuyez sur Entrée. Procédures communes aux dispositifs installables 89 3. Recherchez l'entrée la plus récente contenant un code SRC (System Reference Code). 4. Notez les informations d'emplacement. Activation du voyant associé au composant défaillant Si vous connaissez le code d'emplacement du composant défaillant, activez le voyant pour vous aider à rechercher le composant à remplacer. Pourquoi et quand exécuter cette tâche Pour l'activer, procédez comme suit : Procédure 1. Connectez-vous en tant qu'utilisateur root. 2. Sur la ligne de commande, tapez /usr/sbin/usysident -s identify -l et appuyez sur Entrée. 3. Repérez le voyant d'incident pour identifier le boîtier qui renferme le composant défaillant. Résultats Désactivation du voyant d'un composant défaillant Après avoir terminé la procédure de retrait et de remplacement, vous devez désactiver le voyant du composant défaillant. Pourquoi et quand exécuter cette tâche Pour le désactiver, procédez comme suit : Procédure 1. Connectez-vous en tant qu'utilisateur root. 2. Sur la ligne de commande, tapez /usr/sbin/usysident -s normal -l et appuyez sur Entrée. Localisation d'un composant défaillant sur un système ou une partition logique Virtual I/O Server Vous pouvez utiliser les outils Virtual I/O Server (VIOS) avant d'activer le voyant, pour rechercher un composant défaillant. Procédure 1. Connectez-vous en tant qu'utilisateur root ou celogin-. 2. A l'invite, tapez diagmenu et appuyez sur Entrée. 3. Dans le menu Sélection des fonctions, choisissez Sélection d'une tâche et appuyez sur Entrée. 4. Sélectionnez Affichage des résultats de tests de diagnostic précédents, puis appuyez sur Entrée. 5. Dans l'écran Affichage des résultats de tests de diagnostic précédents, sélectionnez Affichage du journal des diagnostics abrégé. L'écran Affichage du journal des diagnostics apparaît. Il contient la liste des événements triée dans l'ordre chronologique. 6. Dans la colonne T, recherchez l'entrée S la plus récente. Sélectionnez la ligne, puis appuyez sur Entrée. 7. Sélectionnez Validation. Le programme affiche les détails de l'entrée du journal. 8. Notez l'emplacement et le code SRN figurant vers la fin de l'entrée. 9. Retournez à la ligne de commande. 90 Activation SAS (Serial-attached SCSI) RAID Résultats Utilisez les informations de localisation du composant défaillant pour activer le voyant qui identifie ce dernier. Pour plus d'informations, voir «Identification d'un composant à l'aide du Virtual I/O Server». Identification d'un composant à l'aide du Virtual I/O Server Ces instructions permettent d'activer le voyant afin de repérer l'emplacement physique d'un composant à l'aide du Virtual I/O Server (VIOS). Procédure 1. Connectez-vous en tant qu'utilisateur root. 2. A l'invite, tapez diagmenu et appuyez sur Entrée. 3. Dans le menu Sélection des fonctions, choisissez Sélection d'une tâche. Appuyez sur Entrée. 4. Dans le menu Sélection des tâches, sélectionnez Indicateurs d'identification et d'avertissement. Appuyez sur Entrée. 5. Dans la liste des voyants, sélectionnez le code d'emplacement associé au composant défaillant et appuyez sur Entrée. 6. Sélectionnez Validation. Le programme allume le voyant d'incident associé au composant défaillant. 7. Retournez à la ligne de commande. Démarrage du système ou de la partition logique La présente section explique comment démarrer un système ou une partition logique après avoir effectué une action de maintenance ou une mise à niveau du système. Démarrage d'un système non géré par une console HMC ou une console SDMC Vous pouvez utiliser le bouton d'alimentation ou l'interface ASMI pour démarrer un système qui n'est pas géré par une console HMC (Hardware Management Console) ni par une console Systems Director Management Console (SDMC). Pourquoi et quand exécuter cette tâche Pour démarrer un système non géré par une console HMC ou SDMC, procédez comme suit : Procédure 1. Si nécessaire, ouvrez le volet avant de l'armoire. 2. Avant d'appuyer sur le bouton d'alimentation du panneau de commande, vérifiez que le système est bien relié à une source d'alimentation : v Tous les cordons d'alimentation système doivent être reliés à une source d'alimentation. v Le voyant d'alimentation clignote lentement, comme indiqué dans la figure suivante. v Comme indiqué dans la figure ci-après, la partie supérieure de l'écran doit afficher la mention 01 V=F. 3. Appuyez sur le bouton d'alimentation (A) du panneau de commande (voir figure suivante). Procédures communes aux dispositifs installables 91 v A : Bouton de mise sous tension v B : Voyant d'alimentation – Un voyant fixe indique que l'unité est sous alimentation système complète. – Un voyant clignotant indique que l'unité est sous alimentation de mode veille. Remarque : Une période d'environ 30 secondes s'écoule entre le moment où vous appuyez sur le bouton de mise sous tension et celui où le voyant d'alimentation cesse de clignoter pour devenir fixe. Durant cette période de transition, le clignotement peut s'accélérer. v C : Voyant d'identification de boîtier – Une lumière fixe indique l'état d'identification du boîtier ou d'une ressource du boîtier. – Aucune lumière signifie qu'aucune ressource du boîtier n'est identifiée. v D : Voyant d'incident – L'absence de lumière indique un fonctionnement normal du système. – Un voyant fixe indique qu'une intervention est requise sur le système. v E : Port USB v F : Voyant de panne de boîtier – Un voyant fixe correspond à un voyant de panne actif sur le système. – L'absence de lumière indique un fonctionnement normal du système. v G : Ecran Fonctions/Données v H : Bouton de décrémentation v I : Bouton Entrée v J : Bouton d'incrémentation v K : Bouton de réinitialisation 4. Une fois que vous avez appuyé sur le bouton d'alimentation, notez les éléments suivants : v Le voyant d'alimentation commence à clignoter plus rapidement. v Les ventilateurs système sont activés après environ 30 secondes et commencent à tourner plus rapidement. Figure 66. Panneau de commande 92 Activation SAS (Serial-attached SCSI) RAID v L'écran du panneau de commande affiche les jauges (ou points de contrôle) pendant le démarrage du système. Le voyant de mise sous tension arrête de clignoter et reste fixe, indiquant que le système est sous tension. Que faire ensuite Conseil : Si le système ne démarre pas alors que vous avez appuyé sur le bouton d'alimentation, utilisez la procédure suivante pour démarrer le système à l'aide de l'interface ASMI (Advanced System Management Interface) : 1. Lancez l'interface ASMI. Pour plus d'informations, reportez-vous à la rubrique Accès à l'interface ASMI. 2. Démarrez le système à l'aide de l'interface ASMI. Pour plus d'informations, reportez-vous à la rubrique Mise sous tension et arrêt du système. Démarrage d'un système ou d'une partition logique via la console HMC Vous pouvez utiliser l'interface utilisateur de la console HMC (Hardware Management Console) pour démarrer le système ou la partition logique une fois les câbles requis installés et connectés à une source d'alimentation. Pourquoi et quand exécuter cette tâche Pour obtenir des instructions sur l'utilisation de la console HMC, voir Gestion de la console HMC (Hardware Management Console). Pour plus d'informations sur le démarrage d'une partition logique, reportez-vous à la rubrique Partitionnement logique. Pour plus d'informations sur le démarrage du système, reportez-vous à la rubrique Mise sous tension du système géré. L'écran du panneau de commande affiche les jauges (ou points de contrôle) pendant le démarrage du système. Lorsque le voyant d'alimentation s'arrête de clignoter et reste fixe, cela signifie que le système est sous tension. Démarrage d'un système ou d'un serveur virtuel via la console SDMC Vous pouvez utiliser l'interface utilisateur de la console SDMC (Systems Director Management Console) pour démarrer le système ou le serveur virtuel une fois les câbles requis installés et les cordons d'alimentation connectés à une alimentation. Pourquoi et quand exécuter cette tâche Pour obtenir des instructions sur l'utilisation de la console SDMC, voir Gestion et configuration de la console SDMC. Pour des instructions sur le démarrage d'un serveur virtuel, voir Gestion de serveurs virtuels. Pour des instructions sur l'arrêt et le redémarrage de serveurs virtuels, voir Arrêt et redémarrage de serveurs virtuels. L'écran du panneau de commande affiche les jauges, ou points de contrôle, pendant le démarrage du système. Lorsque le voyant de mise sous tension du panneau de commande cesse de clignoter et reste fixe, le système est sous tension. Arrêt d'un système ou d'une partition logique La présente section explique comment arrêter un système ou une partition logique dans le cadre d'une mise à niveau du système ou d'une procédure de maintenance. Procédures communes aux dispositifs installables 93 Pourquoi et quand exécuter cette tâche Avertissement : Si vous utilisez le bouton de mise sous tension ou les commandes de la console HMC (Hardware Management Console) pour arrêter le système, les fichiers de données risquent d'être endommagés. Par ailleurs, le système risque de mettre plus de temps à démarrer la prochaine fois si toutes les applications n'ont pas été fermées avant l'arrêt du système. Pour arrêter le système ou la partition logique, sélectionnez la procédure appropriée. Arrêt d'un système non géré par une console HMC ou une console SDMC Pour effectuer une autre tâche, vous devez arrêter le système. Ces instructions indiquent comment arrêter le système à l'aide du bouton d'alimentation ou de l'interface ASMI. Avant de commencer Avant d'arrêter le système, procédez comme suit : 1. Vérifiez que tous les travaux sont terminés et fermez toutes les applications. 2. Assurez-vous que le système d'exploitation est arrêté. Avertissement : Si vous ne le faites pas, vous risquez de perdre des données. 3. Si une partition logique Virtual I/O Server (VIOS) est en cours d'exécution, assurez-vous que tous les clients sont déconnectés ou qu'ils peuvent accéder à leurs unités par un autre moyen. Pourquoi et quand exécuter cette tâche La procédure ci-dessous explique comment arrêter un système non géré par une console HMC (Hardware Management Console). Procédure 1. Connectez-vous au système sous un nom d'utilisateur bénéficiant des droits d'accès nécessaires pour exécuter la commande shutdown ou pwrdwnsys (mettre le système hors tension). 2. A l'invite, tapez la commande appropriée : v Si le système exécute le système d'exploitation AIX, tapez shutdown. v Si votre système exécute le système d'exploitation Linux, tapez shutdown -h now. v Si votre système exécute le système d'exploitation , tapez PWRDWNSYS. Si votre système est partitionné, utilisez la commande PWRDWNSYS pour arrêter chaque partition secondaire. Ensuite, utilisez la commande PWRDWNSYS pour arrêter la partition principale. La commande arrête le système d'exploitation. Une fois le système d'exploitation arrêté, le voyant de mise sous tension commence à clignoter lentement et le système passe en veille. 3. Sur la ligne de commande Linux, tapez shutdown -h now. La commande arrête le système d'exploitation. Une fois le système d'exploitation arrêté, le voyant de mise sous tension commence à clignoter lentement et le système passe en veille. 4. Sur la ligne de commande Linux, tapez shutdown -h now. La commande arrête le système d'exploitation. Une fois le système d'exploitation arrêté, le voyant de mise sous tension commence à clignoter lentement et le système passe en veille. 5. Pour restaurer l'état du système à l'issue de l'opération d'installation ou de remplacement, notez le type et le mode de l'IPL. 6. Mettez les interrupteurs d'alimentation des unités reliées au système sur Off. 7. Débranchez tous les cordons d'alimentation connectés à l'unité des socles de prise de courant. Vérifiez que vous débranchez également les cordons d'alimentation des périphériques (imprimantes et unités d'extension, par exemple). 94 Activation SAS (Serial-attached SCSI) RAID Important : Le système est équipé d'un second bloc d'alimentation. Avant de continuer la procédure, vérifiez que toutes les sources d'alimentation ont été débranchées du système. (L003) ou Arrêt d'un système via la console HMC Vous pouvez utiliser l'interface utilisateur de la console HMC (Hardware Management Console) pour arrêter le système ou une partition logique. Pourquoi et quand exécuter cette tâche Par défaut, le système géré est configuré pour se mettre hors tension automatiquement lorsque vous arrêtez sa dernière partition logique en cours d'exécution. Si vous définissez les propriétés du système géré sur la console HMC de façon à empêcher la mise hors tension automatique du système, vous devez utiliser cette procédure pour mettre hors tension le système géré. Avertissement : Dans la mesure du possible, arrêtez les partitions logiques en cours d'exécution avant de mettre le système géré hors tension. Si vous n'arrêtez pas d'abord les partitions logiques, vous risquez de provoquer leur arrêt anormal et de perdre des données. Si vous utilisez une partition logique Virtual I/O Server (VIOS), assurez-vous que tous les clients sont déconnectés ou qu'ils peuvent accéder à leurs unités par un autre moyen. Pour mettre le système géré hors tension, vous devez utiliser l'un des profils utilisateur suivants : v Administrateur central v Technicien de maintenance v Opérateur v Responsable produit Procédures communes aux dispositifs installables 95 Procédez comme suit pour arrêter le système via la console HMC. Procédure 1. Dans la zone de navigation, développez le dossier Gestion de systèmes. 2. Cliquez sur l'icône Serveurs. 3. Dans la zone de contenu, sélectionnez le système géré. 4. Sélectionnez Tâches, Opérations, puis Mise hors tension 5. Sélectionnez le mode de mise hors tension approprié et cliquez sur OK. Information associée Arrêt et redémarrage de partitions logiques Arrêt d'un système via la console SDMC Vous pouvez utiliser l'interface utilisateur de la console SDMC (Systems Director Management Console) pour arrêter le système ou un serveur virtuel. Pourquoi et quand exécuter cette tâche Par défaut, le système géré est configuré pour se mettre hors tension automatiquement lorsque vous arrêtez le dernier serveur virtuel en cours d'exécution. Si vous définissez les propriétés du système géré sur la console SDMC de façon à empêcher la mise hors tension automatique du système, vous devez utiliser cette procédure pour mettre hors tension le système géré. Avertissement : Si possible, arrêtez les serveurs en cours d'exécution sur le système géré avant de mettre ce dernier hors tension. Si vous n'arrêtez pas d'abord les serveurs virtuels, vous risquez de provoquer leur arrêt anormal et de perdre des données. Si vous utilisez une partition logique Virtual I/O Server (VIOS) assurez-vous que tous les clients sont déconnectés ou qu'ils peuvent accéder à leurs unités par un autre moyen. Pour mettre le système géré hors tension, vous devez utiliser l'un des profils utilisateur suivants : v Administrateur central v Technicien de maintenance v Opérateur v Responsable produit Procédez comme suit pour arrêter le système via la console SDMC. Procédure 1. Dans la zone des ressources Power Systems sélectionnez le système géré que vous voulez mettre hors tension. 2. Dans le menu Actions, sélectionnez Opérations > Mise hors tension. 3. Sélectionnez le mode de mise hors tension approprié et cliquez sur OK. Retrait et remplacement des capots sur le système 31E/2B, 31E/1C ou 31E/2C Les présentes instructions permettent de retirer, remettre en place ou installer des capots pour accéder à des composants ou effectuer des opérations de maintenance. Retrait du capot d'accès du système 31E/2B, 31E/1C ou 31E/2C Cette procédure décrit comment retirer le capot d'accès. 96 Activation SAS (Serial-attached SCSI) RAID Procédure 1. Desserrez la vis moletée (A) située à l'arrière du capot. 2. Faites glisser le capot (B) vers l'arrière de l'unité centrale. Lorsque la partie avant du capot d'accès n'est plus sur le bord du cadre supérieur, soulevez le capot et retirez-le de l'unité centrale. Avertissement : Lorsque le capot d'accès pour la maintenance est retiré, le système est mis hors tension. Installation du capot d'accès du système 31E/2B, 31E/1C ou 31E/2C Cette procédure décrit comment installer le capot d'accès. Procédure 1. Placez le capot d'accès (A) sur le haut du système, à environ 25 mm du rebord du châssis supérieur. 2. Maintenez le capot d'accès contre l'unité centrale et faites-le glisser vers l'avant du système. Les languettes du capot d'accès glissent sous le rebord du boîtier supérieur. 3. Alignez la vis moletée (B) située à l'arrière du capot d'accès avec le trou à l'arrière du boîtier du système. 4. Serrez les vis moletées afin de fixer le capot d'accès. Figure 67. Retrait du capot d'accès Procédures communes aux dispositifs installables 97 Mise en position de maintenance ou de fonctionnement du système 31E/2B, 31E/1C ou 31E/2C Ces procédures décrivent comment mettre un système en maintenance et en fonctionnement dans le cadre d'une opération de maintenance ou pour accéder à des composants internes. Mise en position de maintenance du système 31E/2B, 31E/1C ou 31E/2C monté en armoire Utilisez cette procédure pour placer le système monté en armoire en position de maintenance. Avant de commencer Remarques : v Lorsque vous placez l'armoire en position de maintenance, il est essentiel de positionner correctement toutes les plaques de stabilisation pour empêcher l'armoire de basculer. Assurez-vous qu'il n'y ait qu'une seule unité centrale en position de maintenance à la fois. v Veillez à ne pas bloquer ou plier les câbles situés à l'arrière de l'unité centrale lorsque vous extrayez celle-ci de l'armoire. v Une fois les glissières de l'unité centrale complètement étendues, les taquets de sécurité s'enclenchent, ce qui empêche l'unité de trop sortir de son emplacement. Procédure 1. Ouvrez le volet avant de l'armoire. 2. Identifiez l'unité système dont vous effectuez la maintenance dans l'armoire. Figure 68. Installation du capot d'accès 98 Activation SAS (Serial-attached SCSI) RAID 3. Tout en maintenant enfoncés les taquets de déverrouillage droit et gauche de l'unité centrale, tirez l'unité centrale hors de l'armoire jusqu'à ce que les glissières soient complètement étendues et verrouillées. Mise en position de fonctionnement du système 31E/2B, 31E/1C ou 31E/2C monté en armoire Utilisez cette procédure pour placer le système monté en armoire en position de fonctionnement. Avant de commencer Lors du placement en position de fonctionnement du système, veillez à ne pas bloquer ou plier les câbles situés à l'arrière du système lorsque vous poussez l'unité dans l'armoire. Procédure 1. Déverrouillez les taquets de sécurité bleus (A) en les levant vers le haut. Figure 69. Placement du système en position de maintenance Procédures communes aux dispositifs installables 99 2. Poussez l'unité centrale dans l'armoire jusqu'à ce que les deux taquets de déverrouillage de l'unité soient en position. Figure 70. Déverrouillage des taquets de sécurité des glissières Figure 71. Mise en position de fonctionnement du système 100 Activation SAS (Serial-attached SCSI) RAID 3. Fermez le volet avant de l'unité dont vous effectuez la maintenance. Déconnexion des cordons d'alimentation du système 31E/2B, 31E/1C ou 31E/2C Utilisez ces procédures pour déconnecter les cordons d'alimentation du système. Procédure 1. Ouvrez le volet arrière de l'unité dont vous effectuez la maintenance. 2. Identifiez l'unité système dont vous effectuez la maintenance dans l'armoire. 3. Débranchez tous les cordons d'alimentation de l'unité dont vous effectuez la maintenance. Le système est peut-être équipé de deux blocs d'alimentation. Avant de continuer toute procédure de retrait et de remplacement, vérifiez que toutes les deux sources d'alimentation ont été débranchées du système. Connexion des cordons d'alimentation au système 31E/2B, 31E/1C ou 31E/2C Utilisez ces procédures pour connecter les cordons d'alimentation au système. Procédure 1. Ouvrez le volet arrière de l'unité dont vous effectuez la maintenance. 2. Identifiez l'unité système dont vous effectuez la maintenance dans l'armoire. 3. Débranchez tous les cordons d'alimentation de l'unité dont vous effectuez la maintenance. Vérifiez que les cordons d'alimentation passent par les poignées. Figure 72. Retrait des cordons d'alimentation Procédures communes aux dispositifs installables 101 Installation d'un dispositif via la console HMC Vous pouvez utiliser la console HMC (Hardware Management Console) pour effectuer diverses tâches de maintenance, y compris l'installation d'un nouveau dispositif ou composant. Avant de commencer Pour employer l'interface utilisateur de la console HMC pour installer un dispositif ou un composant dans une unité centrale ou une unité d'extension gérée par une console HMC, procédez comme suit : Procédure 1. Dans la zone de navigation, développez Gestion de systèmes > Serveurs. 2. Sélectionnez le système géré sur lequel vous allez installer le composant. 3. Dans la zone des tâches, développez Maintenabilité > Matériel > Tâches de modification de matériel. 4. Sélectionnez Ajout d'une FRU (unité remplaçable sur site). 5. Dans la fenêtre Installation/ajout/suppression de matériel, sélectionnez le système ou le boîtier dans lequel vous installez le dispositif. 6. Sélectionnez dans la liste le type de dispositif que vous souhaitez installer et cliquez sur Suivant. 7. Sélectionnez le code d'emplacement pour l'installation du dispositif et cliquez sur Ajouter. 8. Une fois l'unité remplaçable sur site FRU placée dans la catégorie des actions en attente, cliquez sur Lancer la procédure et suivez les instructions d'installation du dispositif. Remarque : La console HMC peut afficher des instructions expliquant comment installer le dispositif. Dans ce cas, suivez ces instructions pour installer le dispositif. Figure 73. Connexion des cordons d'alimentation 102 Activation SAS (Serial-attached SCSI) RAID Retrait d'un composant via la console HMC Vous pouvez utiliser la console HMC (Hardware Management Console) pour effectuer de nombreuses opérations de maintenance, y compris le retrait d'une unité remplaçable sur site (FRU) ou d'un composant. Pourquoi et quand exécuter cette tâche Pour employer l'interface utilisateur de la console HMC afin de retirer un composant dans une unité centrale ou une unité d'extension gérée par une console HMC, procédez comme suit : Procédure 1. Dans la zone de navigation, développez Gestion de systèmes > Serveurs. 2. Sélectionnez le système géré sont vous souhaitez retirer un composant. 3. Sélectionnez l'une des options suivantes. v Si vous installez un nouveau composant dans le cadre d'une mise à niveau, dans la zone des tâches, développez Maintenabilité > Matériel > Tâches de modification de matériel. v Si vous remplacez un composant dans le cadre d'une opération de maintenance, depuis la zone des tâches, développez, Maintenabilité > Matériel > Retrait de FRU. 4. Pour installer un nouveau composant dans le cadre d'une mise à niveau, sélectionnez Ajout de FRU. Pour remplacer un composant dans le cadre d'une opération de maintenance, sélectionnez Retrait de FRU. 5. Dans la fenêtre Installation/ajout/suppression de matériel - Retrait de FRU, sélectionnez le type de FRU, sélectionnez le système ou le boîtier duquel vous souhaitez retirer le composant. 6. Sélectionnez dans la liste le type de composant que vous souhaitez retirer et cliquez sur Suivant. 7. Sélectionnez l'emplacement du composant que vous retirez et cliquez sur Ajouter. 8. Une fois que le composant figure dans la liste des actions en attente, cliquez sur Lancer la procédure et suivez les instructions de retrait du composant. Remarque : La console HMC peut afficher des instructions de l'Information Center expliquant comment retirer le composant. Dans ce cas, suivez ces instructions pour retirer le composant. Remplacement d'un composant via la console HMC Vous pouvez utiliser la console HMC (Hardware Management Console) pour effectuer diverses tâches de maintenance, y compris le remplacement d'une unité remplaçable sur site (FRU) ou d'un composant. Pourquoi et quand exécuter cette tâche Si vous procédez à un échange de composant dans le but de solutionner un événement réparable, suivez ces instructions. Si vous échangez un composant dans le cadre d'une autre procédure à l'aide de la version 7 ou d'une version ultérieure de la console HMC, procédez comme suit : Procédure 1. Dans la zone de navigation, développez Gestion de systèmes > Serveurs. 2. Sélectionnez le système géré dont vous souhaitez remplacer un composant. 3. Sélectionnez l'une des options suivantes. v Si vous installez un nouveau composant dans le cadre d'une mise à niveau, dans la zone des tâches, développez Maintenabilité > Matériel > Tâches de modification de matériel. v Si vous remplacez un composant dans le cadre d'une opération de maintenance, depuis la zone des tâches, développez, Maintenabilité > Matériel > Remplacement de FRU. Procédures communes aux dispositifs installables 103 4. Pour installer un nouveau composant dans le cadre d'une mise à niveau, sélectionnez Ajout de FRU. Pour remplacer un composant dans le cadre d'une opération de maintenance, sélectionnez Remplacement de FRU. 5. Dans la fenêtre Remplacement de matériel - Remplacement de FRU, Sélectionnez le type de FRU, sélectionnez dans la liste le système ou le boîtier dans lequel vous souhaitez échanger un composant. 6. Dans le menu, sélectionnez le type du composant et cliquez sur Suivant. 7. Dans le menu, sélectionnez le code emplacement du composant et cliquez sur Ajouter. 8. Une fois l'unité remplaçable sur site FRU placée dans la catégorie des actions en attente, cliquez sur Lancer la procédure et suivez les instructions d'échange du dispositif. Remarque : Il se peut que la console HMC ouvre une fenêtre d'instructions pour le remplacement du composant. Dans ce cas, suivez les instructions pour remplacer le composant. Vérification du composant installé Vous pouvez vérifier un nouveau composant installé ou remplacé sur votre système, partition logique ou unité d'extension à l'aide du système d'exploitation, de diagnostics autonomes ou de la console HMC (Hardware Management Console). Vérification d'un dispositif installé ou d'un composant remplacé sur un système ou une partition logique AIX Si vous avez installé un dispositif ou remplacé un composant, il est recommandé d'utiliser les outils dans le système d'exploitation AIX pour vérifier que ce dispositif ou composant est reconnu par le système ou la partition logique. Pourquoi et quand exécuter cette tâche Pour vérifier le fonctionnement d'un dispositif ou d'un composant de remplacement récemment installé, sélectionnez la procédure appropriée : v Vérification du dispositif installé via AIX v Vérification du composant de remplacement via AIX Vérification du dispositif installé via le système d'exploitation AIX : 1. Connectez-vous en tant qu'utilisateur root. 2. A l'invite, tapez diag et appuyez sur Entrée. 3. Sélectionnez Programmes de diagnostic, mode étendu et appuyez sur Entrée. 4. Dans le menu Sélection du mode de diagnostic, sélectionnez Vérification du système et appuyez sur Entrée. 5. Dans le menu Sélection des tests de diagnostic, mode étendu, exécutez l'une des procédures suivantes : v Pour tester une seule ressource, sélectionnez la ressource que vous venez d'installer dans la liste des ressources et appuyez sur Entrée. v Pour tester toutes les ressources disponibles du système d'exploitation, sélectionnez Toutes les ressources et appuyez sur Entrée. 6. Sélectionnez Validation, puis attendez la fin de l'exécution des programmes de diagnostic en répondant aux invites éventuelles. 7. Les programmes de diagnostic ont-ils été exécutés complètement avant d'afficher le message Aucun incident n’a été détecté ? v Non : Si un code SRN (Service Request Number - numéro de demande d'intervention) ou un autre code de référence s'affiche, il est possible qu'un adaptateur ou qu'un câble ne soit pas bien connecté. Passez en revue les procédures d'installation pour vérifier que le nouveau dispositif est 104 Activation SAS (Serial-attached SCSI) RAID installé correctement. Si vous ne parvenez pas à corriger l'incident, rassemblez tous les codes SRN et autres codes de référence éventuels. Si le système fonctionne en mode de partitionnement logique, notez la partition logique sur laquelle le dispositif est installé. Pour obtenir de l'aide, contactez votre fournisseur de services. v Oui : Le nouveau dispositif est installé correctement. Quittez les programmes de diagnostic et revenez en mode de fonctionnement normal. Vérification du composant de remplacement via le système d'exploitation AIX : Pour vérifier le fonctionnement d'un dispositif ou d'un composant de remplacement récemment installé, procédez comme suit : 1. Avez remplacé le composant à l'aide du système d'exploitation AIX ou des opérations de remplacement à chaud du service de diagnostic en ligne ? Non : Passez à l'étape 2. Oui : Passez à l'étape 5. 2. Le système est-il hors tension ? Non : Passez à l'étape 4. Oui : Si le système prend en charge l'initialisation lente, configurez la fonction. Pour plus d'informations, consultez la rubrique Exécution d'une initialisation lente. 3. Démarrez le système et attendez que l'invite de connexion du système d'exploitation AIX s'affiche ou que l'activité apparente du système sur le panneau de commande ou l'écran s'arrête. Avez-vous vu l'invite de connexion AIX ? v Non : Si un code SRN (Service Request Number - numéro de demande d'intervention) ou un autre code de référence s'affiche, il est possible qu'un adaptateur ou qu'un câble ne soit pas bien connecté. Passez en revue les procédures de remplacement pour vérifier que le nouveau composant est installé correctement. Si vous ne parvenez pas à corriger l'incident, rassemblez tous les codes SRN et autres codes de référence éventuels. Si le système ne démarre pas ou que l'invite de connexion ne s'affiche pas, consultez la rubrique : Incidents de chargement et de démarrage du système d'exploitation. Si le système est partitionné, notez la partition logique sur laquelle vous avez remplacé le composant. Pour obtenir de l'aide, contactez votre fournisseur de services. v Oui : Passez à l'étape 4. 4. A l'invite, tapez diag —a et appuyez sur Entrée pour vérifier s'il manque des ressources. Si une invite s'affiche, passez à l'étape 5. Si le menu Sélection des tests de diagnostic s'affiche avec un M en regard d'une ressource, procédez comme suit : a. Sélectionnez la ressource, puis appuyez sur Entrée. b. Sélectionnez Validation. c. Suivez les instructions affichées. d. Si le message Souhaitez-vous revoir l'erreur affichée précédemment ? s'affiche, cliquez sur Oui et appuyez sur Entrée. e. Si le système génère un code SRN, une carte ou un câble est probablement mal connecté. Si aucun incident évident n'apparaît, notez le code SRN et contactez votre fournisseur de services. f. Si aucun code SRN ne s'affiche, passez à l'étape 5. 5. Testez le composant en procédant comme suit : a. A l'invite, tapez diag et appuyez sur Entrée. b. Dans le menu Sélection des fonctions, sélectionnez Programmes de diagnostic, mode étendu et appuyez sur Entrée. c. Dans le menu Sélection du mode de diagnostic, sélectionnez Vérification du système et appuyez sur Entrée. Procédures communes aux dispositifs installables 105 d. Sélectionnez Toutes les ressources ou sélectionnez les programmes de diagnostic du composant pour tester uniquement le composant remplacé ou les unités qui y raccordées et appuyez sur Entrée. Le menu Action corrective sur ressource s'est-il affiché ? Non : Passez à l'étape 6. Oui : Passez à l'étape 7. 6. Le message Test terminé, aucun incident n'a été détecté s'est-il affiché ? v Non : Il existe encore un incident. Contactez votre fournisseur de services. La procédure est terminée. v Oui : Si l'incident ne figure pas dans le journal des erreurs, sélectionnez Consignation d'action corrective dans le menu Sélection des tâches pour mettre à jour le journal des erreurs AIX. Si la réparation consistait à remettre en place un câble ou une carte, sélectionnez la ressource correspondante. Si la ressource associée à l'action ne s'affiche pas dans la liste des ressources, sélectionnez sysplanar0 et appuyez sur Entrée. Conseil : Le voyant du composant passe de l'état Incident à l'état normal. Passez à l'étape 9. 7. Dans le menu Action corrective sur ressource, sélectionnez la ressource remplacée. En cas de test réussi sur une ressource en mode de vérification du système, dont une entrée figure dans le journal des erreurs AIX, le menu Action corrective sur ressource apparaît. Pour mettre à jour le journal des erreurs AIX pour indiquer qu'un composant détectable par le système a été remplacé, procédez comme suit. Remarque : Sur les systèmes équipés d'un voyant correspondant au composant défaillant, le voyant passe à l'état normal. a. Dans le menu Action corrective sur ressource, sélectionnez la ressource remplacée. Si la réparation consistait à remettre en place un câble ou une carte, sélectionnez la ressource correspondante. Si la ressource associée à l'action ne s'affiche pas dans la liste des ressources, sélectionnez sysplanar0 et appuyez sur Entrée. b. Sélectionnez Validation une fois les sélections effectuées. L'écran Action corrective sur ressource s'est-il affiché de nouveau ? Non : Si l'écran Aucun incident détecté apparaît, passez à l'étape 9. Oui : Passez à l'étape 8. 8. Dans le menu Action corrective sur ressource, le composant parent ou enfant de la ressource que vous venez de remplacer, si nécessaire. En cas de test réussi sur une ressource en mode de vérification du système, dont une entrée figure dans le journal des erreurs AIX, le menu Action corrective sur ressource apparaît. Pour mettre à jour le journal des erreurs AIX pour indiquer qu'un composant détectable par le système a été remplacé, procédez comme suit. Remarque : Le voyant du composant passe de l'état Incident à l'état normal. a. Dans le menu Action corrective sur ressource, sélectionnez le composant parent ou enfant de la ressource remplacée. Si la réparation consistait à remettre en place un câble ou une carte, sélectionnez la ressource correspondante. Si la ressource associée à l'action ne s'affiche pas dans la liste des ressources, sélectionnez sysplanar0 et appuyez sur Entrée. b. Sélectionnez Validation une fois les sélections effectuées. c. Si l'écran Aucun incident détecté apparaît, passez à l'étape 9. 9. Si vous avez modifié les paramètres du processeur de service ou du réseau (voir procédures précédentes), restaurez la valeur initiale des paramètres. 10. Avez-vous exécuté des procédures de remplacement à chaud avant cette procédure ? Non : Passez à l'étape 11, à la page 107. Oui : Passez à l'étape 12, à la page 107. 106 Activation SAS (Serial-attached SCSI) RAID 11. Démarrez le système d'exploitation (système ou partition logique en mode normal). Avez-vous pu démarrer le système d'exploitation ? Non : Contactez votre fournisseur de services. La procédure est terminée. Oui : Passez à l'étape 12. 12. Les voyants sont-ils encore allumés ? v Non. Cela marque la fin de la procédure. v Oui : Désactivez les voyants. Pour plus d'informations, consultez la rubrique suivante : Modification des indicateurs de service. Vérification d'un composant installé sur un système Linux ou une partition logique Si vous avez installé un nouveau composant, utilisez les instructions de cette section pour savoir comment vérifier que le système reconnaît ce composant. Pourquoi et quand exécuter cette tâche Pour vérifier un composant que vous venez d'installer ou de remplacer consultez la rubrique «Vérification d'un composant installé via les programmes de diagnostic autonome». Vérification d'un composant installé via les programmes de diagnostic autonome Si vous avez installé ou remplacé un composant, vérifiez que le système le reconnaît. Vous pouvez utiliser les programmes de diagnostic autonomes pour vérifier un composant installé sur un système AIX ou Linux, une unité d'extension ou une partition logique. Avant de commencer v Si le serveur est directement connecté à un autre serveur ou à un réseau, vérifiez que la communication avec les autres serveurs est interrompue. v Les programmes de diagnostic autonomes doivent utiliser l'ensemble des ressources de la partition logique. Aucune autre activité ne peut être exécutée sur la partition. v Les programmes de diagnostic autonomes doivent accéder à la console système. Pourquoi et quand exécuter cette tâche Pour accéder aux programmes de diagnostic, utilisez le CD-ROM approprié ou le serveur NIM (Network Installation Management). La procédure suivante explique comment utiliser les programmes de diagnostic depuis le CD-ROM. Pour obtenir plus d'informations sur l'exécution de diagnostics à partir du serveur NIM, voir Exécution de diagnostics autonomes à partir d'un serveur NIM. Pour utiliser les programmes de diagnostic autonomes, procédez comme suit : Procédure 1. Arrêtez les travaux et les applications, puis arrêtez le système d'exploitation sur le système ou la partition logique. 2. Retirez les bandes, les disquettes et les CD-ROM. 3. Mettez l'unité centrale hors tension. L'étape suivante permet d'initialiser le serveur ou la partition logique à partir du CD-ROM des programmes de diagnostic autonomes. Si l'unité de disques optique n'est pas configurée comme unité d'amorçage sur le serveur ou la partition logique que vous utilisez, procédez comme suit : a. Lancez l'interface ASMI. Pour plus d'informations sur l'utilisation de l'interface ASMI, reportez-vous à la rubrique Accès à l'interface ASMI. Procédures communes aux dispositifs installables 107 b. Dans le menu principal de l'interface ASMI, cliquez sur Contrôle d'alimentation/redémarrage. c. Cliquez sur Mettre le système sous/hors tension. d. Dans le menu déroulant des modes d'initialisation de la partition logique AIX ou Linux sélectionnez l'option Amorçage du mode service à partir de la liste par défaut. e. Cliquez sur Sauvegarder les paramètres et mettre sous tension. Dès que l'unité de disques optique est sous tension, insérez le CD-ROM des programmes de diagnostic autonomes. f. Passez à l'étape 5. 4. Mettez l'unité centrale sous tension et insérez immédiatement le CD-ROM de programme de diagnostic autonome dans l'unité de disque optique. 5. Après l'affichage du voyant POST clavier sur la console système et avant l'affichage du dernier voyant POST (haut-parleur), appuyez sur la touche numérique 5 de la console système pour indiquer qu'une initialisation en mode service doit être effectuée au moyen de la liste des unités d'amorçage en mode service par défaut. 6. Tapez le mot de passe requis. 7. Dans l'écran des instructions d'exécution des tests de diagnostic, appuyez sur Entrée. Conseil : Si un code SRN (Service Request Number - numéro de demande d'intervention) ou un autre code de référence s'affiche, il est possible qu'un adaptateur ou qu'un câble ne soit pas bien connecté. Remarque : Si le système a généré un code SRN ou un autre code de référence alors que vous tentiez de le démarrer, contactez votre fournisseur de service pour obtenir de l'aide. 8. Si le type de terminal est requis, sélectionnez l'option Initialisation du terminal du menu Sélection des fonctions pour initialiser le système d'exploitation. 9. Dans le menu Sélection des fonctions, sélectionnez Programmes de diagnostic, mode étendu et appuyez sur Entrée. 10. Dans le menu Sélection du mode de diagnostic, sélectionnez Vérification du système et appuyez sur Entrée. 11. Lorsque le menu Sélection des tests de diagnostic, mode étendu s'affiche, sélectionnez Toutes les ressources ou testez uniquement le composant remplacé ou les unités qui y sont rattachées en sélectionnant les programmes de diagnostic appropriés puis en appuyant sur Entrée. 12. Le message Test terminé, Aucun incident n'a été détecté s'est-il affiché ? v Non : Il existe encore un incident. Contactez votre fournisseur de services. v Oui : Passez à l'étape 13. 13. Si vous avez modifié les paramètres du processeur de service ou du réseau (voir procédures précédentes), restaurez la valeur initiale des paramètres. 14. Si les voyants sont toujours allumés, procédez comme suit : a. Dans le menu Sélection des tâches, sélectionnez Indicateurs d'identification et d'avertissement pour désactiver les voyants d'identification et d'avertissement et appuyez sur Entrée. b. Sélectionnez Mettre l'indicateur d'avertissement système à NORMAL, puis appuyez sur Entrée. c. Sélectionnez Mettre tous les indicateurs d'identification à NORMAL, puis appuyez sur Entrée. d. Sélectionnez Validation. Remarque : Les voyants d'identification et d'incident passent de l'état Incident à l'état Normal. e. Retournez à la ligne de commande. Vérification d'un composant installé à l'aide de la console HMC (Hardware Management Console) Si vous avez installé ou remplacé un composant, utilisez la console HMC (Hardware Management Console) pour mettre à jour les enregistrements de la console HMC une fois que vous avez effectué une 108 Activation SAS (Serial-attached SCSI) RAID opération de maintenance sur le serveur. Si vous vous êtes servi de codes de référence, de symptômes ou de codes d'emplacement lors de l'opération, repérez les enregistrements à utiliser pendant la procédure. Pourquoi et quand exécuter cette tâche Pour vérifier un composant installé, procédez comme suit : Procédure 1. Sur la console HMC, recherchez d'éventuels événements d'action de maintenance ouverts dans le journal des événements de maintenance. Pour plus d'informations, reportez-vous à la rubrique «Affichage des événements réparables à l'aide de la console HMC», à la page 111. 2. Des événements d'action de maintenance sont-ils ouverts ? Non : Si le voyant d'avertissement système est encore allumé, utilisez la console HMC pour le désactiver. Consultez la rubrique «Activation/Désactivation des voyants via la console HMC». La procédure est terminée. Oui : Passez à l'étape suivante. 3. Notez la liste des événements d'action de maintenance ouverts. 4. Examinez les détails de chacun de ces événements. Les codes d'erreur associés à ces derniers sont-ils les mêmes que ceux que vous avez collectés auparavant ? v Non : Sélectionnez l'une des options suivantes : – Examinez les autres événements réparables pour en trouver un dont le code d'erreur correspond et passez à l'étape suivante. – Si le contenu du journal ne correspond pas à ce que vous avez collecté précédemment, contactez votre fournisseur de services. v Oui : Passez à l'étape suivante. 5. Dans la fenêtre Erreur associée à cet événement réparable, sélectionnez et mettez en évidence l'événement d'action de maintenance. 6. Cliquez sur Fermer l'événement. 7. Ajoutez des commentaires sur l'événement réparable. Incluez toute autre information propre à cet événement. Cliquez sur OK. 8. Avez-vous remplacé, ajouté ou modifié une unité remplaçable sur site (FRU) de l'événement d'action de maintenance ouvert ? v Non : Sélectionnez l'option Aucune FRU remplacée pour cet événement réparable et cliquez sur OK pour fermer l'événement d'action de maintenance. v Oui : Procédez comme suit : a. Dans la liste des unités remplaçables sur site, sélectionnez une unité à mettre à jour. b. Cliquez deux fois sur l'unité, puis mettez à jour les informations correspondantes. c. Cliquez sur OK pour refermer l'événement d'action de maintenance. 9. Si les incidents persistent, contactez votre fournisseur de services. Activation/Désactivation des voyants via la console HMC Utilisez cette procédure pour activer ou désactiver des voyants via le Point focal de service pour la console HMC (Hardware Management Console). Pourquoi et quand exécuter cette tâche Choisissez l'une des options suivantes : v «Désactivation d'un voyant d'avertissement système ou de partition» v «Activation ou désactivation d'un voyant d'identification», à la page 110 Désactivation d'un voyant d'avertissement système ou de partition : Procédures communes aux dispositifs installables 109 Pourquoi et quand exécuter cette tâche Vous pouvez désactiver un voyant d'avertissement système ou un voyant de partition logique. Par exemple, vous pouvez constater qu'un incident n'est pas prioritaire et peut être résolu ultérieurement. Cependant, si vous souhaitez être averti en cas d'incident, vous devez désactiver le voyant d'avertissement système pour qu'il puisse être activé quand un nouvel incident survient. Procédure 1. Dans la zone de navigation, ouvrez le dossier Gestion de systèmes. 2. Ouvrez Serveurs et sélectionnez le système approprié. 3. Dans la zone de contenu, cochez la case correspondant à la partition concernée. 4. Sélectionnez Tâches, Opérations, puis Gestion des voyants d'avertissement système. 5. Sélectionnez la partition appropriée. 6. Sélectionnez Désactivation du voyant d'avertissement système dans le menu Action. La fenêtre de confirmation qui s'affiche fournit les informations suivantes : v Un message de vérification indique que le voyant d'avertissement système a été désactivé. v Un message indique que le système présente encore des incidents non résolus. v Un message indique que le voyant d'avertissement système ne peut pas être activé. 7. Sélectionnez l'une des partitions logiques dans le tableau du bas, puis sélectionnez Désactivation du voyant de partition dans le menu Opérations sur la partition. La fenêtre de confirmation qui s'affiche fournit les informations suivantes : v Une vérification indique que le voyant de la partition logique a été désactivé. v Un message indique que la partition logique présente encore des incidents non résolus. v Un message indique que le voyant de la partition logique ne peut pas être activé. Activation ou désactivation d'un voyant d'identification : Pourquoi et quand exécuter cette tâche Le système comporte plusieurs voyants d'identification de composants, tels que les boîtiers ou les unités remplaçables sur site (FRU). On les appelle donc voyants d'identification. Vous pouvez activer ou désactiver les types de voyants d'identification suivants : v Voyant d'identification d'un boîtier : Si vous souhaitez ajouter un adaptateur dans un tiroir spécifique (boîtier), vous devez connaître les type, modèle et numéro de série (MTMS) du tiroir. Pour déterminer si vous disposez des informations MTMS appropriées du tiroir nécessitant le nouvel adaptateur, vous pouvez activer le voyant d'un tiroir et vérifier que le MTMS correspond au tiroir qui doit accueillir le nouveau matériel. v Voyant d'identification d'une unité FRU associée à un boîtier : Pour connecter un câble à un adaptateur d'E-S, vous pouvez activer le voyant de l'adaptateur s'il s'agit d'une unité FRU, puis vérifier sur le matériel où le câble doit être fixé. Cette opération est particulièrement utile en présence de ports ouverts sur plusieurs adaptateurs. Pour activer ou désactiver le voyant d'identification d'un boîtier ou d'une unité FRU, procédez comme suit : Procédure 1. Dans la zone de navigation, ouvrez l'élément Gestion de systèmes. 2. Sélectionnez Serveurs. 3. Dans la zone de contenu, cochez la case correspondant au système concerné. 4. Sélectionnez Tâches, Opérations, Statut du voyant, puis Voyant d'identification. 110 Activation SAS (Serial-attached SCSI) RAID 5. Pour activer ou désactiver le voyant d'identification d'un boîtier, sélectionnez le boîtier dans le tableau, puis cliquez sur Activation du voyant ou Désactivation du voyant. Le voyant associé s'allume ou s'éteint. 6. Pour activer ou désactiver le voyant d'identification d'une unité FRU, choisissez un boîtier dans le tableau, puis sélectionnez Sélectionné > Liste des FRU. 7. Sélectionnez une ou plusieurs unités FRU dans le tableau, et cliquez sur Activation du voyant ou Désactivation du voyant. Le voyant associé s'allume ou s'éteint. Affichage des événements réparables à l'aide de la console HMC Utilisez cette procédure pour afficher un événement réparable, y compris les détails, commentaires et historique de maintenance. Pourquoi et quand exécuter cette tâche Pour pouvoir consulter des événements réparables et les informations qui leur sont associées, vous devez avoir l'un des rôles suivants : v Administrateur central v Technicien de maintenance v Opérateur v Responsable produit v Visionneur Pour afficher des événements réparables, procédez comme suit : Procédure 1. Dans la zone de navigation, sélectionnez Gestion de la maintenance. 2. Sélectionnez Gestion des événements réparables. 3. Sélectionnez les critères des événements réparables à visualiser, puis cliquez sur OK. La fenêtre Présentation d'un événement réparable s'affiche. La liste comporte les événements réparables correspondant à vos critères de sélection. La barre de menus vous propose des actions réalisables sur les événements réparables. 4. Sélectionnez une ligne dans la fenêtre Présentation d'un événement réparable, puis choisissez Sélectionné > Affichage des détails La fenêtre Détails de l'événement réparable affiche des informations détaillées sur l'événement réparable. Le tableau du haut comporte des informations telles que le numéro d'incident et le code de référence. Le tableau du bas indique les unités remplaçables sur site (FRU) associées à cet événement. 5. Sélectionnez l'erreur pour laquelle vous souhaitez consulter les commentaires et l'historique, puis procédez comme suit : a. Sélectionnez Actions > Visualisation des commentaires. b. Pour quitter les commentaires, cliquez sur Fermeture. c. Sélectionnez Actions > Affichage de l'historique de maintenance. La fenêtre Historique de maintenance comporte l'historique de maintenance associé à l'erreur sélectionnée. d. Pour quitter l'historique de maintenance, cliquez sur Fermeture. 6. Quand vous avez terminé, cliquez sur Annulation dans les fenêtres Détails de l'événement réparable et Présentation de l'événement réparable pour les refermer. Vérification du composant installé via la console SDMC Si vous avez installé ou remplacé un composant, utilisez la console Systems Director Management Console (SDMC) pour mettre à jour vos enregistrements SDMC lorsqu'une opération de maintenance est terminée sur le serveur. Si vous vous êtes servi de codes de référence, de symptômes ou de codes d'emplacement lors de l'opération, repérez les enregistrements à utiliser pendant la procédure. Procédures communes aux dispositifs installables 111 Pourquoi et quand exécuter cette tâche Pour vérifier un composant installé, procédez comme suit : Procédure 1. Depuis la console SDMC, recherchez dans le journal des événements les opérations de maintenance ouvertes. Pour plus de détails, voir «Visualisation d'événements réparables via la console de gestion Systems Director», à la page 113. 2. Des événements d'action de maintenance sont-ils ouverts ? Non : Si le voyant d'avertissement système est encore allumé, utilisez la console SDMC pour l'éteindre. Pour plus d'informations, voir «Activation/Désactivation de voyants via la console de gestion Systems Director». La procédure est terminée. Oui : Passez à l'étape suivante. 3. Notez la liste des événements d'action de maintenance ouverts. 4. Examinez les détails de chacun de ces événements. Les codes d'erreur associés à ces derniers sont-ils les mêmes que ceux que vous avez collectés auparavant ? v Non : Sélectionnez l'une des options suivantes : – Examinez les autres événements réparables pour en trouver un dont le code d'erreur correspond et passez à l'étape suivante. – Si le contenu du journal ne correspond pas à ce que vous avez collecté précédemment, contactez votre fournisseur de services. v Oui : Passez à l'étape suivante. 5. Dans la fenêtre Erreur associée à cet événement réparable, sélectionnez et mettez en évidence l'événement d'action de maintenance. 6. Cliquez sur Supprimer ou Ignorer. Remarque : Ces options ne sont disponibles qu'à partir du journal des événements. Activation/Désactivation de voyants via la console de gestion Systems Director Utilisez cette procédure pour activer ou désactiver des voyants via la console Systems Director Management Console. Pourquoi et quand exécuter cette tâche Choisissez l'une des options suivantes : v «Désactivation d'un voyant d'avertissement système ou de partition» v «Activation ou désactivation de voyants d'identification via la console SDMC», à la page 113 Désactivation d'un voyant d'avertissement système ou de partition : Pourquoi et quand exécuter cette tâche Vous pouvez désactiver un voyant d'avertissement système ou un voyant de partition logique. Par exemple, vous pouvez constater qu'un incident n'est pas prioritaire et peut être résolu ultérieurement. Cependant, si vous souhaitez être averti en cas d'incident, vous devez désactiver le voyant d'avertissement système pour qu'il puisse être activé quand un nouvel incident survient. Procédure 1. Dans l'onglet Ressources, sélectionnez l'hôte ou le serveur virtuel approprié. 2. Sélectionnez Actions > Service et support > Matériel > Voyant d'avertissement système. 3. Sélectionnez Désactivation du voyant d'avertissement système. La fenêtre de confirmation qui s'affiche fournit les informations suivantes : v Un message de vérification indique que le voyant d'avertissement système a été désactivé. 112 Activation SAS (Serial-attached SCSI) RAID v Un message indique que le système présente encore des incidents non résolus. v Un message indique que le voyant d'avertissement système ne peut pas être activé. 4. Sélectionnez l'un des serveurs virtuels, puis sélectionnez Désactivation du voyant d'avertissement système. La fenêtre de confirmation qui s'affiche fournit les informations suivantes : v Un message de vérification indique que le voyant d'avertissement système a été désactivé. v Un message indique que la partition logique présente encore des incidents non résolus. v Un message indique que le voyant du serveur virtuel ne peut pas être activé. Activation ou désactivation de voyants d'identification via la console SDMC : Pourquoi et quand exécuter cette tâche Le système comporte différents voyants d'identification de composants, tels que les boîtiers ou les unités remplaçables sur site (FRU). On les appelle donc voyants d'identification. Vous pouvez activer ou désactiver les types de voyants d'identification suivants : v Voyant d'identification d'un boîtier : Si vous souhaitez ajouter un adaptateur dans un tiroir spécifique (boîtier), vous devez connaître les type, modèle et numéro de série (MTMS) du tiroir. Pour déterminer si vous disposez des informations MTMS appropriées du tiroir nécessitant le nouvel adaptateur, vous pouvez activer le voyant d'un tiroir et vérifier que le MTMS correspond au tiroir qui doit accueillir le nouveau matériel. v Voyant d'identification d'une unité FRU associée à un boîtier : Pour connecter un câble à un adaptateur d'E-S, vous pouvez activer le voyant de l'adaptateur s'il s'agit d'une unité FRU, puis vérifier sur le matériel où le câble doit être fixé. Cette opération est particulièrement utile en présence de ports ouverts sur plusieurs adaptateurs. Pour activer ou désactiver le voyant d'identification d'un boîtier ou d'une unité FRU, procédez comme suit : Procédure 1. Dans l'onglet Ressources, sélectionnez l'hôte ou le serveur virtuel approprié. 2. Sélectionnez Actions > Service et support > Matériel > Voyant d'identification. 3. Sous Voyant d'identification, sélectionnez la fenêtre Boîtier puis l'unité centrale ou le boîtier. 4. Pour activer ou désactiver un voyant d'identification, cliquez sur Activation du voyant ou Désactivation du voyant. Le voyant associé s'allume ou s'éteint. 5. Pour activer ou désactiver le voyant d'identification d'une unité FRU, sélectionnez un système ou un boîtier dans le tableau, puis sélectionnez Liste des unités FRU. 6. Sélectionnez une ou plusieurs unités FRU dans le tableau, et cliquez sur Activation du voyant ou Désactivation du voyant. Le voyant associé s'allume ou s'éteint. Visualisation d'événements réparables via la console de gestion Systems Director Utilisez cette procédure pour afficher un événement réparable, y compris les détails, commentaires et historique de maintenance. Pourquoi et quand exécuter cette tâche Pour afficher des événements réparables, procédez comme suit : Procédure 1. Dans l'onglet Ressources, sélectionnez l'hôte ou le serveur virtuel approprié. 2. Sélectionnez Actions > Etat et santé du système > Journal des événements. 3. Facultatif : Vous pouvez limiter les critères de recherche d'événement à l'aide du menu de filtrage des événements. Procédures communes aux dispositifs installables 113 4. Sélectionnez une ligne dans la fenêtre des événements puis sélectionnez Actions > Propriétés La fenêtre Propriétés s'ouvre, affichant les informations détaillées relatives à l'événement réparables. Le tableau indique des informations telles que le numéro d'incident, le code de référence et les unités remplaçables sur site (FRU) associées à l'événement. Vérification d'un dispositif installé ou d'un composant remplacé sur un système ou une partition logique à l'aide des outils Virtual I/O Server Si vous avez installé un dispositif ou remplacé un composant, il est recommandé d'utiliser les outils Virtual I/O Server (VIOS) pour vérifier que ce dispositif ou composant est reconnu par le système ou la partition logique. Pourquoi et quand exécuter cette tâche Pour vérifier le fonctionnement d'un dispositif récemment installé ou d'un composant de remplacement, sélectionnez la procédure appropriée : v Vérification du dispositif installé via VIOS v Vérification du composant de remplacement via VIOS Vérifiez le dispositif installé via VIOS : 1. Connectez-vous en tant qu'utilisateur root. 2. Sur la ligne de commande, tapez diagmenu et appuyez sur Entrée. 3. Sélectionnez Routines de diagnostic avancé et appuyez sur Entrée. 4. Dans le menu Sélection du mode de diagnostic, sélectionnez Vérification du système et appuyez sur Entrée. 5. Dans le menu Sélection des tests de diagnostic, mode étendu, exécutez l'une des procédures suivantes : v Pour tester une seule ressource, sélectionnez la ressource que vous venez d'installer dans la liste des ressources et appuyez sur Entrée. v Pour tester toutes les ressources disponibles du système d'exploitation, sélectionnez Toutes les ressources et appuyez sur Entrée. 6. Sélectionnez Validation, puis attendez la fin de l'exécution des programmes de diagnostic en répondant aux invites éventuelles. 7. Les programmes de diagnostic ont-ils été exécutés complètement avant d'afficher le message Aucun incident n’a été détecté ? v Non : Si un code SRN (Service Request Number - numéro de demande d'intervention) ou un autre code de référence s'affiche, il est possible qu'un adaptateur ou qu'un câble ne soit pas bien connecté. Passez en revue les procédures d'installation pour vérifier que le nouveau dispositif est installé correctement. Si vous ne parvenez pas à corriger l'incident, rassemblez tous les codes SRN et autres codes de référence éventuels. Si le système fonctionne en mode de partitionnement logique, notez la partition logique sur laquelle le dispositif est installé. Pour obtenir de l'aide, contactez votre fournisseur de services. v Oui : Le nouveau dispositif est installé correctement. Quittez les programmes de diagnostic et revenez en mode de fonctionnement normal. Vérifiez le composant de remplacement via VIOS : Pour vérifier le fonctionnement d'un dispositif ou d'un composant de remplacement récemment installé, procédez comme suit : 1. Avez-vous remplacé le composant à l'aide de VIOS ou des opérations de remplacement à chaud du service de diagnostic en ligne ? Non : Passez à l'étape 2, à la page 115. 114 Activation SAS (Serial-attached SCSI) RAID Oui : Passez à l'étape 5. 2. Le système est-il hors tension ? Non : Passez à l'étape 4. Oui : Si le système prend en charge l'initialisation lente, configurez la fonction. Pour plus d'informations, reportez-vous à la rubrique Exécution d'une initialisation lente. 3. Démarrez le système et attendez que l'invite de connexion du système d'exploitation VIOS s'affiche ou que l'activité apparente du système sur le panneau de commande ou l'écran s'arrête. Avez-vous vu l'invite de connexion VIOS ? v Non : Si un code SRN ou un autre code de référence s'affiche, il est possible qu'un adaptateur ou un câble ne soit pas bien connecté. Passez en revue les procédures de remplacement pour vérifier que le nouveau composant est installé correctement. Si vous ne parvenez pas à corriger l'incident, rassemblez tous les codes SRN et autres codes de référence éventuels. Si le système ne démarre pas ou que l'invite de connexion ne s'affiche pas, consultez la rubrique : Incidents de chargement et de démarrage du système d'exploitation. Si le système est partitionné, notez la partition logique sur laquelle vous avez remplacé le composant. Pour obtenir de l'aide, contactez votre fournisseur de services. v Oui : Passez à l'étape 4. 4. A l'invite, tapez diag —a et appuyez sur Entrée pour vérifier s'il manque des ressources. Si une invite s'affiche, passez à l'étape 5. Si le menu Sélection des tests de diagnostic s'affiche avec un M en regard d'une ressource, procédez comme suit : a. Sélectionnez la ressource, puis appuyez sur Entrée. b. Sélectionnez Validation. c. Suivez les instructions affichées. d. Si un message vous demande si vous souhaitez revoir l'erreur affichée précédemment, cliquez sur Oui et appuyez sur Entrée. e. Si le système génère un code SRN, une carte ou un câble est probablement mal connecté. Si aucun incident évident n'apparaît, notez le code SRN et contactez votre fournisseur de services pour obtenir de l'aide. f. Si aucun code SRN ne s'affiche, passez à l'étape 5. 5. Testez le composant en effectuant les opérations suivantes : a. Sur la ligne de commande, tapez diagmenu et appuyez sur Entrée. b. Dans le menu Sélection des fonctions, sélectionnez Programmes de diagnostic, mode étendu et appuyez sur Entrée. c. Dans le menu Sélection du mode de diagnostic, sélectionnez Vérification du système et appuyez sur Entrée. d. Sélectionnez Toutes les ressources ou les programmes de diagnostic du composant pour tester uniquement le composant remplacé ou les périphériques qui y sont raccordées et appuyez sur Entrée. Le menu Action corrective sur ressource s'est-il affiché ? Non : Passez à l'étape 6. Oui : Passez à l'étape 7, à la page 116. 6. Le message Test terminé, aucun incident n'a été détecté s'est-il affiché ? v Non : Il existe encore un incident. Contactez votre fournisseur de services. La procédure est terminée. v Oui : Si l'incident ne figure pas dans le journal des erreurs, sélectionnez Consignation d'action corrective dans le menu Sélection des tâches pour mettre à jour le journal des erreurs AIX. Si la Procédures communes aux dispositifs installables 115 réparation consistait à remettre en place un câble ou une carte, sélectionnez la ressource correspondante. Si la ressource associée à l'action ne s'affiche pas dans la liste des ressources, sélectionnez sysplanar0 et appuyez sur Entrée. Conseil : Le voyant du composant passe de l'état Incident à l'état normal. Passez à l'étape 9. 7. Dans le menu Action corrective sur ressource, sélectionnez la ressource du composant remplacé. En cas de test réussi sur une ressource en mode de vérification du système, dont une entrée figure dans le journal des erreurs AIX, le menu Action corrective sur ressource apparaît. Pour mettre à jour le journal des erreurs AIX pour indiquer qu'un composant détectable par le système a été remplacé, procédez comme suit. Remarque : Sur les systèmes équipés d'un voyant correspondant au composant défaillant, le voyant passe à l'état normal. a. Dans le menu Action corrective sur ressource, sélectionnez la ressource remplacée. Si la réparation consistait à remettre en place un câble ou une carte, sélectionnez la ressource correspondante. Si la ressource associée à votre action n'apparaît pas dans la liste des ressources, sélectionnez sysplanar0. Appuyez sur Entrée. b. Sélectionnez Validation une fois les sélections effectuées. L'écran Action corrective sur ressource s'est-il affiché de nouveau ? Non : Si l'écran Aucun incident détecté apparaît, passez à l'étape 9. Oui : Passez à l'étape 8. 8. Dans le menu Action corrective sur ressource, sélectionnez le composant parent ou enfant de la ressource que vous venez de remplacer, si nécessaire. En cas de test réussi sur une ressource en mode de vérification du système, dont une entrée figure dans le journal des erreurs AIX, le menu Action corrective sur ressource apparaît. Pour mettre à jour le journal des erreurs AIX pour indiquer qu'un composant détectable par le système a été remplacé, procédez comme suit. Remarque : Le voyant du composant passe de l'état Incident à l'état normal. a. Dans le menu Action corrective sur ressource, sélectionnez le composant parent ou enfant de la ressource remplacée. Si la réparation consistait à remettre en place un câble ou une carte, sélectionnez la ressource correspondante. Si la ressource associée à votre action n'apparaît pas dans la liste des ressources, sélectionnez sysplanar0. Appuyez sur Entrée. b. Sélectionnez Validation une fois les sélections effectuées. c. Si l'écran Aucun incident détecté apparaît, passez à l'étape 9. 9. Si vous avez modifié les paramètres du processeur de service ou du réseau (voir procédures précédentes), restaurez la valeur initiale des paramètres. 10. Avez-vous exécuté des procédures de remplacement à chaud avant cette procédure ? Non : Passez à l'étape 11. Oui : Passez à l'étape 12. 11. Démarrez le système d'exploitation (système ou partition logique en mode normal). Avez-vous pu démarrer le système d'exploitation ? Non : Contactez votre fournisseur de services. La procédure est terminée. Oui : Passez à l'étape 12. 12. Les voyants sont-ils encore allumés ? v Non. La procédure est terminée. v Oui : Désactivez les voyants. Pour plus d'informations, consultez l'une de ces rubriques : Modification des indicateurs de service 116 Activation SAS (Serial-attached SCSI) RAID Installation d'un composant via la console SDMC Vous pouvez utiliser la console SDMC (Systems Director Management Console) pour effectuer diverses tâches de maintenance, y compris l'installation d'une nouvelle unité remplaçable sur site (FRU) ou d'un nouveau composant. Avant de commencer Pour utiliser l'interface utilisateur de la console SDMC pour installer un composant sur un système ou une unité d'extension géré(e) par une console SDMC, procédez comme suit : Procédure 1. Dans la zone des ressources Power Systems sélectionnez le système sur lequel vous voulez installer un composant. 2. Dans le menu Actions, développez Service and Support > Matériel > Tâches de modification de matériel > Ajout de FRU. 3. Dans l'onglet d'ajout de FRU, sélectionnez le système ou le type de boîtier. 4. Sélectionnez dans le menu le type de FRU que vous souhaitez installer et cliquez sur Suivant. 5. Sélectionnez le code d'emplacement pour l'installation du composant et cliquez sur Ajouter. 6. Une fois l'unité remplaçable sur site (FRU) placée dans la catégorie des actions en attente, cliquez sur Lancer la procédure et suivez les instructions d'installation du composant. Remarque : Il se peut que la console SDMC ouvre une fenêtre d'instructions pour l'installation du composant. Dans ce cas, suivez les instructions pour installer le composant. Remplacement d'un composant via la console SDMC Vous pouvez utiliser la console SDMC (Systems Director Management Console) pour effectuer diverses tâches de maintenance, y compris le remplacement d'une unité remplaçable sur site (FRU) ou d'un composant. Pourquoi et quand exécuter cette tâche Si vous procédez à un échange de composant dans le but de solutionner un événement réparable, suivez ces instructions. Procédure 1. Dans la zone des ressources Power Systems, sélectionnez le système sur lequel vous souhaitez remplacer un composant. 2. Sélectionnez l'une des options suivantes : v Si vous remplacez un composant en dehors du cadre d'une opération de maintenance, depuis le menu Actions, développez Service and Support > Matériel > Remplacement de FRU. v Si vous procédez à un échange de composant dans le but de corriger un événement réparable, voir Démarrage d'une réparation. 3. Pour remplacer un composant dans le cadre d'une opération de maintenance, sélectionnez Remplacement de FRU. 4. Dans l'onglet Remplacement de FRU, sélectionnez l'unité centrale ou d'extension sur laquelle vous souhaitez remplacer un composant. 5. Sélectionnez dans le menu le type de composant que vous souhaitez remplacer et cliquez sur Suivant. 6. Sélectionnez dans le menu le code d'emplacement du composant que vous souhaitez remplacer et cliquez sur Ajouter. Procédures communes aux dispositifs installables 117 7. Une fois l'unité remplaçable sur site (FRU) placée dans la fenêtre des actions en attente, cliquez sur Lancer la procédure et suivez les instructions d'installation du composant. Remarque : Il se peut que la console SDMC ouvre une fenêtre d'instructions pour le remplacement du composant. Dans ce cas, suivez les instructions pour remplacer le composant. Retrait d'un composant via la console SDMC Vous pouvez utiliser la console SDMC (Systems Director Management Console) pour effectuer diverses tâches de maintenance, y compris le retrait d'une unité remplaçable sur site (FRU) ou d'un composant. Pourquoi et quand exécuter cette tâche Pour utiliser l'interface utilisateur de la console SDMC pour retirer un composant sur un système ou une unité d'extension géré(e) par une console SDMC, procédez comme suit : Procédure 1. Dans la zone des ressources Power Systems, sélectionnez le système duquel vous souhaitez retirer un composant. 2. Sélectionnez l'une des options suivantes : v Si vous installez un nouveau composant dans le cadre d'une mise à niveau, depuis le menu Actions développez Service and Support > Matériel > Tâches de modification de matériel. v Si vous retirez un composant dans le cadre d'une opération de maintenance, depuis le menu Actions développez Service and Support > Matériel > Remplacement de FRU. 3. Pour installer un nouveau composant dans le cadre d'une mise à niveau, sélectionnez Ajout de FRU. Pour remplacer un composant dans le cadre d'une opération de maintenance, sélectionnez Remplacement de FRU. 4. Dans la fenêtre d'ajout ou de remplacement de FRU, sélectionnez le système ou l'unité d'extension duquel ou de laquelle vous souhaitez retirer le composant. 5. Sélectionnez dans la liste le type de composant que vous souhaitez retirer et cliquez sur Suivant. 6. Sélectionnez l'emplacement du composant, puis cliquez sur Ajouter. 7. Une fois que le composant figure dans la fenêtre des actions en attente, cliquez sur Lancer la procédure et suivez les instructions de retrait du composant. Remarque : Il se peut que la console SDMC ouvre une fenêtre d'instructions pour le retrait du composant. Dans ce cas, suivez ces instructions pour retirer le composant. 118 Activation SAS (Serial-attached SCSI) RAID Remarques Le présent document peut contenir des informations ou des références concernant certains produits, logiciels ou services du fabricant non annoncés dans ce pays. Le présent document peut contenir des informations ou des références concernant certains produits, logiciels ou services non annoncés dans ce pays. Pour plus d'informations, référez-vous aux documents d'annonce disponibles dans votre pays ou adressez-vous au partenaire commercial du fabricant. Toute référence à un produit, logiciel ou service du fabricant n'implique pas que seul ce produit, logiciel ou service puisse convenir. Tout autre élément fonctionnellement équivalent peut être utilisé, s'il n'enfreint aucun droit du fabricant. Il est toutefois de la responsabilité de l'utilisateur d'évaluer et de vérifier lui-même le fonctionnement de tout produit, programme ou service. Le fabricant peut détenir des brevets ou des demandes de brevet couvrant les produits mentionnés dans le présent document. La remise de ce document ne vous octroie aucun droit de licence sur ces brevets ou demandes de brevet. Si vous désirez recevoir des informations concernant l'acquisition de licences, veuillez en faire la demande par écrit au fabricant. Le paragraphe suivant ne s'applique ni au Royaume-Uni, ni dans aucun pays dans lequel il serait contraire aux lois locales. LE PRESENT DOCUMENT EST LIVRE «EN L'ETAT» SANS AUCUNE GARANTIE EXPLICITE OU IMPLICITE. LE FABRICANT DECLINE NOTAMMENT TOUTE RESPONSABILITE RELATIVE A CES INFORMATIONS EN CAS DE CONTREFAÇON AINSI QU'EN CAS DE DEFAUT D'APTITUDE A L'EXECUTION D'UN TRAVAIL DONNE. Certaines juridictions n'autorisent pas l'exclusion des garanties implicites, auquel cas l'exclusion ci-dessus ne vous sera pas applicable. Le présent document peut contenir des inexactitudes ou des coquilles. Il est mis à jour périodiquement. Chaque nouvelle édition inclut les mises à jour. IBM peut modifier sans préavis les produits et logiciels décrits dans ce document. Les références à des sites Web non référencés par le fabricant sont fournies à titre d'information uniquement et n'impliquent en aucun cas une adhésion aux données qu'ils contiennent. Les éléments figurant sur ces sites Web ne font pas partie des éléments du présent produit et l'utilisation de ces sites relève de votre seule responsabilité. Le fabricant pourra utiliser ou diffuser, de toute manière qu'il jugera appropriée et sans aucune obligation de sa part, tout ou partie des informations qui lui seront fournies. Les données de performance indiquées dans ce document ont été déterminées dans un environnement contrôlé. Par conséquent, les résultats peuvent varier de manière significative selon l'environnement d'exploitation utilisé. Certaines mesures évaluées sur des systèmes en cours de développement ne sont pas garanties sur tous les systèmes disponibles. En outre, elles peuvent résulter d'extrapolations. Les résultats peuvent donc varier. Il incombe aux utilisateurs de ce document de vérifier si ces données sont applicables à leur environnement d'exploitation. Les informations concernant les produits de fabricants tiers ont été obtenues auprès des fournisseurs de ces produits, par l'intermédiaire d'annonces publiques ou via d'autres sources disponibles. Ce fabricant n'a pas testé ces produits et ne peut confirmer l'exactitude de leurs performances ni leur compatibilité. Il ne peut recevoir aucune réclamation concernant des produits de fabricants tiers. Toute question concernant les performances de produits de fabricants tiers doit être adressée aux fournisseurs de ces produits. 119 Toute instruction relative aux intentions du fabricant pour ses opérations à venir est susceptible d'être modifiée ou annulée sans préavis, et doit être considérée uniquement comme un objectif. Tous les tarifs indiqués sont les prix de vente actuels suggérés et sont susceptibles d'être modifiés sans préavis. Les tarifs appliqués peuvent varier selon les revendeurs. Ces informations sont fournies uniquement à titre de planification. Elles sont susceptibles d'être modifiées avant la mise à disposition des produits décrits. Le présent document peut contenir des exemples de données et de rapports utilisés couramment dans l'environnement professionnel. Ces exemples mentionnent des noms fictifs de personnes, de sociétés, de marques ou de produits à des fins illustratives ou explicatives uniquement. Tous ces noms sont fictifs et toute ressemblance avec des noms et adresses utilisés par une entreprise réelle serait purement fortuite. Si vous visualisez ces informations en ligne, il se peut que les photographies et illustrations en couleur n'apparaissent pas à l'écran. Les figures et les spécifications contenues dans le présent document ne doivent pas être reproduites, même partiellement, sans l'autorisation écrite d'IBM. Le fabricant a conçu le présent document pour expliquer comment utiliser les machines indiquées. Il n'est exploitable dans aucun autre but. Les ordinateurs du fabricant contiennent des mécanismes conçus pour réduire les risques d'altération ou de perte de données. Ces risques, cependant, ne peuvent pas être éliminés. En cas de rupture de tension, de défaillances système, de fluctuations ou de rupture de l'alimentation ou d'incidents au niveau des composants, l'utilisateur doit s'assurer de l'exécution rigoureuse des opérations, et que les données ont été sauvegardées ou transmises par le système au moment de la rupture de tension ou de l'incident (ou peu de temps avant ou après). De plus, ces utilisateurs doivent établir des procédures garantissant la vérification indépendante des données, afin de permettre une utilisation fiable de ces dernières dans le cadre d'opérations stratégiques. Ces utilisateurs doivent enfin consulter régulièrement sur les sites Web de support IBM les mises à jour et les correctifs applicables au système et aux logiciels associés. Restrictions liées à l'utilisation de connexion Ethernet Ce produit n'est pas destiné à être connecté, directement ou indirectement, par quelque moyen que ce soit, à des interfaces de réseaux de télécommunications publiques. Marques IBM, le logo IBM et ibm.com sont des marques d'International Business Machines aux Etats-Unis et/ou dans certains autres pays. Les autres noms de produits et de services peuvent être des marques d'IBM ou d'autres sociétés. La liste actualisée de toutes les marques d'IBM est disponible sur la page Web Copyright and trademark information à http://www.ibm.com/legal/copytrade.shtml. Linux est une marque déposée de Linus Torvalds aux Etats-Unis et/ou dans certains autres pays. Les autres noms de produits et de services peuvent être des marques d'IBM ou d'autres sociétés. Bruits radioélectriques Lorsque vous connectez un moniteur à l'équipement, vous devez utiliser le câble fourni à cet effet, ainsi que toute unité de suppression des interférences. 120 Activation SAS (Serial-attached SCSI) RAID Remarques sur la classe A Les avis de conformité de classe A suivants s'appliquent aux serveurs. Recommandation de la Federal Communications Commission (FCC) [Etats-Unis] Remarque : Cet appareil respecte les limites des caractéristiques d'immunité des appareils numériques définies pour la classe A, conformément au chapitre 15 de la réglementation de la FCC. La conformité aux spécifications de cette classe offre une garantie acceptable contre les perturbations électromagnétiques dans les zones commerciales. Ce matériel génère, utilise et peut émettre de l'énergie radiofréquence. Il risque de parasiter les communications radio s'il n'est pas installé conformément aux instructions du constructeur. L'exploitation faite en zone résidentielle peut entraîner le brouillage des réceptions radio et télé, ce qui obligerait le propriétaire à prendre les dispositions nécessaires pour en éliminer les causes. Utilisez des câbles et connecteurs correctement blindés et mis à la terre afin de respecter les limites de rayonnement définies par la réglementation de la FCC. IBM ne peut pas être tenue pour responsable du brouillage des réceptions radio ou télévision résultant de l'utilisation de câbles et connecteurs inadaptés ou de modifications non autorisées apportées à cet appareil. Toute modification non autorisée pourra annuler le droit d'utilisation de cet appareil. Cet appareil est conforme aux restrictions définies dans le chapitre 15 de la réglementation de la FCC. Son utilisation est soumise aux deux conditions suivantes : (1) il ne peut pas causer de perturbations électromagnétiques gênantes et (2) il doit accepter toutes les perturbations reçues, y compris celles susceptibles d'occasionner un fonctionnement indésirable. Avis de conformité à la réglementation d'Industrie Canada Cet appareil numérique de la classe A est conforme à la norme NMB-003 du Canada. Avis de conformité aux exigences de l'Union européenne Le présent produit satisfait aux exigences de protection énoncées dans la directive 2004/108/CEE du Conseil concernant le rapprochement des législations des Etats membres relatives à la compatibilité électromagnétique. IBM décline toute responsabilité en cas de non-respect de cette directive résultat d'une modification non recommandée du produit, y compris l'ajout de cartes en option non IBM. Ce produit respecte les limites des caractéristiques d'immunité des appareils de traitement de l'information définies par la classe A de la norme européenne EN 55022 (CISPR 22). La conformité aux spécifications de la classe A offre une garantie acceptable contre les perturbations avec les appareils de communication agréés, dans les zones commerciales et industrielles. Dans l'Union européenne, contactez : IBM Deutschland GmbH Technical Regulations, Department M456 IBM-Allee 1, 71139 Ehningen, Germany Tele: +49 7032 15-2937 email: tjahn@de.ibm.com Avertissement : Ce matériel appartient à la classe A. Il est susceptible d'émettre des ondes radioélectriques risquant de perturber les réceptions radio. Son emploi dans une zone résidentielle peut créer des perturbations électromagnétiques. L'utilisateur devra alors prendre les mesures nécessaires pour en éliminer les causes. Remarques 121 Avis de conformité aux exigences du Voluntary Control Council for Interference (VCCI) - Japon Voici un résumé de la recommandation du VCCI japonais figurant dans l'encadré ci-dessus : Ce produit de la classe A respecte les limites des caractéristiques d'immunité définies par le VCCI (Voluntary Control Council for Interference) japonais. Si ce matériel est utilisé dans une zone résidentielle, il peut créer des perturbations électromagnétiques. L'utilisateur devra alors prendre les mesures nécessaires pour en éliminer les causes. Directive relative aux harmoniques confirmée par l'association japonaise JEITA (Japanese Electronics and Information Technology Industries Association) (produits inférieurs ou égaux à 20 A par phase) Directive relative aux harmoniques confirmée avec modifications par l'association japonaise JEITA (Japanese Electronics and Information Technology Industries Association) (produits supérieurs 20 A par phase) Avis d'interférences électromagnétiques (EMI) - République populaire de Chine Ce matériel appartient à la classe A. Il est susceptible d'émettre des ondes radioélectriques risquant de perturber les réceptions radio. L'utilisateur devra alors prendre les mesures nécessaires pour en éliminer les causes. 122 Activation SAS (Serial-attached SCSI) RAID Avis d'interférences électromagnétiques (EMI) - Taïwan Voici un résumé de l'avis EMI de Taïwan figurant ci-dessus. Avertissement : Ce matériel appartient à la classe A. Il est susceptible d'émettre des ondes radioélectriques risquant de perturber les réceptions radio. Son emploi dans une zone résidentielle peut créer des interférences. L'utilisateur devra alors prendre les mesures nécessaires pour les supprimer. Avis d'interférences électromagnétiques (EMI) - Corée Avis de conformité pour l'Allemagne Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse A EU-Richtlinie zur Elektromagnetischen Verträglichkeit Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die Grenzwerte der EN 55022 Klasse A ein. Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne Empfehlung von IBM gesteckt/eingebaut werden. EN 55022 Klasse A Geräte müssen mit folgendem Warnhinweis versehen werden: "Warnung: Dieses ist eine Einrichtung der Klasse A. Diese Einrichtung kann im Wohnbereich Funk-Störungen verursachen; in diesem Fall kann vom Betreiber verlangt werden, angemessene Maßnahmen zu ergreifen und dafür aufzukommen." Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland. Remarques 123 Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse A Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen - CE - zu führen. Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller: International Business Machines Corp. New Orchard Road Armonk, New York 10504 Tel: 914-499-1900 Der verantwortliche Ansprechpartner des Herstellers in der EU ist: IBM Deutschland GmbH Technical Regulations, Abteilung M456 IBM-Allee 1, 71139 Ehningen, Germany Tel: +49 7032 15-2937 email: tjahn@de.ibm.com Generelle Informationen: Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse A. Avis d'interférences électromagnétiques (EMI) - Russie Remarques sur la classe B Les avis de conformité de classe B suivants s'appliquent aux dispositifs déclarés comme relevant de la compatibilité électromagnétique (EMC) de classe B dans les informations d'installation des dispositifs. Recommandation de la Federal Communications Commission (FCC) [Etats-Unis] Cet appareil respecte les limites des caractéristiques d'immunité des appareils numériques définies par la classe B, conformément au chapitre 15 de la réglementation de la FCC. La conformité aux spécifications de la classe B offre une garantie acceptable contre les perturbations électromagnétiques dans les zones résidentielles. Ce matériel génère, utilise et peut émettre de l'énergie radiofréquence. Il risque de parasiter les communications radio s'il n'est pas installé conformément aux instructions du constructeur. Toutefois, il n'est pas garanti que des perturbations n'interviendront pas pour une installation particulière. Si cet appareil provoque des perturbations gênantes dans les communications radio ou télévision, mettez-le hors tension puis sous tension pour vous en assurer. L'utilisateur peut tenter de remédier à cet incident en appliquant une ou plusieurs des mesures suivantes : v Réorienter ou repositionner l'antenne de réception. v Eloigner l'appareil du récepteur. 124 Activation SAS (Serial-attached SCSI) RAID v Brancher l'appareil sur une prise différente de celle du récepteur, sur un circuit distinct. v Prendre contact avec un distributeur agréé IBM ou un représentant commercial IBM pour obtenir de l'aide. Utilisez des câbles et connecteurs correctement blindés et mis à la terre afin de respecter les limites de rayonnement définies par la réglementation de la FCC. Ces câbles et connecteurs sont disponibles chez votre distributeur agréé IBM. IBM ne peut pas être tenue pour responsable du brouillage des réceptions radio ou télévision résultant de modifications non autorisées apportées à cet appareil. Toute modification non autorisée pourra annuler le droit d'utilisation de cet appareil. Cet appareil est conforme aux restrictions définies dans le chapitre 15 de la réglementation de la FCC. Son utilisation est soumise aux deux conditions suivantes : (1) il ne peut pas causer de perturbations électromagnétiques gênantes et (2) il doit accepter toutes les perturbations reçues, y compris celles susceptibles d'occasionner un fonctionnement indésirable. Avis de conformité à la réglementation d'Industrie Canada Cet appareil numérique de la classe B est conforme à la norme NMB-003 du Canada. Avis de conformité aux exigences de l'Union européenne Le présent produit satisfait aux exigences de protection énoncées dans la directive 2004/108/CEE du Conseil concernant le rapprochement des législations des Etats membres relatives à la compatibilité électromagnétique. IBM décline toute responsabilité en cas de non-respect de cette directive résultant d'une modification non recommandée du produit, y compris l'ajout de cartes en option non IBM. Ce produit respecte les limites des caractéristiques d'immunité des appareils de traitement de l'information définies par la classe B de la norme européenne EN 55022 (CISPR 22). La conformité aux spécifications de la classe B offre une garantie acceptable contre les perturbations avec les appareils de communication agréés, dans les zones résidentielles. Dans l'Union européenne, contactez : IBM Deutschland GmbH Technical Regulations, Department M456 IBM-Allee 1, 71139 Ehningen, Germany Tele: +49 7032 15-2937 email: tjahn@de.ibm.com Avis de conformité aux exigences du Voluntary Control Council for Interference (VCCI) - Japon Directive relative aux harmoniques confirmée par l'association japonaise JEITA (Japanese Electronics and Information Technology Industries Association) (produits inférieurs ou égaux à 20 A par phase) Remarques 125 Directive relative aux harmoniques confirmée avec modifications par l'association japonaise JEITA (Japanese Electronics and Information Technology Industries Association) (produits supérieurs 20 A par phase) Avis d'interférences électromagnétiques (EMI) - Corée Avis de conformité pour l'Allemagne Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse B EU-Richtlinie zur Elektromagnetischen Verträglichkeit Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die Grenzwerte der EN 55022 Klasse B ein. Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne Empfehlung von IBM gesteckt/eingebaut werden. Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland. Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse B Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen - CE - zu führen. Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller: International Business Machines Corp. New Orchard Road Armonk, New York 10504 Tel: 914-499-1900 Der verantwortliche Ansprechpartner des Herstellers in der EU ist: IBM Deutschland GmbH Technical Regulations, Abteilung M456 126 Activation SAS (Serial-attached SCSI) RAID IBM-Allee 1, 71139 Ehningen, Germany Tel: +49 7032 15-2937 email: tjahn@de.ibm.com Generelle Informationen: Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse B. Dispositions Les droits d'utilisation relatifs à ces publications sont soumis aux dispositions suivantes. Usage personnel : Vous pouvez reproduire ces publications pour votre usage personnel, non commercial, sous réserve que toutes les mentions de propriété soient conservées. Vous ne pouvez distribuer ou publier tout ou partie de ces publications ou en faire des oeuvres dérivées, sans le consentement exprès du fabricant. Usage commercial : Vous pouvez reproduire, distribuer et afficher ces publications uniquement au sein de votre entreprise, sous réserve que toutes les mentions de propriété soient conservées. Vous ne pouvez reproduire, distribuer, afficher ou publier tout ou partie de ces publications en dehors de votre entreprise, ou en faire des oeuvres dérivées, sans le consentement exprès du fabricant. Excepté les droits d'utilisation expressément accordés dans ce document, aucun autre droit, licence ou autorisation, implicite ou explicite, n'est accordé pour ces publications ou autres informations, données, logiciels ou droits de propriété intellectuelle contenus dans ces publications. Le fabricant se réserve le droit de retirer les autorisations accordées dans le présent document si l'utilisation des publications s'avère préjudiciable à ses intérêts ou que, selon son appréciation, les instructions susmentionnées n'ont pas été respectées. Vous ne pouvez télécharger, exporter ou réexporter ces informations qu'en total accord avec toutes les lois et règlements applicables dans votre pays, y compris les lois et règlements américains relatifs à l'exportation. LE FABRICANT NE DONNE AUCUNE GARANTIE SUR LE CONTENU DE CES PUBLICATIONS. CES PUBLICATIONS SONT LIVREES EN L'ETAT SANS AUCUNE GARANTIE EXPLICITE OU IMPLICITE. LE FABRICANT DECLINE NOTAMMENT TOUTE RESPONSABILITE RELATIVE A CES INFORMATIONS EN CAS DE CONTREFAÇON AINSI QU'EN CAS DE DEFAUT D'APTITUDE A L'EXECUTION D'UN TRAVAIL DONNE. Disk drives REFERENCE 86 A1 14FG 01 ESCALA Power7 Disk drives This publication concerns the following models: - Bull Escala E5-700 (Power 750 / 8233-E8B) - Bull Escala M6-700 (Power 770 / 9117-MMB) - Bull Escala M7-700 (Power 780 / 9179-MHB) - Bull Escala E1-700 (Power 710 / 8231-E2B) - Bull Escala E2-700 / E2-700T (Power 720 / 8202-E4B) - Bull Escala E3-700 (Power 730 / 8231-E2B) - Bull Escala E4-700 / E4-700T (Power 740 / 8205-E6B) References to Power 755 / 8236-E8C models are irrelevant. Hardware September 2010 BULL CEDOC 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE REFERENCE 86 A1 14FG 01 The following copyright notice protects this book under Copyright laws which prohibit such actions as, but not limited to, copying, distributing, modifying, and making derivative works. Copyright Bull SAS 2007-2010 Printed in France Suggestions and criticisms concerning the form, content, and presentation of this book are invited. A form is provided at the end of this book for this purpose. To order additional copies of this book or other Bull Technical Publications, you are invited to use the Ordering Form also provided at the end of this book. Trademarks and Acknowledgements We acknowledge the right of proprietors of trademarks mentioned in this book. The information in this document is subject to change without notice. Bull will not be liable for errors ontained herein, or for incidental or consequential damages in connection with the use of this material. c Contents Safety notices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix Disk drives. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 What's new in Disk drives . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 Installing a disk drive or solid-state drive . . . . . . . . . . . . . . . . . . . . . 5 Solid-state drives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 Installing a disk drive or solid-state drive with the AIX system or logical partition powered on . . . . . . . 10 Prepare the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 Install the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12 Complete the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16 Installing a disk drive or solid-state drive with the Linux system or logical partition powered on . . . . . . 17 Installing a disk drive using Linux kernel 2.4 . . . . . . . . . . . . . . . . . . . . . . . 19 Installing a disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19 Perform the procedure. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24 Installing the disk drive using Linux kernel 2.6 . . . . . . . . . . . . . . . . . . . . . . 25 Install the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25 Verify that the new disk drive is operational . . . . . . . . . . . . . . . . . . . . . . 30 Perform the procedure. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31 Installing a disk drive or solid-state drive with the Virtual I/O Server system or logical partition powered on . . 31 Prepare the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32 Install the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33 Perform the procedure. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37 Installing a disk drive or solid-state drive with the IBM i system or logical partition powered on. . . . . . . 37 Prepare the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39 Prepare to install the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . 39 Install the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41 Perform the procedure. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45 Configuring a disk drive in an system or logical partition. . . . . . . . . . . . . . . . . . . 45 Rebuilding data on a replacement disk drive using IBM i . . . . . . . . . . . . . . . . . . . 47 Determining the IBM i disk drive protection status . . . . . . . . . . . . . . . . . . . . . 48 Installing a disk drive or solid-state drive with the system or logical partition power turned off . . . . . . . 49 Prepare the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50 Install the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51 Perform the procedure. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55 Removing a disk drive or solid-state drive . . . . . . . . . . . . . . . . . . . . 57 Removing a disk drive from a system or expansion unit controlled by IBM i . . . . . . . . . . . . . 57 Prepare the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59 Remove the disk drive. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59 Perform the procedure. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61 Preparing to remove a disk drive from a system or logical partition controlled by IBM i. . . . . . . . . 61 Replacing a disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63 Replacing a disk drive with the AIX system or logical partition powered on. . . . . . . . . . . . . . 63 Prepare the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64 Remove a disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65 Replace a disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68 Complete the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71 Replacing a disk drive with the Linux system or logical partition powered on . . . . . . . . . . . . . 72 Replacing the disk drive using Linux kernel 2.4 . . . . . . . . . . . . . . . . . . . . . . 74 Prepare to remove the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . 74 Remove the disk drive. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75 © Copyright IBM Corp. 2010 iii Replace the disk drive. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79 Perform the procedure. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83 Replacing the disk drive using Linux kernel 2.6 . . . . . . . . . . . . . . . . . . . . . . 83 Prepare to remove the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . 83 Remove the disk drive. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85 Replace the disk drive. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87 Perform the procedure. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89 Replacing a disk drive with the IBM i system or logical partition powered on . . . . . . . . . . . . . 89 Prepare the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90 Remove the disk drive. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91 Replace the disk drive. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 94 Complete the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96 Replacing a disk drive using hot spare in an IBM i system or logical partition . . . . . . . . . . . . . 97 Prepare the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98 Remove the disk drive. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98 Replace the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102 Perform the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104 Replacing a disk drive with the system or logical partition powered off. . . . . . . . . . . . . . . 106 Prepare the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107 Remove the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 108 Replace the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111 Complete the procedure. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113 Replacing a SCSI disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 114 Remove the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 114 Replace the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116 Perform the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119 Removing or installing a disk-drive filler. . . . . . . . . . . . . . . . . . . . . 121 Removing a disk-drive filler . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 121 Installing a disk-drive filler. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 123 Disk drive locations and concurrent maintenance lights . . . . . . . . . . . . . . 125 PCIe RAID and SSD SAS adapter locations . . . . . . . . . . . . . . . . . . . . . . . . 125 Disk drive locations and concurrent maintenance lights for an 8202-E4B or 8205-E6B system . . . . . . . . 125 Disk drive locations and concurrent maintenance lights for 8231-E2B. . . . . . . . . . . . . . . . 127 Disk drive locations and concurrent maintenance lights for 8233-E8B and 8236-E8C . . . . . . . . . . . 129 Disk drive locations and concurrent maintenance lights for models 9117-MMB or 9179-MHB. . . . . . . . 130 Disk drive locations and concurrent maintenance lights for the 5886 disk enclosure . . . . . . . . . . . 131 Disk drive locations and concurrent maintenance lights for the 5786, 5787, 7031-D24, and 7031-T24 SCSI disk-drive enclosures. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 133 Disk drive locations and concurrent maintenance lights for the 5802 expansion unit. . . . . . . . . . . 135 Related information. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 137 Preparing a disk drive for use in an AIX system or logical partition . . . . . . . . . . . . . . . . 137 Preparing to remove a disk drive from a system or expansion unit controlled by AIX . . . . . . . . . . 137 Rebuilding data on a replacement disk drive using AIX . . . . . . . . . . . . . . . . . . . . 137 Rebuilding data on a replacement disk drive using Linux . . . . . . . . . . . . . . . . . . . 137 Rebuilding data on a disk drive using Linux kernel 2.4 . . . . . . . . . . . . . . . . . . . 138 Rebuilding data on a disk drive using Linux kernel 2.6 . . . . . . . . . . . . . . . . . . . 139 Rebuilding data on a replacement disk drive using IBM i . . . . . . . . . . . . . . . . . . . 140 Configuring a disk drive in an system or logical partition . . . . . . . . . . . . . . . . . . 141 Preparing to remove a disk drive from a system or logical partition controlled by IBM i . . . . . . . . 143 Determining the IBM i disk drive protection status. . . . . . . . . . . . . . . . . . . . . 144 Configuring a disk drive on a load source adapter for hot spare using the IBM i operating system. . . . . 145 Internal disk drive sharing on model 8202-E4B or 8205-E6B system. . . . . . . . . 147 Internal disk drive sharing on an 8233-E8B or 8236-E8C . . . . . . . . . . . . . . 149 iv Disk drives Internal disk drive sharing in model 9117-MMB or 9179-MHB . . . . . . . . . . . . 151 Installing the external SAS port on model 8202-E4B or 8205-E6B . . . . . . . . . . 153 Installing the external SAS port on model 8231-E2B. . . . . . . . . . . . . . . . 155 Installing the external SAS port on an 8233-E8B or 8236-E8C. . . . . . . . . . . . 157 Installing the external SAS port on a 9117-MMB or 9179-MHB . . . . . . . . . . . 165 Replacing a SAS conduit card in a 5802 expansion unit with power off . . . . . . . 167 Removing a SAS conduit card from a 5802 expansion unit with power off . . . . . . 175 Replacing a SAS expander card in a 5802 expansion unit with power off . . . . . . 183 Removing a SAS expander card from a 5802 expansion unit with power off . . . . . 185 Common procedures for installable features . . . . . . . . . . . . . . . . . 187 Before you begin. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 189 Identifying a failing part. . . . . . . . . . . . . . . . . . . . . . . . . . . . 193 Control panel LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 193 Identifying a failing part in an AIX system or logical partition . . . . . . . . . . . . . . . . . . 194 Locating a failing part in an AIX system or logical partition. . . . . . . . . . . . . . . . . . 194 Activating the indicator light for the failing part. . . . . . . . . . . . . . . . . . . . . . 194 Deactivating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . 195 Identifying a failing part in an IBM i system or logical partition . . . . . . . . . . . . . . . . . 195 Activating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . . 195 Deactivating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . 196 Identifying a failing part in a Linux system or logical partition. . . . . . . . . . . . . . . . . . 196 Locating a failing part in a Linux system or logical partition . . . . . . . . . . . . . . . . . 196 Finding the location code of a failing part in a Linux system or logical partition . . . . . . . . . . . 197 Activating the indicator light for the failing part. . . . . . . . . . . . . . . . . . . . . . 197 Deactivating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . 197 Locating a failing part in a Virtual I/O Server system or logical partition . . . . . . . . . . . . . . 198 Identifying a part by using the Virtual I/O Server . . . . . . . . . . . . . . . . . . . . . 198 Starting the system or logical partition . . . . . . . . . . . . . . . . . . . . . 199 Starting a system that is not managed by a Hardware Management Console . . . . . . . . . . . . . 199 Starting a system or logical partition by using the Hardware Management Console . . . . . . . . . . . 200 Stopping a system or logical partition. . . . . . . . . . . . . . . . . . . . . . 201 Stopping a system that is not managed by a Hardware Management Console. . . . . . . . . . . . . 201 Stopping a system by using the Hardware Management Console . . . . . . . . . . . . . . . . . 202 Removing and replacing the expansion unit cover or door. . . . . . . . . . . . . 205 Removing the front cover on the 7314-G30 or 5796. . . . . . . . . . . . . . . . . . . . . . 205 Installing the front cover on the 7314-G30 or 5796 . . . . . . . . . . . . . . . . . . . . . . 205 System covers. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 207 Removing and replacing covers for the 8202-E4B or 8205-E6B . . . . . . . . . . . . . . . . . . 207 Removing the front cover on a rack-mounted 8202-E4B or 8205-E6B system . . . . . . . . . . . . 207 Removing the front cover on a stand-alone 8202-E4B or 8205-E6B system . . . . . . . . . . . . . 207 Installing the front cover on a rack-mounted 8202-E4B or 8205-E6B system. . . . . . . . . . . . . 208 Contents v Installing the front cover and front door on a stand-alone 8202-E4B or 8205-E6B system . . . . . . . . 209 Removing the service access cover on a rack-mounted 8202-E4B or 8205-E6B system . . . . . . . . . 210 Removing the service access cover on a stand-alone 8202-E4B or 8205-E6B system . . . . . . . . . . 211 Installing the service access cover on a rack-mounted 8202-E4B or 8205-E6B system. . . . . . . . . . 213 Installing the service access cover on a stand-alone 8202-E4B or 8205-E6B system. . . . . . . . . . . 213 Removing and replacing covers for the 8231-E2B . . . . . . . . . . . . . . . . . . . . . . 215 Removing the service access cover for the 8231-E2B . . . . . . . . . . . . . . . . . . . . 215 Installing the service access cover for the 8231-E2B. . . . . . . . . . . . . . . . . . . . . 216 Removing and replacing covers on the 8233-E8B or 8236-E8C system. . . . . . . . . . . . . . . . 217 Removing the front cover from an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . . . . 217 Installing the front cover on an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . . . . . 218 Removing the service access cover from an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . 219 Installing the service access cover on an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . . 220 Removing and replacing covers for the 9117-MMB or 9179-MHB . . . . . . . . . . . . . . . . . 221 Removing the front cover from the 9117-MMB or 9179-MHB . . . . . . . . . . . . . . . . . 221 Installing the front cover on the 9117-MMB or 9179-MHB . . . . . . . . . . . . . . . . . . 222 Placing the rack-mounted 8233-E8B or 8236-E8C system into the service position or operating position . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 225 Placing a rack-mounted 8231-E2B system into the service position. . . . . . . . . . . . . . . . . 225 Placing the rack-mounted 8231-E2B system into the operating position . . . . . . . . . . . . . . . 226 Placing a rack-mounted 8202-E4B or 8205-E6B system into the service position . . . 227 Placing the rack-mounted 8202-E4B or 8205-E6B system into the operating position 229 Service and operating positions . . . . . . . . . . . . . . . . . . . . . . . . 231 Placing the 8202-E4B or 8205-E6B system into the service or operating position . . . . . . . . . . . . 231 Placing a rack-mounted 8202-E4B or 8205-E6B system into the service position . . . . . . . . . . . 231 Placing the rack-mounted 8202-E4B or 8205-E6B system into the operating position . . . . . . . . . . 232 Placing the rack-mounted 8233-E8B or 8236-E8C system into the service position or operating position . . . . 234 Placing a rack-mounted 8231-E2B system into the service position. . . . . . . . . . . . . . . . 234 Placing the rack-mounted 8231-E2B system into the operating position . . . . . . . . . . . . . . 235 Placing the rack-mounted 8233-E8B or 8236-E8C system into the service position or operating position . . . . 236 Placing a 8233-E8B or 8236-E8C in the service position . . . . . . . . . . . . . . . . . . . 236 Placing the 8233-E8B or 8236-E8C in the operating position . . . . . . . . . . . . . . . . . . 239 Power cords. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 241 Disconnecting the power cords from the 8202-E4B or 8205-E6B system . . . . . . . . . . . . . . . 241 Connecting the power cords to the 8202-E4B or 8205-E6B system . . . . . . . . . . . . . . . . . 242 Disconnecting the power cords from the 8231-E2B . . . . . . . . . . . . . . . . . . . . . . 243 Connecting the power cords to the 8231-E2B . . . . . . . . . . . . . . . . . . . . . . . . 244 Installing a feature by using the Hardware Management Console . . . . . . . . . . 247 Removing a part by using the Hardware Management Console . . . . . . . . . . . . . . . . . 247 Exchanging a part by using the Hardware Management Console . . . . . . . . . . . . . . . . . 247 Verifying the installed part. . . . . . . . . . . . . . . . . . . . . . . . . . . 249 Verifying an installed feature or replaced part in an AIX system or logical partition. . . . . . . . . . . 249 Verifying the installed part in an IBM i system or logical partition. . . . . . . . . . . . . . . . . 251 Deactivating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . 252 Verifying the installed part in a Linux system or logical partition . . . . . . . . . . . . . . . . . 252 Verifying an installed part by using stand-alone diagnostics. . . . . . . . . . . . . . . . . . . 253 Verifying the installed part by using Hardware Management Console . . . . . . . . . . . . . . . 254 Activating and deactivating LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . 255 Deactivating a system attention LED or partition LED. . . . . . . . . . . . . . . . . . . 255 Activating or deactivating identify LED . . . . . . . . . . . . . . . . . . . . . . . 255 Viewing serviceable events. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 256 Verifying an installed feature or replaced part on a system or logical partition by using Virtual I/O Server tools 256 vi Disk drives Verifying a repair. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 261 Verifying the repair in AIX . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 262 Verifying a repair using an IBM i system or logical partition . . . . . . . . . . . . . . . . . . 265 Verifying the repair in Linux . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 267 Verifying the repair from the HMC . . . . . . . . . . . . . . . . . . . . . . . . . . . 267 Closing a service call. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 269 Closing a service call using AIX or Linux . . . . . . . . . . . . . . . . . . . . . . . . . 273 Closing a service call using Integrated Virtualization Manager . . . . . . . . . . . . . . . . . . 277 Activating and deactivating LEDs. . . . . . . . . . . . . . . . . . . . . . . . 283 Deactivating a system attention LED or partition LED using the HMC . . . . . . . . . . . . . . . 283 Activating or deactivating an identify LED using the HMC . . . . . . . . . . . . . . . . . . . 284 Deactivating a system attention LED or logical partition LED using the Advanced System Management Interface 284 Activating or deactivating an identify LED using the Advanced System Management Interface . . . . . . . 285 Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 287 Trademarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 288 Electronic emission notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 288 Class A Notices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 288 Class B Notices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 292 Terms and conditions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 295 Contents vii viii Disk drives Safety notices Safety notices may be printed throughout this guide: v DANGER notices call attention to a situation that is potentially lethal or extremely hazardous to people. v CAUTION notices call attention to a situation that is potentially hazardous to people because of some existing condition. v Attention notices call attention to the possibility of damage to a program, device, system, or data. World Trade safety information Several countries require the safety information contained in product publications to be presented in their national languages. If this requirement applies to your country, a safety information booklet is included in the publications package shipped with the product. The booklet contains the safety information in your national language with references to the U.S. English source. Before using a U.S. English publication to install, operate, or service this product, you must first become familiar with the related safety information in the booklet. You should also refer to the booklet any time you do not clearly understand any safety information in the U.S. English publications. German safety information Das Produkt ist nicht für den Einsatz an Bildschirmarbeitsplätzen im Sinne § 2 der Bildschirmarbeitsverordnung geeignet. Laser safety information IBM® servers can use I/O cards or features that are fiber-optic based and that utilize lasers or LEDs. Laser compliance IBM servers may be installed inside or outside of an IT equipment rack. © Copyright IBM Corp. 2010 ix DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) DANGER x Disk drives Observe the following precautions when working on or around your IT rack system: v Heavy equipment–personal injury or equipment damage might result if mishandled. v Always lower the leveling pads on the rack cabinet. v Always install stabilizer brackets on the rack cabinet. v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest devices in the bottom of the rack cabinet. Always install servers and optional devices starting from the bottom of the rack cabinet. v Rack-mounted devices are not to be used as shelves or work spaces. Do not place objects on top of rack-mounted devices. v Each rack cabinet might have more than one power cord. Be sure to disconnect all power cords in the rack cabinet when directed to disconnect power during servicing. v Connect all devices installed in a rack cabinet to power devices installed in the same rack cabinet. Do not plug a power cord from a device installed in one rack cabinet into a power device installed in a different rack cabinet. v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of the system or the devices that attach to the system. It is the responsibility of the customer to ensure that the outlet is correctly wired and grounded to prevent an electrical shock. CAUTION v Do not install a unit in a rack where the internal rack ambient temperatures will exceed the manufacturer's recommended ambient temperature for all your rack-mounted devices. v Do not install a unit in a rack where the air flow is compromised. Ensure that air flow is not blocked or reduced on any side, front, or back of a unit used for air flow through the unit. v Consideration should be given to the connection of the equipment to the supply circuit so that overloading of the circuits does not compromise the supply wiring or overcurrent protection. To provide the correct power connection to a rack, refer to the rating labels located on the equipment in the rack to determine the total power requirement of the supply circuit. v (For sliding drawers.) Do not pull out or install any drawer or feature if the rack stabilizer brackets are not attached to the rack. Do not pull out more than one drawer at a time. The rack might become unstable if you pull out more than one drawer at a time. v (For fixed drawers.) This drawer is a fixed drawer and must not be moved for servicing unless specified by the manufacturer. Attempting to move the drawer partially or completely out of the rack might cause the rack to become unstable or cause the drawer to fall out of the rack. (R001) Safety notices xi CAUTION: Removing components from the upper positions in the rack cabinet improves rack stability during relocation. Follow these general guidelines whenever you relocate a populated rack cabinet within a room or building: v Reduce the weight of the rack cabinet by removing equipment starting at the top of the rack cabinet. When possible, restore the rack cabinet to the configuration of the rack cabinet as you received it. If this configuration is not known, you must observe the following precautions: – Remove all devices in the 32U position and above. – Ensure that the heaviest devices are installed in the bottom of the rack cabinet. – Ensure that there are no empty U-levels between devices installed in the rack cabinet below the 32U level. v If the rack cabinet you are relocating is part of a suite of rack cabinets, detach the rack cabinet from the suite. v Inspect the route that you plan to take to eliminate potential hazards. v Verify that the route that you choose can support the weight of the loaded rack cabinet. Refer to the documentation that comes with your rack cabinet for the weight of a loaded rack cabinet. v Verify that all door openings are at least 760 x 230 mm (30 x 80 in.). v Ensure that all devices, shelves, drawers, doors, and cables are secure. v Ensure that the four leveling pads are raised to their highest position. v Ensure that there is no stabilizer bracket installed on the rack cabinet during movement. v Do not use a ramp inclined at more than 10 degrees. v When the rack cabinet is in the new location, complete the following steps: – Lower the four leveling pads. – Install stabilizer brackets on the rack cabinet. – If you removed any devices from the rack cabinet, repopulate the rack cabinet from the lowest position to the highest position. v If a long-distance relocation is required, restore the rack cabinet to the configuration of the rack cabinet as you received it. Pack the rack cabinet in the original packaging material, or equivalent. Also lower the leveling pads to raise the casters off of the pallet and bolt the rack cabinet to the pallet. (R002) (L001) (L002) xii Disk drives (L003) or All lasers are certified in the U.S. to conform to the requirements of DHHS 21 CFR Subchapter J for class 1 laser products. Outside the U.S., they are certified to be in compliance with IEC 60825 as a class 1 laser product. Consult the label on each part for laser certification numbers and approval information. CAUTION: This product might contain one or more of the following devices: CD-ROM drive, DVD-ROM drive, DVD-RAM drive, or laser module, which are Class 1 laser products. Note the following information: v Do not remove the covers. Removing the covers of the laser product could result in exposure to hazardous laser radiation. There are no serviceable parts inside the device. v Use of the controls or adjustments or performance of procedures other than those specified herein might result in hazardous radiation exposure. (C026) Safety notices xiii CAUTION: Data processing environments can contain equipment transmitting on system links with laser modules that operate at greater than Class 1 power levels. For this reason, never look into the end of an optical fiber cable or open receptacle. (C027) CAUTION: This product contains a Class 1M laser. Do not view directly with optical instruments. (C028) CAUTION: Some laser products contain an embedded Class 3A or Class 3B laser diode. Note the following information: laser radiation when open. Do not stare into the beam, do not view directly with optical instruments, and avoid direct exposure to the beam. (C030) Power and cabling information for NEBS (Network Equipment-Building System) GR-1089-CORE The following comments apply to the IBM servers that have been designated as conforming to NEBS (Network Equipment-Building System) GR-1089-CORE: The equipment is suitable for installation in the following: v Network telecommunications facilities v Locations where the NEC (National Electrical Code) applies The intrabuilding ports of this equipment are suitable for connection to intrabuilding or unexposed wiring or cabling only. The intrabuilding ports of this equipment must not be metallically connected to the interfaces that connect to the OSP (outside plant) or its wiring. These interfaces are designed for use as intrabuilding interfaces only (Type 2 or Type 4 ports as described in GR-1089-CORE) and require isolation from the exposed OSP cabling. The addition of primary protectors is not sufficient protection to connect these interfaces metallically to OSP wiring. Note: All Ethernet cables must be shielded and grounded at both ends. The ac-powered system does not require the use of an external surge protection device (SPD). The dc-powered system employs an isolated DC return (DC-I) design. The DC battery return terminal shall not be connected to the chassis or frame ground. xiv Disk drives Disk drives Learn about installing, removing, and replacing a disk drive. © Copyright IBM Corp. 2010 1 2 Disk drives What's new in Disk drives Read about new or changed information in Disk drives since the previous update of this topic collection. September 2010 The following updates are made to the content: v Added information for Internal disk drive sharing on model 8231-E2B . v Added information for Internal disk drive sharing on model 8202-E4B or 8205-E6B . v Added information for “Disk drive locations and concurrent maintenance lights for 8231-E2B” on page 127. v Added information for “Disk drive locations and concurrent maintenance lights for an 8202-E4B or 8205-E6B system” on page 125. v Added information for “PCIe RAID and SSD SAS adapter locations” on page 125. v Added a procedure for “Installing the external SAS port on model 8231-E2B” on page 155. v Added a procedure for “Installing the external SAS port on model 8202-E4B or 8205-E6B” on page 153. March 2010 The following updates are made to the content: v Added information for “Disk drive locations and concurrent maintenance lights for models 9117-MMB or 9179-MHB” on page 130. v Added a procedure for “Internal disk drive sharing in model 9117-MMB or 9179-MHB” on page 151. February 2010 v Added information for servers that contain the POWER7™ processor. © Copyright IBM Corp. 2010 3 4 Disk drives Installing a disk drive or solid-state drive You can install a Small Computer System Interface (SCSI), serial-attached SCSI (SAS) disk drive, or solid-state drive (SSD) in the server or expansion unit. Use the procedures in this topic to perform this task. The system or logical partition can either be turned off or turned on when you install the disk drive. Identify the system for performing the installation or an upgrade. Turn on the System Identify (BLUE) LED. Solid-state drives To install and configure solid-state drives (SSD), you must review the supported machine type model, adapter, and configuration details. Solid®-state drives, also known as flash drives, follow many of the same rules as a regular hard disk drive (HDD). For example, SSDs physically resemble, install in the same manner, and in most of the same slots as HDDs. However, some restrictions and configuration rules that apply to SSD are different from HDD. The following tables can help you determine which operating system, adapter, machine type model, and disk drive configuration rules apply to your system. After you have reviewed the configuration rules, you can install the drive as you would any other disk drive. For more information, see “Installing a disk drive or solid-state drive.” Restriction: Solid-state drives must be part of a RAID array or System Mirroring (IBM i). Select the operating system for the server or partition where you want to install drives from the following list: v Solid-state drive rules for the AIX® operating system v Solid-state drive rules for the Linux® operating system v Solid-state drive rules for the IBM i operating system Attention: Before installing SSD into your system, you must ensure that you have all the prerequisite updates for both the SSD and the adapters. To check the prerequisites, go to the IBM Prerequisite website: http://www-912.ibm.com/e_dir/eServerPrereq.nsf. Table 1. Solid-state drive rules for the AIX operating system Machine type model Adapter Mixing rules 5886 v (5904) - PCI-X DDR 1.5GB Cache SAS RAID Adapter v (5906 and 5908) - PCI-X DDR 1.5GB Cache SAS RAID Adapter (BSC) v (5903) - PCIe 380 MB Cache Dual - x4 3 Gb SAS RAID Adapter v Solid-state drives and hard disk drives cannot be mixed in this drive enclosure. v A maximum of eight drives are supported in this configuration. v Only one 5886 can be attached to the adapter, or pair of adapters, when solid-state drives are installed. v Same rules apply when using dual SAS Raid adapters connected to 5886. © Copyright IBM Corp. 2010 5 Table 1. Solid-state drive rules for the AIX operating system (continued) Machine type model Adapter Mixing rules 8233-E8B and 8236-E8C v (5679) - SAS RAID Enablement: internal adapter v (5904) - PCI-X DDR 1.5GB Cache SAS RAID Adapter v (5903) - PCIe 380 MB Cache Dual - x4 3 Gb SAS RAID Adapter v SSD and HDD can never be mixed in the same RAID array or mirroring group. v SSD and HDD can be mixed in the system disk drive backplane if the backplane is not split. v SSD and HDD can both be present in a split backplane if each side of the split backplane has only SSD or HDD. Restriction: Split backplane for these models is not supported with dual SAS Raid adapters. Remember: SSD must be on one side of the split backplane and HDD must be on the other side of the split backplane. SSD and HDD drive types cannot be mixed within a parity set. If you are mixing SSD and HDD in a system, ensure that you have enough of each drive type to manage your protection scheme. 9117-MMB and 9179-MHB v (5652) - Disk/Media Backplane v (5662) - 175 MB Cache RAID - Dual IOA Enablement Card v (5903) - PCIe 380 MB Cache Dual - x4 3 Gb SAS RAID Adapter v SSD and HDD cannot be mixed in the same RAID array or mirroring group. v SSD and HDD can be mixed in the system disk-drive backplane. Remember: You cannot mix SSD and HDD drive types within a parity set. If you are mixing SSD and HDD in a system, ensure that you have enough of each drive type to manage your protection scheme. 5802 (5903) - PCIe 380 MB Cache Dual - x4 3 Gb SAS RAID Adapter v SSD and HDD cannot be mixed in the same system disk drive backplane. v SSD and HDD can both be present in a split backplane if each side of the split backplane has only SSD or HDD. SSD must be on one side of the split backplane and HDD must be on the other side of the split backplane. v SSD and HDD cannot be mixed on a pair of 5903 adapters. v A maximum of nine SSDs are supported on a pair of 5903 adapters. 6 Disk drives Table 2. Solid-state drive rules for the Linux operating system Machine type model Adapter Mixing rules 5886 v (5904) - PCI-X DDR 1.5GB Cache SAS RAID Adapter v (5906 and 5908) - PCI-X DDR 1.5GB Cache SAS RAID Adapter (BSC) v (5903) - PCIe 380 MB Cache Dual - x4 3 Gb SAS RAID Adapter v Solid-state drives and hard disk drives cannot be mixed in this drive enclosure. v A maximum of eight drives are supported in this configuration. v Only one 5886 can be attached to the adapter, or pair of adapters, when solid-state drives are installed. v Same rules apply when using dual SAS Raid adapters connected to 5886. 8233-E8B and 8236-E8C v (5679) - SAS RAID Enablement: internal adapter v (5904) - PCI-X DDR 1.5GB Cache SAS RAID Adapter v (5903) - PCIe 380 MB Cache Dual - x4 3 Gb SAS RAID Adapter v SSD and HDD can never be mixed in the same RAID array or mirroring group. v SSD and HDD can be mixed in the system disk drive backplane if the backplane is not split. v SSD and HDD can both be present in a split backplane if each side of the split backplane has only SSD or HDD. Restriction: Split backplane for these models is not supported with dual SAS Raid adapters. Remember: SSD must be on one side of the split backplane and HDD must be on the other side of the split backplane. SSD and HDD drive types cannot be mixed within a parity set. If you are mixing SSD and HDD in a system, ensure that you have enough of each drive type to manage your protection scheme. 9117-MMB and 9179-MHB v (5652) - Disk/Media Backplane v (5662) - 175 MB Cache RAID - Dual IOA Enablement Card v (5903) - PCIe 380 MB Cache Dual - x4 3 Gb SAS RAID Adapter v SSD and HDD cannot be mixed in the same RAID array or mirroring group. v SSD and HDD can be mixed in the system disk-drive backplane. Remember: You cannot mix SSD and HDD drive types within a parity set. If you are mixing SSD and HDD in a system, ensure that you have enough of each drive type to manage your protection scheme. Installing a disk drive or solid-state drive 7 Table 2. Solid-state drive rules for the Linux operating system (continued) Machine type model Adapter Mixing rules 5802 (5903) - PCIe 380 MB Cache Dual - x4 3 Gb SAS RAID Adapter v SSD and HDD cannot be mixed in the same system disk drive backplane. v SSD and HDD can both be present in a split backplane if each side of the split backplane has only SSD or HDD. SSD must be on one side of the split backplane and HDD must be on the other side of the split backplane. v SSD and HDD cannot be mixed on a pair of 5903 adapters. v A maximum of nine SSDs are supported on a pair of 5903 adapters. Table 3. Solid-state drive rules for the IBM i operating system Machine type model Adapter Mixing rules 5886 v (5904) - PCI-X DDR 1.5GB Cache SAS RAID Adapter v (5906 and 5908) - PCI-X DDR 1.5GB Cache SAS RAID Adapter (BSC) v (5903) - PCIe 380 MB Cache Dual - x4 3 Gb SAS RAID Adapter v Solid-state drives and hard disk drives cannot be mixed in this drive enclosure. v A maximum of eight drives are supported in this configuration. v Only one 5886 can be attached to the adapter, or pair of adapters, when solid-state drives are installed. v Same rules apply when using dual SAS Raid adapters connected to 5886. 8233-E8B and 8236-E8C v (5679) - SAS RAID Enablement: internal adapter v (5904) - PCI-X DDR 1.5GB Cache SAS RAID Adapter v (5903) - PCIe 380 MB Cache Dual - x4 3 Gb SAS RAID Adapter v SSD and HDD cannot be mixed in the same RAID array or mirroring group. v SSD and HDD can be mixed in the system disk-drive backplane. Remember: You cannot mix SSD and HDD drive types within a parity set. If you are mixing SSD and HDD in a system, ensure that you have enough of each drive type to manage your protection scheme. 8 Disk drives Table 3. Solid-state drive rules for the IBM i operating system (continued) Machine type model Adapter Mixing rules 9117-MMB and 9179-MHB v (5652) - Disk/Media Backplane v (5662) - 175 MB Cache RAID - Dual IOA Enablement Card v (5903) - PCIe 380 MB Cache Dual - x4 3 Gb SAS RAID Adapter v SSD and HDD cannot be mixed in the same RAID array or mirroring group. v SSD and HDD can be mixed in the system disk-drive backplane. Remember: You cannot mix SSD and HDD drive types within a parity set. If you are mixing SSD and HDD in a system, ensure that you have enough of each drive type to manage your protection scheme. 5802 (5903) - PCIe 380 MB Cache Dual - x4 3 Gb SAS RAID Adapter v SSD and HDD cannot be mixed in the same system disk drive backplane. v SSD and HDD can both be present in a split backplane if each side of the split backplane has only SSD or HDD. SSD must be on one side of the split backplane and HDD must be on the other side of the split backplane. v SSD and HDD cannot be mixed on a pair of 5903 adapters. v A maximum of nine SSDs are supported on a pair of 5903 adapters. Installing a disk drive or solid-state drive 9 Related tasks “Installing a disk drive or solid-state drive with the AIX system or logical partition powered on” You can install a disk drive or solid-state drive in a system or expansion unit while the AIX operating system or logical partition that controls the disk drive location is powered on. Use the procedure in this section to perform this task. “Installing a disk drive or solid-state drive with the Linux system or logical partition powered on” on page 17 You might need to install a disk drive or solid-state drive in a system or expansion unit while the Linux operating system or logical partition that controls the disk drive location is powered on. Use the procedure in this section to perform this task. “Installing a disk drive or solid-state drive with the IBM i system or logical partition powered on” on page 37 Learn how to install a disk drive or solid-state drive in a system or expansion unit while the IBM i operating system or logical partition that controls the disk drive location is powered on. Use the procedure in this information to perform this task. “Installing a disk drive or solid-state drive with the system or logical partition power turned off” on page 49 Learn how to install a disk drive in a system or logical partition that controls the disk drive location with the power turned off. Use the procedure in this information to perform this task. Related information Cabling the 5802 disk subsystem Installing a disk drive or solid-state drive with the AIX system or logical partition powered on You can install a disk drive or solid-state drive in a system or expansion unit while the AIX operating system or logical partition that controls the disk drive location is powered on. Use the procedure in this section to perform this task. Attention: If you are servicing a failing part, see the service procedures for “Replacing a disk drive with the AIX system or logical partition powered on” on page 63. This procedure is intended for installing a new or upgraded disk drive. Before installing or replacing a feature, ensure that the software required to support the feature is installed on your system. For information about software prerequisites, see IBM Prerequisite. If the required software is not installed, see the following Web sites to download it, and then install it before continuing: v To download firmware and software updates and fixes, see Microcode downloads. v To download Hardware Management Console (HMC) updates and fixes, see Hardware Management Console Support and downloads. If your system is managed by the HMC, use the HMC to complete the steps for installing a feature in the server. For information about using the HMC to install a feature, see “Installing a feature by using the Hardware Management Console” on page 247. If you do not have an HMC, complete the following steps to install a disk drive from the system or partition that controls the disk drive. If you are installing Solid State Drives, review the configuration rules and then return here. For more information, see “Solid-state drives” on page 5. 10 Disk drives DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) Attention: The powered-on procedure is a complex set of steps that must be performed precisely as instructed to avoid potential data loss or system damage. You might also consider the powered-off procedure, which is a simpler task that involves less risk of data loss or system damage. For instructions, see “Installing a disk drive or solid-state drive with the system or logical partition power turned off” on page 49. Note: Some of the figures in these procedures might not look exactly like the system or expansion unit that you have. However, the steps to perform the task are the same. Prepare the system 1. Perform prerequisite tasks as described in “Before you begin” on page 189. 2. If applicable, remove the system unit door, remove the expansion unit door, or open the rack. 3. If applicable, remove the front cover. 4. Determine the next available disk drive position in your system or expansion unit. For information, see “Disk drive locations and concurrent maintenance lights” on page 125. Installing a disk drive or solid-state drive 11 Note: When you have a system with an expansion unit, fill the slot positions in the system unit first. However, you can choose a different disk drive placement depending on your data protection strategy. 5. Record the position where the new disk drive will be installed. For example, the next available disk drive slot might be P3-D4. 6. Perform the following steps to install a disk drive. Install the disk drive 1. Log in as root user. 2. At the command line, type diag and press Enter. 3. Press Enter to continue. 4. On the Function Selection display, select Task Selection. 5. On the Tasks Selection display, select Hot Plug Task. 6. Select SCSI and SCSI RAID Hot Plug Manager. 7. Select Attach a Device to a SCSI Hot Swap Enclosure Device. A list of empty slots in the SCSI hot swap enclosure device is shown. 8. Select the slot where you want to install the disk drive and press Enter. The disk drive slot enters the Identify state. The concurrent maintenance light for the slot begins flashing. To view the locations of the concurrent maintenance lights, select the model into which you are installing a disk drive. For a list of models, see “Disk drive locations and concurrent maintenance lights” on page 125. 9. If the slot you want to use contains a disk-drive filler, store the filler for future use, or discard it, then move on to step Insert the disk drive. Note: The disk drive filler is a single piece that does not attach to the new disk drive. Important: . Pull the disk-drive filler from the slot that you want to use, as shown in Figure 1 on page 13 Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. 12 Disk drives 10. If the slot you want to use contains a disk-drive filler, complete the following step. Remove the disk-drive bezel latch (A) from the front of the filler, as shown in the following figure. 11. If the slot you want to use contains a disk-drive filler, complete the following step. Loosen, and then pull out on the thumbscrew (C) to release the bezel (B) from the front edge of the filler. Figure 1. Removing a disk-drive filler from a model . Figure 2. Removing a disk-drive bezel from a disk-drive filler Installing a disk drive or solid-state drive 13 12. Find the package that contains the new disk drive. Attention: Disk drives are fragile. Handle with care. 13. Remove the disk drive from the static-protective package. 14. Unlock the disk drive handle by squeezing it and pulling it out toward you. If the handle is not all the way out, the disk drive will not slide into the system or expansion unit. 15. If the disk drive you are installing does not have a bezel preinstalled, then complete the following step. Attach the disk-drive bezel (A) to the disk drive, as shown in the following figure. 16. If the disk drive you are installing does not have a bezel preinstalled, then complete the following step. Press in and then tighten the thumbscrew (B) to secure the bezel (A) to the disk drive. 17. Support the bottom of the disk drive as you align it with the guide rails in the system unit. Do not hold the disk drive by the handle. 18. Slide the disk drive into the system unit until it stops, as shown in Figure 4 on page 15. 19. Push the disk drive handle in until it locks. The light on the disk drive will turn on. Figure 3. Installing a disk-drive bezel on a disk drive 14 Disk drives Figure 4. Installing a disk drive in a model Figure 5. Installing a small-form factor disk drive Installing a disk drive or solid-state drive 15 20. Press Enter. 21. If you are installing more than one disk drive, repeat step 1 on page 12 through step 20 until all disk drives are installed. 22. Cancel to get to the SCSI and SCSI RAID Hot Plug Manager menu. 23. Select Configure Added/Replaced Devices. 24. Exit to the command line. Complete the procedure 1. Replace the front cover. 2. Install or close the system, expansion unit, or rack door. 3. Verify the installed part. For instructions, see “Verifying an installed feature or replaced part in an AIX system or logical partition” on page 249. 4. To prepare the newly installed disk drive for use, go to “Preparing a disk drive for use in an AIX system or logical partition” on page 137. Figure 6. Installing a small-form factor disk drive in the 5803 expansion unit 16 Disk drives Related concepts Removing and replacing covers and doors Use these procedures to remove and replace the covers on IBM Power Systems™ servers that contain the POWER7 processor. Related reference “Removing and replacing the expansion unit cover or door” on page 205 You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing components or performing a service action. Installing a disk drive or solid-state drive with the Linux system or logical partition powered on You might need to install a disk drive or solid-state drive in a system or expansion unit while the Linux operating system or logical partition that controls the disk drive location is powered on. Use the procedure in this section to perform this task. Attention: If you are servicing a failing part, see the service procedures for “Replacing a disk drive with the Linux system or logical partition powered on” on page 72. This procedure is intended for installing a new or upgraded disk drive. Before installing or replacing a feature, ensure that the software required to support the feature is installed on your system. For information about software prerequisites, see IBM Prerequisite. If the required software is not installed, see the following Web sites to download it, and then install it before continuing: v To download firmware and software updates and fixes, see Microcode downloads. v To download Hardware Management Console (HMC) updates and fixes, see Hardware Management Console Support and downloads. If your system is managed by the HMC, use the HMC to complete the steps for installing a feature in the server. For information about using the HMC to install a feature, see “Installing a feature by using the Hardware Management Console” on page 247. If you do not have an HMC, complete the following steps to install a disk drive from the system or partition that controls the disk drive. If you are installing Solid State Drives, review the configuration rules and then return here. For more information, see “Solid-state drives” on page 5. Installing a disk drive or solid-state drive 17 DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) Attention: Be sure you read these instructions carefully and understand the complexity of the powered-on procedure before you begin. If any of the following steps apply to you, go to “Installing a disk drive or solid-state drive with the system or logical partition power turned off” on page 49. v You are installing a disk drive that will be connected with the 6203 or 6204 SCSI adapter. v You do not feel comfortable using the powered-on procedure. Note: Some of the figures in these procedures might not look exactly like the system or expansion unit that you have. However, the steps to perform the task are the same. Prepare the system 1. Perform the prerequisite tasks as described in “Before you begin” on page 189. 2. If applicable, remove the system unit door, expansion unit door, or the rack door. 3. If applicable, remove the front cover. v If you have a front cover from any other models, remove the front cover. 18 Disk drives 4. Look at the front of the system or expansion unit and determine the next available disk drive position. For information, see “Disk drive locations and concurrent maintenance lights” on page 125. Note: When you have a system with an expansion unit, fill the slot positions in the system unit first. However, you can choose a different disk drive placement depending on your data protection strategy. 5. Record the position where the new disk drive will be installed. For example, the next available disk drive position might be P3-D4. Install a SCSI disk drive in a system or expansion unit using Linux kernel 2.4 or Linux kernel 2.6. Related concepts Removing and replacing covers and doors Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the POWER7 processor. Related reference “Removing and replacing the expansion unit cover or door” on page 205 You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing components or performing a service action. Installing a disk drive using Linux kernel 2.4 If you are installing a disk drive in a location that is controlled by Linux kernel 2.4, follow the procedure in this section to perform this task. Installing a disk drive 1. If the slot you want to use contains a disk-drive filler, store the filler for future use, or discard it, then move on to step 7 on page 21. Note: The disk drive filler is a single piece that does not attach to the new disk drive. Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. a. Pull the disk-drive filler from the slot, as shown in Figure 7 on page 20. Installing a disk drive or solid-state drive 19 b. Pull off the disk-drive bezel latch (A) from the front of the filler, as shown in the following figure. c. Loosen, and then pull out on the thumbscrew (C) to release the disk-drive bezel (B) from the front edge of the filler. 2. Find the package that contains the new disk drive. Figure 7. Removing a disk-drive filler from model Figure 8. Removing the disk-drive bezel 20 Disk drives Attention: Disk drives are fragile. Handle with care. 3. Remove the disk drive from the static-protective package. Record the serial number (use the last 4 digits) of the disk drive. 4. Unlock the disk drive handle by squeezing and pulling it out toward you. If the handle is not all the way out, the disk drive will not slide into the system or expansion unit. 5. If your disk drive does not have a bezel preinstalled, follow these steps to install the disk-drive bezel on the new disk drive: a. Attach the bezel (A) to the disk drive, as shown in the following figure. b. Press in and then tighten the thumbscrew (B) to secure the bezel to the disk drive. 6. Support the bottom of the disk drive as you align it with the guide rails in the system or expansion unit. Do not hold the disk drive by the handle. 7. Slide the disk drive into the system. Slide the disk drive all the way in to the system or expansion unit and then lock it in place by pushing in the disk drive handle, as shown in Figure 10 on page 22. Figure 9. Installing a disk-drive bezel Installing a disk drive or solid-state drive 21 Figure 10. Installing a disk drive in a model Figure 11. Installing a small-form factor disk drive 22 Disk drives Note: The concurrent maintenance light stops blinking and remains on when you have installed the disk drive correctly. To view the locations of the concurrent maintenance lights, select the appropriate model from the following list: v “Disk drive locations and concurrent maintenance lights for the 5786, 5787, 7031-D24, and 7031-T24 SCSI disk-drive enclosures” on page 133 v “Disk drive locations and concurrent maintenance lights for the 5886 disk enclosure” on page 131 v “Disk drive locations and concurrent maintenance lights for the 5802 expansion unit” on page 135 8. If you are installing more than one disk drive, repeat step 1 on page 19 through step 7 on page 21, until all disk drives are installed. 9. Log in as root user. 10. Wait 30 seconds after installing the disk drive, and then type iprconfig on the command line of the Linux session. Press Enter. The Work with Disk Units display is shown. 11. Select Display disk hardware status from the Work with Disk Units display. Press Enter. A Display Disk Hardware Status display is shown, similar to the following figure. Figure 12. Installing a small-form factor disk drive in the 5803 expansion unit Installing a disk drive or solid-state drive 23 12. Locate the disk drive that you installed. The serial number might be 0. If the disk drive is displayed as a model 50 disk, you are done. If not, follow these steps: a. Find the /dev/sdx name associated with the disk drive that you installed. b. Exit to the command line. c. Type sg_map on the command line and press Enter. d. Using the output from running the sg_map command, correlate the /dev/sdx device to a /dev/sgx device. e. Type sg_inq /dev/sgx on the command line using the /dev/sgx value from the previous step. Press Enter. f. Type iprconfig on the command line and press Enter. g. Select Display disk hardware status from the Work with Disk Units display. Press Enter. h. The serial number for the newly installed disk drive will be displayed on the Display Hardware Status display. 13. Record the resource name, when you configure the disk drive. Perform the procedure 1. Replace the front cover. 2. Install or close the system, expansion unit or rack front door. 3. Configure the newly installed disk drive for your environment. For information, see the PCI-X SCSI RAID Controller Reference Guide for Linux. This guide is available from the SCSI PCI Adapters web page. Display Disk Hardware Status Serial Vendor Product Resource Hardware Number ID ID Model Name Status 03132041 IBM 2780001 001 /dev/ipr3 Operational 000015A5 IBM ST373453LC 079 DPY/Active E3V1WEAB IBM IC35L036UCDY10-0 070 DPY/Active E3V0J55B IBM IC35L036UCDY10-0 079 DPY/Active E3V1W1UB IBM IC35L036UCDY10-0 070 DPY/Active 04B81193 IBM 3C98D203 210 /dev/sdi Operational 000227CE IBM ST336605LC 020 /dev/sdh Operational 00000001 IBM 570B001 001 /dev/ipr2 Operational 00006719 IBM ST336607LC 020 /dev/sdd Operational E3V0E77B IBM IC35L036UCDY10-0 020 /dev/sde Operational 0007E0C4 IBM ST373453LC 020 /dev/sdf Operational 00000001 IBM 5702001 001 /dev/ipr1 Operational 03060038 IBM 5703001 001 /dev/ipr0 Operational More... Press Enter to continue. e=Exit q=Cancel r=Refresh f=PageDn b=PageUp d=Display disk unit details p=Display device parity status Figure 13. Display Disk Hardware Status display 24 Disk drives Related concepts Removing and replacing covers and doors Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the POWER7 processor. Related reference “Removing and replacing the expansion unit cover or door” on page 205 You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing components or performing a service action. Installing the disk drive using Linux kernel 2.6 If you are installing a disk drive in a location that is controlled by Linux kernel 2.6, follow the procedure in this section to perform this task. Install the disk drive 1. Log in as root user. 2. Type lsslot on the command line of the Linux session and press Enter. A display like the following figure, is shown. 3. Note the Linux Name for the slot that contains the adapter under which you want to install the disk drive. 4. Type iprconfig on the command line of the Linux session and press Enter. The IBM Power RAID Configuration Utility display is shown. 5. Select Work with disk unit recovery from the IBM Power RAID Configuration Utility display. Press Enter. 6. Select Concurrent add device from the Work with Disk Unit Recovery display. Press Enter. A Concurrent Device Add display, like the following figure, is shown. 7. Type option 1 next to the location into which you want to install the disk drive and press Enter. The Verify Device Concurrent Add display is shown. x0537p1:~ # lsslot # Slot Description Linux Name Device(s) U787A.001.DNZ00XV-P1-C1 Logical I/O Slot 0000:00:02.0 ethernet U787A.001.DNZ00XV-P1-C2 Logical I/O Slot 0000:00:02.2 pci U787A.001.DNZ00XV-P1-C3 Logical I/O Slot 0001:00:02.2 pci1069,b166 U787A.001.DNZ00XV-P1-C4 Logical I/O Slot 0000:00:02.6 pci1069,b166 U787A.001.DNZ00XV-P1-C5 Logical I/O Slot 0001:00:02.6 pci1014,0180 U787A.001.DNZ00XV-P1-T5 Logical I/O Slot 0000:00:02.4 ethernet U787A.001.DNZ00XV-P1-T7 Logical I/O Slot 0000:00:02.3 usb U787A.001.DNZ00XV-P1-T10 Logical I/O Slot 0001:00:02.4 scsi U787A.001.DNZ00XV-P1-T12 Logical I/O Slot 0001:00:02.3 ide U9111.520.10D3C6C-V1-C0 Virtual I/O Slot 30000000 vty Figure 14. Find the Linux Name for the location for which you want to install the disk drive. Concurrent Device Add Choose a single location for add operations 1=Select OPT Name PCI/SCSI Location Description Status --- ------ -------------------------- ------------------------- ----------------- 0000:41:01.0.1/0:5: Empty 0000:41:01.0.1/1:8: Empty 0001:61:01.0.2/0:8: Empty Figure 15. Example Concurrent Device Remove display Installing a disk drive or solid-state drive 25 A concurrent maintenance light indicates the slot where the disk drive can be installed. 8. Press Enter on the Verify Device Concurrent Add display to verify the location. 9. If the slot you want to use contains a disk-drive filler, store the filler for future use, or discard it, then move on to step 16 on page 28. Note: The disk drive filler is a single piece that does not attach to the new disk drive. Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. a. Pull the disk-drive filler from the slot as shown in Figure 16. b. Pull off the disk-drive bezel latch (A) from the front of the filler, as shown in the following figure. Figure 16. Removing a disk-drive filler from model 26 Disk drives c. Loosen, and then pull out on the thumbscrew (C) to release the disk-drive bezel (B) from the front edge of the filler. 10. Find the package that contains the new disk drive. Attention: Disk drives are fragile. Handle with care. 11. Remove the disk drive from the static-protective package. Record the serial number (use the last four digits) of the disk drive. 12. Unlock the disk drive handle by squeezing and pulling it out toward you. If the handle is not all the way out, the disk drive cannot slide into the system or expansion unit. 13. If your disk drive does not have a bezel preinstalled, follow these steps to install the disk-drive bezel on the new disk drive: a. Attach the bezel (A) to the disk drive, as shown in the following figure. Figure 17. Removing a disk-drive bezel from the disk drive. Installing a disk drive or solid-state drive 27 b. Press in and then tighten the thumbscrew (B) to secure the bezel to the disk drive. 14. Support the bottom of the disk drive as you align it with the guide rails in the system or expansion unit. Do not hold the disk drive by the handle. 15. The Complete Device Concurrent Add display is shown. 16. Slide the disk drive into the system. Slide the disk drive all the way in into the system or expansion unit and then lock it in place by pushing in the disk drive handle, as shown in Figure 19 on page 29. Figure 18. Attaching the disk-drive bezel to the disk drive. 28 Disk drives Figure 19. Installing a disk drive in a model Figure 20. Installing a small-form factor disk drive Installing a disk drive or solid-state drive 29 17. Press Enter. The concurrent maintenance light stops flashing and remains on when you have installed the disk drive properly. To view the locations of the concurrent maintenance lights, select the appropriate model from the following list: v “Disk drive locations and concurrent maintenance lights for the 5786, 5787, 7031-D24, and 7031-T24 SCSI disk-drive enclosures” on page 133 v “Disk drive locations and concurrent maintenance lights for the 5886 disk enclosure” on page 131 v “Disk drive locations and concurrent maintenance lights for the 5802 expansion unit” on page 135 18. If you are installing more than one disk drive, repeat step 2 on page 25 through step 17 until all disk drives are installed. Verify that the new disk drive is operational To verify that the new disk drive is operational, follow these steps: 1. Log in as root user. 2. Type iprconfig on the command line of the Linux session and press Enter. The IBM Power RAID Configuration Utility display is shown. 3. Select Display hardware status. The Display Hardware Status display is shown, like the following figure. Figure 21. Installing a small-form factor disk drive in the 5803 expansion unit 30 Disk drives 4. Verify that the disk drive that you installed is shown on this display. Perform the procedure 1. Replace the front cover. 2. Install or close the system, expansion unit, or rack front door. 3. Configure the newly installed disk drive for your environment. For information, go to the PCI-X SCSI RAID Controller Reference Guide for Linux. This guide is available from the SCSI PCI Adapters web page. Related concepts Removing and replacing covers and doors Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the POWER7 processor. Related reference “Removing and replacing the expansion unit cover or door” on page 205 You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing components or performing a service action. Installing a disk drive or solid-state drive with the Virtual I/O Server system or logical partition powered on You can install a disk drive or solid-state drive in a system or expansion unit while the Virtual I/O Server operating system or logical partition that controls the disk drive location is powered on. Use the procedure in this information to perform this task. Before installing or replacing a feature, ensure that the software required to support the feature is installed on your system. For information about software prerequisites, see IBM Prerequisite. If the required software is not installed, see the following websites to download it, and then install it before continuing: v To download firmware and software updates and fixes, see Microcode downloads. v To download Hardware Management Console (HMC) updates and fixes, see Hardware Management Console Support and downloads. Display Hardware Status Type option, press Enter. 1=Display hardware resource information details OPT Name PCI/SCSI Location Description Status --- ------ -------------------------- ------------------------- ---------------- 0000:01:01.0.0/ PCI-X SCSI Adapter Operational 0000:41:01.0.1/ PCI-X SCSI Adapter Operational sda 0000:41:01.0.1/0:3:0 Physical Disk Active sdb 0000:41:01.0.1/0:4:0 Physical Disk Active sdc 0000:41:01.0.1/0:8:0 Physical Disk Active sdd 0000:41:01.0.1/1:3:0 Physical Disk Active sde 0000:41:01.0.1/1:4:0 Physical Disk Active sdf 0000:41:01.0.1/1:5:0 Physical Disk Active 0001:61:01.0.2/ PCI-X SCSI RAID Adapter Operational sdg 0001:61:01.0.2/0:3:0 Physical Disk Active 0001:61:01.0.2/0:6:0 Advanced Function Disk Active sdi 0001:61:01.0.2/0:9:0 Physical Disk Active sdh 0001:61:01.0.2/255:0:0 RAID 10 Disk Array Failed 0001:61:01.0.2/0:4:0 RAID 10 Array Member Failed 0001:61:01.0.2/0:5:0 RAID 10 Array Member Failed e=Exit q=Cancel r=Refresh t=Toggle Figure 22. Example Display Hardware Status Installing a disk drive or solid-state drive 31 If your system is managed by the HMC, use the HMC to perform the steps for installing a feature in the server. For information about using the HMC to install a feature, see “Installing a feature by using the Hardware Management Console” on page 247. If you do not have an HMC, perform the following steps to install a disk drive from the system or partition that controls the disk drive. DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) Attention: The powered-on procedure is a complex set of steps that must be performed precisely as instructed to avoid potential data loss or system damage. You might also consider the powered-off procedure, which is a simpler task that involves less risk of data loss or system damage. For instructions, see “Installing a disk drive or solid-state drive with the system or logical partition power turned off” on page 49. Note: Some of the figures in these procedures might not look exactly like the system or expansion unit that you have. However, the steps to perform the task are the same. Prepare the system 1. Perform prerequisite tasks as described in “Before you begin” on page 189. 2. Remove the front covers. For instructions, see “System covers” on page 207 32 Disk drives 3. Determine the next available disk drive position in your system or expansion unit. For information, see “Disk drive locations and concurrent maintenance lights” on page 125. Note: When you have a system with an expansion unit, fill the slot positions in the system unit first. However, you can select a different disk drive placement depending on your data protection strategy. 4. Record the position where the new disk drive can be installed. For example, the next available disk drive slot might be P3-D4. 5. Perform the following steps to install a disk drive. Install the disk drive 1. Log in as root user. 2. At the command line, type diagmenu and press Enter. 3. Press Enter to continue. 4. On the Function Selection display, select Task Selection. 5. On the Tasks Selection display, select Hot Plug Task. 6. Select SCSI and SCSI RAID Hot Plug Manager. 7. Select Attach a Device to a SCSI Hot Swap Enclosure Device. A list of empty slots in the SCSI hot swap enclosure device is shown. 8. Select the slot where you want to install the disk drive and press Enter. The disk drive slot enters the Identify state. The concurrent maintenance light for the slot begins flashing. To view the locations of the concurrent maintenance lights, select the model you are installing a disk drive in to. For a list of models see, “Disk drive locations and concurrent maintenance lights” on page 125. 9. If the slot you want to use contains a disk-drive filler, store the filler for future use, or discard it, then move on to step Insert the disk drive. Note: The disk drive filler is a single piece that does not attach to the new disk drive. Important: . Pull the disk-drive filler from the slot that you want to use, as shown in Figure 23 on page 34 Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. Installing a disk drive or solid-state drive 33 10. If the slot you want to use contains a disk-drive filler, perform the following step. Remove the disk-drive bezel latch (A) from the front of the filler, as shown in the following figure. 11. If the slot you want to use contains a disk-drive filler, perform the following step. Loosen, and then pull out on the thumbscrew (C) to release the bezel (B) from the front edge of the filler. Figure 23. Removing a disk-drive filler from a model Figure 24. Removing a disk-drive bezel from a disk-drive filler 34 Disk drives 12. Find the package that contains the new disk drive. Attention: Disk drives are fragile. Handle with care. 13. Remove the disk drive from the static-protective package. 14. Unlock the disk drive handle by squeezing it and pulling it out toward you. If the handle is not all the way out, the disk drive does not slide into the system or expansion unit. 15. If the disk drive you are installing does not have a bezel preinstalled, then perform the following step. Attach the disk-drive bezel (A) to the disk drive, as shown in the following figure. 16. If the disk drive you are installing does not have a bezel preinstalled, then perform the following step. Press in and then tighten the thumbscrew (B) to secure the bezel (A) to the disk drive. 17. Support the bottom of the disk drive as you align it with the guide rails in the system unit. Do not hold the disk drive by the handle. 18. Slide the disk drive into the system unit until it stops, as shown in Figure 26 on page 36. 19. Push the disk drive handle in until it locks. The light on the disk drive turns on. Figure 25. Installing a disk-drive bezel on a disk drive Installing a disk drive or solid-state drive 35 Figure 26. Installing a disk drive in a model Figure 27. Installing a small-form factor disk drive 36 Disk drives 20. Press Enter. 21. If you are installing more than one disk drive, repeat step 1 on page 33 through step 20 until all disk drives are installed. 22. Cancel to get to the SCSI and SCSI RAID Hot Plug Manager menu. 23. Select Configure Added/Replaced Devices. 24. Exit to the command line. Perform the procedure 1. Replace the front cover. For instructions on replacing the cover, see “System covers” on page 207. 2. Verify the installed part. For instructions, see “Verifying the installed part” on page 249. Installing a disk drive or solid-state drive with the IBM i system or logical partition powered on Learn how to install a disk drive or solid-state drive in a system or expansion unit while the IBM i operating system or logical partition that controls the disk drive location is powered on. Use the procedure in this information to perform this task. Attention: If you are servicing a failing part, see the service procedures for “Replacing a disk drive with the IBM i system or logical partition powered on” on page 89. This procedure is intended for installing a new or upgraded disk drive. Before installing or replacing a feature, ensure that the software required to support the feature is installed on your system. For information about software prerequisites, see IBM Prerequisite. If the required software is not installed, see the following websites to download it, and then install it before continuing: v To download firmware and software updates and fixes, see Microcode downloads. Figure 28. Installing a small-form factor disk drive in the 5803 expansion unit Installing a disk drive or solid-state drive 37 v To download Hardware Management Console (HMC) updates and fixes, see Hardware Management Console Support and downloads. If your system is managed by the HMC, use the HMC to perform the steps for installing a feature in the server. For information about using the HMC to install a feature, see “Installing a feature by using the Hardware Management Console” on page 247. If you do not have an HMC, perform the following steps to install a disk drive from the system or partition that controls the disk drive. If you are installing Solid State Drives, review the configuration rules and then return here. For more information, see “Solid-state drives” on page 5. DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) Attention: The powered-on procedure is a complex set of steps that must be performed exactly to avoid potential data loss or system damage. You might also consider the powered-off procedure, which is a simpler task that involves less risk of data loss or system damage. For instructions, see “Installing a disk drive or solid-state drive with the system or logical partition power turned off” on page 49. Note: Some of the figures in these procedures might not look exactly like the system or expansion unit that you have. However, the steps to perform the task are the same. 38 Disk drives Prepare the system 1. Perform prerequisite tasks as described in “Before you begin” on page 189. 2. Determine the next available disk drive position for your system or expansion unit. For information, see “Disk drive locations and concurrent maintenance lights” on page 125. 3. Record the position where the new disk drive can be installed. For example, the next available disk drive slot might be P3-D4. 4. If your system is managed by a Hardware Management Console (HMC), use the HMC to perform the steps for installing the disk drive. For instructions, see “Installing a feature by using the Hardware Management Console” on page 247. If you do not have an HMC, do the following steps to install a disk drive. Prepare to install the disk drive 1. Record the physical location where the new disk drive can be installed. The physical location has the following form: U787A.001.AAAXXXX-P3-D4 where P3-D4 is the position of the disk drive in the system unit or expansion unit. Find the U787A.001.AAAXXXX location information from the control panel display on the system unit or expansion unit. If you cannot find the location information from the control panel, find the feature code (FC) and sequence number (SN or SEQ) on the label that is shown on the front of the system unit or expansion unit, as shown in the following figure. In this example, U787A.001.AAAXXXX-P3-D4, U787A.001 comes from the feature code (FC) on the label, AAAXXXX comes from the last seven digits of the sequence number (SN or SEQ) on the label, and P3–D4 is the slot where you are installing the disk drive as shown in “Disk drive locations and concurrent maintenance lights” on page 125. In the example, U5074.001.AAAXXXX-DB3-D32, you would install a disk drive in the expansion unit with type 5074, model 001, sequence number AAAXXXX, and disk drive position DB3–D32. 2. Sign on to service level authority. 3. Type strsst on the command line of the IBM i session and press Enter. 4. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign On display. Press Enter. Remember: The service tools password is case-sensitive. 5. Select Start a service tool from the System Service Tools (SST) display. Press Enter. 6. Select Hardware service manager from the Start a Service Tool display. Press Enter. 7. If the slot you want to use contains a disk-drive filler, store the filler for future use, or discard it, then move on to step “Install the disk drive” on page 41. Figure 29. Example of a system or expansion unit label Installing a disk drive or solid-state drive 39 Note: The disk drive filler is a single piece that does not attach to the new disk drive. Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. a. Pull the disk-drive filler from the slot, as shown in the following figure. b. Pull off the disk drive bezel latch (A) from the front of the filler, as shown in the following figure. Figure 30. Removing a disk-drive filler 40 Disk drives c. Loosen, and then pull out the thumbscrew (C) to release the disk-drive bezel (B) from the front edge of the filler. Install the disk drive 1. Find the package that contains the new disk drive. Attention: Disk drives are fragile. Handle with care. 2. Remove the disk drive from the protective package. Record the serial number (use the last four digits) of the disk drive. 3. Unlock the disk drive handle by squeezing it and pulling it out toward you. If the handle is not all the way out, the disk drive does does not slide into the system or expansion unit. 4. Follow these steps to install the disk-drive bezel on the new disk drive: a. Attach the bezel (A) to the disk drive, as shown in the following figure. Figure 31. Removing a disk-drive bezel Installing a disk drive or solid-state drive 41 b. Press in and then tighten the thumbscrew (B) to secure the bezel to the disk drive. 5. Support the bottom of the disk drive as you align it with the guide rails in the system or expansion unit. Do not hold the disk drive by the handle. 6. Slide the disk drive halfway into the system or expansion unit. 7. Select Device Concurrent Maintenance from the Hardware Service Manager display. Press Enter. The Device Concurrent Maintenance is displayed, as shown in the following figure. 8. In the Specify either physical location or resource name field, type the physical location that you had previously recorded in this procedure, as shown in Figure 33. 9. Select option 2 (Install device) for the Action to be performed. Figure 32. Installing a disk-drive bezel on a disk drive Device Concurrent Maintenance Type the choices, then press Enter. Specify either physical location or resource name. Physical location: U787A.001.1234567-P3-D4 or Device resource name: Specify action as 1=Remove device 2=Install device Action to be performed . . . . . . . .: 2 Enter a time value between 00 and 19. Time delay needed in minutes . . . . .: 01 F3=Exit F5=Refresh F12=Cancel Figure 33. Example Device Concurrent Maintenance display 42 Disk drives 10. Set the time delay for one minute - 01. Important: Do not press Enter at this time. 11. Locate the concurrent maintenance light that corresponds to the position of the disk drive that you are installing. To view the locations of the concurrent maintenance lights, select the appropriate model from the following list: v “Disk drive locations and concurrent maintenance lights for the 5786, 5787, 7031-D24, and 7031-T24 SCSI disk-drive enclosures” on page 133 v “Disk drive locations and concurrent maintenance lights for the 5886 disk enclosure” on page 131 v “Disk drive locations and concurrent maintenance lights for the 5802 expansion unit” on page 135 12. Press Enter on the console. Important: When you press Enter, after a one minute delay, the light comes on and begins to flash rapidly. You then have nine seconds to slide the disk drive all the way in and then lock it in place by pushing in the disk drive handle. 13. When the concurrent maintenance light starts blinking, slide the disk drive all the way in and lock it in place by closing the latch, as shown in the following figure. Note: The light stops blinking and remains on when you have installed the disk drive properly. Figure 34. Sliding the disk drive in to the system unit Installing a disk drive or solid-state drive 43 14. If you are installing more than one disk drive, perform step 1 on page 39 through 13 on page 43 until all disk drives are installed. Figure 35. Installing the small-form factor disk drive into the system Figure 36. Installing a small-form factor disk drive in the 5803 expansion unit 44 Disk drives Perform the procedure 1. Return to the console and wait until the Concurrent Maintenance Results display is shown. Press Enter. 2. When the Hardware Service Manager display is shown, press F12 twice and then press Enter. 3. To perform this procedure, go to “Configuring a disk drive in an system or logical partition.” Related concepts Removing and replacing covers and doors Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the POWER7 processor. Related reference “Removing and replacing the expansion unit cover or door” on page 205 You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing components or performing a service action. Configuring a disk drive in an system or logical partition To configure a newly installed disk drive into an existing device parity set, follow these steps: If you want to start a new device parity set or change the protection on your disks, see Manage device parity protection or Start device parity protection in the Disk Management section of the IBM i Information Center website at http://www.ibm.com/systems/i/infocenter . 1. If necessary, start System Service Tools (SST) by typing strsst on the command line of the IBM i session and pressing Enter. 2. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign On display. Press Enter. Restriction: The service tools password is case sensitive. 3. Select Work with Disk Units on the System Service Tools display. Press Enter. 4. Select Display disk configuration on the Work with Disk Units display. Press Enter. 5. Select Display non-configured units on the Display disk configuration display. Press Enter. 6. The disk drive that you installed is listed. The serial number should match the serial number you recorded (the last 4 digits). Note: The new disk drive might take up to five minutes to appear in the list. If the disk drive is not listed, ensure that the disk drive was properly installed. 7. Press F12 twice. 8. Select Work with disk configuration on the Work with Disk Units display. Press Enter. 9. Are you adding a disk drive to an existing device parity set? v Yes: Continue with the next step. v No: Go to step 23 on page 46 for mirroring or no protection, go to 1 on page 47 for hot spare. Tip: Select No for mirroring, hot spare or no protection. 10. Select Work with device parity protection on the Work with Disk Configuration display. Press Enter. 11. Select Include unit in device parity protection on the Work with Device Parity Protection display. Press Enter. 12. Did the Include Device Parity Protection Failed display appear? v Yes: Continue with the next step. v No: Go to step 15 on page 46. 13. The following conditions must be met before you can start device parity protection. Installing a disk drive or solid-state drive 45 v If there are enough disk drives available to create a new parity set, the drives are eligible for Start Device Parity Protection operation and not for the Include Device Parity Protection operation. v All disk drives in a parity set must be the same capacity with a minimum number of three or four disk drives and maximum of 10 or 18 disk drives in the resulting parity set, depending on the disk drive type. v Not all disk drives attached to an advanced function input output processor have reported to the system. Retry the operation. v The type or model of the disk drives cannot be used for the requested operation. When these conditions are met, press F12 to return to the Work with Disk Configuration display. 14. Select Include unit in device parity protection on the Work with disk configuration display. Press Enter. 15. On the Include Disk Units in Device Parity Protection display, select the disk drives to be included in Device Parity Protection. Type a 1 in the Option column. Press Enter. 16. Press Enter to confirm your choice. The configuration starts and might take 30 to 60 minutes to complete. 17. When the device parity protection is complete, the message “Selected drives have been included successfully” is shown on the Work with device parity protection display. Press F12. 18. Select Display disk configuration on the Work with disk configuration display. Press Enter. 19. Select Display disk configuration status on the Display disk configuration display. 20. If you have more than one ASP, determine the auxiliary storage pool (ASP) to which you want to add the disk drive. 21. Press F12 twice to get to the Work with Disk Units display. 22. Select Work with disk configuration on the Work with Disk Units display. Press Enter. 23. Select the option to add units to ASPs and balance data on the Work with disk configuration display. Press Enter. 24. Specify the number of the ASP to which you want to add the disk drive. The system ASP is ASP 1. Press Enter. Notes: a. If ASP is mirror protected, disk drives must be added in pairs of the same size. b. When one or more pairs of disk drives are added to a mirrored ASP, they automatically become mirror protected. c. To start mirrored protection on an ASP, refer to Recovering your system in the IBM i Information Center. To access the IBM i Information Center Web site, go to http://www.ibm.com/systems/i/ infocenter and select from the following options: v If you select V6R1, expand IBM i 6.1 Information Center > Systems management > Backup and recovery > Recovering your system v If you select V5R4, expand i5/OS Information Center > Systems management > Backup and recovery > Backing up and recovering your system > Recovering your system 25. The Confirm Add Units display is shown showing the configuration your system will have when the add operation completes. Note: If you have selected the wrong ASP, press F12 to change your options. Press the Enter key to continue. The add process will take several minutes to complete. 26. When the message Selected units have been added successfully is shown, press F3 three times and press Enter to return to the Main Menu display. 27. If you have a printer, print the configuration list. A service provider might refer to the configuration list in the future. a. You need to sign on with at least service level authority 46 Disk drives b. Type strsst on the Main Menu command line. Press Enter. c. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign On display. Press Enter. Remember: The service tools password is case sensitive. d. Select Start a service tool on the System Service Tools (SST) display. Press Enter. e. Select Hardware service manager on the Start a Service Tool display. Press Enter. f. Press F6 (print the configuration) from the Hardware Service Manager display. g. To return to the Main Menu display, press F3 (Exit) twice and then press Enter. h. Place the configuration list where you can refer to it in the future. 28. If you are a service provider performing this task, return to the procedure that sent you here. 1. Select Work with hot spare protection from the Work with disk configuration menu. 2. Select Start device parity protection - RAID 5 with hot spare or Start device parity protection - RAID 6 with hot spare depending on the level of parity protection that is desired. 3. Return to the procedure that sent you here. Rebuilding data on a replacement disk drive using IBM i Learn how to rebuild data on replacement disk. To rebuild data on the replacement disk drive, complete the following steps: 1. If necessary, start System Service Tools (SST) by typing strsst on the command line of the IBM i session and pressing Enter. 2. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign On display. Press Enter. Note: The service tools password is case sensitive. 3. Select Work with disk units from the Start System Service Tools (SST) display. Press Enter. 4. Select Work with disk unit recovery from the Work with Disk Units display. Press Enter. 5. If you determined that the disk drive you replaced had device parity protection (RAID), follow these steps to restore device parity protection (RAID) on the disk drive: a. Select Rebuild disk unit data on the Work with Disk unit recovery display. Press Enter. b. Select 1 to rebuild the disk drive displayed (the disk drive displayed is the disk drive that you removed) on the Rebuild Disk Unit Data display. Press Enter. c. Press Enter on the Confirm Rebuild Disk Unit Data display. The rebuild process might take several minutes to complete. d. Press F5 to refresh the display until the Percent complete shows 5%. e. When the display shows at least 5% complete, you can either continue to monitor this display to completion, or press F3 (Exit) to return to the Work with disk units display. f. Press F3 (Exit) to return to the System service tools display. g. Press F3 (Exit) to return to the Exit SST display and press Enter. h. Return to the procedure that sent you here. 6. If the disk drive that you replaced had mirrored protection, follow these steps: a. Select Replace configured unit on the Work with Disk unit recovery display. Press Enter. b. Select the configured disk drive that you are exchanging (suspended drive) on the Select Configured Unit to Replace display. Press Enter. c. Select the disk drive that you just installed on the Select Replacement Unit display. This drive has a non-configured status. Installing a disk drive or solid-state drive 47 Note: In some cases, it might take several minutes for a new disk drive to display. Repeat these steps until the new drive is shown. Press Enter. d. Press Enter on the Confirm Replace of Configured Unit display to confirm your choice for replacement. The replacement process might take several minutes to complete. When the process is complete, the Work with Disk unit recovery display is shown. e. Press F3 (Exit) to return to the Work with disk units display. f. Select Display disk configuration on the Work with disk units display. g. Select Display disk configuration status on the Display Disk Configuration display. Mirrored status shows Resuming. When complete, the mirrored status shows Active. This process might take several minutes to complete. You can either monitor this display to completion, or press F3 (Exit) three times, and then press Enter to return to the IBM i main menu. Determining the IBM i disk drive protection status To determine the protection status of the disk drives in your IBM i system or logical partition, follow these steps from the System Service Tools (SST) display: Tip: If you do not know the protection status of the disk drives and you cannot get to the System Service Tools display, contact your next level of support. 1. Sign on with at least service level authority. 2. Type strsst on the command line of the IBM i session and press Enter. 3. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign On display. Press Enter. Note: The service tools password is case-sensitive. 4. Select Work with disk units from the System Service Tools (SST) display. Press Enter. 5. Select Display Disk Configuration from the Work with Disk Units display. Press Enter. 6. Select Display Disk Configuration Status from the Display Disk Configuration display. Press Enter. A list of each auxiliary storage pool (ASP) displays, showing the disk drives that are members of the ASP. The Status column shows one of the following protection statuses for the ASP: v Mirrored. These ASPs are configured for mirrored protection. If the failing disk drive has mirrored protection, record the status of both disk drives in the mirrored pair. You need this information in the recovery procedure. v Unprotected. An ASP with a status of Unprotected might contain disk drives that are device parity protected. Record the status of the failing disk drive as it is shown in the Status column of the Display Disk Configuration Status display. A disk drive with one of the following statuses has device parity protection only if all other disk drives in the array are operational: – DPY/Active – DPY/Failed – DPY/HDW Failure – DPY/Degraded – DPY/Power Loss – DPY/Not Ready – DPY/Unknown A disk drive with any other status (such as DPY/Rebuild or DPY/Unprotected) does not have device parity protection. Examples: 48 Disk drives – Suppose one disk drive has a status of DPY/Failed while all other disk drives have a status of DPY/Unprotected. In this case, only the disk drive with the DPY/Failed status is protected. Servicing the failed disk drive does not result in data loss. Removing any of the unprotected disk drives might result in data loss. – All disk drives have a status of DPY/Active or DPY/Degraded, therefore, all disk drives are protected. In this case, exchanging any disk drive does not result in data loss. Tip: For more information about the meaning of a status for any disk drive, use the Help key on the display. Installing a disk drive or solid-state drive with the system or logical partition power turned off Learn how to install a disk drive in a system or logical partition that controls the disk drive location with the power turned off. Use the procedure in this information to perform this task. Attention: If you are servicing a failing part, see the service procedures for “Replacing a disk drive with the system or logical partition powered off” on page 106. This procedure is intended for installing a new or upgraded disk drive. Before installing or replacing a feature, ensure that the software required to support the feature is installed on your system. For information about software prerequisites, see IBM Prerequisite. If the required software is not installed, see the following websites to download it, and then install it before continuing: v To download firmware and software updates and fixes, see Microcode downloads. v To download Hardware Management Console (HMC) updates and fixes, see Hardware Management Console Support and downloads. If your system is managed by the HMC, use the HMC to perform the steps for installing a feature in the server. For information about using the HMC to install a feature, see http://publib.boulder.ibm.com/ infocenter/powersys/v3r1m5/topic/p7haj/hmcinstall.htm. If you do not have an HMC, perform the following steps to install a disk drive from the system or partition that controls the disk drive. If you are installing Solid State Drives, review the configuration rules and then return here. For more information, see “Solid-state drives” on page 5. Installing a disk drive or solid-state drive 49 DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) Prepare the system 1. Perform prerequisite tasks as described in “Before you begin” on page 189. Note: Do not disconnect the power source when you stop the system or logical partition to install a disk drive or solid-state drive. 2. If applicable, remove the system unit door or expansion unit door, or open the rack. 3. If applicable, remove the front cover. 4. Determine the next available disk drive position in your system or expansion unit. For information, see “Disk drive locations and concurrent maintenance lights” on page 125. Note: When you have a system with an expansion unit, you must fill the slot positions in the system unit first. However, you can select a different disk drive placement depending on your data protection strategy. 5. Stop the system or the logical partition that controls the disk drive location. For instructions, see http://publib.boulder.ibm.com/infocenter/powersys/v3r1m5/topic/p7haj/crustopsys.htm. 50 Disk drives Install the disk drive 1. If the slot you want to use contains a disk-drive filler, store the filler for future use, or discard it, then move on to step Insert the disk. Note: The disk drive filler is a single piece that does not attach to the new disk drive. Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. a. Pull the disk-drive filler from the slot you want to use, as shown in Figure 37. b. Pull off the disk-drive bezel latch (A) from the front of the filler, as shown in the following figure. Figure 37. Removing a disk-drive filler from a model Installing a disk drive or solid-state drive 51 c. Loosen, and then pull out on the thumbscrew (C) to release the bezel (B) from the front edge of the filler. 2. Find the package that contains the new disk drive. Attention: Disk drives are fragile. Handle with care. 3. Remove the disk drive from its protective package. Record the serial number (use the last four digits) of the disk drive. 4. Unlock the disk drive handle by squeezing and pulling it out toward you before you install the disk drive. If the handle is not all the way out, the disk drive does not slide into the system or expansion unit. 5. If your disk drive does not have a bezel preinstalled, follow these steps to install the disk-drive bezel on the new disk drive: a. Attach the bezel (A) to the disk drive, as shown in the following figure. Figure 38. Removing a disk-drive bezel 52 Disk drives b. Press in and then tighten the thumbscrew (B) to secure the bezel to the disk drive. 6. Support the bottom of the disk drive as you align it with the guide rails in the system or expansion unit. Do not hold the disk drive by the handle. 7. Slide the disk drive into the system or expansion unit until it stops. Note: When installing a disk drive, the drive must be fully seated and all the way into the system. 8. Push the disk drive handle in until it locks, as shown in Figure 40 on page 54. Figure 39. Installing a disk-drive bezel Installing a disk drive or solid-state drive 53 Figure 40. Installing a disk drive in the Figure 41. Installing a small-form factor disk drive 54 Disk drives 9. If you are installing more than one disk drive, repeat step 1 on page 51 through step 8 on page 53 until all disk drives are installed. Perform the procedure 1. Replace the front cover. 2. Install or close the system, expansion unit, or rack front door. 3. If the system power cables were removed, reconnect them. 4. Start the system or logical partition. For instructions, see “Starting the system or logical partition” on page 199. 5. To configure the newly installed disk drive, select the procedure for the operating system that controls the disk drive. v “Preparing a disk drive for use in an AIX system or logical partition” on page 137 v For information about configuring the newly installed disk drive for a Linux environment, see the PCI-X SCSI RAID Controller Reference Guide for Linux. This guide is available from the SCSI PCI Adapters web page. Related concepts Removing and replacing covers and doors Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the POWER7 processor. Related reference “Removing and replacing the expansion unit cover or door” on page 205 You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing components or performing a service action. Figure 42. Installing a small-form factor disk drive in the 5803 expansion unit Installing a disk drive or solid-state drive 55 56 Disk drives Removing a disk drive or solid-state drive You might need to remove a disk drive or solid-state drive from the sever or expansion unit. This section includes information about the disk-drive enclosure and procedures so that you can perform these tasks. To remove and replace a failed disk drive, go to “Replacing a disk drive” on page 63. If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the steps for removing a disk drive. For instructions, see “Removing a part by using the Hardware Management Console” on page 247. Remove a disk drive from a system or expansion unit that is controlled by different operating systems. “Replacing a disk drive with the AIX system or logical partition powered on” on page 63 “Replacing a disk drive with the Linux system or logical partition powered on” on page 72 “Replacing a disk drive with the IBM i system or logical partition powered on” on page 89 Removing a disk drive from a system or expansion unit controlled by IBM i Learn how to remove a Small Computer System Interface (SCSI) or serial-attached SCSI (SAS) disk drive from a system or expansion unit with an IBM i operating system or a logical partition that controls the disk drive location. Use the procedure in this information to perform this task. Attention: If you are removing a disk drive in order to install a new or upgraded disk drive, see “Installing a disk drive or solid-state drive with the IBM i system or logical partition powered on” on page 37. If you are removing a disk drive as part of a service procedure, continue to use the following procedure. If your system is managed by the Hardware Management Console (HMC), use the HMC to perform the steps for removing disk drives from the server. For information about using the HMC to remove memory modules, see “Removing a part by using the Hardware Management Console” on page 247. If you do not have an HMC, perform the following steps to remove the disk drive. © Copyright IBM Corp. 2010 57 DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) 1. Before removing a disk drive, back up all the data from the disk drive or the array that the drive is a part of, and then move the data from the disk drive that you want to remove. For instructions, choose one of the following options: v Move and remove disk units v Recovering your system in the IBM i Information Center To access the IBM i Information Center website, go to http://www.ibm.com/systems/i/infocenter and select from the following options: – If you select V6R1, expand IBM i 6.1 Information Center > Systems management > Backup and recovery > Recovering your system – If you select V5R4, expand i5/OS Information Center > Systems management > Backup and recovery > Backing up and recovering your system > Recovering your system 2. Prepare to remove the disk drive from the system configuration as described in “Preparing to remove a disk drive from a system or logical partition controlled by IBM i” on page 61. 3. Locate the disk drive to be removed. See “Disk drive locations and concurrent maintenance lights” on page 125. 4. Stop the system or logical partition that controls the disk drive location. For instructions, see “Stopping a system or logical partition” on page 201. 58 Disk drives Prepare the system 1. If applicable, remove the system unit door, expansion unit door, or open the rack. 2. If applicable, remove the front cover. v If you have a front cover from any other models, remove the front cover. Remove the disk drive 1. Unlock the disk drive handle by squeezing it and pulling it out toward you. Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. Figure 43. Removing a disk drive Removing a disk drive or solid-state drive 59 2. Support the bottom of the disk drive as you slide it out of the system or expansion unit. Do not hold the disk drive by the handle. 3. If a disk-drive filler is available, follow these steps: Figure 44. Removing a small-form factor disk drive Figure 45. Removing a small-form factor disk drive from the 5803 expansion unit 60 Disk drives Note: Refill the disk drive slots with either another disk drive or a disk-drive filler. Filling the disk drive slot can help ensure proper airflow for cooling and help maintain optimal EMI compliance. a. Attach the disk-drive bezel to the filler. b. Press in and then tighten the thumbscrew (B) to secure the bezel to the disk drive. c. Install the disk-drive filler in the system or expansion unit. Perform the procedure 1. Install or close the front door of the system, expansion unit, or rack. 2. Reconnect all system power cables. 3. Reconnect the system to the power supply by plugging it into the system. 4. Start the system or logical partition. For instructions, see “Starting the system or logical partition” on page 199. Related concepts Removing and replacing covers and doors Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the POWER7 processor. Related reference “Removing and replacing the expansion unit cover or door” on page 205 You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing components or performing a service action. Preparing to remove a disk drive from a system or logical partition controlled by IBM i Learn the actions you need to take to properly remove a disk drive. Before removing a disk drive from a system or expansion unit that is controlled by IBM i, follow these steps: 1. Determine the protection status of the disk drive that you are replacing. For instructions, see “Determining the IBM i disk drive protection status” on page 48. 2. The service provider must perform one of the following tasks: v Save data and initialize the disk drive. v Initialize the disk drive and copy to disk. 3. For mirrored disk drives on an IBM i system or logical partition, follow these steps to verify that the disk drive you are replacing has been suspended: a. Sign on with service level authority. b. Type strsst on the command line of the IBM i session and press Enter. c. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign On display. Press Enter. Remember: The service tools password is case sensitive. d. Select Work with disk units from the System Service Tools (SST) display. Press Enter. e. Select Display Disk Configuration from the Work with Disk Units display. Press Enter. f. Select Display Disk Configuration Status from the Display Disk Configuration display. Press Enter. g. Does the disk drive that is mirrored to the disk drive that you are replacing (same drive number) have a status of Active? v No: The replacement must be performed by a service provider. v Yes: Does the disk drive that you are replacing have a status of Suspended? – Yes: Continue with the next step. Removing a disk drive or solid-state drive 61 – No: Follow these steps to suspend mirrored protection on the disk drive that you are replacing: 1) Press F3 from the Display Disk Configuration display to return to the Work with Disk Units display. 2) Select Work with Disk Unit Recovery from the Work with Disk Units display. Press Enter. 3) Select Suspend mirrored protection from the Work with Disk Unit Recovery display. Press Enter. 4) Select the option to suspend the disk drive that you are replacing from the Suspend Mirrored Protection display. Press Enter. h. Press F3 repeatedly to exit System Service Tools and return to the main menu. 62 Disk drives Replacing a disk drive You might need to replace a Small Computer System Interface (SCSI) or serial-attached SCSI (SAS) disk drive in the server or expansion unit. Use the procedures in this section to perform this task. Note: These instructions (the first 3 sections) can also be used if you are replacing an SAS disk drive. To remove and replace a disk drive in a different enclosure, use the information for that enclosure. Use this procedure to remove a failed disk drive and replace it with the same type of disk drive. If you plan to remove a disk drive and install a different type of disk drive, remove the existing disk drive as described in “Removing a disk drive or solid-state drive” on page 57, then install the new disk drive as described in “Installing a disk drive or solid-state drive” on page 5. If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the steps for replacing a disk drive. For instructions, see “Exchanging a part by using the Hardware Management Console” on page 247. Your system or logical partition can be powered off or powered on when you replace the disk drive. If the disk drive that is being replaced is in the AIX or Linux root volume group (rootvg) and does not have disk protection replace your disk with the powered-off procedure. Replacing a disk drive with the AIX system or logical partition powered on You might need to replace a Small Computer System Interface (SCSI) or serial-attached SCSI (SAS) disk drive in a system or expansion unit while the AIX operating system or logical partition that controls the disk drive location is powered on. Use the procedure in this section to perform this task. Attention: If you are installing a new or upgraded disk drive, see “Installing a disk drive or solid-state drive with the AIX system or logical partition powered on” on page 10. If you are removing a disk drive as part of a service procedure, continue to use the following procedure. © Copyright IBM Corp. 2010 63 DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) Attention: The powered-on procedure is a complex set of steps that must be performed exactly to avoid potential data loss or system damage. You might also consider the powered-off procedure, which is a simpler task that involves less risk of data loss or system damage. If the system or logical partition that controls the disk drive you are replacing is at the AIX command line, you can use this powered-on procedure. If the disk drive that you are replacing is in the root volume group (rootvg) and it is not protected by either Redundant Array of Independent Disks (RAID) or mirroring, or to use the powered-off procedure, go to “Replacing a disk drive with the system or logical partition powered off” on page 106. Note: Some of the figures in these procedures might not look exactly like the system or expansion unit that you have. However, the steps to perform the task are the same. Prepare the system 1. Perform prerequisite tasks as described in “Before you begin” on page 189. 2. Prepare to remove the disk drive from the system or expansion unit as described in “Preparing to remove a disk drive from a system or expansion unit controlled by AIX” on page 137. 3. If applicable, remove the system unit door, expansion unit door, or the rack door. 64 Disk drives 4. If applicable, remove the front cover. v If you have a front cover from any other models, remove the front cover. To replace a Just a Bunch of Disks (JBOD) disk drive attached to any SCSI or SCSI RAID adapter, see the procedure in “Replacing a SCSI disk drive” on page 114. Remove a disk drive 1. If you are removing a solid state device located on a PCIe RAID and SSD SAS adapter, go to Removing and replacing the PCIe RAID and SSD SAS adapterRemoving and replacing the PCIe RAID and SSD SAS adapter and complete that procedure, then return here and perform the following steps. 2. Log in as root user. 3. At the command line, type diag and press Enter. 4. Press Enter to continue. 5. On the Function Selection display, select Task Selection. 6. Select Hot Plug Task. 7. Select SCSI and SCSI RAID Hot Plug Manager or RAID Hot Plug Devices, depending on the adapter that your drive is attached to. Tip: One or both of these options might be available, depending on the configuration of your system. If the drive you want to replace is not listed when you follow the steps for the SCSI and SCSI RAID Hot Plug Manager, go to step 8. v If you have a RAID disk drive attached to a 2498 PCI SCSI RAID adapter, continue with step 8. v If you have a JBOD disk drive attached to any SCSI or SCSI RAID adapter, go to “Replacing a SCSI disk drive” on page 114. v If you have RAID drives attached to any other adapter, go to the PCI-X SCSI RAID Controller Reference Guide for AIX. This guide is available from the SCSI PCI Adapters Web page at http://publib.boulder.ibm.com/infocenter/pseries/v5r3/index.jsp?topic=/com.ibm.pseries.doc/ hardware_docs/scsipciadapters.htm. 8. Select RAID Hot Plug Devices. 9. On the Resource Selection List, select the adapter that is connected to the array that contains the disk drive you want to replace. Press Enter. 10. Select Commit. 11. On the Identify and Remove Resources menu, select Remove. Press Enter. A list of the disk drives that you can remove is shown. If the disk drive you want to remove does not appear in the list, ask your system administrator to put the disk drive in the Failed state before continuing with this procedure. For information, see AIX System Management Guide: Operating System and Devices. This guide is available at the System p® and AIX Information Center (http://publib.boulder.ibm.com/infocenter/pseries/v5r3/index.jsp). 12. Select the drive that you want to remove and press Enter. The concurrent maintenance light for the slot will flash rapidly. Attention: If the light does not flash rapidly, do not remove the disk drive. To view the locations of the concurrent maintenance lights, go to “Disk drive locations and concurrent maintenance lights” on page 125. 13. Unlock the disk drive handle by squeezing it and pulling it out toward you as shown in Figure 46 on page 66. Replacing a disk drive 65 Figure 46. Removing a disk drive from a model Figure 47. Removing a small-form factor disk drive 66 Disk drives 14. Support the bottom of the disk drive as you slide it out of the system or expansion unit. Do not hold the disk drive by the handle. The concurrent maintenance light for the slot turns off when you remove the disk drive. 15. Does your new disk drive have a bezel preinstalled? v No: Continue with the procedure from here. v Yes: Go to step 6 on page 69 of the replacement procedure. Important: If the disk drive bezel does not have a thumbscrew, it is not removable and is a serial-attached SCSI (SAS) disk drive. The replacement drive should come with an attached bezel. Loosen and pull out on the thumbscrew (B) to release the bezel (C) from the disk drive, as shown in the following figure. Figure 48. Removing a small-form factor disk drive from the 5803 expansion unit Replacing a disk drive 67 Replace a disk drive 1. Find the package that contains the new disk drive. Attention: Disk drives are fragile. Handle with care. 2. Remove the disk drive from its protective package. 3. Unlock the handle of the replacement disk drive by squeezing and pulling it out toward you. If the handle is not all the way out, the disk drive cannot slide into the system. 4. If your new disk drive does not have a bezel preinstalled, then perform the following step. Attach the disk-drive bezel (A) to the replacement disk drive, as shown in the following figure. 5. If your new disk drive does not have a bezel preinstalled, then perform the following step. Press in and then tighten the thumbscrew (B) to secure the bezel to the disk drive. Figure 49. Removing a disk-drive bezel 68 Disk drives 6. Install the disk drive as shown in Figure 51 on page 70. a. Support the bottom of the disk drive as you align it with the guide rails in the system unit. Do not hold the disk drive by the handle. b. Slide the disk drive into the system until it stops. Important: Ensure that when installing a disk drive, that the drive is fully seated and all the way into the system. c. Push the disk drive handle in until it locks. Figure 50. Installing a disk-drive bezel Replacing a disk drive 69 Figure 51. Installing a disk drive in a model Figure 52. Installing a small-form factor disk drive 70 Disk drives 7. Press Enter. The disk drive slot exits the Remove state and enters the Normal state. 8. Exit to the RAID Hot-Plug Devices menu. Press F3 to return. 9. Exit to the Task selection display. 10. Select Log Repair Action. 11. Select the disk drive that you replaced and then press Enter. 12. Select Commit after you have made your selection and then press Enter. 13. Exit to the command line. Complete the procedure 1. Replace the front cover. 2. Install or close the front door of the system, expansion unit or rack. 3. To prepare the disk drive for use, go to “Rebuilding data on a replacement disk drive using AIX” on page 137. Figure 53. Installing a small-form factor disk drive in the 5803 expansion unit Replacing a disk drive 71 Related concepts Removing and replacing covers and doors Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the POWER7 processor. Related reference “Removing and replacing the expansion unit cover or door” on page 205 You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing components or performing a service action. Replacing a disk drive with the Linux system or logical partition powered on You might need to replace a Small Computer System Interface (SCSI) or serial-attached SCSI (SAS) disk drive in a system or expansion unit while the Linux operating system or logical partition that controls the disk drive location is powered on. Use the procedure in this section to perform this task. Attention: If you are installing a new or upgraded disk drive, see “Installing a disk drive or solid-state drive with the Linux system or logical partition powered on” on page 17. If you are removing a disk drive as part of a service procedure, continue to use the following procedure. 72 Disk drives DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) Attention: Read these instructions carefully and understand the complexity of the powered-on procedure before you begin. If the system or logical partition that controls the disk drive you are replacing is at the Linux command line, you can use this powered-on procedure. Go to “Replacing a disk drive with the system or logical partition powered off” on page 106, if: v You are installing a disk drive that will be connected with the 6203 or 6204 SCSI adapter v You do not feel comfortable using the powered-on procedure Note: Some of the figures in these procedures might not look exactly like the system or expansion unit that you have. However, the steps to perform the task are the same. Prepare the system 1. Perform prerequisite tasks as described in “Before you begin” on page 189. 2. Is the disk drive you are replacing in a redundant array of independent disk (RAID) 0 array or is the disk drive you are replacing in a failed RAID level 5 or 10 array? v If yes, you must delete the disk array before replacing the disk. For instructions, see PCI-X SCSI RAID Controller Reference Guide for Linux. Replacing a disk drive 73 This guide is available from the SCSI PCI Adapters Web page at http://publib.boulder.ibm.com/ infocenter/pseries/v5r3/index.jsp?topic=/com.ibm.pseries.doc/hardware_docs/scsipciadapters.htm. v If no, your drive is configured as Just a Bunch of Disks (JBOD) or is configured within a degraded array. Proceed to the next step. 3. If the disk drive you are replacing is in a redundant array of independent disk (RAID) Level 0 disk array or in a failed RAID Level 5 or RAID Level 10 disk array, you must delete the disk array before replacing the disk. For instructions, see the PCI-X SCSI RAID Controller Reference Guide for Linux. This guide is available from the SCSI PCI Adapters Web page at http://publib.boulder.ibm.com/ infocenter/pseries/v5r3/index.jsp?topic=/com.ibm.pseries.doc/hardware_docs/scsipciadapters.htm. 4. If applicable, remove the system unit door or expansion unit door or open the rack front door. 5. If applicable, remove the front cover. v If you have a front cover from any other models, remove the front cover. To replace a disk drive in a location with the Linux system or logical partition that controls the disk drive location powered on, select the appropriate procedure from the following list: Related concepts Removing and replacing covers and doors Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the POWER7 processor. Related reference “Removing and replacing the expansion unit cover or door” on page 205 You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing components or performing a service action. Replacing the disk drive using Linux kernel 2.4 Learn how to replace a disk drive in a location that is controlled by a system or logical partition that is running Linux kernel 2.4. Prepare to remove the disk drive 1. Log in as root user. 2. Type iprconfig on the command line of the Linux session and press Enter. The Work with Disk Units display is shown. 3. From the Work with Disk Units display, select Analyze log. Press Enter. The Kernel Messages Log display is shown, like the following display. 4. From the Kernel Messages Log display, select Use vi to view most recent IBM Storage error messages. Press Enter. 5. Find the device error log entry for the disk drive you want to replace. Kernel Messages Log Select one of the following: 1. Use vi to view most recent IBM Storage error messages 2. Use vi to view IBM Storage error messages 3. Use vi to view all kernel error messages 4. Use vi to view sisconfig error messages 5. Set root kernel message log directory 6. Set default editor 7. Restore defaults 8. Use vi to view IBM Storage boot time messages Selection: e=Exit Figure 54. Kernel Messages Log 74 Disk drives 6. Record the location information for the disk drive. The location information has the form of P1–I2/Z1–A5. 7. Exit to the Work with Disk Units display. 8. Select Display disk hardware status from the Work with Disk Units display. Press Enter. 9. Record the hardware status of the disk drive you want to replace. 10. Type d (Display disk unit details) next to the disk drive you want to replace. 11. On the Display Disk Unit Details display, type 5 (Display hardware resource information details) next to the disk drive you want to replace. The Disk Unit Hardware Resource Information Details display is shown, like the following display. 12. Record the physical location information. 13. Exit to the Work with Disk Units display. 14. If the disk drive you want to replace is unprotected or in use, move the data from the disk drive before continuing with this procedure. For information, see the PCI-X SCSI RAID Controller Reference Guide for Linux. This guide is available from the SCSI PCI Adapters web page. Remove the disk drive 1. If you are removing a solid state device located on a PCIe RAID and SSD SAS adapter, go to Removing and replacing the PCIe RAID and SSD SAS adapterRemoving and replacing the PCIe RAID and SSD SAS adapter and complete that procedure, then return here and perform the following steps. 2. From the Work with Disk Units display, select Work with disk unit recovery. Press Enter. 3. From the Work with Disk Unit Recovery display, select Device Concurrent maintenance. Press Enter. A Device Concurrent Maintenance display is shown, like the following display. Disk Unit Hardware Resource Information Details Manufacturer. . . . . . . . : IBM Machine Type and Model. . . : IC35L073UCDY10-0 Firmware Version. . . . . . : 53323846 Model . . . . . . . . . . . : 020 Serial Number . . . . . . . : E6VKUNPD Capacity. . . . . . . . . . : 73.41 GB Resource Name . . . . . . . : /dev/sda Physical location: Location . . . . . . . : U1.9-P1/Z1-A8 PCI Bus. . . . . . . . . . : 257 PCI Device . . . . . . . . : 1 SCSI Host Number . . . . . : 0 SCSI Channel. . . . . . . : 0 SCSI Id. . . . . . . . . . : 8 SCSI Lun. . . . . . . . . : 0 Press Enter to continue. e=Exit q=Cancel v=Display Extended VPD Figure 55. Disk Unit Hardware Resource Information Details display Replacing a disk drive 75 4. Enter either the physical location information or the PCI or SCSI location for the disk drive you want to replace. Press Enter. 5. Specify 1 (Remove device) for the action to be performed. 6. Enter a time value in minutes, that is the time required by you to remove the disk drive. The concurrent maintenance light indicates that you can remove the disk drive. To view the locations of the concurrent maintenance lights, select the appropriate model from the following list: v “Disk drive locations and concurrent maintenance lights for the 5786, 5787, 7031-D24, and 7031-T24 SCSI disk-drive enclosures” on page 133 7. Press Enter. A Device Concurrent Maintenance Action display is shown, like the following display. 8. Enter c to confirm that you want to remove the disk drive at the specified location. A Device Concurrent Maintenance Action In Progress display is shown, like the following display. Device Concurrent Maintenance Type the choices, then press Enter. Specify either Physical Location or PCI/SCSI location. Physical Location: U0.1-P1-I4/Z1-A8 OR PCI/SCSI . . . PCI Bus: PCI Device: SCSI Channel: SCSI Id: SCSI Lun: Specify action as 1=Remove device 2=Install device Action to be performed . . . . . . . . . : 1 Enter a time value between 01 and 19. Time needed in minutes . . . . . . . . . : 1 Or leave blank and press Enter to cancel Figure 56. Example Device Concurrent Maintenance display Device Concurrent Maintenance Action Removal of device: Location: U0.1-P1-I4/Z1-A8 You have 1 minute to perform the operation During this time, your system can seem unresponsive c=Confirm q=Cancel Figure 57. Example Device Concurrent Maintenance Action display Device Concurrent Maintenance Action In Progress Please remove the device located in: Location: U0.1-P1-I4/Z1-A8 Figure 58. Example Device Concurrent Maintenance Action In Progress display 76 Disk drives Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. 9. Squeeze or lift the handle of the disk drive out toward you before you remove the disk drive, as shown in Figure 59. If the handle is not all the way out, the disk drive cannot slide out of the system or expansion unit. Figure 59. Removing a disk drive from a model or 7311-D20 Replacing a disk drive 77 10. Support the bottom of the disk drive as you slide it out of the system. Do not hold the disk drive by the handle. Figure 60. Removing a small-form factor disk drive Figure 61. Removing a small-form factor disk drive from the 5803 expansion unit 78 Disk drives Note: The concurrent maintenance light for the slot stops flashing and remains off when you remove the disk drive. 11. Press Enter. 12. Does your disk drive have a bezel preinstalled? v No: Continue with the procedure from here. v Yes: Move on to step 5 on page 80 of the replace procedure. Loosen and pull out on the thumbscrew (B) to release the disk-drive bezel (C) from the disk drive, as shown in the following figure. Replace the disk drive 1. Find the package that contains the new disk drive. Attention: Disk drives are fragile. Handle with care. 2. Remove the disk drive from its protective package. 3. Unlock the disk drive handle by squeezing and pulling it out toward you before you install the disk drive. If the handle is not all the way out, the disk drive does not slide into the system or expansion unit. 4. If your replacement disk drive does not have a bezel preinstalled, then follow these steps to install the disk-drive bezel on the replacement disk drive: a. Attach the bezel (A) to the disk drive, as shown in the following figure. Figure 62. Removing a disk-drive bezel Replacing a disk drive 79 b. Press in and then tighten the thumbscrew (B) to secure the bezel to the disk drive. 5. Select Device Concurrent maintenance from the Work with Disk Unit Recovery display. Press Enter. 6. Enter either the physical location code or the PCI/SCSI location from which you removed the disk drive. Press Enter. 7. Specify action as 2 (Install device). 8. Enter a time value in minutes, that is time required by you to install the new disk drive. The concurrent maintenance light for the slot indicates that you can install the disk drive. 9. Press Enter. A Device Concurrent Maintenance Action display is shown, like the following display. 10. Enter c to confirm that you want to install the disk drive in the specified location. A Device Concurrent Maintenance Action In Progress display like the following sample is shown. Figure 63. Attaching a bezel to the disk drive Device Concurrent Maintenance Action Installation of device: Location: U0.1-P1-I4/Z1-A8 You have 1 minute to perform the operation During this time, your system can seem unresponsive c=Confirm q=Cancel Figure 64. Example Device Concurrent Action Maintenance display 80 Disk drives 11. Support the bottom of the disk drive as you align it with the guide rails in the system or expansion unit. Do not hold the disk drive by the handle. 12. Slide the disk drive all the way in into the system or expansion unit and then lock it in place by pushing in the disk drive handle as shown in Figure 66. Device Concurrent Maintenance Action In Progress Please insert the device located in: Location: U0.1-P1-I4/Z1-A8 Figure 65. Example Device Concurrent Maintenance Action In Progress display Figure 66. Installing a disk drive in a model or 7311-D20 Replacing a disk drive 81 The concurrent maintenance light for the slot stops flashing and remains on when you have installed the disk drive properly. Figure 67. Installing a small-form factor disk drive Figure 68. Installing a small-form factor disk drive in the 5803 expansion unit 82 Disk drives Perform the procedure 1. Rebuild the data on the replacement disk drive. For instructions, go to “Rebuilding data on a replacement disk drive using Linux” on page 137. 2. Replace the front cover. 3. Install or close the system, expansion unit, or rack front door. Related concepts Removing and replacing covers and doors Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the POWER7 processor. Related reference “Removing and replacing the expansion unit cover or door” on page 205 You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing components or performing a service action. Replacing the disk drive using Linux kernel 2.6 Learn how to replace a disk drive in a location that is controlled by a system or logical partition that is running Linux kernel 2.6. Prepare to remove the disk drive 1. Log in as root user. 2. Type iprconfig on the command line of the Linux session and press Enter. The IBM Power RAID Configuration Utility display is shown. 3. Select Analyze log. Press Enter. The Kernel Messages Log display is shown. 4. Select View most recent ipr error messages from the Kernel Messages Log display. Press Enter. 5. Find the entry in the log for the disk drive you want to replace. 6. Record the location information for the disk drive. Note: The location information has the form of 2:0:8:0. In this example, 2 is the SCSI host number, 0 is the SCSI bus, 8 is the SCSI target ID, and 0 is the LUN (logical unit). 7. Return to the command line. 8. Type the following command: ls -ld /sys/class/scsi_host/host#/device Where # is the SCSI host number. Press Enter. Kernel Messages Log Select one of the following: 1. View most recent ipr error messages 2. View ipr error messages 3. View all kernel error messages 4. View iprconfig error messages 5. Set root kernel message log directory 6. Set default editor 7. Restore defaults 8. View ipr boot time messages Selection: e=Exit Figure 69. Kernel Messages Log Replacing a disk drive 83 9. Record the PCI location information. Note: The PCI location information has the form of 61:01:0:2. 10. Type iprconfig on the command line and press Enter. The IBM Power RAID Configuration Utility display is shown. 11. Select Display hardware status from the IBM Power RAID Configuration Utility display. Press Enter. The Display Hardware Status display is shown. 12. Look for the disk drive at the PCI location you recorded. The disk drive might have a Failed status. 13. If the disk drive you want to replace is unprotected or in use, move the data from the disk drive before continuing with this procedure. For information, see the PCI-X SCSI RAID Controller Reference Guide for Linux. This guide is available from the SCSI PCI Adapters web page. 14. Type option 1 (Display hardware resource information details) next to the disk drive you want to replace. Press Enter. The Disk Hardware Resource Information Details is displayed like the shown in the following sample. Display Hardware Status Type option, press Enter. 1=Display hardware resource information details OPT Name PCI/SCSI Location Description Status --- ------ -------------------------- ------------------------- ---------------- 0000:01:01.0.0/ PCI-X SCSI Adapter Operational 0000:41:01.0.1/ PCI-X SCSI Adapter Operational sda 0000:41:01.0.1/0:3:0 Physical Disk Active sdb 0000:41:01.0.1/0:4:0 Physical Disk Active sdc 0000:41:01.0.1/0:8:0 Physical Disk Active sdd 0000:41:01.0.1/1:3:0 Physical Disk Active sde 0000:41:01.0.1/1:4:0 Physical Disk Active sdf 0000:41:01.0.1/1:5:0 Physical Disk Active 0001:61:01.0.2/ PCI-X SCSI RAID Adapter Operational sdg 0001:61:01.0.2/0:3:0 Physical Disk Active 0001:61:01.0.2/0:6:0 Advanced Function Disk Active sdi 0001:61:01.0.2/0:9:0 Physical Disk Active sdh 0001:61:01.0.2/255:0:0 RAID 10 Disk Array Failed 0001:61:01.0.2/0:4:0 RAID 10 Array Member Failed 0001:61:01.0.2/0:5:0 RAID 10 Array Member Failed e=Exit q=Cancel r=Refresh t=Toggle Figure 70. Example Display Hardware Status 84 Disk drives 15. Record the physical location information. 16. Return to the IBM Power RAID Configuration Utility display. Remove the disk drive 1. If you are removing a solid state device located on a PCIe RAID and SSD SAS adapter, go to Removing and replacing the PCIe RAID and SSD SAS adapterRemoving and replacing the PCIe RAID and SSD SAS adapter and complete that procedure, then return here and perform the following steps. 2. From the IBM Power RAID Configuration Utility display, select Work with disk unit recovery. Press Enter. 3. From the Work with Disk Unit Recovery display, select Concurrent remove device. Press Enter. A Concurrent Device Remove display is shown, like the following sample. 4. Type option 1 (Select) next to the location for the disk drive you want to replace. Press Enter. 5. The Verify Device Concurrent Remove display is shown. The concurrent maintenance light turns on for that disk drive slot. To view the locations of the concurrent maintenance lights, select the appropriate model from the following list: Disk Unit Hardware Resource Information Details Manufacturer . . . . . . . . . . . . . . : IBM Product ID . . . . . . . . . . . . . . . : ST336607LC Firmware Version . . . . . . . . . . . . : 43353048 (C50H) Serial Number. . . . . . . . . . . . . . : 00006719 Capacity . . . . . . . . . . . . . . . . : 36.40 GB Resource Name. . . . . . . . . . . . . . : /dev/sdd Physical location PCI Address. . . . . . . . . . . . . . . : 0001:50:01.0 SCSI Host Number . . . . . . . . . . . . : 2 SCSI Channel . . . . . . . . . . . . . . : 0 SCSI Id. . . . . . . . . . . . . . . . . : 4 SCSI Lun . . . . . . . . . . . . . . . . : 0 Extended Details FRU Number . . . . . . . . . . . . . . . : 00P2676 More... Press Enter to Continue e=Exit q=Cancel f=PageDn b=PageUp Figure 71. Example Disk Hardware Resource Information Details display Concurrent Device Remove Choose a single location for remove operations 1=Select OPT Name PCI/SCSI Location Description Status --- ------ -------------------------- ------------------------- ----------------- sdc 0000:41:01.0.1/0:8:0 Physical Disk Active 0000:41:01.0.1/0:5: Empty sdb 0000:41:01.0.1/0:4:0 Physical Disk Active sda 0000:41:01.0.1/0:3:0 Physical Disk Active 0000:41:01.0.1/1:8: Empty sdf 0000:41:01.0.1/1:5:0 Physical Disk Active sde 0000:41:01.0.1/1:4:0 Physical Disk Active sdd 0000:41:01.0.1/1:3:0 Physical Disk Active 0001:61:01.0.2/0:8: Empty sdh 0001:61:01.0.2/0:9:0 Physical Disk Active sdg 0001:61:01.0.2/0:3:0 Physical Disk Active Figure 72. Example Concurrent Device Remove display Replacing a disk drive 85 v “Disk drive locations and concurrent maintenance lights for the 5786, 5787, 7031-D24, and 7031-T24 SCSI disk-drive enclosures” on page 133 6. On the Verify Device Concurrent Remove display, verify that the selected disk drive is the disk drive you want to replace, then press Enter. The identify light turns on for the disk drive. Note: Ensure that the disk drive is not in use to prevent loss of data. 7. The Device Concurrent Remove display is shown. Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. 8. Squeeze and pull the handle of the disk drive out toward you before you remove the disk drive as shown in Figure 73. If the handle is not all the way out, the disk drive does not slide out of the system or expansion unit. 9. Support the bottom of the disk drive as you slide it out of the system or expansion unit. Do not hold the disk drive by the handle. 10. If your replacement disk drive does not have a bezel preinstalled, then perform the following step. Loosen and pull out on the thumbscrew (B) to release the disk-drive bezel (C) from the disk drive, as shown in the following figure. Figure 73. Removing a disk drive from a model or 7311-D20 86 Disk drives 11. Press Enter on the IBM Power RAID Configuration Utility to indicate that you have removed the disk drive. Replace the disk drive 1. Find the package that contains the new disk drive. Attention: Disk drives are fragile. Handle them with care. 2. Remove the disk drive from its protective package. 3. Unlock the disk drive handle by squeezing and pulling it out toward you before you install the disk drive. If the handle is not all the way out, the disk drive does not slide into the system or expansion unit. 4. If your replacement disk drive does not have a bezel preinstalled, then follow these steps to install the disk-drive bezel on the replacement disk drive: a. Attach the bezel (A) to the disk drive, as shown in Figure 75 on page 88. b. Press in and then tighten the thumbscrew (B) to secure the bezel to the disk drive. Figure 74. Removing a disk-drive bezel Replacing a disk drive 87 5. From the IBM Power RAID Configuration Utility display, select Work with disk unit recovery. Press Enter. 6. From the Work with Disk Unit Recovery display, select Concurrent add device. Press Enter. A Concurrent Device Add display like the following sample is shown. 7. Type option1 (Select) next to the location from which you removed the disk drive. The Verify Device Concurrent Add display is shown. 8. Press Enter on the Verify Device Concurrent Add display. The Complete Device Concurrent Add display is shown. 9. Support the bottom of the disk drive as you align it with the guide rails in the system or expansion unit. Do not hold the disk drive by the handle. 10. Slide the disk drive all the way into the system or expansion unit and then lock it in place by pushing in the disk drive handle, as shown in Figure 77 on page 89. Figure 75. Attaching a bezel to the disk drive Concurrent Device Add Choose a single location for add operations 1=Select OPT Name PCI/SCSI Location Description Status --- ------ -------------------------- ------------------------- ----------------- 0000:41:01.0.1/0:5: Empty 0000:41:01.0.1/1:8: Empty 0001:61:01.0.2/0:8: Empty Figure 76. Example Concurrent Device Add display 88 Disk drives 11. Press Enter on the Complete Device Concurrent Add display to indicate that the disk drive is installed. Perform the procedure 1. Rebuild the data on the replacement disk drive. For instructions, go to “Rebuilding data on a replacement disk drive using Linux” on page 137. 2. Replace the front cover. 3. Install or close the system, expansion unit, or rack front door. Related concepts Removing and replacing covers and doors Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the POWER7 processor. Related reference “Removing and replacing the expansion unit cover or door” on page 205 You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing components or performing a service action. Replacing a disk drive with the IBM i system or logical partition powered on Learn how to replace a Small Computer System Interface (SCSI) or serial-attached SCSI (SAS) disk drive in a system or expansion unit while the IBM i operating system or logical partition that controls the disk drive location is powered on. Use the procedure in this information to perform this task. Attention: If you are installing a new or upgraded disk drive, see “Installing a disk drive or solid-state drive with the IBM i system or logical partition powered on” on page 37. If you are removing a disk drive as part of a service procedure, continue to use the following procedure. Figure 77. Installing a disk drive in a model or 7311-D20 Replacing a disk drive 89 DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) Attention: The powered-on procedure is a complex set of steps that must be performed exactly to avoid potential data loss or system damage. You might also consider the powered-off procedure, which is a simpler task that involves less risk of data loss or system damage. For instructions, see “Replacing a disk drive with the system or logical partition powered off” on page 106. Note: Some of the figures in these procedures might not look exactly like the system or expansion unit that you have. However, the steps to perform the task are the same. Prepare the system 1. Perform prerequisite tasks as described in “Before you begin” on page 189. 2. If applicable, remove the system unit door, expansion unit door, or the rack door. 3. If applicable, remove the front cover. v If you have a front cover from any other models, remove the front cover. 90 Disk drives Remove the disk drive 1. If you are removing a solid-state device located on a PCIe RAID and SSD SAS adapter, go to Removing and replacing the PCIe RAID and SSD SAS adapterRemoving and replacing the PCIe RAID and SSD SAS adapter and perform that procedure, then return here and perform the following steps. 2. Locate the disk drive to be replaced and record the location information, then use the location information to activate the identify light for that disk drive. For instructions, see “Identifying a failing part in an IBM i system or logical partition” on page 195. 3. Determine if the disk drive to be removed is a non-configured unit by performing the following steps: a. Type strsst on the command line of the IBM i session and press Enter. b. Type your service tools user ID and service tools password and press Enter. c. Select Work with disk units and press Enter. d. Select Display Disk Configuration and press Enter. e. Select Display non-configured units and press Enter. Is the failed drive a non-configured unit? v No: Continue to Verify disk is suspended. v Yes: Continue to next step. f. Determine if the disk drive to be removed was replaced by a hot spare when it failed. An xxxx9031 error logged close to the same time of the original disk failure indicates the automatic Rebuild of the data to a hot spare device. 4. Verify that the disk drive you are replacing has been suspended before continuing with this procedure. For instructions, see “Preparing to remove a disk drive from a system or logical partition controlled by IBM i” on page 61. 5. Did you find the service action log entry related to this problem? v No: Exit the service action log and go to step 7. v Yes: Continue with the next step. 6. Select the concurrent maintenance option. This option fills in the disk drive location on the concurrent maintenance display. Go to step 9 on page 92. 7. Select Device Concurrent Maintenance from the Hardware Service Manager display. Press Enter. The Device Concurrent Maintenance display is shown, as shown in the following figure. 8. If the physical location does not appear in the Device Concurrent Maintenance display, type the physical location where you are replacing the disk drive as shown in the preceding example, Figure 78. Device Concurrent Maintenance Type the choices, then press Enter. Specify either physical location or resource name. Physical location: U787A.001.1234567-P3-D4 or Device resource name: Specify action as 1=Remove device 2=Install device Action to be performed . . . . . . . .: 1 Enter a time value between 00 and 19. Time delay needed in minutes . . . . .: 01 F3=Exit F5=Refresh F12=Cancel Figure 78. Example Device Concurrent Maintenance display Replacing a disk drive 91 The physical location has the following form: U787A.001.AAAXXXX-P3-D4 where P3-D4 is the position of the disk drive in the system unit or expansion unit. You can get the U787A.001.AAAXXXX location information from the control panel display on the system unit or expansion unit. If you cannot get the information from the control panel, find the feature code (FC) and sequence number (SN or SEQ) on the label that appears on the front of the system unit or expansion unit as shown in the following figure. In this example, U787A.001.AAAXXXX-P3-D4, U787A.001 comes from the feature code (FC) on the label, AAAXXXX comes from the last seven digits of the sequence number (SN or SEQ) on the label, and P3–D4 is the slot where you are replacing the disk drive as shown in “Disk drive locations and concurrent maintenance lights” on page 125. In the example, U5074.001.AAAXXXX-DB3-D32, you would replace a disk drive in the expansion unit with type 5074, model 001, sequence number AAAXXXX, and disk drive position DB3–D32. 9. Select option 1 (Remove device) for the Action to be performed. 10. Set the time delay for one minute: 01. Important: Do not press Enter at this time. 11. Locate the concurrent maintenance light that corresponds to the position of the disk drive that you are replacing. To view the locations of the concurrent maintenance lights, select the appropriate model from the following list: v “Disk drive locations and concurrent maintenance lights for the 5786, 5787, 7031-D24, and 7031-T24 SCSI disk-drive enclosures” on page 133 Important: When you press Enter, after a 1-minute delay, this light comes on and begins to flash rapidly. You then have nine seconds to remove the disk drive. 12. Press Enter on the console. Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. 13. When the concurrent maintenance light starts blinking, squeeze and pull the handle of the disk drive out toward you before you remove the disk drive, as shown in the following figure. If the handle is not all the way out, the disk drive cannot slide out of the system or expansion unit. Figure 79. Example system or expansion unit label 92 Disk drives 14. Support the bottom of the disk drive as you slide it out of the system. Do not hold the disk drive by the handle. Note: The concurrent maintenance light will stop blinking and remain off when the disk drive is removed. 15. Loosen and pull out on the thumbscrew B to release the disk-drive bezel C from the disk drive, as shown in the following figure. Important: If the disk drive bezel does not have a thumbscrew it is not removable and is a SAS disk drive. The replacement drive should come with an attached bezel. Figure 80. Removing a disk drive Replacing a disk drive 93 Replace the disk drive 1. Return to the console and wait until the Concurrent Maintenance Results display is shown. Press F12. 2. The physical locations you entered in step 8 on page 91 of this procedure might still appear on the display. If not, retype the physical location where you will be installing the new disk drive. In the example, U787A.001.1234567-P3-D4, you would install a disk drive in the system unit with type 9406, Model 520, sequence number 1234567, and disk drive position P3–D4. In the example, U5074.001.1234567-DB3-D32, you would install a disk drive in the expansion unit with type 5074, model 001, sequence number 1234567, and disk drive position DB3–D32. 3. Select option 2 (Install device) for the Action to be performed. 4. Set the time delay for one minute:01. Important: Do not press Enter at this time. 5. Locate the concurrent maintenance light that corresponds to the position of the disk drive that you are replacing. Important: When you press Enter, after a one minute delay, this light comes on and begins to blink rapidly. You then have nine seconds to slide the disk drive all the way in and then lock it in place by pushing in the disk drive handle. 6. Find the package that contains the new disk drive. Attention: Disk drives are fragile. Handle with care. 7. Remove the disk drive from its protective package. Figure 81. Removing a disk-drive bezel 94 Disk drives 8. Unlock the disk drive handle by squeezing and pulling it out toward you before you install the disk drive. If the handle is not all the way out, the disk drive can not slide into the system or expansion unit. 9. Follow these steps to install the disk-drive bezel on the replacement disk drive: a. Attach the bezel A to the disk drive, as shown in the following figure. b. Press in and then tighten the thumbscrew B to secure the bezel to the disk drive. 10. Support the bottom of the disk drive as you align it with the guide rails in the system or expansion unit. Do not hold the disk drive by the handle. 11. Slide the disk drive halfway into the system or expansion unit. 12. Press Enter on the console. 13. When the light starts blinking, slide the disk drive all the way in. Then, lock the disk drive in place by pushing in the disk drive handle, as shown in the following figure. Note: The light stops blinking and remains on when you have installed the disk drive properly. Figure 82. Installing a disk-drive bezel Replacing a disk drive 95 Complete the procedure 1. Replace the front cover. 2. Install or close the system, expansion unit or rack front door. 3. Return to the console and wait until the Concurrent Maintenance Results display is shown. Press Enter. 4. If you return to the Service Action Log display, exit the service action log. 5. When the Hardware Service Manager display is shown, press F3. 6. To complete this procedure, go to “Rebuilding data on a replacement disk drive using IBM i” on page 47. Figure 83. Installing a disk drive 96 Disk drives Related concepts Removing and replacing covers and doors Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the POWER7 processor. Related reference “Removing and replacing the expansion unit cover or door” on page 205 You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing components or performing a service action. Replacing a disk drive using hot spare in an IBM i system or logical partition You can use the hot spare function to replace a disk drive in a system or expansion unit while the IBM i operating system or logical partition that controls the disk drive location is powered on. Use the procedure in this section to perform this task. DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) Note: Some of the figures in these procedures might not look exactly like the system or expansion unit Replacing a disk drive 97 that you have. However, the steps to perform the task are the same. Prepare the system 1. Perform prerequisite tasks as described in “Before you begin” on page 189. 2. Remove the front cover and door, if you need to, to access the disk drives. For instructions on removing the cover, see “System covers” on page 207. 3. Locate the disk drive to be replaced and record the location information, then use the location information to activate the identify light for that disk drive. For instructions, see “Identifying a failing part in an IBM i system or logical partition” on page 195 Remove the disk drive 1. If you are removing a solid state device located on a PCIe RAID and SSD SAS adapter, go to Removing and replacing the PCIe RAID and SSD SAS adapterRemoving and replacing the PCIe RAID and SSD SAS adapter and complete that procedure, then return here and perform the following steps. 2. Type strsst on the command line of the IBM i session and press Enter. 3. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign On display, and press Enter. 4. Select Work with disk units from the System Service Tools display, press Enter. 5. Select Display Disk Configuration from the Work with Disk Units display, and press Enter. 6. Select Display non-configured units from the Display Disk Configuration display, and press Enter. 7. Is the failed drive a non-configured unit? v No: Go to “Replacing a disk drive with the IBM i system or logical partition powered on” on page 89. v Yes: Determine if the disk drive to be removed was replaced by a hot spare when it failed. A xxxx9031 error logged close to the same time of the original disk failure indicates the automatic Rebuild of the data to a hot spare device, then go on to the next step. 8. Is the disk drive to be removed controlled by the load source adapter? v No: Go to step 11. v Yes: Go to the next step. 9. Is the disk drive to be removed located in a valid load source position? v No: Go to “Configuring a disk drive on a load source adapter for hot spare using the IBM i operating system” on page 145. v Yes: Go to step 11. 10. Choose the concurrent maintenance option. This option fills in the disk drive location on the concurrent maintenance display. Go to step 13 on page 99. 11. Select Device Concurrent Maintenance from the Hardware Service Manager display. Press Enter. The Device Concurrent Maintenance display is shown, as shown in the following figure. 98 Disk drives 12. If the physical location does not appear in the Device Concurrent Maintenance display, type the physical location where you will be replacing the disk drive as shown in the preceding example, Figure 84. The physical location has the following form: U787A.001.AAAXXXX-P3-D4 where P3-D4 is the position of the disk drive in the system unit or expansion unit. You can get the U787A.001.AAAXXXX location information from the control panel display on the system unit or expansion unit. If you cannot get the information from the control panel, find the feature code (FC) and sequence number (SN or SEQ) on the label that appears on the front of the system unit or expansion unit as shown in the following figure. In this example, U787A.001.AAAXXXX-P3-D4, U787A.001 comes from the feature code (FC) on the label, AAAXXXX comes from the last seven digits of the sequence number (SN or SEQ) on the label, and P3–D4 is the slot where you are replacing the disk drive as shown in “Disk drive locations and concurrent maintenance lights” on page 125. In the example, U5074.001.AAAXXXX-DB3-D32, you would replace a disk drive in the expansion unit with type 5074, model 001, sequence number AAAXXXX, and disk drive position DB3–D32. 13. Select option 1 (Remove device) for the Action to be performed. 14. Set the time delay for one minute: 01. Important: Do not press Enter at this time. 15. Locate the concurrent maintenance light that corresponds to the position of the disk drive that you are replacing. To view the locations of the concurrent maintenance lights, select the appropriate model from the following list: v “Disk drive locations and concurrent maintenance lights for the 5786, 5787, 7031-D24, and 7031-T24 SCSI disk-drive enclosures” on page 133 Important: When you press Enter, after a one minute delay, this light comes on and begins to blink rapidly. You then have nine seconds to remove the disk drive. Device Concurrent Maintenance Type the choices, then press Enter. Specify either physical location or resource name. Physical location: U787A.001.1234567-P3-D4 or Device resource name: Specify action as 1=Remove device 2=Install device Action to be performed . . . . . . . .: 1 Enter a time value between 00 and 19. Time delay needed in minutes . . . . .: 01 F3=Exit F5=Refresh F12=Cancel Figure 84. Example Device Concurrent Maintenance display Figure 85. Example system or expansion unit label Replacing a disk drive 99 16. Press Enter on the console. Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. 17. When the concurrent maintenance light starts blinking, squeeze and pull the handle of the disk drive out toward you before you remove the disk drive, as shown in the following figure. If the handle is not all the way out, the disk drive cannot slide out of the system or expansion unit. Figure 86. Removing a disk drive 100 Disk drives 18. Support the bottom of the disk drive as you slide it out of the system. Do not hold the disk drive by the handle. Note: The concurrent maintenance light will stop blinking and remain off when the disk drive is removed. 19. Loosen and pull out on the thumbscrew B to release the disk-drive bezel C from the disk drive, as shown in the following figure. Important: If the disk drive bezel does not have a thumbscrew it is not removable and is a SAS disk drive. The replacement drive should come with an attached bezel. Figure 87. Removing a small-form factor disk drive from the 5803 expansion unit Replacing a disk drive 101 Replace the disk drive 1. Return to the console and wait until the Concurrent Maintenance Results display is shown. Press F12. 2. The physical locations you entered in step 12 on page 99 of this procedure might still appear on the display. If not, retype the physical location where you will be installing the new disk drive. In the example, U787A.001.1234567-P3-D4, you would install a disk drive in the system unit with type 9406, Model 520, sequence number 1234567, and disk drive position P3–D4. In the example, U5074.001.1234567-DB3-D32, you would install a disk drive in the expansion unit with type 5074, model 001, sequence number 1234567, and disk drive position DB3–D32. 3. Select option 2 (Install device) for the Action to be performed. 4. Set the time delay for one minute:01. Important: Do not press Enter at this time. 5. Locate the concurrent maintenance light that corresponds to the position of the disk drive that you are replacing. Important: When you press Enter, after a one minute delay, this light comes on and begins to blink rapidly. You then have nine seconds to slide the disk drive all the way in and then lock it in place by pushing in the disk drive handle. 6. Find the package that contains the new disk drive. Attention: Disk drives are fragile. Handle with care. 7. Remove the disk drive from its protective package. Figure 88. Removing a disk-drive bezel 102 Disk drives 8. Unlock the disk drive handle by squeezing and pulling it out toward you before you install the disk drive. If the handle is not all the way out, the disk drive can not slide into the system or expansion unit. 9. Follow these steps to install the disk-drive bezel on the replacement disk drive: a. Attach the bezel A to the disk drive, as shown in the following figure. b. Press in and then tighten the thumbscrew B to secure the bezel to the disk drive. 10. Support the bottom of the disk drive as you align it with the guide rails in the system or expansion unit. Do not hold the disk drive by the handle. 11. Slide the disk drive halfway into the system or expansion unit. 12. Press Enter on the console. 13. When the light starts blinking, slide the disk drive all the way in. Then, lock the disk drive in place by pushing in the disk drive handle, as shown in the following figure. Note: The light stops blinking and remains on when you have installed the disk drive properly. Figure 89. Installing a disk-drive bezel Replacing a disk drive 103 Perform the procedure 1. Replace the front cover. 2. Install or close the system, expansion unit or rack front door. 3. Return to the console and wait until the Concurrent Maintenance Results display is shown, and press Enter if you used the concurrent maintenance process above, otherwise continue with the next step. Figure 90. Installing a disk drive Figure 91. Installing a small-form factor disk drive in the 5803 expansion unit 104 Disk drives 4. If you return to the Hardware Service Manager (HSM), exit the HSM. 5. Select Work with disk units from the Start System Service Tools display. 6. Select Non-configured disk units from the Work with Disk Units display, and press Enter. 7. Is the new disk unit displayed with a status of non-configured? Note: It may take several minutes for the new disk drive to display. v No: Contact your next level of support. v Yes: Continue with the next step. 8. Press F3 to return to the Work with Disk Units display. 9. Select Work with disk unit recovery from the Work with Disk Units display, and press Enter. 10. Select Disk unit problem recovery procedure, and press Enter. 11. Select Initialize and format disk unit, and press Enter. 12. Select the new disk and press Enter. Note: This will take several minutes to complete. 13. When the disk drive is initialized and formatted, press F3 to return to the Work with Disk Units display. 14. Select Start hot spare, and press Enter. 15. Select the IOA with the new disk, then press Enter and press Enter again. 16. Does the new hot spare disk drive have a smaller capacity than previous hot spare disk drive? Refer to the resource name identified in the xxxx9031 error which was logged close to the same time of the original disk failure. v No: The procedure is complete. Press F3 to return to the System service tools display. v Yes: Go to the next step. 17. Select Display Device Parity Status, and press Enter. If the new disk drive is not active and hot spare protected, do not continue. Call your next level of support. Restriction: You must complete the following steps to ensure your data is protected. The following steps will allow you to exchange the two disk drives so that the larger disk drive will become the hot spare, and the smaller disk drive will become a parity set member. 18. Fail the previous hot spare disk drive by unlocking the disk drive handle by squeezing it and pulling it out toward you. 19. Pull the drive out about one inch and wait ten seconds. 20. Push the disk drive back into the slot and close the handle. 21. Return to step 3 on page 104 and start hot spare on the larger drive. Replacing a disk drive 105 Related concepts Removing and replacing covers and doors Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the POWER7 processor. Related reference “Removing and replacing the expansion unit cover or door” on page 205 You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing components or performing a service action. Replacing a disk drive with the system or logical partition powered off You might need to install a Small Computer System Interface (SCSI) or serial-attached SCSI (SAS) disk drive in a system or logical partition that controls the disk drive location with the power off. Use the procedure in this section to perform this task. Attention: If you are installing a new or upgraded disk drive, see “Installing a disk drive or solid-state drive with the system or logical partition power turned off” on page 49. If you are removing a disk drive as part of a service procedure, continue to use the following procedure. 106 Disk drives DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) Prepare the system 1. Perform prerequisite tasks as described in “Before you begin” on page 189. 2. If you cannot start the system or logical partition that controls the disk drive location, or if the system or logical partition are not powered on, go to step 6 on page 108. 3. Identify the disk drive to be replaced and record the location information. For information, see “Identifying a failing part” on page 193. 4. Prepare to remove the disk drive from the system or expansion unit as follows: v If the system or logical partition that controls the disk drive location is running AIX, follow the steps in “Preparing to remove a disk drive from a system or expansion unit controlled by AIX” on page 137. v If the system or logical partition that controls the disk drive location is running IBM i, follow the steps in “Preparing to remove a disk drive from a system or logical partition controlled by IBM i” on page 61. Replacing a disk drive 107 v If the system or logical partition that controls the disk drive location is running Linux and the disk drive you want to replace is unprotected or in use, move the data from the disk drive before continuing with this procedure. For information, see the PCI-X SCSI RAID Controller Reference Guide for Linux. This guide is available from the SCSI PCI Adapters Web page at http://publib.boulder.ibm.com/ infocenter/pseries/v5r3/index.jsp?topic=/com.ibm.pseries.doc/hardware_docs/scsipciadapters.htm. 5. Stop the system or logical partition that controls the disk drive location. For instructions, see “Stopping a system or logical partition” on page 201. 6. If you stopped the logical partition, do not disconnect the power source. If you stopped the system, disconnect the power source from the system by unplugging the system. Note: This system might be equipped with a second power supply. Before continuing with this procedure, ensure that all power sources to the system have been completely disconnected. (L003) or 7. Remove the system unit door or expansion unit door or open the rack front door. For instructions, select the appropriate procedure from the following list: v “Removing and replacing the expansion unit cover or door” on page 205 8. Remove the front cover. Remove the disk drive 1. If you are removing a solid state device located on a PCIe RAID and SSD SAS adapter, go to Removing and replacing the PCIe RAID and SSD SAS adapterRemoving and replacing the PCIe RAID and SSD SAS adapter and complete that procedure, then return here and perform the following steps. 2. Unlock the disk drive handle by squeezing it and pulling it out toward you, as shown in Figure 92 on page 109. 108 Disk drives Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. Figure 92. Removing a disk drive from a model or 7311-D20 Replacing a disk drive 109 3. Support the bottom of the disk drive as you slide it out of the system or expansion unit. Do not hold the disk drive by the handle. 4. If your replacement disk drive does not have a bezel preinstalled, then perform the following step. Loosen and then pull out on the thumbscrew (B) to release the disk-drive bezel (C) from the disk drive, as shown in the following figure. Important: If the disk drive bezel does not have a thumbscrew it is not removable and is a SAS disk drive. The replacement drive should come with an attached bezel. Figure 93. Removing a small-form factor disk drive from the 5803 expansion unit 110 Disk drives Replace the disk drive 1. Find the package that contains the new disk drive. Attention: Disk drives are fragile. Handle with care. 2. Remove the disk drive from its protective package. 3. Unlock the disk drive handle by squeezing and pulling it out toward you before you install the disk drive. If the handle is not all the way out, the disk drive will not slide into the system or expansion unit. 4. If your replacement disk drive does not have a bezel preinstalled, then follow these steps to install the disk-drive bezel on the replacement disk drive: a. Attach the bezel (A) to the disk drive, as shown in the following figure. b. Press in and then tighten the thumbscrew (B) to secure the bezel to the disk drive. Figure 94. Removing a disk-drive bezel Replacing a disk drive 111 5. Support the bottom of the disk drive as you align it with the guide rails in the system or expansion unit. Do not hold the disk drive by the handle. 6. Slide the disk drive into the system or expansion unit until it stops. Note: It is important to ensure that when installing a disk drive, that the drive is fully seated and all the way into the system. 7. Push the disk drive handle in until it locks, as shown in Figure 96 on page 113. Figure 95. Attaching a disk-drive bezel 112 Disk drives Complete the procedure 1. Replace the front cover. 2. Install or close the system, expansion unit or rack front door. 3. Reconnect all system power cables. 4. Reconnect the system to the power supply by plugging it in. Figure 96. Installing a disk drive in a model or 7311-D20 Figure 97. Installing a small-form factor disk drive in the 5803 expansion unit Replacing a disk drive 113 5. Start the system or logical partition. For instructions, see “Starting the system or logical partition” on page 199. 6. If the system or logical partition that controls the disk drive you replaced is running AIX, verify that the disk drive is working as described in “Verifying an installed feature or replaced part in an AIX system or logical partition” on page 249. If it is not possible to run online diagnostics, for example if you replaced the root volume group (rootvg) disk drive, use standalone diagnostics as described in “Verifying an installed part by using stand-alone diagnostics” on page 253. 7. To rebuild the data on the replacement disk drive, select the operating system for the system or logical partition that controls the disk drive: v “Rebuilding data on a replacement disk drive using AIX” on page 137 v “Rebuilding data on a replacement disk drive using IBM i” on page 47 v “Rebuilding data on a replacement disk drive using Linux” on page 137 Related concepts Removing and replacing covers and doors Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the POWER7 processor. Related reference “Removing and replacing the expansion unit cover or door” on page 205 You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing components or performing a service action. Replacing a SCSI disk drive Learn how to replace a Small Computer System Interface (SCSI) disk drive in a system or expansion unit. Use the procedure in this information to perform this task. Remove the disk drive 1. Log in as root user. 2. At the command line, type diag and press Enter. 3. Press Enter to continue. 4. On the Function Selection display, select Task Selection. 5. On the Task Selection display, select Hot Plug Task. 6. Select SCSI and SCSI RAID Hot Plug Manager. 7. Select Identify a Device Attached to a SCSI Hot Swap Enclosure Device. 8. Select the disk drive you want to replace and press Enter. The disk drive slot enters the Identify state, indicated by the flashing concurrent maintenance light. To view the locations of the concurrent maintenance lights, go to “Disk drive locations and concurrent maintenance lights” on page 125. 9. Exit to the SCSI and SCSI RAID Hot Plug Manager display. 10. On the SCSI and SCSI RAID Hot Plug Manager display, select Replace/Remove a Device Attached to a SCSI Hot Swap Enclosure Device. A list of all slots from which you can remove a disk drive is shown. 11. Select the slot that contains the disk drive you want to remove. The disk drive slot enters the Remove state and power is removed from the slot. The concurrent maintenance light for the slot begins flashing rapidly. 12. Unlock the disk drive. Unlock the disk drive handle by squeezing it and pulling it out toward you as shown in the following figures. 114 Disk drives 13. Support the bottom of the disk drive as you slide it out of the system or expansion unit. Do not hold the disk drive by the handle. Note: The concurrent maintenance light for the slot turns off when you remove the disk drive. 14. Loosen and pull out on the thumbscrew B to release the bezel C from the disk drive. Figure 98. Removing a disk drive from a model or 7311-D20 Figure 99. Removing a small-form factor disk drive from the 5803 expansion unit Replacing a disk drive 115 Replace the disk drive 1. Find the package that contains the new disk drive. Attention: Disk drives are fragile. Handle with care. 2. Remove the disk drive from its protective package. 3. Unlock the handle of the replacement disk drive by squeezing and pulling it out toward you. If the handle is not all the way out, the disk drive cannot slide into the system. 4. Attach disk-drive bezel A to the replacement disk drive, as shown in the following figure. Figure 100. Removing a disk-drive bezel 116 Disk drives 5. Press in and then tighten thumbscrew B to secure bezel A to the disk drive. 6. Install the disk drive as follows: a. Support the bottom of the disk drive as you align it with the guide rails in the system unit. Do not hold the disk drive by the handle. b. Slide the disk drive into the system until it stops. Note: It is important to ensure that when installing a disk drive, that the drive is fully seated and all the way into the system. c. Push the disk drive handle in until it locks, as shown in Figure 102 on page 118. Figure 101. Attaching a disk-drive bezel Replacing a disk drive 117 7. Press Enter. 8. Exit to the SCSI and SCSI RAID Hot-Plug Manager menu. 9. Select Configure Added/Replaced Devices. Press Enter. 10. Select the disk drive that you replaced and then press Enter. The disk drive is now configured. 11. Exit to the Task Selection display. Figure 102. Installing a disk drive in a model a model or 7311-D20 Figure 103. Installing a small-form factor disk drive in the 5803 expansion unit 118 Disk drives 12. Select Log Repair Action. 13. Select the disk drive that you replaced and then press Enter. 14. Select Commit after you have made your selection and then press Enter. 15. Exit to the command line. Perform the procedure 1. Replace the front cover. 2. Install or close the system, expansion unit, or rack front door. 3. Verify that the disk drive is working. For instructions, see “Verifying an installed feature or replaced part in an AIX system or logical partition” on page 249. If it is not possible to run online AIX diagnostics, for example if you replaced the root volume group (rootvg) disk drive, use stand-alone diagnostics as described in “Verifying an installed part by using stand-alone diagnostics” on page 253. 4. To prepare the disk drive for use, go to “Rebuilding data on a replacement disk drive using AIX” on page 137. Related concepts Removing and replacing covers and doors Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the POWER7 processor. Related reference “Removing and replacing the expansion unit cover or door” on page 205 You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing components or performing a service action. Replacing a disk drive 119 120 Disk drives Removing or installing a disk-drive filler You might need to remove or install a Small Computer System Interface (SCSI) disk-drive filler in the server or expansion unit. This section includes procedures so that you can perform these tasks. Note: Refill the disk drive slots with either another disk drive or a disk-drive filler. Filling the disk drive slot will help ensure proper air flow for cooling and help maintain optimal EMI compliance. Removing a disk-drive filler Learn how to remove a disk-drive filler from the server or expansion unit. Use the procedure in this information to perform this task. Note: Some of the figures in this procedure might not look exactly like the system or expansion unit that you have. However, the steps to perform the task are the same. 1. Remove the system unit door, expansion unit door, or open the rack front door. For instructions, see “Removing and replacing the expansion unit cover or door” on page 205. 2. Remove the front cover. For instructions on removing the cover, see “System covers” on page 207. If you have a front cover from any other models, remove the front cover. 3. Identify the disk-drive filler you want to remove. 4. Pull the disk-drive filler from the slot. Pull the disk-drive filler from the slot that you want to use, as shown in Figure 104 on page 122 and Figure 105 on page 122. Attention: v Attach a wrist strap to an unpainted surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted surface of the system for a minimum of 5 seconds. © Copyright IBM Corp. 2010 121 Figure 104. Removing a disk-drive filler from model 8233-E8B or 8236-E8C Figure 105. Removing a disk-drive filler from model 9117-MMB or 9179-MHB 122 Disk drives Installing a disk-drive filler You might need to install a disk-drive filler in the server or expansion unit. Use the procedure in this section to perform this task. Note: Some of the figures in this procedure might not look exactly like the system or expansion unit that you have. However, the steps to perform the task are the same. 1. Remove the system unit door, expansion unit door, or open the rack front door. For instructions, see “Removing and replacing the expansion unit cover or door” on page 205. 2. Remove the front cover. For instructions on removing the cover, see “System covers” on page 207. If you have a front cover from any other models, remove the front cover. 3. Install the disk-drive filler in the system or expansion unit, as shown in the following figures. Figure 106. Installing a disk-drive filler for model 8233-E8B or 8236-E8C Removing or installing a disk-drive filler 123 4. Replace the front cover. 5. Install or close the system, expansion unit or rack front door. Related concepts Removing and replacing covers and doors Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the POWER7 processor. Related reference “Removing and replacing the expansion unit cover or door” on page 205 You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing components or performing a service action. Figure 107. Installing a disk-drive filler for model 9117-MMB or 9179-MHB 124 Disk drives Disk drive locations and concurrent maintenance lights Find out where the disk drives and concurrent maintenance lights are located on different models. Locate your system or expansion unit to find the disk drive locations and where the concurrent maintenance lights are located. PCIe RAID and SSD SAS adapter locations Learn about the PCIe RAID and SSD SAS adapter locations and the location of the concurrent maintenance lights on the PCIe RAID and SSD SAS adapter. The figure Figure 108 shows the PCIe RAID and SSD SAS adapter locations. Disk drive locations and concurrent maintenance lights for an 8202-E4B or 8205-E6B system Learn about the disk drive locations and understand where the concurrent maintenance lights are located for your system. The following figures show the disk drive locations and concurrent maintenance lights for the 8202-E4B or 8205-E6B. Concurrent maintenance lights are located above the latch handle on the disk drives. Figure 108. PCIe RAID and SSD SAS adapter locations © Copyright IBM Corp. 2010 125 Figure 109. Disk drive locations for the rack model Figure 110. Disk drive locations for the tower model 126 Disk drives Disk drive locations and concurrent maintenance lights for 8231-E2B Learn about the disk drive locations and the location of the concurrent maintenance lights on the system. The following figures show the disk drive locations and concurrent maintenance lights for the 8231-E2B. Concurrent maintenance lights are located above the latch handle on the disk drives. Figure 111. Concurrent maintenance light locations for the rack model Figure 112. Concurrent maintenance light locations for the tower model Disk drive locations and concurrent maintenance lights 127 Figure 113. Disk drive locations for base model with six SFF bays Figure 114. Disk drive locations for model with three SFF bays (optional) Figure 115. Concurrent maintenance light locations for base model with six SFF bays 128 Disk drives Disk drive locations and concurrent maintenance lights for 8233-E8B and 8236-E8C Learn the disk drive locations and where the concurrent maintenance lights are located for your system. The following figures show the disk drive locations and concurrent maintenance lights for the 8233-E8B and 8236-E8C. Concurrent maintenance lights are located above the latch handle on the disk drives. Figure 116. Concurrent maintenance light locations for model with three SFF bays (optional) Figure 117. Disk drive locations Disk drive locations and concurrent maintenance lights 129 Disk drive locations and concurrent maintenance lights for models 9117-MMB or 9179-MHB Learn the disk drive locations for your system. The following figures show the disk drive locations and concurrent maintenance lights for the IBM model 9117-MMB or 9179-MHB. Figure 118. Concurrent maintenance light locations Figure 119. Disk drive locations 130 Disk drives Disk drive locations and concurrent maintenance lights for the 5886 disk enclosure Learn the disk drive locations and where the concurrent maintenance lights are located for your disk enclosure. The following figure shows the disk drive locations for the 5886 disk enclosure. Figure 120. Concurrent maintenance light locations Disk drive locations and concurrent maintenance lights 131 The following figure shows the locations of the disk-drive concurrent maintenance lights for the 5886 disk-drive enclosure. Figure 121. Disk drive locations for the 5886 disk enclosure Figure 122. Front view of disk-drive enclosure concurrent maintenance lights for model 5886 132 Disk drives Disk drive locations and concurrent maintenance lights for the 5786, 5787, 7031-D24, and 7031-T24 SCSI disk-drive enclosures Learn the disk drive locations and where the concurrent maintenance lights are located for your disk-drive enclosure. The availability of disk drive locations depends on the number of SCSI interface cards installed in the SCSI disk-drive enclosure. If the next available disk drive position is the first slot in a disk-drive enclosure, such as D21, D26, or D46, look at the disk-drive controller to verify that the enclosure is connected to a SCSI bus port. The controller end of the cable is clearly labeled with the disk-drive enclosure to which it connects. If the enclosure is not connected to a disk-drive controller, you must either install the disk drive in a different disk-drive enclosure or connect the enclosure to a SCSI bus port on a disk-drive controller. You might need to add a disk-drive controller before you can add disk drives to the expansion unit. The following figures show the disk drive locations for the front and back of the 5786, 5787, 7031-D24, and 7031-T24 SCSI disk-drive enclosures. Figure 123. Front view of 5786, 5787, 7031-D24, and 7031-T24 SCSI disk-drive enclosures Disk drive locations and concurrent maintenance lights 133 The following figures show the locations of the disk-drive concurrent maintenance light A for the models 5786, 5787, 7031-D24, and 7031-T24 SCSI disk-drive enclosures. Figure 124. Back view of 5786, 5787, 7031-D24, and 7031-T24 SCSI disk-drive enclosures Figure 125. Front view of SCSI disk-drive enclosure concurrent maintenance lights for model 5786, 5787, 7031-D24, and 7031-T24 Figure 126. Back view of SCSI disk-drive enclosure concurrent maintenance lights for model 5786, 5787, 7031-D24, and 7031-T24 134 Disk drives Disk drive locations and concurrent maintenance lights for the 5802 expansion unit Learn the disk drive locations and where the concurrent maintenance lights are located for your expansion unit. The following figure shows the disk drive locations for the 5802. Concurrent maintenance lights are located at the bottom of the latch handle on each disk drive. Related information Cabling the 5802 disk subsystem Figure 127. Disk drive locations Disk drive locations and concurrent maintenance lights 135 136 Disk drives Related information Locate additional disk-drive related information and instructions you might need. Preparing a disk drive for use in an AIX system or logical partition Configure the newly installed disk drive for your environment. If the disk drive is attached to a PCI-X SCSI RAID controller, see the PCI-X SCSI RAID Controller Reference Guide for AIX. If the disk drive is attached to another controller, see the AIX System Management Guide: Operating System and Devices. These guides are available at the System p and AIX Information Center. (http://publib.boulder.ibm.com/infocenter/pseries/v5r3/index.jsp) Preparing to remove a disk drive from a system or expansion unit controlled by AIX Learn the actions you need to take to properly remove a disk drive. Before removing a disk drive from a system or expansion unit that is controlled by AIX, ensure that all data on that disk drive or the array that contains the drive is backed up and removed from the disk drive and that the disk drive is in the Defined state. If the disk drive is attached to a PCI-X SCSI RAID controller, see the PCI-X SCSI RAID Controller Reference Guide for AIX. If the disk drive is attached to another controller, see the information for that controller or the AIX System Management Guide: Operating System and Devices. These guides are available at the System p and AIX Information Center. (http://publib.boulder.ibm.com/infocenter/pseries/v5r3/index.jsp) Rebuilding data on a replacement disk drive using AIX To rebuild data on the replacement disk drive, refer to the information for the controller to which the disk drive is attached. v If the disk drive is attached to a PCI-X SCSI RAID controller, see the PCI-X SCSI RAID Controller Reference Guide for AIX. v If the disk drive is attached to a PCI SCSI RAID adapter (feature code 2498), see the PCI 4–Channel Ultra3 SCSI RAID Adapter Reference Guide. For more information, see the AIX System Management Guide: Operating System and Devices.These guides are available at the System p and AIX Information Center. (http://publib.boulder.ibm.com/infocenter/ pseries/v5r3/index.jsp) Rebuilding data on a replacement disk drive using Linux Use this procedure to rebuild data on the replacement disk drive. To rebuild data on the replacement disk drive, complete the steps below. For an unprotected disk drive If the disk drive you are replacing is in a RAID Level 0 disk array or in a failed RAID Level 5 or RAID Level 10 disk array, perform these tasks: 1. Re-create the disk array. © Copyright IBM Corp. 2010 137 2. Re-create the file systems on the disk array. 3. Copy the data back to the restored disk array from your backup media. For information on these tasks, see the PCI-X SCSI RAID Controller Reference Guide for Linux. This guide is available from the SCSI PCI Adapters Web page at http://publib.boulder.ibm.com/infocenter/pseries/ v5r3/index.jsp?topic=/com.ibm.pseries.doc/hardware_docs/scsipciadapters.htm. For a disk drive in a RAID Level 5 or RAID Level 10 disk array, select the appropriate procedure: Rebuilding data on a disk drive using Linux kernel 2.4 Complete the steps listed here to rebuild your data. If a hot-spare disk drive was available and protected the disk array throughout the disk drive failure, you can configure the newly installed disk drive as a hot spare. For instructions, see the PCI-X SCSI RAID Controller Reference Guide for Linux. This guide is available from the SCSI PCI Adapters Web page at http://publib.boulder.ibm.com/ infocenter/pseries/v5r3/index.jsp?topic=/com.ibm.pseries.doc/hardware_docs/scsipciadapters.htm. If a hot spare rebuild was not initiated by the controller, you need to initiate a rebuild on the newly installed disk drive. To initiate the rebuild, follow these steps: 1. Log in as root user. 2. Type iprconfig on the command line of the Linux session and press Enter. The Work with Disk Units display is shown. 3. From the Work with Disk Units display, select Work with disk unit recovery. The Work with Disk Unit Recovery display is shown. 4. From the Work with Disk Unit Recovery display, select Rebuild disk unit data. A Rebuild Disk Unit Data display similar to the following figure is shown below. 5. Type option 1 (Rebuild) next to the disk drive you want to rebuild and press Enter. Note: Performing a rebuild on a disk drive overwrites the data currently on the disk drive. A Confirm Rebuild Disk Unit Data display similar to the following figure shown below. Rebuild Disk Unit Data Select the units to be rebuilt Type choice, press Enter. 1=Rebuild Vendor Product Serial PCI PCI SCSI SCSI SCSI Option ID ID Number Bus Dev Bus ID Lun IBM IC35L036UCDY10-0 E3V0J55B 353 01 0 5 0 e=Exit q=Cancel f=PageDn b=PageUp Figure 128. Example Rebuild Disk Unit Data display 138 Disk drives 6. To confirm that you want to rebuild the data on the disk drive, press Enter. The message that the rebuild has begun is shown at the bottom of the Work with Disk Unit Recovery display. The rebuild process might take several minutes to complete. Rebuilding data on a disk drive using Linux kernel 2.6 Learn the steps to rebuild your data on a disk drive. If a hot-spare disk drive was available and protected the disk array throughout the disk drive failure, you can configure the newly installed disk drive as a hot spare. For instructions, see the PCI-X SCSI RAID Controller Reference Guide for Linux. This guide is available from the SCSI PCI Adapters Web page at http://publib.boulder.ibm.com/ infocenter/pseries/v5r3/index.jsp?topic=/com.ibm.pseries.doc/hardware_docs/scsipciadapters.htm. If a hot-spare rebuild was not initiated by the controller, you need to initiate a rebuild on the newly installed disk drive. To initiate the rebuild, follow these steps: 1. Log in as root user. 2. Type iprconfig on the command line of the Linux session and press Enter. The IBM Power RAID Configuration Utility display is shown. 3. Select Work with disk unit recovery from the IBM Power RAID Configuration Utility display. Press Enter. The Work with Disk Unit Recovery display is shown. 4. From the Work with Disk Unit Recovery display, select Rebuild disk unit data. A Rebuild Disk Unit Data display similar to the following is shown. Confirm Rebuild Disk Unit Data Rebuilding the disk unit data could take several minutes for each unit selected. Press Enter to confirm having the data rebuilt. Press q=Cancel to return and change your choice. Vendor Product Serial PCI PCI SCSI SCSI SCSI Option ID ID Number Bus Dev Bus ID Lun 1 IBM IC35L036UCDY10-0 E3V0J55B 353 01 0 5 0 q=Cancel f=PageDn b=PageUp Figure 129. Example Confirm Rebuild Disk Unit Data display Related information 139 5. Type option 1 (Rebuild) next to the disk drive you want to rebuild and press Enter. Note: Performing a rebuild on a disk drive overwrites the data currently on the disk drive. A Confirm Rebuild Disk Unit Data display similar to the following is shown. 6. To confirm that you want to rebuild the data on the disk drive, press Enter. The message that the rebuild has begun is shown at the bottom of the Work with Disk Unit Recovery display. The rebuild process might take several minutes to complete. Rebuilding data on a replacement disk drive using IBM i Learn how to rebuild data on replacement disk. To rebuild data on the replacement disk drive, complete the following steps: 1. If necessary, start System Service Tools (SST) by typing strsst on the command line of the IBM i session and pressing Enter. 2. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign On display. Press Enter. Note: The service tools password is case sensitive. 3. Select Work with disk units from the Start System Service Tools (SST) display. Press Enter. 4. Select Work with disk unit recovery from the Work with Disk Units display. Press Enter. Rebuild Disk Unit Data Select the disks to be rebuilt Type choice, press Enter. 1=Rebuild OPT Name PCI/SCSI Location Description Status --- ------ -------------------------- ------------------------- ------------- 1 0000:58:01.0.0/0:4:0 RAID Array Member Failed e=Exit q=Cancel t=Toggle Figure 130. Example Rebuild Disk Unit Data display Confirm Rebuild Disk Unit Data Rebuilding the disk unit data might take several minutes for each disk selected. Press Enter to confirm having the data rebuilt. q=Cancel to return and change your choice. OPT Name PCI/SCSI Location Description Status --- ------ -------------------------- ------------------------- --------------- 1 0000:58:01.0.0/0:4:0 RAID Array Member Failed q=Cancel t=Toggle Figure 131. Example Confirm Rebuild Disk Unit Data display 140 Disk drives 5. If you determined that the disk drive you replaced had device parity protection (RAID), follow these steps to restore device parity protection (RAID) on the disk drive: a. Select Rebuild disk unit data on the Work with Disk unit recovery display. Press Enter. b. Select 1 to rebuild the disk drive displayed (the disk drive displayed is the disk drive that you removed) on the Rebuild Disk Unit Data display. Press Enter. c. Press Enter on the Confirm Rebuild Disk Unit Data display. The rebuild process might take several minutes to complete. d. Press F5 to refresh the display until the Percent complete shows 5%. e. When the display shows at least 5% complete, you can either continue to monitor this display to completion, or press F3 (Exit) to return to the Work with disk units display. f. Press F3 (Exit) to return to the System service tools display. g. Press F3 (Exit) to return to the Exit SST display and press Enter. h. Return to the procedure that sent you here. 6. If the disk drive that you replaced had mirrored protection, follow these steps: a. Select Replace configured unit on the Work with Disk unit recovery display. Press Enter. b. Select the configured disk drive that you are exchanging (suspended drive) on the Select Configured Unit to Replace display. Press Enter. c. Select the disk drive that you just installed on the Select Replacement Unit display. This drive has a non-configured status. Note: In some cases, it might take several minutes for a new disk drive to display. Repeat these steps until the new drive is shown. Press Enter. d. Press Enter on the Confirm Replace of Configured Unit display to confirm your choice for replacement. The replacement process might take several minutes to complete. When the process is complete, the Work with Disk unit recovery display is shown. e. Press F3 (Exit) to return to the Work with disk units display. f. Select Display disk configuration on the Work with disk units display. g. Select Display disk configuration status on the Display Disk Configuration display. Mirrored status shows Resuming. When complete, the mirrored status shows Active. This process might take several minutes to complete. You can either monitor this display to completion, or press F3 (Exit) three times, and then press Enter to return to the IBM i main menu. Configuring a disk drive in an system or logical partition To configure a newly installed disk drive into an existing device parity set, follow these steps: If you want to start a new device parity set or change the protection on your disks, see Manage device parity protection or Start device parity protection in the Disk Management section of the IBM i Information Center website at http://www.ibm.com/systems/i/infocenter . 1. If necessary, start System Service Tools (SST) by typing strsst on the command line of the IBM i session and pressing Enter. 2. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign On display. Press Enter. Restriction: The service tools password is case sensitive. 3. Select Work with Disk Units on the System Service Tools display. Press Enter. 4. Select Display disk configuration on the Work with Disk Units display. Press Enter. 5. Select Display non-configured units on the Display disk configuration display. Press Enter. Related information 141 6. The disk drive that you installed is listed. The serial number should match the serial number you recorded (the last 4 digits). Note: The new disk drive might take up to five minutes to appear in the list. If the disk drive is not listed, ensure that the disk drive was properly installed. 7. Press F12 twice. 8. Select Work with disk configuration on the Work with Disk Units display. Press Enter. 9. Are you adding a disk drive to an existing device parity set? v Yes: Continue with the next step. v No: Go to step 23 on page 46 for mirroring or no protection, go to 1 on page 47 for hot spare. Tip: Select No for mirroring, hot spare or no protection. 10. Select Work with device parity protection on the Work with Disk Configuration display. Press Enter. 11. Select Include unit in device parity protection on the Work with Device Parity Protection display. Press Enter. 12. Did the Include Device Parity Protection Failed display appear? v Yes: Continue with the next step. v No: Go to step 15 on page 46. 13. The following conditions must be met before you can start device parity protection. v If there are enough disk drives available to create a new parity set, the drives are eligible for Start Device Parity Protection operation and not for the Include Device Parity Protection operation. v All disk drives in a parity set must be the same capacity with a minimum number of three or four disk drives and maximum of 10 or 18 disk drives in the resulting parity set, depending on the disk drive type. v Not all disk drives attached to an advanced function input output processor have reported to the system. Retry the operation. v The type or model of the disk drives cannot be used for the requested operation. When these conditions are met, press F12 to return to the Work with Disk Configuration display. 14. Select Include unit in device parity protection on the Work with disk configuration display. Press Enter. 15. On the Include Disk Units in Device Parity Protection display, select the disk drives to be included in Device Parity Protection. Type a 1 in the Option column. Press Enter. 16. Press Enter to confirm your choice. The configuration starts and might take 30 to 60 minutes to complete. 17. When the device parity protection is complete, the message “Selected drives have been included successfully” is shown on the Work with device parity protection display. Press F12. 18. Select Display disk configuration on the Work with disk configuration display. Press Enter. 19. Select Display disk configuration status on the Display disk configuration display. 20. If you have more than one ASP, determine the auxiliary storage pool (ASP) to which you want to add the disk drive. 21. Press F12 twice to get to the Work with Disk Units display. 22. Select Work with disk configuration on the Work with Disk Units display. Press Enter. 23. Select the option to add units to ASPs and balance data on the Work with disk configuration display. Press Enter. 24. Specify the number of the ASP to which you want to add the disk drive. The system ASP is ASP 1. Press Enter. Notes: a. If ASP is mirror protected, disk drives must be added in pairs of the same size. 142 Disk drives b. When one or more pairs of disk drives are added to a mirrored ASP, they automatically become mirror protected. c. To start mirrored protection on an ASP, refer to Recovering your system in the IBM i Information Center. To access the IBM i Information Center Web site, go to http://www.ibm.com/systems/i/ infocenter and select from the following options: v If you select V6R1, expand IBM i 6.1 Information Center > Systems management > Backup and recovery > Recovering your system v If you select V5R4, expand i5/OS Information Center > Systems management > Backup and recovery > Backing up and recovering your system > Recovering your system 25. The Confirm Add Units display is shown showing the configuration your system will have when the add operation completes. Note: If you have selected the wrong ASP, press F12 to change your options. Press the Enter key to continue. The add process will take several minutes to complete. 26. When the message Selected units have been added successfully is shown, press F3 three times and press Enter to return to the Main Menu display. 27. If you have a printer, print the configuration list. A service provider might refer to the configuration list in the future. a. You need to sign on with at least service level authority b. Type strsst on the Main Menu command line. Press Enter. c. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign On display. Press Enter. Remember: The service tools password is case sensitive. d. Select Start a service tool on the System Service Tools (SST) display. Press Enter. e. Select Hardware service manager on the Start a Service Tool display. Press Enter. f. Press F6 (print the configuration) from the Hardware Service Manager display. g. To return to the Main Menu display, press F3 (Exit) twice and then press Enter. h. Place the configuration list where you can refer to it in the future. 28. If you are a service provider performing this task, return to the procedure that sent you here. 1. Select Work with hot spare protection from the Work with disk configuration menu. 2. Select Start device parity protection - RAID 5 with hot spare or Start device parity protection - RAID 6 with hot spare depending on the level of parity protection that is desired. 3. Return to the procedure that sent you here. Preparing to remove a disk drive from a system or logical partition controlled by IBM i Learn the actions you need to take to properly remove a disk drive. Before removing a disk drive from a system or expansion unit that is controlled by IBM i, follow these steps: 1. Determine the protection status of the disk drive that you are replacing. For instructions, see “Determining the IBM i disk drive protection status” on page 48. 2. The service provider must perform one of the following tasks: v Save data and initialize the disk drive. v Initialize the disk drive and copy to disk. 3. For mirrored disk drives on an IBM i system or logical partition, follow these steps to verify that the disk drive you are replacing has been suspended: Related information 143 a. Sign on with service level authority. b. Type strsst on the command line of the IBM i session and press Enter. c. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign On display. Press Enter. Remember: The service tools password is case sensitive. d. Select Work with disk units from the System Service Tools (SST) display. Press Enter. e. Select Display Disk Configuration from the Work with Disk Units display. Press Enter. f. Select Display Disk Configuration Status from the Display Disk Configuration display. Press Enter. g. Does the disk drive that is mirrored to the disk drive that you are replacing (same drive number) have a status of Active? v No: The replacement must be performed by a service provider. v Yes: Does the disk drive that you are replacing have a status of Suspended? – Yes: Continue with the next step. – No: Follow these steps to suspend mirrored protection on the disk drive that you are replacing: 1) Press F3 from the Display Disk Configuration display to return to the Work with Disk Units display. 2) Select Work with Disk Unit Recovery from the Work with Disk Units display. Press Enter. 3) Select Suspend mirrored protection from the Work with Disk Unit Recovery display. Press Enter. 4) Select the option to suspend the disk drive that you are replacing from the Suspend Mirrored Protection display. Press Enter. h. Press F3 repeatedly to exit System Service Tools and return to the main menu. Determining the IBM i disk drive protection status To determine the protection status of the disk drives in your IBM i system or logical partition, follow these steps from the System Service Tools (SST) display: Tip: If you do not know the protection status of the disk drives and you cannot get to the System Service Tools display, contact your next level of support. 1. Sign on with at least service level authority. 2. Type strsst on the command line of the IBM i session and press Enter. 3. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign On display. Press Enter. Note: The service tools password is case-sensitive. 4. Select Work with disk units from the System Service Tools (SST) display. Press Enter. 5. Select Display Disk Configuration from the Work with Disk Units display. Press Enter. 6. Select Display Disk Configuration Status from the Display Disk Configuration display. Press Enter. A list of each auxiliary storage pool (ASP) displays, showing the disk drives that are members of the ASP. The Status column shows one of the following protection statuses for the ASP: v Mirrored. These ASPs are configured for mirrored protection. If the failing disk drive has mirrored protection, record the status of both disk drives in the mirrored pair. You need this information in the recovery procedure. 144 Disk drives v Unprotected. An ASP with a status of Unprotected might contain disk drives that are device parity protected. Record the status of the failing disk drive as it is shown in the Status column of the Display Disk Configuration Status display. A disk drive with one of the following statuses has device parity protection only if all other disk drives in the array are operational: – DPY/Active – DPY/Failed – DPY/HDW Failure – DPY/Degraded – DPY/Power Loss – DPY/Not Ready – DPY/Unknown A disk drive with any other status (such as DPY/Rebuild or DPY/Unprotected) does not have device parity protection. Examples: – Suppose one disk drive has a status of DPY/Failed while all other disk drives have a status of DPY/Unprotected. In this case, only the disk drive with the DPY/Failed status is protected. Servicing the failed disk drive does not result in data loss. Removing any of the unprotected disk drives might result in data loss. – All disk drives have a status of DPY/Active or DPY/Degraded, therefore, all disk drives are protected. In this case, exchanging any disk drive does not result in data loss. Tip: For more information about the meaning of a status for any disk drive, use the Help key on the display. Configuring a disk drive on a load source adapter for hot spare using the IBM i operating system To start hot spare protection on your load source adapter you will need to determine the status of the disk drive in the load source slot and configure or exchange that disk drive, which might require you to power down the system. 1. If necessary, start System Service Tools (SST) by typing strsst on the command line of the IBM i session and pressing Enter. 2. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign On display. Press Enter. Note: The service tools password is case sensitive. 3. Determine the protection status of all disks controlled by the load source adapter. Do all disk drives that are a member of a parity sets or mirror sets have a status of active? v No: Replace the disk using the following procedure in, “Replacing a disk drive with the IBM i system or logical partition powered on” on page 89. v Yes: Continue to the next step. Restriction: There must be a disk drive of the same or larger capacity located in a load source capable slot to act as a hot spare for the active load source disk drive. The next steps will have you change the disk drive in the desired slot if needed. Attention: If the disk drive acting as hot spare for the load source is of smaller capacity you can suffer data loss. 4. Determine whether a non-configured disk drive of the same or larger capacity is located in a load source capable slot but is not the active load source. Only parallel SCSI places a restriction on the location of the load source. Any SAS, Fibre Channel, or virtual disk drive does not have this restriction. Related information 145 v No non-configured drive is located in a load source capable slot. Continue with the next step v Yes a non-configured drive is located in a load source capable slot. Return to the procedure that sent you here, and continue to the next step of that procedure. 5. Choose the disk drive to be installed or exchanged with the drive in the load source slot by doing the following: a. Select a disk drive in a parity set located in a load source capable slot, but is not the active load source drive, and record the location of this disk drive. b. Select a non-configured disk drive of the same or larger capacity from either a non-load source slot or a failed drive you are replacing with a non-configured disk drive. 6. Stop the system. For instructions, see “Stopping a system or logical partition” on page 201. 7. Exchange the disk drives from the two locations you recorded in step 5 by removing both disk drives and placing them into the other disk drives slot. 8. Start the system. For instructions, see “Starting the system or logical partition” on page 199. 9. Return to the procedure that sent you here and continue with the next step in that procedure. 146 Disk drives Internal disk drive sharing on model 8202-E4B or 8205-E6B system Learn how to split the internal disks in the system unit enclosure into groups, which you can manage separately. Use the following information to perform the task. DASD backplane supports Split DASD mode. FC 5631 split DASD enablement card is used to put DASD backplane in the Split DASD mode in which 3 SFF DASD on left (from front view) are assigned to the split DASD enablement card (CCIN 2BE0), and 3 SFF DASD on right are assigned to the integrated storage controller (CCIN 572C). A PCIe or PCIX SAS adapter (such as FC 5901, FC5904, or FC5912) can access the 3 SFF DASD through an external AI SAS cable, as shown in Figure 132 on page 148. Important: This configuration is not supported with the 5630 backplane feature code. Use the following information after the adapter is installed. For more information, see PCI adapters. Perform the prerequisite tasks as described in “Before you begin” on page 189. Note: Internal cable FC 3670 for an 8202-E4B or 8205-E6B must be installed to enable the split DASD configuration. For installation instructions, see “Installing the external SAS port on model 8202-E4B or 8205-E6B” on page 153. Perform the following tasks to split the internal disks in the system unit enclosure into groups, which you can manage separately. 1. Stop and turn off the power to the system, see Stopping the system or logical partition. 2. Cable the system by connecting the SAS cable to the adapter and then to the internal SAS connection, as shown in Figure 132 on page 148. © Copyright IBM Corp. 2010 147 3. Start the system, see “Starting the system or logical partition” on page 199. 4. Verify that the feature is installed and working, see “Verifying the installed part” on page 249. After performing this procedure, three disks in the system enclosure is managed by the split DASD enablement card and three disks is managed by the internal system. You can now manage the disks as you would manage any other disks. Notes: The second connector on the adapter can be used to attach a disk expansion or media expansion drawer. Figure 132. SAS adapter to internal SAS disk slots through the system external SAS port 148 Disk drives Internal disk drive sharing on an 8233-E8B or 8236-E8C You might need to cable and understand the function of this feature. Use the following information to complete these tasks. DASD backplane supports Split DASD mode. FC 3669 internal SAS cable for Split DASD mode is used to put DASD backplane in the Split DASD mode in which 4 SFF DASD on left (from front view) are assigned to the integrated SAS controller, and 4 SFF DASD on right are assigned to the external rear SAS port. A PCIe or PCIX SAS adapter (such as FC 5901, FC5904, or FC5912) can access the right 4 SFF DASD via an external AI SAS cable, as shown in the figure below. The following information is for use after the adapter is installed. Install the adapter and then return here. See the PCI adapters topic for more information, PCI adapters. You should also review the tasks in “Before you begin” on page 189 if you have not already done so. Note: Internal cable FC3669 for model 8233-E8B or 8236-E8C must be installed to enable this configuration. For installation instructions, see Installing the external SAS port on a model 8233-E8B or 8236-E8C This feature allows you to split the internal disks in the system unit enclosure into groups which you can manage separately. 1. Stop and power off the system if it is not already, see “Stopping a system or logical partition” on page 201. 2. Cable the system by connecting the SAS cable to the adapter and then to the internal SAS connection, as shown in the following graphic. © Copyright IBM Corp. 2010 149 3. Start the system, see “Starting the system or logical partition” on page 199. 4. Verify that the feature is installed and working, see “Verifying the installed part” on page 249. Once you have completed the procedure, four of the disks in the system enclosure will be managed by the PCI SAS adapter and four will be managed by the integrated SAS controller in the system unit. Now, you can manage the disks as you would manage any other disks. Related information SAS RAID controllers for AIX on the 8233-E8B and 8236-E8C Notes: v The second connector on the adapter can be used to attach a disk expansion or media expansion drawer. Figure 133. SAS adapter to internal SAS disk slots through the system external SAS port 150 Disk drives Internal disk drive sharing in model 9117-MMB or 9179-MHB You might need to cable and understand the function of this feature. Use the following information to complete these tasks. Review the disk drive modes and supported disk drive locations, as shown in Table 1. The following figure shows the disk drive locations for the IBM model 9117-MMB or 9179-MHB. Table 4. Disk drive modes and supported disk drive locations Disk drive mode SAS Port Card Adapter SAS Cable Disk drive location RAID internal drives None Integrated Dual RAID controller None D1, D2, D3, D4, D5, and D6 RAID internal and external drives SAS External Port feature (FC1819) - SAS Cable Assembly for SAS Port Integrated Dual RAID controller v YI Cable (FC3686) - System to SAS Enclosure, Single Controller/Dual Path 1.5 meter v YI Cable (FC3687) - System to SAS Enclosure, Single Controller/Dual Path 3 meter D1, D2, D3, D4, D5, and D6 Dual Split Backplane None v Integrated SAS controller (SAS A) v Integrated SAS controller (SAS B) None v D1, D2, and D3 v D4, D5, and D6 Triple Split Backplane SAS Internal Port feature (FC1815) - SAS Cable for triple split disk drive backplane v Integrated SAS controller (SAS A) v Integrated SAS controller (SAS B) v (FC5901) - PCIe Dual-x4 SAS Adapter AI Cable (FC3679) - Adapter to Internal drive 1 meter v D1 and D4 v D2 and D5 v D3 and D6 Figure 134. Disk drive locations © Copyright IBM Corp. 2010 151 The following information is for use after the FC5901 SAS Storage Adapter is installed. Install the adapter and then return here. See the PCI adapters topic for more information, PCI adapters. You should also review the tasks in “Before you begin” on page 189 if you have not already done so. This feature allows you to split the internal disks in the system unit enclosure into groups which you can manage separately. 1. Stop and power off the system if it is not already, see “Stopping a system or logical partition” on page 201. 2. Cable a single system unit enclosure by doing the following: a. Attach the cable to the SAS port on the rear bulkhead of the system unit enclosure to the top port in the SAS Storage Adapter, as shown in the following figure. Restriction: Internal disk drive sharing is only available when internal cable feature FC1815 is installed from the DASD backplane to the read bulkhead of the system unit enclosure. Also, feature FC5662 175 MB cache RAID - dual IOA enablement card must not be installed. The SAS Storage Adapter can be in any of the other slots that support it. b. Secure any extra cable. 3. Start the system, see “Starting the system or logical partition” on page 199. 4. Verify that the feature is installed and working, see “Verifying the installed part” on page 249. With this function installed, 2 of the 6 of the disks in the system enclosure can be managed by the SAS Storage Adapter. Note: The Removable Media device is always controlled by a separate embedded SAS controller on the system planar. Now, you can manage the disks as you would manage any other disks. 152 Disk drives Installing the external SAS port on model 8202-E4B or 8205-E6B Use this procedure to install an external SAS port as a part of upgrading your disk-drive backplane, or to enable internal disk sharing. Perform the prerequisite tasks as described in “Before you begin” on page 189. Perform the following steps to install an external SAS port into the server: 1. Stop the system if it is running. For instructions on stopping the system, see “Stopping a system or logical partition” on page 201. 2. Remove all PCI adapter cassettes. See PCI adapter single-width, fourth-generation cassette. 3. If necessary, remove the existing SAS cable or filler, see Figure 135. a. Press the side tab and pull out the SAS shroud (A) from the SAS connection on the system backplane. b. Open the cable clamp (B) on the side wall. The cable clamp opens from the top. c. Pull the SAS cable (D) out of the system. 4. Install the new SAS cable, see Figure 136 on page 154. a. Connect the external port to the rear of the system until it snaps in place. b. Open the cable clamp (C) on the side wall, placing the cable (A) through the clamp. The cable clamp opens from the top. Figure 135. Removing the SAS cable © Copyright IBM Corp. 2010 153 c. Close the cable clamp (C) and continue to slide the cable towards the SAS connection on the system backplane. d. Push the SAS shroud (D) into the SAS connection on the system backplane until it snaps in place. 5. Replace all PCI adapter cassettes. See PCI adapter single-width, fourth-generation cassette. 6. Start the system. See http://publib.boulder.ibm.com/infocenter/powersys/v3r1m5/topic/p7haj/ crustartsys.htm You can now attach other supported SAS features to the system. Figure 136. Replacing the SAS cable 154 Disk drives Installing the external SAS port on model 8231-E2B Use this procedure to install an external SAS port as a part of upgrading your disk-drive backplane, or to enable internal disk sharing. Perform the prerequisite tasks as described in “Before you begin” on page 189. Perform the following steps to install the external SAS port into the server: 1. Stop the system if it is running. For instructions on stopping the system, see “Stopping a system or logical partition” on page 201. 2. Remove all PCI adapter cassettes. See PCI adapter single-width, fourth-generation cassette. 3. If necessary, remove the existing SAS cable or filler, see Figure 137. a. Press the side tab and pull the SAS shroud out of the SAS connection on the system backplane. b. Open the cable clamp on the side wall and pull out the cable (B). The cable clamp opens from the top. c. Push the external SAS port down (A) and then pull the SAS cable out of the system. 4. Install the new SAS cable. See Figure 138 on page 156. a. Connect the external port to the back of the system (B) until it snaps in place. b. If necessary, open the cable clamp on the side wall, placing the cable through the clamp. The cable clamp opens from the top. c. Close the cable clamp and continue to slide the cable (A) back to the SAS connection on the system backplane. d. Push the SAS shroud into the SAS connection (B) on the system backplane until it snaps in place. Figure 137. Removing the SAS cable © Copyright IBM Corp. 2010 155 5. Replace all PCI adapter cassettes. See PCI adapter single-width, fourth-generation cassette. 6. Start the system. See http://publib.boulder.ibm.com/infocenter/powersys/v3r1m5/topic/p7haj/ crustartsys.htm You can now attach other supported SAS features to the system. Figure 138. Replacing the SAS cable 156 Disk drives Installing the external SAS port on an 8233-E8B or 8236-E8C You might need to install an external SAS port as a part of upgrading your disk-drive backplane, or to enable internal disk sharing. Use the procedure in this section to perform this task. If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the steps to install the SAS feature 3699 into the server. For instructions, see “Removing a part by using the Hardware Management Console” on page 247. If you do not have an HMC, complete the following steps to install SAS feature 3699 into the server: 1. Perform the prerequisite tasks described in “Before you begin” on page 189. 2. Stop the system. For instructions, see “Stopping a system or logical partition” on page 201. 3. If the system is rack-mounted, place the system into the service position. For instructions, see “Placing a 8233-E8B or 8236-E8C in the service position” on page 236. 4. Remove the service access cover. For instructions, see “Removing the service access cover from an 8233-E8B or 8236-E8C system” on page 219. 5. Remove the system processor assembly or filler in position P1-C16 by lifting the handle (A) and lifting the assembly out of the system, as shown in the following figure. Note: You only need to remove the one assembly or filler in P1-C16. To remove a filler, grasp the filler in the center and lift the filler out of the system. 6. Remove the fans and fan cage from the system by doing the following steps: a. To remove the fans, squeeze the tab (A) and lift the fan out of the fan cage as shown in the following figure. © Copyright IBM Corp. 2010 157 b. To remove the air baffles, grasp and pull the air baffle out of the fan cage, as shown in the following figure. Figure 139. Removing the fan from the 8233-E8B or 8236-E8C fan cage Figure 140. Removing the air baffles from the 8233-E8B or 8236-E8C fan cage 158 Disk drives c. To remove the fan cage, loosen the two thumbscrews (A) and lift the fan cage (B) out of the system as shown in the following figure. 7. Slide the front of the cable, with the male connector, though the opening between the processor cage and the side of the system. Tip: Refer to Figure 142 on page 160 to complete the next few steps. 8. Insert the pins (A) on the side of the system into the slots (B) on the SAS cable assembly. 9. Slide the cable assembly back until the pins (A) are seated and the latch (C) snaps in place. Tip: The port on the back of the cable assembly should be seated in the slot at the back of the system. 10. Attach the cable bracket (H) by placing the notch on the bracket over the post on the side of the system, aligning the screw hole (I) with the hole on the side of the system, and securing the cable bracket with the screw. Note: The bracket should fit snugly over the SAS cable assembly. Figure 141. Removing the fan cage from the 8233-E8B or 8236-E8C system Installing the external SAS port on an 8233-E8B or 8236-E8C 159 11. Depending on the disk-drive backplane you have, complete one of the following choices: Tip: Refer to Figure 143 on page 161 to complete this step. v If you can see the SAS connection in the center of the disk-drive backplane (D), attach the cable to the connection and continue to step 12 on page 161. v If you can see the SAS cable shroud (E) extending from the disk-drive backplane, continue to step 11a. Figure 142. Attaching the SAS cable 160 Disk drives a. Locate the SAS cable shroud (E) on the disk-drive backplane and pull the back of the shroud away from the side of the system to unseat the shroud. b. Pull the shroud out of the SAS port. c. Attach the shroud to the male SAS connector (F) on the cable assembly by sliding the shroud over the top of the cable connector (F) from behind. The SAS connector on the cable should fit into the two slots at the front of the shroud, and the connector should extend away from the shroud so it can be seated in the port. d. Slide the cable and shroud (G) into the SAS port until it is firmly seated in place. 12. Replace the fans and the fan cage, by doing the following steps: a. To install the fan cage (A) into the system, align the locator pins through the holes, and tighten the two thumbscrews (B), as shown in the following figure. Figure 143. SAS cable connections Installing the external SAS port on an 8233-E8B or 8236-E8C 161 b. To install the air baffles, insert the air baffles into their location in the fan cage, and push down until they lock into place, as shown in the following figure. Figure 144. Installing the fan cage in the 8233-E8B or 8236-E8C system Figure 145. Installing the air baffles in the 8233-E8B or 8236-E8C fan cage 162 Disk drives c. To install the fans, insert the fans into their location in the fan cage, and push down on the fan until it locks into place, as shown in the following figure. 13. Install the system processor (B) and secure both latches (C), as shown in the following figure. 14. Replace the service access cover. For instructions, see “Installing the service access cover on an 8233-E8B or 8236-E8C system” on page 220. Figure 146. Installing the fans in the 8233-E8B or 8236-E8C fan cage Installing the external SAS port on an 8233-E8B or 8236-E8C 163 15. Place the system back into the operating position. For instructions, see “Placing the 8233-E8B or 8236-E8C in the operating position” on page 239. You may now attach supported SAS features to your system. Return to the instructions that sent you here or start your system. For instructions, see “Starting the system or logical partition” on page 199. 164 Disk drives Installing the external SAS port on a 9117-MMB or 9179-MHB Use this procedure to install an external SAS port as a part of upgrading your disk-drive backplane, or to enable internal disk sharing. Complete the following steps to install the external SAS port into the server: 1. Perform the prerequisite tasks described in “Before you begin” on page 189. 2. Stop the system. See “Stopping a system or logical partition” on page 201. 3. Remove all PCI adapter cassettes. See PCI adapter single-width, fourth generation cassette. 4. If necessary, remove the existing SAS cable or filler. a. Press the side tab and pull the SAS shroud (A) out of the SAS connection on the system backplane. b. Open the cable clamp on the side wall and pull the cable out. The cable clamp opens from the top. c. Push the external SAS port down (B) and then pull the SAS cable out of the system. 5. Install the new SAS cable. a. Connect the external port to the back of the system (B) until it snaps in place. b. If necessary, open the cable clamp on the side wall, placing the cable through the clamp. The cable clamp opens from the top. c. Close the cable clamp and continue to slide the cable back to the SAS connection on the system backplane. d. Push the SAS shroud (A) into the SAS connection on the system backplane until it snaps in place. Figure 147. Removing the SAS cable © Copyright IBM Corp. 2010 165 6. Replace all PCI adapter cassettes. See PCI adapter single-width, fourth generation cassette. 7. Start the system. See “Starting the system or logical partition” on page 199 You may now attach other supported SAS features to your system. Figure 148. Replacing the SAS cable 166 Disk drives Replacing a SAS conduit card in a 5802 expansion unit with power off Learn how to replace a SAS conduit card in a 5802 expansion unit with power off. Review the requirements in “Before you begin” on page 189. If your system is managed by the Hardware Management Console (HMC), use the HMC to replace the card. For instructions see, “Removing a part by using the Hardware Management Console” on page 247 1. Stop the system if it is running. For instructions see, “Stopping a system or logical partition” on page 201. 2. Disconnect the power source from the system by unplugging the system. Attention: This system might be equipped with a second power supply. You must disconnect the power source from the system by disconnecting all power cords to prevent system damage during this procedure. (L003) or 3. Place the card connector (C) onto the on the midplane connection (D) using the guide track (B) to ensure a good connection. Then lift the handle (A) to seat and secure the SAS conduit card to the midplane as shown in the following figure. © Copyright IBM Corp. 2010 167 4. Place the midplane and SAS conduit card back into the system by doing the following steps: a. Align the midplane with the track and ensure the slot (A) on the midplane attaches to the post (B) in the system. b. Secure the thumbscrews (C) and (D) on the SAS conduit card. c. Secure the thumbscrews (E) on the sides of the midplane. Figure 149. Installing the SAS conduit card into the midplane 168 Disk drives 5. Attach the power cord cover and secure the thumbscrew (A) to the system. Figure 150. Installing the SAS conduit card and the midplane Figure 151. Installing the power cord plate Replacing a SAS conduit card in a 5802 expansion unit with power off 169 6. Carefully slide the enclosure management controller into the system and close the latch (A) as shown in the following figure. 7. Carefully lift and align the I/O planar with the slot at the back of the expansion unit. 8. Insert the I/O planar firmly into the server, as shown in the following figure. 9. Secure the I/O planar with the locking tabs (A) as shown in the following figure. Figure 152. Replacing the enclosure management controller 170 Disk drives 10. To replace the power supply, with the locking handle (A) in the open position, push the power supply into the expansion unit as shown in the following figure. 11. Close the locking handle (A) until the power supply locks into position. Figure 153. Install the I/O planar on a rack mounted model Replacing a SAS conduit card in a 5802 expansion unit with power off 171 12. To replace the disk drive backplane, lift the backplane along the two sides and align with the guide rails in the expansion unit. CAUTION: The backplane may be heavy. Ensure that you can safely perform this task before you begin. 13. Using your hand to support the bottom of the backplane, slide it into the expansion unit as shown in the following figure. 14. Secure the backplane in place by moving the locking tabs from the open position (B) to the locked position (A). Figure 154. Replacing a power supply 172 Disk drives 15. Replace all disk drive units and fillers, if you removed them earlier. a. Support the bottom of the disk drive unit as you align it with the guide rails in the backplane. b. Slide the disk drive unit all the way into the backplane. c. Push the handle (A) toward the disk drive unit to lock it into place. Figure 155. Installing the backplane on a rack-mounted model Replacing a SAS conduit card in a 5802 expansion unit with power off 173 16. Replace all expander cards and fillers, if you removed them earlier. a. Supporting the bottom of the expander card with your hand, align it with the guide rails in the backplane. b. Slide the expander card all the way into the backplane. c. Push the handle (A) toward the card to lock it into place. 17. Start the system. For instructions see, “Starting the system or logical partition” on page 199. Continue with any other service actions you need to perform. v For information on connecting the system see Connecting your expansion units v For information an installing the PCI adapters into the planar, see Model 5802 and 5877 expansion units, PCI adapters, and cassettes. 174 Disk drives Removing a SAS conduit card from a 5802 expansion unit with power off Learn how to remove a SAS conduit card from a 5802 (rack-mounted) expansion unit if you need to replace a failing part with power off. If your system is managed by the Hardware Management Console (HMC) use the HMC to remove the SAS conduit card. For instructions see, “Removing a part by using the Hardware Management Console” on page 247. 1. Perform the prerequisite tasks as described in “Before you begin” on page 189. 2. Identify the card you are going to remove. For instructions see, “Identifying a failing part” on page 193. 3. Stop the system. For instructions see, “Stopping a system or logical partition” on page 201. 4. Remove the disk drives and fillers as shown in the following figure. a. Squeeze the latch of the disk drive and pull the handle (A) toward you to release the drives. b. Supporting the bottom of the disk drive with your hand, slide it out of the backplane. 5. Remove the port cards and fillers as shown in the following figure. a. Grasp the end of the latching handle (A) and pull it upward to unlock the port card. b. Using your hand to support the bottom of the port card, slide it out of the backplane. c. Place the port card on an electrostatic discharge (ESD) surface. Figure 156. Removing the disk drives from the backplane © Copyright IBM Corp. 2010 175 6. Remove the backplane as shown in the following figure. CAUTION: The backplane might be heavy. Ensure that you can safely complete the procedure. a. Squeeze the blue latches (A) to unlock the handles. b. Pull the handles (B) outward in the direction shown to release the backplane. c. Slide the backplane out of the enclosure, using your hand to support the bottom of the backplane. Figure 157. Removing the port card from the backplane 176 Disk drives 7. Disconnect both offline converter assembly (OCA) power cords by doing the following: a. On the front of both OCAs, slide the power cord holders (A) to the left. b. Label and disconnect both OCA power cords (B) from each OCA. Figure 158. Removing the backplane from the enclosure Removing a SAS conduit card from a 5802 expansion unit with power off 177 8. Remove the offline converter assemblies by doing the following steps: a. Pull the OCA handle (A) all the way down in the direction shown to unseat the OCA. b. Grasp both sides of the OCA and pull it out of the enclosure. Note: Use 2 hands to support the OCA as it is pulled out of the enclosure. c. Repeat these steps to remove both OCAs. Figure 159. Disconnecting the OCA power cords 178 Disk drives 9. Remove the enclosure management controller by doing the following steps: a. Label and disconnect any cables connected to the enclosure management controller card. b. Pinch the latch (A) to unlock the handle (B). c. Pull the handle outward in the direction shown to unseat the EMC card. d. Slide the enclosure management controller card straight out of the enclosure. Figure 160. Remove the offline converter assemblies Removing a SAS conduit card from a 5802 expansion unit with power off 179 10. Remove the power cord plate by turning the thumbscrew (A) and pulling the plate away from the system. 11. Disconnect the cables from the back of the I/O planar. Figure 161. Remove the enclosure management controller Figure 162. Removing the power cord plate 180 Disk drives 12. Squeeze the release latches (A) and pull the levers (B) out away from the system, as shown in the following figure. 13. Slide the planar out of the system. 14. Remove the SAS conduit card and midplane from the enclosure by doing the following steps: a. Pull the spring plungers (A) up and turn them one quarter to the left. b. Turn the captive screws (B) and (C) counterclockwise until they are completely loosened. c. Pull the SAS conduit and midplane component out of the enclosure. Figure 163. Removing the I/O planar Removing a SAS conduit card from a 5802 expansion unit with power off 181 15. Push the handles (A) on the conduit card down to disengage the card from the midplane, and remove the card. Continue with replacing the SAS expander card or return to the procedure that sent you here. Figure 164. Removing the SAS conduit card and midplane Figure 165. Removing the SAS conduit card from the midplane 182 Disk drives Replacing a SAS expander card in a 5802 expansion unit with power off Learn how to replace a SAS expander card in a 5802 expansion unit with power off. Review the requirements in “Before you begin” on page 189. If your system is managed by the Hardware Management Console (HMC) use the HMC to replace the card. For instructions see, “Removing a part by using the Hardware Management Console” on page 247 1. Before installing or reinstalling a SAS expander card, inspect the signal connector pins to ensure that the pins are not bent or damaged. Attention: Bent pins damage the backplane receptacle. Be careful not to touch or damage the signal connector pins. 2. Slide the card completely into the slot and push the lever (A) down to secure the card as shown in the following figure. 3. Start the system. For instructions see, “Starting the system or logical partition” on page 199. 4. Verify the presence of the newly installed card. For instructions see, “Verifying the installed part” on page 249. Continue with any other service actions you need to perform. Figure 166. Installing the SAS expander card © Copyright IBM Corp. 2010 183 184 Disk drives Removing a SAS expander card from a 5802 expansion unit with power off Learn how to remove a SAS expander card from a 5802 (rack-mounted) expansion unit if you need to replace a failing part with power off. If your system is managed by the Hardware Management Console (HMC) use the HMC to remove the SAS expander card. For instructions see, “Removing a part by using the Hardware Management Console” on page 247. 1. Perform the prerequisite tasks as described in “Before you begin” on page 189. 2. Identify the card you are going to remove. For instructions see, “Identifying a failing part” on page 193. 3. Lift the handle (A) and pull the card out of the system, as shown in the following figure. Continue with replacing the SAS expander card or return to the procedure that sent you here. Figure 167. Removing the SAS expander card. © Copyright IBM Corp. 2010 185 186 Disk drives Common procedures for installable features This section contains all the common procedures that are related to installing, removing, and replacing features. © Copyright IBM Corp. 2010 187 188 Disk drives Before you begin Observe these precautions when you are installing, removing, or replacing features and parts. These precautions are intended to create a safe environment to service your system and do not provide steps for servicing your system. The installation, removal and replacement procedures provide the step-by-step processes required to service your system. DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) DANGER © Copyright IBM Corp. 2010 189 Observe the following precautions when working on or around your IT rack system: v Heavy equipment–personal injury or equipment damage might result if mishandled. v Always lower the leveling pads on the rack cabinet. v Always install stabilizer brackets on the rack cabinet. v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest devices in the bottom of the rack cabinet. Always install servers and optional devices starting from the bottom of the rack cabinet. v Rack-mounted devices are not to be used as shelves or work spaces. Do not place objects on top of rack-mounted devices. v Each rack cabinet might have more than one power cord. Be sure to disconnect all power cords in the rack cabinet when directed to disconnect power during servicing. v Connect all devices installed in a rack cabinet to power devices installed in the same rack cabinet. Do not plug a power cord from a device installed in one rack cabinet into a power device installed in a different rack cabinet. v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of the system or the devices that attach to the system. It is the responsibility of the customer to ensure that the outlet is correctly wired and grounded to prevent an electrical shock. CAUTION v Do not install a unit in a rack where the internal rack ambient temperatures will exceed the manufacturer's recommended ambient temperature for all your rack-mounted devices. v Do not install a unit in a rack where the air flow is compromised. Ensure that air flow is not blocked or reduced on any side, front, or back of a unit used for air flow through the unit. v Consideration should be given to the connection of the equipment to the supply circuit so that overloading of the circuits does not compromise the supply wiring or overcurrent protection. To provide the correct power connection to a rack, refer to the rating labels located on the equipment in the rack to determine the total power requirement of the supply circuit. v (For sliding drawers.) Do not pull out or install any drawer or feature if the rack stabilizer brackets are not attached to the rack. Do not pull out more than one drawer at a time. The rack might become unstable if you pull out more than one drawer at a time. v (For fixed drawers.) This drawer is a fixed drawer and must not be moved for servicing unless specified by the manufacturer. Attempting to move the drawer partially or completely out of the rack might cause the rack to become unstable or cause the drawer to fall out of the rack. (R001) Before you begin a replacement or installation procedure, perform these tasks: 1. If you are installing a new feature, ensure that you have the software required to support the new feature. To do this, go to the following Web site: IBM Prerequisite 2. If you are performing an installation or replacement procedure that might put your data at risk, ensure, wherever possible, that you have a current backup of your system or logical partition (including operating systems, licensed programs, and data). 3. Review the installation or replacement procedure for the feature or part. 4. Note the significance of color on your system. 190 Disk drives Blue or terra-cotta on a part of the hardware indicates a touch point where you can grip the hardware to remove it from or install it in the system, open or close a latch, and so on. Terra-cotta might also indicate that the part can be removed and replaced with the system or logical partition power on. 5. Ensure that you have access to a medium flat-blade screwdriver, a Phillips screwdriver, and a pair of scissors. 6. If parts are incorrect, missing, or visibly damaged, do the following: v If you are replacing a part, contact the provider of your parts or next level of support. v If you are installing a feature, contact one of the following service organizations: – The provider of your parts or next level of support. – In the United States, the IBM Rochester Manufacturing Automated Information Line (R–MAIL) at 1–800–300–8751. In countries and regions outside of the United States, use the following Web site to locate your service and support telephone numbers: http://www.ibm.com/planetwide 7. If you encounter difficulties during the installation, contact your service provider, your IBM reseller, or your next level of support. 8. If you are installing new hardware in a logical partition, you need to understand and plan for the implications of partitioning your system. For information, see Logical Partitioning. Before you begin 191 192 Disk drives Identifying a failing part Use these instructions to learn how to locate and identify a failing part on your system or expansion unit using the appropriate method for your system. For IBM Power Systems servers that contain the POWER7 processor, the light-emitting diodes (LEDs) can be used to identify or verify a part that you are servicing. The fault (amber) LED indicates an error and corresponds to the location code in the system reference code (SRC). The LED is activated and deactivated automatically. If you need to use the identify function, use the following procedures. Control panel LEDs Use this information as a guide to the control panel LEDs and buttons. The control panel has LEDs that indicate various system status. v A: Power-on button v B: Power LED – A constant light indicates full system power to the unit. – A blinking light indicates standby power to the unit. Note: There is approximately a 30-second transition period from the time the power-on button is pressed to when the power LED goes from blinking to solid. During the transition period, the LED might blink faster. v C: Enclosure identify light Figure 168. Control panel © Copyright IBM Corp. 2010 193 – A constant light indicates the identify state, which is used to identify a part. – No light indicates that the system is operating normally. v D: System information light – No light indicates that the system is operating normally. – Light on indicates that the system requires attention. v E: USB port v F: Enclosure fault roll-up light – A constant light indicates a fault in the system unit. – No light indicates that the system is operating normally. v G: Function/Data display v H: Decrement button v I: Enter button v J: Increment button v K: Pinhole reset button Related concepts Identifying a failing part Use these instructions to learn how to locate and identify a failing part on your system or expansion unit using the appropriate method for your system. Identifying a failing part in an AIX system or logical partition Use these instructions to learn how to locate a failing part, and then activate the indicator light for that part on a system or logical partition running the AIX operating system. Locating a failing part in an AIX system or logical partition You might need to use AIX tools, before activating the indicator light, to locate a part that is failing. 1. Log in as root user or celogin-. 2. At the command line, type diag and press Enter. 3. From the Function Selection menu, select Task Selection and press Enter. 4. Select Display Previous Diagnostic Results and press Enter. 5. From the Display Previous Diagnostic Results display, select Display Diagnostic Log Summary. The Display Diagnostic Log display shows a chronological list of events. 6. Look in the T column for the most recent S entry. Select this row in the table and press Enter. 7. Select Commit. The details of this log entry are shown. 8. Record the location information and the SRN value shown near the end of the entry. 9. Exit to the command line. Use the location information for the failing part to activate the indicator light that identifies the failing part. “Activating the indicator light for the failing part.” Activating the indicator light for the failing part Use these instructions to help physically identify the location of a part you are servicing. 1. Log in as root user. 2. At the command line, type diag and press Enter. 3. From the Function Selection menu, select Task Selection and press Enter. 4. From the Task Selection menu, select Identify and Attention Indicators and press Enter. 5. From the list of lights, select the location code for the failing part and press Enter. 194 Disk drives 6. Select Commit. This turns on the system attention and indicator light for the failing part. 7. Exit to the command line. Deactivating the failing-part indicator light Use this procedure to turn off any indicator light that you turned on as a part of a service action. To deactivate the indicator light, follow these steps: 1. Log in as root user. 2. At the command line, type diag and press Enter. 3. From the Function Selection menu, select Task Selection and press Enter. 4. From the Task Selection menu, select Identify and Attention Indicators and press Enter. 5. From the list of lights, select the location code for the failing part and press Enter. When a light is activated for a failing part, an I character precedes the location code. 6. Select Commit. This turns off the system attention and indicator light for the failing part. 7. Exit to the command line. Identifying a failing part in an IBM i system or logical partition You can activate or deactivate the indicator light by using IBM i to assist in locating a failing part. Activating the failing-part indicator light You can search the service action log for an entry that matches the time, reference code, or resource of a problem, and then activate the indicator light for a failing part. 1. Sign on to an IBM i session, with at least service level authority. 2. On the command line of the session, type strsst and press Enter. Note: If you cannot get to the System Service Tools display, use function 21 from the control panel. Alternatively, if the system is managed by a Hardware Management Console (HMC), use the Service Focal Point™ utilities to get to the Dedicated Service Tools (DST) display. 3. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign On display. Press Enter. Remember: The service tools password is case-sensitive. 4. Select Start a service tool from the System Service Tools (SST) display and press Enter. 5. Select Hardware service manager from the Start a Service Tool display and press Enter. 6. Select Work with service action log from the Hardware Service Manager display and press Enter. 7. On the Select Timeframe display, change the From: Date and Time field to a date and time prior to when the problem occurred. 8. Search for an entry that matches one or more conditions of the problem: v System Reference code v Resource v Date and time v Failing item list 9. Select option 2 (Display failing item information) to display the service action log entry. 10. Select option 2 (Display details) to display location information for the failing part to be replaced. The information displayed in the date and time fields is the date and time for the first occurrence of the specific System reference code for the resource displayed during the time range selected. 11. If location information is available, select option 6 (Indicator on) to turn on the failing part's indicator light. Identifying a failing part 195 Tip: If the failing part does not contain a physical indicator light, a higher-level indicator light is activated. For example, the indicator light for the backplane or unit that contains the failing part might be lit. In this case, use the location information to locate the actual failing part. 12. Look for the enclosure indicator light to locate the enclosure that contains the failing part. Deactivating the failing-part indicator light Use this procedure to turn off any indicator light that you turned on as a part of a service action. To deactivate the indicator light, follow these steps: 1. Sign on to an IBM i session, with at least service level authority. 2. On the command line of the session, type strsst and press Enter. Note: If you cannot get to the System Service Tools display, use function 21 from the control panel. Alternatively, if the system is managed by a Hardware Management Console (HMC), use the Service Focal Point utilities to get to the Dedicated Service Tools (DST) display. 3. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign On display. Press Enter. Remember: The service tools password is case-sensitive. 4. Select Start a service tool from the System Service Tools (SST) display and press Enter. 5. Select Hardware service manager from the Start a Service Tool display and press Enter. 6. Select Work with service action log from the Hardware Service Manager display and press Enter. 7. On the Select Timeframe display, change the From: Date and Time field to a date and time prior to when the problem occurred. 8. Search for an entry that matches one or more conditions of the problem: v System Reference code v Resource v Date and time v Failing item list 9. Select option 2 (Display failing item information) to display the service action log entry. 10. Select option 2 (Display details) to display location information for the failing part to be replaced. The information displayed in the date and time fields is the date and time for the first occurrence of the specific System reference code for the resource displayed during the time range selected. 11. Select option 7 (Indicator off) to turn off the indicator light. 12. Select the Acknowledge all errors function at the bottom of the Service Action Log display, if all problems have been resolved. 13. Close the log entry by selecting option 8 (Close new entry) on the Service Action Log Report display. Identifying a failing part in a Linux system or logical partition If the service aids have been installed on a system or logical partition, you can activate or deactivate the indicator lights to locate a part or compete a service action. Locating a failing part in a Linux system or logical partition If the service aids have been installed on a system or logical partition, you need to activate the indicator lights to locate a part. To activate the indicator light, follow these steps: 1. Log in as root user. 2. At the command line, type /usr/sbin/usysident -s identify -l and press Enter. 196 Disk drives 3. Look for the system attention light to identify the enclosure that contains the failing part. Related information Service and productivity tools for Linux on POWER systems IBM provides hardware diagnostic aids and productivity tools, as well as installation aids for Linux operating systems on IBM Power systems. Finding the location code of a failing part in a Linux system or logical partition To retrieve the location code of the failing part, if you do not know the location code, use the procedure in this topic. To locate the failing part in a system or logical partition follow these steps: 1. Log in as root user. 2. At the command line, type grep diagela /var/log/platform and press Enter. 3. Look for the most recent entry that contains a system reference code (SRC). 4. Record the location information. Note: IBM provides hardware diagnostic aids and productivity tools, as well as installation aids for Linux operating systems on IBM Power systems. See Service and productivity tools for Linux on POWER® systems Activating the indicator light for the failing part If you know the location code of the failing part, activate the indicator light to help you locate which part to replace. To activate the indicator light, follow these steps: 1. Log in as root user. 2. At the command line, type /usr/sbin/usysident -s identify -l and press Enter. 3. Look for the system attention light to identify the enclosure that contains the failing part. Note: IBM provides hardware diagnostic aids and productivity tools, as well as installation aids for Linux operating systems on IBM Power systems. See Service and productivity tools for Linux on POWER systems Deactivating the failing-part indicator light After you complete a removal and replacement procedure, you must deactivate the failing-part indicator light. To deactivate the indicator light, follow these steps: 1. Log in as root user. 2. At the command line, type /usr/sbin/usysident -s normal -l and press Enter. Note: IBM provides hardware diagnostic aids and productivity tools, as well as installation aids for Linux operating systems on IBM Power systems. See Service and productivity tools for Linux on POWER systems Identifying a failing part 197 Locating a failing part in a Virtual I/O Server system or logical partition You can use Virtual I/O Server (VIOS) tools, before activating the indicator light, to locate a part that is failing. 1. Log in as root user or celogin-. 2. At the command line, type diagmenu and press Enter. 3. From the Function Selection menu, select Task Selection and press Enter. 4. Select Display Previous Diagnostic Results and press Enter. 5. From the Display Previous Diagnostic Results display, select Display Diagnostic Log Summary. A Display Diagnostic Log display appears. This display contains a chronological list of events. 6. Look in the T column for the most recent S entry. Select this row in the table and press Enter. 7. Choose Commit. The details of this log entry are shown. 8. Record the location information and the SRN value shown near the end of the entry. 9. Exit to the command line. Use the location information for the failing part to activate the indicator light that identifies the failing part. For instructions, see “Identifying a part by using the Virtual I/O Server.” Identifying a part by using the Virtual I/O Server Use these instructions to turn on the indicator light to help you physically locate a part by using the Virtual I/O Server (VIOS). 1. Log in as root user. 2. At the command line, type diagmenu and press Enter. 3. From the Function Selection menu, select Task Selection. Press Enter. 4. From the Task Selection menu, select Identify and Attention Indicators. Press Enter. 5. From the list of lights, select the location code for the failing part and press Enter. 6. Select Commit. This turns on the system attention and indicator light for the failing part. 7. Exit to the command line. 198 Disk drives Starting the system or logical partition Learn how to start a system or logical partition after performing a service action or system upgrade. Starting a system that is not managed by a Hardware Management Console You can use the power button or the Advanced System Management Interface to start a system that is not managed by a Hardware Management Console. To start a system that is not managed by a Hardware Management Console (HMC), follow these steps: 1. Open the front rack door, if necessary. 2. Before you press the power button on the control panel, ensure that power is connected to the system unit as follows: v All system power cables are connected to a power source. v The power-on light, as shown in the following figure, is slowly blinking. v The top of the display, as shown in the following figure, shows 01 V=F. 3. Press the power button (A), as shown in the following figure, on the control panel. v A: Power-on button v B: Power LED – A constant light indicates full system power to the unit. – A blinking light indicates standby power to the unit. Figure 169. Control panel © Copyright IBM Corp. 2010 199 Note: There is approximately a 30-second transition period from the time the power-on button is pressed to when the power LED goes from blinking to solid. During the transition period, the LED might blink faster. v C: Enclosure identify light – A constant light indicates the identify state, which is used to identify a part. – No light indicates that the system is operating normally. v D: Attention light – No light indicates that the system is operating normally. – A blinking light indicates that the system requires attention. v E: USB port v F: Enclosure fault roll-up light – A constant light indicates a fault in the system. – No light indicates that the system is operating normally. v G: Function/Data display v H: Decrement button v I: Enter button v J: Increment button v K: Pinhole reset button 4. Observe the following after pressing the power button: v The power-on light begins to blink faster. v The system cooling fans are activated after approximately 30 seconds and begin to accelerate to operating speed. v Progress indicators, also referred to as checkpoints, appear on the control panel display while the system is being started. The power-on light on the control panel stops blinking and remains on, indicating that system power is on. Tip: If pressing the power button does not start the system, do the following steps to start the system using the Advanced System Management Interface (ASMI): 1. Access the ASMI. For instructions, see Accessing the ASMI. 2. Start the system using the ASMI. For instructions, see Powering the system on and off. Starting a system or logical partition by using the Hardware Management Console You can use the Hardware Management Console (HMC) user interface to start the system or logical partition after the required cables are installed and the power cables are connected to a power source. For instructions on working with the HMC, see Managing the Hardware Management Console. For instructions on starting a logical partition, see Logical partitioning. For instructions on starting the system, see Powering on the managed system. Progress indicators, also referred to as checkpoints, appear on the control panel display while the system is being started. When the power-on light on the control panel stops blinking and remains on, the system power is on. 200 Disk drives Stopping a system or logical partition Learn how to stop a system or logical partition as a part of a system upgrade or service action. Attention: Using either the power-on button on the control panel or entering commands at the Hardware Management Console (HMC) to stop the system can cause unpredictable results in the data files. Also, the next time you start the system, it might take longer if all applications are not ended before stopping the system. To stop the system or logical partition, select the appropriate procedure. Related tasks Shutting down logical partitions using the Integrated Virtualization Manager Use this procedure to shut down a logical partition or the entire managed system that is managed using the Integrated Virtualization Manager. Stopping a system that is not managed by a Hardware Management Console You might need to stop the system to perform another task. Use these instructions to stop the system using the power button or Advanced System Management Interface. Before you stop the system, follow these steps: 1. If an Integrated xSeries® Adapter (IXA) is present on the system, shut it down using IBM i options. 2. Ensure that all jobs are completed and end all applications. 3. Ensure that the operating system is stopped. Attention: Failure to do so can result in the loss of data. 4. If a Virtual I/O Server (VIOS) logical partition is running, ensure that all clients are shut down or that the clients have access to their devices using an alternate method. The following procedure describes how to stop a system that is not managed by a Hardware Management Console (HMC). 1. Log in to the system as a user with the authority to run the shutdown or pwrdwnsys (Power Down System) command. 2. At the command line, enter one of the following commands: v If your system is running the AIX operating system, type shutdown. v If your system is running the Linux operating system, type shutdown -h now. v If your system is running the IBM i operating system, type PWRDWNSYS. If your system is partitioned, use the PWRDWNSYS command to power down each of the secondary partitions. Then, use the PWRDWNSYS command to power down the primary partition. The command stops the operating system. The system power turns off, the power-on light begins to slowly flash, and the system goes into a standby state. 3. Record the IPL type and the IPL mode from the control panel display to help you return the system to this state when the installation or replacement procedure is completed. 4. Set the power switches of any devices connected to the system to off. 5. Unplug any power cables that are attached to the unit from electrical outlets. Ensure that you unplug power cables from peripheral devices, such as printers and expansion units. Important: The system is equipped with a second power supply. Before continuing with this procedure, ensure that all power sources to the system have been disconnected. © Copyright IBM Corp. 2010 201 (L003) or Stopping a system by using the Hardware Management Console You can use the Hardware Management Console (HMC) user interface to stop the system or a logical partition. Use the following steps to accomplish this task. By default, the managed system is set to power off automatically when you shut down the last running logical partition on the managed system. If you set the managed system properties on the HMC so that the managed system does not power off automatically, you must use this procedure to power off your managed system. Attention: If possible, shut down the running logical partitions on the managed system before powering off the managed system. Powering off the managed system without shutting down the logical partitions first causes the logical partitions to shut down abnormally and can cause data loss. If you use a Virtual I/O Server (VIOS) logical partition, ensure that all clients are shut down or that the clients have access to their devices using an alternate method. To power off a managed system, you must be a member of one of the following roles: v Super administrator v Service representative v Operator v Product engineer 1. In the Navigation area, expand the Systems Management folder. 2. Click the Servers icon. 3. In the Contents area, select the managed system. 202 Disk drives 4. Select Tasks, then Operations, and then Power Off 5. Select the appropriate power-off mode and click OK. Related information Shutting down and restarting logical partitions Stop the system 203 204 Disk drives Removing and replacing the expansion unit cover or door You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing components or performing a service action. Removing the front cover on the 7314-G30 or 5796 You might need to remove the front cover to perform service to the system. To remove the front cover, follow these steps: 1. Remove the two thumbscrews (B) located on the left and right of the cover. 2. Press in on the two latches (C) located left and right of the cover to release the cover. 3. Lift out the cover (A) and away from the chassis. Tip: If an airflow block is present on an unpopulated side of the chassis do not remove it unless you are populating that side of the chassis. Installing the front cover on the 7314-G30 or 5796 Use this procedure to install the front cover after installing the system or performing a service action. If you only have one side of the chassis populated, ensure that the airflow block is present on the unpopulated side. 1. Insert the posts on the bottom of the cover (A) into the hooks on the chassis of the enclosure. Figure 170. Remove the front cover for model 7314-G30 or 5796 © Copyright IBM Corp. 2010 205 2. Press in on the two latches (C) on the right and left of the front cover. 3. Press the cover onto the chassis and release the latches. The cover securely snaps into place. 4. Replace the two thumbscrews (B) into the slots on the left and right of the front cover. Figure 171. Front cover on a model 7314-G30 or 5796 206 Disk drives System covers Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the POWER7 processor. Removing and replacing covers for the 8202-E4B or 8205-E6B Use these instructions to remove, replace, or install covers to access components or perform service. Removing the front cover on a rack-mounted 8202-E4B or 8205-E6B system Use this procedure to remove the cover to access components or perform service. To remove the front cover, follow these steps: 1. Remove the two screws that secure the system to the rack. 2. Pull the cover away from the system. The cover has an indentation where you can hold onto it more easily. Removing the front cover on a stand-alone 8202-E4B or 8205-E6B system Use this procedure to remove the cover to access components or perform service. To remove the front cover, follow these steps: 1. Open the front door (A) by swinging it towards the left as shown in the following figure. Figure 172. Removing the front cover © Copyright IBM Corp. 2010 207 2. Press latch (B) and pull the cover away from the system. The cover has an indentation where you can hold onto it more easily. Installing the front cover on a rack-mounted 8202-E4B or 8205-E6B system Use this procedure to install the cover to access components or perform service. To install the front cover, follow these steps: 1. Push the cover onto the system. 2. Gently push the cover in until the four cover clips are seated in their respective mounting posts as shown in the following figure. The cover snaps into place and has an indentation where you can hold onto it more easily. Figure 173. Removing the front cover 208 Disk drives Installing the front cover and front door on a stand-alone 8202-E4B or 8205-E6B system Use this procedure to install the cover to access components or perform service. To install the front cover and front door, complete the following steps. 1. Align the cover (A) until the two cover tabs are seated into the slots on the base plate (B), as shown in the following figure. 2. Rotate the cover up and towards the system until the release latch is seated into its respective slot. The cover has an indentation where you can hold onto it more easily. Figure 174. Removing the front cover System covers 209 3. Hold the front door at an approximate 120 degree angle to the system as show in the following figure. Engage the latches. Removing the service access cover on a rack-mounted 8202-E4B or 8205-E6B system Use this procedure to remove the service access cover. 1. Loosen the two thumbscrews (A) located at the back of the cover. Figure 175. Installing the front cover Figure 176. Installing the front door 210 Disk drives 2. Slide the cover toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit. Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components. Removing the service access cover on a stand-alone 8202-E4B or 8205-E6B system Use this procedure to remove the service access cover. 1. Loosen the thumbscrew (A) located at the back of the outside cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit. Figure 177. Removing the service access cover System covers 211 3. Loosen the two thumbscrews (A) located at the back of the inside cover. 4. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit. Figure 178. Installing the outside service access cover Figure 179. Removing the inside service access cover 212 Disk drives Installing the service access cover on a rack-mounted 8202-E4B or 8205-E6B system Use this procedure to install the service access cover. 1. Place the service access cover on top of the system, about 25 mm (1 in.) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (A) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover. Installing the service access cover on a stand-alone 8202-E4B or 8205-E6B system Use this procedure to install the service access cover. 1. Place the inside service access cover (A) on top of the system, about 25 mm (1 in.) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover. Figure 180. Installing the service access cover System covers 213 5. Place the outside service access cover (A) on top of the system. 6. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 7. Align the thumbscrew (B) located on the back of the service access cover with the hole on the back of the system chassis. 8. Tighten the thumbscrew to secure the service access cover. Figure 181. Installing the inside service access cover 214 Disk drives Removing and replacing covers for the 8231-E2B Use these instructions to remove, replace, or install covers to access components or perform service. Removing the service access cover for the 8231-E2B Use this procedure to remove the service access cover. 1. Loosen the thumbscrew (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit. Figure 182. Installing the outside service access cover System covers 215 Attention: For proper cooling and air flow, re-install the cover before turning on the system. Operating the system without the cover for more than 30 minutes could damage the system components. Installing the service access cover for the 8231-E2B Use this procedure to install the service access cover. 1. Place the service access cover (A) on top of the system, about 25 mm (1 in.) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the thumbscrew (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover. Figure 183. Removing the service access cover 216 Disk drives Removing and replacing covers on the 8233-E8B or 8236-E8C system Use these instructions to remove, replace, or install covers to access components or perform service. Removing the front cover from an 8233-E8B or 8236-E8C system Use this procedure to remove the cover to access components or perform service. To remove the front cover, follow these steps: 1. Remove the two screws (A) that secure the system to the rack (B) as shown in the following figure. 2. Pull the cover away from the system. The cover has an indentation where you can hold onto it more easily. Figure 184. Installing the service access cover System covers 217 Installing the front cover on an 8233-E8B or 8236-E8C system Use this procedure to install the cover after accessing components or performing service. To install the front cover, follow these steps: 1. Align the cover with the system. 2. Gently push the cover in until the four cover clips are seated in their respective mounting posts (B) on the system. 3. Replace the two screws (C) that secure the system to the rack (A). Figure 185. Removing the front cover 218 Disk drives Removing the service access cover from an 8233-E8B or 8236-E8C system Use this procedure to remove the service access cover to perform service or to gain access to internal components. To remove the service access cover, follow these steps: 1. Place the system into the service position. For instructions, see “Placing a 8233-E8B or 8236-E8C in the service position” on page 236. 2. Loosen the two thumbscrews (A) located at the back of the cover. 3. Slide the cover (B) toward the back of the system unit. When the front of the service access cover clears the upper frame ledge, lift the cover up and off the system unit. Attention: For proper cooling and airflow, install the cover before starting the system. Operating the system without the cover for more than 30 minutes could damage the system components. Figure 186. Installing the front cover System covers 219 Installing the service access cover on an 8233-E8B or 8236-E8C system Use this procedure to install the service access cover after performing service or accessing internal components. To install the service access cover, follow these steps: 1. Place the service access cover (A) on the top of the system unit, approximately 25 mm (1 in.) from the front of the system unit. 2. Hold the service access cover against the system unit, and slide it toward the front of the system. The tabs on the service access cover slide beneath the upper chassis ledge, and the two screws align with the screw holes at the back of the system unit. Important: Ensure that the fan LED cables do not get caught on the front edge of the service access cover as you move it forward. 3. Tighten the screws (B) located at the back of the cover. Figure 187. Removing the service access cover 220 Disk drives Removing and replacing covers for the 9117-MMB or 9179-MHB Use these instructions to remove, replace, or install covers to access components or perform service. Removing the front cover from the 9117-MMB or 9179-MHB Use this procedure to remove the cover to access components or perform service. To remove the front cover follow these steps: 1. If necessary, open the front rack door. 2. Pull out the latches (A) located on both sides of the cover as shown in the following figure. Figure 188. Installing the service access cover System covers 221 3. Pull off the cover (B) to remove it from the system unit. Installing the front cover on the 9117-MMB or 9179-MHB Use this procedure to install the cover after accessing components or performing service. To install the front cover, follow these steps: 1. Position the cover (A) on the front of the system unit so that the four pins on the system match the four holes on the back of the cover. Figure 189. Removing the front cover 222 Disk drives 2. Press the tabs (B) to snap the cover into position. 3. Close the front rack door. Figure 190. Installing the front cover System covers 223 224 Disk drives Placing the rack-mounted 8233-E8B or 8236-E8C system into the service position or operating position Use these procedures to place a system into the service position or operating position to perform service or to gain access to internal components. Placing a rack-mounted 8231-E2B system into the service position Use this procedure to place the rack-mounted system into the service position. Notes: v When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. v Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. v When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. While holding the system unit release latches down on both the left and right sides, pull the system unit out from the rack until the rails are fully extended and locked. Figure 191. Placing the system in the service position © Copyright IBM Corp. 2010 225 Placing the rack-mounted 8231-E2B system into the operating position Use this procedure to place the rack-mounted system into the operating position. When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches by sliding them toward the front of the system. 2. Push the system unit back into the rack until both system-unit release latches have locked into position. 3. Close the front rack door of the unit you are servicing. Figure 192. Placing the system into the operating position 226 Disk drives Placing a rack-mounted 8202-E4B or 8205-E6B system into the service position Use this procedure to place the rack-mounted system into the service position. Notes: v When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. v Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. v When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. 4. While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked. Figure 193. Placing the system in the service position © Copyright IBM Corp. 2010 227 228 Disk drives Placing the rack-mounted 8202-E4B or 8205-E6B system into the operating position Use this procedure to place the rack-mounted system into the operating position. When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (A) by sliding them upward from the system. The rail safety latches are located near the middle of the system. 2. Push the system unit (B) back into the rack until both system-unit release latches have locked into position. 3. Replace and tighten the two screws (A) that secure the system unit (B) to the rack. 4. Close the front rack door of the unit you are servicing. Figure 194. Placing the system into the operating position © Copyright IBM Corp. 2010 229 Figure 195. Placing the system into the operating position 230 Disk drives Service and operating positions Use these procedures to place IBM Power Systems servers into the service or operating position. Placing the 8202-E4B or 8205-E6B system into the service or operating position Use these procedures to place a system into the service position or operating position to perform service or to gain access to internal components. Placing a rack-mounted 8202-E4B or 8205-E6B system into the service position Use this procedure to place the rack-mounted system into the service position. Notes: v When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. v Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. v When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. © Copyright IBM Corp. 2010 231 4. While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked. Placing the rack-mounted 8202-E4B or 8205-E6B system into the operating position Use this procedure to place the rack-mounted system into the operating position. Figure 196. Placing the system in the service position 232 Disk drives When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches (A) by sliding them upward from the system. The rail safety latches are located near the middle of the system. 2. Push the system unit (B) back into the rack until both system-unit release latches have locked into position. 3. Replace and tighten the two screws (A) that secure the system unit (B) to the rack. 4. Close the front rack door of the unit you are servicing. Figure 197. Placing the system into the operating position Service and operating positions 233 Placing the rack-mounted 8233-E8B or 8236-E8C system into the service position or operating position Use these procedures to place a system into the service position or operating position to perform service or to gain access to internal components. Placing a rack-mounted 8231-E2B system into the service position Use this procedure to place the rack-mounted system into the service position. Notes: v When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. v Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. v When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. While holding the system unit release latches down on both the left and right sides, pull the system unit out from the rack until the rails are fully extended and locked. Figure 198. Placing the system into the operating position 234 Disk drives Placing the rack-mounted 8231-E2B system into the operating position Use this procedure to place the rack-mounted system into the operating position. When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the rail safety latches by sliding them toward the front of the system. 2. Push the system unit back into the rack until both system-unit release latches have locked into position. Figure 199. Placing the system in the service position Service and operating positions 235 3. Close the front rack door of the unit you are servicing. Placing the rack-mounted 8233-E8B or 8236-E8C system into the service position or operating position Use these procedures to place a system into the service position or operating position to perform service or to gain access to internal components. Placing a 8233-E8B or 8236-E8C in the service position Use this procedure to perform service or gain access to internal components by placing the rack-mounted system or expansion unit in the service position. Note: Some of the figures in these procedures might not look exactly like the system or expansion unit that you have. However, the steps to perform the task are the same. Figure 200. Placing the system into the operating position 236 Disk drives DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) DANGER Service and operating positions 237 Observe the following precautions when working on or around your IT rack system: v Heavy equipment–personal injury or equipment damage might result if mishandled. v Always lower the leveling pads on the rack cabinet. v Always install stabilizer brackets on the rack cabinet. v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest devices in the bottom of the rack cabinet. Always install servers and optional devices starting from the bottom of the rack cabinet. v Rack-mounted devices are not to be used as shelves or work spaces. Do not place objects on top of rack-mounted devices. v Each rack cabinet might have more than one power cord. Be sure to disconnect all power cords in the rack cabinet when directed to disconnect power during servicing. v Connect all devices installed in a rack cabinet to power devices installed in the same rack cabinet. Do not plug a power cord from a device installed in one rack cabinet into a power device installed in a different rack cabinet. v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of the system or the devices that attach to the system. It is the responsibility of the customer to ensure that the outlet is correctly wired and grounded to prevent an electrical shock. CAUTION v Do not install a unit in a rack where the internal rack ambient temperatures will exceed the manufacturer's recommended ambient temperature for all your rack-mounted devices. v Do not install a unit in a rack where the air flow is compromised. Ensure that air flow is not blocked or reduced on any side, front, or back of a unit used for air flow through the unit. v Consideration should be given to the connection of the equipment to the supply circuit so that overloading of the circuits does not compromise the supply wiring or overcurrent protection. To provide the correct power connection to a rack, refer to the rating labels located on the equipment in the rack to determine the total power requirement of the supply circuit. v (For sliding drawers.) Do not pull out or install any drawer or feature if the rack stabilizer brackets are not attached to the rack. Do not pull out more than one drawer at a time. The rack might become unstable if you pull out more than one drawer at a time. v (For fixed drawers.) This drawer is a fixed drawer and must not be moved for servicing unless specified by the manufacturer. Attempting to move the drawer partially or completely out of the rack might cause the rack to become unstable or cause the drawer to fall out of the rack. (R001) To place the rack-mounted system or expansion unit into the service position, follow these steps: 1. If necessary, open the front rack door. 2. Remove the two screws (A) that secure the system unit to the rack as shown in the following figure. 3. Release the rack latches (B) on both the left and right sides as shown in the following figure. 238 Disk drives 4. If required, unplug the cables from the back of the system or expansion unit before you pull the unit out from the rack. Ensure that the cables at the rear of the system or expansion unit do not catch or bind as you pull the unit out from the rack. 5. Slowly pull the system or expansion unit out from the rack until the rails are fully extended and locked. Ensure that the rails are fully extended. When the rails are fully extended, the rail safety latches lock into place. This action prevents the system or expansion unit from being pulled out too far. Placing the 8233-E8B or 8236-E8C in the operating position Use this procedure to place the system or expansion unit in the operating position to make the unit available for use. To place the system into the operating position follow these steps: Tip: Some of the figures in these procedures might not look exactly like the system or expansion unit that you have. However, the steps to perform the task are the same. 1. Simultaneously release the blue rail safety latches (B), located near the front of each rail, and push the system or expansion unit into the rack as shown in the following figure. Note: Ensure that the cables at the rear of the system or expansion unit do not catch or bind as you push the unit back into the rack. Figure 201. Releasing the rack latches Service and operating positions 239 2. Replace and tighten the two thumbscrews (C) that secure the system or expansion unit (A) to the rack as shown in the following figure. 3. Close the front rack door. Figure 202. Releasing the rail safety latches Figure 203. Replacing the thumbscrews 240 Disk drives Power cords Use these procedures to disconnect and connect the power cords on IBM Power Systems servers that contain the POWER7 processor. Disconnecting the power cords from the 8202-E4B or 8205-E6B system Use these procedures to disconnect the power cords from the 8202-E4B or 8205-E6B system. 1. For a rack-mounted system, open the back rack door on the unit you are servicing and identify the system unit you are servicing in the rack. 2. For both a rack-mounted system and a stand-alone system, disconnect all power cords from the unit you are servicing. This system might be equipped with two power supplies. Before continuing with any remove and replace procedures that require the power off, ensure that both power sources to the system have been completely disconnected. Figure 204. Removing the power cords for a rack-mounted system © Copyright IBM Corp. 2010 241 Connecting the power cords to the 8202-E4B or 8205-E6B system Use this procedure to connect the power cords. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. Ensure that the cords are threaded through the handles. Figure 205. Removing the power cords for a stand-alone system 242 Disk drives 3. Close the back rack door. Disconnecting the power cords from the 8231-E2B Use these procedures to disconnect the power cords from the8231-E2B system. 1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing. This system might be equipped with two power supplies. Before continuing with any remove and replace procedures that require the power off, ensure that both power sources to the system have been completely disconnected. Figure 206. Connecting the power cords Power cords 243 Connecting the power cords to the 8231-E2B Use these procedures to connect the power cords to the8231-E2B system. 1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing. Ensure that the cords are threaded through the handles. Figure 207. Removing the power cords 244 Disk drives Figure 208. Connecting the power cords Power cords 245 246 Disk drives Installing a feature by using the Hardware Management Console You can use the Hardware Management Console to perform many service actions, including the installation of a new feature or part. To use the Hardware Management Console user interface to install a feature or part into a system or expansion unit that is managed by an HMC, follow these steps: 1. In the navigation area, expand Systems Management ? Servers. 2. Select the managed system you will install the part in. 3. In the Tasks area expand Serviceability ? Hardware. 4. Select Add FRU (field replaceable unit). 5. In the Add/Install/Remove Hardware window select the system or enclosure into which you are installing the feature. 6. Select the type of feature you are installing from the menu and click Next 7. Select the location code for where you will install the feature, and click Add. 8. After the FRU is placed in the pending actions category click Launch Procedure and follow the instructions to install the feature. Note: The HMC might open external instructions for installing the feature. If so, follow those instructions to install the feature. Removing a part by using the Hardware Management Console You can use the Hardware Management Console (HMC) to perform many service actions, including the removal of a field replaceable unit (FRU) or part. To use the Hardware Management Console user interface to remove a part in a system or expansion unit that is managed by an HMC, follow these steps: 1. In the navigation area, expand Systems Management ? Servers. 2. Select the managed system from which you are removing a part. 3. In the Tasks area, expand Serviceability ? Hardware. 4. Select Remove FRU. 5. In the Add/Install/Remove Hardware - Remove FRU, Select FRU Type window, select the system or enclosure from which you are removing the part. 6. Select the type of part you are removing from the menu and click Next. 7. Select the location of the part you are removing and click Add. 8. After the part is listed in the Pending Actions display click Launch Procedure and follow the instructions to remove the part. Note: The HMC might open the information center instructions for removing the part. If so, follow those instructions to remove the part. Exchanging a part by using the Hardware Management Console You can use the Hardware Management Console (HMC) to perform many service actions, including exchanging a field replaceable unit (FRU) or part. © Copyright IBM Corp. 2010 247 If you are exchanging a part to repair a serviceable event follow those instructions. If you are exchanging a part as a part of any other procedure using HMC Version 7 or later use these steps: 1. In the navigation area, expand Systems Management ? Servers. 2. Select the managed system in which you are exchanging a part. 3. In the Tasks area expand Serviceability ? Hardware. 4. Select Exchange FRU. 5. In the Replace Hardware - Replace FRU, Select FRU Type window select the system or enclosure in which you will exchange a part from the list. 6. Select the type of part you will exchange from the menu and click Next. 7. Select the location code of the part you will exchange from the menu and click Add. 8. After the FRU is placed in the pending actions category click Launch Procedure and follow the instructions to exchange the feature. Note: The HMC might open external instructions for replacing the part. If so, follow those instructions to replace the part. 248 Disk drives Verifying the installed part You can verify a newly installed or replaced part on your system, logical partition, or expansion unit using the operating system, stand-alone diagnostics, or the Hardware Management Console (HMC). Verifying an installed feature or replaced part in an AIX system or logical partition If you installed feature or replaced a part, you might want to use the tools in AIX to verify that the feature or part is recognized by the system or logical partition. To verify the operation of a newly installed feature or replacement part, select the appropriate procedure: v Verify the installed feature using AIX v Verifying the replaced part using AIX Verify the installed feature by using AIX: 1. Log in as root user. 2. At the command line, type diag and press Enter. 3. Select Advanced Diagnostics Routines and press Enter. 4. From the Diagnostic Mode Selection menu, select System Verification and press Enter. 5. When the Advanced Diagnostic Selection menu appears, do one of the following: v To test a single resource, select the resource that you just installed from the list of resources and press Enter. v To test all the resources available to the operating system, select All Resources and press Enter. 6. Select Commit, and wait until the diagnostic programs run to completion, responding to any prompts that appear. 7. Did the diagnostics run to completion and display the message No trouble was found? v No: If a service request number (SRN) or other reference code is displayed, suspect a loose adapter or cable connection. Review the installation procedures to ensure that the new feature is installed correctly. If you cannot correct the problem, collect all SRNs or any other reference code information that you see. If the system is running in logical partitioning (LPAR) mode, note the logical partition in which you installed the feature. Contact your service provider for assistance. v Yes: The new device is installed correctly. Exit the diagnostic programs and return the system to normal operations. Verify the replacement part by using AIX: To verify the operation of a newly installed feature or replacement part, follow these steps: 1. Did you use either the AIX operating system or the online diagnostics service aid concurrent (hot-swap) service to replace the part? No: Go to step 2. Yes: Go to step 5 on page 250. 2. Is the system powered off? No: Go to step 4 on page 250. Yes: If the system supports slow boot, set the system to perform a slow boot. For information, see Performing a slow boot. © Copyright IBM Corp. 2010 249 3. Start the system and wait until the AIX operating system login prompt is displayed or until apparent system activity on the operator panel or display has stopped. Did the AIX login prompt display? v No: If a service request number (SRN) or other reference code is displayed, suspect a loose adapter or cable connection. Review the procedures for the part that you replaced to ensure that the new part is installed correctly. If you cannot correct the problem, collect all SRNs or any other reference code information that you see. If the system does not start or you have no login prompt, see: Problems with loading and starting the operating system. If the system is partitioned, note the logical partition in which you replaced the part. Contact your service provider for assistance. v Yes: Go to step 4. 4. At the command prompt, type diag —a and press Enter to check for missing resources. If you see a command prompt, go to step 5. If the Diagnostic selection menu is shown with M appearing next to any resource, follow these steps: a. Select the resource and press Enter. b. Select Commit. c. Follow any instructions that are shown. d. If the Do you want to review the previously displayed error? message is shown, select Yes and press Enter. e. If an SRN is shown, suspect a loose card or connection. If no obvious problem is shown, record the SRN and contact your service provider for assistance.. f. If no SRN is shown, go to step 5. 5. Test the part by doing the following steps: a. At the command line, type diag and press Enter. b. From the Function Selection menu, select Advanced Diagnostics Routines and press Enter. c. From the Diagnostic Mode Selection menu, select System Verification and press Enter. d. Select All Resources, or select the diagnostics for the individual part to test only the part you replaced and any devices that are attached to the part you replaced and press Enter. Did the Resource Repair Action menu appear? No: Go to step 6. Yes: Go to step 7. 6. Did the Testing Complete, No trouble was found message appear? v No: There is still a problem. Contact your service provider. This ends the procedure. v Yes: Select Log Repair Action, if not previously logged, from the Task Selection menu to update the AIX error log. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0 and press Enter. Tip: This action changes the indicator light for the part from the fault state to the normal state. Go to step 9 on page 251. 7. Select the resource for the replaced part from the Resource Repair Action menu. When a test is run on a resource in system verification mode, and that resource has an entry in the AIX error log, if the test on the resource was successful, the Resource Repair Action menu appears. Complete the following steps to update the AIX error log to indicate that a system-detectable part has been replaced. Note: On systems with an indicator light for the failing part, this action changes the indicator light to the normal state. 250 Disk drives a. Select the resource that has been replaced from the Resource Repair Action menu. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action does not appear on the resource list, select sysplanar0 and press Enter. b. Select Commit after you make your selections. Did another Resource Repair Action display appear? No: If the No Trouble Found display appears, go to step 9 Yes: Go to step 8. 8. Select the parent or child of the resource for the replaced part from the Resource Repair Action menu if necessary. When a test is run on a resource in system verification mode, and that resource has an entry in the AIX error log, if the test on the resource was successful, the Resource Repair Action menu appears. Complete the following steps to update the AIX error log to indicate that a system-detectable part has been replaced. Note: This action changes the indicator light for the part from the fault state to the normal state. a. From the Resource Repair Action menu, select the parent or child of the resource that has been replaced. If the repair action was to reseat a cable or adapter, select the resource associated with that repair action. If the resource associated with your action does not appear on the resource list, select sysplanar0 and press Enter. b. Select Commit after you make your selections. c. If the No Trouble Found display appears, go to step 9. 9. If you changed the service processor or network settings, as instructed in previous procedures, restore the settings to the values they had prior to servicing the system. 10. Did you do any hot-plug procedures before doing this procedure? No: Go to step 11. Yes: Go to step 12. 11. Start the operating system, with the system or logical partition in normal mode. Were you able to start the operating system? No: Contact your service provider. This ends the procedure. Yes: Go to step 12. 12. Are the indicator lights still on? v No. This ends the procedure. v Yes. Turn off the lights. See the following for instructions: Changing service indicators. Verifying the installed part in an IBM i system or logical partition If you have installed a new feature or part, verify that the system recognizes the feature or part by using the IBM i system service tools. To verify the installed part, follow these steps: 1. Deactivate the failing item indicator light. For instructions, see “Deactivating the failing-part indicator light” on page 196. 2. Sign on with at least service level authority. 3. On the command line of the IBM i session, type strsst and press Enter. Note: If you cannot get to the System Service Tools display, use function 21 from the control panel. Alternatively, if the system is managed by Hardware Management Console (HMC), use the Service Focal Point Utilities to get to the Dedicated Service Tools (DST) display. 4. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign On display and press Enter. Verifying the installed part 251 Note: The service tools password is case-sensitive. 5. Select Start a service tool from the System Service Tools (SST) display and press Enter. 6. Select Hardware service manager from the Start a Service Tool display and press Enter. 7. Select Logical hardware resources (buses, IOPs, controllers) from the Hardware Service Manager display and press Enter. This option allows you to display and work with logical resources. Logical hardware resources are the functional resources of the system used by the operating system. With the Logical Hardware Resources display, you can show logical hardware resource status or information, and associated packaging hardware resources. Use the online Help information to better understand specific functions, fields, or symbols. Deactivating the failing-part indicator light Use this procedure to turn off any indicator light that you turned on as a part of a service action. To deactivate the indicator light, follow these steps: 1. Sign on to an IBM i session, with at least service level authority. 2. On the command line of the session, type strsst and press Enter. Note: If you cannot get to the System Service Tools display, use function 21 from the control panel. Alternatively, if the system is managed by a Hardware Management Console (HMC), use the Service Focal Point utilities to get to the Dedicated Service Tools (DST) display. 3. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign On display. Press Enter. Remember: The service tools password is case-sensitive. 4. Select Start a service tool from the System Service Tools (SST) display and press Enter. 5. Select Hardware service manager from the Start a Service Tool display and press Enter. 6. Select Work with service action log from the Hardware Service Manager display and press Enter. 7. On the Select Timeframe display, change the From: Date and Time field to a date and time prior to when the problem occurred. 8. Search for an entry that matches one or more conditions of the problem: v System Reference code v Resource v Date and time v Failing item list 9. Select option 2 (Display failing item information) to display the service action log entry. 10. Select option 2 (Display details) to display location information for the failing part to be replaced. The information displayed in the date and time fields is the date and time for the first occurrence of the specific System reference code for the resource displayed during the time range selected. 11. Select option 7 (Indicator off) to turn off the indicator light. 12. Select the Acknowledge all errors function at the bottom of the Service Action Log display, if all problems have been resolved. 13. Close the log entry by selecting option 8 (Close new entry) on the Service Action Log Report display. Verifying the installed part in a Linux system or logical partition If you have installed a new part, learn how to verify that the system recognizes the part. To verify the newly installed or replaced part, continue with “Verifying an installed part by using stand-alone diagnostics” on page 253. 252 Disk drives Verifying an installed part by using stand-alone diagnostics If you have installed or replaced a part, verify that the system recognizes the new part. You can use stand-alone diagnostics to verify an installed part in a Linux system, expansion unit, or logical partition. v If this server is directly attached to another server or attached to a network, ensure communications with the other servers have stopped. v The stand-alone diagnostics require use of all of the logical partition resources. No other activity can be running on the logical partition. v The stand-alone diagnostics require access to the system console. You access these diagnostics from a CD-ROM or from the Network Installation Management (NIM) server. This procedure describes how to use the diagnostics from a CD-ROM. For information on running diagnostics from the Network Installation Management (NIM) server, see Running stand-alone diagnostics from a Network Installation Management server. To use stand-alone diagnostics, follow these steps: 1. Stop all jobs and applications and then stop the operating system on the system or logical partition. 2. Remove all tapes, diskettes, and CD-ROM. 3. Turn off the system unit power. The next step boots the server or logical partition from the stand-alone diagnostics CD-ROM. If the optical drive is not available as the boot device on the server or logical partition on which you are working, follow these steps: a. Access the ASMI. See Accessing the ASMI for information on using the ASMI. b. On the ASMI main menu, click on Power/Restart Control. c. Click Power On/Off System. d. Select the Service mode boot from default boot list option in the AIX or Linux logical partition mode boot drop-down menu. e. Click Save settings and power on. As soon as the optical drive has power, insert the standalone diagnostic CD-ROM. f. Go to step 5. 4. Turn on the system unit power and immediately insert the diagnostics CD-ROM into the optical drive. 5. After the keyboard POST indicator displays on the system console and before the last POST indicator (speaker) displays, press the numeric 5 key on the system console to indicate that a service mode boot should be initiated using the default-service mode boot list. 6. Enter any requested password. 7. At the Diagnostic Operating Instructions display, press Enter. Tip: If a service request number (SRN) or other reference code is displayed, suspect a loose adapter or cable connection. Note: If you received an SRN or any other reference code when you attempted to start the system, contact your service provider for assistance.. 8. If the terminal type is requested, select the Initialize Terminal option on the Function Selection menu to initialize the operating system. 9. From the Function Selection menu, select Advanced Diagnostics Routines and press Enter. 10. From the Diagnostic Mode Selection menu, select System Verification and press Enter. 11. When the Advanced Diagnostic Selection menu appears, select All Resources, or test only the part you replaced, and any devices that are attached to the part you replaced, by selecting the diagnostics for the individual part and press Enter. 12. Did the Testing Complete, No trouble was found message appear? Verifying the installed part 253 v No: There is still a problem. Contact your service provider. v Yes: Go to step 13. 13. If you changed the service processor or network settings, as instructed in previous procedures, restore the settings to the value they had prior to servicing the system. 14. If the indicator lights are still on, follow these steps: a. Select Identify and Attention Indicators from the Task Selection menu to turn off the system attention and indicator lights and press Enter. b. Select Set System Attention Indicator to NORMAL and press Enter. c. Select Set All Identify Indicators to NORMAL and press Enter. d. Choose Commit. Note: This changes the system attention and identify indicators from the Fault state to the Normal state. e. Exit to the command line. Verifying the installed part by using Hardware Management Console If you have installed or replaced a part, use the Hardware Management Console (HMC) to update your HMC records after you have completed a service action on your server. If you have reference codes, symptoms, or location codes that you used during the service action, locate the records for use during this procedure. To verify the installed part, complete these steps: 1. At the HMC, examine the service action event log for any open service action events. See “Viewing serviceable events” on page 256 for details. 2. Are there any service action events that are open? No: If the system attention LED is still on, use the HMC to turn off the LED. See “Activating and deactivating LEDs” on page 255. This ends the procedure. Yes: Continue with the next step. 3. Record the list of open service action events. 4. Examine the details of the open service action event. Is the error code associated with this service action event the same as you gathered earlier. v No: Select one of the following options: – Review the other serviceable events, find one that does match, and continue with the next step. – If the log does not match what you had gathered earlier, contact your service provider. v Yes: Continue with the next step. 5. Select and highlight the service action event from the Error Associated With This Serviceable Event window. 6. Click Close Event. 7. Add comments for the serviceable event. Include any unique additional information. Click OK. 8. Did you replace, add, or modify a field replaceable unit (FRU) of the open service action event? v No: Select the No FRU Replaced for this Serviceable Event option, and click OK to close the service action event. v Yes: Perform the following steps: a. From the FRU list, select a FRU that you need to update. b. Double-click the FRU and update the FRU information. c. Click OK to close the service action event. 9. If you continue to have problems, contact your service provider. 254 Disk drives Activating and deactivating LEDs Use this procedure to activate or deactivate LEDs using Service Focal Point for the HMC. Choose from the following: v “Deactivating a system attention LED or partition LED” v “Activating or deactivating identify LED” Deactivating a system attention LED or partition LED You can deactivate a system attention LED or a logical partition LED. For example, you might determine that a problem is not a high priority and decide to repair the problem at a later time. However, you want to be alerted if another problem occurs, so you must deactivate the system attention LED so that it can be activated again if another problem occurs. 1. In the navigation area, open Systems Management. 2. Open Servers and select the appropriate system. 3. In the content area, check the box for the appropriate Partition. 4. Select Tasks, then Operations, and then Manage Attention LED. 5. Select the appropriate Partition. 6. Select Deactivate System Attention LED from the Action menu. A confirmation window is displayed that provides the following information: v A verification that the system attention LED was deactivated. v An indication that there still might be open problems within the system. v An indication that you cannot activate the system attention LED. 7. Select one of the logical partitions in the lower table, and select Deactivate partition LED from the Partition Operations menu. A confirmation window is displayed that provides the following information: v A verification that the logical partition LED was deactivated. v An indication that there still might be open problems within the logical partition. v An indication that you cannot activate the logical partition LED. Activating or deactivating identify LED The system provides several LEDs that help identify various components, such as enclosures or field replaceable units (FRUs), in the system. For this reason, they are called identify LEDs. You can activate or deactivate the following types of identify LEDs: v Identify LED for an enclosure If you want to add an adapter to a specific drawer (enclosure), you need to know the machine type, model, and serial number (MTMS) of the drawer. To determine whether you have the correct MTMS for the drawer that needs the new adapter, you can activate the LED for a drawer and verify that the MTMS corresponds to the drawer that requires the new adapter. v Identify LED for a FRU associated with a specified enclosure If you want to hook up a cable to a specific I/O adapter, you can activate the LED for the adapter which is a field replaceable unit (FRU), and then physically check to see where you should hook up the cable. This is especially useful when you have several adapters with open ports. To activate or deactivate an identify LED for an enclosure or FRU, follow these steps: 1. In the navigation area, open Systems Management. 2. Select Servers. 3. In the content area, check the box for the appropriate System. 4. Select Tasks, then Operations, then LED Status, and then Identify LED. Verifying the installed part 255 5. To activate or deactivate an identify LED for an enclosure, select an enclosure from the table, and click either Activate LED or Deactivate LED. The associated LED is either turned on or off. 6. To activate or deactivate an identify LED for a FRU, select an enclosure from the table, select Selected ? List FRUs. 7. Select one or more FRUs from the table, and click either Activate LED or Deactivate LED. The associated LED is either turned on or off. Viewing serviceable events Use this procedure to view a serviceable event, including details, comments, and service history. To view serviceable events and other information about the events, you must be a member of one of the following roles: v Super administrator v Service representative v Operator v Product engineer v Viewer To view serviceable events, follow these steps: 1. In the navigation area, select Service Management. 2. Select Manage Serviceable Events. 3. Select the criteria for the serviceable events that you want to view, and click OK. The Serviceable Event Overview window opens. The list shows all serviceable events that match your selection criteria. You can use the menu options to perform actions on the serviceable events. 4. Select a line in the Serviceable Event Overview window, and select Selected ? View Details The Serviceable Event Details window opens, showing detailed information about the serviceable event. The upper table shows information, such as problem number and reference code. The lower table shows the field replaceable units (FRUs) associated with this event. 5. Select the error for which you want to view comments and history, and follow these steps: a. Select Actions ? View Comments. b. When you are finished viewing the comments, click Close. c. Select Actions ? View Service History. The Service History window opens, showing service history associated with the selected error. d. When you are finished viewing the service history, click Close. 6. When you are finished, click Cancel twice to close the Serviceable Event Details window and the Serviceable Event Overview window. Verifying an installed feature or replaced part on a system or logical partition by using Virtual I/O Server tools If you installed feature or replaced a part, you might want to use the tools in Virtual I/O Server (VIOS) to verify that the feature or part is recognized by the system or logical partition. To verify the operation of a newly installed feature or replacement part, select the appropriate procedure: v Verify the installed feature using VIOS v Verifying the replaced part using VIOS Verify the installed feature by using VIOS: 1. Log in as root user. 2. At the command line, type diagmenu and press Enter. 256 Disk drives 3. Select Advanced Diagnostics Routines and press Enter. 4. From the Diagnostic Mode Selection menu, select System Verification and press Enter. 5. When the Advanced Diagnostic Selection menu appears, do one of the following: v To test a single resource, select the resource that you just installed from the list of resources and press Enter. v To test all the resources available to the operating system, select All Resources and press Enter. 6. Select Commit, and wait until the diagnostic programs run to completion, responding to any prompts that appear. 7. Did the diagnostics run to completion and display the message No trouble was found? v No: If a service request number (SRN) or other reference code is displayed, suspect a loose adapter or cable connection. Review the installation procedures to ensure that the new feature is installed correctly. If you cannot correct the problem, collect all SRNs or any other reference code information that you see. If the system is running in LPAR mode, note the logical partition in which you installed the feature. Contact your service provider for assistance. v Yes: The new device is installed correctly. Exit the diagnostic programs and return the system to normal operations. Verify the replacement part by using VIOS: To verify the operation of a newly installed feature or replacement part, follow these steps: 1. Did you replace the part by using either VIOS or the online diagnostics service aid's concurrent (hot-swap) service operation? No: Go to step 2. Yes: Go to step 5 on page 258. 2. Is the system powered off? No: Go to step 4. Yes: If the system supports slow boot, set the system to perform a slow boot. For information, see Performing a slow boot. 3. Start the system and wait until the VIOS operating system login prompt displays or until apparent system activity on the operator panel or display has stopped. Did the VIOS login prompt display? v No: If an SRN or other reference code is displayed, suspect a loose adapter or cable connection. Review the procedures for the part that you replaced to ensure that the new part is installed correctly. If you cannot correct the problem, collect all SRNs or any other reference code information that you see. If the system does not start or you have no login prompt, see: Problems with loading and starting the operating system. If the system is partitioned, note the logical partition in which you replaced the part. Contact your service provider for assistance. v Yes: Go to step 4 4. At the command prompt, type diag —a and press Enter to check for missing resources. If you see a command prompt, go to step 5 on page 258. If the Diagnostic selection menu is shown with M appearing next to any resource, follow these steps: a. Select the resource and press Enter. b. Select Commit. c. Follow any instructions that are shown. d. If a Do you want to review the previously displayed error? message is shown, select Yes and press Enter. Verifying the installed part 257 e. If an SRN is shown, suspect a loose card or connection. If no obvious problem is shown, record the SRN and contact your service provider for assistance.. f. If no SRN is shown, go to 5. 5. Test the part by doing the following: a. At the command line, type diagmenu and press Enter. b. From the Function Selection menu, select Advanced Diagnostics Routines and press Enter. c. From the Diagnostic Mode Selection menu, select System Verification and press Enter. d. Select All Resources, or select the diagnostics for the individual part to test only the part you replaced, and any devices that are attached to the part you replaced and press Enter. Did the Resource Repair Action menu appear? No: Go to step 6. Yes: Go to step 7. 6. Did the Testing Complete, No trouble was found message appear? v No: There is still a problem. Contact your service provider. This ends the procedure. v Yes: Select Log Repair Action, if not previously logged, from the Task Selection menu to update the AIX error log. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the Resource List, select sysplanar0 and press Enter. Tip: This action changes the indicator light for the part from the fault state to the normal state. Go to step 9 on page 259 7. Select the resource for the replaced part from the Resource Repair Action menu. When a test is run on a resource in system verification mode, and that resource has an entry in the AIX error log, if the test on the resource was successful, the Resource Repair Action menu appears. Complete the following steps to update the AIX error log to indicate that a system-detectable part has been replaced. Note: On systems with a indicator light for the failing part, this changes the indicator light to the normal state. a. Select the resource that has been replaced from the Resource Repair Action menu. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action does not appear on the Resource List, select sysplanar0. Press Enter. b. Select Commit after you make your selections. Did another Resource Repair Action display appear? No: If the No Trouble Found display appears, go to step 9 on page 259 Yes: Go to step 8. 8. Select the parent or child of the resource for the replaced part from the Resource Repair Action menu if necessary. When a test is run on a resource in system verification mode, and that resource has an entry in the AIX error log, if the test on the resource was successful, the Resource Repair Action menu appears. Complete the following steps to update the AIX error log to indicate that a system-detectable part has been replaced. Note: This changes the indicator light for the part from the fault state to the normal state. a. From the Resource Repair Action menu, select the parent or child of the resource that has been replaced. If the repair action was to reseat a cable or adapter, select the resource associated with that repair action. If the resource associated with your action does not appear on the Resource List, select sysplanar0. Press Enter. b. Select Commit after you make your selections. c. If the No Trouble Found display appears, go to step 9 on page 259. 258 Disk drives 9. If you changed the service processor or network settings, as instructed in previous procedures, restore the settings to the values they had prior to servicing the system. 10. Did you do any hot-plug procedures before doing this procedure? No: Go to step 11. Yes: Go to step 12. 11. Start the operating system, with the system or logical partition in normal mode. Were you able to start the operating system? No: Contact your service provider. This ends the procedure. Yes: Go to step 12. 12. Are the indicator lights still on? v No. This ends the procedure. v Yes. Turn off the lights. See one of the following for instructions: Changing service indicators Verifying the installed part 259 260 Disk drives Verifying a repair Use these procedures to verify hardware operation after making repairs to the system. Choose from the following options: v To verify the repair of a system that is currently powered off, go to step 1. v To verify the repair of a system that is currently powered on with no operating system loaded, go to step 3. v To verify the repair of a system that is currently powered on and that has an operating system loaded, go to step 5. 1. Power on the server and all attached I/O enclosures. Did all the enclosures power on? Yes: Go to step 3.? Go to step 3. No: Continue with the next step. 2. Choose from the following options: v If the original problem was that an enclosure would not power on and you have another FRU to replace, locate and replace the next field-replaceable unit (FRU). v If the next FRU in the FRU list is an isolation procedure, perform the isolation procedure. v If the original problem was that an enclosure would not power on and you have an isolation procedure to complete, perform the isolation procedure. v If the original problem was that an enclosure would not power on and there are no more FRUs or isolation procedures in the FRU list, contact your next level of support. v If you have a new problem, perform problem analysis and repair the new problem. 3. Load the operating system. Did the operating system load successfully? Yes: Go to step 5. No: Continue with the next step. 4. Choose from the following options: v If the original problem was a failing disk drive that contained the operating system software, go to step 5. v If the original problem was that the operating system would not load and you have another FRU to replace, go to your FRU locations section to locate the next FRU. v If the next FRU in the FRU list is an isolation procedure, perform the isolation procedure. v If the original problem was that the operating system would not load and you have an isolation procedure to complete, perform the isolation procedure. v If the original problem was that the operating system would not load and there are no more FRUs or isolation procedures in the FRU list, contact your next level of support. v If you have a new problem, perform problem analysis and repair the new problem. 5. Choose from the following options: v “Verifying the repair in AIX” on page 262 v “Verifying the repair in Linux” on page 267 v “Verifying a repair using an IBM i system or logical partition” on page 265 © Copyright IBM Corp. 2010 261 Verifying the repair in AIX You can use this procedure to verify that a repair is complete using the AIX operating system. Use this maintenance analysis procedure (MAP) to check out the server after a repair is completed. 1. Did you replace a disk drive in the root volume group? No Go to step 3. Yes Continue with the next step. 2. Run stand-alone diagnostics either from a CD or from a Network Installation Management (NIM) server. Did you encounter any problems? No Reinstall the operating system and continue with step 5. Yes If the original problem still exists, replace the field-replaceable unit (FRU) or perform the isolation procedure that is next in the FRU list. If you have reached the end of the FRU list, contact your next level of support. If a new problem has occurred, go to Beginning problem analysis. 3. Did you replace a FRU with the power turned on and concurrently with system operations? No Go to step 5. Yes Continue with the next step. 4. Did you use an AIX diagnostics service aid hot-swap operation to change the FRU? Yes Go to step 6. No Go to step 7 on page 263. Note: The AIX diagnostic service aid was used if a resource was removed using the Hot Plug task. 5. Note: If any FRUs have been removed that should be reinstalled, reinstall them now. Perform the following steps: 1. If the system is not powered on, power it on now. 2. Perform a slow boot. 3. Wait until the AIX operating system login prompt displays or until system activity on the operator panel or display apparently has stopped. 4. Did you encounter any problems? No Continue at step 6. Yes If the original problem still exists, replace the FRU or perform the isolation procedure that is next in the FRU list. If you have reached the end of the FRU list, contact your next level of support. If a new problem has occurred, go to Beginning problem analysis. 6. If the Resource Repair Action menu is already displayed, go to step 9 on page 263; otherwise, perform the following steps: 262 Disk drives 1. Log into the operating system either with root authority (if needed, ask the customer to enter the password) or use the CE login. 2. Enter the diag -a command and check for missing resources. Follow any instructions that display. If an SRN displays, suspect a loose card or connection. If no instructions display, no resources were detected as missing. Continue with the next step. 7. Perform the following steps: 1. Enter diag at the command prompt. 2. Press Enter. 3. Select the Diagnostics Routines option. 4. When the Diagnostic Mode Selection menu displays, select System verification. 5. When the Diagnostic Selection menu displays, select the All Resources option or test the FRUs you exchanged, and any devices that are attached to the FRU(s) you exchanged, by selecting the diagnostics for the individual FRU. Did the Resource Repair Action menu (801015) display? No Continue with the next step. Yes Go to step 9. 8. Did the Testing Complete, no trouble was found menu (801010) display? Yes Use the Log Repair Action option, if not previously logged, in the TASK SELECTION menu to update the AIX error log. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. Note: If the system attention indicator is on, this action will set it back to the normal state. Go to step 11 on page 264. No If the original problem still exists, replace the FRU or perform the isolation procedure that is next in the FRU list. If you have reached the end of the FRU list, contact your next level of support. If a new problem has occurred, go to Beginning problem analysis. 9. When a test is run on a resource in system verification mode, that resource has an entry in the AIX error log. If the test on that resource was successful, the Resource Repair Action menu displays. After replacing a FRU, you must select the resource for that FRU from the Resource Repair Action menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced. Note: If the system attention indicator is on, this action will set it back to the normal state. Perform the following steps: Verifying a repair 263 1. Select the resource that has been replaced from the Resource Repair Action menu. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. 2. Press Commit after you make your selections. Did another Resource Repair Action (801015) display? No If the No Trouble Found menu displays, go to step 11. Yes Continue with the next step. 10. The parent or child of the resource you just replaced might also require that you run the Resource Repair Action service aid on it. When a test is run on a resource in system verification mode, that resource has an entry in the AIX error log. If the test on the resource was successful, the Resource Repair Action menu displays. After replacing that FRU, you must select the resource for that FRU from the Resource Repair Action menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced. Note: If the system attention indicator is on, this action will set it back to the normal state. Perform the following steps: 1. From the RESOURCE REPAIR ACTION menu, select the parent or child of the resource that has been replaced. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. 2. Press COMMIT after you make your selections. 3. If the No Trouble Found menu displays, continue with the next step. 11. If you changed the service processor or network settings, as instructed in previous MAPs, restore the settings to the value they had prior to servicing the system. If you ran stand-alone diagnostics from CD-ROM, remove the stand-alone diagnostics CD-ROM from the system. Did you perform service on a RAID subsystem involving changing of the PCI RAID adapter cache card or changing the configuration? Note: This information does not apply to the PCI-X RAID adapter or cache. No Go to the close of call procedure. Yes Continue with the next step. 12. Use the Recover Options selection to resolve the RAID configuration. To do this, perform the following steps: 264 Disk drives 1. On the PCI SCSI Disk Array Manager display, select Recovery options. 2. If a previous configuration exists on the replacement adapter, this must be cleared. Select Clear PCI SCSI Adapter Configuration and press F3. 3. On the Recovery Options screen, select Resolve PCI SCSI RAID Adapter Configuration. 4. On the Resolve PCI SCSI RAID Adapter Configuration screen, select Accept Configuration on Drives. 5. On the PCI SCSI RAID Adapter selections menu, select the adapter that you changed. 6. On the next screen, press Enter. 7. When you see the Are You Sure selection menu, press Enter to continue. 8. If you see a Failed status message, verify that you selected the correct adapter, and then repeat this procedure. When the recovery is complete, exit the operating system. 9. Go to the close of call procedure. Verifying a repair using an IBM i system or logical partition Use this procedure to verify a repair using the IBM i operating system. 1. Was the system powered off during the repair? Yes: Continue with the next step. No: Continue with step 3. 2. Perform the following tasks: a. Verify that the power cable is plugged into the power outlet. b. Verify that power is available at the customer's power outlet. 3. Was the partition powered off during the repair? Yes: Continue with the next step. No: Continue with step 6. 4. Select the IPL type and mode for the system or logical partition that the customer uses (see IPL type mode and speed options in the Service functions). 5. Start an IPL by powering on the system or partition (see Powering on and powering off). Did the system complete the IPL? Yes: Continue with the next step. No: This might be a new problem. Go to the Start of call procedure. This ends the procedure. 6. Did the system or partition remain running throughout the repair, and was the I/O processor, I/O adapter, or storage device replaced? Yes: Continue with step 10. No: Continue with the next step. 7. Use the service action log or serviceable event view (if the system is managed by an HMC) to look for any reference codes that are related to this IPL (see Using the Service Action Log). Are there any reference codes that are related to this IPL? Yes: Continue with the next step. No: If the problem was related to removable media or communications, perform the verification procedures in the Service functions to verify that the problem is corrected. Then return the system to the customer and have the customer verify the system date and time. This ends the procedure. Verifying a repair 265 8. Is the new reference code the same as the original reference code? Yes: Continue with the next step. No: A new symptom might have occurred. Go to the Start of call procedure. This ends the procedure. 9. Are there any other failing items that remain to be replaced? Yes: Replace the next failing item listed for this reference code. This ends the procedure. No: Contact your next level of support for assistance. This ends the procedure. 10. Was concurrent maintenance performed on an optical storage unit? Yes: The product activity log and service action log, in most cases, contain a reference code for the optical storage unit when concurrent maintenance is performed. You can ignore this reference code. Perform the following: - Perform the verification procedures in the Service functions topic to verify that the problem is corrected. - Return the system to the customer and have the customer verify the system date and time. This ends the procedure. No: Continue with the next step. 11. Use the service action log to look for any new reference codes (see Using the Service Action Log). Are there any new reference codes? Yes: Continue with the next step. No: Go to step 14. 12. Is the new reference code the same as the original reference code? Yes: Continue with the next step. No: A new symptom might have occurred. Go to the Start of call procedure to determine the cause of the problem. This ends the procedure. 13. Are there any other failing items that need to be replaced? Yes: Replace the next failing item listed for the reference code. This ends the procedure.. No: Contact your next level of support for assistance. This ends the procedure. 14. Are you working with a tape device? Yes: Perform the verification procedures in the Service functions to verify that the problem is corrected. After the verification test is complete, the tape device description will be set to the failed state because a resource change was detected. Perform the following tasks: - Vary the tape device description off and then on. - Return the system to the customer and have the customer verify the system date and time. Then go to Verifying the repair from the HMC. This ends the procedure. No: Continue with the next step. 15. Are you working with an IOP or an IOA? Yes: Use the display hardware configuration service function to check for any missing or failed hardware: - On the command line, enter the STRSST (Start System Service Tools command). If you cannot get to SST, select DST. Do not IPL the system or partition to get to DST. - On the Start Service Tools Sign On display, enter the user ID with the service authority and password. - Select Start a service tool ? Hardware service manager ? Logical hardware resources ? System bus resources. - Select the function key for Include nonreporting resources. - If the IOP and IOA that you just replaced is a failed or non-reporting resource, the problem has not been fixed. Continue to the next failing item in the failing item list. This ends the procedure. 266 Disk drives No: Perform the verification procedures in the Service functions topics to verify that the problem is corrected. Resources that usually vary on automatically during an IPL, or that were previously varied on manually, might need to be varied on again after the verification procedures are complete. Return the system to the customer and have the customer verify the system date and time. This ends the procedure. Verifying the repair in Linux You can use this procedure to verify that a repair is complete using the Linux operating system. 1. Run stand-alone diagnostics from either a CD or from a Network Installation Management (NIM) server. See Running the stand-alone diagnostics from CD-ROM. Did you encounter any problems? No Reboot the operating system and continue with the close of call procedure. Yes If the original problem still exists, replace the field-replaceable unit (FRU) or perform the isolation procedure that is next in the FRU list. If you have reached the end of the FRU list, contact your next level of support. If a new problem has occurred, go to Beginning problem analysis and repair the new problem. Verifying the repair from the HMC Perform these procedures to close problem numbers, clear hardware messages, and prepare the server to return to the customer using the hardware management console (HMC). Follow this checklist before performing the procedures: v You return the server to the state that the customer normally uses such as IPL type, IPL mode, and the way the system is configured or partitioned. Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. v While you were performing the problem analysis on the original serviceable event, other serviceable-event numbers might have been opened. Close all serviceable events that were opened as a result of your service activity. v Server verification has been performed and there are no problems that require additional service actions. v If the repair was done using the HMC online repair procedures, ensure that the original serviceable event is now closed. 1. Is an HMC used to manage the server that you are servicing? v Yes: Continue with the next step. v No: Return to the “Verifying a repair” on page 261. This ends the procedure. 2. Are you closing a service event that was a repair on the HMC personal computer? v Yes: Continue with the next step. v No: Go to step 4 on page 268. 3. Power on the HMC. Did the power-on process complete without errors? v Yes: Ensure that the HMC can be used to perform server management tasks, and return the HMC to normal operations. Go to “Closing a service call” on page 269. This ends the procedure. v No: Go to the HMC isolation procedures. This ends the procedure. Verifying a repair 267 4. Log into the HMC as the service representative. If invalid user or invalid password displays, get the correct login information from the system administer. 1. If logged on the System Manager, select Exit from the Console, located on the System Manager window. 2. Login to the System Manager with the following: User identification service Password service mode 5. View Serviceable Event Details. 1. In the navigation area, select the Service Applications icon. 2. In the navigation area, select the Service Focal Point icon. 3. In the Contents area, select Manage Serviceable Events. 4. Designate the set of Serviceable Events you want to view. When you are finished, click OK. The Service Event Overview window opens. Note: Only the events that match all of the criteria you specify are shown. 6. Close open or delayed events. 1. Select the problem to close, on the Service Event Overview window. 2. Select the menu Selected, located on the menu bar. 3. Click Close Event. 4. Enter your comments in the Serviceable Event Comments window, and click Close Event. 5. Close all events associated with the problem on which you were working. 7. Did the Service Event Overview window contain the event or events on which you were working? v Yes: Return the HMC to normal operations. Go to “Closing a service call” on page 269. This ends the procedure. v No: Go to Detecting problems. This ends the procedure. 268 Disk drives Closing a service call Perform these procedures to close problem numbers, clear hardware messages, and prepare the server to return to the customer. Follow this checklist before performing the procedures: v Return the server to the state that the customer normally uses such as IPL type, IPL mode, and the way the system is configured or partitioned. Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. v While you were performing the problem analysis on the original serviceable event, other serviceable events might have been opened. Close all serviceable events that were opened as a result of your service activity. v Ensure that server verification has been performed and no problems require additional service actions. v If the repair was done using the Hardware Management Console (HMC) online repair procedures, ensure that the original serviceable event is now closed. 1. Record the system reference code (SRC) or symptom and the location code of the field-replaceable unit (FRU) you replaced, for future reference. Is the server managed by an HMC? v Yes: Continue with the next step. v No: Do one of the following steps: – If the server is managed by Integrated Virtualization Manager (IVM), go to “Closing a service call using Integrated Virtualization Manager” on page 277. – If the server is not partitioned and is running the AIX or Linux operating system, go to “Closing a service call using AIX or Linux” on page 273. 2. On the HMC, open Manage Serviceable Events and examine the service action event log for any open service action events. 3. Are there any service action events that are open? v Yes: Continue with the next step. v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 283. Return the system to the customer. This completes the repair. 4. Record the list of open service action events. 5. From the list of serviceable events recorded in step 4, perform the following step 6 through step 32 on page 272 for each open service action event. 6. Determine the error class of the serviceable event. Record for future use. 7. Examine the details of the open service action event. © Copyright IBM Corp. 2010 269 Is the error code that is associated with this service action event the same as recorded in Step 1 on page 269? v Yes: Go to step 11. v No: Continue with the next step. 8. Examine the FRU list of the service action event. Are any FRUs listed for the service action event? v Yes: Continue with the next step. v No: Go to step 11. 9. Is the FRU list identical (same FRUs, same number of FRUs, and same order of FRUs) to the FRU list of the error code recorded in step 1 on page 269? v Yes: Go to step 11. v No: Continue with the next step. 10. The FRU list is different. Is the FRU you replaced and recorded in step 1 on page 269 in the list of FRUs for this service action event? v Yes: Continue with the next step. v No: Go to step 32 on page 272. Note: There are service action events that will remain open when you leave this MAP. Further service actions might be required to complete the repair. 11. Examine the details of this service action event, and record the partitions involved in this service action event for use in a later step. 12. Is the error code associated with this service action event of the form A11-xxx or A01-xxx? v Yes: Continue with the next step. v No: Go to step 17 on page 271. 13. Have you begun a list of Axx partitions from prior service action events that you processed in this maintenance analysis procedure (MAP)? v Yes: Go to step 15. v No: Continue with the next step. 14. Begin a new list of Axx partitions by copying the list of partitions obtained in step 11. Go to step 16. 15. Add the partition list obtained in step 11 to the existing list of Axx partitions obtained from processing previous service action events in this MAP. 16. Remove all entries in the list of all partitions you recorded in step 11. If you are referred to the list of partitions obtained in step 11 in future steps, the list is empty. Go to step 17 on page 271. 270 Disk drives 17. Select and highlight the service action event from the Error Associated With This Serviceable Event window. 18. Click Close Event. 19. Add comments for the serviceable event. Include any unique additional information. Click OK. The following steps will add or update FRU information. 20. Did you replace, add, or modify a FRU of the open service action event? v Yes: Continue with the next step. v No: Go to step 22. 21. From the FRU list, select a FRU that you need to update. Double-click the FRU, and update the FRU information. Go to step 23. 22. Select the No FRU Replaced for this Serviceable Event option. 23. Click OK to close the service action event. 24. Is the list of all partitions you recorded in step 11 on page 270 empty? v Yes: Go to step 32 on page 272. v No: Continue with the next step. 25. Does the list of all partitions you recorded in step 11 on page 270 contain more than one entry? v Yes: Continue with the next step. v No: Go to step 32 on page 272. 26. Is the error class recorded in step 25 AIX? v Yes: Continue with the next step. v No: Go to step 32 on page 272. 27. Perform the following steps for each entry in the list of all partitions you recorded in step 11 on page 270, except the partition you were using to debug the original problem. 28. From the HMC virtual terminal window of a partition in the list of all partitions, type diag at the AIX command prompt. Closing a service call 271 29. When the diagnostic operating instructions are displayed, do the following steps: 1. Press Enter. 2. Select the Task Selection option. 3. Select the Log Repair option. 4. Select the resource associated with the repair action. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your repair action is not displayed on the Resource List, select sysplanar0. 5. Click Commit after you have made your selection. Note: If the terminal type is not defined, you are prompted to define it before you can proceed. 30. Exit from diagnostics in this partition and return to the AIX prompt. 31. Have all the partitions in the list of all the partitions you recorded in step 11 on page 270 been processed? v Yes: Continue with the next step. v No: Go to step 24 on page 271 to process the next partition in the list you recorded in step 11 on page 270. 32. Have all the serviceable events recorded in step 4 on page 269 been processed? v Yes: Continue with the next step. v No: Go to step 5 on page 269 and process the next service action event in the list of serviceable events recorded in step 4 on page 269. 33. While processing all service action events, were you directed to step 14 on page 270? v Yes: Continue with the next step. v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 283. Return the system to the customer. This completes the repair. Note: If during the processing of the list of open service action events, some service action events remained open, further service actions might be required to complete the repair. 34. Perform the following steps for each entry in the list of Axx partitions you began recording in step 14 on page 270, except the partition you were using to debug the original problem. 35. From the HMC virtual terminal window of a partition in the list of Axx partitions, type diag at the AIX command prompt. 36. When the diagnostic operating instructions are displayed, do the following steps: 272 Disk drives 1. Press Enter. 2. Select the Task Selection option. Note: If the terminal type is not defined, you are prompted to define it before you can proceed. 3. Select the Log Repair option. 4. Select the resource associated with the repair action. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your repair action is not displayed on the Resource List, select sysplanar0. 5. Click Commit after you have made your selection. 37. Exit from diagnostics in this partition and return to the AIX prompt. 38. Have all the partitions in the list of Axx partitions you began recording in step 14 on page 270 been processed? v Yes: Continue with the next step. v No: Go to step 34 on page 272 to process the next partition in the list you recorded in step 14 on page 270. 39. If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 283. This completes the repair. Return the system to the customer. Note: If during the processing of the list of open service action events, some service action events remained open, further service actions might be required to complete the repair. Closing a service call using AIX or Linux If the server is not connected to an Hardware Management Console (HMC) and not using Integrated Virtualization Manager (IVM), perform these procedures to close problem numbers, clear hardware messages, and prepare the server to return to the customer. Follow this checklist before performing the procedures: v Return the server to the state that the customer normally uses, such as IPL type, IPL mode, and the way the system is configured or partitioned. Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. v While you were performing the problem analysis on the original serviceable event, other serviceable-event numbers might have been opened. Close all serviceable events that were opened as a result of your service activity. v Ensure that server verification has been performed and that no problems require additional service actions. v If the repair was done using the IVM online repair procedures, ensure that the original serviceable event is now closed. 1. Did you use an AIX diagnostics service aid hot-swap operation to change the FRU? v Yes: Go to step 4 on page 274 v No: Continue with the next step. Closing a service call 273 2. Do you have any field-replaceable units (FRUs) (for example cards, adapters, cables, or devices) that were removed during problem analysis that you want to put back into the system? Note: If the system planar or battery has been replaced and you are loading diagnostics from a server over a network, it might be necessary for the customer to set the network boot information for this system before diagnostics can be loaded. The system time and date information should also be set when the repair is completed. v Yes: Reinstall all of the FRUs that were removed during problem analysis. Go to step 3 v No: Continue with the next step. 3. Is the system or logical partition that you are performing a repair action on running the AIX operating system? v Yes: Continue with the next step. v No: Go to step 5. 4. Does the system or logical partition you are performing a repair action on have AIX installed? Note: Answer no to this question if you have just replaced a hard disk in the root volume group. v Yes: Go to step 7. v No: Continue with the next step. 5. Run stand-alone diagnostics in problem determination mode from either a CD-ROM or from a Network Installation Management (NIM) server. Note: For instructions on running stand-alone diagnostics from a CD and not using an HMC, go to Running the stand-alone diagnostics from CD on a server without an HMC attached. For instructions on running stand-alone diagnostics from a NIM server, go to Running the stand-alone diagnostics from a Network Installation Management server. Did you encounter any problems? v Yes: Go to problem analysis. v No: Continue with the next step. 6. The system hardware is functioning correctly. If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 283. This completes the repair. Note: If, during the processing of the list of open service action events, some service action events remained open, further service actions might be required to complete the repair. Return the server to the state that the customer normally uses, such as IPL type, IPL mode, and the way the system is configured or partitioned. This might require you to reboot the operating system. Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. 7. Complete the following steps: 274 Disk drives 1. If the system supports slow boot (see Performing a slow boot), do a slow boot on the system. If the system does not support slow boot, do a normal boot. 2. Power on the system. 3. Wait until the AIX operating system login prompt displays or until system activity on the operator panel or display apparently has stopped. Did the AIX Login Prompt display? v Yes: Continue with the next step. v No: Go to problem analysis. 8. If the Resource Repair Action menu is already displayed, go to 12 on page 276; otherwise, do the following steps: 1. Log into the operating system, either with root authority (if needed, ask the customer to enter the password) or use the CE login. 2. Enter the diag -a command and check for missing resources. Follow any instructions that display. If an system request number (SRN) displays, suspect a loose card or connection. If no instructions display, no resources were detected as missing. Continue with 9. 9. Complete the following steps: 1. Enter diag at the command prompt. 2. Press Enter. 3. Select the Diagnostics Routines option. 4. When the Diagnostic Mode Selection menu displays, select Problem determination. 5. When the Advanced Diagnostic Selection menu displays, select the All Resources option, or test the FRUs you exchanged, and any devices that are attached to the FRUs you exchanged, by selecting the diagnostics for the individual FRU. Did the Resource Repair Action menu (801015) display? v Yes: Go to step 13 on page 276. v No: Continue with the next step. 10. Did the TESTING COMPLETE, no trouble was found menu (801010) display? v Yes: Continue with the next step. v No: There is still a problem. Go to problem analysis. 11. Use the Log Repair Action option, if not previously logged, in the TASK SELECTION menu to update the AIX error log. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. Closing a service call 275 If the resource associated with your action is not displayed on the resource list, select sysplanar0. Note: If the system attention indicator is on, this will set it back to the normal state. Go to step 14. 12. When a test is run on a resource in system verification mode, and that resource has an entry in the AIX error log, if the test on the resource was successful, the Resource Repair Action menu displays. After replacing a FRU, you must select the resource for that FRU from the Resource Repair Action menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced. Note: If the system attention indicator is on, this action will set it back to the normal state. Do the following steps: 1. Select the resource that has been replaced from the Resource Repair Action menu. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. 2. Press Commit after you make your selections. Did another Resource Repair Action (801015) display? v Yes: Continue with the next step. v No: If the No Trouble Found menu displays, go to step 14. 13. The parent or child of the resource you just replaced might also require that you run the Resource Repair Action service aid on it. When a test is run on a resource in system verification mode, and that resource has an entry in the AIX error log, if the test on the resource was successful, the Resource Repair Action menu displays. After replacing that FRU, you must select the resource for that FRU from the Resource Repair Action menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced. Note: If the system attention indicator is on, this action will set it back to the normal state. Do the following steps: 1. From the Resource Repair Action menu, select the parent or child of the resource that has been replaced. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. 2. Press COMMIT after you make your selections. Did the No Trouble Found menu display? v Yes: Continue with the next step. v No: Go to problem analysis. 14. If you changed the service processor or network settings, as instructed in previous MAPs, restore the settings to the value they had prior to servicing the system. If you ran stand-alone diagnostics from CD-ROM, remove the stand-alone diagnostics CD-ROM from the system. 276 Disk drives Did you perform service on a RAID subsystem involving changing of the PCI RAID adapter cache card or changing the configuration? Note: This does not refer to the PCI-X RAID adapter or cache. v Yes: Continue with the next step. v No: Go to step 16. 15. Use the Recover Options selection to resolve the RAID configuration. To do this, do the following steps: 1. On the PCI SCSI Disk Array Manager dialog, select Recovery options. 2. If a previous configuration exists on the replacement adapter, this must be cleared. Select Clear PCI SCSI Adapter Configuration and press F3. 3. On the Recovery Options dialog, select Resolve PCI SCSI RAID Adapter Configuration. 4. On the Resolve PCI SCSI RAID Adapter Configuration dialog, select Accept Configuration on Drives. 5. On the PCI SCSI RAID Adapter selections menu, select the adapter that you changed. 6. On the next dialog, press Enter. 7. When you see the Are You Sure selection menu, press Enter to continue. 8. You should see an OK status message when the recover is complete. If you get a Failed status message, verify that you selected the correct adapter, and then repeat this procedure. When recover is complete, exit the operating system. 9. Go to 16. 16. The system hardware is functioning correctly. Return the server to the state that the customer normally uses, such as IPL type, IPL mode, and the way the system is configured or partitioned. Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. Closing a service call using Integrated Virtualization Manager Perform these procedures to close problem numbers, clear hardware messages, and prepare the server to return to the customer. Follow this checklist before performing the procedures: v Return the server to the state that the customer normally uses, such as IPL type, IPL mode, and the way the system is configured or partitioned. Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. v While you were performing the problem analysis on the original serviceable event, other serviceable-event numbers might have been opened. Close all serviceable events that were opened as a result of your service activity. v Ensure that server verification has been performed and there are no problems that require additional service actions. v If the repair was done using the Integrated Virtualization Manager (IVM) online repair procedures, ensure that the original serviceable event is now closed. 1. Record the system reference code (SRC) or symptom and the location code of the field-replaceable unit (FRU) you replaced, for future reference. Closing a service call 277 2. On the IVM, open Manage Serviceable Events and look at existing serviceable events. 3. Are there any service action events that are open? v Yes: Continue with the next step. v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 283. Return the system to the customer. This completes the repair. 4. Record the list of open service action events. 5. From the list of serviceable events recorded in step 4, perform step 6 through step 30 on page 280 for each open service action event. 6. Determine the error class of the serviceable event. Record for future use. 7. Examine the details of the open service action event. Is the error code associated with this service action event the same as recorded in step 1 on page 277? v Yes: Go to step 11. v No: Continue with the next step. 8. Examine the FRU list of the service action event. Are any FRUs listed for the service action event? v Yes: Continue with the next step. v No: Go to step 11. 9. Is the FRU list identical (same FRUs, same number of FRUs, and same order of FRUs) to the FRU list of the error code recorded in step 1 on page 277? v Yes: Go to step 11. v No: Continue with the next step. 10. Is the FRU you replaced and recorded in step 1 on page 277 in the list of FRUs for this service action event? v Yes: Continue with the next step. v No: Go to step 30 on page 280. Note: There are service action events that will remain open when you leave this MAP. Further service actions might be required to complete the repair. 11. Examine the details of this service action event, and record the partitions involved in this service action event for use in a later step. 278 Disk drives 12. Is the error code associated with this service action event of the form A11-xxx or A01-xxx? v Yes: Continue with the next step. v No: Go to step 17. 13. Have you begun a list of Axx partitions from prior service action events that you processed in this maintenance analysis procedure (MAP)? v Yes: Go to step 15. v No: Continue with the next step. 14. Begin a new list of Axx partitions by copying the list of partitions obtained in step 11 on page 278. Go to step 16. 15. Add the partition list obtained in step 11 on page 278 to the existing list of Axx partitions obtained from processing previous service action events in this MAP. 16. Remove all entries in the list of all partitions you recorded in step 11 on page 278. If you are referred to the list of partitions obtained in step 11 on page 278 in future steps, the list is empty. Go to step 17. 17. Select and highlight the service action event from the Manage Serviceable Events window. 18. Click Close Event. 19. Add comments for the serviceable event. Include any unique additional information. Click OK. 20. Add or update FRU information: Did you replace, add, or modify a FRU of the open service action event? v Yes: Continue with the next step. v No: Go to step 22. 21. Click OK to close the service action event. 22. Is the list of all partitions you recorded in step 11 on page 278 empty? v Yes: Go to step 30 on page 280. v No: Continue with the next step. Closing a service call 279 23. Does the list of all partitions you recorded in step 11 on page 278 contain more than one entry? v Yes: Continue with the next step. v No: Go to step 30. 24. Is the error class recorded in step 23? v Yes: Continue with the next step. v No: Go to step 30. 25. Perform the following steps for each entry in the list of all partitions you recorded in step 11 on page 278, except the partition you were using to debug the original problem. 26. From the IVM virtual terminal window of a partition in the list of all partitions, type diag at the AIX command prompt. 27. When the diagnostic operating instructions are displayed, do the following steps: 1. Press Enter. 2. Select the Task Selection option. 3. Select the Log Repair option. 4. Select the resource associated with the repair action. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your repair action is not displayed on the Resource List, select sysplanar0. 5. Click Commit after you have made your selection. Note: If the terminal type is not defined, you are prompted to define it before you can proceed. 28. Exit from diagnostics in this partition and return to the AIX prompt. 29. Have all the partitions in the list of all partitions you recorded in step 11 on page 278 been processed? v Yes: Continue with the next step. v No: Go to step 25 to process the next partition in the list you recorded in step 11 on page 278. 30. Have all the serviceable events recorded in step 4 on page 278 been processed? v Yes: Continue with the next step. v No: Go to step 5 on page 278 and process the next service action event in the list of serviceable events recorded in step 4 on page 278. 31. While processing all service action events, were you directed to step 14 on page 279? 280 Disk drives v Yes: Continue with the next step. v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 283. Return the system to the customer. This completes the repair. Note: If during the processing of the list of open service action events, some service action events remained open, further service actions might be required to complete the repair. 32. Perform the following steps for each entry in the list of Axx partitions you began recording in step 14 on page 279, except the partition you were using to debug the original problem. 33. From the IVM virtual terminal window of a partition in the list of Axx partitions, type diag at the AIX command prompt. 34. When the diagnostic operating instructions are displayed, do the following steps: 1. Press Enter. 2. Select the Task Selection option. Note: If the terminal type is not defined, you are prompted to define it before you can proceed. 3. Select the Log Repair option. 4. Select the resource associated with the repair action. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your repair action is not displayed on the Resource List, select sysplanar0. 5. Click Commit after you have made your selection. 35. Exit from diagnostics in this partition and return to the AIX prompt. 36. Have all the partitions in the list of Axx partitions you began recording in step 14 on page 279 been processed? v Yes: Continue with the next step. v No: Go to step 32 to process the next partition in the list you recorded in step 14 on page 279. 37. If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 283. This completes the repair. Return the system to the customer. Note: If, during the processing of the list of open service action events, some service action events remained open, further service actions might be required to complete the repair. Closing a service call 281 282 Disk drives Activating and deactivating LEDs You can use these procedures to activate or deactivate light-emitting diodes (LEDs) using the Hardware Management Console (HMC) or the Advanced System Management Interface (ASMI). For IBM Power Systems servers that contain the POWER7 processor, the LEDs can be used to identify or verify a part that you are servicing. The error and identify function (amber) LED indicates an error and corresponds to the location code in the system reference code (SRC). The LED is activated and deactivated automatically. Additionally, the following procedures can also be used to activate and deactivate LEDs. v “Deactivating a system attention LED or partition LED using the HMC” v “Activating or deactivating an identify LED using the HMC” on page 284 v “Deactivating a system attention LED or logical partition LED using the Advanced System Management Interface” on page 284 v “Activating or deactivating an identify LED using the Advanced System Management Interface” on page 285 Deactivating a system attention LED or partition LED using the HMC You can deactivate a system attention LED or a logical partition LED if you decide that a problem is not a high priority and you decide to repair the problem at a later time. If you want to be alerted if another problem occurs, you must deactivate the system attention LED so that it can be activated again if another problem occurs. To deactivate a system attention LED using the HMC, complete the following steps: 1. In the navigation area, open Systems management. 2. Select the server you are working on by checking the box next to its name. 3. Open Operations. 4. Open LED Status. 5. Select View System Attention. The system attention LED window opens. The selected system and its LED state are displayed in the upper part of the window. The logical partition and its LED state are displayed in the lower part of the window. From the system attention LED window, you can deactivate both the system attention LED and the logical partition LED. 6. Select Deactivate System Attention LED from the Action menu. A confirmation window is displayed that provides the following information: v A verification that the system attention LED was deactivated. v An indication that there still might be open problems within the system. v An indication that you cannot activate the system attention LED. 7. Select one of the logical partitions in the lower table, and select Deactivate partition LED from the Partition Operations menu. A confirmation window is displayed that provides the following information: v A verification that the logical partition LED was deactivated. v An indication that there still might be open problems within the logical partition. v An indication that you cannot activate the logical partition LED. © Copyright IBM Corp. 2010 283 Activating or deactivating an identify LED using the HMC You can activate or deactivate an identify LED for components attached to the system. The system provides several LEDs that help identify various components, such as enclosures or field-replaceable units (FRUs), in the system. For this reason, they are called identify LEDs. You can activate or deactivate the following types of identify LEDs: v Identify LED for an enclosure If you want to add an adapter to a specific drawer (enclosure), you need to know the machine type, model, and serial number (MTMS) of the drawer. To determine whether you have the correct MTMS for the drawer that needs the new adapter, you can activate the LED for a drawer and verify that the MTMS corresponds to the drawer that requires the new adapter. v Identify LED for a FRU associated with a specified enclosure If you want to hook up a cable to a specific I/O adapter, you can activate the LED for the adapter, which is a field replaceable unit (FRU), and then physically check to see where you should hook up the cable. This is especially useful when you have several adapters with open ports. To activate or deactivate an identify LED for an enclosure or FRU, follow these steps: 1. In the navigation area, open Systems management. 2. Select the server you are working on. 3. Click Operations ? LED Status ? Identify LED. The Identify LED, Select Enclosure window opens. 4. To activate or deactivate an identify LED for an enclosure, select an enclosure from the table, and click either Activate LED or Deactivate LED. The associated LED is either turned on or off. 5. To activate or deactivate an identify LED for a FRU, select an enclosure from the table, and then select Selected ? List FRUs. 6. Select one or more FRUs from the table, and click either Activate LED or Deactivate LED. The associated LED is either turned on or off. Deactivating a system attention LED or logical partition LED using the Advanced System Management Interface You can deactivate a system attention LED or a logical partition LED using the Advanced System Management Interface (ASMI). The system attention indicator provides a visual signal that the system as a whole requires attention or service. Each system has a single system attention indicator. When an event occurs that either needs your intervention or that of service and support, the system attention indicator lights continuously. The system attention indicator is turned on when an entry is made in the service processor error log. The error entry is transmitted to the system error log and the operating system error log. To perform this operation, your authority level must be one of the following levels: v Administrator v Authorized service provider To turn off the system attention indicator, do the following steps: 1. On the ASMI Welcome pane, specify your user ID and password, and click Log In. 2. In the navigation area, expand System Configuration ? Service Indicators ? System Attention Indicator. 3. In the right pane, click Turn off system attention indicator. If the attempt is unsuccessful, an error message is displayed. 284 Disk drives Activating or deactivating an identify LED using the Advanced System Management Interface You can activate or deactivate an identify LED using the Advanced System Management Interface (ASMI). You can specify the location code of any indicator to view or modify its current state. If you provide the wrong location code, the advanced system manager attempts to go to the next higher level of the location code. The next level is the base-level location code for that field replaceable unit (FRU). For example, a user types the location code for the FRU located on the second I/O slot of the third enclosure in the system. If the location code for the second I/O slot is incorrect (the FRU does not exist at this location), an attempt to set the indicator for the third enclosure is initiated. This process continues until a FRU is located or no other level is available. To perform this operation, your authority level must be one of the following levels: v Administrator v Authorized service provider To change the current state of an indicator, do the following steps: 1. On the ASMI Welcome pane, specify your user ID and password, and click Log In. 2. In the navigation area, expand System Configuration ? Service Indicators ? Indicators by Location code. 3. In the right pane, enter the location code of the FRU and click Continue. 4. Select the preferred state from the list. 5. Click Save settings. Activating and deactivating LEDs 285 286 Disk drives Notices This information was developed for products and services offered in the U.S.A. The manufacturer may not offer the products, services, or features discussed in this document in other countries. Consult the manufacturer's representative for information on the products and services currently available in your area. Any reference to the manufacturer's product, program, or service is not intended to state or imply that only that product, program, or service may be used. Any functionally equivalent product, program, or service that does not infringe any intellectual property right of the manufacturer may be used instead. However, it is the user's responsibility to evaluate and verify the operation of any product, program, or service. The manufacturer may have patents or pending patent applications covering subject matter described in this document. The furnishing of this document does not grant you any license to these patents. You can send license inquiries, in writing, to the manufacturer. The following paragraph does not apply to the United Kingdom or any other country where such provisions are inconsistent with local law: THIS INFORMATION IS PROVIDED “AS IS” WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESS OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF NON-INFRINGEMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Some states do not allow disclaimer of express or implied warranties in certain transactions, therefore, this statement may not apply to you. This information could include technical inaccuracies or typographical errors. Changes are periodically made to the information herein; these changes will be incorporated in new editions of the publication. The manufacturer may make improvements and/or changes in the product(s) and/or the program(s) described in this publication at any time without notice. Any references in this information to Web sites not owned by the manufacturer are provided for convenience only and do not in any manner serve as an endorsement of those Web sites. The materials at those Web sites are not part of the materials for this product and use of those Web sites is at your own risk. The manufacturer may use or distribute any of the information you supply in any way it believes appropriate without incurring any obligation to you. Any performance data contained herein was determined in a controlled environment. Therefore, the results obtained in other operating environments may vary significantly. Some measurements may have been made on development-level systems and there is no guarantee that these measurements will be the same on generally available systems. Furthermore, some measurements may have been estimated through extrapolation. Actual results may vary. Users of this document should verify the applicable data for their specific environment. Information concerning products not produced by this manufacturer was obtained from the suppliers of those products, their published announcements or other publicly available sources. This manufacturer has not tested those products and cannot confirm the accuracy of performance, compatibility or any other claims related to products not produced by this manufacturer. Questions on the capabilities of products not produced by this manufacturer should be addressed to the suppliers of those products. All statements regarding the manufacturer's future direction or intent are subject to change or withdrawal without notice, and represent goals and objectives only. © Copyright IBM Corp. 2010 287 The manufacturer's prices shown are the manufacturer's suggested retail prices, are current and are subject to change without notice. Dealer prices may vary. This information is for planning purposes only. The information herein is subject to change before the products described become available. This information contains examples of data and reports used in daily business operations. To illustrate them as completely as possible, the examples include the names of individuals, companies, brands, and products. All of these names are fictitious and any similarity to the names and addresses used by an actual business enterprise is entirely coincidental. If you are viewing this information in softcopy, the photographs and color illustrations may not appear. The drawings and specifications contained herein shall not be reproduced in whole or in part without the written permission of the manufacturer. The manufacturer has prepared this information for use with the specific machines indicated. The manufacturer makes no representations that it is suitable for any other purpose. The manufacturer's computer systems contain mechanisms designed to reduce the possibility of undetected data corruption or loss. This risk, however, cannot be eliminated. Users who experience unplanned outages, system failures, power fluctuations or outages, or component failures must verify the accuracy of operations performed and data saved or transmitted by the system at or near the time of the outage or failure. In addition, users must establish procedures to ensure that there is independent data verification before relying on such data in sensitive or critical operations. Users should periodically check the manufacturer's support websites for updated information and fixes applicable to the system and related software. Trademarks IBM, the IBM logo, and ibm.com are trademarks or registered trademarks of International Business Machines Corp., registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM or other companies. A current list of IBM trademarks is available on the Web at Copyright and trademark information at www.ibm.com/legal/copytrade.shtml. Linux is a registered trademark of Linus Torvalds in the United States, other countries, or both. Other company, product, or service names may be trademarks or service marks of others. Electronic emission notices Class A Notices The following Class A statements apply to the IBM servers that contain the POWER7 processor and its features unless designated as electromagnetic compatibility (EMC) Class B in the feature information. Federal Communications Commission (FCC) statement Note: This equipment has been tested and found to comply with the limits for a Class A digital device, pursuant to Part 15 of the FCC Rules. These limits are designed to provide reasonable protection against harmful interference when the equipment is operated in a commercial environment. This equipment generates, uses, and can radiate radio frequency energy and, if not installed and used in accordance with the instruction manual, may cause harmful interference to radio communications. Operation of this equipment in a residential area is likely to cause harmful interference, in which case the user will be required to correct the interference at his own expense. 288 Disk drives Properly shielded and grounded cables and connectors must be used in order to meet FCC emission limits. IBM is not responsible for any radio or television interference caused by using other than recommended cables and connectors or by unauthorized changes or modifications to this equipment. Unauthorized changes or modifications could void the user's authority to operate the equipment. This device complies with Part 15 of the FCC rules. Operation is subject to the following two conditions: (1) this device may not cause harmful interference, and (2) this device must accept any interference received, including interference that may cause undesired operation. Industry Canada Compliance Statement This Class A digital apparatus complies with Canadian ICES-003. Avis de conformité à la réglementation d'Industrie Canada Cet appareil numérique de la classe A est conforme à la norme NMB-003 du Canada. European Community Compliance Statement This product is in conformity with the protection requirements of EU Council Directive 2004/108/EC on the approximation of the laws of the Member States relating to electromagnetic compatibility. IBM cannot accept responsibility for any failure to satisfy the protection requirements resulting from a non-recommended modification of the product, including the fitting of non-IBM option cards. This product has been tested and found to comply with the limits for Class A Information Technology Equipment according to European Standard EN 55022. The limits for Class A equipment were derived for commercial and industrial environments to provide reasonable protection against interference with licensed communication equipment. European Community contact: IBM Deutschland GmbH Technical Regulations, Department M456 IBM-Allee 1, 71139 Ehningen, Germany Tele: +49 7032 15-2937 e-mail: tjahn@de.ibm.com Warning: This is a Class A product. In a domestic environment, this product may cause radio interference, in which case the user may be required to take adequate measures. VCCI Statement - Japan The following is a summary of the VCCI Japanese statement in the box above: This is a Class A product based on the standard of the VCCI Council. If this equipment is used in a domestic environment, radio interference may occur, in which case, the user may be required to take corrective actions. Notices 289 Japanese Electronics and Information Technology Industries Association (JEITA) Confirmed Harmonics Guideline (products less than or equal to 20 A per phase) Japanese Electronics and Information Technology Industries Association (JEITA) Confirmed Harmonics Guideline with Modifications (products greater than 20 A per phase) Electromagnetic Interference (EMI) Statement - People's Republic of China Declaration: This is a Class A product. In a domestic environment this product may cause radio interference in which case the user may need to perform practical action. Electromagnetic Interference (EMI) Statement - Taiwan The following is a summary of the EMI Taiwan statement above. Warning: This is a Class A product. In a domestic environment this product may cause radio interference in which case the user will be required to take adequate measures. IBM Taiwan Contact Information: 290 Disk drives Electromagnetic Interference (EMI) Statement - Korea Please note that this equipment has obtained EMC registration for commercial use. In the event that it has been mistakenly sold or purchased, please exchange it for equipment certified for home use. Germany Compliance Statement Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse A EU-Richtlinie zur Elektromagnetischen Verträglichkeit Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die Grenzwerte der EN 55022 Klasse A ein. Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne Empfehlung von IBM gesteckt/eingebaut werden. EN 55022 Klasse A Geräte müssen mit folgendem Warnhinweis versehen werden: "Warnung: Dieses ist eine Einrichtung der Klasse A. Diese Einrichtung kann im Wohnbereich Funk-Störungen verursachen; in diesem Fall kann vom Betreiber verlangt werden, angemessene Maßnahmen zu ergreifen und dafür aufzukommen." Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland. Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse A Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen - CE - zu führen. Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller: International Business Machines Corp. Notices 291 New Orchard Road Armonk, New York 10504 Tel: 914-499-1900 Der verantwortliche Ansprechpartner des Herstellers in der EU ist: IBM Deutschland GmbH Technical Regulations, Abteilung M456 IBM-Allee 1, 71139 Ehningen, Germany Tel: +49 7032 15-2937 e-mail: tjahn@de.ibm.com Generelle Informationen: Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse A. Electromagnetic Interference (EMI) Statement - Russia Class B Notices The following Class B statements apply to features designated as electromagnetic compatibility (EMC) Class B in the feature installation information. Federal Communications Commission (FCC) statement This equipment has been tested and found to comply with the limits for a Class B digital device, pursuant to Part 15 of the FCC Rules. These limits are designed to provide reasonable protection against harmful interference in a residential installation. This equipment generates, uses, and can radiate radio frequency energy and, if not installed and used in accordance with the instructions, may cause harmful interference to radio communications. However, there is no guarantee that interference will not occur in a particular installation. If this equipment does cause harmful interference to radio or television reception, which can be determined by turning the equipment off and on, the user is encouraged to try to correct the interference by one or more of the following measures: v Reorient or relocate the receiving antenna. v Increase the separation between the equipment and receiver. v Connect the equipment into an outlet on a circuit different from that to which the receiver is connected. v Consult an IBM-authorized dealer or service representative for help. Properly shielded and grounded cables and connectors must be used in order to meet FCC emission limits. Proper cables and connectors are available from IBM-authorized dealers. IBM is not responsible for any radio or television interference caused by unauthorized changes or modifications to this equipment. Unauthorized changes or modifications could void the user's authority to operate this equipment. 292 Disk drives This device complies with Part 15 of the FCC rules. Operation is subject to the following two conditions: (1) this device may not cause harmful interference, and (2) this device must accept any interference received, including interference that may cause undesired operation. Industry Canada Compliance Statement This Class B digital apparatus complies with Canadian ICES-003. Avis de conformité à la réglementation d'Industrie Canada Cet appareil numérique de la classe B est conforme à la norme NMB-003 du Canada. European Community Compliance Statement This product is in conformity with the protection requirements of EU Council Directive 2004/108/EC on the approximation of the laws of the Member States relating to electromagnetic compatibility. IBM cannot accept responsibility for any failure to satisfy the protection requirements resulting from a non-recommended modification of the product, including the fitting of non-IBM option cards. This product has been tested and found to comply with the limits for Class B Information Technology Equipment according to European Standard EN 55022. The limits for Class B equipment were derived for typical residential environments to provide reasonable protection against interference with licensed communication equipment. European Community contact: IBM Deutschland GmbH Technical Regulations, Department M456 IBM-Allee 1, 71139 Ehningen, Germany Tele: +49 7032 15-2937 e-mail: tjahn@de.ibm.com VCCI Statement - Japan Japanese Electronics and Information Technology Industries Association (JEITA) Confirmed Harmonics Guideline (products less than or equal to 20 A per phase) Japanese Electronics and Information Technology Industries Association (JEITA) Confirmed Harmonics Guideline with Modifications (products greater than 20 A per phase) Notices 293 IBM Taiwan Contact Information Electromagnetic Interference (EMI) Statement - Korea Germany Compliance Statement Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse B EU-Richtlinie zur Elektromagnetischen Verträglichkeit Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die Grenzwerte der EN 55022 Klasse B ein. Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne Empfehlung von IBM gesteckt/eingebaut werden. Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland. Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse B Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen - CE - zu führen. Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller: International Business Machines Corp. New Orchard Road Armonk, New York 10504 Tel: 914-499-1900 Der verantwortliche Ansprechpartner des Herstellers in der EU ist: IBM Deutschland GmbH Technical Regulations, Abteilung M456 294 Disk drives IBM-Allee 1, 71139 Ehningen, Germany Tel: +49 7032 15-2937 e-mail: tjahn@de.ibm.com Generelle Informationen: Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse B. Terms and conditions Permissions for the use of these publications is granted subject to the following terms and conditions. Personal Use: You may reproduce these publications for your personal, noncommercial use provided that all proprietary notices are preserved. You may not distribute, display or make derivative works of these publications, or any portion thereof, without the express consent of the manufacturer. Commercial Use: You may reproduce, distribute and display these publications solely within your enterprise provided that all proprietary notices are preserved. You may not make derivative works of these publications, or reproduce, distribute or display these publications or any portion thereof outside your enterprise, without the express consent of the manufacturer. Except as expressly granted in this permission, no other permissions, licenses or rights are granted, either express or implied, to the publications or any information, data, software or other intellectual property contained therein. The manufacturer reserves the right to withdraw the permissions granted herein whenever, in its discretion, the use of the publications is detrimental to its interest or, as determined by the manufacturer, the above instructions are not being properly followed. You may not download, export or re-export this information except in full compliance with all applicable laws and regulations, including all United States export laws and regulations. THE MANUFACTURER MAKES NO GUARANTEE ABOUT THE CONTENT OF THESE PUBLICATIONS. THESE PUBLICATIONS ARE PROVIDED "AS-IS" AND WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING BUT NOT LIMITED TO IMPLIED WARRANTIES OF MERCHANTABILITY, NON-INFRINGEMENT, AND FITNESS FOR A PARTICULAR PURPOSE. Bull 2104 Model DU3 Expandable Storage Plus Operator’s Guide 86 A1 16EF 01 ORDER REFERENCE Bull 2104 Model DU3 Expandable Storage Plus Operator’s Guide Hardware December 2000 BULL CEDOC 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE 86 A1 16EF 01 ORDER REFERENCE The following copyright notice protects this book under the Copyright laws of the United States of America and other countries which prohibit such actions as, but not limited to, copying, distributing, modifying, and making derivative works. Copyright Bull S.A. 1992, 2000 Printed in France Suggestions and criticisms concerning the form, content, and presentation of this book are invited. A form is provided at the end of this book for this purpose. To order additional copies of this book or other Bull Technical Publications, you are invited to use the Ordering Form also provided at the end of this book. Trademarks and Acknowledgements We acknowledge the right of proprietors of trademarks mentioned in this book. AIX is a registered trademark of International Business Machines Corporation, and is being used under licence. UNIX is a registered trademark in the United States of America and other countries licensed exclusively through the Open Group. Year 2000 The product documented in this manual is Year 2000 Ready. The information in this document is subject to change without notice. Groupe Bull will not be liable for errors contained herein, or for incidental or consequential damages in connection with the use of this material. Contents Communications Statements . . . . . . . . . . . . . . . . . . . v Federal Communications Commission (FCC) Statement . . . . . . . . . . v Japanese Voluntary Control Council for Interference (VCCI) Statement. . . . . . v Korean Government Ministry of Communication (MOC) Statement . . . . . . . v New Zealand Compliance Statement . . . . . . . . . . . . . . . . . v International Electrotechnical Commission (IEC) Statement . . . . . . . . . vi Avis de conformité à la réglementation d’Industrie Canada . . . . . . . . . vi Industry Canada Compliance Statement. . . . . . . . . . . . . . . . vi United Kingdom Telecommunications Requirements. . . . . . . . . . . . vi European Union (EU) Statement . . . . . . . . . . . . . . . . . . vi Radio Protection for Germany . . . . . . . . . . . . . . . . . . . vi Taiwan Class A Compliance Statement . . . . . . . . . . . . . . . . vii Safety Notices. . . . . . . . . . . . . . . . . . . . . . . . ix Definitions of Safety Notices. . . . . . . . . . . . . . . . . . . . ix Acoustic Noise Declaration . . . . . . . . . . . . . . . . . . . . ix Electrostatic Discharge (ESD) . . . . . . . . . . . . . . . . . . xi About This Book . . . . . . . . . . . . . . . . . . . . . . xiii If You Need More Information . . . . . . . . . . . . . . . . . . . xiii Trademarks. . . . . . . . . . . . . . . . . . . . . . . . . xiii Numbering Convention . . . . . . . . . . . . . . . . . . . . . xiv Chapter 1. Using the 2104 Model DU3 or TU3 . . . . . . . . . . . . . 1 2104 Model DU3 . . . . . . . . . . . . . . . . . . . . . . . 2 2104 Model TU3. . . . . . . . . . . . . . . . . . . . . . . . 5 Controls and Lights. . . . . . . . . . . . . . . . . . . . . . . 7 Disk Drive Module Lights. . . . . . . . . . . . . . . . . . . . 8 Fan-and-Power-Supply Assembly and Fan Assembly Lights and Switch . . . . 11 SCSI Interface Card Lights. . . . . . . . . . . . . . . . . . . 14 Switch Card. . . . . . . . . . . . . . . . . . . . . . . . 16 Identifying 2104 Disk Enclosures. . . . . . . . . . . . . . . . . . 20 Identifying Disk Drive Modules. . . . . . . . . . . . . . . . . . . 21 Security . . . . . . . . . . . . . . . . . . . . . . . . . . 22 Ordering Cover Lock Keys . . . . . . . . . . . . . . . . . . . 23 Chapter 2. Adding Disk Drive Modules . . . . . . . . . . . . . . . 25 Before Adding a Disk Drive Module . . . . . . . . . . . . . . . . . 27 To Add a Disk Drive Module . . . . . . . . . . . . . . . . . . . 29 Chapter 3. Exchanging FRUs . . . . . . . . . . . . . . . . . . 35 Exchanging Disk Drive Modules . . . . . . . . . . . . . . . . . . 36 Before Exchanging a Disk Drive Module . . . . . . . . . . . . . . 37 To Exchange a Disk Drive Module . . . . . . . . . . . . . . . . 38 Exchanging a Fan-and-Power-Supply Assembly. . . . . . . . . . . . . 45 Before Exchanging a Fan-and-Power-Supply Assembly . . . . . . . . . 45 iii To Exchange a Fan-and-Power-Supply Assembly . . . . . . . . . . . 46 Exchanging a Fan Assembly . . . . . . . . . . . . . . . . . . . 55 Before Exchanging a Fan Assembly. . . . . . . . . . . . . . . . 55 To Exchange a Fan Assembly. . . . . . . . . . . . . . . . . . 56 Exchanging a SCSI Interface Card Assembly. . . . . . . . . . . . . . 62 Before Exchanging the SCSI Interface Card Assembly. . . . . . . . . . 63 To Exchange a SCSI Interface Card Assembly . . . . . . . . . . . . 64 Exchanging the Switch Card Assembly. . . . . . . . . . . . . . . . 73 Before Exchanging the Switch Card Assembly . . . . . . . . . . . . 73 To Exchange the Switch Card Assembly . . . . . . . . . . . . . . 74 Reporting Problems . . . . . . . . . . . . . . . . . . . . . . 80 Appendix A. Operating with RISC Systems . . . . . . . . . . . . . 81 SCSI Adapters and Cables. . . . . . . . . . . . . . . . . . . . 81 SCSI Service Aids. . . . . . . . . . . . . . . . . . . . . . . 81 Identifying a 2104 and its Disk Drive Modules . . . . . . . . . . . . . 81 Problem Determination . . . . . . . . . . . . . . . . . . . . . 82 Checking the SCSI Error Log . . . . . . . . . . . . . . . . . . . 82 Configuring a 2104 to the using system . . . . . . . . . . . . . . . 82 Unconfiguring a 2104 from the using system . . . . . . . . . . . . . . 82 Web Support Page . . . . . . . . . . . . . . . . . . . . . . 82 Related Publications . . . . . . . . . . . . . . . . . . . . . . 82 Appendix B. Translated Safety Notices . . . . . . . . . . . . . . . 83 Index . . . . . . . . . . . . . . . . . . . . . . . . . . 117 iv 2104 Models DU3 and TU3 Operator’s Guide Communications Statements The following statements apply to this product. The statements for other products intended for use with this product appear in their accompanying manuals. Federal Communications Commission (FCC) Statement This equipment has been tested and found to comply with the limits for a Class A digital device, pursuant to Part 15 of the FCC Rules. These limits are designed to provide reasonable protection against harmful interference when the equipment is operated in a commercial environment. This equipment generates, uses, and can radiate radio frequency energy and, if not installed and used in accordance with the instruction manual, may cause interference to radio communications. Operation of this equipment in a residential area is likely to cause harmful interference, in which case the user will be required to correct the interference at his own expense. Properly shielded and grounded cables and connectors must be used in order to meet FCC emission limits. Neither the provider nor the manufacturer is responsible for any radio or television interference caused by using other than recommended cables and connectors or by unauthorized changes or modifications to this equipment. Unauthorized changes or modifications could void the user’s authority to operate the equipment. This device complies with Part 15 of FCC Rules. Operation is subject to the following two conditions: (1) this device may not cause harmful interference, and (2) this device must accept any interference received, including interference that may cause undesired operation. Japanese Voluntary Control Council for Interference (VCCI) Statement This product is a Class A Information Technology Equipment and conforms to the standards set by the Voluntary Control Council for Interference by Information Technology Equipment (VCCI). In a domestic environment, this product might cause radio interference, in which event the user might be required to take adequate measures. Korean Government Ministry of Communication (MOC) Statement Please note that this device has been approved for business purposes with regard to electromagnetic interference. If you find that this device is not suitable for your use, you can exchange it for one that is approved for non-business purposes. New Zealand Compliance Statement This is a Class A product. In a domestic environment this product might cause radio interference, in which event the user might be required to take adequate measures v International Electrotechnical Commission (IEC) Statement This product has been designed and built to comply with (IEC) Standard 60950. Avis de conformité à la réglementation d’Industrie Canada Cet appareil numérique de la classe A est conforme à la norme NMB-003 du Canada. Industry Canada Compliance Statement This Class A digital apparatus complies with IECS-003. United Kingdom Telecommunications Requirements This apparatus is manufactured to the International Safety Standard EN60950 and as such is approved in the U.K. under approval number NS/G/1234/J/100003 for indirect connection to public telecommunications systems in the United Kingdom. European Union (EU) Statement This product is in conformity with the protection requirements of EU council directive 89/336/EEC on the approximation of the laws of the Member States relating to electromagnetic compatibility. Neither the provider nor the manufacturer can accept responsibility for any failure to satisfy the protection requirements resulting from a non-recommended modification of the product, including the fitting of option cards not supplied by the manufacturer. This product is in conformity with the EU council directive 73/23/EEC on the approximation of the laws of the Member States relating to electrical equipment designed for use within certain voltage limits. This conformity is based on compliance with the following harmonized standard: EN60950. Radio Protection for Germany Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG) vom 30. August 1995 (bzw. der EMC EG Richtlinie 89/336): Dieses Gerät ist berechtigt in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen - CE - zu führen. Verantwortlich für die Konformitätserklärung nach Paragraph 5 des EMVG ist die: IBM Deutschland Informationssysteme GmbH, 70548 Stuttgart. Informationen in Hinsicht EMVG Paragraph 3 Abs. (2) : Das Gerät erfüllt die Schutzanforderungen nach EN 50082-1 und EN 55022 Klasse A. EN 55022 Klasse A Geräte müssen mit folgendem Warnhinweis versehen werden: ?Warnung: dies ist eine Einrichtung der Klasse A. Diese Einrichtung kann im vi 2104 Models DU3 and TU3 Operator’s Guide Wohnbereich Funkstörungen verursachen; in diesem Fall kann vom Betreiber verlangt werden, angemessene Massnahmen durchzuführen und dafür aufzukommen.? EN 50082-1 Hinweis: ?Wird dieses Gerät in einer industriellen Umgebung betrieben (wie in EN 50082-2 festgelegt), dann kann es dabei eventuell gestört werden. In solch einem Fall ist der Abstand bzw. die Abschirmung zu der industriellen Störquelle zu vergrössern.? Anmerkung: Um die Einhaltung des EMVG sicherzustellen sind die Geräte, wie in den Handbüchern angegeben, zu installieren und zu betreiben. Taiwan Class A Compliance Statement Communications Statements vii viii 2104 Models DU3 and TU3 Operator’s Guide Safety Notices For a translation of the danger and caution notices contained in this book, see “Appendix B. Translated Safety Notices” on page 83. Definitions of Safety Notices A danger notice indicates the presence of a hazard that has the potential of causing death or serious injury. A danger notice appears on page 50. A caution notice indicates the presence of a hazard that has the potential of causing moderate or minor personal injury. Caution notices appear on pages 47, 49, 58, 67, 69, and 76. An attention notice indicates an action that could cause damage to a program, device, system, or data. Acoustic Noise Declaration The equivalent continuous A-weighted sound pressure level at the bystander’s position measured for a 2104 Model DU3 or TU3 does not exceed 68 dB(A). This level has been measured under the following conditions: v Rack-mounted unit: – The 2104 Model DU3 had fourteen disk drive modules installed. – The 2104 Model DU3 was installed in a 7202 Rack with the bottom of the 2104 Model DU3 system at EIA position 17. – The remainder of the rack was filled with units to which power was not connected. v Deskside unit: – The 2104 Model TU3 had fourteen disk drive modules installed. These levels were measured using a procedure in accordance with ISO standard DIS7779, Measurement of Airborne Noise Emitted by Computer and Business Equipment. The equipment was installed and operated as described in Appendix C.8 of that standard. ix x 2104 Models DU3 and TU3 Operator’s Guide Electrostatic Discharge (ESD) Attention: When you handle field-replaceable units (FRUs) and other computer parts, take these precautions to avoid static damage: v Limit your movement. Movement can cause static electricity to build up around you. v Always touch computer parts carefully. Hold adapters and memory-modules by their edges. Never touch any exposed circuits. v Prevent people who are not correctly grounded from touching computer parts. v Before you install a new part, touch the static-protective package that contains the part against an unpainted metal part of the 2104 or using system for at least two seconds. This action reduces static electricity in the package and in your body. v Remove the part from its package and, if possible, install it directly into the 2104 without putting the part down. If you need to put the part down, first place the static-protective package that contained the part onto a smooth, level surface, then place the part onto the package. Do not place the part onto any metal surface. xi xii 2104 Models DU3 and TU3 Operator’s Guide About This Book This book is for people who operate a system that has one or more 2104 Model DU3 or TU3 connected to it. Throughout this book, service representative means a person who has been authorized by your organization to maintain your 2104 Model DU3 or TU3. Chapter 1 describes the 2104 Models DU3 and TU3, their controls, and how to use them. Chapter 2 describes how to add disk drive modules to a 2104 Model DU3 or TU3 that is already installed as part of your system. Chapter 3 describes what to do if you have a problem with your 2104 Model DU3 or TU3. It tells you how to replace particular failed components, such as disk drive modules, and what to report if you need to call your service representative for assistance. Appendix A gives additional information about using a 2104 Model DU3 or TU3 that is attached to a RISC system. Appendix B gives a translation of all danger and caution notices contained in this book. An index is provided at the back of this book. If You Need More Information Other books that you might need are: v Expandable Storage Plus : 2104 Model DU3 Installation Guide, 86 A1 15EF v Expandable Storage Plus : 2104 Model TU3 Installation Guide, v Expandable Storage Plus : 2104 Models DU3 and TU3 Service Guide, 86 A1 17EF v The Operator Guide for your system v The User’s Guide for your using-system SCSI attachment (for example, your SCSI adapter) v The Site and Hardware Planning Information for your system v The Problem Solving Guide and Reference for your system. Trademarks The following items are trademarks of International Business Machines Corporation in the United States, or other countries, or both: v AIX v IBM v RS/6000 xiii v TM Numbering Convention In this book: KB means 1 000 bytes. MB means 1 000 000 bytes. GB means 1 000 000 000 bytes. xiv 2104 Models DU3 and TU3 Operator’s Guide Chapter 1. Using the 2104 Model DU3 or TU3 Important: The installation of this product is a customer responsibility. Attention: Before continuing with any of the actions described in this book, please refer to “Safety Notices” on page ix. This section describes the Expandable Storage Plus: 2104 Models DU3 and TU3, their controls, and how to use them. A 2104 Model DU3 or TU3 can be attached to any supported RS/6000™ or IBM server TM computer that provides support for any of the Small Computer System Interface (SCSI) adapters listed in “SCSI Adapters and Cables” on page 81. The 2104 Models DU3 and TU3 look like this: 1 2104 Model DU3 The 2104 Model DU3 is a rack-mounted SCSI disk enclosure that can be installed into a standard Electrical Industries Association (EIA) 19-inch rack. A fully configured 2104 Model DU3 looks like this: Figure 1. 2104 Model TU3 (Left) and 2104 Model DU3 (Right) 2 2104 Models DU3 and TU3 Operator’s Guide Figure 2. 2104 Model DU3 from the Front (Top) and Back (Bottom) Chapter 1. Using the 2104 Model DU3 or TU3 3 At the front of a 2104 Model DU3 there are 14 device slots 1. Each slot must contain either a SCSI disk drive module, or a dummy disk module. Each disk drive module connects to a backplane which is mounted vertically about halfway between the front and back of the 2104 Model DU3. Note: To run I/O at least one SCSI disk drive module must be present in each 2104 Model DU3. The 14 device slots are divided into two groups of seven slots. Between the two groups of seven slots there is a SCSI bridge card2. The SCSI bridge card assigns the disk drive modules to a SCSI bus, in accordance with the setting of the SCSI bus split switch on the switch card 5. On the back of the backplane are connectors for two fan-and-power-supply assemblies3, two SCSI interface cards4, and a switch card5. The 2104 Model DU3 has either two fan-and-power-supply assemblies or one fan-and-power-supply assembly and one fan assembly. Either option provides all the necessary power and cooling for the 2104 Model DU3. It is better, however, to have two fan-and-power-supply assemblies, rather than one fan-and-power-supply assembly and one fan assembly, because then the amount of power required from each power supply is reduced, and, if one power supply fails, power for the 2104 disk enclosure continues to be supplied by the other power supply. The SCSI interface cards4 are used to connect host machines to the 2104 Model DU3. These cards contain logic that provides information about what is happening in the 2104 Model DU3 and the status of components within it. The switch card5 contains switches that control which SCSI bus mode is selected, how the 2104 Model DU3 is supplied with power, and what enclosure services are enabled. It also contains a rotary switch that sets the ID of the 2104 Model DU3. Options for configuring a 2104 Model DU3 are described in the Expandable Storage Plus: 2104 Model DU3 Installation Guide. 4 2104 Models DU3 and TU3 Operator’s Guide 2104 Model TU3 The 2104 Model TU3 is a deskside SCSI disk enclosure. A fully configured 2104 Model TU3 looks like this: Figure 3. 2104 Model TU3 from the Front (Left) and Back (Right) Chapter 1. Using the 2104 Model DU3 or TU3 5 At the front of a 2104 Model TU3 are 14 device slots 1. Each slot must contain either an SCSI disk drive module, or a dummy module. Each disk drive module connects to a backplane that is mounted vertically about halfway between the front and back of the 2104 Model TU3. Note: To run I/O at least one SCSI disk drive module must be present in each 2104 Model TU3. The 14 device slots are split into two groups of seven slots. Between the two groups of slots, there is a SCSI bus bridge card 2. The SCSI bus bridge card assigns the disk drive modules to a SCSI bus, in accordance with the setting of the SCSI bus split switch on the switch card5. On the back of the backplane are connectors for two fan-and-power-supply assemblies3, two SCSI interface cards4, and a switch card5. The 2104 Model TU3 can have either two fan-and-power-supply assemblies or one fan-and-power-supply assembly and one fan assembly. Either option provides all the necessary power and cooling for the 2104 Model TU3. It is better, however, to have two fan-and-power-supply assemblies, rather than one fan-and-power-supply assembly and one fan assembly, because then the amount of power required from each power supply is reduced, and, if one power supply fails, power for the 2104 disk enclosure continues to be supplied by the other power supply. The SCSI interface cards4 are used to connect host machines to the 2104 Model TU3. These cards contain logic that provides information about what is happening in the 2104 Model TU3, and controls the operation of the subsystem. The switch card5 contains switches that control which SCSI bus mode is selected, how the 2104 Model TU3 is supplied with power, and what enclosure services are enabled. It also contains a rotary switch that is used to set the ID of the 2104 Model TU3. Options for configuring a 2104 Model TU3 are described in the Expandable Storage Plus: 2104 Model TU3 Installation Guide. 6 2104 Models DU3 and TU3 Operator’s Guide Controls and Lights This section describes the switches and lights on a 2104 Model DU3 or TU3. The 2104 has no main power switch. However, each fan-and-power-supply assembly has an On/Standby switch. You can see the 2104 lights at the front of the 2104 Model DU3 or TU3. Power light 1 The Power light, which is green, comes on and stays on continuously when power is supplied to the 2104 by one or both of the fan-and-power-supply assemblies within the 2104. Check light 2 The Check light, which is amber, comes on if a failure occurs in the 2104. Note: The 2104 might be able to continue operating correctly even when the failure of a single part has been detected. Figure 4. Disk Enclosure Lights on a 2104 Model TU3 (Left) and a 2104 Model DU3 (Right) Chapter 1. Using the 2104 Model DU3 or TU3 7 Disk Drive Module Lights The lights on the disk drive modules that are installed in a 2104 Model DU3 are visible at the front of the 2104 Model DU3. However, to see the lights on the disk drive modules that are installed in a 2104 Model TU3, open the front cover: 1. If necessary, unlock the cover by using the key provided. 2. Grip the right-hand side of the front cover, and pivot the cover to the left. Figure 5. 2104 Model TU3 with Cover Opened 8 2104 Models DU3 and TU3 Operator’s Guide Each disk drive module has two lights: 1Activity light The Activity light, which is green, shows the following conditions: Off The SCSI link to the disk drive is not active. Flashing This disk drive module is active and a command is in progress. 2Check light The Check light, which is amber, shows the following conditions: Off Normal operating condition Permanently On One of the following conditions: v The service aid has set Remove for this disk drive module (See “Appendix A. Operating with RISC Systems” on page 81). v The disk drive has reported a Predictive Failure Analysis (PFA) error; this error indicates that the disk drive has had an excessive number of internally recovered errors. v The disk drive is faulty; this is a Conner/Intel SCSI-Accessed Fault-Tolerant Enclosures (SAF-TE) function. Flashing (2 seconds on, 2 seconds off) The Check light has been set by a SCSI service aid to identify the position of this disk drive module. Figure 6. Disk Drive Module Lights Chapter 1. Using the 2104 Model DU3 or TU3 9 Flashing (0.25 seconds on, 0.25 seconds off) The disk drive is part of an array that is being rebuilt (this is a SAF-TE function). Note:SAF-TE services are not used with RS/6000 or IBM server pSeries computers. 10 2104 Models DU3 and TU3 Operator’s Guide Fan-and-Power-Supply Assembly and Fan Assembly Lights and Switch Either one or two fan-and-power-supply assemblies 1 are installed in the back of a 2104 Model DU3 or TU3. The power switch and lights are visible at the back of the 2104. Figure 7. Fan-and-Power-Supply Assemblies in a 2104 Model TU3 (Left) and 2104 Model DU3 (Right) Chapter 1. Using the 2104 Model DU3 or TU3 11 A fan-and-power-supply assembly has the following lights and switch: AC PWR light 1 The AC PWR light, which is green, comes on when the mainline power supply is connected to the fan-and-power-supply assembly. DC PWR light 2 The DC PWR light, which is green, comes on when this fan-and-power-supply assembly is supplying power to the 2104. CHK light 3 The CHK light, which is amber, indicates either that there is a failure in the fan-and-power-supply assembly or that the DC On/Standby switch is set to Standby. Note: This light flashes when the failure in the fan-and-power-supply assembly is a fan failure. DC On/Standby switch 4 The DC On/Standby switch connects dc electrical power from the fan-and-power-supply assembly to the disk drives and other components in the 2104. This switch must be set to on for the fan-and-power-supply unit to start. If the DC On/Standby switch on either of the two fan-and-power-supply assemblies is set to on, power in a 2104 Model DU3 or TU3 unit is switched on automatically if all of the following conditions exist: v Input power to the 2104 is present. v The fan-and-power-supply assembly is fully home in its slot. Figure 8. Fan-and-Power-Supply Assembly Lights and Switch 12 2104 Models DU3 and TU3 Operator’s Guide v Either the power control switch on the switch card is set to on, or terminator power is active in an external SCSI connection. A fan assembly has one light: CHK light 1 The CHK light, which is amber, flashes when the fan fails. Figure 9. Fan Assembly Chapter 1. Using the 2104 Model DU3 or TU3 13 SCSI Interface Card Lights Up to two SCSI interface 1cards can be installed in the back of a 2104. Figure 10. SCSI Interface Cards in a 2104 Model TU3 (Left) and 2104 Model DU3 (Right) 14 2104 Models DU3 and TU3 Operator’s Guide An SCSI interface card has the following lights: TERM POWER light 1 The TERM POWER light, which is green, is lit when an active SCSI connection is present. LVD/SE light 2 The LVD/SE light indicates the type of host SCSI cable. It is green, and when lit indicates an active Low Voltage Differential (LVD) SCSI connection. If this light is off and the TERM POWER light is on, there is an active single-ended (SE) SCSI connection. ACTIVE light 3 The ACTIVE light, which is green, is lit when the using system is communicating with the SCSI Enclosure Services (SES) processor or the disk drives. RESET light 4 The RESET light, which is green, comes on when a Power On Reset (POR) or a SCSI Bus Reset occurs. This light is switched off by the SES processor. FAULT light 5 The FAULT light, which is amber, comes on if the SCSI interface card fails. Figure 11. SCSI Interface Card Assembly Lights Chapter 1. Using the 2104 Model DU3 or TU3 15 Switch Card A Switch Card 1 is installed in the back of a 2104. Figure 12. Switch Card in a 2104 Model TU3 (Left) and 2104 Model DU3 (Right) 16 2104 Models DU3 and TU3 Operator’s Guide Refer to Figure 13 to identify the switches on the switch card. The switch card contains the following switches that are accessible to the operator. Note: The ?default logical setting? of a switch is the setting that is used if the switch card assembly is not present when the 2104 is switched on, or when a reset operation occurs. Power Control switch 1 When this switch is set to Off, the 2104 automatically switches off or on when the using system is switched off or on. When this switch is set to On, the 2104 is powered up or down by the DC On/Standby switch on a fan-and-power-supply assembly. The default logical setting for this switch is On. Drive Autostart switch 2 The disk drive motors are controlled by the combination of this switch and the Delay Motor Start Switch 5. Figure 13. Switch Card Chapter 1. Using the 2104 Model DU3 or TU3 17 When this switch is set to On, and the Delay Motor Start Switch 5 is set to Off, the disk drive motors do not start until a START MOTOR command is issued. The timing sequence of disk motor startup is under the control of the using system software. When this switch is set to Off, and the Delay Motor Start Switch 5 is set to On, the disk drives are set to Delay Motor Start mode. The disk motor startup delay time is different for each disk drive and is usually computed by multiplying 12 seconds by its SCSI ID. For example, the disk drive motor of a disk drive having a SCSI ID equal to two will start 24 (2 x 12) seconds after power is applied to the 2104. When this switch is set to Off, and the Delay Motor Start Switch 5 is set to Off, the disk drives are set to Normal Start mode. The disk drive motors will start when power is applied to the 2104. The effect of both this switch and the Delay Motor Start Switch 5 being set to On is undefined. The default logical setting for this switch is On. Enable Enclosure Services switch 3 When this switch is set to On, the enclosure services can operate. When the switch is to Off, no response occurs to any request to use the enclosure services. The default logical setting for this switch is On. Select Enclosure Services switch 4 This switch selects which enclosure services can be used — SAF-TE or SES. When set to Off, it selects SAF-TE. When set to On, it selects SES. The default logical setting for this switch is On.Note:SAF-TE services are not used with RS/6000 or IBM server pSeries computers. Delay Motor Start switch 5 Refer to the explanation of the Drive Autostart Switch 2. The default logical setting for this switch is Off. Box ID switch 6 This 10-position rotary switch indicates the ID of the 2104. The operator selects the required setting which can be used by the SES Inquiry command and the SAF-TE buffer 1 command. The default logical setting for this switch is 0.Note:SAF-TE services are not used with RS/6000 or IBM server pSeries computers. 18 2104 Models DU3 and TU3 Operator’s Guide The switch card has other switches that are accessible only when the switch card is removed from the 2104. They should only be changed when the 2104 is being converted from a Model DU3 to a Model TU3, or from a Model TU3 to a Model DU3. SCSI Address switch 7 When this switch is set to Off, which is its default setting in a 2104 Model DU3, the SCSI addresses of the disk drives are 0, 1, 2, 3, 4, 5, 6, 8, 9, 10, 11, 12, 13, 14, from left to right. When the switch is set to On, these addresses are reversed. Note: This setting, On, is not supported in a 2104 Model TU3 or 2104 Model DU3. Orientation switch 8 This switch should be set to Off in a 2104 Model DU3, and to On in a 2104 Model TU3. Its setting determines the position of the Power and Check lights on the 2104. SCSI Bus Split switch 9 This switch controls the SCSI bus configuration. When the switch is set to Off, the 2104 is configured as a single SCSI bus. When the switch is set to On, the 2104 is configured as a dual (or split) SCSI bus. In a 2104 Model DU3, with this switch set to On and the SCSI address switch set to Off, the SCSI addresses of the disk drives are 0, 1, 2, 3, 4, 5, 6, from left to center, on one SCSI bus, and 8, 9, 10, 11, 12, 13, 14, from center to right, on the other SCSI bus. The default logical setting for this switch is Off. Reserved 10 This switch should be set to Off. Chapter 1. Using the 2104 Model DU3 or TU3 19 Identifying 2104 Disk Enclosures The Box ID switch on the switch card is a 10-position rotary switch that indicates the ID of the 2104. Its setting is used by the SES Inquiry command or the SAF-TE buffer 1 command. Labels are supplied with the 2104 to enable the operator to identify each 2104. It is recommended that the label that represents the ID set by the Box ID switch is attached to the front of the 2104. 20 2104 Models DU3 and TU3 Operator’s Guide Identifying Disk Drive Modules You can identify a disk drive module by the serial number that appears on a label 1 on the front of the module: This label also shows the size of the disk drive. Each disk drive module also has a SCSI address that is related to its position in the 2104. The addresses used are 0, 1, 2, 3, 4, 5, 6, 8, 9, 10, 11, 12, 13 and 14. These addresses are usually assigned, in sequence, from left to right in the 2104 Model DU3 and from top to bottom in the 2104 Model TU3. Figure 14. Disk Drive Module Label Chapter 1. Using the 2104 Model DU3 or TU3 21 Security On the 2104 Model TU3, a lock 1 on the front cover provides physical security for the disk drives. You do not need to remove this cover during normal operations. If you lock the cover, it cannot be opened for servicing without the use of the key. You do not need to lock the cover for it to stay closed. The 2104 Model DU3 has no cover or lock. Figure 15. 2104 Model TU3 Lock 22 2104 Models DU3 and TU3 Operator’s Guide Ordering Cover Lock Keys For protection against unauthorized key duplication, the cover lock is a high-security lock. Keys (see Figure 16) for these locks are a factory-restricted series and duplicate keys are not available through normal commercial channels. The serial number of the lock is stamped on each key. Make a note of this number. The additional key supplied, and your note of its number, should be stored in separate secure areas. If you lose the key, or it becomes faulty, a replacement key can be purchased from the Illinois Lock Company. Their address is: Illinois Lock Company, 301, W. Hintz Road, Wheeling, Illinois 60090. Phone:(800) 299-5880 (inside USA) 001-847-537-1800 (from outside USA) FAX:001-847-537-1881 e-mail:illock@aol.com website:www.illinoislock.com Figure 16. Cover Lock Keys Chapter 1. Using the 2104 Model DU3 or TU3 23 24 2104 Models DU3 and TU3 Operator’s Guide Chapter 2. Adding Disk Drive Modules This chapter describes how to add disk drive modules to a 2104 that is already installed as part of your system. You can do this only if: 1. You are authorized by your organization. 2. You have the correct disk drive module. Refer to Figure 17. This disk drive module has, at the front, a handle 1 that can be pulled upward to open. The disk drive module should also be the correct size. The size of a disk drive module is shown on a label 2 at the front of the disk drive module. The label is shown in more detail in Figure 14 on page 21. Attention: v Disk drive modules are fragile. Handle them with care. Keep them well away from strong magnetic fields. v Any slot that has no disk drive module installed must contain a dummy disk drive module. The dummy module ensures that the correct airflow is maintained around the disk drive modules in the other slots. If a slot remains empty, overheating might occur. You do not need to remove power from the 2104 when adding a disk drive module. Figure 17. Disk Drive Module Handle and Label 25 After you have added a disk drive module to a 2104, you must add it into your system software configuration by using the system programs. Refer to “Appendix A. Operating with RISC Systems” on page 81. 26 2104 Models DU3 and TU3 Operator’s Guide Before Adding a Disk Drive Module 1. Identify the slot for the new disk drive module by using the configuration planning information for your system. A dummy disk drive module should be in the slot. Refer to Figure 18. Use the SCSI Device Identification and Removal service aid to set the disk drive slot to Insert. If you then remove the dummy disk drive module from the identified slot, an amber light on the backplane should be visible. This verifies that you have selected the correct slot into which to add the new disk drive module. This light goes off when a disk drive module is added. If it does not, it is visible as the amber Check light on the disk drive module that you have added. Figure 18. Dummy Disk Drive Module Chapter 2. Adding Disk Drive Modules 27 2. If you are adding a disk drive module into a 2104 Model TU3, open the front cover of the unit: a. If necessary, unlock the cover using the key provided. b. Grip the right-hand side of the cover and pivot the cover to the left. Figure 19. Disk Drive Modules in a 2104 Model TU3 28 2104 Models DU3 and TU3 Operator’s Guide To Add a Disk Drive Module The diagrams in the following instructions show a 2104 Model DU3. The steps for adding a disk drive module into a 2104 Model TU3 are the same, but the parts are turned through 90 degrees. 1. Pull the dummy module out of the 2104. Attention: Do not leave the slot empty for more than 30 minutes. If you do, the 2104 and its disk drive modules might overheat and be damaged. Figure 20. Removing a Dummy Module Chapter 2. Adding Disk Drive Modules 29 2. With one hand supporting the base of the disk drive module and the other hand holding the handle 1, insert the module and push it into the slot. When the handle touches the front surface of the enclosure, the module stops. However, the module is not yet fully home. Figure 21. Adding a Disk Drive Module 30 2104 Models DU3 and TU3 Operator’s Guide 3. While continuing to push the disk drive module into the slot, slowly close the handle 1 until it stops with a click. This action pushes the module fully home. Attention: The fans might decrease in speed. 4. Verify that the disk drive module that you have just installed is aligned with the sides of the 2104, and that there is no gap between this module and the modules that are next to it. Verify also that the front edge of this disk drive module aligns with the front edges of the modules next to it. If the disk drive module is not correctly aligned, remove it (see step 2 on page 39) and reinstall it (steps 2 through 3 above). Figure 22. Closing the Handle of a Disk Drive Module Chapter 2. Adding Disk Drive Modules 31 5. If the amber check light 2 comes on, first verify that the new disk drive module has been correctly installed, and then run diagnostics in Problem Determination mode to check the error log. See “Appendix A. Operating with RISC Systems” on page 81. Note: The green activity light 1 does not come on until disk activity begins. Figure 23. Disk Drive Module Lights 32 2104 Models DU3 and TU3 Operator’s Guide 6. If you have just added a disk drive module to a 2104 Model TU3, close the cover and, if required, lock it with the key provided. 7. Add the new disk drive module into your system software configuration, using the system programs. Refer to “Appendix A. Operating with RISC Systems” on page 81. Figure 24. Closing the Cover of a 2104 Model TU3 Chapter 2. Adding Disk Drive Modules 33 34 2104 Models DU3 and TU3 Operator’s Guide Chapter 3. Exchanging FRUs This chapter describes what to do if you need to exchange a Field Replaceable Unit (FRU) of your 2104 Model DU3 or TU3. A disk drive module, a fan-and-power-supply assembly, a fan assembly, a SCSI interface card, a switch card, and a SCSI bus bridge card are all FRUs in the 2104 Model DU3 or TU3. If you need to exchange the SCSI bus bridge card, please refer to Expandable Storage Plus: 2104 Models DU3 and TU3 Service Guide 35 Exchanging Disk Drive Modules Sometimes, you can exchange a faulty disk drive module without the assistance of a service representative. You can do this only if: v You are authorized by your organization. v You have the correct disk drive module. Refer to Figure 25. This disk drive module has, at the front, a handle 1 that can be pulled upward to open. The disk drive module should also be the correct size. The size of a disk drive module is shown on a label 2 at the front of the disk drive module. The label is shown in more detail in Figure 14 on page 21. Attention: v Disk drive modules are fragile. Handle them with care. Keep them well away from strong magnetic fields. v Any slot that has no disk drive module installed must contain a dummy disk drive module. The dummy module ensures that the correct airflow is maintained around the disk drive modules in the other slots. If a slot remains empty, overheating might occur. You do not need to remove power from the 2104 when exchanging a disk drive module. After you have exchanged a disk drive module in a 2104, you must restore it into your system by using the system programs. Figure 25. Disk Drive Module Handle and Capacity 36 2104 Models DU3 and TU3 Operator’s Guide Before Exchanging a Disk Drive Module 1. Use the system error log to identify the faulty disk drive module. “Appendix A. Operating with RISC Systems” on page 81 describes how to do this if you have a RISC-based system. 2. Find the failing disk drive module by using the location code supplied in the error log. 3. Use the SCSI Device Identification and Removal service aid to set Remove on the failing disk drive module. (“Appendix A. Operating with RISC Systems” on page 81 describes how to do this if you have a RISC-based system.) 4. Check the lights on this disk drive module. (“Disk Drive Module Lights” on page 8 describes where these are, and how to see them.) If the Check light is on, exchange the disk drive module for a new one using the procedure described in the following section. Chapter 3. Exchanging FRUs 37 To Exchange a Disk Drive Module 1. If the failing disk drive module is in a 2104 Model TU3, open the front cover of the 2104: a. If necessary, unlock the cover, using the key provided. b. Grip the cover at the right and pivot it to the left. Figure 26. Disk Drive Modules in a 2104 Model TU3 38 2104 Models DU3 and TU3 Operator’s Guide The diagrams in the following instructions show a 2104 Model DU3. The steps for exchanging a disk drive module in a 2104 Model TU3 are the same, but the parts are turned through 90 degrees. 2. Pull upward on the trigger 1 and pull outward the handle 2 of the failing disk drive module. This action moves the module partly out of the slot. Wait for about 20 seconds to allow the disk drive motor time to spin down. Figure 27. Opening the Disk Drive Module Handle Chapter 3. Exchanging FRUs 39 3. Carefully pull the module out of the 2104. Support the module with your other hand as you take it from the 2104. Attention:The fan speed might increase. Do not leave the slot empty for more than 30 minutes. If you do, the 2104, and its disk drive modules, might overheat and be damaged. Figure 28. Removing a Disk Drive Module 40 2104 Models DU3 and TU3 Operator’s Guide 4. With one hand supporting the base of the replacement module and the other hand holding the handle 1, insert the module and push it into the slot. When the handle touches the front surface of the enclosure, the module stops. Note that the disk drive module is not yet fully home. Figure 29. Inserting a Disk Drive Module Chapter 3. Exchanging FRUs 41 5. While continuing to push the disk drive module into the slot, slowly close the handle 1 until it stops with a click. This action pushes the module fully home. Attention:The fans might decrease their speed. 6. Verify that the disk drive module that you have just installed is aligned with the sides of the 2104 and there is no gap between this module and the modules that are next to it. Verify also that the front edge of this disk drive module aligns with the front edges of the modules next to it. If the disk drive module is not correctly aligned, remove it (see step 2 on page 39) and reinstall it (steps 4 through 6 above). Figure 30. Closing the Handle 42 2104 Models DU3 and TU3 Operator’s Guide 7. If the amber check light 2 comes on, first verify that the new disk drive module has been correctly installed, then run diagnostics in Problem Determination mode to check the error log. See “Appendix A. Operating with RISC Systems” on page 81. Note: The green activity light 1 comes on for about one second after the disk drive module has been installed. The light then goes out and comes on again only when disk I/O activity begins. Figure 31. Disk Drive Module Lights Chapter 3. Exchanging FRUs 43 8. If you have exchanged a disk drive module in a 2104 Model TU3, close the cover and, if required, lock it with the key provided. 9. Restore the disk drive module into your system, using the system programs. Refer to “Appendix A. Operating with RISC Systems” on page 81. Figure 32. Closing the Cover of a 2104 Model TU3 44 2104 Models DU3 and TU3 Operator’s Guide Exchanging a Fan-and-Power-Supply Assembly You can exchange a faulty fan-and-power-supply assembly if: v You have a correct spare fan-and-power-supply assembly or fan assembly. v Another fan-and-power-supply assembly is still present in the 2104. Note: If your 2104 contains just one fan-and-power-supply assembly (and one fan assembly), refer to Expandable Storage Plus: 2104 Models DU3 and TU3 Service Guide for more guidance about replacing the fan-and-power-supply assembly. You do not need to remove power from the 2104 when exchanging a fan-and-power-supply assembly. Before Exchanging a Fan-and-Power-Supply Assembly v Check the lights on this fan-and-power-supply assembly (“Fan-and-Power-Supply Assembly and Fan Assembly Lights and Switch” on page 11 describes these lights and how to see them.). Ensure that the DC On/Standby switch is set to On. If the CHK light is on, exchange the fan-and-power-supply assembly by using the procedure in the following section. Figure 33. A Fan-and-Power-Supply Assembly Chapter 3. Exchanging FRUs 45 To Exchange a Fan-and-Power-Supply Assembly Note: The illustrations in the following instructions show a 2104 Model DU3. The steps for exchanging a fan-and-power-supply assembly in a 2104 Model TU3 are the same, but the parts are turned through 90 degrees. 1. Set the DC On/Standby switch 1 on the fan-and-power-supply assembly to Standby. Note: The DC PWR light might stay on for a short time. Wait until it is off before you go to the next step. Figure 34. Fan-and-Power-Supply DC On/Standby Switch 46 2104 Models DU3 and TU3 Operator’s Guide 2. Unplug the mainline power cable from the failing fan-and-power-supply assembly. CAUTION: Ensure that the mainline power cable has been removed from the failing fan-and-power-supply before continuing. Figure 35. Unplugging the Mainline Power Cable Chapter 3. Exchanging FRUs 47 3. Unscrew the two thumbscrews on the fan-and-power-supply assembly. Figure 36. Unscrewing the Thumbscrews on the Fan-and-Power-Supply Assembly 48 2104 Models DU3 and TU3 Operator’s Guide 4. Pull the fan-and-power-supply assembly from the 2104. The speed of the other fan in the 2104 might increase. Attention: Do not leave the space empty for more than 30 minutes; if you do, the 2104, and the disk drive modules, might overheat and be damaged. CAUTION: Do not insert hands or tools into the empty space where the fan-and-power-supply assembly was. Figure 37. Removing the Fan-and-Power-Supply Assembly Chapter 3. Exchanging FRUs 49 5. Push the replacement fan-and-power-supply assembly fully into the 2104. DANGER Do not plug a power cable into the fan-and-power-supply assembly until the assembly is fully home and its thumbscrews are fully tightened. Figure 38. Inserting a Replacement Fan-and-Power-Supply Assembly 50 2104 Models DU3 and TU3 Operator’s Guide 6. Tighten the two thumbscrews by turning them clockwise. Figure 39. Tightening the Screws on a Fan-and-Power-Supply Assembly Chapter 3. Exchanging FRUs 51 7. Plug the mainline power cable into the new fan-and-power-supply assembly. Verify that the green AC PWR light 1 comes on immediately. Figure 40. Plugging in the Mainline Power Cable 52 2104 Models DU3 and TU3 Operator’s Guide 8. Set the DC On/Standby switch 1 on the new fan-and-power-supply assembly to On. Note: The fan speeds might decrease. Figure 41. Fan-and-Power-Supply DC On/Standby Switch Chapter 3. Exchanging FRUs 53 9. Verify that the green DC PWR light 1 comes on within five seconds. If it does not, or the amber CHK light 2 comes on, verify that the new fan-and-power-supply assembly is correctly installed. If it is, report the problem to your service representative. Figure 42. Fan-and-Power-Supply Lights 54 2104 Models DU3 and TU3 Operator’s Guide Exchanging a Fan Assembly You can exchange a faulty fan assembly if: v You have a correct spare fan assembly. v Another fan-and-power-supply assembly is present in the 2104. You do not need to remove power from the 2104 when exchanging a fan assembly. Before Exchanging a Fan Assembly v Look at the Check light on this fan assembly (see “Fan-and-Power-Supply Assembly and Fan Assembly Lights and Switch” on page 11). If the Check light is flashing, exchange the fan assembly by using the procedure in the following section. Figure 43. Fan Assembly Chapter 3. Exchanging FRUs 55 To Exchange a Fan Assembly The diagrams in the following instructions show a 2104 Model DU3. The steps for exchanging a fan assembly in a 2104 Model TU3 are the same, but the parts are turned through 90 degrees. 1. Verify that the amber CHK light 1 is flashing. Figure 44. Fan Assembly Light 56 2104 Models DU3 and TU3 Operator’s Guide 2. Unscrew the two thumbscrews. Figure 45. Unscrewing the Thumbscrews on a Fan Assembly Chapter 3. Exchanging FRUs 57 3. Pull the fan assembly from the 2104. Note: The speed of the other fan in the 2104 might increase. Attention: Do not leave the space empty for more than 30 minutes; if you do, the 2104, and its disk drive modules, might overheat and be damaged. CAUTION: Do not insert hands or tools into the empty fan assembly opening. Figure 46. Pulling out a Fan Assembly 58 2104 Models DU3 and TU3 Operator’s Guide 4. Push the replacement fan assembly fully into the 2104. Note: The speed of the other fan in the 2104 might decrease. Figure 47. Inserting a Fan Assembly Chapter 3. Exchanging FRUs 59 5. Tighten the two thumbscrews. Figure 48. Tightening the Thumbscrews 60 2104 Models DU3 and TU3 Operator’s Guide 6. Verify that the amber CHK light 1 is off. Figure 49. Fan Assembly Light Chapter 3. Exchanging FRUs 61 Exchanging a SCSI Interface Card Assembly Attention: The SCSI interface card is sensitive to electrostatic discharge (ESD). Use the tools and procedures defined by your organization to protect such parts. You can exchange a faulty SCSI interface card assembly if: v You are authorized by your organization. v You have a correct spare SCSI interface card assembly. You do not need to remove power from the 2104 when exchanging an SCSI interface card assembly. Figure 50. SCSI Interface Card Assembly 62 2104 Models DU3 and TU3 Operator’s Guide Before Exchanging the SCSI Interface Card Assembly Check the lights on the SCSI interface card. If the FAULT light is on, exchange the SCSI interface card assembly using the procedure in the following section. Figure 51. FAULT Light on an SCSI Interface Card Assembly Chapter 3. Exchanging FRUs 63 To Exchange a SCSI Interface Card Assembly The diagrams in the following instructions show a 2104 Model DU3. The steps for exchanging a SCSI interface card assembly in a 2104 Model TU3 are the same, but the parts are turned through 90 degrees. 1. Unscrew and unplug the SCSI cable. Figure 52. Removing the SCSI Cable 64 2104 Models DU3 and TU3 Operator’s Guide 2. Unscrew the thumbscrew 1 on the failing SCSI interface card. Figure 53. Unscrewing the SCSI Interface Card Chapter 3. Exchanging FRUs 65 3. Open the lever fully by moving it to the right. This action unplugs the card assembly from the backplane. Figure 54. Opening the Lever 66 2104 Models DU3 and TU3 Operator’s Guide 4. Pull the SCSI interface card assembly out of the 2104. CAUTION: Do not insert hands or tools into the empty space above the fan-and-power-supply assembly. Figure 55. Pulling the SCSI Interface Card Out Chapter 3. Exchanging FRUs 67 5. Push the replacement SCSI interface card assembly into the 2104. Figure 56. Pushing the Replacement SCSI Interface Card In 68 2104 Models DU3 and TU3 Operator’s Guide 6. Move the lever to the left. CAUTION: As you push the assembly fully home, the lever automatically moves towards its closed position. Ensure that your fingers do not become pinched between the lever and the assembly. Figure 57. Closing the Lever Chapter 3. Exchanging FRUs 69 7. Tighten the thumbscrew. Figure 58. Tightening the Thumbscrew 70 2104 Models DU3 and TU3 Operator’s Guide 8. Plug the SCSI connector into the SCSI interface card, and tighten the retaining screws. Figure 59. Plugging in the SCSI Cable Chapter 3. Exchanging FRUs 71 9. Check that the green TERM POWER light 1 comes on within five seconds. It indicates that the TERM POWER from a host bus adapter is activated. If it does not, check that the new SCSI interface card has been correctly installed. Figure 60. SCSI Interface Card Assembly TERM POWER Light 72 2104 Models DU3 and TU3 Operator’s Guide Exchanging the Switch Card Assembly Attention: The switch card is sensitive to electrostatic discharge (ESD). Use the tools and procedures defined by your organization to protect such parts. You can exchange a faulty switch card assembly if: v You are authorized by your organization. v You have a correct spare switch card assembly. You do not need to remove power from the 2104 when exchanging the switch card. Before Exchanging the Switch Card Assembly Notes: 1. The settings of the switches on the replacement switch card must be the same as those on the original card. See “Switch Card” on page 16 for details of the switch settings. 2. If the switch card is removed while the 2104 is on, the 2104 might be powered off. Figure 61. Switch Card Chapter 3. Exchanging FRUs 73 To Exchange the Switch Card Assembly The diagrams in the following instructions show a 2104 Model DU3. The steps for exchanging the switch card assembly in a 2104 Model TU3 are the same, but the parts are turned through 90 degrees. 1. Unscrew the thumbscrew1 on the failing switch card assembly. Figure 62. Unscrewing the Thumbscrew on the Switch Card Assembly 74 2104 Models DU3 and TU3 Operator’s Guide 2. Pull the lever up. This action unplugs the switch card assembly from the backplane. Note: If the 2104 is switched on, it might switch off when you unplug the switch card assembly. Figure 63. Opening the Lever Chapter 3. Exchanging FRUs 75 3. Pull the switch card assembly out of the 2104. CAUTION: Do not insert hands or tools into the empty space between the power supply assemblies. Figure 64. Remove the Switch Card 76 2104 Models DU3 and TU3 Operator’s Guide 4. Push the replacement switch card assembly into the disk enclosure. Figure 65. Pushing the Switch Card Assembly into the 2104 Chapter 3. Exchanging FRUs 77 5. Push the lever down. Figure 66. Closing the Lever 78 2104 Models DU3 and TU3 Operator’s Guide 6. Tighten the thumbscrew. Figure 67. Tightening the Thumbscrew Chapter 3. Exchanging FRUs 79 Reporting Problems When you report a problem that has occurred in a 2104, it is important that you report the following information, in addition to the error information (the SRN) given to you by your operating system: The machine type number 2104 The machine model number DU3 or TU3 The machine serial number xx-xxxxx The serial number can be read from the label at the bottom right of the front of the 2104, or from the label on the back of the 2104. 80 2104 Models DU3 and TU3 Operator’s Guide Appendix A. Operating with RISC Systems This appendix contains information that is specific to Expandable Storage Plus disk enclosures that are attached to RISC systems. SCSI Adapters and Cables An RS/6000 or IBM server pSeries computer uses one of the following SCSI adapters to connect to the 2104 Model DU3 or 2104 Model TU3: v PCI 3-Channel Ultra2 SCSI RAID Adapter (Feature Code 2494 Type Number 4-T) v PCI Dual Channel Ultra2 SCSI Adapter (Feature Code 6205 Type Number 4-R) v PCI Single-Ended Ultra SCSI Adapter (Feature Code 6206 Type Number 4-K) v PCI Single-Ended Plus SCSI RAID Adapter (Feature Code 6208 Type Number 4_A) SCSI cables supported by these adapters are: Adapter (Feature Code) Cable Length Feature Code Part Number 6205 or 2494 20 meters 9320 09L3307 10 meters 9310 09L3305 5 meters 9305 09L3303 3 meters 9303 09L3301 1 meter 9301 09L3299 6206 3 meters 9313 09L3309 2415 3 meters 9323 09L3311 SCSI Service Aids Service aids are available on the using system to help you service the 2104. Service aids described in the Expandable Storage Plus: 2104 Models DU3 and TU3 Service Guide are: v Format Media v Certify Media v SCSI Device Identification and Removal v Download Microcode Identifying a 2104 and its Disk Drive Modules You can identify a 2104 and the disk drive modules installed in it either by the location code contained in system messages that refer to that unit, or by using the SCSI Device Identification and Removal service aid. 81 See “Location Codes” in the operator guide for your system for general information about location codes. Problem Determination For some problems, your system diagnostics tell you to check the lights on the system to determine which part, if any, has a fault. “Controls and Lights” on page 7 of this book shows the position of the lights. The diagnostic programs tell you what to report to your service representative. (“Reporting Problems” on page 80 describes other information you should report.) Checking the SCSI Error Log You can check the error log by running diagnostics in Problem Determination mode. They check the error log and present messages about any problems that have been logged. Your system manuals describe how to run the diagnostic programs. Configuring a 2104 to the using system Use the cfgmgr command to configure or reconfigure a 2104 to the using system. Unconfiguring a 2104 from the using system To remove a 2104 from the using system, use the following command: rmdev -l [enclosurenumber] -d where [enclosurenumber] is the enclosure device that was generated by the cfgmgr command (for example, [ses0], [ses1], [ses2]). Remove the ses_healthcheck job from the system cron table. Web Support Page You can find the web support page at: http://www.rs6000.ibm.com/support Related Publications v Diagnostic Information for Micro Channel Bus Systems, SA23-2765 v Diagnostic Information for Multiple Bus Systems, SA38-0509 v Site and Hardware Planning Information,SA38-0508 v Adapters, Devices and Cable Information for Micro Channel Bus Systems, SA23-2764 v Adapters, Devices and Cable Information for Multiple Bus Systems, SA38-0516 82 2104 Models DU3 and TU3 Operator’s Guide Appendix B. Translated Safety Notices This appendix contains the danger and caution notices that are used in the various books relating to the Expandable Storage Plus: 2104 Models DU3 and TU3. The notices are shown in English and in various other languages. Danger notice A danger notice calls attention to a situation that is potentially lethal or extremely hazardous to people. Caution notice A caution notice calls attention to a situation that is potentially hazardous to people because of some existing condition. Always use safe working procedures whenever you work on machines. Use your own judgment to identify safety conditions that these notices do not describe. Danger Notices DANGER In the following step you are going to remove the power cables. These cables are live if the rack power distribution unit or uninterruptable power supply (UPS) unit is still switched on. DANGER Do not try to open the covers of the fan-and-power-supply assembly. DANGER Do not plug a power cable into the fan-and-power-supply assembly until the assembly is fully home and its thumbscrews are fully tightened. DANGER An electrical outlet that is not correctly wired could place hazardous voltage on metal parts of the system or the devices that attach to that system. It is the customers’s responsibility to ensure that the outlet is correctly wired and grounded to prevent an electrical shock. During an electrical storm, do not disconnect cables for display stations, printers, telephones, or station protectors for communication lines. 83 Caution Notices CAUTION: This unit may have two linecords. To remove all power, disconnect both linecords. CAUTION: This unit weighs 38.5 kg. CAUTION: Do not remove cover, do not service, no serviceable parts. CAUTION: Double pole/neutral fusing CAUTION: A ?Standby? condition is indicated by the symbol to the right of ?DC? directly above the switch, SW1. When SW1 is toggled to the right position directly under the ?Standby? symbol, the unit’s AC-power is not shut off. CAUTION: The stabilizer must be correctly attached to the bottom front of the rack to prevent the rack from tipping forward while the 2104 Model DU3 is being removed from the rack. Do not pull out or install any unit if a stabilizer is not attached to the rack. CAUTION: The stabilizer must be correctly attached to the bottom front of the rack to prevent the rack from tipping forward while the 2104 Model DU3 is being installed into the rack. Do not remove or install any unit if a stabilizer is not attached to the rack. CAUTION: A 2104 Model DU3 weighs up to 38.5 kg (85 lb) with the maximum number of disk drive modules installed. Do not attempt to lift the 2104 into the rack unless all the disk drive modules have been removed. CAUTION: A 2104 Model DU3 weighs up to 38.5 kg (85 lb) with the maximum number of disk drive modules installed. Do not attempt to remove the 2104 from the rack unless all the disk drive modules have been removed. CAUTION: Do not attempt to lift the 2104 by yourself. Ask another person for aid. CAUTION: Do not insert hands or tools into the empty space that contained the fan assembly. 84 2104 Models DU3 and TU3 Operator’s Guide CAUTION: Do not insert hands or tools into the empty space that contained the fan-and-power-supply assembly. CAUTION: Do not insert hands or tools into the empty space that contained the switch card assembly. CAUTION: Do not insert hands or tools into the empty space that contained the SCSI interface card assembly. CAUTION: This product is equipped with a 3-wire power cable and plug for the user’s safety. Use this power cable in conjunction with a correctly grounded electrical outlet to avoid an electrical shock. CAUTION: Do not touch the power outlet or the power outlet face plate with anything other than test probes before you have completed this safety check. CAUTION: If the reading is not infinity, do not proceed. Make the necessary corrections to the wiring before you continue. Do not switch on the branch circuit CB until all the above steps are satisfactorily completed. CAUTION: A 2104 Model TU3 can weigh up to 54.5 kg (120 lb) with the maximum number of disk drive modules installed. Do not attempt to lift one without help from a second person. CAUTION: Do not use the handles of the fan or fan-and-power-supply assemblies to carry the 2104. These handles are not intended to support the weight of the unit. CAUTION: As you push the assembly fully home, the lever automatically moves toward its closed position. Ensure that your fingers do not become pinched between the lever and the assembly. Appendix B. Translated Safety Notices 85 86 2104 Models DU3 and TU3 Operator’s Guide Appendix B. Translated Safety Notices 87 88 2104 Models DU3 and TU3 Operator’s Guide Appendix B. Translated Safety Notices 89 90 2104 Models DU3 and TU3 Operator’s Guide Appendix B. Translated Safety Notices 91 92 2104 Models DU3 and TU3 Operator’s Guide Appendix B. Translated Safety Notices 93 94 2104 Models DU3 and TU3 Operator’s Guide Appendix B. Translated Safety Notices 95 96 2104 Models DU3 and TU3 Operator’s Guide Appendix B. Translated Safety Notices 97 98 2104 Models DU3 and TU3 Operator’s Guide Appendix B. Translated Safety Notices 99 100 2104 Models DU3 and TU3 Operator’s Guide Appendix B. Translated Safety Notices 101 102 2104 Models DU3 and TU3 Operator’s Guide Appendix B. Translated Safety Notices 103 104 2104 Models DU3 and TU3 Operator’s Guide Appendix B. Translated Safety Notices 105 106 2104 Models DU3 and TU3 Operator’s Guide Appendix B. Translated Safety Notices 107 108 2104 Models DU3 and TU3 Operator’s Guide Appendix B. Translated Safety Notices 109 110 2104 Models DU3 and TU3 Operator’s Guide Appendix B. Translated Safety Notices 111 112 2104 Models DU3 and TU3 Operator’s Guide Appendix B. Translated Safety Notices 113 114 2104 Models DU3 and TU3 Operator’s Guide Appendix B. Translated Safety Notices 115 116 2104 Models DU3 and TU3 Operator’s Guide Index A AC PWR light, fan-and-power- supply assembly 12 Active light SCSI interface card 15 Active light, SCSI interface card 15 activity light, disk drive module 9 B Box ID switch interface card 18 C check light disk drive module 9 disk enclosure 7 fan assembly 13 power supply assembly 12 check light, power supply assembly 12 configurations deskside 5 rack-mounted 2 controls disk drive module 8 Subsystem 7 cover locks 22 D DC On/Standby switch fan-and-power-supply assembly 12 DC PWR light, fan-and-power- supply assembly 12 Delay Motor Start switch switch card 18 Delay Motor Start switch, interface card 18 disk drive module adding 25 exchanging 36 identifying 21 lights 8 Drive Autostart switch switch card 17 E Enable Enclosure Services switch switch card 18 Enable Enclosure Services switch, interface card 18 F fan-and-power- supply assembly lights 11 Power light 12 fan assembly exchanging 55 lights 11 Fault light SCSI interface card 15 Fault light, SCSI interface card 15 I identifying disk drive modules 21 interface card lights 14 K keys, reordering 23 L lights 2104 power 7 disk drive module 8 enclosure check 7 fan-and-power- supply assembly 11 fan assembly 11 interface card 14 Subsystem 7 switch card 16 locks deskside unit covers 22 LVD/SE light SCSI interface card 15 LVD/SE light, SCSI interface card 15 M model number, machine 80 N numbers, machine 80 O Orientation switch switch card 19 P Power Control switch switch card 17 power light 2104 7 Power light fan-and-power-supply assembly 12 Power–Good light fan-and-power- supply assembly 12 power supply assembly 12 power-supply assembly exchanging 45 power switch fan-and-power- supply assembly 12 problems dealing with 35 117 problems (continued) reporting 80 R Reset light SCSI interface card 15 Reset light, SCSI interface card 15 Rotary switch interface card 18 S SCSI Address switch switch card 19 SCSI Bus Split switch switch card 19 SCSI Cable type SCSI interface card 15 SCSI Interface Card exchanging 62 security 22 Select Enclosure Services switch switch card 18 Select Enclosure Services switch, interface card 18 serial number, machine 80 Subsystem configuration 2, 5 controls 7 lights 7 power control 7 turning power on and off 7 switch card lights 16 Switch Card exchanging 73 system identifying 80 T TERM POWER light SCSI interface card 15 TERM POWER light, SCSI interface card 15 turning power on and off 7 type number, machine 80 118 2104 Models DU3 and TU3 Operator’s Guide Vos remarques sur ce document / Technical publication remark form Titre / Title : Bull 2104 Model DU3 Expandable Storage Plus Operator’s Guide Nº Reférence / Reference Nº : 86 A1 16EF 01 Daté / Dated : December 2000 ERREURS DETECTEES / ERRORS IN PUBLICATION AMELIORATIONS SUGGEREES / SUGGESTIONS FOR IMPROVEMENT TO PUBLICATION Vos remarques et suggestions seront examinées attentivement. Si vous désirez une réponse écrite, veuillez indiquer ci-après votre adresse postale complète. Your comments will be promptly investigated by qualified technical personnel and action will be taken as required. If you require a written reply, please furnish your complete mailing address below. NOM / NAME : Date : SOCIETE / COMPANY : ADRESSE / ADDRESS : Remettez cet imprimé à un responsable BULL ou envoyez-le directement à : Please give this technical publication remark form to your BULL representative or mail to: BULL CEDOC 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE Technical Publications Ordering Form Bon de Commande de Documents Techniques To order additional publications, please fill up a copy of this form and send it via mail to: Pour commander des documents techniques, remplissez une copie de ce formulaire et envoyez-la à : BULL CEDOC ATTN / MME DUMOULIN 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE Managers / Gestionnaires : Mrs. / Mme : C. DUMOULIN +33 (0) 2 41 73 76 65 Mr. / M : L. CHERUBIN +33 (0) 2 41 73 63 96 FAX : +33 (0) 2 41 73 60 19 E–Mail / Courrier Electronique : srv.Cedoc@franp.bull.fr Or visit our web sites at: / Ou visitez nos sites web à: http://www.logistics.bull.net/cedoc http://www–frec.bull.com http://www.bull.com CEDOC Reference # No Référence CEDOC Qty Qté CEDOC Reference # No Référence CEDOC Qty Qté CEDOC Reference # No Référence CEDOC Qty Qté _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] [ _ _ ] : no revision number means latest revision / pas de numéro de révision signifie révision la plus récente NOM / NAME : Date : SOCIETE / COMPANY : ADRESSE / ADDRESS : PHONE / TELEPHONE : FAX : E–MAIL : For Bull Subsidiaries / Pour les Filiales Bull : Identification: For Bull Affiliated Customers / Pour les Clients Affiliés Bull : Customer Code / Code Client : For Bull Internal Customers / Pour les Clients Internes Bull : Budgetary Section / Section Budgétaire : For Others / Pour les Autres : Please ask your Bull representative. / Merci de demander à votre contact Bull. BULL CEDOC 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE 86 A1 16EF 01 ORDER REFERENCE PLACE BAR CODE IN LOWER LEFT CORNER Utiliser les marques de découpe pour obtenir les étiquettes. Use the cut marks to get the labels. 86 A1 16EF 01 2104 Model DU3 Expandable Storage Plus Operator’s Guide 86 A1 16EF 01 2104 Model DU3 Expandable Storage Plus Operator’s Guide 86 A1 16EF 01 Disk drives for the E2-700, E2-700T, E2- 705, E2-705T, E4-700, E4-700T or E4- 705 REFERENCE 86 A1 02FH 01 ESCALA Power7 Disk drives for the E2-700, E2-700T, E2-705, E2-705T, E4-700, E4-700T or E4-705 The ESCALA Power7 publications concern the following models: - Bull Escala E5-700 (Power 750 / 8233-E8B) - Bull Escala M6-700 (Power 770 / 9117-MMB) - Bull Escala M6-705 (Power 770 / 9117-MMC) - Bull Escala M7-700 (Power 780 / 9179-MHB) - Bull Escala M7-705 (Power 780 / 9179-MHC) - Bull Escala E1-700 (Power 710 / 8231-E2B) - Bull Escala E1-705 (Power 710 / 8231-E1C) - Bull Escala E2-700 / E2-700T (Power 720 / 8202-E4B) - Bull Escala E2-705 / E2-705T (Power 720 / 8202-E4C) - Bull Escala E3-700 (Power 730 / 8231-E2B) - Bull Escala E3-705 (Power 730 / 8231-E2C) - Bull Escala E4-700 / E4-700T (Power 740 / 8205-E6B) - Bull Escala E4-705 (Power 740 / 8205-E6C) References to Power 755 / 8236-E8C models are irrelevant. Hardware October 2011 BULL CEDOC 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE REFERENCE 86 A1 02FH 01 The following copyright notice protects this book under Copyright laws which prohibit such actions as, but not limited to, copying, distributing, modifying, and making derivative works. Copyright Bull SAS 2011 Printed in France Suggestions and criticisms concerning the form, content, and presentation of this book are invited. A form is provided at the end of this book for this purpose. To order additional copies of this book or other Bull Technical Publications, you are invited to use the Ordering Form also provided at the end of this book. Trademarks and Acknowledgements We acknowledge the right of proprietors of trademarks mentioned in this book. The information in this document is subject to change without notice. Bull will not be liable for errors contained herein, or f or incidental or consequential damages in connection with the use of this material. Contents Safety notices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix Disk drives. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 What's new in Disk drives . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 Installing a disk drive or solid-state drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 Solid-state drives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 Installing a disk drive or solid-state drive in an 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C with the AIX system or logical partition power turned on . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8 Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 Preparing to install a disk drive by using the HSM . . . . . . . . . . . . . . . . . . . . . 10 Installing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. . . . . . . . . . . . . . 10 Installing a disk drive in the 5887 disk-drive expansion unit . . . . . . . . . . . . . . . . . . 11 Completing the disk drive installation by using the HSM . . . . . . . . . . . . . . . . . . . 12 Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12 Installing a disk drive or solid-state drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C with the Linux system or logical partition power turned on . . . . . . . . . . . . . . . . . . . . . . . . 13 Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14 Preparing to install a disk drive by using the HSM . . . . . . . . . . . . . . . . . . . . . 15 Installing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. . . . . . . . . . . . . . 16 Installing a disk drive in the 5887 disk-drive expansion unit . . . . . . . . . . . . . . . . . . 17 Verifying that the new disk drive is operational . . . . . . . . . . . . . . . . . . . . . . 18 Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19 Installing a disk drive or solid-state drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C with the Virtual I/O Server system or logical partition power turned on . . . . . . . . . . . . . . . . . . . . . . 20 Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21 Preparing to install a disk drive by using the HSM . . . . . . . . . . . . . . . . . . . . . 22 Installing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. . . . . . . . . . . . . . 22 Installing a disk drive in the 5887 disk-drive expansion unit . . . . . . . . . . . . . . . . . . 23 Completing the disk drive installation by using the HSM . . . . . . . . . . . . . . . . . . . 24 Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24 Installing a disk drive or solid-state drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C with the IBM i system or logical partition power turned on . . . . . . . . . . . . . . . . . . . . . . . . . . . 25 Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26 Preparing to install a disk drive by using the HSM . . . . . . . . . . . . . . . . . . . . . 27 Installing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. . . . . . . . . . . . . . 28 Installing a disk drive in the 5887 disk-drive expansion unit . . . . . . . . . . . . . . . . . . 29 Completing the disk drive installation by using the HSM . . . . . . . . . . . . . . . . . . . 30 Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30 Installing a disk drive or solid-state drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C with the system or logical partition power turned off . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31 Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32 Installing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. . . . . . . . . . . . . . 33 Installing a disk drive in the 5887 disk-drive expansion unit . . . . . . . . . . . . . . . . . . 35 Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35 Removing a disk drive or solid-state drive from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37 Removing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C with the AIX system or logical partition power turned on . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37 Preparing the system for removing a disk drive . . . . . . . . . . . . . . . . . . . . . . 38 Preparing the system for removing a disk drive by using the HSM. . . . . . . . . . . . . . . . 39 © Copyright IBM Corp. 2010, 2011 iii Removing the disk drive from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C . . . . . . . . . . . . 40 Removing the disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C using Linux kernel 2.6 with the Linux system or logical partition power turned on . . . . . . . . . . . . . . . . . . . . . . . . 41 Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42 Preparing to remove the disk drive by using the HSM . . . . . . . . . . . . . . . . . . . . 43 Removing the disk drive from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C . . . . . . . . . . . . 46 Removing a disk drive from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C or an expansion unit controlled by IBMi . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47 Preparing the system for removing a disk drive . . . . . . . . . . . . . . . . . . . . . . 48 Preparing to remove the disk drive by using the HSM . . . . . . . . . . . . . . . . . . . . 49 Removing the disk drive from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C . . . . . . . . . . . . 51 Removing a disk drive using hot-spare in an IBM i system or logical partition from an 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52 Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52 Preparing to remove a disk drive by using the HSM. . . . . . . . . . . . . . . . . . . . . 52 Removing the disk drive from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C . . . . . . . . . . . . 54 Removing a SCSI disk drive from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. . . . . . . . . . . . 55 Preparing to remove the SCSI disk drive by using the HSM . . . . . . . . . . . . . . . . . . 55 Removing the disk drive from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C . . . . . . . . . . . . 56 Removing a disk drive from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C with the system or logical partition power turned off . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57 Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58 Removing the disk drive from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C . . . . . . . . . . . . 60 Replacing a disk drive or solid state drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63 Replacing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C with the AIX system or logical partition power turned on . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63 Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64 Replacing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. . . . . . . . . . . . . . 65 Completing the replacing the disk drive by using the HSM . . . . . . . . . . . . . . . . . . 66 Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66 Replacing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C by using Linux kernel 2.6, with the power to the Linux system or logical partition turned on . . . . . . . . . . . . . . . . . . . . 67 Preparing the system to replace a disk drive . . . . . . . . . . . . . . . . . . . . . . . 68 Preparing to replace a disk drive by using the HSM. . . . . . . . . . . . . . . . . . . . . 69 Replacing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. . . . . . . . . . . . . . 69 Completing the disk drive replace by using the HSM . . . . . . . . . . . . . . . . . . . . 70 Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70 Replacing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C with the IBM i system or logical partition power turned on . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71 Preparing to replace the disk drive by using the HSM . . . . . . . . . . . . . . . . . . . . 72 Replacing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. . . . . . . . . . . . . . 73 Completing the disk drive replace by using the HSM . . . . . . . . . . . . . . . . . . . . 74 Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74 Replacing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C using hot-spare in an IBM i system or logical partition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75 Preparing to replace the disk drive by using the HSM . . . . . . . . . . . . . . . . . . . . 75 Replacing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. . . . . . . . . . . . . . 75 Completing the procedure by using the HSM . . . . . . . . . . . . . . . . . . . . . . . 76 Replacing a SCSI disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C . . . . . . . . . . . . . 77 Replacing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. . . . . . . . . . . . . . 77 Completing the disk drive replace by using the HSM . . . . . . . . . . . . . . . . . . . . 78 Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79 Replacing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C with the system or logical partition power turned off . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79 Replacing a disk drive in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. . . . . . . . . . . . . . 80 Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81 iv Disk drives Installing the external SAS port in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83 Removing the external SAS port from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87 Replacing the external SAS port in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91 Internal disk drive sharing on an 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system 93 Removing or installing a disk drive filler . . . . . . . . . . . . . . . . . . . . . 95 Removing a disk drive filler from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. . . . . . . . . . . . 95 Installing a disk drive filler in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C . . . . . . . . . . . . . 96 Removing or installing a disk drive bezel. . . . . . . . . . . . . . . . . . . . . 99 Removing a disk drive bezel from a disk drive or filler. . . . . . . . . . . . . . . . . . . . . 99 Installing a disk drive bezel in a disk drive or filler . . . . . . . . . . . . . . . . . . . . . 100 Disk drive locations and service indicators . . . . . . . . . . . . . . . . . . . 103 PCIe RAID and SSD SAS adapter locations . . . . . . . . . . . . . . . . . . . . . . . . 103 Disk drive locations and service indicators for the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system. . . . . 103 Disk drive locations and service indicators for the 5886 disk-drive enclosure . . . . . . . . . . . . . 105 Disk-drive locations and service indicators for the 5887 disk-drive enclosure . . . . . . . . . . . . . 107 Disk drive locations and service indicators for the 5786, 5787, 7031-D24, and 7031-T24 SCSI disk-drive enclosures 108 Disk-drive locations and service indicators for the 5802 expansion unit . . . . . . . . . . . . . . . 110 Related information. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113 Preparing a disk drive for use in an AIX system or logical partition . . . . . . . . . . . . . . . . 113 Preparing to remove a disk drive from a system or expansion unit controlled by AIX . . . . . . . . . . 113 Rebuilding data on a replacement disk drive using AIX . . . . . . . . . . . . . . . . . . . . 113 Removing a disk drive using Linux kernel 2.6 . . . . . . . . . . . . . . . . . . . . . . . 113 Preparing to remove the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . 113 Removing the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116 Replacing the disk drive. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119 Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 121 Rebuilding data on a replacement disk drive using Linux . . . . . . . . . . . . . . . . . . . 122 Rebuild data on a disk drive using Linux kernel 2.6 . . . . . . . . . . . . . . . . . . . . 122 Preparing to remove a disk-drive from a system or logical partition controlled by IBM i . . . . . . . . . 124 Determining the IBM i disk-drive protection status. . . . . . . . . . . . . . . . . . . . . . 125 Configuring a disk drive in an IBM i system or logical partition . . . . . . . . . . . . . . . . . 126 Configuring a disk drive on a load source adapter for hot spare using the IBM i operating system. . . . . . 128 Rebuilding data on a replacement disk drive using IBM i . . . . . . . . . . . . . . . . . . . 129 Replacing a SAS conduit card in a 5802 expansion unit with the power turned off . . . . . . . . . . . 130 Removing a SAS conduit card from a 5802 expansion unit with the power turned off . . . . . . . . . . 137 Replacing a SAS expander card in a 5802 expansion unit with the power turned off. . . . . . . . . . . 146 Removing a SAS expander card from a 5802 expansion unit with power turned off . . . . . . . . . . . 146 Common procedures for installable features . . . . . . . . . . . . . . . . . 149 Before you begin. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 151 Identifying a part. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 155 Control panel LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 155 Identifying a failing part in an AIX system or logical partition . . . . . . . . . . . . . . . . . . 156 Locating a failing part in an AIX system or logical partition. . . . . . . . . . . . . . . . . . 156 Activating the indicator light for the failing part. . . . . . . . . . . . . . . . . . . . . . 157 Contents v Deactivating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . 157 Identifying a failing part in an IBM i system or logical partition . . . . . . . . . . . . . . . . . 157 Activating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . . 157 Deactivating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . 158 Identifying a failing part in a Linux system or logical partition. . . . . . . . . . . . . . . . . . 159 Locating a failing part in a Linux system or logical partition . . . . . . . . . . . . . . . . . 159 Finding the location code of a failing part in a Linux system or logical partition . . . . . . . . . . . 159 Activating the indicator light for the failing part. . . . . . . . . . . . . . . . . . . . . . 159 Deactivating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . 159 Locating a failing part in a Virtual I/O Server system or logical partition . . . . . . . . . . . . . . 160 Identifying a part by using the Virtual I/O Server . . . . . . . . . . . . . . . . . . . . . 160 Starting the system or logical partition . . . . . . . . . . . . . . . . . . . . . 161 Starting a system that is not managed by a Hardware Management Console or or a Systems Director Management Console. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 161 Starting a system or logical partition by using the Hardware Management Console . . . . . . . . . . . 162 Starting a system or virtual server with the Systems Director Management Console. . . . . . . . . . . 163 Stopping a system or logical partition. . . . . . . . . . . . . . . . . . . . . . 165 Stopping a system that is not managed by an HMC or an SDMC . . . . . . . . . . . . . . . . . 165 Stopping a system by using the Hardware Management Console . . . . . . . . . . . . . . . . . 166 Stopping a system with the Systems Director Management Console . . . . . . . . . . . . . . . . 167 Removing and replacing the expansion unit cover or door. . . . . . . . . . . . . 169 Removing the front cover on the 7314-G30 or 5796. . . . . . . . . . . . . . . . . . . . . . 169 Installing the front cover on the 7314-G30 or 5796 . . . . . . . . . . . . . . . . . . . . . . 169 System covers. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 171 Removing and replacing covers for the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C . . . . . . . . . . . 171 Removing the front cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system . . . . . 171 Removing the front cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system . . . . . . . . . 172 Installing the front cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system . . . . . 172 Installing the front cover and front door on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system. . . . . 173 Removing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system . . 174 Removing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system . . . . . . 175 Installing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system . . 177 Installing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system . . . . . . . 177 Removing and replacing covers for the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . 179 Removing the service access cover for the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . 179 Installing the service access cover for the 8231-E2B, 8231-E1C, or 8231-E2C. . . . . . . . . . . . . 180 Removing and replacing covers on the 8233-E8B or 8236-E8C system. . . . . . . . . . . . . . . . 181 Removing the front cover from an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . . . . 181 Installing the front cover on an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . . . . . 182 Removing the service access cover from an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . 183 Installing the service access cover on an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . . 184 Removing and replacing covers for the 9117-MMB, 9117-MMC, 9179-MHB, or 9179-MHC. . . . . . . . . 185 Removing the front cover from the 9117-MMB, 9117-MMC, 9179-MHB, or 9179-MHC . . . . . . . . . 185 Removing the front cover with the cover removal tool. . . . . . . . . . . . . . . . . . . . 186 Installing the front cover on the 9117-MMB, 9117-MMC, 9179-MHB, or 9179-MHC . . . . . . . . . . 191 Placing the 8231-E2B, 8231-E1C, or 8231-E2C system into the service or operating position. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 193 Placing the rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system into the service position . . . . . . . . 193 Placing the rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system into the operating position . . . . . . . 194 Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service position . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 197 vi Disk drives Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the operating position . . . . . . . . . . . . . . . . . . . . . . . . . . . . 199 Service and operating positions . . . . . . . . . . . . . . . . . . . . . . . . 201 Placing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service or operating position . . . . . 201 Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service position . . . 201 Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the operating position . . 202 Placing the 8231-E2B, 8231-E1C, or 8231-E2C system into the service or operating position . . . . . . . . 204 Placing the rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system into the service position . . . . . . . 204 Placing the rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system into the operating position . . . . . . 205 Placing the rack-mounted 8233-E8B or 8236-E8C system into the service position or operating position . . . . 207 Placing the 8233-E8B or 8236-E8C in the service position . . . . . . . . . . . . . . . . . . . 207 Placing the 8233-E8B or 8236-E8C in the operating position . . . . . . . . . . . . . . . . . . 209 Power cords. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 211 Disconnecting the power cords from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system . . . . . . . . 211 Connecting the power cords to the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system. . . . . . . . . . 212 Disconnecting the power cords from the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . 213 Connecting the power cords to the 8231-E2B, 8231-E1C, or 8231-E2C. . . . . . . . . . . . . . . . 214 Verifying the installed part. . . . . . . . . . . . . . . . . . . . . . . . . . . 217 Verifying an installed feature or replaced part in an AIX system or logical partition. . . . . . . . . . . 217 Verifying the installed part in an IBM i system or logical partition. . . . . . . . . . . . . . . . . 219 Deactivating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . 220 Verifying the installed part in a Linux system or logical partition . . . . . . . . . . . . . . . . . 220 Verifying an installed part by using stand-alone diagnostics. . . . . . . . . . . . . . . . . . . 221 Verifying the installed part by using Hardware Management Console . . . . . . . . . . . . . . . 222 Activating and deactivating LEDs by using the HMC . . . . . . . . . . . . . . . . . . . . 223 Deactivating a system attention LED or partition LED. . . . . . . . . . . . . . . . . . . 223 Activating or deactivating an identify LED . . . . . . . . . . . . . . . . . . . . . . 223 Viewing serviceable events by using the HMC . . . . . . . . . . . . . . . . . . . . . . 224 Verifying the installed part by using Systems Director Management Console . . . . . . . . . . . . . 224 Activating and deactivating LEDs by using the Systems Director Management Console . . . . . . . . 225 Deactivating a system attention LED or partition LED. . . . . . . . . . . . . . . . . . . 225 Activating or deactivating an identify LED by using the SDMC . . . . . . . . . . . . . . . 225 Viewing serviceable events by using the Systems Director Management Console . . . . . . . . . . . 226 Verifying an installed feature or replaced part on a system or logical partition by using Virtual I/O Server tools 226 Verifying a repair. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 231 Verifying the repair in AIX . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 232 Verifying a repair using an IBM i system or logical partition . . . . . . . . . . . . . . . . . . 235 Verifying the repair in Linux . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 237 Verifying the repair from the management console . . . . . . . . . . . . . . . . . . . . . . 237 Closing a service call. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 239 Closing a service call using AIX or Linux . . . . . . . . . . . . . . . . . . . . . . . . . 243 Closing a service call using Integrated Virtualization Manager . . . . . . . . . . . . . . . . . . 247 Activating and deactivating LEDs. . . . . . . . . . . . . . . . . . . . . . . . 253 Deactivating a system attention LED or partition LED using the management console . . . . . . . . . . 253 Activating or deactivating an identify LED using the management console. . . . . . . . . . . . . . 254 Deactivating a system attention LED or logical partition LED using the Advanced System Management Interface 254 Activating or deactivating an identify LED using the Advanced System Management Interface . . . . . . . 255 Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 257 Trademarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 258 Electronic emission notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 258 Class A Notices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 258 Class B Notices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 262 Contents vii Terms and conditions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 265 viii Disk drives Safety notices Safety notices may be printed throughout this guide: v DANGER notices call attention to a situation that is potentially lethal or extremely hazardous to people. v CAUTION notices call attention to a situation that is potentially hazardous to people because of some existing condition. v Attention notices call attention to the possibility of damage to a program, device, system, or data. World Trade safety information Several countries require the safety information contained in product publications to be presented in their national languages. If this requirement applies to your country, a safety information booklet is included in the publications package shipped with the product. The booklet contains the safety information in your national language with references to the U.S. English source. Before using a U.S. English publication to install, operate, or service this product, you must first become familiar with the related safety information in the booklet. You should also refer to the booklet any time you do not clearly understand any safety information in the U.S. English publications. German safety information Das Produkt ist nicht für den Einsatz an Bildschirmarbeitsplätzen im Sinne § 2 der Bildschirmarbeitsverordnung geeignet. Laser safety information IBM® servers can use I/O cards or features that are fiber-optic based and that utilize lasers or LEDs. Laser compliance IBM servers may be installed inside or outside of an IT equipment rack. © Copyright IBM Corp. 2010, 2011 ix DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) DANGER x Disk drives Observe the following precautions when working on or around your IT rack system: v Heavy equipment–personal injury or equipment damage might result if mishandled. v Always lower the leveling pads on the rack cabinet. v Always install stabilizer brackets on the rack cabinet. v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest devices in the bottom of the rack cabinet. Always install servers and optional devices starting from the bottom of the rack cabinet. v Rack-mounted devices are not to be used as shelves or work spaces. Do not place objects on top of rack-mounted devices. v Each rack cabinet might have more than one power cord. Be sure to disconnect all power cords in the rack cabinet when directed to disconnect power during servicing. v Connect all devices installed in a rack cabinet to power devices installed in the same rack cabinet. Do not plug a power cord from a device installed in one rack cabinet into a power device installed in a different rack cabinet. v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of the system or the devices that attach to the system. It is the responsibility of the customer to ensure that the outlet is correctly wired and grounded to prevent an electrical shock. CAUTION v Do not install a unit in a rack where the internal rack ambient temperatures will exceed the manufacturer's recommended ambient temperature for all your rack-mounted devices. v Do not install a unit in a rack where the air flow is compromised. Ensure that air flow is not blocked or reduced on any side, front, or back of a unit used for air flow through the unit. v Consideration should be given to the connection of the equipment to the supply circuit so that overloading of the circuits does not compromise the supply wiring or overcurrent protection. To provide the correct power connection to a rack, refer to the rating labels located on the equipment in the rack to determine the total power requirement of the supply circuit. v (For sliding drawers.) Do not pull out or install any drawer or feature if the rack stabilizer brackets are not attached to the rack. Do not pull out more than one drawer at a time. The rack might become unstable if you pull out more than one drawer at a time. v (For fixed drawers.) This drawer is a fixed drawer and must not be moved for servicing unless specified by the manufacturer. Attempting to move the drawer partially or completely out of the rack might cause the rack to become unstable or cause the drawer to fall out of the rack. Serial-Attached SCSI RAID Enablement in E2-700, E2-700T, E2-705, E2-705T, E4- 700, E4-700T or E4-705 REFERENCE 86 A1 90FF 03 ESCALA Power7 Serial-Attached SCSI RAID Enablement in E2-700, E2- 700T, E2-705, E2-705T, E4-700, E4-700T or E4-705 The ESCALA Power7 publications concern the following models: - Bull Escala E5-700 (Power 750 / 8233-E8B) - Bull Escala M6-700 (Power 770 / 9117-MMB) - Bull Escala M6-705 (Power 770 / 9117-MMC) - Bull Escala M7-700 (Power 780 / 9179-MHB) - Bull Escala M7-705 (Power 780 / 9179-MHC) - Bull Escala E1-700 (Power 710 / 8231-E2B) - Bull Escala E1-705 (Power 710 / 8231-E1C) - Bull Escala E2-700 / E2-700T (Power 720 / 8202-E4B) - Bull Escala E2-705 / E2-705T (Power 720 / 8202-E4C) - Bull Escala E3-700 (Power 730 / 8231-E2B) - Bull Escala E3-705 (Power 730 / 8231-E2C) - Bull Escala E4-700 / E4-700T (Power 740 / 8205-E6B) - Bull Escala E4-705 (Power 740 / 8205-E6C) References to Power 755 / 8236-E8C models are irrelevant. Hardware October 2011 BULL CEDOC 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE REFERENCE 86 A1 90FF 03 The following copyright notice protects this book under Copyright laws which prohibit such actions as, but not limited to, copying, distributing, modifying, and making derivative works. Copyright Bull SAS 2011 Printed in France Suggestions and criticisms concerning the form, content, and presentation of this book are invited. A form is provided at the end of this book for this purpose. To order additional copies of this book or other Bull Technical Publications, you are invited to use the Ordering Form also provided at the end of this book. Trademarks and Acknowledgements We acknowledge the right of proprietors of trademarks mentioned in this book. The information in this document is subject to change without notice. Bull will not be liable for errors contained herein, or f or incidental or consequential damages in connection with the use of this material. Contents Safety notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vii SAS RAID enablement and cache battery pack for the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 What's new in SAS RAID enablement . . . . . . . . . . . . . . . . . . . . . . . 3 Installing the Storage Backplane - 6 SFF Bays/SATA DVD (FC 5618). . . . . . . . . . 5 Installing the 3x3 split-drive bays no RAID 5/6 support (FC 5631 and FC EJ02) . . . . . 9 Installing the Storage Backplane - 8 SFF Bays/175MB RAID/Dual IOA (FC 5630 and FC EJ01) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15 Removing and replacing SAS RAID adapters and batteries . . . . . . . . . . . . . 23 Removing and replacing the disk drive backplane . . . . . . . . . . . . . . . . . . . . . . 23 Removing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C disk drive backplane . . . . . . . . . . . 24 Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C disk drive backplane . . . . . . . . . . . 29 Removing and replacing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system backplane. . . . . . . . . 33 Removing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system backplane . . . . . . . . . . . . 33 Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system backplane. . . . . . . . . . . . . 44 8202-E4B or 8205-E6B SAS subsystem service considerations . . . . . . . . . . . . . . . . . . . 59 Service considerations for the cache battery pack. . . . . . . . . . . . . . . . . . . . . . 60 Removing the 8202-E4B or 8205-E6B RAID enablement card . . . . . . . . . . . . . . . . . . . 61 Replacing the 8202-E4B or 8205-E6B RAID enablement card . . . . . . . . . . . . . . . . . . . 64 Removing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C RAID/cache storage controller . . . . . . . . . 66 Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C RAID/cache storage controller . . . . . . . . . 69 Removing and replacing the cache battery card . . . . . . . . . . . . . . . . . . . . . . . 70 Removing and replacing a cache battery pack . . . . . . . . . . . . . . . . . . . . . . . . 75 Common procedures for installable features . . . . . . . . . . . . . . . . . . . 79 Before you begin . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79 Identifying a part . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81 Control panel LEDs. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81 Identifying a failing part in an AIX system or logical partition . . . . . . . . . . . . . . . . . 83 Locating a failing part in an AIX system or logical partition . . . . . . . . . . . . . . . . . 83 Activating the indicator light for the failing part . . . . . . . . . . . . . . . . . . . . . 83 Deactivating the failing-part indicator light. . . . . . . . . . . . . . . . . . . . . . . 83 Identifying a failing part in an IBM i system or logical partition. . . . . . . . . . . . . . . . . 84 Activating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . 84 Deactivating the failing-part indicator light. . . . . . . . . . . . . . . . . . . . . . . 84 Identifying a failing part in a Linux system or logical partition . . . . . . . . . . . . . . . . . 85 Locating a failing part in a Linux system or logical partition . . . . . . . . . . . . . . . . . 85 Finding the location code of a failing part in a Linux system or logical partition . . . . . . . . . . 85 Activating the indicator light for the failing part . . . . . . . . . . . . . . . . . . . . . 86 Deactivating the failing-part indicator light. . . . . . . . . . . . . . . . . . . . . . . 86 Locating a failing part in a Virtual I/O Server system or logical partition. . . . . . . . . . . . . . 86 Identifying a part by using the Virtual I/O Server . . . . . . . . . . . . . . . . . . . . 86 Starting the system or logical partition . . . . . . . . . . . . . . . . . . . . . . . . . . 87 Starting a system that is not managed by a Hardware Management Console or or a Systems Director Management Console . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87 Starting a system or logical partition by using the Hardware Management Console . . . . . . . . . . 88 Starting a system or virtual server with the Systems Director Management Console . . . . . . . . . . 89 © Copyright IBM Corp. 2010, 2011 iii Stopping a system or logical partition . . . . . . . . . . . . . . . . . . . . . . . . . . 89 Stopping a system that is not managed by an HMC or an SDMC . . . . . . . . . . . . . . . . 89 Stopping a system by using the Hardware Management Console . . . . . . . . . . . . . . . . 90 Stopping a system with the Systems Director Management Console . . . . . . . . . . . . . . . 91 Removing and replacing covers for the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C . . . . . . . . . . . 91 Removing the front cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system . . . . . 92 Removing the front cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system . . . . . . . . . . 92 Installing the front cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system . . . . . . 93 Installing the front cover and front door on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system . . . . . 94 Removing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system. . . 95 Removing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system . . . . . . . 96 Installing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system . . . 98 Installing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system . . . . . . . 98 Placing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service or operating position . . . . . 100 Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service position . . . 100 Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the operating position . . 101 Disconnecting the power cords from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system . . . . . . . . 103 Connecting the power cords to the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system. . . . . . . . . . 105 Installing a feature by using the Hardware Management Console . . . . . . . . . . . . . . . . . 105 Removing a part by using the Hardware Management Console . . . . . . . . . . . . . . . . . 106 Replacing a part by using the Hardware Management Console. . . . . . . . . . . . . . . . . . 106 Verifying the installed part. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107 Verifying an installed feature or replaced part in an AIX system or logical partition. . . . . . . . . . 107 Verifying the installed part in an IBM i system or logical partition. . . . . . . . . . . . . . . . 109 Deactivating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . 110 Verifying the installed part in a Linux system or logical partition . . . . . . . . . . . . . . . . 110 Verifying an installed part by using stand-alone diagnostics . . . . . . . . . . . . . . . . . . 111 Verifying the installed part by using Hardware Management Console . . . . . . . . . . . . . . 112 Activating and deactivating LEDs by using the HMC . . . . . . . . . . . . . . . . . . . 113 Deactivating a system attention LED or partition LED. . . . . . . . . . . . . . . . . . 113 Activating or deactivating an identify LED . . . . . . . . . . . . . . . . . . . . . 113 Viewing serviceable events by using the HMC . . . . . . . . . . . . . . . . . . . . . 114 Verifying the installed part by using Systems Director Management Console . . . . . . . . . . . . 114 Activating and deactivating LEDs by using the Systems Director Management Console . . . . . . . 115 Deactivating a system attention LED or partition LED. . . . . . . . . . . . . . . . . . 115 Activating or deactivating an identify LED by using the SDMC . . . . . . . . . . . . . . 115 Viewing serviceable events by using the Systems Director Management Console . . . . . . . . . . 116 Verifying an installed feature or replaced part on a system or logical partition by using Virtual I/O Server tools . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116 Verifying a repair . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119 Verifying the repair in AIX . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 120 Verifying a repair using an IBM i system or logical partition . . . . . . . . . . . . . . . . . 123 Verifying the repair in Linux . . . . . . . . . . . . . . . . . . . . . . . . . . . . 125 Verifying the repair from the management console . . . . . . . . . . . . . . . . . . . . . 125 Closing a service call. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 126 Closing a service call using AIX or Linux . . . . . . . . . . . . . . . . . . . . . . . . 131 Closing a service call using Integrated Virtualization Manager . . . . . . . . . . . . . . . . . 135 Activating and deactivating LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . 139 Deactivating a system attention LED or partition LED using the management console . . . . . . . . . 139 Activating or deactivating an identify LED using the management console. . . . . . . . . . . . . 140 Deactivating a system attention LED or logical partition LED using the Advanced System Management Interface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 141 Activating or deactivating an identify LED using the Advanced System Management Interface . . . . . . 141 Installing a part by using the Systems Director Management Console . . . . . . . . . . . . . . . 142 Replacing a part by using the Systems Director Management Console . . . . . . . . . . . . . . . 142 Removing a part by using the Systems Director Management Console . . . . . . . . . . . . . . . 143 Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 145 Trademarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 146 Electronic emission notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 146 Class A Notices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 146 iv Power Systems: Serial-attached SCSI RAID enablement Class B Notices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 150 Terms and conditions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 153 Contents v vi Power Systems: Serial-attached SCSI RAID enablement Safety notices Safety notices may be printed throughout this guide: v DANGER notices call attention to a situation that is potentially lethal or extremely hazardous to people. v CAUTION notices call attention to a situation that is potentially hazardous to people because of some existing condition. v Attention notices call attention to the possibility of damage to a program, device, system, or data. World Trade safety information Several countries require the safety information contained in product publications to be presented in their national languages. If this requirement applies to your country, a safety information booklet is included in the publications package shipped with the product. The booklet contains the safety information in your national language with references to the U.S. English source. Before using a U.S. English publication to install, operate, or service this product, you must first become familiar with the related safety information in the booklet. You should also refer to the booklet any time you do not clearly understand any safety information in the U.S. English publications. German safety information Das Produkt ist nicht für den Einsatz an Bildschirmarbeitsplätzen im Sinne § 2 der Bildschirmarbeitsverordnung geeignet. Laser safety information IBM® servers can use I/O cards or features that are fiber-optic based and that utilize lasers or LEDs. Laser compliance IBM servers may be installed inside or outside of an IT equipment rack. © Copyright IBM Corp. 2010, 2011 vii DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) DANGER viii Power Systems: Serial-attached SCSI RAID enablement Observe the following precautions when working on or around your IT rack system: v Heavy equipment–personal injury or equipment damage might result if mishandled. v Always lower the leveling pads on the rack cabinet. v Always install stabilizer brackets on the rack cabinet. v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest devices in the bottom of the rack cabinet. Always install servers and optional devices starting from the bottom of the rack cabinet. v Rack-mounted devices are not to be used as shelves or work spaces. Do not place objects on top of rack-mounted devices. v Each rack cabinet might have more than one power cord. Be sure to disconnect all power cords in the rack cabinet when directed to disconnect power during servicing. v Connect all devices installed in a rack cabinet to power devices installed in the same rack cabinet. Do not plug a power cord from a device installed in one rack cabinet into a power device installed in a different rack cabinet. v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of the system or the devices that attach to the system. It is the responsibility of the customer to ensure that the outlet is correctly wired and grounded to prevent an electrical shock. CAUTION v Do not install a unit in a rack where the internal rack ambient temperatures will exceed the manufacturer's recommended ambient temperature for all your rack-mounted devices. v Do not install a unit in a rack where the air flow is compromised. Ensure that air flow is not blocked or reduced on any side, front, or back of a unit used for air flow through the unit. v Consideration should be given to the connection of the equipment to the supply circuit so that overloading of the circuits does not compromise the supply wiring or overcurrent protection. To provide the correct power connection to a rack, refer to the rating labels located on the equipment in the rack to determine the total power requirement of the supply circuit. v (For sliding drawers.) Do not pull out or install any drawer or feature if the rack stabilizer brackets are not attached to the rack. Do not pull out more than one drawer at a time. The rack might become unstable if you pull out more than one drawer at a time. v (For fixed drawers.) This drawer is a fixed drawer and must not be moved for servicing unless specified by the manufacturer. Attempting to move the drawer partially or completely out of the rack might cause the rack to become unstable or cause the drawer to fall out of the rack. (R001) Safety notices ix CAUTION: Removing components from the upper positions in the rack cabinet improves rack stability during relocation. Follow these general guidelines whenever you relocate a populated rack cabinet within a room or building: v Reduce the weight of the rack cabinet by removing equipment starting at the top of the rack cabinet. When possible, restore the rack cabinet to the configuration of the rack cabinet as you received it. If this configuration is not known, you must observe the following precautions: – Remove all devices in the 32U position and above. – Ensure that the heaviest devices are installed in the bottom of the rack cabinet. – Ensure that there are no empty U-levels between devices installed in the rack cabinet below the 32U level. v If the rack cabinet you are relocating is part of a suite of rack cabinets, detach the rack cabinet from the suite. v Inspect the route that you plan to take to eliminate potential hazards. v Verify that the route that you choose can support the weight of the loaded rack cabinet. Refer to the documentation that comes with your rack cabinet for the weight of a loaded rack cabinet. v Verify that all door openings are at least 760 x 230 mm (30 x 80 in.). v Ensure that all devices, shelves, drawers, doors, and cables are secure. v Ensure that the four leveling pads are raised to their highest position. v Ensure that there is no stabilizer bracket installed on the rack cabinet during movement. v Do not use a ramp inclined at more than 10 degrees. v When the rack cabinet is in the new location, complete the following steps: – Lower the four leveling pads. – Install stabilizer brackets on the rack cabinet. – If you removed any devices from the rack cabinet, repopulate the rack cabinet from the lowest position to the highest position. v If a long-distance relocation is required, restore the rack cabinet to the configuration of the rack cabinet as you received it. Pack the rack cabinet in the original packaging material, or equivalent. Also lower the leveling pads to raise the casters off of the pallet and bolt the rack cabinet to the pallet. (R002) (L001) (L002) x Power Systems: Serial-attached SCSI RAID enablement (L003) or All lasers are certified in the U.S. to conform to the requirements of DHHS 21 CFR Subchapter J for class 1 laser products. Outside the U.S., they are certified to be in compliance with IEC 60825 as a class 1 laser product. Consult the label on each part for laser certification numbers and approval information. CAUTION: This product might contain one or more of the following devices: CD-ROM drive, DVD-ROM drive, DVD-RAM drive, or laser module, which are Class 1 laser products. Note the following information: v Do not remove the covers. Removing the covers of the laser product could result in exposure to hazardous laser radiation. There are no serviceable parts inside the device. v Use of the controls or adjustments or performance of procedures other than those specified herein might result in hazardous radiation exposure. (C026) Safety notices xi CAUTION: Data processing environments can contain equipment transmitting on system links with laser modules that operate at greater than Class 1 power levels. For this reason, never look into the end of an optical fiber cable or open receptacle. (C027) CAUTION: This product contains a Class 1M laser. Do not view directly with optical instruments. (C028) CAUTION: Some laser products contain an embedded Class 3A or Class 3B laser diode. Note the following information: laser radiation when open. Do not stare into the beam, do not view directly with optical instruments, and avoid direct exposure to the beam. (C030) CAUTION: The battery contains lithium. To avoid possible explosion, do not burn or charge the battery. Do Not: v ___ Throw or immerse into water v ___ Heat to more than 100°C (212°F) v ___ Repair or disassemble Exchange only with the IBM-approved part. Recycle or discard the battery as instructed by local regulations. In the United States, IBM has a process for the collection of this battery. For information, call 1-800-426-4333. Have the IBM part number for the battery unit available when you call. (C003) Power and cabling information for NEBS (Network Equipment-Building System) GR-1089-CORE The following comments apply to the IBM servers that have been designated as conforming to NEBS (Network Equipment-Building System) GR-1089-CORE: The equipment is suitable for installation in the following: v Network telecommunications facilities v Locations where the NEC (National Electrical Code) applies The intrabuilding ports of this equipment are suitable for connection to intrabuilding or unexposed wiring or cabling only. The intrabuilding ports of this equipment must not be metallically connected to the interfaces that connect to the OSP (outside plant) or its wiring. These interfaces are designed for use as intrabuilding interfaces only (Type 2 or Type 4 ports as described in GR-1089-CORE) and require isolation from the exposed OSP cabling. The addition of primary protectors is not sufficient protection to connect these interfaces metallically to OSP wiring. Note: All Ethernet cables must be shielded and grounded at both ends. The ac-powered system does not require the use of an external surge protection device (SPD). The dc-powered system employs an isolated DC return (DC-I) design. The DC battery return terminal shall not be connected to the chassis or frame ground. xii Power Systems: Serial-attached SCSI RAID enablement SAS RAID enablement and cache battery pack for the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C You might need to configure a redundant array of independent disks (RAID) on your serial-attached SCSI (SAS) disk drives in your system enclosure. Use these instructions to install features to enable RAID on your system. The following instructions will enable you to install SAS RAID enablement cards to prepare your system, to use SAS disk drives in a RAID configuration. To find information on installing adapters or configuring RAID on your system, see the following topics: v To view the PCI adapters, see PCI adapters v To view the SAS RAID controllers for AIX® operating system, see SAS RAID controllers for AIX v To view the SAS RAID controllers for IBM i operating system, see SAS RAID controllers for IBM i and the IBM i Information Center web site at http://www.ibm.com/systems/i/infocenter . v To view the SAS RAID controllers for Linux operating system, see SAS RAID controllers for Linux © Copyright IBM Corp. 2010, 2011 1 2 Power Systems: Serial-attached SCSI RAID enablement What's new in SAS RAID enablement Read about new or significantly changed information in SAS RAID enablement since the previous update of this topic collection. October 2011 v Added information for 8202-E4C and 8205-E6C servers. © Copyright IBM Corp. 2010, 2011 3 4 Power Systems: Serial-attached SCSI RAID enablement Installing the Storage Backplane - 6 SFF Bays/SATA DVD (FC 5618) Use this procedure to install the parts included with feature Storage Backplane - 6 SFF Bays/SATA DVD. This procedure is intended for installing new features. Attention: If you are servicing a failing part, see the following service procedure: v For the RAID storage controller, see “Removing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C RAID/cache storage controller” on page 66 and “Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C RAID/cache storage controller” on page 69. Before installing or replacing a feature, ensure that the software required to support the feature is installed on your system. For information about software prerequisites, see IBM Prerequisite. If the required software is not installed, see the following websites to download it, and then install it before continuing: v To download firmware and software updates and fixes, see Fix Central. v To download Hardware Management Console (HMC) updates and fixes, see Hardware Management Console Support and downloads. Attention: To maintain system availability and prevent possible data loss, it is important to understand that these features have implications on the serial-attached SCSI (SAS) configuration and data accessibility of the system. Review the SAS subsystem topic to understand the various SAS configuration and service considerations. Before servicing or installing any features, review the I/O configurations for each logical partition on your system. It is important to understand the differences between the various configurations and the resulting service considerations. In addition, consider possible impacts to the following areas: v Logical partitions and their I/O adapter assignments v RAID array configurations v Physical disk locations v Path to your boot device If your system is managed by the HMC, use the HMC to complete the steps for installing a feature in the server. For instructions, see “Installing a feature by using the Hardware Management Console” on page 105. If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to complete the steps for installing a feature in the server. For instructions, see “Replacing a part by using the Systems Director Management Console” on page 142. If you do not have an HMC or an SDMC, complete the following procedure. 1. Complete the prerequisite tasks described in “Before you begin” on page 79. 2. Stop the system, as described in “Stopping a system or logical partition” on page 89. 3. Remove the front cover. v For a rack-mounted system, see “Removing the front cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system” on page 92. v For a stand-alone system, see “Removing the front cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system” on page 92. © Copyright IBM Corp. 2010, 2011 5 4. For a rack-mounted system, place the system into the service position, as described in “Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service position” on page 100. 5. Remove the service access cover. v For a rack-mounted system, see “Removing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system” on page 95. v For a stand-alone system, see “Removing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system” on page 96. 6. Disconnect the power source from the system by unplugging the system. Note: This system is equipped with a second power supply. Before continuing with this procedure, ensure that the power source to the system has been completely disconnected. (L003) or 7. Attach the wrist strap. Attention: v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted metal surface of the system for a minimum of 5 seconds. 8. Install the disk drive backplane. For instructions, see Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C disk drive backplane. 9. Install the disk drives. 6 Power Systems: Serial-attached SCSI RAID enablement v For internal disk drive, see Replacing the internal disk drive in an 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system. v For solid-state drive, see Installing a disk drive or solid-state drive. 10. Install the SATA DVD drive. For instructions, see Installing a Slimline media device in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. 11. Replace the service access cover. v For a rack-mounted system, see “Installing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system” on page 98. v For a stand-alone system, see “Installing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system” on page 98. 12. For a rack-mounted system, put the system into the operating position, as described in “Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the operating position” on page 101. 13. Replace the front cover. v For a rack-mounted system, see “Installing the front cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system” on page 93. v For a stand-alone system, see “Installing the front cover and front door on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system” on page 94. 14. Reconnect the power source. 15. Start the system as described in “Starting the system or logical partition” on page 87. 16. Verify the installed part: v If you replaced the part because of a service action, verify the installed part as described in Verifying a repair. v If you installed the part for any other reason, verify the installed part as described in Verifying the installed part. Installing the Storage Backplane - 6 SFF Bays/SATA DVD (FC 5618) 7 8 Power Systems: Serial-attached SCSI RAID enablement Installing the 3x3 split-drive bays no RAID 5/6 support (FC 5631 and FC EJ02) Use this procedure to install the parts included with feature 3x3 split-drive bays No RAID 5/6 support. Note: The FC 5631 is for the 8202-E4B or 8205-E6B, and the FC EJ02 is for the 8202-E4C or 8205-E6C. Attention: This procedure is intended for installing a split-drive bay capability. Before installing or replacing a feature, ensure that the software required to support the feature is installed on your system. For information about software prerequisites, see IBM Prerequisite. If the required software is not installed, see the following websites to download it, and then install it before continuing: v To download firmware and software updates and fixes, see Fix Central. v To download Hardware Management Console (HMC) updates and fixes, see Hardware Management Console Support and downloads. Attention: To maintain system availability and prevent possible data loss, it is important to understand that these features have implications on the serial-attached SCSI (SAS) configuration and data accessibility of the system. Review the SAS subsystem topic to understand the various SAS configuration and service considerations. Before servicing or installing any features, review the I/O configurations for each logical partition on your system. It is important to understand the differences between the various configurations and the resulting service considerations. In addition, consider possible impacts to the following areas: v Logical partitions and their I/O adapter assignments v RAID array configurations v Physical disk locations v Path to your boot device If your system is managed by the HMC, use the HMC to complete the steps for installing a feature in the server. For instructions, see “Installing a feature by using the Hardware Management Console” on page 105 If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to complete the steps for installing a feature in the server. For instructions, see “Replacing a part by using the Systems Director Management Console” on page 142. If you do not have an HMC or an SDMC, complete the following procedure. 1. Complete the prerequisite tasks described in “Before you begin” on page 79. 2. Stop the system, as described in “Stopping a system or logical partition” on page 89. 3. For a rack-mounted system, place the system into the service position, as described in “Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service position” on page 100. 4. Remove the service access cover. v For a rack-mounted system, see “Removing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system” on page 95. v For a stand-alone system, see “Removing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system” on page 96. © Copyright IBM Corp. 2010, 2011 9 5. Use the service indicator light-emitting diodes (LEDs) to help identify the part, as described in “Identifying a part” on page 81. 6. Disconnect the power source from the system by unplugging the system. Note: This system is equipped with a second power supply. Before continuing with this procedure, ensure that the power source to the system has been completely disconnected. (L003) or 7. Attach the wrist strap. Attention: v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted metal surface of the system for a minimum of 5 seconds. 8. Locate the RAID-10 storage controller slots. 10 Power Systems: Serial-attached SCSI RAID enablement 9. Align the RAID-10 storage controller (A) with the slot in the system. Figure 1. Slot locations of RAID-10 storage controller (P1-C19) for 8202-E4B or 8205-E6B Figure 2. Slot locations of RAID-10 storage controller (P1-C19) for 8202-E4C or 8205-E6C Installing the 3x3 split-drive bays no RAID 5/6 support (FC 5631 and FC EJ02) 11 10. Slide the RAID-10 storage controller into the system until the tab (B) locks into place as shown in the following figure. 11. Install the disk drive backplane. For instructions, see Removing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C disk drive backplane. 12. Install the disk drives. v For internal disk drive, see Replacing the internal disk drive in an 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system. v For solid-state drive, see Installing a disk drive or solid-state drive. 13. Install the SATA DVD drive. For instructions, see Installing a Slimline media device in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. 14. Install the external SAS port. For instructions, see Installing the external SAS port on an 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system. 15. Replace the service access cover. v For a rack-mounted system, see “Installing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system” on page 98. v For a stand-alone system, see “Installing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system” on page 98. 16. For a rack-mounted system, put the system into the operating position, as described in “Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the operating position” on page 101. 17. Replace the front cover. v For a rack-mounted system, see “Installing the front cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system” on page 93. v For a stand-alone system, see “Installing the front cover and front door on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system” on page 94. Figure 3. Rack-mounted model RAID-10 storage controller replacement 12 Power Systems: Serial-attached SCSI RAID enablement 18. Reconnect the power source. 19. Start the system as described in “Starting the system or logical partition” on page 87. 20. Verify the installed part or repair. v If you replaced the part because of a service action, verify the installed part as described in Verifying a repair. v If you installed the part for any other reason, verify the installed part as described in Verifying the installed part. Installing the 3x3 split-drive bays no RAID 5/6 support (FC 5631 and FC EJ02) 13 14 Power Systems: Serial-attached SCSI RAID enablement Installing the Storage Backplane - 8 SFF Bays/175MB RAID/Dual IOA (FC 5630 and FC EJ01) Use this procedure to install the parts included with feature Storage Backplane - 8 SFF Bays/175MB RAID/Dual IOA. This procedure is intended for installing new features. Note: The FC 5630 is for the 8202-E4B or 8205-E6B, and the FC EJ01 is for the 8202-E4C or 8205-E6C. Attention: If you are servicing a failing part, see the following service procedures: v For the enablement card, see Removing enablement card and Replacing enablement card. v For the RAID/cache storage controller, see Removing storage controller and Replacing storage controller. Attention: To maintain system availability and prevent possible data loss, it is important to understand that these features have implications on the serial-attached SCSI (SAS) configuration and data accessibility of the system. Review the SAS subsystem topic to understand the various SAS configuration and service considerations. Before servicing or installing any features, review the I/O configurations for each logical partition on your system. It is important to understand the differences between the various configurations and the resulting service considerations. In addition, consider possible impacts to the following areas: v Logical partitions and their I/O adapter assignments v RAID array configurations v Physical disk locations v Path to your boot device Before installing or replacing a feature, ensure that the software required to support the feature is installed on your system. For information about software prerequisites, see IBM Prerequisite. If the required software is not installed, see the following web sites to download it, and then install it before continuing: v To download firmware and software updates and fixes, see Fix Central. v To download Hardware Management Console (HMC) updates and fixes, see Hardware Management Console Support and downloads. If your system is managed by the HMC, use the HMC to complete the steps for installing a feature in the server. For instructions, see “Installing a feature by using the Hardware Management Console” on page 105. If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to complete the steps for installing a feature in the server. For instructions, see “Replacing a part by using the Systems Director Management Console” on page 142. If you do not have an HMC or an SDMC, complete the following procedure. 1. Complete the prerequisite tasks described in “Before you begin” on page 79. 2. Use the service indicator light-emitting diodes (LEDs) to help identify the part, as described in “Identifying a part” on page 81. 3. Stop the system, as described in “Stopping a system or logical partition” on page 89. 4. For a rack-mounted system, place the system into the service position, as described in “Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service position” on page 100. © Copyright IBM Corp. 2010, 2011 15 5. Disconnect the power source from the system by unplugging the system. Note: This system is equipped with a second power supply. Before continuing with this procedure, ensure that the power source to the system has been completely disconnected. (L003) or 6. Remove the service access cover. v For a rack-mounted system, see “Removing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system” on page 95. v For a stand-alone system, see “Removing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system” on page 96. 7. Attach the wrist strap. Attention: v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted metal surface of the system for a minimum of 5 seconds. 8. For 8202-E4B or 8205-E6B system, locate the RAID enablement card slot (P1–C13), RAID/cache storage controller slot (P1-C19), and cache battery card (P1-C14), and go to step 10 on page 17. 16 Power Systems: Serial-attached SCSI RAID enablement 9. For 8202-E4C or 8205-E6C system, locate the RAID/cache storage controller slot (P1-C19) and cache battery card (P1-C14), and go to step 11 on page 18. 10. Install RAID enablement card for 8202-E4B or 8205-E6B, as follows: Note: The 8202-E4C or 8205-E6C does not have the enablement card. a. Align the RAID enablement card (A) with the slot (B) in the system. Figure 4. Slot location for RAID enablement card, RAID/cache storage controller, cache battery card Figure 5. Slot location for RAID/cache storage controller, cache battery card Installing the Storage Backplane - 8 SFF Bays/175MB RAID/Dual IOA (FC 5630 and FC EJ01) 17 b. Slide the RAID enablement card into the system until the tab locks into place as shown in the following figure. 11. Install RAID/cache storage controller as follows: a. Align the RAID/cache storage controller (A) with the slot in the system. b. Slide the RAID/cache storage controller into the system until the tab (B) locks into place as shown in the following figure. Figure 6. Rack-mounted model RAID enablement card replacement 18 Power Systems: Serial-attached SCSI RAID enablement 12. Install the cache battery card as follows: a. Slide the cache battery card (A) into the mounting guides on the controller until it is seated in the battery connector. b. After the battery is seated in the connector, move the lever to the latched position to fully seat the battery into the connector. Figure 7. Rack-mounted model RAID/cache storage controller replacement Installing the Storage Backplane - 8 SFF Bays/175MB RAID/Dual IOA (FC 5630 and FC EJ01) 19 13. Remove the disk drives. v For internal disk drives, see Removing the internal disk drive in an 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system. v For solid-state drives, see Removing a disk drive or solid-state drive. 14. Remove the disk drives backplane [CCIN 2BD5]. For instructions, see Removing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C disk drive backplane. 15. Install the high-function disk drive backplane [CCIN 2BD6]. For instructions, see Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C disk drive backplane. 16. Replace the disk drives in to the high-function disk drive backplane. v For internal disk drives, see Replacing the internal disk drive in an 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system. v For solid-state drives, see Installing a disk drive or solid-state drive. 17. Replace the service access cover. See v For a rack-mounted system, see “Installing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system” on page 98. v For a stand-alone system, see “Installing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system” on page 98. 18. For a rack-mounted system, put the system into the operating position, as described in “Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the operating position” on page 101. 19. Replace the front cover. v For a rack-mounted system, see “Installing the front cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system” on page 93. v For a stand-alone system, see “Installing the front cover and front door on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system” on page 94. 20. Reconnect the power source. Figure 8. Rack-mounted model RAID cache battery card replacement 20 Power Systems: Serial-attached SCSI RAID enablement 21. Start the system as described in “Starting the system or logical partition” on page 87. 22. Verify the installed part: v If you replaced the part because of a service action, verify the installed part as described in Verifying a repair. v If you installed the part for any other reason, verify the installed part as described in Verifying the installed part. Installing the Storage Backplane - 8 SFF Bays/175MB RAID/Dual IOA (FC 5630 and FC EJ01) 21 22 Power Systems: Serial-attached SCSI RAID enablement Removing and replacing SAS RAID adapters and batteries You might need to remove or replace the SAS RAID adapters and batteries. This section includes procedures for these tasks. Note: Replacing this feature is a customer task. You can perform this task yourself, or contact an authorized service provider to perform the task for you. You might be charged a fee by the authorized service provider for this service. Removing and replacing the disk drive backplane Use these procedures to remove, replace, or install the disk drive backplane in the IBM Power 720 Express (8202-E4B and 8202-E4C) and IBM Power 740 Express (8205-E6B and 8205-E6C) servers. The following figures show the disk drive backplanes of a rack-mounted system and a stand-alone system. Figure 9. Disk drive backplane of a rack-mounted system © Copyright IBM Corp. 2010, 2011 23 Removing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C disk drive backplane Use this procedure to remove a disk drive backplane from the server. If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the steps for removing the disk drive backplane from the server. For information about using the HMC to remove the disk drive backplane, see Removing a part using the Hardware Management Console. If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to complete the steps for removing the disk drive backplane from the server. For instructions, see Removing a part using the Systems Director Management Console. If you do not have an HMC or SDMC, complete the following steps to remove the disk drive backplane: Attention: Ensure that you remove the control panel before you remove the disk drive backplane to prevent damage to the control panel cables. 1. Identify the system that you will be working on by using the identify the system process to turn on the system locate (blue) indicator. For more information, see Control panel LEDs. 2. Use the service indicator light-emitting diodes (LEDs) to help identify the part, as described in “Identifying a part” on page 81. 3. Complete the prerequisite tasks as described in “Before you begin” on page 79. 4. Stop the system, as described in “Stopping a system or logical partition” on page 89. 5. Remove the front cover: v For a rack-mounted system, see “Removing the front cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system” on page 92. Figure 10. Disk drive backplane of a stand-alone system 24 Power Systems: Serial-attached SCSI RAID enablement v For a stand-alone system, see “Removing the front cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system” on page 92. 6. For a rack-mounted system, place the system into the service position, as described in “Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service position” on page 100. 7. Disconnect the power source from the system by unplugging the system, as described in “Disconnecting the power cords from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system” on page 103. Note: This system is equipped with a second power supply. Before continuing with this procedure, ensure that the power source to the system has been completely disconnected. (L003) or 8. Remove the service access cover: v For a rack-mounted system, see “Removing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system” on page 95. v For a stand-alone system, see “Removing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system” on page 96. 9. Attach the wrist strap. Removing and replacing SAS RAID adapters and batteries 25 Attention: v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted metal surface of the system for a minimum of 5 seconds. 10. If you need to replace the backplane because of an upgrade or system failure, complete the following steps to remove the disk drives, depending on your backplane. Note: If you remove a backplane that is populated with disk drives, the backplane will be heavy. Ensure that you can safely complete the procedure. a. To remove the disk drive from the backplane, perform the following steps: Note: Record the location of each disk drive you remove so that the disk drive can be replaced in the same location again. 1) Unlock the disk drive handle (A) and pull it toward you to unseat the disk drive. 2) Place one hand under the disk drive and carefully pull the drive out of the system as shown in Figure 11. 11. Remove the media device by pressing in on the latch (A) and sliding the media device out of the system, as shown in the following figure. For information about removing the different types of media devices, see Removing and replacing media devices in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. Figure 11. Removing the disk drive from the backplane 26 Power Systems: Serial-attached SCSI RAID enablement 12. Disconnect the cables from the back of the disk drive backplane. If you have connected an external serial-attached SCSI (SAS) cable, ensure that you disconnect the SAS cable from the disk drive backplane. For more information, see Installing the external SAS port on model 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. 13. Unlock the disk drive backplane by pulling the handles (A) open from the center of the backplane as shown in Figure 13 on page 28 and Figure 14 on page 28. Figure 12. Removing the media device from the disk drive backplane Removing and replacing SAS RAID adapters and batteries 27 14. Pull the backplane out of the server. Figure 13. Removing the backplane that is populated with disk drives Figure 14. Removing the backplane that is not populated disk drives 28 Power Systems: Serial-attached SCSI RAID enablement If you removed the backplane as a part of another procedure, continue with that procedure. To replace the backplane or install a new backplane, see Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C disk drive backplane. Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C disk drive backplane Use this procedure to replace a disk drive backplane. If you are replacing an existing disk drive backplane as part of a service procedure and want to remove the disk drive backplane, see Removing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C disk drive backplane. If your system is managed by the Hardware Management Console (HMC), use the HMC to complete the steps for replacing the disk drive backplane in the server. For instructions, see Exchanging a part using the Hardware Management Console. If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to complete the steps for replacing the disk drive backplane in the server. For instructions, see Replacing a part using the Systems Director Management Console. If you do not have an HMC or SDMC, complete the following steps to replace the disk drive backplane: 1. Check for a protective cover that might be present on the connector pins, in the lower-middle part at the rear of the backplane. If a protective cover is present, remove the cover. 2. Check for the locking tabs (A) shown in the following figures. If the tabs are in locked position, open the tabs. 3. Carefully lift the backplane along two sides and align it with the slot in the server. 4. Insert the backplane firmly into the server. 5. Secure the backplane with the locking tabs (A), as shown in the following figures. Removing and replacing SAS RAID adapters and batteries 29 6. Replace the disk drives, if you removed them earlier. Use the information that you recorded when you removed the drives to install them in the correct locations. To replace the disk drives, perform the following steps: Figure 15. Installing the backplane that is not populated with disk drives in a rack-mounted model Figure 16. Installing the backplane that is populated with disk drives in a rack-mounted model 30 Power Systems: Serial-attached SCSI RAID enablement a. Hold the disk drive by the top and bottom edges as you position the drive and insert it into the expansion unit. Do not hold the disk drive by the handle. b. Slide the disk drive all the way into the system or expansion unit, and push the disk drive handle in until it locks, as shown in the following figure. 7. Replace the Slimline media device by inserting it in the slot and pushing it in until the lever (B) locks in place, as shown in the following figure. For information about replacing the different types of media devices, see Removing and replacing media devices in the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. Figure 17. Installing the backplane disk drives Removing and replacing SAS RAID adapters and batteries 31 8. If required, connect the cable to the backplane. For more information, see Installing the external SAS port on an 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. 9. If required, install the RAID enablement card. For more information, see SAS RAID enablement and cache battery pack for the model 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. 10. Replace the service access cover: v For a rack-mounted system, see “Installing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system” on page 98. v For a stand-alone system, see “Installing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system” on page 98. 11. For a rack-mounted system, place the system into the operating position as described in “Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the operating position” on page 101. 12. Connect the power cords to the system, as described in Connecting the power cords. 13. Replace the front cover: v For a rack-mounted system, see “Installing the front cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system” on page 93. v For a stand-alone system, see “Installing the front cover and front door on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system” on page 94. 14. Start the system or logical partition. See “Starting the system or logical partition” on page 87. 15. Verify the installed part: v If you replaced the part because of a failure, verify the installed part as described in Verifying a repair. v If you installed the part for any other reason, verify the installed part as described in Verifying the installed part. Figure 18. Installing the media device 32 Power Systems: Serial-attached SCSI RAID enablement Removing and replacing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system backplane Learn how to remove or replace the system backplane. The following procedures describe the removal and replacement of the system backplane. Removing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system backplane Learn how to remove the system backplane from a server to replace a failing part or as part of another service procedure. Notes: v Before you begin this procedure, ensure that you have authority to log in as an authorized service provider. The different authority levels are described in ASMI authority levels. v Identify the system that you will be working on by using the identify the system process to turn on the system locate (blue) indicator. For more information, see Control panel LEDs. If your system is managed by the Hardware Management Console (HMC), use the HMC to remove the system backplane. For instructions, see Exchanging a part using the Hardware Management Console. If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to remove the system backplane. For instructions, see Exchanging a part using the Systems Director Management Console. If you do not have an HMC or an SDMC, complete the following steps to remove the system backplane: 1. Access the Advanced System Management Interface (ASMI). The Advanced System Management Interface (ASMI) is the interface to the service processor that is required to perform general and administrator-level service tasks, such as reading service processor error logs, reading vital product data, setting up the service processor, and controlling the system power. The ASMI might also be referred to as the service processor menus. To access the ASMI, refer to Accessing the Advanced System Management Interface. After you have accessed the ASMI, continue with the next step. 2. Check and record the System Enclosure Type and Feature Code. The enclosure serial number is a seven-digit number printed on a bar-coded label on the system unit. Note: The feature code number for 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system backplane is 78AA-001. 3. Check and record the server Host Ethernet Adapter (HEA) device settings. v For HMC: a. Expand the Systems Management folder in the navigation tree pane. b. Double click the Servers folder. c. Select a server in the central panel. d. On the Tasks menu, click Hardware Information > Adapters > Host Ethernet. The Host Ethernet Adapters window opens. e. Select the required physical port and click Configure. The HEA Physical Port Configuration window opens. f. Record the value in the Pending Port Group Multi-Core Scaling value field. v For SDMC: a. Go to the Resources tab and click Hosts. Removing and replacing SAS RAID adapters and batteries 33 b. Select a server in the right pane. c. On the Actions menu, click Hardware Information > Adapters > Host Ethernet. The Host Ethernet Adapters window opens. d. Select the required physical port and click Configure. The HEA Physical Port Configuration window opens. e. Record the value in the Pending Port Group Multi-Core Scaling value field. 4. Check and record the server settings. Prior to replacing this field-replaceable unit (FRU), check and record all server settings since you might need to update the system after you replace this FRU. This can be done through the Advanced System Management Interface (ASMI). a. Check and record the server firmware level indicated on the right edge of the status frame, the area where the Log out button is located. You might need to update the system to the latest server firmware code level after you replace this FRU. b. Check and record the service processor settings you previously set using the ASMI. You will need to reset these service processor settings after you update this FRU or the default settings will be used. Settings to record include the following: v System Name setting v System Power Control settings v System Service Aids settings v System Configuration settings v Network Services settings v Performance Option settings v Login Profile settings settings v Boot List settings v Host Ethernet Adapter settings c. Record any service processor settings you might have set using the operating system commands. 5. Stop the system or logical partition. For instructions, see Stopping a system or logical partition. 6. Disconnect the power source from the system by unplugging the system. Note: This system is optionally equipped with a second power supply. Before continuing with this procedure, ensure that the power source to the system has been completely disconnected. (L003) or 34 Power Systems: Serial-attached SCSI RAID enablement 7. Label and disconnect the external cables connected to the field-replaceable unit (FRU) you are servicing. 8. Place the system into the service position. For instructions, see Placing a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service position.. 9. Remove the service access cover: v For a rack-mounted system, see Removing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system. v For a stand-alone system, see Removing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system. 10. Remove the memory riser. For instructions, see Removing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C memory riser. 11. Remove the GX adapters. For instructions, see Removing GX adapters for the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. 12. Remove the PCI adapters. For instructions, see Removing a PCI adapter from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C server with the power off. 13. Remove the Host Ethernet Adapter (8202-E4B or 8205-E6B). For instructions, see Removing the 8202-E4B or 8205-E6B Host Ethernet Adapter with the power off. 14. Remove all the voltage regulator modules. For instructions, see Removing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C voltage regulator module. 15. Remove the middle fans. For instructions, see Removing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C middle fans. 16. Remove the power supplies. For instructions, see Removing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C power supply with the power off or Removing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C power supply with the power on. 17. Disconnect the internal cables from backplane. Remove the SPCN cable from the chassis rear (8202-E4C or 8205-E6C). Removing and replacing SAS RAID adapters and batteries 35 18. Remove the disk drive cage: a. Grasp the disk drive latches and pull then outward. b. Grasp the handles and pull the disk drive cage toward you. Figure 19. Disconnecting the internal cables 36 Power Systems: Serial-attached SCSI RAID enablement 19. Remove the PCI-riser air divider. Firmly grasp the PCI-riser air divider and lift it upward. 20. Remove the PCI riser (optional): Figure 20. Removing the disk drive cage Figure 21. Removing the PCI riser air divider Removing and replacing SAS RAID adapters and batteries 37 a. Loosen the two thumbscrews (A) by turning them counterclockwise. b. Unlock the PCI riser latches (B) by pressing the end of the latches and lifting them upward. c. Remove the PCI riser by pulling it upward. 21. Remove the disk-drive cage shield and voltage regulator guide and air divider. a. Remove the five screws that secure the disk-drive cage shield and the voltage regulator guide and air divider. b. Remove the disk drive cage shield and the voltage regulator guide and air divider by lifting them upward. Figure 22. Removing the PCI riser 38 Power Systems: Serial-attached SCSI RAID enablement 22. Remove the dual inline memory module (DIMM) cage: a. Remove the four screws that secure the DIMM cage (View A). Figure 23. Removing the disk-drive cage shield (8202-E4B or 8205-E6B) Figure 24. Removing the disk-drive cage shield (8202-E4C or 8205-E6C) Removing and replacing SAS RAID adapters and batteries 39 b. Remove the DIMM cage by lifting it upward. 23. Disconnect the front fan cable. Figure 25. Removing the DIMM cage 40 Power Systems: Serial-attached SCSI RAID enablement 24. Remove the middle fan cage: a. Remove the screws (B) that secure the fan cage. b. Remove the two screws (A) that secure the fan cage on the outside of the system chassis. c. Remove the fan cage by lifting it upward. Figure 26. Disconnecting the front fan cable Removing and replacing SAS RAID adapters and batteries 41 25. Remove the power interlock switch. For instructions, see Removing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C power interlock switch with power off. 26. Remove the system processor module. For instructions, see Removing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system processor module. 27. Remove the metal guide bracket for the voltage regulator module. a. Locate the three screws securing the bracket to the system sidewall. b. Remove the three screws. 28. Remove the system backplane. Note: Use care when removing the system backplane. Do not lift the system backplane by any of the attached modules. a. Label and disconnect all the Ethernet cables that are present. b. Remove the four screws (A). Note: Do not remove any of the Torx screws. Remove only the four hex and slot combination screws. Figure 27. Remove the middle fan cage 42 Power Systems: Serial-attached SCSI RAID enablement c. Grip the system backplane by both heat-sink load arms (B). Figure 28. Removing the system backplane (8202-E4B or 8205-E6B) Figure 29. Removing the system backplane Removing and replacing SAS RAID adapters and batteries 43 d. Lift the front of the system backplane up to clear the front alignment pins (approximately 6.35 mm (1/4 in.)). e. Slide the system backplane slightly toward the front of the system so that the connectors on the system backplane clear the rear chassis. f. After the connectors are clear of the chassis, lift the rear of the system backplane and remove the system backplane at an angle. Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system backplane Learn how to replace the system backplane that is part of a service action. To complete this procedure, you must have already completed the removal procedure. For the removal procedure instructions, see Removing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system backplane. If your system is managed by the Hardware Management Console (HMC), use the HMC to replace the system backplane. For instructions, see Exchanging a part using the Hardware Management Console. If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to replace the system backplane. For instructions, see Exchanging a part using the Systems Director Management Console. If you do not have an HMC or an SDMC, complete the following steps to replace the system backplane: Attention: Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic discharge from damaging your hardware. When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. If you do not have a wrist strap, prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted metal surface of the system for a minimum of 5 seconds. 1. Replace the system backplane. Note: Use care when you replace the system backplane. a. Carefully grasp the replacement system backplane by both of the heat-sink load arms (A). b. Lower the system backplane at an angle toward the front of the system chassis. c. Lower the rear of the system backplane, and then slide it toward the rear of the system until the connectors clear the openings in the chassis (see inset). d. Lower the front of the system backplane onto the alignment pins (B). 44 Power Systems: Serial-attached SCSI RAID enablement e. Tighten the four screws (C) that secure the system backplane to the system chassis. f. Reconnect all of the Ethernet cables that were previously removed. Figure 30. Installing the system backplane (8202-E4B or 8205-E6B) Figure 31. Installing the system backplane Removing and replacing SAS RAID adapters and batteries 45 2. Install the metal guide bracket for the voltage regulator module. a. Align the three holes on the bracket to the three holes along the system sidewall. b. Tighten the three screws. 3. Replace the system processor module. For instructions, see Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system processor module. 4. Replace the power interlock switch. For instructions, see Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C power interlock switch with the power off. 5. Replace the fan cage: a. Lower the fan cage into position ensuring that alignment groove (A) and alignment pin (B) are correctly aligned. b. Tighten the two screws (D) that secure the fan cage to the system chassis. c. Tighten the screws (C) that secure the fan cage. 6. Connect the front fan cable. Figure 32. Installing the fan cage 46 Power Systems: Serial-attached SCSI RAID enablement 7. Replace the dual inline memory module (DIMM) cage: a. Lower the DIMM cage into position. b. Insert and tighten the four screws that secure the DIMM cage (View A). Figure 33. Connecting the front fan cable Removing and replacing SAS RAID adapters and batteries 47 8. Replace the plastic voltage regulator module guide (A). a. Lower the plastic guide into position. b. Insert and tighten the three screws that secure the guide. 9. Replace the disk-drive cage shield (B). a. Lower the disk-drive cage shield into position. b. Insert and tighten the two screws that secure the disk-drive cage shield. Figure 34. Replacing the DIMM cage 48 Power Systems: Serial-attached SCSI RAID enablement 10. Replace the PCI riser: a. Lower the PCI riser into position. Figure 35. Replacing the disk-drive cage shield (8202-E4B or 8205-E6B) Figure 36. Replacing the disk-drive cage shield (8202-E4C or 8205-E6C) Removing and replacing SAS RAID adapters and batteries 49 b. Press the PCI riser latches (A) down until they lock into place. c. Tighten the two thumbscrews (D). 11. Replace the PCI-riser air divider. a. Lower the PCI-riser air divider and press it into place. Figure 37. Replacing the PCI riser 50 Power Systems: Serial-attached SCSI RAID enablement 12. Replace the disk drive cage: a. With the disk drive latches extended, push the disk drive cage into the system. b. Push the disk drive latches toward system until they lock. Figure 38. Replacing the PCI riser air divider Removing and replacing SAS RAID adapters and batteries 51 13. Connect the internal cables to the backplane. Replace the SPCN cable to the chassis rear (8202-E4C or 8205-E6C). Figure 39. Replacing the disk drive cage Figure 40. Connecting the internal cables 52 Power Systems: Serial-attached SCSI RAID enablement 14. Replace the power supplies. For instructions, see Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C power supply with the power off or Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C power supply with the power on. 15. Replace the fans. For instructions, see Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C fans. 16. Replace the memory riser. For instructions, see Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C memory riser. 17. Replace all the voltage regulator modules. For instructions, see Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C voltage regulator module. 18. Replace the Host Ethernet Adapter (8202-E4B or 8205-E6B). For instructions, see Replacing the 8202-E4B or 8205-E6B Host Ethernet adapter with the power off. 19. Replace the GX adapters. For instructions, see Replacing GX adapters for the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C. 20. Replace the PCI adapters. For instructions, see Replacing a PCI adapter from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C server with the power off. 21. Replace the service access cover: v For a rack-mounted system, see Installing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system. v For a stand-alone system, see Installing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system. 22. Replace the front cover: v For a rack-mounted system, see Installing the front cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system. v For a stand-alone system, see Installing the front cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system. 23. Place the system in the operating position. For instructions, see Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the operating position. 24. Start the system or logical partition. See Starting the system or logical partition. 25. Restore network connectivity. The service processor must connect to the network to be accessible from the management console. How will the IP Address of the Service Processor be managed? Obtain an IP address automatically from a Dynamic Host Configuration Protocol (DHCP) server Go to next step. Configure a static IP address Go to step 27 on page 54 26. Restore service processor settings. Enable network access to the service processor. If the network connection uses the Dynamic Host Configuration Protocol (DHCP) to establish an IP address, and uses the Management consoles as the DHCP server, no additional configuration is necessary to enable network access to the service processor. Perform the following steps: a. Ensure that the service processor is connected to the existing service network by verifying that the management console cable is connected to the management console port on the system backplane. b. If not already connected, connect all system power cables by plugging them into power outlets. Note: Do not start the system at this time. Can you access the service network? Yes No Go to step 29 on page 54. Go to step 32 on page 55 Removing and replacing SAS RAID adapters and batteries 53 27. Restore service processor settings. The Advanced System Management Interface (ASMI) is the interface to the service processor that is required to perform general and administrator-level service tasks, such as reading service processor error logs, reading vital product data, setting up the service processor, controlling the system power, and setting service processor network ports. The ASMI may also be referred to as the service processor menus. The ASMI can be accessed through https. As the network connection uses static IP address assignments, perform the following steps. a. To know your service processor network ports, if necessary, you can use the control panel in manual mode. Refer to service functions to get control panel documentation in that case. b. Connect a client with a Web browser directly to the service processor network ports you previously noted. It could be something like the following URLs: v https://169.254.2.147 v https://169.254.3.147 c. If not already connected, connect all system power cables by plugging them into power outlets. Note: Do not start the system at this time. d. Log on to the ASMI with the admin User ID and admin default Password. e. Change the admin user ID's Password and the general user ID's Password when prompted. f. To configure network access perform the following steps. 1) Click Network Configuration under the Network Services node. 2) On the right pane, configure eth0 and eth1 network interfaces, select Static for the Type of IP Address field, and set and a Host name, an IP address, a Subnet mask. 28. Restore system processor settings. With the network connection now configured to use static IP address assignments, try to access the service processor network ports. Can you access the service network? Yes No Go to next step. Go to step 33 on page 55 29. Access the ASMI. The Advanced System Management Interface (ASMI) is the interface to the service processor that is required to perform general and administrator-level service tasks, such as reading service processor error logs, reading vital product data, setting up the service processor, and controlling the system power. The ASMI might also be referred to as the service processor menus. To access the ASMI, refer to Accessing the Advanced System Management Interface. After you have accessed the ASMI, continue with the next step. 30. Set System Enclosure Type and Feature Code. The enclosure serial number is a seven-digit number printed on a bar-coded label on the system unit. When the system backplane is replaced, the seven-digit enclosure serial number and feature code number needs to be manually inputted and stored in the new system backplane using the Advanced System Management Interface (ASMI). Note: The feature code number for 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system backplane is 78AA-001. Note: To perform this operation, your authority level must be Authorized Service Provider (ID: celogin). To input the system serial number: a. On the ASMI Welcome pane, specify your user ID and password, and click Log in. b. In the navigation area, expand System Configuration and Program Vital Product Data. c. Select System Enclosures. In the right pane, the current system enclosures are displayed. 54 Power Systems: Serial-attached SCSI RAID enablement d. On the system being serviced, record the enclosure serial number printed on the bar-coded label on the system unit. e. Enter the enclosure serial number recorded from the system unit in the Enclosure serial number field. f. Enter the feature code number in the Feature Code/Sequence Number field. g. Click Save settings to update and save the enclosure serial number and the feature code. 31. Go to step 35 on page 56 32. Restore service processor settings. If the network connection uses DHCP to establish an IP address, but does not use a management console as the DHCP server, perform the following steps. a. Complete any network configuration necessary to allow the DHCP server to recognize and assign an IP address to the service processor. b. Ensure that the service processor is connected to the existing service network by verifying that the network cable is connected to the network port. c. If not already connected, connect all system power cables by plugging them into power outlets. Note: Do not start the system at this time. Can you access the service network? Yes No Go to 29 on page 54 Go to next step. 33. Restore service processor settings. You have indicated that you are still not able to access the service network. You will need to reset the service processor by pressing the pin-hole switch (E) on the operator panel. Removing and replacing SAS RAID adapters and batteries 55 34. Got to step 25 on page 53. 35. Restore service processor settings. Change the password of the admin user ID when prompted by performing the following steps. a. In the navigation area of the ASMI, expand Login Profile. b. Select Change Password under Login Profile. c. Specify the required information, and click Continue. As a security measure, you are required to enter your current user password in the Current password for current user ID field. Note: Passwords can be any combination of up to 64 alphanumeric characters. The default password for the admin User ID is admin. After your initial login to the ASMI, the admin password must be changed. As a security measure, you are required to enter the password for your user ID in the Current password for current user field. 36. Restore service processor settings. Figure 41. Pin-hole switch on the operator information panel 56 Power Systems: Serial-attached SCSI RAID enablement Set the system name. The system name can be changed to any valid ASCII string. It does not have to follow the initialized machine type_model_serial number format. To change the system name, complete the following steps. a. In the navigation area of the ASMI, expand System Configuration. b. Select System Name under System Configuration. c. Enter the desired system name. d. Click Save settings to update the system name to the new value. The new system name is displayed in the status frame, the area where the Log out button is located. If another method, such as the Management consoles, are used to change the system name, the status frame does not reflect the change. 37. Restore service processor settings. To set the time-of-day, complete the following steps. a. In the navigation area of the ASMI, expand System Configuration. b. Select Time of Day under System Configuration. c. If the system is powered off, the right pane displays a form that shows the current date (month, day, and year) and time (hours, minutes, seconds). d. Change either the date or the time or both, and click Save settings. 38. Restore service processor settings. Verify if the server firmware level is same as the value recorded prior to replacing the system backplane. If the value is different then it must be fixed. For information on how to fix the firmware, see Getting firmware fixes. 39. Restore the server Host Ethernet Adapter (HEA) device settings. v For HMC: a. Expand the Systems Management folder in the navigation tree pane. b. Double click the Servers folder. c. Select a server in the central panel. d. On the Tasks menu, click Hardware Information > Adapters > Host Ethernet. The Host Ethernet Adapters window opens. e. Select the required physical port and click Configure. The HEA Physical Port Configuration window opens. f. Enter the value in the Pending Port Group Multi-Core Scaling value field and click OK. v For SDMC: a. Go to the Resources tab and click Hosts. b. Select a server in the right pane. c. On the Actions menu, click Hardware Information > Adapters > Host Ethernet. The Host Ethernet Adapters window opens. d. Select the required physical port and click Configure. The HEA Physical Port Configuration window opens. e. Enter the value in the Pending Port Group Multi-Core Scaling value field and click OK. 40. Restore service processor settings. Reenter any of the following settings that were previously changed through the Advanced System Management Interface (ASMI), unless you want to use the default settings. v System name setting v System Power Control settings v System Service Aids settings v System Configuration settings v Network Services settings Removing and replacing SAS RAID adapters and batteries 57 v Performance Options settings v Login Profile settings v Boot List settings v Host Ethernet Adapter settings 41. Restore service processor settings. a. Reset any service processor settings that you may have set using operating system commands. You recorded these settings at the beginning of this procedure. b. If you choose to reset the Management consoles Access password, perform the following: To change the HMC access password, select one of the following methods: 1) From the HMC GUI (preferred method): a) Expand the Systems Management folder in the navigation tree pane. b) Double click the Servers folder. c) Select a server in the central panel. d) Under the Operations folder, select Change Password. e) Provide the desired information and click OK. 2) From the HMC command line, type: /usr/hmcrbin/chsyspwd -m managedsystem -t access --passwd --newpasswd newpassword where: v The value for managedsystem is the new service processor's managed system name. v No value for --passwd is entered thereby allowing authentication. v The value for newpasswd is the newpassword value. To change the SDMC access password, follow these steps: 1) Go to the Resources tab and click Hosts. 2) Select a server in the right pane. 3) On the Actions menu, click Operations > Change Password. 4) Provide the desired information and click OK. 42. Recover partition data on the service processor. Profile data stored in the managed server has been cleared or corrupted. To recover profile data using the HMC, complete the following steps. a. Expand the Systems Management folder in the navigation tree pane. b. Double click the Servers folder. c. Use the check box to select a server, in the central panel. d. In the Configuration folder if you have the Manage Partition Data folder, select Restore. To recover profile data using the SDMC, complete the following steps: a. Go to the Resources tab and click Hosts. b. Select a server in the right pane. c. On the Actions menu, click System Configuration > Manage Partition Data > Restore. d. Select the desired restore option and click OK. 43. Update vital product data. If your system is running IBM i and is not managed by the Management consoles, you must update the load source after replacing the system backplane. To update the load source without using the Management consoles, do the following steps: a. Load the I_BASE_01 Licensed Internal Code optical media into the device that is defined for the system and access it to perform an initial program load (IPL). Note: This IPL can take several minutes. 58 Power Systems: Serial-attached SCSI RAID enablement b. After the Select a Language Group display is shown, press Enter to select the default language feature 2924 (English). Note: To change the language feature, type the language feature that appears on the media that contains the operating system, and then press Enter. Confirm the language feature by pressing Enter. c. From the Install Licensed Internal Code display, select option 2 (Work with Dedicated Service Tools (DST)) and press Enter. Use the Dedicated Service Tools to make sure the disk units, including the load source, are reporting correctly and that there are no errors that might prevent the IPL from disk. When all errors have been resolved and the load source disk is reporting correctly, continue with the next step. d. From the Work with Dedicated Service Tools (DST) display, select option 2 (Work with disk units) and press Enter. e. Select option 8 (Update system vital product data) to identify the location of the load source disk unit. Press Enter. The Update System Vital Product Data display is shown. f. Press Enter to confirm that you want the vital product data to be written. g. Press F12 (Previous) twice. h. Type 1 and press Enter to exit DST. Use the system control panel to delay turning off the system. Change the IPL mode to B, and power on the system to perform an IPL from disk. 44. Restore the boot list and the Host Ethernet Adapter settings. The boot list is not present on a new system backplane, therefore; it is necessary to create the boot devices by booting into the system management services (SMS) menus. See Selecting boot options for information on setting the boot device list. After the operating system is booted, the bootlist command can be used to verify, or make additional changes to, the list of boot devices for a system or partition. To restore the Host Ethernet Adapter settings, see Configuring physical ports on a Host Ethernet Adapter. 45. Verify the repair. For more information, see Verifying a repair. 8202-E4B or 8205-E6B SAS subsystem service considerations There are several considerations to review before servicing features within the SAS subsystem for the 8202-E4B or 8205-E6B system. Attention: Do not attempt to remove any parts related to the SAS subsystem if the cache data present LED on the card in either P1-C19 or P1-C14 is flashing. To see this LED, you must shut down the system and remove the service cover. The LED shows a no-touch icon (a hand with a slash through it), and indicates that there is possible data in cache on the adapter. To prevent data loss, first make sure that the system has been shut down correctly. If either cache data present LED is flashing, refer to one of the following topics before removing any SAS subsystem parts. You can safely replace the cache battery pack when Yes is displayed next to Battery pack can be safely replaced on the Battery Information panel (for Linux and IBM i systems) or COMMAND STATUS panel (for AIX systems). Before servicing any features, review the I/O configurations for each logical partition on your system. It is important to understand the differences between the various configurations and the resulting service considerations. In addition, consider possible impacts to the following areas. v Logical partitions and their I/O adapter assignments v RAID array configurations v Physical disk locations v Path to your boot device Removing and replacing SAS RAID adapters and batteries 59 Service considerations for the cache battery pack To prevent data loss, follow proper procedures before replacing the cache battery pack on the Cache RAID cards. Note: Concurrent maintenance of adapter battery is not supported. Even with failed or missing batteries, the card associated with the battery must first be removed. For more information, see Removing and replacing SAS RAID adapters and batteries. Attention: To maintain system availability and prevent possible data loss, you must understand that these features have implications on the SAS configuration and data accessibility of the system. Attention: Do not attempt to remove any parts related to the SAS subsystem if the cache data present LED on the card in either P1-C19 or P1-C14 is flashing. To see this LED, you must shut down the system and remove the service cover. The LED shows a no-touch icon (a hand with a slash through it), and indicates that there is possible data in cache on the adapter. To prevent data loss, first make sure the system has been shut down correctly. If either cache data present LED is flashing, refer to one of the following topics before removing any SAS subsystem parts. You can safely replace the cache battery pack when Yes is displayed next to Battery pack can be safely replaced on the Battery Information panel (for the Linux and IBM i systems) or COMMAND STATUS panel (for AIX systems). The following figure shows the location of the cache data present LED in the 8202-E4B or 8205-E6B SAS subsystem. The following table describes the states of the cache data present LED and what to consider before attempting a service action. Figure 42. Cache data present LED 60 Power Systems: Serial-attached SCSI RAID enablement Table 1. LED indications and actions Cache battery card (Un-P1-C14) RAID/Cache storage controller (Un-P1-C19) Indication Action Off Off Cache is not active. Service actions can proceed. Off Flashing Cache is active on C19. Cache might have been flushed on C14 if C19 is failed. Attempt to power on and perform a proper shutdown prior to any service action. Flashing Off Cache is active on C14. Cache might have been flushed on C19 if C14 is failed. Attempt to power on and perform a proper shutdown prior to any service action. Flashing Flashing Cache is active on the dual controllers. Attempt to power on and perform a proper shutdown prior to any service action. If this is unsuccessful in clearing the cache active LEDs, contact support. See Contacting IBM service and support. Removing the 8202-E4B or 8205-E6B RAID enablement card Use this procedure to remove a Redundant Array of Independent Disks (RAID) enablement card before installing a new or upgraded enablement card. Attention: To maintain system availability and prevent possible data loss, it is important to understand that these features have implications on the serial-attached SCSI (SAS) configuration and data accessibility of the system. Review the SAS subsystem topic to understand the various SAS configuration and service considerations. Before servicing or installing any features, review the I/O configurations for each logical partition on your system. It is important to understand the differences between the various configurations and the resulting service considerations. In addition, consider possible impacts to the following areas: v Logical partitions and their I/O adapter assignments v RAID array configurations v Physical disk locations v Path to your boot device If your system is managed by the Hardware Management Console (HMC), use the HMC to remove the enablement card from the server. For instructions, see “Removing a part by using the Hardware Management Console” on page 106. If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to remove the enablement card from the server. For instructions, see “Removing a part by using the Systems Director Management Console” on page 143. If you do not have an HMC or an SDMC, complete the following steps to remove the enablement card: 1. Complete the prerequisite tasks described in “Before you begin” on page 79. 2. Stop the system, as described in “Stopping a system or logical partition” on page 89. Removing and replacing SAS RAID adapters and batteries 61 3. Ensure the system is powered down. Check the operator panel to see that the power LED is flashing on or off. 4. Remove the front cover. v For a rack-mounted system, see “Removing the front cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system” on page 92. v For a stand-alone system, see “Removing the front cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system” on page 92. 5. For a rack-mounted system, place the system into the service position, as described in “Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service position” on page 100. 6. Remove the service access cover. v For a rack-mounted system, see “Removing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system” on page 95. v For a stand-alone system, see “Removing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system” on page 96. 7. Check the status of the green Write Cache LEDs on cards located in P1-C19 and P1-C14 of Figure 1. v If both the green cache LEDs are flashing, replace the system cover and follow the “Starting the system or logical partition” on page 87 before returning here. v If both LEDs are off, proceed to the next step. v If by chance, one cache LED is flashing and the other is not, contact your service provider for assistance. 8. Disconnect the power source from the system by unplugging the system. Note: This system is equipped with a second power supply. Before continuing with this procedure, ensure that the power source to the system has been completely disconnected. (L003) or 62 Power Systems: Serial-attached SCSI RAID enablement 9. Attach the wrist strap. Attention: v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted metal surface of the system for a minimum of 5 seconds. 10. Locate the RAID enablement card that you want to remove. Note: If an optional tape drive is installed, then move the tape drive to access the RAID enablement card. 11. Press the tab and slide the card out of the system as shown in the following figures. The RAID enablement card is attached to a plastic plate. You must slide the plastic plate out of the system to remove the RAID enablement card. Figure 43. Slot location for RAID enablement card (P1-C13) Removing and replacing SAS RAID adapters and batteries 63 To replace the RAID enablement card, see “Replacing the 8202-E4B or 8205-E6B RAID enablement card.” 12. Replace the service access cover. v For a rack-mounted system, see “Installing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system” on page 98. v For a stand-alone system, see “Installing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system” on page 98. 13. For a rack-mounted system, put the system into the operating position, as described in “Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the operating position” on page 101. 14. Replace the front cover. v For a rack-mounted system, see “Installing the front cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system” on page 93. v For a stand-alone system, see “Installing the front cover and front door on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system” on page 94. 15. Reconnect the power source. 16. Start the system as described in “Starting the system or logical partition” on page 87. 17. Verify the installed part: v If you replaced the part because of a service action, verify the installed part as described in Verifying a repair. v If you installed the part for any other reason, verify the installed part as described in Verifying the installed part. Replacing the 8202-E4B or 8205-E6B RAID enablement card Use this procedure to replace an existing Redundant Array of Independent Disks (RAID) enablement card. Figure 44. RAID enablement card removal from a rack-mounted system unit 64 Power Systems: Serial-attached SCSI RAID enablement You must have already completed the procedure “Removing the 8202-E4B or 8205-E6B RAID enablement card” on page 61. To replace a RAID enablement card, complete the following steps: 1. Locate the slot for the RAID enablement card. Note: If an optional tape drive is installed, then move the tape drive to access the RAID enablement card. 2. Align the RAID enablement card with the slot in the system. 3. Slide the RAID enablement card fully into the system until the tab (A) locks into place. 4. Replace the service access cover: v For a rack-mounted system, see “Installing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system” on page 98. v For a stand-alone system, see “Installing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system” on page 98. 5. For a rack-mounted system, put the system into the operating position, as described in “Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the operating position” on page 101. 6. Replace the front cover: v For a rack-mounted system, see “Installing the front cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system” on page 93. v For a stand-alone system, see “Installing the front cover and front door on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system” on page 94. 7. Reconnect the power source. 8. Start the system as described in “Starting the system or logical partition” on page 87. 9. Verify the installed part: Figure 45. RAID enablement card replacement in a rack-mounted system unit Removing and replacing SAS RAID adapters and batteries 65 v If you replaced the part because of a service action, verify the installed part as described in Verifying a repair. v If you installed the part for any other reason, verify the installed part as described in Verifying the installed part. Removing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C RAID/cache storage controller Use this procedure to remove a Redundant Array of Independent Disks (RAID) storage controller before installing a new or upgraded storage controller. You can also use this procedure to remove a RAID-10 storage controller. Identify the system that you will be working on by using the identify the system process to turn on the system locate (blue) indicator. For more information, see “Control panel LEDs” on page 81. Attention: To maintain system availability and prevent possible data loss, it is important to understand that these features have implications on the serial-attached SCSI (SAS) configuration and data accessibility of the system. Review the SAS subsystem topic to understand the various SAS configuration and service considerations. Before servicing or installing any features, review the I/O configurations for each logical partition on your system. It is important to understand the differences between the various configurations and the resulting service considerations. In addition, consider possible impacts to the following areas: v Logical partitions and their I/O adapter assignments v RAID array configurations v Physical disk locations v Path to your boot device If your system is managed by the Hardware Management Console (HMC), use the HMC to remove RAID/cache storage controller from the server. For instructions, see “Removing a part by using the Hardware Management Console” on page 106 If your system is managed by the IBM Systems Director Management Console (SDMC), use the SDMC to remove the RAID/cache storage controller from the server. For instructions, see “Removing a part by using the Systems Director Management Console” on page 143. If you do not have an HMC or an SDMC, complete the following steps to remove RAID/cache storage controller: 1. Complete the prerequisite tasks described in “Before you begin” on page 79. 2. Stop the system, as described in “Stopping a system or logical partition” on page 89. 3. Remove the front cover. v For a rack-mounted system, see “Removing the front cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system” on page 92. v For a stand-alone system, see “Removing the front cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system” on page 92. 4. For a rack-mounted system, place the system into the service position, as described in “Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service position” on page 100. 5. Remove the service access cover. v For a rack-mounted system, see “Removing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system” on page 95. 66 Power Systems: Serial-attached SCSI RAID enablement v For a stand-alone system, see “Removing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system” on page 96. 6. Use the service indicator light-emitting diodes (LEDs) to help identify the part, as described in “Identifying a part” on page 81. 7. Disconnect the power source from the system by unplugging the system. Note: This system is equipped with a second power supply. Before continuing with this procedure, ensure that the power source to the system has been completely disconnected. (L003) or 8. Attach the wrist strap. Attention: v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted metal surface of the system for a minimum of 5 seconds. 9. Locate the RAID/cache storage controller that you want to remove. 10. Pull the tab (A) out and slide the card (B) out of the system as shown in the following figure. Removing and replacing SAS RAID adapters and batteries 67 To replace the RAID/cache storage controller, see “Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C RAID/cache storage controller” on page 69. 11. Replace the service access cover: v For a rack-mounted system, see “Installing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system” on page 98. v For a stand-alone system, see “Installing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system” on page 98. 12. For a rack-mounted system, put the system into the operating position, as described in “Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the operating position” on page 101. 13. Replace the front cover: v For a rack-mounted system, see “Installing the front cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system” on page 93. v For a stand-alone system, see “Installing the front cover and front door on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system” on page 94. 14. Reconnect the power source. 15. Start the system as described in “Starting the system or logical partition” on page 87. 16. Verify the installed part: v If you replaced the part because of a service action, verify the installed part as described in Verifying a repair. v If you installed the part for any other reason, verify the installed part as described in Verifying the installed part. Figure 46. RAID/cache storage controller removal from a rack-mounted system unit 68 Power Systems: Serial-attached SCSI RAID enablement Replacing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C RAID/cache storage controller Use this procedure to replace an existing Redundant Array of Independent Disks (RAID) storage controller. You can also use this procedure to replace a RAID-10 storage controller. Identify the system that you will be working on by using the identify the system process to turn on the system locate (blue) indicator. For more information, see “Control panel LEDs” on page 81. You must have already completed the procedure “Removing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C RAID/cache storage controller” on page 66. To replace a RAID/cache storage controller, complete the following steps: 1. Locate the slot for the RAID/cache storage controller. 2. Align the RAID/cache storage controller with the slot in the system. 3. Slide the RAID/cache storage controller (A) fully into the system until the tab (B) locks into place. 4. Replace the service access cover: v For a rack-mounted system, see “Installing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system” on page 98. v For a stand-alone system, see “Installing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system” on page 98. 5. For a rack-mounted system, put the system into the operating position, as described in “Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the operating position” on page 101. 6. Replace the front cover. Figure 47. RAID/cache storage controller replacement in a rack-mounted system unit Removing and replacing SAS RAID adapters and batteries 69 v For a rack-mounted system, see “Installing the front cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system” on page 93. v For a stand-alone system, see “Installing the front cover and front door on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system” on page 94. 7. Reconnect the power source. 8. Start the system as described in “Starting the system or logical partition” on page 87 9. Verify the installed part: v If you replaced the part because of a service action, verify the installed part as described in Verifying a repair. v If you installed the part for any other reason, verify the installed part as described in Verifying the installed part. Removing and replacing the cache battery card To service the cache battery card, complete the following procedure: Attention: To maintain system availability and prevent possible data loss, it is important to understand that these features have implications on the serial-attached SCSI (SAS) configuration and data accessibility of the system. Review the SAS subsystem topic to understand the various SAS configuration and service considerations. Before servicing or installing any features, review the I/O configurations for each logical partition on your system. It is important to understand the differences between the various configurations and the resulting service considerations. In addition, consider possible impacts to the following areas: v Logical partitions and their I/O adapter assignments v RAID array configurations v Physical disk locations v Path to your boot device Complete the following steps to remove and install the cache battery card: 1. Begin with power on. To prevent data loss, the cache battery card must be in an error state before it is removed from the system. If you have powered the system off, power it on before you continue. 2. Select your operating system: v AIX: Go to step 3. v Linux: Go to step 5. v IBM i: Go to step 7. 3. Force the cache battery card into an error state on AIX (to prevent possible data loss), as follows: a. Navigate to the IBM SAS Disk Array Manager by using the following steps: 1) At the command prompt, type smit, and press Enter. 2) Select Devices. 3) Select Disk Array. 4) Select IBM SAS Disk Array. 5) Select IBM SAS Disk Array Manager from the menu with options for configuring and managing the IBM SAS RAID Controller. b. Select Diagnostics and Recovery Options. c. Select Controller Rechargeable Battery Maintenance. d. Select Force Controller Rechargeable Battery Error. 70 Power Systems: Serial-attached SCSI RAID enablement e. Select the IBM SAS RAID Controller for the battery you want to replace. This option places the battery into the error state, which requires it to be replaced. f. Determine whether it is safe to replace the cache battery card. See the Displaying rechargeable battery information procedure. It is safe to replace the cache battery card when Yes is displayed next to Battery pack can be safely replaced. You might need to reselect the option Display Controller Rechargeable Battery Information multiple times as it might take several minutes before it is safe to replace the cache battery card. g. Verify that the Cache Present LED is not flashing. Displaying rechargeable battery information a. Navigate to the IBM SAS Disk Array Manager by using the previous procedure. b. Select Diagnostics and Recovery Options. c. Select Controller Rechargeable Battery Maintenance. d. Select Display Controller Rechargeable Battery Information. e. Select IBM SAS RAID Controller. 4. Go to step 8. 5. Force the cache battery card into an error state on the Linux operating system (to prevent possible data loss), as follows: a. Run the iprconfig utility by typing iprconfig. b. Select Work with disk unit recovery. c. Select Work with resources containing cache battery packs. d. Select your adapter and type 2. Then press Enter to force the battery error. Note: This option places the battery into the error state, which requires it to be replaced. e. If you are sure you want to force a battery error, type c to confirm. If you do not want to force a battery error, type q to cancel. f. Determine whether it is safe to replace the cache battery card. See Displaying rechargeable battery information. It is safe to replace the cache battery card when Yes is displayed next to Battery pack can be safely replaced. You might need to reselect the option Display Controller Rechargeable Battery Information multiple times, because it might take several minutes before it is safe to replace the cache battery card. 6. Go to step 8. 7. Force the cache battery card into an error state on IBM i (to prevent possible data loss), on the system or partition that contains the adapter as follows: a. Be sure that you are signed on to the system with at least service level authority. b. Type strsst on the command line and press Enter. c. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign On display. Press Enter. d. Select Start a Service Tool from the System Service Tools (SST) display and press Enter. e. Select Hardware Service Manager from the Start a Service Tool display and press Enter. f. Select Work with resources containing cache battery packs from the Hardware Service Manager display and press Enter. g. Select Force battery pack into error state for the I/O card that you are working with from the Work with Resources containing cache battery packs display and press Enter. h. On the Force Battery Packs Into Error State display, verify that the correct I/O adapter has been selected and press the function key that confirms your choice. i. Return to the Work with Resources containing cache battery packs display and select Display battery information. Ensure that the field Safe to replace cache battery is set to Yes. This might take several minutes and you might need to press the refresh key to see the field update. Removing and replacing SAS RAID adapters and batteries 71 8. Place a rack-mounted system into operating position. See “Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the operating position” on page 101. 9. Remove the service access cover: v For a rack-mounted system, see “Removing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system” on page 95. v For a stand-alone system, see “Removing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system” on page 96. 10. Check cache data present LED (A) by examining the LED on the cache battery card. v If the LED is flashing, go to next step. v If the LED is not flashing, go to step 12. 11. Do not remove the cache battery card if the LED is flashing. The flashing LED indicates that there is cached data on the battery that has not been saved. This data will be lost if the battery is removed at this time. Complete this service procedure without replacing the battery, and then retry it to ensure all cache data is saved before the battery is replaced. Go to step 18. 12. Remove the cache battery card as follows: Attention: Do not remove the cache battery card if the cache data present LED is flashing because this indicates cache data exists. a. Disengage the cache battery card (B) from the connector (A). b. Slide the cache battery card out of the mounting guides and remove it from the system. c. After the card has been removed from the system, remove the battery pack from the battery card: 1) Unplug the battery pack from the battery card. 2) Grasp the retaining clips and squeeze them closed to allow the battery pack to lift off of the retaining clips. Figure 48. Cache data present LED 72 Power Systems: Serial-attached SCSI RAID enablement If you are replacing the cache battery pack, go to the “Removing and replacing a cache battery pack” on page 75, and then return here and continue with the next step. 13. Replace the cache battery card as follows: Attention: Ensure that the cache battery card is disconnected for at least 60 seconds before you connect the new battery. This is the minimum amount of time needed for the adapter to recognize that the battery has been replaced. a. Slide the cache battery card into the mounting guides on the controller until it is seated in the battery connector. Figure 49. Removing the cache battery card Removing and replacing SAS RAID adapters and batteries 73 14. Power on the system and restart the adapter's write cache by doing the following steps: Select your operating system: v AIX: Go to step 15. v Linux: Go to step 16. v IBM i: Go to step 17. 15. Perform the following steps: a. Navigate to the IBM SAS Disk Array Manager by using the information in step 3a. b. Select Diagnostics and Recovery Options. c. Select Controller Rechargeable Battery Maintenance. d. Select Start Adapter Cache. e. Select the controller with the battery you just replaced and press Enter. f. Go to step 18. 16. Perform the following steps: a. Run the iprconfig utility by typing iprconfig. b. Select Work with disk unit recovery. c. Select Work with resources containing cache battery packs. d. Start the I/O adapter cache on the adapter for the battery that you just replaced by typing 3 and press Enter. e. Go to step 18. 17. Perform the following steps: a. Return to the Work with Resources containing Cache Battery Packs display using the information in step 7a - 7f and select the Start IOA cache. Press Enter. b. Ensure that you get the message Cache was started. c. Go to step 18. Figure 50. Installing the cache battery card 74 Power Systems: Serial-attached SCSI RAID enablement 18. Install the service access cover: v For a rack-mounted system, see “Installing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system” on page 98. v For a stand-alone system, see “Installing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system” on page 98. 19. Place into operating position. See “Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the operating position” on page 101. 20. Close the back door on the unit you are servicing. 21. Verify that the new resource is working correctly, as described in “Verifying the installed part” on page 107. Removing and replacing a cache battery pack Use this procedure to replace the cache battery pack on a battery card. Identify the system that you will be working on by using the identify the system process to turn on the system locate (blue) indicator. For more information, see “Control panel LEDs” on page 81. Attention: Before continuing with this procedures, determine that it is safe to replace the cache battery pack. See Displaying rechargeable battery information. It is safe to replace the cache battery pack when Yes is displayed next to Battery pack can be safely replaced. Attention: To maintain system availability and prevent possible data loss, it is important to understand that these features have implications on the serial-attached SCSI (SAS) configuration and data accessibility of the system. Review the SAS subsystem topic to understand the various SAS configuration and service considerations. Before servicing or installing any features, review the I/O configurations for each logical partition on your system. It is important to understand the differences between the various configurations and the resulting service considerations. In addition, consider possible impacts to the following areas: v Logical partitions and their I/O adapter assignments v RAID array configurations v Physical disk locations v Path to your boot device Complete the following steps to replace a nonconcurrent maintainable battery pack: 1. Remove the controller from the system. For more information, see “Removing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C RAID/cache storage controller” on page 66. 2. Place the controller on a surface that is electrostatic-discharge protected. 3. Unplug the battery connector (A) from its connector on the adapter, squeezing the retaining latch while gently pulling on the plug. The plug connects to the board in only one way; therefore, it cannot be inserted incorrectly during the replacement procedure. Note: Ensure that the cache battery pack is disconnected for at least 60 seconds before connecting the new battery. This is the minimum amount of time needed for the adapter to recognize that the battery has been replaced. Removing and replacing SAS RAID adapters and batteries 75 (A) Battery connector (B) Cache battery pack 4. The battery is mounted on a black plastic plate with two tabs, secured by white plastic standoffs with retaining tabs. To remove the battery assembly, remove the mounting plate from the standoffs, one standoff at a time. 5. Push in the standoff retaining tab and pull up on the mounting tab until the standoff releases through the hole in the mounting tab. Repeat for the second standoff. Note: If the retaining tab is difficult to push in, turn the white standoff so its retaining tab faces away from the battery pack. 6. Install the battery on to the posts until the latches click into place. Figure 51. Removing the cache battery 76 Power Systems: Serial-attached SCSI RAID enablement 7. Reinsert the pins into the rivets from the rear of the adapter. 8. Connect the cache battery pack connector (A) to the adapter. The plug connects to the adapter in only one way; therefore it cannot be inserted incorrectly. 9. Reinstall the adapter and restart the cache. For more information, see the following procedures: v “Removing and replacing the cache battery card” on page 70. v “Removing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C RAID/cache storage controller” on page 66. Figure 52. Replacing the cache battery Removing and replacing SAS RAID adapters and batteries 77 78 Power Systems: Serial-attached SCSI RAID enablement Common procedures for installable features This section contains all the common procedures related to installing, removing and replacing features. Before you begin Observe these precautions when you are installing, removing, or replacing features and parts. These precautions are intended to create a safe environment to service your system and do not provide steps for servicing your system. The installation, removal, and replacement procedures provide the step-by-step processes required to service your system. DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) DANGER © Copyright IBM Corp. 2010, 2011 79 Observe the following precautions when working on or around your IT rack system: v Heavy equipment–personal injury or equipment damage might result if mishandled. v Always lower the leveling pads on the rack cabinet. v Always install stabilizer brackets on the rack cabinet. v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest devices in the bottom of the rack cabinet. Always install servers and optional devices starting from the bottom of the rack cabinet. v Rack-mounted devices are not to be used as shelves or work spaces. Do not place objects on top of rack-mounted devices. v Each rack cabinet might have more than one power cord. Be sure to disconnect all power cords in the rack cabinet when directed to disconnect power during servicing. v Connect all devices installed in a rack cabinet to power devices installed in the same rack cabinet. Do not plug a power cord from a device installed in one rack cabinet into a power device installed in a different rack cabinet. v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of the system or the devices that attach to the system. It is the responsibility of the customer to ensure that the outlet is correctly wired and grounded to prevent an electrical shock. CAUTION v Do not install a unit in a rack where the internal rack ambient temperatures will exceed the manufacturer's recommended ambient temperature for all your rack-mounted devices. v Do not install a unit in a rack where the air flow is compromised. Ensure that air flow is not blocked or reduced on any side, front, or back of a unit used for air flow through the unit. v Consideration should be given to the connection of the equipment to the supply circuit so that overloading of the circuits does not compromise the supply wiring or overcurrent protection. To provide the correct power connection to a rack, refer to the rating labels located on the equipment in the rack to determine the total power requirement of the supply circuit. v (For sliding drawers.) Do not pull out or install any drawer or feature if the rack stabilizer brackets are not attached to the rack. Do not pull out more than one drawer at a time. The rack might become unstable if you pull out more than one drawer at a time. v (For fixed drawers.) This drawer is a fixed drawer and must not be moved for servicing unless specified by the manufacturer. Attempting to move the drawer partially or completely out of the rack might cause the rack to become unstable or cause the drawer to fall out of the rack. (R001) Before you begin a replacement or installation procedure, perform these tasks: 1. If you are installing a new feature, ensure that you have the software required to support the new feature. To do this, go to the following Web site: IBM Prerequisite 2. If you are performing an installation or replacement procedure that might put your data at risk, ensure, wherever possible, that you have a current backup of your system or logical partition (including operating systems, licensed programs, and data). 3. Review the installation or replacement procedure for the feature or part. 4. Note the significance of color on your system. 80 Power Systems: Serial-attached SCSI RAID enablement Blue or terra-cotta on a part of the hardware indicates a touch point where you can grip the hardware to remove it from or install it in the system, open or close a latch, and so on. Terra-cotta might also indicate that the part can be removed and replaced with the system or logical partition power on. 5. Ensure that you have access to a medium flat-blade screwdriver, a Phillips screwdriver, and a pair of scissors. 6. If parts are incorrect, missing, or visibly damaged, do the following: v If you are replacing a part, contact the provider of your parts or next level of support. v If you are installing a feature, contact one of the following service organizations: – The provider of your parts or next level of support. – In the United States, the IBM Rochester Manufacturing Automated Information Line (R–MAIL) at 1–800–300–8751. In countries and regions outside of the United States, use the following Web site to locate your service and support telephone numbers: http://www.ibm.com/planetwide 7. If you encounter difficulties during the installation, contact your service provider, your IBM reseller, or your next level of support. 8. If you are installing new hardware in a logical partition, you need to understand and plan for the implications of partitioning your system. For information, see Logical Partitioning. Identifying a part Use these instructions to learn how to identify the location of a failed part, the location of a part to be removed, or the location to install a new part on your system or expansion unit using the appropriate method for your system. For IBM Power Systems™ servers that contain the POWER7® processor, the light-emitting diodes (LEDs) can be used to identify or verify the location of a part that you are removing, servicing, or installing. The combination identify and fault LED (amber color) shows the location of a field replaceable unit (FRU). When removing a FRU, first verify whether you are working on the correct FRU by using the identify function in the management console or other user interface. When removing a FRU under the direction of the hardware management console, the identify function is activated and deactivated automatically at the correct times. The identify function causes the amber LED to blink. When you turn off the identify function, the LED returns to the state it was previously. For parts that have a blue service button, the identify function sets LED information for the service button so that when the button is pressed, the correct LEDs on that part blink. If you need to use the identify function, use the following procedures. Control panel LEDs Use this information as a guide to the control panel LEDs and buttons. The control panel has LEDs that indicate various system status. Common procedures for installable features 81 v A: Power-on button v B: Power LED – A constant light indicates full system power to the unit. – A blinking light indicates standby power to the unit. Note: There is approximately a 30-second transition period from the time the power-on button is pressed to when the power LED goes from blinking to solid. During the transition period, the LED might blink faster. v C: Enclosure identify light – A constant light indicates the identify state, which is used to identify a part. – No light indicates that the system is operating normally. v D: System information light – No light indicates that the system is operating normally. – Light on indicates that the system requires attention. v E: USB port v F: Enclosure fault roll-up light – A constant light indicates a fault in the system unit. – No light indicates that the system is operating normally. v G: Function/Data display v H: Decrement button v I: Enter button v J: Increment button v K: Pinhole reset button Figure 53. Control panel 82 Power Systems: Serial-attached SCSI RAID enablement Related concepts Identifying a failing part Use these instructions to learn how to locate and identify a failing part on your system or expansion unit using the appropriate method for your system. Identifying a failing part in an AIX system or logical partition Use these instructions to learn how to locate a failing part, and then activate the indicator light for that part on a system or logical partition running the AIX operating system. Locating a failing part in an AIX system or logical partition You might need to use AIX tools, before activating the indicator light, to locate a part that is failing. 1. Log in as root user or celogin-. 2. At the command line, type diag and press Enter. 3. From the Function Selection menu, select Task Selection and press Enter. 4. Select Display Previous Diagnostic Results and press Enter. 5. From the Display Previous Diagnostic Results display, select Display Diagnostic Log Summary. The Display Diagnostic Log display shows a chronological list of events. 6. Look in the T column for the most recent S entry. Select this row in the table and press Enter. 7. Select Commit. The details of this log entry are shown. 8. Record the location information and the SRN value shown near the end of the entry. 9. Exit to the command line. Use the location information for the failing part to activate the indicator light that identifies the failing part. See “Activating the indicator light for the failing part.” Activating the indicator light for the failing part Use these instructions to help physically identify the location of a part you are servicing. 1. Log in as root user. 2. At the command line, type diag and press Enter. 3. From the Function Selection menu, select Task Selection and press Enter. 4. From the Task Selection menu, select Identify and Attention Indicators and press Enter. 5. From the list of lights, select the location code for the failing part and press Enter. 6. Select Commit. This turns on the system attention and indicator light for the failing part. 7. Exit to the command line. Deactivating the failing-part indicator light Use this procedure to turn off any indicator light that you turned on as a part of a service action. To deactivate the indicator light, follow these steps: 1. Log in as root user. 2. At the command line, type diag and press Enter. 3. From the Function Selection menu, select Task Selection and press Enter. 4. From the Task Selection menu, select Identify and Attention Indicators and press Enter. 5. From the list of lights, select the location code for the failing part and press Enter. When a light is activated for a failing part, an I character precedes the location code. 6. Select Commit. This turns off the system attention and indicator light for the failing part. 7. Exit to the command line. Common procedures for installable features 83 Identifying a failing part in an IBM i system or logical partition You can activate or deactivate the indicator light by using IBM i to assist in locating a failing part. Activating the failing-part indicator light You can search the service action log for an entry that matches the time, reference code, or resource of a problem, and then activate the indicator light for a failing part. 1. Sign on to an IBM i session, with at least service level authority. 2. On the command line of the session, type strsst and press Enter. Note: If you cannot get to the System Service Tools display, use function 21 from the control panel. Alternatively, if the system is managed by a Hardware Management Console (HMC), use the Service Focal Point™ utilities to get to the Dedicated Service Tools (DST) display. 3. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign On display. Press Enter. Remember: The service tools password is case-sensitive. 4. Select Start a service tool from the System Service Tools (SST) display and press Enter. 5. Select Hardware service manager from the Start a Service Tool display and press Enter. 6. Select Work with service action log from the Hardware Service Manager display and press Enter. 7. On the Select Timeframe display, change the From: Date and Time field to a date and time prior to when the problem occurred. 8. Search for an entry that matches one or more conditions of the problem: v System Reference code v Resource v Date and time v Failing item list 9. Select option 2 (Display failing item information) to display the service action log entry. 10. Select option 2 (Display details) to display location information for the failing part to be replaced. The information displayed in the date and time fields is the date and time for the first occurrence of the specific System reference code for the resource displayed during the time range selected. 11. If location information is available, select option 6 (Indicator on) to turn on the failing part's indicator light. Tip: If the failing part does not contain a physical indicator light, a higher-level indicator light is activated. For example, the indicator light for the backplane or unit that contains the failing part might be lit. In this case, use the location information to locate the actual failing part. 12. Look for the enclosure indicator light to locate the enclosure that contains the failing part. Deactivating the failing-part indicator light Use this procedure to turn off any indicator light that you turned on as a part of a service action. To deactivate the indicator light, follow these steps: 1. Sign on to an IBM i session, with at least service level authority. 2. On the command line of the session, type strsst and press Enter. Note: If you cannot get to the System Service Tools display, use function 21 from the control panel. Alternatively, if the system is managed by a Hardware Management Console (HMC), use the Service Focal Point utilities to get to the Dedicated Service Tools (DST) display. 3. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign On display. Press Enter. 84 Power Systems: Serial-attached SCSI RAID enablement Remember: The service tools password is case-sensitive. 4. Select Start a service tool from the System Service Tools (SST) display and press Enter. 5. Select Hardware service manager from the Start a Service Tool display and press Enter. 6. Select Work with service action log from the Hardware Service Manager display and press Enter. 7. On the Select Timeframe display, change the From: Date and Time field to a date and time prior to when the problem occurred. 8. Search for an entry that matches one or more conditions of the problem: v System Reference code v Resource v Date and time v Failing item list 9. Select option 2 (Display failing item information) to display the service action log entry. 10. Select option 2 (Display details) to display location information for the failing part to be replaced. The information displayed in the date and time fields is the date and time for the first occurrence of the specific System reference code for the resource displayed during the time range selected. 11. Select option 7 (Indicator off) to turn off the indicator light. 12. Select the Acknowledge all errors function at the bottom of the Service Action Log display, if all problems have been resolved. 13. Close the log entry by selecting option 8 (Close new entry) on the Service Action Log Report display. Identifying a failing part in a Linux system or logical partition If the service aids have been installed on a system or logical partition, you can activate or deactivate the indicator lights to locate a part or compete a service action. Locating a failing part in a Linux system or logical partition If the service aids have been installed on a system or logical partition, you need to activate the indicator lights to locate a part. To activate the indicator light, follow these steps: 1. Log in as root user. 2. At the command line, type /usr/sbin/usysident -s identify -l and press Enter. 3. Look for the system attention light to identify the enclosure that contains the failing part. Related information Service and productivity tools for Linux on POWER systems IBM provides hardware diagnostic aids and productivity tools, as well as installation aids for Linux operating systems on IBM Power systems. Finding the location code of a failing part in a Linux system or logical partition To retrieve the location code of the failing part, if you do not know the location code, use the procedure in this topic. To locate the failing part in a system or logical partition follow these steps: 1. Log in as root user. 2. At the command line, type grep diagela /var/log/platform and press Enter. 3. Look for the most recent entry that contains a system reference code (SRC). 4. Record the location information. Note: IBM provides hardware diagnostic aids and productivity tools, as well as installation aids for Linux operating systems on IBM Power systems. See Service and productivity tools for Linux on Power Systems servers Common procedures for installable features 85 Activating the indicator light for the failing part If you know the location code of the failing part, activate the indicator light to help you locate which part to replace. To activate the indicator light, follow these steps: 1. Log in as root user. 2. At the command line, type /usr/sbin/usysident -s identify -l and press Enter. 3. Look for the system attention light to identify the enclosure that contains the failing part. Note: IBM provides hardware diagnostic aids and productivity tools, as well as installation aids for Linux operating systems on IBM Power systems. See Service and productivity tools for Linux on Power Systems servers Deactivating the failing-part indicator light After you complete a removal and replacement procedure, you must deactivate the failing-part indicator light. To deactivate the indicator light, follow these steps: 1. Log in as root user. 2. At the command line, type /usr/sbin/usysident -s normal -l and press Enter. Note: IBM provides hardware diagnostic aids and productivity tools, as well as installation aids for Linux operating systems on IBM Power systems. See Service and productivity tools for Linux on Power Systems servers Locating a failing part in a Virtual I/O Server system or logical partition You can use Virtual I/O Server (VIOS) tools, before activating the indicator light, to locate a part that is failing. 1. Log in as root user or celogin-. 2. At the command line, type diagmenu and press Enter. 3. From the Function Selection menu, select Task Selection and press Enter. 4. Select Display Previous Diagnostic Results and press Enter. 5. From the Display Previous Diagnostic Results display, select Display Diagnostic Log Summary. A Display Diagnostic Log display appears. This display contains a chronological list of events. 6. Look in the T column for the most recent S entry. Select this row in the table and press Enter. 7. Choose Commit. The details of this log entry are shown. 8. Record the location information and the SRN value shown near the end of the entry. 9. Exit to the command line. Use the location information for the failing part to activate the indicator light that identifies the failing part. For instructions, see “Identifying a part by using the Virtual I/O Server.” Identifying a part by using the Virtual I/O Server Use these instructions to turn on the indicator light to help you physically locate a part by using the Virtual I/O Server (VIOS). 1. Log in as root user. 2. At the command line, type diagmenu and press Enter. 3. From the Function Selection menu, select Task Selection. Press Enter. 4. From the Task Selection menu, select Identify and Attention Indicators. Press Enter. 5. From the list of lights, select the location code for the failing part and press Enter. 86 Power Systems: Serial-attached SCSI RAID enablement 6. Select Commit. This turns on the system attention and indicator light for the failing part. 7. Exit to the command line. Starting the system or logical partition Learn how to start a system or logical partition after performing a service action or system upgrade. Starting a system that is not managed by a Hardware Management Console or or a Systems Director Management Console You can use the power button or the Advanced System Management Interface to start a system that is not managed by a Hardware Management Console (HMC) or an IBM Systems Director Management Console (SDMC). To start a system that is not managed by a HMC or SDMC, follow these steps: 1. Open the front rack door, if necessary. 2. Before you press the power button on the control panel, ensure that power is connected to the system unit as follows: v All system power cables are connected to a power source. v The Power LED, as shown in the following figure, is slowly blinking. v The top of the display, as shown in the following figure, shows 01 V=F. 3. Press the power button (A), as shown in the following figure, on the control panel. v A: Power-on button v B: Power LED – A constant light indicates full system power to the unit. – A blinking light indicates standby power to the unit. Figure 54. Control panel Common procedures for installable features 87 Note: There is approximately a 30-second transition period from the time the power-on button is pressed to when the power LED goes from blinking to solid. During the transition period, the LED might blink faster. v C: Enclosure identify light – A constant light indicates the identify state for the enclosure or for a resource within the enclosure. – No light indicates that no resources in the enclosure are being identified. v D: Attention light – No light indicates that the system is operating normally. – A solid light indicates that the system requires attention. v E: USB port v F: Enclosure fault roll-up light – A constant light indicates a fault indicator active in the system. – No light indicates that the system is operating normally. v G: Function/Data display v H: Decrement button v I: Enter button v J: Increment button v K: Pinhole reset button 4. Observe the following after pressing the power button: v The power-on light begins to blink faster. v The system cooling fans are activated after approximately 30 seconds and begin to accelerate to operating speed. v Progress indicators, also referred to as checkpoints, appear on the control panel display while the system is being started. The power-on light on the control panel stops blinking and remains on, indicating that system power is on. Tip: If pressing the power button does not start the system, do the following steps to start the system using the Advanced System Management Interface (ASMI): 1. Access the ASMI. For instructions, see Accessing the ASMI. 2. Start the system using the ASMI. For instructions, see Powering the system on and off. Starting a system or logical partition by using the Hardware Management Console You can use the Hardware Management Console (HMC) user interface to start the system or logical partition after the required cables are installed and the power cables are connected to a power source. For instructions on working with the HMC, see Managing the Hardware Management Console. For instructions on starting a logical partition, see Logical partitioning. For instructions on starting the system, see Powering on the managed system. Progress indicators, also referred to as checkpoints, appear on the control panel display while the system is being started. When the power-on light on the control panel stops blinking and remains on, the system power is on. 88 Power Systems: Serial-attached SCSI RAID enablement Starting a system or virtual server with the Systems Director Management Console You can use the IBM Systems Director Management Console (SDMC) user interface to start the system or virtual server after the required cables are installed and the power cables are connected to a power source. For instructions on working with the SDMC, see Managing and configuring the SDMC. For instructions on starting a virtual server, see Managing virtual servers. For instructions on shutting down and restarting virtual servers, see Shutting down and restarting virtual servers. Progress indicators, also known as checkpoints, display on the control panel while the system is being started. When the power-on light on the control panel stops flashing and remains on, the system power is on. Stopping a system or logical partition Learn how to stop a system or logical partition as a part of a system upgrade or service action. Attention: Using either the power-on button on the control panel or entering commands at the Hardware Management Console (HMC) to stop the system can cause unpredictable results in the data files. Also, the next time you start the system, it might take longer if all applications are not ended before stopping the system. To stop the system or logical partition, select the appropriate procedure. Stopping a system that is not managed by an HMC or an SDMC You might need to stop the system to perform another task. Use these instructions to stop the system using the power button or Advanced System Management Interface. Before you stop the system, follow these steps: 1. If an Integrated xSeries® Adapter (IXA) is present on the system, shut it down using IBM i options. 2. Ensure that all jobs are completed and end all applications. 3. Ensure that the operating system is stopped. Attention: Failure to do so can result in the loss of data. 4. If a Virtual I/O Server (VIOS) logical partition is running, ensure that all clients are shut down or that the clients have access to their devices using an alternate method. The following procedure describes how to stop a system that is not managed by a Hardware Management Console (HMC). 1. Log in to the system as a user with the authority to run the shutdown or pwrdwnsys (Power Down System) command. 2. At the command line, enter one of the following commands: v If your system is running the AIX operating system, type shutdown. v If your system is running the Linux operating system, type shutdown -h now. v If your system is running the IBM i operating system, type PWRDWNSYS. If your system is partitioned, use the PWRDWNSYS command to power down each of the secondary partitions. Then, use the PWRDWNSYS command to power down the primary partition. The command stops the operating system. The system power turns off, the power-on light begins to slowly flash, and the system goes into a standby state. 3. At the Linux command line, type shutdown -h now. Common procedures for installable features 89 The command stops the operating system. The system power turns off, the power-on light begins to slowly flash, and the system goes into a standby state. 4. At the Linux command line, type shutdown -h now. The command stops the operating system. The system power turns off, the power-on light begins to slowly flash, and the system goes into a standby state. 5. Record the IPL type and the IPL mode from the control panel display to help you return the system to this state when the installation or replacement procedure is completed. 6. Set the power switches of any devices connected to the system to off. 7. Unplug any power cables that are attached to the unit from electrical outlets. Ensure that you unplug power cables from peripheral devices, such as printers and expansion units. Important: The system is equipped with a second power supply. Before continuing with this procedure, ensure that all power sources to the system have been disconnected. (L003) or Stopping a system by using the Hardware Management Console You can use the Hardware Management Console (HMC) user interface to stop the system or a logical partition. By default, the managed system is set to power off automatically when you shut down the last running logical partition on the managed system. If you set the managed system properties on the HMC so that the managed system does not power off automatically, you must use this procedure to power off your managed system. 90 Power Systems: Serial-attached SCSI RAID enablement Attention: If possible, shut down the running logical partitions on the managed system before powering off the managed system. Powering off the managed system without shutting down the logical partitions first causes the logical partitions to shut down abnormally and can cause data loss. If you use a Virtual I/O Server (VIOS) logical partition, ensure that all clients are shut down or that the clients have access to their devices using an alternate method. To power off a managed system, you must be a member of one of the following roles: v Super administrator v Service representative v Operator v Product engineer Use the following steps to stop the system using the HMC. 1. In the Navigation area, expand the Systems Management folder. 2. Click the Servers icon. 3. In the Contents area, select the managed system. 4. Select Tasks, then Operations, and then Power Off 5. Select the appropriate power-off mode and click OK. Related information Shutting down and restarting logical partitions Stopping a system with the Systems Director Management Console You can use the IBM Systems Director Management Console (SDMC) user interface to stop the system or a virtual server. By default, the managed system is set to power off automatically when you shut down the last running virtual server on the managed system. If you set the managed system properties on the SDMC so that the managed system does not power off automatically, you must use this procedure to power off your managed system. Attention: If possible, shut down the running virtual servers on the managed system before powering off the managed system. Powering off the managed system without shutting down the virtual servers first causes the virtual servers to shut down abnormally and can cause data loss. If you use a Virtual I/O Server (VIOS) logical partition, ensure that all clients are shut down or that the clients have access to their devices with an alternate method. To power off a managed system, you must be a member of one of the following roles: v Super administrator v Service representative v Operator v Product engineer Use the following steps to stop the system using the SDMC. 1. In the Power Systems Resource area, select the managed system you want to power off. 2. From the Actions menu, select Operations ? Power Off. 3. Select the appropriate power-off mode and click OK. Removing and replacing covers for the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C Use these instructions to remove, replace, or install covers to access components or perform service. Common procedures for installable features 91 Removing the front cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system Use this procedure to remove the cover to access components or perform service. To remove the front cover, follow these steps: 1. Remove the two shipping screws (if present) that secure the system to the rack. Note: Replacing the shipping screws is optional but should be done in geographic areas that are prone to seismic activity. 2. Pull the cover away from the system. The cover has an indentation where you can hold onto it more easily. Removing the front cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system Use this procedure to remove the cover to access components or perform service. To remove the front cover, follow these steps: 1. Open the front door (A) by swinging it towards the left as shown in the following figure. 2. Press latch (B) and pull the cover away from the system. The cover has an indentation where you can hold onto it more easily. Figure 55. Removing the front cover 92 Power Systems: Serial-attached SCSI RAID enablement Installing the front cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system Use this procedure to install the cover to access components or perform service. To install the front cover, follow these steps: 1. Push the cover onto the system. 2. Gently push the cover in until the four cover clips are seated in their respective mounting posts as shown in the following figure. The cover snaps into place and has an indentation where you can hold onto it more easily. Note: Replacing the shipping screws is optional but should be done in geographic areas that are prone to seismic activity. Figure 56. Removing the front cover Common procedures for installable features 93 Installing the front cover and front door on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system Use this procedure to install the cover to access components or perform service. To install the front cover and front door, complete the following steps. 1. Align the cover (A) until the two cover tabs are seated into the slots on the base plate (B), as shown in the following figure. 2. Rotate the cover up and towards the system until the release latch is seated into its respective slot. The cover has an indentation where you can hold onto it more easily. Figure 57. Removing the front cover 94 Power Systems: Serial-attached SCSI RAID enablement 3. Hold the front door at an approximate 120 degree angle to the system as shown in the following figure. Engage the latches. Removing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system Use this procedure to remove the service access cover. 1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit. Figure 58. Installing the front cover Figure 59. Installing the front door Common procedures for installable features 95 Attention: When the service access cover is removed, the system powers down. Removing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system Use this procedure to remove the service access cover. 1. Loosen the thumbscrew (A) located at the back of the outside cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit. Figure 60. Removing the service access cover 96 Power Systems: Serial-attached SCSI RAID enablement 3. Loosen the two thumbscrews (A) located at the back of the inside cover. 4. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit. Attention: When the service access cover is removed, the system powers down. Figure 61. Installing the outside service access cover Figure 62. Removing the inside service access cover Common procedures for installable features 97 Installing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system Use this procedure to install the service access cover. 1. Place the service access cover on top of the system, about 25 mm (1 in.) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (A) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover. Installing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system Use this procedure to install the service access cover. 1. Place the inside service access cover (A) on top of the system, about 25 mm (1 in.) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover. Figure 63. Installing the service access cover 98 Power Systems: Serial-attached SCSI RAID enablement 5. Place the outside service access cover (A) on top of the system. 6. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 7. Align the thumbscrew (B) located on the back of the service access cover with the hole on the back of the system chassis. 8. Tighten the thumbscrew to secure the service access cover. Figure 64. Installing the inside service access cover Common procedures for installable features 99 Placing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service or operating position Use these procedures to place a system into the service position or operating position to perform service or to gain access to internal components. Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service position Use this procedure to place the rack-mounted system into the service position. Notes: v When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. v Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. v When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. Note: Replacing the shipping screws is optional but should be done in geographic areas that are prone to seismic activity. Figure 65. Installing the outside service access cover 100 Power Systems: Serial-attached SCSI RAID enablement 4. While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked. Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the operating position Use this procedure to place the rack-mounted system into the operating position. Figure 66. Placing the system in the service position Common procedures for installable features 101 When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the blue rail safety latches (A) by lifting them upward. The rail safety latches are located near the middle of the system. 2. Push the system unit (B) back into the rack until both system-unit release latches have locked into position. 3. Replace and tighten the two screws (if present) (A) that secure the system unit (B) to the rack. Note: Replacing the two shipping screws is optional but should be done in geographic areas that are prone to seismic activity. 4. Close the front rack door of the unit you are servicing. Figure 67. Placing the system into the operating position 102 Power Systems: Serial-attached SCSI RAID enablement Disconnecting the power cords from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system Use these procedures to disconnect the power cords from the system. 1. For a rack-mounted system, open the back rack door on the unit you are servicing and identify the system unit you are servicing in the rack. 2. For both a rack-mounted system and a stand-alone system, disconnect all power cords from the unit you are servicing. This system might be equipped with two power supplies. Before continuing with any remove and replace procedures that require the power off, ensure that both power sources to the system have been completely disconnected. Figure 68. Placing the system into the operating position Common procedures for installable features 103 Figure 69. Removing the power cords for a rack-mounted system Figure 70. Removing the power cords for a stand-alone system 104 Power Systems: Serial-attached SCSI RAID enablement Connecting the power cords to the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system Use this procedure to connect the power cords to the system. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. Ensure that the cords are threaded through the handles. 3. Close the back rack door. Installing a feature by using the Hardware Management Console You can use the Hardware Management Console (HMC) to perform many service actions, including the installation of a new feature or part. To use the HMC user interface to install a feature or part into a system or expansion unit that is managed by an HMC, follow these steps: 1. In the navigation area, expand Systems Management ? Servers. 2. Select the managed system you will install the part in. 3. In the Tasks area expand Serviceability ? Hardware ? MES Tasks. 4. Select Add FRU (field replaceable unit). 5. In the Add/Install/Remove Hardware window select the system or enclosure into which you are installing the feature. 6. Select the type of feature you are installing from the menu and click Next 7. Select the location code for where you will install the feature, and click Add. 8. After the FRU is placed in the pending actions category click Launch Procedure and follow the instructions to install the feature. Figure 71. Connecting the power cords Common procedures for installable features 105 Note: The HMC might open external instructions for installing the feature. If so, follow those instructions to install the feature. Removing a part by using the Hardware Management Console You can use the Hardware Management Console (HMC) to perform many service actions, including the removal of a field replaceable unit (FRU) or part. To use the HMC user interface to remove a part in a system or expansion unit that is managed by an HMC, follow these steps: 1. In the navigation area, expand Systems Management ? Servers. 2. Select the managed system from which you are removing a part. 3. Select one of the following options. v If you are installing a new part as part of an upgrade, in the Tasks area expand Serviceability ? Hardware ? MES Tasks. v If you are replacing a part as part of a service action, in the Tasks area expand Serviceability ? Hardware ? Remove FRU. 4. To install a new part as part of an upgrade, select Add FRU. To replace a part as part of a service action, select Remove FRU. 5. In the Add/Install/Remove Hardware - Remove FRU, Select FRU Type window, select the system or enclosure from which you are removing the part. 6. Select the type of part you are removing from the menu and click Next. 7. Select the location of the part you are removing and click Add. 8. After the part is listed in the Pending Actions display click Launch Procedure and follow the instructions to remove the part. Note: The HMC might open the information center instructions for removing the part. If so, follow those instructions to remove the part. Replacing a part by using the Hardware Management Console You can use the Hardware Management Console (HMC) to perform many service actions, including exchanging a field replaceable unit (FRU) or part. If you are exchanging a part to repair a serviceable event follow those instructions. If you are exchanging a part as a part of any other procedure using HMC Version 7, or later, complete the following steps: 1. In the navigation area, expand Systems Management ? Servers. 2. Select the managed system in which you are exchanging a part. 3. Select one of the following options. v If you are installing a new part as part of an upgrade, in the Tasks area expand Serviceability ? Hardware ? MES Tasks. v If you are replacing a part as part of a service action, in the Tasks area expand Serviceability ? Hardware ? Exchange FRU. 4. To install a new part as part of an upgrade, select Add FRU. To replace a part as part of a service action, select Exchange FRU. 5. In the Replace Hardware - Replace FRU, Select FRU Type window select the system or enclosure in which you will exchange a part from the list. 6. Select the type of part you will exchange from the menu, and click Next. 7. Select the location code of the part you will exchange from the menu, and click Add. 8. After the FRU is placed in the pending actions category click Launch Procedure, and follow the instructions to exchange the feature. 106 Power Systems: Serial-attached SCSI RAID enablement Note: The HMC might open external instructions for replacing the part. If so, follow those instructions to replace the part. Verifying the installed part You can verify a newly installed or replaced part on your system, logical partition, or expansion unit using the operating system, stand-alone diagnostics, or the Hardware Management Console (HMC). Verifying an installed feature or replaced part in an AIX system or logical partition If you installed feature or replaced a part, you might want to use the tools in the AIX operating system to verify that the feature or part is recognized by the system or logical partition. To verify the operation of a newly installed feature or replacement part, select the appropriate procedure: v Verify the installed feature using AIX v Verifying the replaced part using AIX Verify the installed feature by using the AIX operating system: 1. Log in as root user. 2. At the command line, type diag and press Enter. 3. Select Advanced Diagnostics Routines and press Enter. 4. From the Diagnostic Mode Selection menu, select System Verification and press Enter. 5. When the Advanced Diagnostic Selection menu appears, do one of the following: v To test a single resource, select the resource that you just installed from the list of resources and press Enter. v To test all the resources available to the operating system, select All Resources and press Enter. 6. Select Commit, and wait until the diagnostic programs run to completion, responding to any prompts that appear. 7. Did the diagnostics run to completion and display the message No trouble was found? v No: If a service request number (SRN) or other reference code is displayed, suspect a loose adapter or cable connection. Review the installation procedures to ensure that the new feature is installed correctly. If you cannot correct the problem, collect all SRNs or any other reference code information that you see. If the system is running in logical partitioning (LPAR) mode, note the logical partition in which you installed the feature. Contact your service provider for assistance. v Yes: The new device is installed correctly. Exit the diagnostic programs and return the system to normal operations. Verify the replacement part by using the AIX operating system: To verify the operation of a newly installed feature or replacement part, follow these steps: 1. Did you use either the AIX operating system or the online diagnostics service aid concurrent (hot-swap) service to replace the part? No: Go to step 2. Yes: Go to step 5 on page 108. 2. Is the system powered off? No: Go to step 4 on page 108. Yes: If the system supports slow boot, set the system to perform a slow boot. For information, see Performing a slow boot. 3. Start the system and wait until the AIX operating system login prompt is displayed or until apparent system activity on the operator panel or display has stopped. Common procedures for installable features 107 Did the AIX login prompt display? v No: If a service request number (SRN) or other reference code is displayed, suspect a loose adapter or cable connection. Review the procedures for the part that you replaced to ensure that the new part is installed correctly. If you cannot correct the problem, collect all SRNs or any other reference code information that you see. If the system does not start or you have no login prompt, see: Problems with loading and starting the operating system. If the system is partitioned, note the logical partition in which you replaced the part. Contact your service provider for assistance. v Yes: Go to step 4. 4. At the command prompt, type diag —a and press Enter to check for missing resources. If you see a command prompt, go to step 5. If the Diagnostic selection menu is shown with M appearing next to any resource, follow these steps: a. Select the resource and press Enter. b. Select Commit. c. Follow any instructions that are shown. d. If the Do you want to review the previously displayed error? message is shown, select Yes and press Enter. e. If an SRN is shown, suspect a loose card or connection. If no obvious problem is shown, record the SRN and contact your service provider for assistance.. f. If no SRN is shown, go to step 5. 5. Test the part by doing the following steps: a. At the command line, type diag and press Enter. b. From the Function Selection menu, select Advanced Diagnostics Routines and press Enter. c. From the Diagnostic Mode Selection menu, select System Verification and press Enter. d. Select All Resources, or select the diagnostics for the individual part to test only the part you replaced and any devices that are attached to the part you replaced and press Enter. Did the Resource Repair Action menu appear? No: Go to step 6. Yes: Go to step 7. 6. Did the Testing Complete, No trouble was found message appear? v No: There is still a problem. Contact your service provider. This ends the procedure. v Yes: Select Log Repair Action, if not previously logged, from the Task Selection menu to update the AIX error log. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0 and press Enter. Tip: This action changes the indicator light for the part from the fault state to the normal state. Go to step 9 on page 109. 7. Select the resource for the replaced part from the Resource Repair Action menu. When a test is run on a resource in system verification mode, and that resource has an entry in the AIX error log, if the test on the resource was successful, the Resource Repair Action menu appears. Complete the following steps to update the AIX error log to indicate that a system-detectable part has been replaced. Note: On systems with an indicator light for the failing part, this action changes the indicator light to the normal state. 108 Power Systems: Serial-attached SCSI RAID enablement a. Select the resource that has been replaced from the Resource Repair Action menu. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action does not appear on the resource list, select sysplanar0 and press Enter. b. Select Commit after you make your selections. Did another Resource Repair Action display appear? No: If the No Trouble Found display appears, go to step 9 Yes: Go to step 8. 8. Select the parent or child of the resource for the replaced part from the Resource Repair Action menu if necessary. When a test is run on a resource in system verification mode, and that resource has an entry in the AIX error log, if the test on the resource was successful, the Resource Repair Action menu appears. Complete the following steps to update the AIX error log to indicate that a system-detectable part has been replaced. Note: This action changes the indicator light for the part from the fault state to the normal state. a. From the Resource Repair Action menu, select the parent or child of the resource that has been replaced. If the repair action was to reseat a cable or adapter, select the resource associated with that repair action. If the resource associated with your action does not appear on the resource list, select sysplanar0 and press Enter. b. Select Commit after you make your selections. c. If the No Trouble Found display appears, go to step 9. 9. If you changed the service processor or network settings, as instructed in previous procedures, restore the settings to the values they had prior to servicing the system. 10. Did you do any hot-plug procedures before doing this procedure? No: Go to step 11. Yes: Go to step 12. 11. Start the operating system, with the system or logical partition in normal mode. Were you able to start the operating system? No: Contact your service provider. This ends the procedure. Yes: Go to step 12. 12. Are the indicator lights still on? v No. This ends the procedure. v Yes. Turn off the lights. See the following for instructions: Changing service indicators. Verifying the installed part in an IBM i system or logical partition If you have installed a new feature or part, verify that the system recognizes the feature or part by using the IBM i system service tools. To verify the installed part, follow these steps: 1. Deactivate the failing item indicator light. For instructions, see “Deactivating the failing-part indicator light” on page 84. 2. Sign on with at least service level authority. 3. On the command line of the IBM i session, type strsst and press Enter. Note: If you cannot get to the System Service Tools display, use function 21 from the control panel. Alternatively, if the system is managed by Hardware Management Console (HMC), use the Service Focal Point Utilities to get to the Dedicated Service Tools (DST) display. 4. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign On display and press Enter. Common procedures for installable features 109 Note: The service tools password is case-sensitive. 5. Select Start a service tool from the System Service Tools (SST) display and press Enter. 6. Select Hardware service manager from the Start a Service Tool display and press Enter. 7. Select Logical hardware resources (buses, IOPs, controllers) from the Hardware Service Manager display and press Enter. This option allows you to display and work with logical resources. Logical hardware resources are the functional resources of the system used by the operating system. With the Logical Hardware Resources display, you can show logical hardware resource status or information, and associated packaging hardware resources. Use the online Help information to better understand specific functions, fields, or symbols. Deactivating the failing-part indicator light Use this procedure to turn off any indicator light that you turned on as a part of a service action. To deactivate the indicator light, follow these steps: 1. Sign on to an IBM i session, with at least service level authority. 2. On the command line of the session, type strsst and press Enter. Note: If you cannot get to the System Service Tools display, use function 21 from the control panel. Alternatively, if the system is managed by a Hardware Management Console (HMC), use the Service Focal Point utilities to get to the Dedicated Service Tools (DST) display. 3. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign On display. Press Enter. Remember: The service tools password is case-sensitive. 4. Select Start a service tool from the System Service Tools (SST) display and press Enter. 5. Select Hardware service manager from the Start a Service Tool display and press Enter. 6. Select Work with service action log from the Hardware Service Manager display and press Enter. 7. On the Select Timeframe display, change the From: Date and Time field to a date and time prior to when the problem occurred. 8. Search for an entry that matches one or more conditions of the problem: v System Reference code v Resource v Date and time v Failing item list 9. Select option 2 (Display failing item information) to display the service action log entry. 10. Select option 2 (Display details) to display location information for the failing part to be replaced. The information displayed in the date and time fields is the date and time for the first occurrence of the specific System reference code for the resource displayed during the time range selected. 11. Select option 7 (Indicator off) to turn off the indicator light. 12. Select the Acknowledge all errors function at the bottom of the Service Action Log display, if all problems have been resolved. 13. Close the log entry by selecting option 8 (Close new entry) on the Service Action Log Report display. Verifying the installed part in a Linux system or logical partition If you have installed a new part, learn how to verify that the system recognizes the part. To verify the newly installed or replaced part, continue with “Verifying an installed part by using stand-alone diagnostics” on page 111. 110 Power Systems: Serial-attached SCSI RAID enablement Verifying an installed part by using stand-alone diagnostics If you have installed or replaced a part, verify that the system recognizes the new part. You can use stand-alone diagnostics to verify an installed part in an AIX or Linux system, expansion unit, or logical partition. v If this server is directly attached to another server or attached to a network, ensure communications with the other servers have stopped. v The stand-alone diagnostics require use of all of the logical partition resources. No other activity can be running on the logical partition. v The stand-alone diagnostics require access to the system console. You access these diagnostics from a CD-ROM or from the Network Installation Management (NIM) server. This procedure describes how to use the diagnostics from a CD-ROM. For information on running diagnostics from the Network Installation Management (NIM) server, see Running stand-alone diagnostics from a Network Installation Management server. To use stand-alone diagnostics, follow these steps: 1. Stop all jobs and applications and then stop the operating system on the system or logical partition. 2. Remove all tapes, diskettes, and CD-ROM. 3. Turn off the system unit power. The next step boots the server or logical partition from the stand-alone diagnostics CD-ROM. If the optical drive is not available as the boot device on the server or logical partition on which you are working, follow these steps: a. Access the ASMI. See Accessing the ASMI for information on using the ASMI. b. On the ASMI main menu, click on Power/Restart Control. c. Click Power On/Off System. d. Select the Service mode boot from default boot list option in the AIX or Linux logical partition mode boot drop-down menu. e. Click Save settings and power on. As soon as the optical drive has power, insert the standalone diagnostic CD-ROM. f. Go to step 5. 4. Turn on the system unit power and immediately insert the diagnostics CD-ROM into the optical drive. 5. After the keyboard POST indicator displays on the system console and before the last POST indicator (speaker) displays, press the numeric 5 key on the system console to indicate that a service mode boot should be initiated using the default-service mode boot list. 6. Enter any requested password. 7. At the Diagnostic Operating Instructions display, press Enter. Tip: If a service request number (SRN) or other reference code is displayed, suspect a loose adapter or cable connection. Note: If you received an SRN or any other reference code when you attempted to start the system, contact your service provider for assistance.. 8. If the terminal type is requested, select the Initialize Terminal option on the Function Selection menu to initialize the operating system. 9. From the Function Selection menu, select Advanced Diagnostics Routines and press Enter. 10. From the Diagnostic Mode Selection menu, select System Verification and press Enter. 11. When the Advanced Diagnostic Selection menu appears, select All Resources, or test only the part you replaced, and any devices that are attached to the part you replaced, by selecting the diagnostics for the individual part and press Enter. 12. Did the Testing Complete, No trouble was found message appear? Common procedures for installable features 111 v No: There is still a problem. Contact your service provider. v Yes: Go to step 13. 13. If you changed the service processor or network settings, as instructed in previous procedures, restore the settings to the value they had prior to servicing the system. 14. If the indicator lights are still on, follow these steps: a. Select Identify and Attention Indicators from the Task Selection menu to turn off the system attention and indicator lights and press Enter. b. Select Set System Attention Indicator to NORMAL and press Enter. c. Select Set All Identify Indicators to NORMAL and press Enter. d. Choose Commit. Note: This changes the system attention and identify indicators from the Fault state to the Normal state. e. Exit to the command line. Verifying the installed part by using Hardware Management Console If you have installed or replaced a part, use the Hardware Management Console (HMC) to update your HMC records after you have completed a service action on your server. If you have reference codes, symptoms, or location codes that you used during the service action, locate the records for use during this procedure. To verify the installed part, complete these steps: 1. At the HMC, examine the service action event log for any open service action events. See “Viewing serviceable events by using the HMC” on page 114 for details. 2. Are there any service action events that are open? No: If the system attention LED is still on, use the HMC to turn off the LED. See “Activating and deactivating LEDs by using the HMC” on page 113. This ends the procedure. Yes: Continue with the next step. 3. Record the list of open service action events. 4. Examine the details of the open service action event. Is the error code associated with this service action event the same as you gathered earlier. v No: Select one of the following options: – Review the other serviceable events, find one that does match, and continue with the next step. – If the log does not match what you had gathered earlier, contact your service provider. v Yes: Continue with the next step. 5. Select and highlight the service action event from the Error Associated With This Serviceable Event window. 6. Click Close Event. 7. Add comments for the serviceable event. Include any unique additional information. Click OK. 8. Did you replace, add, or modify a field replaceable unit (FRU) of the open service action event? v No: Select the No FRU Replaced for this Serviceable Event option, and click OK to close the service action event. v Yes: Perform the following steps: a. From the FRU list, select a FRU that you need to update. b. Double-click the FRU and update the FRU information. c. Click OK to close the service action event. 9. If you continue to have problems, contact your service provider. 112 Power Systems: Serial-attached SCSI RAID enablement Activating and deactivating LEDs by using the HMC Use this procedure to activate or deactivate LEDs using Service Focal Point for the Hardware Management Console (HMC). Choose from the following: v “Deactivating a system attention LED or partition LED” v “Activating or deactivating an identify LED” Deactivating a system attention LED or partition LED: You can deactivate a system attention LED or a logical partition LED. For example, you might determine that a problem is not a high priority and decide to repair the problem at a later time. However, you want to be alerted if another problem occurs, so you must deactivate the system attention LED so that it can be activated again if another problem occurs. 1. In the navigation area, open Systems Management. 2. Open Servers and select the appropriate system. 3. In the content area, check the box for the appropriate Partition. 4. Select Tasks, then Operations, and then Manage Attention LED. 5. Select the appropriate Partition. 6. Select Deactivate System Attention LED from the Action menu. A confirmation window is displayed that provides the following information: v A verification that the system attention LED was deactivated. v An indication that there still might be open problems within the system. v An indication that you cannot activate the system attention LED. 7. Select one of the logical partitions in the lower table, and select Deactivate partition LED from the Partition Operations menu. A confirmation window is displayed that provides the following information: v A verification that the logical partition LED was deactivated. v An indication that there still might be open problems within the logical partition. v An indication that you cannot activate the logical partition LED. Activating or deactivating an identify LED: The system provides several LEDs that help identify various components, such as enclosures or field replaceable units (FRUs), in the system. For this reason, they are called identify LEDs. You can activate or deactivate the following types of identify LEDs: v Identify LED for an enclosure If you want to add an adapter to a specific drawer (enclosure), you need to know the machine type, model, and serial number (MTMS) of the drawer. To determine whether you have the correct MTMS for the drawer that needs the new adapter, you can activate the LED for a drawer and verify that the MTMS corresponds to the drawer that requires the new adapter. v Identify LED for a FRU associated with a specified enclosure If you want to hook up a cable to a specific I/O adapter, you can activate the LED for the adapter which is a field replaceable unit (FRU), and then physically check to see where you should hook up the cable. This is especially useful when you have several adapters with open ports. To activate or deactivate an identify LED for an enclosure or FRU, follow these steps: 1. In the navigation area, open Systems Management. 2. Select Servers. 3. In the content area, check the box for the appropriate System. 4. Select Tasks, then Operations, then LED Status, and then Identify LED. Common procedures for installable features 113 5. To activate or deactivate an identify LED for an enclosure, select an enclosure from the table, and click either Activate LED or Deactivate LED. The associated LED is either turned on or off. 6. To activate or deactivate an identify LED for a FRU, select an enclosure from the table, select Selected ? List FRUs. 7. Select one or more FRUs from the table, and click either Activate LED or Deactivate LED. The associated LED is either turned on or off. Viewing serviceable events by using the HMC Use this procedure to view a serviceable event, including details, comments, and service history. To view serviceable events and other information about the events, you must be a member of one of the following roles: v Super administrator v Service representative v Operator v Product engineer v Viewer To view serviceable events, follow these steps: 1. In the navigation area, select Service Management. 2. Select Manage Serviceable Events. 3. Select the criteria for the serviceable events that you want to view, and click OK. The Serviceable Event Overview window opens. The list shows all serviceable events that match your selection criteria. You can use the menu options to perform actions on the serviceable events. 4. Select a line in the Serviceable Event Overview window, and select Selected ? View Details The Serviceable Event Details window opens, showing detailed information about the serviceable event. The upper table shows information, such as problem number and reference code. The lower table shows the field replaceable units (FRUs) associated with this event. 5. Select the error for which you want to view comments and history, and follow these steps: a. Select Actions ? View Comments. b. When you are finished viewing the comments, click Close. c. Select Actions ? View Service History. The Service History window opens, showing service history associated with the selected error. d. When you are finished viewing the service history, click Close. 6. When you are finished, click Cancel twice to close the Serviceable Event Details window and the Serviceable Event Overview window. Verifying the installed part by using Systems Director Management Console If you installed or replaced a part, use the IBM Systems Director Management Console (SDMC) to update your SDMC records after you have completed a service action on your server. If you have reference codes, symptoms, or location codes that you used during the service action, locate the records for use during this procedure. To verify the installed part, complete these steps: 1. From the SDMC, examine the service action event log for any open service action events. See “Viewing serviceable events by using the Systems Director Management Console” on page 116 for details. 2. Are there any service action events that are open? 114 Power Systems: Serial-attached SCSI RAID enablement No: If the system attention LED is still on, use the SDMC to turn off the LED. See “Activating and deactivating LEDs by using the Systems Director Management Console.” This ends the procedure. Yes: Continue with the next step. 3. Record the list of open service action events. 4. Examine the details of the open service action event. Is the error code associated with this service action event the same as you gathered earlier. v No: Select one of the following options: – Review the other serviceable events, find one that does match, and continue with the next step. – If the log does not match what you had gathered earlier, contact your service provider. v Yes: Continue with the next step. 5. Select and highlight the service action event from the Error Associated With This Serviceable Event window. 6. Click Delete or Ignore. Note: These options are only available from the problem event log. Activating and deactivating LEDs by using the Systems Director Management Console Use this procedure to activate or deactivate LEDs by using the IBM Systems Director Management Console (SDMC). Choose from the following: v “Deactivating a system attention LED or partition LED” v “Activating or deactivating an identify LED by using the SDMC” Deactivating a system attention LED or partition LED: You can deactivate a system attention LED or a logical partition LED. For example, you might determine that a problem is not a high priority and decide to repair the problem at a later time. However, you want to be alerted if another problem occurs, so you must deactivate the system attention LED so that it can be activated again if another problem occurs. 1. On the Resources tab, select the appropriate host or virtual server. 2. Select Actions ? Service and Support ? Hardware ? System Attention LED. 3. Select Deactivate System Attention LED. A confirmation window is displayed that provides the following information: v A verification that the system attention LED was deactivated. v An indication that there still might be open problems within the system. v An indication that you cannot activate the system attention LED. 4. Select one of the virtual servers, and select Deactivate System Attention LED. A confirmation window is displayed that provides the following information: v A verification that the system attention LED was deactivated. v An indication that there still might be open problems within the logical partition. v An indication that you cannot activate the virtual server LED. Activating or deactivating an identify LED by using the SDMC: The system provides several LEDs that help identify various components, such as enclosures or field replaceable units (FRUs). For this reason, they are called identify LEDs. You can activate or deactivate the following types of identify LEDs: Common procedures for installable features 115 v Identify LED for an enclosure If you want to add an adapter to a specific drawer (enclosure), you need to know the machine type, model, and serial number (MTMS) of the drawer. To determine whether you have the correct MTMS for the drawer that needs the new adapter, you can activate the LED for a drawer and verify that the MTMS corresponds to the drawer that requires the new adapter. v Identify LED for a FRU associated with a specified enclosure If you want to hook up a cable to a specific I/O adapter, you can activate the LED for the adapter which is a field replaceable unit (FRU), and then physically check to see where you should hook up the cable. This is especially useful when you have several adapters with open ports. To activate or deactivate an identify LED for an enclosure or FRU, follow these steps: 1. On the Resources tab, select the appropriate host or virtual server. 2. Select Actions ? Service and Support ? Hardware ? Identify LED. 3. In the Identify LED, Select Enclosure window, select the system unit or enclosure. 4. To activate or deactivate an identify LED, click either Activate LED or Deactivate LED. The associated LED is either turned on or off. 5. To activate or deactivate an identify LED for a FRU, select a system or enclosure from the table, and then select List FRUs.... 6. Select one or more FRUs from the table, and click either Activate LED or Deactivate LED. The associated LED is either turned on or off. Viewing serviceable events by using the Systems Director Management Console Use this procedure to view a serviceable event, including details, comments, and service history. To view serviceable events, follow these steps: 1. On the Resources tab, select the appropriate host or virtual server. 2. Select Actions ? System Status and Health ? Event Log. 3. Optional: You can narrow the event criteria using the Event filter menu. 4. Select a line in the Events window, and select Actions ? Properties The Properties window opens, showing detailed information about the serviceable event. The table shows information, such as problem number, reference code, and the field replaceable units (FRUs) associated with this event. Verifying an installed feature or replaced part on a system or logical partition by using Virtual I/O Server tools If you installed feature or replaced a part, you might want to use the tools in Virtual I/O Server (VIOS) to verify that the feature or part is recognized by the system or logical partition. To verify the operation of a newly installed feature or replacement part, select the appropriate procedure: v Verify the installed feature using VIOS v Verifying the replaced part using VIOS Verify the installed feature by using VIOS: 1. Log in as root user. 2. At the command line, type diagmenu and press Enter. 3. Select Advanced Diagnostics Routines and press Enter. 4. From the Diagnostic Mode Selection menu, select System Verification and press Enter. 5. When the Advanced Diagnostic Selection menu appears, do one of the following: v To test a single resource, select the resource that you just installed from the list of resources and press Enter. v To test all the resources available to the operating system, select All Resources and press Enter. 116 Power Systems: Serial-attached SCSI RAID enablement 6. Select Commit, and wait until the diagnostic programs run to completion, responding to any prompts that appear. 7. Did the diagnostics run to completion and display the message No trouble was found? v No: If a service request number (SRN) or other reference code is displayed, suspect a loose adapter or cable connection. Review the installation procedures to ensure that the new feature is installed correctly. If you cannot correct the problem, collect all SRNs or any other reference code information that you see. If the system is running in LPAR mode, note the logical partition in which you installed the feature. Contact your service provider for assistance. v Yes: The new device is installed correctly. Exit the diagnostic programs and return the system to normal operations. Verify the replacement part by using VIOS: To verify the operation of a newly installed feature or replacement part, follow these steps: 1. Did you replace the part by using either VIOS or the online diagnostics service aid's concurrent (hot-swap) service operation? No: Go to step 2. Yes: Go to step 5. 2. Is the system powered off? No: Go to step 4. Yes: If the system supports slow boot, set the system to perform a slow boot. For information, see Performing a slow boot. 3. Start the system and wait until the VIOS operating system login prompt displays or until apparent system activity on the operator panel or display has stopped. Did the VIOS login prompt display? v No: If an SRN or other reference code is displayed, suspect a loose adapter or cable connection. Review the procedures for the part that you replaced to ensure that the new part is installed correctly. If you cannot correct the problem, collect all SRNs or any other reference code information that you see. If the system does not start or you have no login prompt, see: Problems with loading and starting the operating system. If the system is partitioned, note the logical partition in which you replaced the part. Contact your service provider for assistance. v Yes: Go to step 4 4. At the command prompt, type diag —a and press Enter to check for missing resources. If you see a command prompt, go to step 5. If the Diagnostic selection menu is shown with M appearing next to any resource, follow these steps: a. Select the resource and press Enter. b. Select Commit. c. Follow any instructions that are shown. d. If a Do you want to review the previously displayed error? message is shown, select Yes and press Enter. e. If an SRN is shown, suspect a loose card or connection. If no obvious problem is shown, record the SRN and contact your service provider for assistance.. f. If no SRN is shown, go to 5. 5. Test the part by doing the following: a. At the command line, type diagmenu and press Enter. b. From the Function Selection menu, select Advanced Diagnostics Routines and press Enter. c. From the Diagnostic Mode Selection menu, select System Verification and press Enter. Common procedures for installable features 117 d. Select All Resources, or select the diagnostics for the individual part to test only the part you replaced, and any devices that are attached to the part you replaced and press Enter. Did the Resource Repair Action menu appear? No: Go to step 6. Yes: Go to step 7. 6. Did the Testing Complete, No trouble was found message appear? v No: There is still a problem. Contact your service provider. This ends the procedure. v Yes: Select Log Repair Action, if not previously logged, from the Task Selection menu to update the AIX error log. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the Resource List, select sysplanar0 and press Enter. Tip: This action changes the indicator light for the part from the fault state to the normal state. Go to step 9 7. Select the resource for the replaced part from the Resource Repair Action menu. When a test is run on a resource in system verification mode, and that resource has an entry in the AIX error log, if the test on the resource was successful, the Resource Repair Action menu appears. Complete the following steps to update the AIX error log to indicate that a system-detectable part has been replaced. Note: On systems with a indicator light for the failing part, this changes the indicator light to the normal state. a. Select the resource that has been replaced from the Resource Repair Action menu. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action does not appear on the Resource List, select sysplanar0. Press Enter. b. Select Commit after you make your selections. Did another Resource Repair Action display appear? No: If the No Trouble Found display appears, go to step 9 Yes: Go to step 8. 8. Select the parent or child of the resource for the replaced part from the Resource Repair Action menu if necessary. When a test is run on a resource in system verification mode, and that resource has an entry in the AIX error log, if the test on the resource was successful, the Resource Repair Action menu appears. Complete the following steps to update the AIX error log to indicate that a system-detectable part has been replaced. Note: This changes the indicator light for the part from the fault state to the normal state. a. From the Resource Repair Action menu, select the parent or child of the resource that has been replaced. If the repair action was to reseat a cable or adapter, select the resource associated with that repair action. If the resource associated with your action does not appear on the Resource List, select sysplanar0. Press Enter. b. Select Commit after you make your selections. c. If the No Trouble Found display appears, go to step 9. 9. If you changed the service processor or network settings, as instructed in previous procedures, restore the settings to the values they had prior to servicing the system. 10. Did you do any hot-plug procedures before doing this procedure? No: Go to step 11. Yes: Go to step 12 on page 119. 11. Start the operating system, with the system or logical partition in normal mode. Were you able to start the operating system? No: Contact your service provider. This ends the procedure. 118 Power Systems: Serial-attached SCSI RAID enablement Yes: Go to step 12. 12. Are the indicator lights still on? v No. This ends the procedure. v Yes. Turn off the lights. See one of the following for instructions: Changing service indicators Verifying a repair Use these procedures to verify hardware operation after making repairs to the system. Choose from the following options: v To verify the repair of a system that is currently powered off, go to step 1. v To verify the repair of a system that is currently powered on with no operating system loaded, go to step 3. v To verify the repair of a system that is currently powered on and that has an operating system loaded, go to step 5. 1. Power on the server and all attached I/O enclosures. Did all the enclosures power on? Yes: Go to step 3.? Go to step 3. No: Continue with the next step. 2. Choose from the following options: v If the original problem was that an enclosure would not power on and you have another FRU to replace, locate and replace the next field-replaceable unit (FRU). v If the next FRU in the FRU list is an isolation procedure, perform the isolation procedure. v If the original problem was that an enclosure would not power on and you have an isolation procedure to complete, perform the isolation procedure. v If the original problem was that an enclosure would not power on and there are no more FRUs or isolation procedures in the FRU list, contact your next level of support. v If you have a new problem, perform problem analysis and repair the new problem. 3. Load the operating system. Did the operating system load successfully? Yes: Go to step 5. No: Continue with the next step. 4. Choose from the following options: v If the original problem was a failing disk drive that contained the operating system software, go to step 5. v If the original problem was that the operating system would not load and you have another FRU to replace, go to your FRU locations section to locate the next FRU. v If the next FRU in the FRU list is an isolation procedure, perform the isolation procedure. v If the original problem was that the operating system would not load and you have an isolation procedure to complete, perform the isolation procedure. v If the original problem was that the operating system would not load and there are no more FRUs or isolation procedures in the FRU list, contact your next level of support. v If you have a new problem, perform problem analysis and repair the new problem. 5. Choose from the following options: v “Verifying the repair in AIX” on page 120 v “Verifying the repair in Linux” on page 125 Common procedures for installable features 119 v “Verifying a repair using an IBM i system or logical partition” on page 123 Verifying the repair in AIX You can use this procedure to verify that a repair is complete using the AIX operating system. Use this maintenance analysis procedure (MAP) to check out the server after a repair is completed. 1. Did you replace a disk drive in the root volume group? No Go to step 3. Yes Continue with the next step. 2. Run stand-alone diagnostics either from a CD or from a Network Installation Management (NIM) server. Did you encounter any problems? No Reinstall the operating system and continue with step 5. Yes If the original problem still exists, replace the field-replaceable unit (FRU) or perform the isolation procedure that is next in the FRU list. If you have reached the end of the FRU list, contact your next level of support. If a new problem has occurred, go to Beginning problem analysis. 3. Did you replace a FRU with the power turned on and concurrently with system operations? No Go to step 5. Yes Continue with the next step. 4. Did you use an AIX diagnostics service aid hot-swap operation to change the FRU? Yes Go to step 6. No Go to step 7 on page 121. Note: The AIX diagnostic service aid was used if a resource was removed using the Hot Plug task. 5. Note: If any FRUs have been removed that should be reinstalled, reinstall them now. Perform the following steps: 1. If the system is not powered on, power it on now. 2. Perform a slow boot. 3. Wait until the AIX operating system login prompt displays or until system activity on the operator panel or display apparently has stopped. 4. Did you encounter any problems? No Continue at step 6. Yes If the original problem still exists, replace the FRU or perform the isolation procedure that is next in the FRU list. If you have reached the end of the FRU list, contact your next level of support. If a new problem has occurred, go to Beginning problem analysis. 6. If the Resource Repair Action menu is already displayed, go to step 9 on page 121; otherwise, perform the following steps: 120 Power Systems: Serial-attached SCSI RAID enablement 1. Log into the operating system either with root authority (if needed, ask the customer to enter the password) or use the CE login. 2. Enter the diag -a command and check for missing resources. Follow any instructions that display. If an SRN displays, suspect a loose card or connection. If no instructions display, no resources were detected as missing. Continue with the next step. 7. Perform the following steps: 1. Enter diag at the command prompt. 2. Press Enter. 3. Select the Diagnostics Routines option. 4. When the Diagnostic Mode Selection menu displays, select System verification. 5. When the Diagnostic Selection menu displays, select the All Resources option or test the FRUs you exchanged, and any devices that are attached to the FRU(s) you exchanged, by selecting the diagnostics for the individual FRU. Did the Resource Repair Action menu (801015) display? No Continue with the next step. Yes Go to step 9. 8. Did the Testing Complete, no trouble was found menu (801010) display? Yes Use the Log Repair Action option, if not previously logged, in the TASK SELECTION menu to update the AIX error log. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. Note: If the system attention indicator is on, this action will set it back to the normal state. Go to step 11 on page 122. No If the original problem still exists, replace the FRU or perform the isolation procedure that is next in the FRU list. If you have reached the end of the FRU list, contact your next level of support. If a new problem has occurred, go to Beginning problem analysis. 9. When a test is run on a resource in system verification mode, that resource has an entry in the AIX error log. If the test on that resource was successful, the Resource Repair Action menu displays. After replacing a FRU, you must select the resource for that FRU from the Resource Repair Action menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced. Note: If the system attention indicator is on, this action will set it back to the normal state. Perform the following steps: Common procedures for installable features 121 1. Select the resource that has been replaced from the Resource Repair Action menu. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. 2. Press Commit after you make your selections. Did another Resource Repair Action (801015) display? No If the No Trouble Found menu displays, go to step 11. Yes Continue with the next step. 10. The parent or child of the resource you just replaced might also require that you run the Resource Repair Action service aid on it. When a test is run on a resource in system verification mode, that resource has an entry in the AIX error log. If the test on the resource was successful, the Resource Repair Action menu displays. After replacing that FRU, you must select the resource for that FRU from the Resource Repair Action menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced. Note: If the system attention indicator is on, this action will set it back to the normal state. Perform the following steps: 1. From the RESOURCE REPAIR ACTION menu, select the parent or child of the resource that has been replaced. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. 2. Press COMMIT after you make your selections. 3. If the No Trouble Found menu displays, continue with the next step. 11. If you changed the service processor or network settings, as instructed in previous MAPs, restore the settings to the value they had prior to servicing the system. If you ran stand-alone diagnostics from CD-ROM, remove the stand-alone diagnostics CD-ROM from the system. Did you perform service on a RAID subsystem involving changing of the PCI RAID adapter cache card or changing the configuration? Note: This information does not apply to the PCI-X RAID adapter or cache. No Go to the close of call procedure. Yes Continue with the next step. 12. Use the Recover Options selection to resolve the RAID configuration. To do this, perform the following steps: 122 Power Systems: Serial-attached SCSI RAID enablement 1. On the PCI SCSI Disk Array Manager display, select Recovery options. 2. If a previous configuration exists on the replacement adapter, this must be cleared. Select Clear PCI SCSI Adapter Configuration and press F3. 3. On the Recovery Options screen, select Resolve PCI SCSI RAID Adapter Configuration. 4. On the Resolve PCI SCSI RAID Adapter Configuration screen, select Accept Configuration on Drives. 5. On the PCI SCSI RAID Adapter selections menu, select the adapter that you changed. 6. On the next screen, press Enter. 7. When you see the Are You Sure selection menu, press Enter to continue. 8. If you see a Failed status message, verify that you selected the correct adapter, and then repeat this procedure. When the recovery is complete, exit the operating system. 9. Go to the close of call procedure. Verifying a repair using an IBM i system or logical partition Use this procedure to verify a repair using the IBM i operating system. 1. Was the system powered off during the repair? Yes: Continue with the next step. No: Continue with step 3. 2. Perform the following tasks: a. Verify that the power cable is plugged into the power outlet. b. Verify that power is available at the customer's power outlet. 3. Was the partition powered off during the repair? Yes: Continue with the next step. No: Continue with step 6. 4. Select the IPL type and mode for the system or logical partition that the customer uses (see IPL type mode and speed options in the Service functions). 5. Start an IPL by powering on the system or partition (see Powering on and powering off). Did the system complete the IPL? Yes: Continue with the next step. No: This might be a new problem. Go to the Start of call procedure. This ends the procedure. 6. Did the system or partition remain running throughout the repair, and was the I/O processor, I/O adapter, or storage device replaced? Yes: Continue with step 10. No: Continue with the next step. 7. Use the service action log or serviceable event view (if the system is managed by an HMC) to look for any reference codes that are related to this IPL (see Using the Service Action Log). Are there any reference codes that are related to this IPL? Yes: Continue with the next step. No: If the problem was related to removable media or communications, perform the verification procedures in the Service functions to verify that the problem is corrected. Then return the system to the customer and have the customer verify the system date and time. This ends the procedure. Common procedures for installable features 123 8. Is the new reference code the same as the original reference code? Yes: Continue with the next step. No: A new symptom might have occurred. Go to the Start of call procedure. This ends the procedure. 9. Are there any other failing items that remain to be replaced? Yes: Replace the next failing item listed for this reference code. This ends the procedure. No: Contact your next level of support for assistance. This ends the procedure. 10. Was concurrent maintenance performed on an optical storage unit? Yes: The product activity log and service action log, in most cases, contain a reference code for the optical storage unit when concurrent maintenance is performed. You can ignore this reference code. Perform the following: - Perform the verification procedures in the Service functions topic to verify that the problem is corrected. - Return the system to the customer and have the customer verify the system date and time. This ends the procedure. No: Continue with the next step. 11. Use the service action log to look for any new reference codes (see Using the Service Action Log). Are there any new reference codes? Yes: Continue with the next step. No: Go to step 14. 12. Is the new reference code the same as the original reference code? Yes: Continue with the next step. No: A new symptom might have occurred. Go to the Start of call procedure to determine the cause of the problem. This ends the procedure. 13. Are there any other failing items that need to be replaced? Yes: Replace the next failing item listed for the reference code. This ends the procedure.. No: Contact your next level of support for assistance. This ends the procedure. 14. Are you working with a tape device? Yes: Perform the verification procedures in the Service functions to verify that the problem is corrected. After the verification test is complete, the tape device description will be set to the failed state because a resource change was detected. Perform the following tasks: - Vary the tape device description off and then on. - Return the system to the customer and have the customer verify the system date and time. Then go to Verifying the repair from the HMC. This ends the procedure. No: Continue with the next step. 15. Are you working with an IOP or an IOA? Yes: Use the display hardware configuration service function to check for any missing or failed hardware: - On the command line, enter the STRSST (Start System Service Tools command). If you cannot get to SST, select DST. Do not IPL the system or partition to get to DST. - On the Start Service Tools Sign On display, enter the user ID with the service authority and password. - Select Start a service tool ? Hardware service manager ? Logical hardware resources ? System bus resources. - Select the function key for Include nonreporting resources. - If the IOP and IOA that you just replaced is a failed or non-reporting resource, the problem has not been fixed. Continue to the next failing item in the failing item list. This ends the procedure. 124 Power Systems: Serial-attached SCSI RAID enablement No: Perform the verification procedures in the Service functions topics to verify that the problem is corrected. Resources that usually vary on automatically during an IPL, or that were previously varied on manually, might need to be varied on again after the verification procedures are complete. Return the system to the customer and have the customer verify the system date and time. This ends the procedure. Verifying the repair in Linux You can use this procedure to verify that a repair is complete using the Linux operating system. 1. Run stand-alone diagnostics from either a CD or from a Network Installation Management (NIM) server. See Running the stand-alone diagnostics from CD-ROM. Did you encounter any problems? No Reboot the operating system and continue with the close of call procedure. Yes If the original problem still exists, replace the field-replaceable unit (FRU) or perform the isolation procedure that is next in the FRU list. If you have reached the end of the FRU list, contact your next level of support. If a new problem has occurred, go to Beginning problem analysis and repair the new problem. Verifying the repair from the management console Perform these procedures to close problem numbers, clear hardware messages, and prepare the server to return to the customer using the management console. Follow this checklist before performing the procedures: v You return the server to the state that the customer normally uses such as IPL type, IPL mode, and the way the system is configured or partitioned. Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. v While you were performing the problem analysis on the original serviceable event, other serviceable-event numbers might have been opened. Close all serviceable events that were opened as a result of your service activity. v Server verification has been performed and there are no problems that require additional service actions. v If the repair was done using the HMC online repair procedures, ensure that the original serviceable event is now closed. 1. Is a management console used to manage the server that you are servicing? v Yes: Continue with the next step. v No: Return to the “Verifying a repair” on page 119. This ends the procedure. 2. Are you closing a service event that was a repair on the management console personal computer? v Yes: Continue with the next step. v No: Go to step 4. 3. Power on the management console. Did the power-on process complete without errors? v Yes: Ensure that the management console can be used to perform server management tasks, and return the management console to normal operations. Go to “Closing a service call” on page 126. This ends the procedure. v No: Go to the HMC isolation procedures. This ends the procedure. 4. Log into the management console as the service representative. If invalid user or invalid password displays, get the correct login information from the system administer. Common procedures for installable features 125 1. If logged on the System Manager, select Exit from the Console, located on the System Manager window. 2. Login to the System Manager with the following: User identification service Password service mode 5. View Serviceable Event Details. 1. In the navigation area, select the Service Applications icon. 2. In the navigation area, select the Service Focal Point icon. 3. In the Contents area, select Manage Serviceable Events. 4. Designate the set of Serviceable Events you want to view. When you are finished, click OK. The Service Event Overview window opens. Note: Only the events that match all of the criteria you specify are shown. 6. Close open or delayed events. 1. Select the problem to close, on the Service Event Overview window. 2. Select the menu Selected, located on the menu bar. 3. Click Close Event. 4. Enter your comments in the Serviceable Event Comments window, and click Close Event. 5. Close all events associated with the problem on which you were working. 7. Did the Service Event Overview window contain the event or events on which you were working? v Yes: Return the HMC to normal operations. Go to “Closing a service call.” This ends the procedure. v No: Go to Detecting problems. This ends the procedure. Closing a service call Perform these procedures to close problem numbers, clear hardware messages, and prepare the server to return to the customer. Follow this checklist before performing the procedures: v Return the server to the state that the customer normally uses such as IPL type, IPL mode, and the way the system is configured or partitioned. Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. v While you were performing the problem analysis on the original serviceable event, other serviceable events might have been opened. Close all serviceable events that were opened as a result of your service activity. v Ensure that server verification has been performed and no problems require additional service actions. v If the repair was done using the management console online repair procedures, ensure that the original serviceable event is now closed. 1. Record the system reference code (SRC) or symptom and the location code of the field-replaceable unit (FRU) you replaced, for future reference. Is the server managed by a management console? v Yes: Continue with the next step. v No: Do one of the following steps: – If the server is managed by Integrated Virtualization Manager (IVM), go to “Closing a service call using Integrated Virtualization Manager” on page 135. – If the server is not partitioned and is running the AIX or Linux operating system, go to “Closing a service call using AIX or Linux” on page 131. 126 Power Systems: Serial-attached SCSI RAID enablement 2. On the HMC, open Manage Serviceable Events and examine the service action event log for any open service action events. 3. Are there any service action events that are open? v Yes: Continue with the next step. v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 139. Return the system to the customer. This completes the repair. 4. Record the list of open service action events. 5. From the list of serviceable events recorded in step 4, perform the following step 6 through step 32 on page 129 for each open service action event. 6. Determine the error class of the serviceable event. Record for future use. 7. Examine the details of the open service action event. Is the error code that is associated with this service action event the same as recorded in Step 1 on page 126? v Yes: Go to step 11. v No: Continue with the next step. 8. Examine the FRU list of the service action event. Are any FRUs listed for the service action event? v Yes: Continue with the next step. v No: Go to step 11. 9. Is the FRU list identical (same FRUs, same number of FRUs, and same order of FRUs) to the FRU list of the error code recorded in step 1 on page 126? v Yes: Go to step 11. v No: Continue with the next step. 10. The FRU list is different. Is the FRU you replaced and recorded in step 1 on page 126 in the list of FRUs for this service action event? v Yes: Continue with the next step. v No: Go to step 32 on page 129. Note: There are service action events that will remain open when you leave this MAP. Further service actions might be required to complete the repair. 11. Examine the details of this service action event, and record the partitions involved in this service action event for use in a later step. Common procedures for installable features 127 12. Is the error code associated with this service action event of the form A11-xxx or A01-xxx? v Yes: Continue with the next step. v No: Go to step 17. 13. Have you begun a list of Axx partitions from prior service action events that you processed in this maintenance analysis procedure (MAP)? v Yes: Go to step 15. v No: Continue with the next step. 14. Begin a new list of Axx partitions by copying the list of partitions obtained in step 11 on page 127. Go to step 16. 15. Add the partition list obtained in step 11 on page 127 to the existing list of Axx partitions obtained from processing previous service action events in this MAP. 16. Remove all entries in the list of all partitions you recorded in step 11 on page 127. If you are referred to the list of partitions obtained in step 11 on page 127 in future steps, the list is empty. Go to step 17. 17. Select and highlight the service action event from the Error Associated With This Serviceable Event window. 18. Click Close Event. 19. Add comments for the serviceable event. Include any unique additional information. Click OK. The following steps will add or update FRU information. 20. Did you replace, add, or modify a FRU of the open service action event? v Yes: Continue with the next step. v No: Go to step 22. 21. From the FRU list, select a FRU that you need to update. Double-click the FRU, and update the FRU information. Go to step 23. 22. Select the No FRU Replaced for this Serviceable Event option. 23. Click OK to close the service action event. 128 Power Systems: Serial-attached SCSI RAID enablement 24. Is the list of all partitions you recorded in step 11 on page 127 empty? v Yes: Go to step 32. v No: Continue with the next step. 25. Does the list of all partitions you recorded in step 11 on page 127 contain more than one entry? v Yes: Continue with the next step. v No: Go to step 32. 26. Is the error class recorded in step 25 AIX? v Yes: Continue with the next step. v No: Go to step 32. 27. Perform the following steps for each entry in the list of all partitions you recorded in step 11 on page 127, except the partition you were using to debug the original problem. 28. From the HMC virtual terminal window of a partition in the list of all partitions, type diag at the AIX command prompt. 29. When the diagnostic operating instructions are displayed, do the following steps: 1. Press Enter. 2. Select the Task Selection option. 3. Select the Log Repair option. 4. Select the resource associated with the repair action. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your repair action is not displayed on the Resource List, select sysplanar0. 5. Click Commit after you have made your selection. Note: If the terminal type is not defined, you are prompted to define it before you can proceed. 30. Exit from diagnostics in this partition and return to the AIX command prompt. 31. Have all the partitions in the list of all the partitions you recorded in step 11 on page 127 been processed? v Yes: Continue with the next step. v No: Go to step 24 to process the next partition in the list you recorded in step 11 on page 127. 32. Have all the serviceable events recorded in step 4 on page 127 been processed? v Yes: Continue with the next step. v No: Go to step 5 on page 127 and process the next service action event in the list of serviceable events recorded in step 4 on page 127. Common procedures for installable features 129 33. While processing all service action events, were you directed to step 14 on page 128? v Yes: Continue with the next step. v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 139. Return the system to the customer. This completes the repair. Note: If during the processing of the list of open service action events, some service action events remained open, further service actions might be required to complete the repair. 34. Perform the following steps for each entry in the list of Axx partitions you began recording in step 14 on page 128, except the partition you were using to debug the original problem. 35. From the management console virtual terminal window of a partition in the list of Axx partitions, type diag at the AIX command prompt. 36. When the diagnostic operating instructions are displayed, do the following steps: 1. Press Enter. 2. Select the Task Selection option. Note: If the terminal type is not defined, you are prompted to define it before you can proceed. 3. Select the Log Repair option. 4. Select the resource associated with the repair action. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your repair action is not displayed on the Resource List, select sysplanar0. 5. Click Commit after you have made your selection. 37. Exit from diagnostics in this partition and return to the AIX command prompt. 38. Have all the partitions in the list of Axx partitions you began recording in step 14 on page 128 been processed? v Yes: Continue with the next step. v No: Go to step 34 to process the next partition in the list you recorded in step 14 on page 128. 39. If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 139. This completes the repair. Return the system to the customer. Note: If during the processing of the list of open service action events, some service action events remained open, further service actions might be required to complete the repair. 130 Power Systems: Serial-attached SCSI RAID enablement Closing a service call using AIX or Linux If the server is not connected to an management console and not using Integrated Virtualization Manager (IVM), perform these procedures to close problem numbers, clear hardware messages, and prepare the server to return to the customer. Follow this checklist before performing the procedures: v Return the server to the state that the customer normally uses, such as IPL type, IPL mode, and the way the system is configured or partitioned. Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. v While you were performing the problem analysis on the original serviceable event, other serviceable-event numbers might have been opened. Close all serviceable events that were opened as a result of your service activity. v Ensure that server verification has been performed and that no problems require additional service actions. v If the repair was done using the IVM online repair procedures, ensure that the original serviceable event is now closed. 1. Did you use an AIX diagnostics service aid hot-swap operation to change the FRU? v Yes: Go to step 4 v No: Continue with the next step. 2. Do you have any field-replaceable units (FRUs) (for example cards, adapters, cables, or devices) that were removed during problem analysis that you want to put back into the system? Note: If the system planar or battery has been replaced and you are loading diagnostics from a server over a network, it might be necessary for the customer to set the network boot information for this system before diagnostics can be loaded. The system time and date information should also be set when the repair is completed. v Yes: Reinstall all of the FRUs that were removed during problem analysis. Go to step 3 v No: Continue with the next step. 3. Is the system or logical partition that you are performing a repair action on running the AIX operating system? v Yes: Continue with the next step. v No: Go to step 5. 4. Does the system or logical partition you are performing a repair action on have AIX installed? Note: Answer no to this question if you have just replaced a hard disk in the root volume group. v Yes: Go to step 7 on page 132. v No: Continue with the next step. 5. Run stand-alone diagnostics in problem determination mode from either a CD-ROM or from a Network Installation Management (NIM) server. Note: For instructions on running stand-alone diagnostics from a CD and not using an HMC, go to Running the stand-alone diagnostics from CD on a server without an HMC attached. Common procedures for installable features 131 For instructions on running stand-alone diagnostics from a NIM server, go to Running the stand-alone diagnostics from a Network Installation Management server. Did you encounter any problems? v Yes: Go to problem analysis. v No: Continue with the next step. 6. The system hardware is functioning correctly. If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 139. This completes the repair. Note: If, during the processing of the list of open service action events, some service action events remained open, further service actions might be required to complete the repair. Return the server to the state that the customer normally uses, such as IPL type, IPL mode, and the way the system is configured or partitioned. This might require you to reboot the operating system. Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. 7. Complete the following steps: 1. If the system supports slow boot (see Performing a slow boot), do a slow boot on the system. If the system does not support slow boot, do a normal boot. 2. Power on the system. 3. Wait until the AIX operating system login prompt displays or until system activity on the operator panel or display apparently has stopped. Did the AIX Login Prompt display? v Yes: Continue with the next step. v No: Go to problem analysis. 8. If the Resource Repair Action menu is already displayed, go to 12 on page 133; otherwise, do the following steps: 1. Log into the operating system, either with root authority (if needed, ask the customer to enter the password) or use the CE login. 2. Enter the diag -a command and check for missing resources. Follow any instructions that display. If an system request number (SRN) displays, suspect a loose card or connection. If no instructions display, no resources were detected as missing. Continue with 9. 9. Complete the following steps: 132 Power Systems: Serial-attached SCSI RAID enablement 1. Enter diag at the command prompt. 2. Press Enter. 3. Select the Diagnostics Routines option. 4. When the Diagnostic Mode Selection menu displays, select Problem determination. 5. When the Advanced Diagnostic Selection menu displays, select the All Resources option, or test the FRUs you exchanged, and any devices that are attached to the FRUs you exchanged, by selecting the diagnostics for the individual FRU. Did the Resource Repair Action menu (801015) display? v Yes: Go to step 13. v No: Continue with the next step. 10. Did the TESTING COMPLETE, no trouble was found menu (801010) display? v Yes: Continue with the next step. v No: There is still a problem. Go to problem analysis. 11. Use the Log Repair Action option, if not previously logged, in the TASK SELECTION menu to update the AIX error log. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. Note: If the system attention indicator is on, this will set it back to the normal state. Go to step 14 on page 134. 12. When a test is run on a resource in system verification mode, and that resource has an entry in the AIX error log, if the test on the resource was successful, the Resource Repair Action menu displays. After replacing a FRU, you must select the resource for that FRU from the Resource Repair Action menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced. Note: If the system attention indicator is on, this action will set it back to the normal state. Do the following steps: 1. Select the resource that has been replaced from the Resource Repair Action menu. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. 2. Press Commit after you make your selections. Did another Resource Repair Action (801015) display? v Yes: Continue with the next step. v No: If the No Trouble Found menu displays, go to step 14 on page 134. 13. The parent or child of the resource you just replaced might also require that you run the Resource Repair Action service aid on it. Common procedures for installable features 133 When a test is run on a resource in system verification mode, and that resource has an entry in the AIX error log, if the test on the resource was successful, the Resource Repair Action menu displays. After replacing that FRU, you must select the resource for that FRU from the Resource Repair Action menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced. Note: If the system attention indicator is on, this action will set it back to the normal state. Do the following steps: 1. From the Resource Repair Action menu, select the parent or child of the resource that has been replaced. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. 2. Press COMMIT after you make your selections. Did the No Trouble Found menu display? v Yes: Continue with the next step. v No: Go to problem analysis. 14. If you changed the service processor or network settings, as instructed in previous MAPs, restore the settings to the value they had prior to servicing the system. If you ran stand-alone diagnostics from CD-ROM, remove the stand-alone diagnostics CD-ROM from the system. Did you perform service on a RAID subsystem involving changing of the PCI RAID adapter cache card or changing the configuration? Note: This does not refer to the PCI-X RAID adapter or cache. v Yes: Continue with the next step. v No: Go to step 16. 15. Use the Recover Options selection to resolve the RAID configuration. To do this, do the following steps: 1. On the PCI SCSI Disk Array Manager dialog, select Recovery options. 2. If a previous configuration exists on the replacement adapter, this must be cleared. Select Clear PCI SCSI Adapter Configuration and press F3. 3. On the Recovery Options dialog, select Resolve PCI SCSI RAID Adapter Configuration. 4. On the Resolve PCI SCSI RAID Adapter Configuration dialog, select Accept Configuration on Drives. 5. On the PCI SCSI RAID Adapter selections menu, select the adapter that you changed. 6. On the next dialog, press Enter. 7. When you see the Are You Sure selection menu, press Enter to continue. 8. You should see an OK status message when the recover is complete. If you get a Failed status message, verify that you selected the correct adapter, and then repeat this procedure. When recover is complete, exit the operating system. 9. Go to 16. 16. The system hardware is functioning correctly. Return the server to the state that the customer normally uses, such as IPL type, IPL mode, and the way the system is configured or partitioned. 134 Power Systems: Serial-attached SCSI RAID enablement Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. Closing a service call using Integrated Virtualization Manager Perform these procedures to close problem numbers, clear hardware messages, and prepare the server to return to the customer. Follow this checklist before performing the procedures: v Return the server to the state that the customer normally uses, such as IPL type, IPL mode, and the way the system is configured or partitioned. Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. v While you were performing the problem analysis on the original serviceable event, other serviceable-event numbers might have been opened. Close all serviceable events that were opened as a result of your service activity. v Ensure that server verification has been performed and there are no problems that require additional service actions. v If the repair was done using the Integrated Virtualization Manager (IVM) online repair procedures, ensure that the original serviceable event is now closed. 1. Record the system reference code (SRC) or symptom and the location code of the field-replaceable unit (FRU) you replaced, for future reference. 2. On the IVM, open Manage Serviceable Events and look at existing serviceable events. 3. Are there any service action events that are open? v Yes: Continue with the next step. v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 139. Return the system to the customer. This completes the repair. 4. Record the list of open service action events. 5. From the list of serviceable events recorded in step 4, perform step 6 through step 30 on page 138 for each open service action event. 6. Determine the error class of the serviceable event. Record for future use. 7. Examine the details of the open service action event. Is the error code associated with this service action event the same as recorded in step 1? Common procedures for installable features 135 v Yes: Go to step 11. v No: Continue with the next step. 8. Examine the FRU list of the service action event. Are any FRUs listed for the service action event? v Yes: Continue with the next step. v No: Go to step 11. 9. Is the FRU list identical (same FRUs, same number of FRUs, and same order of FRUs) to the FRU list of the error code recorded in step 1 on page 135? v Yes: Go to step 11. v No: Continue with the next step. 10. Is the FRU you replaced and recorded in step 1 on page 135 in the list of FRUs for this service action event? v Yes: Continue with the next step. v No: Go to step 30 on page 138. Note: There are service action events that will remain open when you leave this MAP. Further service actions might be required to complete the repair. 11. Examine the details of this service action event, and record the partitions involved in this service action event for use in a later step. 12. Is the error code associated with this service action event of the form A11-xxx or A01-xxx? v Yes: Continue with the next step. v No: Go to step 17. 13. Have you begun a list of Axx partitions from prior service action events that you processed in this maintenance analysis procedure (MAP)? v Yes: Go to step 15. v No: Continue with the next step. 14. Begin a new list of Axx partitions by copying the list of partitions obtained in step 11. Go to step 16. 15. Add the partition list obtained in step 11 to the existing list of Axx partitions obtained from processing previous service action events in this MAP. 16. Remove all entries in the list of all partitions you recorded in step 11. If you are referred to the list of partitions obtained in step 11 in future steps, the list is empty. Go to step 17. 17. Select and highlight the service action event from the Manage Serviceable Events window. 136 Power Systems: Serial-attached SCSI RAID enablement 18. Click Close Event. 19. Add comments for the serviceable event. Include any unique additional information. Click OK. 20. Add or update FRU information: Did you replace, add, or modify a FRU of the open service action event? v Yes: Continue with the next step. v No: Go to step 22. 21. Click OK to close the service action event. 22. Is the list of all partitions you recorded in step 11 on page 136 empty? v Yes: Go to step 30 on page 138. v No: Continue with the next step. 23. Does the list of all partitions you recorded in step 11 on page 136 contain more than one entry? v Yes: Continue with the next step. v No: Go to step 30 on page 138. 24. Is the error class recorded in step 23? v Yes: Continue with the next step. v No: Go to step 30 on page 138. 25. Perform the following steps for each entry in the list of all partitions you recorded in step 11 on page 136, except the partition you were using to debug the original problem. 26. From the IVM virtual terminal window of a partition in the list of all partitions, type diag at the AIX command prompt. 27. When the diagnostic operating instructions are displayed, do the following steps: Common procedures for installable features 137 1. Press Enter. 2. Select the Task Selection option. 3. Select the Log Repair option. 4. Select the resource associated with the repair action. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your repair action is not displayed on the Resource List, select sysplanar0. 5. Click Commit after you have made your selection. Note: If the terminal type is not defined, you are prompted to define it before you can proceed. 28. Exit from diagnostics in this partition and return to the AIX command prompt. 29. Have all the partitions in the list of all partitions you recorded in step 11 on page 136 been processed? v Yes: Continue with the next step. v No: Go to step 25 on page 137 to process the next partition in the list you recorded in step 11 on page 136. 30. Have all the serviceable events recorded in step 4 on page 135 been processed? v Yes: Continue with the next step. v No: Go to step 5 on page 135 and process the next service action event in the list of serviceable events recorded in step 4 on page 135. 31. While processing all service action events, were you directed to step 14 on page 136? v Yes: Continue with the next step. v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 139. Return the system to the customer. This completes the repair. Note: If during the processing of the list of open service action events, some service action events remained open, further service actions might be required to complete the repair. 32. Perform the following steps for each entry in the list of Axx partitions you began recording in step 14 on page 136, except the partition you were using to debug the original problem. 33. From the IVM virtual terminal window of a partition in the list of Axx partitions, type diag at the AIX command prompt. 34. When the diagnostic operating instructions are displayed, do the following steps: 138 Power Systems: Serial-attached SCSI RAID enablement 1. Press Enter. 2. Select the Task Selection option. Note: If the terminal type is not defined, you are prompted to define it before you can proceed. 3. Select the Log Repair option. 4. Select the resource associated with the repair action. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your repair action is not displayed on the Resource List, select sysplanar0. 5. Click Commit after you have made your selection. 35. Exit from diagnostics in this partition and return to the AIX command prompt. 36. Have all the partitions in the list of Axx partitions you began recording in step 14 on page 136 been processed? v Yes: Continue with the next step. v No: Go to step 32 on page 138 to process the next partition in the list you recorded in step 14 on page 136. 37. If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs.” This completes the repair. Return the system to the customer. Note: If, during the processing of the list of open service action events, some service action events remained open, further service actions might be required to complete the repair. Activating and deactivating LEDs You can use these procedures to activate or deactivate light-emitting diodes (LEDs) using the management console or the Advanced System Management Interface (ASMI). For IBM Power Systems servers that contain the POWER7 processor, the LEDs can be used to identify or verify a part that you are servicing. The error and identify function (amber) LED indicates an error and corresponds to the location code in the system reference code (SRC). The LED is activated and deactivated automatically. Additionally, the following procedures can also be used to activate and deactivate LEDs. v “Deactivating a system attention LED or partition LED using the management console” v “Activating or deactivating an identify LED using the management console” on page 140 v “Deactivating a system attention LED or logical partition LED using the Advanced System Management Interface” on page 141 v “Activating or deactivating an identify LED using the Advanced System Management Interface” on page 141 Deactivating a system attention LED or partition LED using the management console You can deactivate a system attention LED or a logical partition LED if you decide that a problem is not a high priority and you decide to repair the problem at a later time. You can do this task from the Hardware Management Console (HMC) or the IBM Systems Director Management Console (SDMC). Common procedures for installable features 139 If you want to be alerted if another problem occurs, you must deactivate the system attention LED so that it can be activated again if another problem occurs. To deactivate a system attention LED by using HMC, complete the following steps: 1. In the navigation area, open Systems management. 2. Select the server you are working on by checking the box next to its name. 3. Open Operations. 4. Open LED Status. 5. Select View System Attention. The system attention LED window opens. The selected system and its LED state are displayed in the upper part of the window. The logical partition and its LED state are displayed in the lower part of the window. From the system attention LED window, you can deactivate both the system attention LED and the logical partition LED. 6. Select Deactivate System Attention LED from the Action menu. A confirmation window is displayed that provides the following information: v A verification that the system attention LED was deactivated. v An indication that there still might be open problems within the system. v An indication that you cannot activate the system attention LED. 7. Select one of the logical partitions in the lower table, and select Deactivate partition LED from the Partition Operations menu. A confirmation window is displayed that provides the following information: v A verification that the logical partition LED was deactivated. v An indication that there still might be open problems within the logical partition. v An indication that you cannot activate the logical partition LED. To deactivate a system attention LED by using SDMC, complete the following steps: 1. On the Welcome page, under the Resources tab, select the server. 2. Click Actions > Service and Support> Hardware> System Attention LED. 3. Click OK. Activating or deactivating an identify LED using the management console You can activate or deactivate an identify LED for components attached to the system from the Hardware Management Console (HMC) or the IBM Systems Director Management Console (SDMC). The system provides several LEDs that help identify various components, such as enclosures or field-replaceable units (FRUs), in the system. For this reason, they are called identify LEDs. You can activate or deactivate the following types of identify LEDs: v Identify LED for an enclosure If you want to add an adapter to a specific drawer (enclosure), you need to know the machine type, model, and serial number (MTMS) of the drawer. To determine whether you have the correct MTMS for the drawer that needs the new adapter, you can activate the LED for a drawer and verify that the MTMS corresponds to the drawer that requires the new adapter. v Identify LED for a FRU associated with a specified enclosure If you want to hook up a cable to a specific I/O adapter, you can activate the LED for the adapter, which is a field replaceable unit (FRU), and then physically check to see where you should hook up the cable. This is especially useful when you have several adapters with open ports. To activate or deactivate an identify LED for an enclosure or FRU by using the HMC, follow these steps: 1. In the navigation area, open Systems management. 2. Select the server you are working on. 140 Power Systems: Serial-attached SCSI RAID enablement 3. Click Operations ? LED Status ? Identify LED. The Identify LED, Select Enclosure window opens. 4. To activate or deactivate an identify LED for an enclosure, select an enclosure from the table, and click either Activate LED or Deactivate LED. The associated LED is either turned on or off. 5. To activate or deactivate an identify LED for a FRU, select an enclosure from the table, and then select Selected ? List FRUs. 6. Select one or more FRUs from the table, and click either Activate LED or Deactivate LED. The associated LED is either turned on or off. To activate or deactivate an identify LED for an enclosure or FRU by using the SDMC, complete these steps: 1. On the Welcome page, under the Resources tab, select the server. 2. Click Actions > Service and Support> Hardware> Identify LED. 3. To activate or deactivate an identify LED for an enclosure, select an enclosure from the table, and click either Activate LED or Deactivate LED. The associated LED is either turned on or off. 4. To activate or deactivate an identify LED for a FRU, select an enclosure from the table, and then click List FRUs. 5. Select one or more FRUs from the table, and then click either Activate LED or Deactivate LED. The associated LED is either turned on or off. Deactivating a system attention LED or logical partition LED using the Advanced System Management Interface You can deactivate a system attention LED or a logical partition LED using the Advanced System Management Interface (ASMI). The system attention indicator provides a visual signal that the system as a whole requires attention or service. Each system has a single system attention indicator. When an event occurs that either needs your intervention or that of service and support, the system attention indicator lights continuously. The system attention indicator is turned on when an entry is made in the service processor error log. The error entry is transmitted to the system error log and the operating system error log. To perform this operation, your authority level must be one of the following levels: v Administrator v Authorized service provider To turn off the system attention indicator, do the following steps: 1. On the ASMI Welcome pane, specify your user ID and password, and click Log In. 2. In the navigation area, expand System Configuration ? Service Indicators ? System Attention Indicator. 3. In the right pane, click Turn off system attention indicator. If the attempt is unsuccessful, an error message is displayed. Activating or deactivating an identify LED using the Advanced System Management Interface You can activate or deactivate an identify LED using the Advanced System Management Interface (ASMI). You can specify the location code of any indicator to view or modify its current state. If you provide the wrong location code, the advanced system manager attempts to go to the next higher level of the location code. Common procedures for installable features 141 The next level is the base-level location code for that field replaceable unit (FRU). For example, a user types the location code for the FRU located on the second I/O slot of the third enclosure in the system. If the location code for the second I/O slot is incorrect (the FRU does not exist at this location), an attempt to set the indicator for the third enclosure is initiated. This process continues until a FRU is located or no other level is available. To perform this operation, your authority level must be one of the following levels: v Administrator v Authorized service provider To change the current state of an indicator, do the following steps: 1. On the ASMI Welcome pane, specify your user ID and password, and click Log In. 2. In the navigation area, expand System Configuration ? Service Indicators ? Indicators by Location code. 3. In the right pane, enter the location code of the FRU and click Continue. 4. Select the preferred state from the list. 5. Click Save settings. Installing a part by using the Systems Director Management Console You can use the IBM Systems Director Management Console (SDMC) to perform many service actions, including the installation of a new field-replaceable unit (FRU) or part. To use the SDMC user interface to install a part into a system or expansion unit that is managed by an SDMC, follow these steps: 1. In the Power Systems Resources area, select the system on which you want to install a part. 2. From the Actions menu, expand Service and Support ? Hardware ? MES Tasks ? Add FRU. 3. On the ADD FRU tab, select the system or enclosure type. 4. Select the FRU type you are installing from the menu and click Next 5. Select the location code for where you want to install the part, and click Add. 6. After the FRU is placed in the pending actions category, click Launch Procedure and follow the instructions to install the part. Note: The SDMC might open external instructions for installing the feature. If so, follow those instructions to install the part. Replacing a part by using the Systems Director Management Console You can use the IBM Systems Director Management Console (SDMC) to perform many service actions, including exchanging a field replaceable unit (FRU) or part. If you are exchanging a part to repair a serviceable event follow those instructions. 1. In the Power Systems Resource area, select the managed system in which you are exchanging a part. 2. Select one of the following options: v If you are replacing a part that is not part of a serviceable event, from the Actions menu, expand Service and Support ? Hardware ? Exchange FRU. v If you are exchanging a part to repair a serviceable event, see Starting a repair action. 3. To replace a part as part of a service action, select Exchange FRU. 4. From the Exchange FRU tab, select the system or expansion unit in which you want to exchange a part. 5. Select the type of part you want to exchange from the menu and click Next. 142 Power Systems: Serial-attached SCSI RAID enablement 6. Select the location code of the part you want to exchange from the menu and click Add. 7. After the FRU is placed in the Pending actions window, click Launch Procedure and follow the instructions to exchange the part. Note: The SDMC might open external instructions for replacing the part. If so, follow those instructions to replace the part. Removing a part by using the Systems Director Management Console You can use the IBM Systems Director Management Console (SDMC) to perform many service actions, including the removal of a field replaceable unit (FRU) or part. To use the SDMC user interface to remove a part in a system or expansion unit that is managed by an SDMC, follow these steps: 1. In the Power Systems Resources area, select the managed system from which you are removing a part. 2. Select one of the following options: v If you are installing a new part as part of an upgrade, from the Actions menu, expand Service and Support ? Hardware ? MES Tasks. v If you are removing a part as part of a service action, from the Actions menu, expand Service and Support ? Hardware ? Exchange FRU. 3. To install a new part as part of an upgrade, select Add FRU. To replace a part as part of a service action, select Exchange FRU. 4. On the Add FRU tab or the Exchange FRU tab, select the system or expansion unit from which you are removing the part. 5. Select the type of part you are removing from the menu and click Next. 6. Select the location of the part you are removing and click Add. 7. After the part is listed in the Pending Actions window, click Launch Procedure and follow the instructions to remove the part. Note: The SDMC might open external instructions for removing the part. If so, follow those instructions to remove the part. Common procedures for installable features 143 144 Power Systems: Serial-attached SCSI RAID enablement Notices This information was developed for products and services offered in the U.S.A. The manufacturer may not offer the products, services, or features discussed in this document in other countries. Consult the manufacturer's representative for information on the products and services currently available in your area. Any reference to the manufacturer's product, program, or service is not intended to state or imply that only that product, program, or service may be used. Any functionally equivalent product, program, or service that does not infringe any intellectual property right of the manufacturer may be used instead. However, it is the user's responsibility to evaluate and verify the operation of any product, program, or service. The manufacturer may have patents or pending patent applications covering subject matter described in this document. The furnishing of this document does not grant you any license to these patents. You can send license inquiries, in writing, to the manufacturer. The following paragraph does not apply to the United Kingdom or any other country where such provisions are inconsistent with local law: THIS INFORMATION IS PROVIDED “AS IS” WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESS OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF NON-INFRINGEMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Some states do not allow disclaimer of express or implied warranties in certain transactions, therefore, this statement may not apply to you. This information could include technical inaccuracies or typographical errors. Changes are periodically made to the information herein; these changes will be incorporated in new editions of the publication. The manufacturer may make improvements and/or changes in the product(s) and/or the program(s) described in this publication at any time without notice. Any references in this information to websites not owned by the manufacturer are provided for convenience only and do not in any manner serve as an endorsement of those websites. The materials at those websites are not part of the materials for this product and use of those websites is at your own risk. The manufacturer may use or distribute any of the information you supply in any way it believes appropriate without incurring any obligation to you. Any performance data contained herein was determined in a controlled environment. Therefore, the results obtained in other operating environments may vary significantly. Some measurements may have been made on development-level systems and there is no guarantee that these measurements will be the same on generally available systems. Furthermore, some measurements may have been estimated through extrapolation. Actual results may vary. Users of this document should verify the applicable data for their specific environment. Information concerning products not produced by this manufacturer was obtained from the suppliers of those products, their published announcements or other publicly available sources. This manufacturer has not tested those products and cannot confirm the accuracy of performance, compatibility or any other claims related to products not produced by this manufacturer. Questions on the capabilities of products not produced by this manufacturer should be addressed to the suppliers of those products. All statements regarding the manufacturer's future direction or intent are subject to change or withdrawal without notice, and represent goals and objectives only. The manufacturer's prices shown are the manufacturer's suggested retail prices, are current and are subject to change without notice. Dealer prices may vary. © Copyright IBM Corp. 2010, 2011 145 This information is for planning purposes only. The information herein is subject to change before the products described become available. This information contains examples of data and reports used in daily business operations. To illustrate them as completely as possible, the examples include the names of individuals, companies, brands, and products. All of these names are fictitious and any similarity to the names and addresses used by an actual business enterprise is entirely coincidental. If you are viewing this information in softcopy, the photographs and color illustrations may not appear. The drawings and specifications contained herein shall not be reproduced in whole or in part without the written permission of the manufacturer. The manufacturer has prepared this information for use with the specific machines indicated. The manufacturer makes no representations that it is suitable for any other purpose. The manufacturer's computer systems contain mechanisms designed to reduce the possibility of undetected data corruption or loss. This risk, however, cannot be eliminated. Users who experience unplanned outages, system failures, power fluctuations or outages, or component failures must verify the accuracy of operations performed and data saved or transmitted by the system at or near the time of the outage or failure. In addition, users must establish procedures to ensure that there is independent data verification before relying on such data in sensitive or critical operations. Users should periodically check the manufacturer's support websites for updated information and fixes applicable to the system and related software. Ethernet connection usage restriction This product is not intended to be connected directly or indirectly by any means whatsoever to interfaces of public telecommunications networks. Trademarks IBM, the IBM logo, and ibm.com are trademarks or registered trademarks of International Business Machines Corp., registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM or other companies. A current list of IBM trademarks is available on the Web at Copyright and trademark information at www.ibm.com/legal/copytrade.shtml. Linux is a registered trademark of Linus Torvalds in the United States, other countries, or both. Other product and service names might be trademarks of IBM or other companies. Electronic emission notices When attaching a monitor to the equipment, you must use the designated monitor cable and any interference suppression devices supplied with the monitor. Class A Notices The following Class A statements apply to the IBM servers that contain the POWER7 processor and its features unless designated as electromagnetic compatibility (EMC) Class B in the feature information. Federal Communications Commission (FCC) statement Note: This equipment has been tested and found to comply with the limits for a Class A digital device, pursuant to Part 15 of the FCC Rules. These limits are designed to provide reasonable protection against harmful interference when the equipment is operated in a commercial environment. This equipment 146 Power Systems: Serial-attached SCSI RAID enablement generates, uses, and can radiate radio frequency energy and, if not installed and used in accordance with the instruction manual, may cause harmful interference to radio communications. Operation of this equipment in a residential area is likely to cause harmful interference, in which case the user will be required to correct the interference at his own expense. Properly shielded and grounded cables and connectors must be used in order to meet FCC emission limits. IBM is not responsible for any radio or television interference caused by using other than recommended cables and connectors or by unauthorized changes or modifications to this equipment. Unauthorized changes or modifications could void the user's authority to operate the equipment. This device complies with Part 15 of the FCC rules. Operation is subject to the following two conditions: (1) this device may not cause harmful interference, and (2) this device must accept any interference received, including interference that may cause undesired operation. Industry Canada Compliance Statement This Class A digital apparatus complies with Canadian ICES-003. Avis de conformité à la réglementation d'Industrie Canada Cet appareil numérique de la classe A est conforme à la norme NMB-003 du Canada. European Community Compliance Statement This product is in conformity with the protection requirements of EU Council Directive 2004/108/EC on the approximation of the laws of the Member States relating to electromagnetic compatibility. IBM cannot accept responsibility for any failure to satisfy the protection requirements resulting from a non-recommended modification of the product, including the fitting of non-IBM option cards. This product has been tested and found to comply with the limits for Class A Information Technology Equipment according to European Standard EN 55022. The limits for Class A equipment were derived for commercial and industrial environments to provide reasonable protection against interference with licensed communication equipment. European Community contact: IBM Deutschland GmbH Technical Regulations, Department M456 IBM-Allee 1, 71139 Ehningen, Germany Tele: +49 7032 15-2937 email: tjahn@de.ibm.com Warning: This is a Class A product. In a domestic environment, this product may cause radio interference, in which case the user may be required to take adequate measures. VCCI Statement - Japan The following is a summary of the VCCI Japanese statement in the box above: Notices 147 This is a Class A product based on the standard of the VCCI Council. If this equipment is used in a domestic environment, radio interference may occur, in which case, the user may be required to take corrective actions. Japanese Electronics and Information Technology Industries Association (JEITA) Confirmed Harmonics Guideline (products less than or equal to 20 A per phase) Japanese Electronics and Information Technology Industries Association (JEITA) Confirmed Harmonics Guideline with Modifications (products greater than 20 A per phase) Electromagnetic Interference (EMI) Statement - People's Republic of China Declaration: This is a Class A product. In a domestic environment this product may cause radio interference in which case the user may need to perform practical action. Electromagnetic Interference (EMI) Statement - Taiwan The following is a summary of the EMI Taiwan statement above. Warning: This is a Class A product. In a domestic environment this product may cause radio interference in which case the user will be required to take adequate measures. IBM Taiwan Contact Information: 148 Power Systems: Serial-attached SCSI RAID enablement Electromagnetic Interference (EMI) Statement - Korea Germany Compliance Statement Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse A EU-Richtlinie zur Elektromagnetischen Verträglichkeit Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die Grenzwerte der EN 55022 Klasse A ein. Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne Empfehlung von IBM gesteckt/eingebaut werden. EN 55022 Klasse A Geräte müssen mit folgendem Warnhinweis versehen werden: "Warnung: Dieses ist eine Einrichtung der Klasse A. Diese Einrichtung kann im Wohnbereich Funk-Störungen verursachen; in diesem Fall kann vom Betreiber verlangt werden, angemessene Maßnahmen zu ergreifen und dafür aufzukommen." Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland. Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse A Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen - CE - zu führen. Notices 149 Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller: International Business Machines Corp. New Orchard Road Armonk, New York 10504 Tel: 914-499-1900 Der verantwortliche Ansprechpartner des Herstellers in der EU ist: IBM Deutschland GmbH Technical Regulations, Abteilung M456 IBM-Allee 1, 71139 Ehningen, Germany Tel: +49 7032 15-2937 email: tjahn@de.ibm.com Generelle Informationen: Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse A. Electromagnetic Interference (EMI) Statement - Russia Class B Notices The following Class B statements apply to features designated as electromagnetic compatibility (EMC) Class B in the feature installation information. Federal Communications Commission (FCC) statement This equipment has been tested and found to comply with the limits for a Class B digital device, pursuant to Part 15 of the FCC Rules. These limits are designed to provide reasonable protection against harmful interference in a residential installation. This equipment generates, uses, and can radiate radio frequency energy and, if not installed and used in accordance with the instructions, may cause harmful interference to radio communications. However, there is no guarantee that interference will not occur in a particular installation. If this equipment does cause harmful interference to radio or television reception, which can be determined by turning the equipment off and on, the user is encouraged to try to correct the interference by one or more of the following measures: v Reorient or relocate the receiving antenna. v Increase the separation between the equipment and receiver. v Connect the equipment into an outlet on a circuit different from that to which the receiver is connected. v Consult an IBM-authorized dealer or service representative for help. Properly shielded and grounded cables and connectors must be used in order to meet FCC emission limits. Proper cables and connectors are available from IBM-authorized dealers. IBM is not responsible for 150 Power Systems: Serial-attached SCSI RAID enablement any radio or television interference caused by unauthorized changes or modifications to this equipment. Unauthorized changes or modifications could void the user's authority to operate this equipment. This device complies with Part 15 of the FCC rules. Operation is subject to the following two conditions: (1) this device may not cause harmful interference, and (2) this device must accept any interference received, including interference that may cause undesired operation. Industry Canada Compliance Statement This Class B digital apparatus complies with Canadian ICES-003. Avis de conformité à la réglementation d'Industrie Canada Cet appareil numérique de la classe B est conforme à la norme NMB-003 du Canada. European Community Compliance Statement This product is in conformity with the protection requirements of EU Council Directive 2004/108/EC on the approximation of the laws of the Member States relating to electromagnetic compatibility. IBM cannot accept responsibility for any failure to satisfy the protection requirements resulting from a non-recommended modification of the product, including the fitting of non-IBM option cards. This product has been tested and found to comply with the limits for Class B Information Technology Equipment according to European Standard EN 55022. The limits for Class B equipment were derived for typical residential environments to provide reasonable protection against interference with licensed communication equipment. European Community contact: IBM Deutschland GmbH Technical Regulations, Department M456 IBM-Allee 1, 71139 Ehningen, Germany Tele: +49 7032 15-2937 email: tjahn@de.ibm.com VCCI Statement - Japan Japanese Electronics and Information Technology Industries Association (JEITA) Confirmed Harmonics Guideline (products less than or equal to 20 A per phase) Notices 151 Japanese Electronics and Information Technology Industries Association (JEITA) Confirmed Harmonics Guideline with Modifications (products greater than 20 A per phase) IBM Taiwan Contact Information Electromagnetic Interference (EMI) Statement - Korea Germany Compliance Statement Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse B EU-Richtlinie zur Elektromagnetischen Verträglichkeit Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die Grenzwerte der EN 55022 Klasse B ein. Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne Empfehlung von IBM gesteckt/eingebaut werden. Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland. Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse B 152 Power Systems: Serial-attached SCSI RAID enablement Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen - CE - zu führen. Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller: International Business Machines Corp. New Orchard Road Armonk, New York 10504 Tel: 914-499-1900 Der verantwortliche Ansprechpartner des Herstellers in der EU ist: IBM Deutschland GmbH Technical Regulations, Abteilung M456 IBM-Allee 1, 71139 Ehningen, Germany Tel: +49 7032 15-2937 email: tjahn@de.ibm.com Generelle Informationen: Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse B. Terms and conditions Permissions for the use of these publications is granted subject to the following terms and conditions. Personal Use: You may reproduce these publications for your personal, noncommercial use provided that all proprietary notices are preserved. You may not distribute, display or make derivative works of these publications, or any portion thereof, without the express consent of the manufacturer. Commercial Use: You may reproduce, distribute and display these publications solely within your enterprise provided that all proprietary notices are preserved. You may not make derivative works of these publications, or reproduce, distribute or display these publications or any portion thereof outside your enterprise, without the express consent of the manufacturer. Except as expressly granted in this permission, no other permissions, licenses or rights are granted, either express or implied, to the publications or any information, data, software or other intellectual property contained therein. The manufacturer reserves the right to withdraw the permissions granted herein whenever, in its discretion, the use of the publications is detrimental to its interest or, as determined by the manufacturer, the above instructions are not being properly followed. You may not download, export or re-export this information except in full compliance with all applicable laws and regulations, including all United States export laws and regulations. THE MANUFACTURER MAKES NO GUARANTEE ABOUT THE CONTENT OF THESE PUBLICATIONS. THESE PUBLICATIONS ARE PROVIDED "AS-IS" AND WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING BUT NOT LIMITED TO IMPLIED WARRANTIES OF MERCHANTABILITY, NON-INFRINGEMENT, AND FITNESS FOR A PARTICULAR PURPOSE. Notices 153 ESCALA Power7 Serial-Attached SCSI RAID Enablement in E2-700, E2-700T, E2-705, E2-705T, E4- 700, E4-700T or E4-705 REFERENCE 86 A1 90FF 03 ESCALA Power7 Serial-Attached SCSI RAID Enablement in E2-700, E2- 700T, E2-705, E2-705T, E4-700, E4-700T or E4-705 The ESCALA Power7 publications concern the following models: - Bull Escala E5-700 (Power 750 / 8233-E8B) - Bull Escala M6-700 (Power 770 / 9117-MMB) - Bull Escala M6-705 (Power 770 / 9117-MMC) - Bull Escala M7-700 (Power 780 / 9179-MHB) - Bull Escala M7-705 (Power 780 / 9179-MHC) - Bull Escala E1-700 (Power 710 / 8231-E2B) - Bull Escala E1-705 (Power 710 / 8231-E1C) - Bull Escala E2-700 / E2-700T (Power 720 / 8202-E4B) - Bull Escala E2-705 / E2-705T (Power 720 / 8202-E4C) - Bull Escala E3-700 (Power 730 / 8231-E2B) - Bull Escala E3-705 (Power 730 / 8231-E2C) - Bull Escala E4-700 / E4-700T (Power 740 / 8205-E6B) - Bull Escala E4-705 (Power 740 / 8205-E6C) References to Power 755 / 8236-E8C models are irrelevant. Hardware October 2011 BULL CEDOC 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE REFERENCE 86 A1 90FF 03 The following copyright notice protects this book under Copyright laws which prohibit such actions as, but not limited to, copying, distributing, modifying, and making derivative works. Copyright Bull SAS 2011 Printed in France Suggestions and criticisms concerning the form, content, and presentation of this book are invited. A form is provided at the end of this book for this purpose. To order additional copies of this book or other Bull Technical Publications, you are invited to use the Ordering Form also provided at the end of this book. Trademarks and Acknowledgements We acknowledge the right of proprietors of trademarks mentioned in this book. Disk drives for the E1-700, E1-705, E3- 700 or E3-705 REFERENCE 86 A1 01FH 01 ESCALA Power7 Disk drives for the E1-700, E1-705, E3-700 or E3-705 The ESCALA Power7 publications concern the following models: - Bull Escala E5-700 (Power 750 / 8233-E8B) - Bull Escala M6-700 (Power 770 / 9117-MMB) - Bull Escala M6-705 (Power 770 / 9117-MMC) - Bull Escala M7-700 (Power 780 / 9179-MHB) - Bull Escala M7-705 (Power 780 / 9179-MHC) - Bull Escala E1-700 (Power 710 / 8231-E2B) - Bull Escala E1-705 (Power 710 / 8231-E1C) - Bull Escala E2-700 / E2-700T (Power 720 / 8202-E4B) - Bull Escala E2-705 / E2-705T (Power 720 / 8202-E4C) - Bull Escala E3-700 (Power 730 / 8231-E2B) - Bull Escala E3-705 (Power 730 / 8231-E2C) - Bull Escala E4-700 / E4-700T (Power 740 / 8205-E6B) - Bull Escala E4-705 (Power 740 / 8205-E6C) References to Power 755 / 8236-E8C models are irrelevant. Hardware October 2011 BULL CEDOC 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE REFERENCE 86 A1 01FH 01 The following copyright notice protects this book under Copyright laws which prohibit such actions as, but not limited to, copying, distributing, modifying, and making derivative works. Copyright Bull SAS 2011 Printed in France Suggestions and criticisms concerning the form, content, and presentation of this book are invited. A form is provided at the end of this book for this purpose. To order additional copies of this book or other Bull Technical Publications, you are invited to use the Ordering Form also provided at the end of this book. Trademarks and Acknowledgements We acknowledge the right of proprietors of trademarks mentioned in this book. The information in this document is subject to change without notice. Bull will not be liable for errors contained herein, or f or incidental or consequential damages in connection with the use of this material. Contents Safety notices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix Disk drives. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 What's new in Disk drives . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C . . . 5 Solid-state drives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the AIX system or logical partition power turned on. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8 Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 Installing a disk drive in an 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . . . 10 Installing a disk drive in the 5887 disk-drive expansion unit . . . . . . . . . . . . . . . . . . 11 Completing the disk drive installation by using the HSM . . . . . . . . . . . . . . . . . . . 12 Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12 Installing a disk-drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the Linux system or logical partition power turned on . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13 Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14 Preparing to install a disk drive by using the HSM . . . . . . . . . . . . . . . . . . . . . 15 Installing a disk drive in an 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . . . 16 Installing a disk drive in the 5887 disk-drive expansion unit . . . . . . . . . . . . . . . . . . 17 Verifying that the new disk drive is operational . . . . . . . . . . . . . . . . . . . . . . 18 Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19 Installing a disk-drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the Virtual I/O Server system or logical partition power turned on . . . . . . . . . . . . . . . . . . . . . . . . 20 Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21 Preparing to install a disk drive by using the HSM . . . . . . . . . . . . . . . . . . . . . 22 Installing a disk drive in an 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . . . 22 Installing a disk drive in the 5887 disk-drive expansion unit . . . . . . . . . . . . . . . . . . 23 Completing the disk drive installation by using the HSM . . . . . . . . . . . . . . . . . . . 24 Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24 Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the IBM i system or logical partition power turned on . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25 Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26 Preparing to install a disk drive by using the HSM . . . . . . . . . . . . . . . . . . . . . 27 Installing a disk drive in an 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . . . 28 Installing a disk drive in the 5887 disk-drive expansion unit . . . . . . . . . . . . . . . . . . 29 Completing the disk drive installation by using the HSM . . . . . . . . . . . . . . . . . . . 30 Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30 Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the system or logical partition power turned off . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31 Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32 Installing a disk drive in an 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . . . 33 Installing a disk drive in the 5887 disk-drive expansion unit . . . . . . . . . . . . . . . . . . 35 Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35 Removing a disk drive or solid-state drive from the 8231-E2B, 8231-E1C, or 8231-E2C 37 Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C with the AIX system or logical partition power turned on . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37 Preparing the system for removing a disk drive . . . . . . . . . . . . . . . . . . . . . . 38 Preparing to remove a disk drive by using the HSM. . . . . . . . . . . . . . . . . . . . . 39 Removing the disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . 39 Removing the disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C using Linux kernel 2.6 with the Linux system or logical partition power turned on . . . . . . . . . . . . . . . . . . . . . . . . . . . 41 © Copyright IBM Corp. 2010, 2011 iii Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42 Preparing to remove the disk drive by using the HSM . . . . . . . . . . . . . . . . . . . . 43 Removing the disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . 46 Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C or an expansion unit controlled by IBM i . . . 47 Preparing the system for removing a disk drive . . . . . . . . . . . . . . . . . . . . . . 48 Preparing to remove the disk drive by using the HSM . . . . . . . . . . . . . . . . . . . . 49 Removing the disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . 51 Removing a disk drive using hot-spare in an IBM i system or logical partition from the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52 Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52 Preparing to remove a disk drive by using the HSM. . . . . . . . . . . . . . . . . . . . . 52 Removing the disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . 54 Removing a SCSI disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . 55 Preparing to remove the SCSI disk drive by using the HSM . . . . . . . . . . . . . . . . . . 55 Removing the disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . 55 Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C with the system or logical partition power turned off . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56 Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57 Removing the disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . 59 Replacing a disk drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C . . 61 Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the AIX system or logical partition power turned on . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61 Preparing the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62 Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . . . 63 Completing the disk drive replace by using the HSM . . . . . . . . . . . . . . . . . . . . 64 Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64 Replacing a disk drive using Linux kernel 2.6, with the Linux system or logical partition power turned on . . . 65 Preparing the system to replace a disk drive . . . . . . . . . . . . . . . . . . . . . . . 66 Preparing to replace a disk drive by using the HSM. . . . . . . . . . . . . . . . . . . . . 67 Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . . . 67 Completing the disk drive replace by using the HSM . . . . . . . . . . . . . . . . . . . . 68 Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68 Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the IBM i system or logical partition power turned on . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69 Preparing to replace the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . 70 Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . . . 71 Completing the disk drive replace by using the HSM . . . . . . . . . . . . . . . . . . . . 72 Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72 Replacing a disk drive in an 8231-E2B, 8231-E1C, or 8231-E2C using hot-spare in an IBM i system or logical partition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73 Preparing to replace the disk drive by using the HSM . . . . . . . . . . . . . . . . . . . . 73 Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . . . 73 Completing the procedure by using the HSM . . . . . . . . . . . . . . . . . . . . . . . 74 Replacing a SCSI disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . . 75 Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . . . 75 Completing the disk drive replace by using the HSM . . . . . . . . . . . . . . . . . . . . 76 Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76 Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the system or logical partition power turned off . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77 Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . . . . 78 Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79 Installing the external SAS port in the 8231-E2B, 8231-E1C, or 8231-E2C system. . . . 81 Removing the external SAS port from the 8231-E2B, 8231-E1C, or 8231-E2C system . . 85 Replacing the external SAS port in the 8231-E2B, 8231-E1C, or 8231-E2C system . . . 87 Removing or installing a disk drive filler . . . . . . . . . . . . . . . . . . . . . 89 iv Disk drives Removing a disk drive filler from the 8231-E2B, 8231-E1C, or 8231-E2C system. . . . . . . . . . . . . 89 Installing a disk drive filler in the 8231-E2B, 8231-E1C, or 8231-E2C system . . . . . . . . . . . . . . 90 Removing or installing a disk drive bezel. . . . . . . . . . . . . . . . . . . . . 93 Removing a disk drive bezel from a disk drive or filler. . . . . . . . . . . . . . . . . . . . . 93 Installing a disk drive bezel in a disk drive or filler . . . . . . . . . . . . . . . . . . . . . . 94 Disk drive locations and service indicators . . . . . . . . . . . . . . . . . . . . 97 PCIe RAID and SSD SAS adapter locations. . . . . . . . . . . . . . . . . . . . . . . . . 97 Disk drive locations and service indicators for the 8231-E2B, 8231-E1C, or 8231-E2C system . . . . . . . . 97 Disk drive locations and service indicators for the 5886 disk-drive enclosure . . . . . . . . . . . . . 99 Disk-drive locations and service indicators for the 5887 disk-drive enclosure . . . . . . . . . . . . . 100 Disk drive locations and service indicators for the 5786, 5787, 7031-D24, and 7031-T24 SCSI disk-drive enclosures 101 Disk-drive locations and service indicators for the 5802 expansion unit . . . . . . . . . . . . . . . 103 Related information. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105 Preparing a disk drive for use in an AIX system or logical partition . . . . . . . . . . . . . . . . 105 Preparing to remove a disk drive from a system or expansion unit controlled by AIX . . . . . . . . . . 105 Rebuilding data on a replacement disk drive using AIX . . . . . . . . . . . . . . . . . . . . 105 Removing a disk drive using Linux kernel 2.6 . . . . . . . . . . . . . . . . . . . . . . . 105 Preparing to remove the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . 105 Removing the disk drive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 108 Replacing the disk drive. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111 Completing the procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113 Rebuilding data on a replacement disk drive using Linux . . . . . . . . . . . . . . . . . . . 114 Rebuild data on a disk drive using Linux kernel 2.6 . . . . . . . . . . . . . . . . . . . . 114 Preparing to remove a disk-drive from a system or logical partition controlled by IBM i . . . . . . . . . 116 Determining the IBM i disk-drive protection status. . . . . . . . . . . . . . . . . . . . . . 117 Configuring a disk drive in an IBM i system or logical partition . . . . . . . . . . . . . . . . . 118 Configuring a disk drive on a load source adapter for hot spare using the IBM i operating system. . . . . . 120 Rebuilding data on a replacement disk drive using IBM i . . . . . . . . . . . . . . . . . . . 121 Replacing a SAS conduit card in a 5802 expansion unit with the power turned off . . . . . . . . . . . 122 Removing a SAS conduit card from a 5802 expansion unit with the power turned off . . . . . . . . . . 129 Replacing a SAS expander card in a 5802 expansion unit with the power turned off. . . . . . . . . . . 138 Removing a SAS expander card from a 5802 expansion unit with power turned off . . . . . . . . . . . 138 Common procedures for installable features . . . . . . . . . . . . . . . . . 141 Before you begin. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 143 Identifying a part. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 147 Control panel LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 147 Identifying a failing part in an AIX system or logical partition . . . . . . . . . . . . . . . . . . 148 Locating a failing part in an AIX system or logical partition. . . . . . . . . . . . . . . . . . 148 Activating the indicator light for the failing part. . . . . . . . . . . . . . . . . . . . . . 149 Deactivating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . 149 Identifying a failing part in an IBM i system or logical partition . . . . . . . . . . . . . . . . . 149 Activating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . . 149 Deactivating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . 150 Identifying a failing part in a Linux system or logical partition. . . . . . . . . . . . . . . . . . 151 Locating a failing part in a Linux system or logical partition . . . . . . . . . . . . . . . . . 151 Finding the location code of a failing part in a Linux system or logical partition . . . . . . . . . . . 151 Activating the indicator light for the failing part. . . . . . . . . . . . . . . . . . . . . . 151 Deactivating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . 151 Locating a failing part in a Virtual I/O Server system or logical partition . . . . . . . . . . . . . . 152 Identifying a part by using the Virtual I/O Server . . . . . . . . . . . . . . . . . . . . . 152 Starting the system or logical partition . . . . . . . . . . . . . . . . . . . . . 153 Contents v Starting a system that is not managed by a Hardware Management Console or or a Systems Director Management Console. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 153 Starting a system or logical partition by using the Hardware Management Console . . . . . . . . . . . 154 Starting a system or virtual server with the Systems Director Management Console. . . . . . . . . . . 155 Stopping a system or logical partition. . . . . . . . . . . . . . . . . . . . . . 157 Stopping a system that is not managed by an HMC or an SDMC . . . . . . . . . . . . . . . . . 157 Stopping a system by using the Hardware Management Console . . . . . . . . . . . . . . . . . 158 Stopping a system with the Systems Director Management Console . . . . . . . . . . . . . . . . 159 Removing and replacing the expansion unit cover or door. . . . . . . . . . . . . 161 Removing the front cover on the 7314-G30 or 5796. . . . . . . . . . . . . . . . . . . . . . 161 Installing the front cover on the 7314-G30 or 5796 . . . . . . . . . . . . . . . . . . . . . . 161 System covers. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 163 Removing and replacing covers for the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C . . . . . . . . . . . 163 Removing the front cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system . . . . . 163 Removing the front cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system . . . . . . . . . 164 Installing the front cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system . . . . . 164 Installing the front cover and front door on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system. . . . . 165 Removing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system . . 166 Removing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system . . . . . . 167 Installing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system . . 169 Installing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system . . . . . . . 169 Removing and replacing covers for the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . 171 Removing the service access cover for the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . 171 Installing the service access cover for the 8231-E2B, 8231-E1C, or 8231-E2C. . . . . . . . . . . . . 172 Removing and replacing covers on the 8233-E8B or 8236-E8C system. . . . . . . . . . . . . . . . 173 Removing the front cover from an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . . . . 173 Installing the front cover on an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . . . . . 174 Removing the service access cover from an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . 175 Installing the service access cover on an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . . 176 Removing and replacing covers for the 9117-MMB, 9117-MMC, 9179-MHB, or 9179-MHC. . . . . . . . . 177 Removing the front cover from the 9117-MMB, 9117-MMC, 9179-MHB, or 9179-MHC . . . . . . . . . 177 Removing the front cover with the cover removal tool. . . . . . . . . . . . . . . . . . . . 178 Installing the front cover on the 9117-MMB, 9117-MMC, 9179-MHB, or 9179-MHC . . . . . . . . . . 183 Placing the 8231-E2B, 8231-E1C, or 8231-E2C system into the service or operating position. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 185 Placing the rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system into the service position . . . . . . . . 185 Placing the rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system into the operating position . . . . . . . 186 Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service position . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 189 Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the operating position . . . . . . . . . . . . . . . . . . . . . . . . . . . . 191 Service and operating positions . . . . . . . . . . . . . . . . . . . . . . . . 193 Placing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service or operating position . . . . . 193 Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service position . . . 193 Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the operating position . . 194 Placing the 8231-E2B, 8231-E1C, or 8231-E2C system into the service or operating position . . . . . . . . 196 Placing the rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system into the service position . . . . . . . 196 Placing the rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system into the operating position . . . . . . 197 Placing the rack-mounted 8233-E8B or 8236-E8C system into the service position or operating position . . . . 199 Placing the 8233-E8B or 8236-E8C in the service position . . . . . . . . . . . . . . . . . . . 199 Placing the 8233-E8B or 8236-E8C in the operating position . . . . . . . . . . . . . . . . . . 201 vi Disk drives Power cords. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 203 Disconnecting the power cords from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system . . . . . . . . 203 Connecting the power cords to the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system. . . . . . . . . . 204 Disconnecting the power cords from the 8231-E2B, 8231-E1C, or 8231-E2C . . . . . . . . . . . . . . 205 Connecting the power cords to the 8231-E2B, 8231-E1C, or 8231-E2C. . . . . . . . . . . . . . . . 206 Verifying the installed part. . . . . . . . . . . . . . . . . . . . . . . . . . . 209 Verifying an installed feature or replaced part in an AIX system or logical partition. . . . . . . . . . . 209 Verifying the installed part in an IBM i system or logical partition. . . . . . . . . . . . . . . . . 211 Deactivating the failing-part indicator light . . . . . . . . . . . . . . . . . . . . . . . 212 Verifying the installed part in a Linux system or logical partition . . . . . . . . . . . . . . . . . 212 Verifying an installed part by using stand-alone diagnostics. . . . . . . . . . . . . . . . . . . 213 Verifying the installed part by using Hardware Management Console . . . . . . . . . . . . . . . 214 Activating and deactivating LEDs by using the HMC . . . . . . . . . . . . . . . . . . . . 215 Deactivating a system attention LED or partition LED. . . . . . . . . . . . . . . . . . . 215 Activating or deactivating an identify LED . . . . . . . . . . . . . . . . . . . . . . 215 Viewing serviceable events by using the HMC . . . . . . . . . . . . . . . . . . . . . . 216 Verifying the installed part by using Systems Director Management Console . . . . . . . . . . . . . 216 Activating and deactivating LEDs by using the Systems Director Management Console . . . . . . . . 217 Deactivating a system attention LED or partition LED. . . . . . . . . . . . . . . . . . . 217 Activating or deactivating an identify LED by using the SDMC . . . . . . . . . . . . . . . 217 Viewing serviceable events by using the Systems Director Management Console . . . . . . . . . . . 218 Verifying an installed feature or replaced part on a system or logical partition by using Virtual I/O Server tools 218 Verifying a repair. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 223 Verifying the repair in AIX . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 224 Verifying a repair using an IBM i system or logical partition . . . . . . . . . . . . . . . . . . 227 Verifying the repair in Linux . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 229 Verifying the repair from the management console . . . . . . . . . . . . . . . . . . . . . . 229 Closing a service call. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 231 Closing a service call using AIX or Linux . . . . . . . . . . . . . . . . . . . . . . . . . 235 Closing a service call using Integrated Virtualization Manager . . . . . . . . . . . . . . . . . . 239 Activating and deactivating LEDs. . . . . . . . . . . . . . . . . . . . . . . . 245 Deactivating a system attention LED or partition LED using the management console . . . . . . . . . . 245 Activating or deactivating an identify LED using the management console. . . . . . . . . . . . . . 246 Deactivating a system attention LED or logical partition LED using the Advanced System Management Interface 246 Activating or deactivating an identify LED using the Advanced System Management Interface . . . . . . . 247 Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 249 Trademarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 250 Electronic emission notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 250 Class A Notices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 250 Class B Notices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 254 Terms and conditions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 257 Contents vii viii Disk drives Safety notices Safety notices may be printed throughout this guide: v DANGER notices call attention to a situation that is potentially lethal or extremely hazardous to people. v CAUTION notices call attention to a situation that is potentially hazardous to people because of some existing condition. v Attention notices call attention to the possibility of damage to a program, device, system, or data. World Trade safety information Several countries require the safety information contained in product publications to be presented in their national languages. If this requirement applies to your country, a safety information booklet is included in the publications package shipped with the product. The booklet contains the safety information in your national language with references to the U.S. English source. Before using a U.S. English publication to install, operate, or service this product, you must first become familiar with the related safety information in the booklet. You should also refer to the booklet any time you do not clearly understand any safety information in the U.S. English publications. German safety information Das Produkt ist nicht für den Einsatz an Bildschirmarbeitsplätzen im Sinne § 2 der Bildschirmarbeitsverordnung geeignet. Laser safety information IBM® servers can use I/O cards or features that are fiber-optic based and that utilize lasers or LEDs. Laser compliance IBM servers may be installed inside or outside of an IT equipment rack. © Copyright IBM Corp. 2010, 2011 ix DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) DANGER x Disk drives Observe the following precautions when working on or around your IT rack system: v Heavy equipment–personal injury or equipment damage might result if mishandled. v Always lower the leveling pads on the rack cabinet. v Always install stabilizer brackets on the rack cabinet. v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest devices in the bottom of the rack cabinet. Always install servers and optional devices starting from the bottom of the rack cabinet. v Rack-mounted devices are not to be used as shelves or work spaces. Do not place objects on top of rack-mounted devices. v Each rack cabinet might have more than one power cord. Be sure to disconnect all power cords in the rack cabinet when directed to disconnect power during servicing. v Connect all devices installed in a rack cabinet to power devices installed in the same rack cabinet. Do not plug a power cord from a device installed in one rack cabinet into a power device installed in a different rack cabinet. v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of the system or the devices that attach to the system. It is the responsibility of the customer to ensure that the outlet is correctly wired and grounded to prevent an electrical shock. CAUTION v Do not install a unit in a rack where the internal rack ambient temperatures will exceed the manufacturer's recommended ambient temperature for all your rack-mounted devices. v Do not install a unit in a rack where the air flow is compromised. Ensure that air flow is not blocked or reduced on any side, front, or back of a unit used for air flow through the unit. v Consideration should be given to the connection of the equipment to the supply circuit so that overloading of the circuits does not compromise the supply wiring or overcurrent protection. To provide the correct power connection to a rack, refer to the rating labels located on the equipment in the rack to determine the total power requirement of the supply circuit. v (For sliding drawers.) Do not pull out or install any drawer or feature if the rack stabilizer brackets are not attached to the rack. Do not pull out more than one drawer at a time. The rack might become unstable if you pull out more than one drawer at a time. v (For fixed drawers.) This drawer is a fixed drawer and must not be moved for servicing unless specified by the manufacturer. Attempting to move the drawer partially or completely out of the rack might cause the rack to become unstable or cause the drawer to fall out of the rack. (R001) Safety notices xi CAUTION: Removing components from the upper positions in the rack cabinet improves rack stability during relocation. Follow these general guidelines whenever you relocate a populated rack cabinet within a room or building: v Reduce the weight of the rack cabinet by removing equipment starting at the top of the rack cabinet. When possible, restore the rack cabinet to the configuration of the rack cabinet as you received it. If this configuration is not known, you must observe the following precautions: – Remove all devices in the 32U position and above. – Ensure that the heaviest devices are installed in the bottom of the rack cabinet. – Ensure that there are no empty U-levels between devices installed in the rack cabinet below the 32U level. v If the rack cabinet you are relocating is part of a suite of rack cabinets, detach the rack cabinet from the suite. v Inspect the route that you plan to take to eliminate potential hazards. v Verify that the route that you choose can support the weight of the loaded rack cabinet. Refer to the documentation that comes with your rack cabinet for the weight of a loaded rack cabinet. v Verify that all door openings are at least 760 x 230 mm (30 x 80 in.). v Ensure that all devices, shelves, drawers, doors, and cables are secure. v Ensure that the four leveling pads are raised to their highest position. v Ensure that there is no stabilizer bracket installed on the rack cabinet during movement. v Do not use a ramp inclined at more than 10 degrees. v When the rack cabinet is in the new location, complete the following steps: – Lower the four leveling pads. – Install stabilizer brackets on the rack cabinet. – If you removed any devices from the rack cabinet, repopulate the rack cabinet from the lowest position to the highest position. v If a long-distance relocation is required, restore the rack cabinet to the configuration of the rack cabinet as you received it. Pack the rack cabinet in the original packaging material, or equivalent. Also lower the leveling pads to raise the casters off of the pallet and bolt the rack cabinet to the pallet. (R002) (L001) (L002) xii Disk drives (L003) or All lasers are certified in the U.S. to conform to the requirements of DHHS 21 CFR Subchapter J for class 1 laser products. Outside the U.S., they are certified to be in compliance with IEC 60825 as a class 1 laser product. Consult the label on each part for laser certification numbers and approval information. CAUTION: This product might contain one or more of the following devices: CD-ROM drive, DVD-ROM drive, DVD-RAM drive, or laser module, which are Class 1 laser products. Note the following information: v Do not remove the covers. Removing the covers of the laser product could result in exposure to hazardous laser radiation. There are no serviceable parts inside the device. v Use of the controls or adjustments or performance of procedures other than those specified herein might result in hazardous radiation exposure. (C026) Safety notices xiii CAUTION: Data processing environments can contain equipment transmitting on system links with laser modules that operate at greater than Class 1 power levels. For this reason, never look into the end of an optical fiber cable or open receptacle. (C027) CAUTION: This product contains a Class 1M laser. Do not view directly with optical instruments. (C028) CAUTION: Some laser products contain an embedded Class 3A or Class 3B laser diode. Note the following information: laser radiation when open. Do not stare into the beam, do not view directly with optical instruments, and avoid direct exposure to the beam. (C030) CAUTION: The battery contains lithium. To avoid possible explosion, do not burn or charge the battery. Do Not: v ___ Throw or immerse into water v ___ Heat to more than 100°C (212°F) v ___ Repair or disassemble Exchange only with the IBM-approved part. Recycle or discard the battery as instructed by local regulations. In the United States, IBM has a process for the collection of this battery. For information, call 1-800-426-4333. Have the IBM part number for the battery unit available when you call. (C003) Power and cabling information for NEBS (Network Equipment-Building System) GR-1089-CORE The following comments apply to the IBM servers that have been designated as conforming to NEBS (Network Equipment-Building System) GR-1089-CORE: The equipment is suitable for installation in the following: v Network telecommunications facilities v Locations where the NEC (National Electrical Code) applies The intrabuilding ports of this equipment are suitable for connection to intrabuilding or unexposed wiring or cabling only. The intrabuilding ports of this equipment must not be metallically connected to the interfaces that connect to the OSP (outside plant) or its wiring. These interfaces are designed for use as intrabuilding interfaces only (Type 2 or Type 4 ports as described in GR-1089-CORE) and require isolation from the exposed OSP cabling. The addition of primary protectors is not sufficient protection to connect these interfaces metallically to OSP wiring. Note: All Ethernet cables must be shielded and grounded at both ends. The ac-powered system does not require the use of an external surge protection device (SPD). The dc-powered system employs an isolated DC return (DC-I) design. The DC battery return terminal shall not be connected to the chassis or frame ground. xiv Disk drives Disk drives Learn about installing, removing, and replacing a disk drive for the 8231-E2B, 8231-E1C, or 8231-E2C. © Copyright IBM Corp. 2010, 2011 1 2 Disk drives What's new in Disk drives Read about new or changed information in Disk drives since the previous update of this topic collection. October 2011 The following updates were made to the content: v Added information for the 8231-E1C and 8231-E2C. May 2011 The following updates were made to the content: v Added information for installing a disk drive in the 5887 under the installing a disk drive section. v Added information for Disk drive locations and concurrent maintenance lights for the 5887 disk drive enclosure unit. September 2010 The following updates were made to the content: v Added information for Internal disk drive sharing on an 8202-E4B or 8205-E6B system. v Added information for Disk drive locations and concurrent maintenance lights for 8231-E2B. v Added information for Disk drive locations and concurrent maintenance lights for an 8202-E4B or 8205-E6B system. v Added information for PCIe RAID and SSD SAS adapter locations. v Added a procedure for Installing the external SAS port on an 8231-E2B system. v Added a procedure for Installing the external SAS port on an 8202-E4B or 8205-E6B system. March 2010 The following updates were made to the content: v Added information for Disk drive locations and concurrent maintenance lights for models 9117-MMB or 9179-MHB. v Added a procedure for Internal disk drive sharing in a 9117-MMB or 9179-MHB system. February 2010 v Added information for servers that contain the POWER7® processor. © Copyright IBM Corp. 2010, 2011 3 4 Disk drives Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C You can install a Small Computer System Interface (SCSI), serial-attached SCSI (SAS) disk drive, or solid-state drive (SSD) in the server or expansion unit. The system or logical partition can either be turned off or turned on when you install the disk drive. Identify the system for performing the installation or an upgrade. Turn on the System Identify (BLUE) LED. Solid-state drives To install and configure solid-state drives (SSD), you must review the supported machine type, adapter, and configuration details. SSDs, also known as flash drives, follow many of the same rules as a regular hard disk drive (HDD). For example, SSDs physically resemble, install in the same manner, and in most of the same slots as HDDs. However, some restrictions and configuration rules that apply to SSD are different from HDD. The following tables can help you determine which operating system, adapter, machine type model, and disk drive configuration rules apply to your system or expansion unit. After you have reviewed the configuration rules, you can install the drive as you would any other disk drive. Restriction: Solid-state drives must be part of a RAID array or System Mirroring (IBM i). Refer to the Solid-state drive rules for the IBM AIX®, IBM i, or Linux operating system for the server or partition where you want to install drives. Attention: Before installing SSD into your system, you must ensure that you have all the prerequisite updates for both the SSD and the adapters. To check for the prerequisites, go to the website: IBM Prerequisite. © Copyright IBM Corp. 2010, 2011 5 Table 1. Solid-state drive rules for the IBM AIX, IBM i, or Linux operating system Machine type and model Adapter Mixing rules 5887 v FC 5805: PCIe 380 MB Cache Dual - x4 3Gb SAS RAID Adapter v FC 5901: PCIe Dual - x4 SAS Adapter v FC 5908: PCI-X DDR 1.5 GB Cache SAS RAID Adapter (BSC) v A maximum of 24 drives are supported in this expansion unit. v The EXP24S can be configured as one set of 24 bays (mode 1), two sets of 12 bays (mode 2), or four sets of 6 bays (mode 4). v The EXP24S serial-attached SCSI (SAS) ports are attached to SAS controllers, which can be a SAS Peripheral Component Interconnect-X (PCI-X) adapter, PCI Express (PCIe) adapter, or a pair of adapters. The EXP24S can also be attached to an imbedded SAS controller in a server with an imbedded SAS port. v The mixing of the Solid-state drives and hard disk drives (HDD) is as follows: – SSDs and HDDs cannot be mixed in a logical group when configured as in mode 1. – SSDs and HDDs can be mixed when configured as in mode 2: one set can be SSDs and the other set can be HDDs. – SSDs and HDDs cannot be mixed when configured as in mode 4 because the SSDs are not supported by the adapter that is required to configure the 5887 for mode 4. 5886 v FC 5904: PCI-X DDR 1.5GB Cache SAS RAID Adapter v FC 5906 and FC 5908: PCI-X DDR 1.5 GB Cache SAS RAID Adapter (BSC) v FC 5903: PCIe 380 MB Cache Dual - x4 3 Gb SAS RAID Adapter v SSDs and hard disk drives can be mixed in this drive enclosure. v A maximum of 24 drives are supported in this expansion unit. v SSDs and HDDs can be attached to the SAS Controller PCI or PCI-X adapters. Also, attached to the imbedded SAS controller in a server with an imbedded SAS port. 8231-E2B, 8231-E1C, or 8231-E2C v FC 2053: 3 Gbps SAS RAID Controller with 4X SSD Sockets v FC 5278: SAS Controller v SSD and HDD can never be mixed in the same RAID array or mirroring group. v SSDs and HDDs cannot split the disks. v SSDs and HDDs can be connected to the 5886 from the external SAS port. 6 Disk drives Table 1. Solid-state drive rules for the IBM AIX, IBM i, or Linux operating system (continued) Machine type and model Adapter Mixing rules 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C v FC 5631: Split disks Enablement adapter v SSD and HDD can never be mixed in the same RAID array or mirroring group. v SSDs and HDDs can be connected to the 5886 from the external SAS port. v 5631 can be used to split disks 8233-E8B and 8236-E8C v FC 5679: SAS RAID Enablement: internal adapter v FC 5904: PCI-X DDR 1.5GB Cache SAS RAID Adapter v FC 5903: PCIe 380 MB Cache Dual - x4 3 Gb SAS RAID Adapter v SSD and HDD can never be mixed in the same RAID array or mirroring group. v SSD and HDD can be mixed in the system disk drive backplane if the backplane is not split. v SSD and HDD can both be present in a split backplane if each side of the split backplane has only SSD or HDD. Restriction: Split backplane for these models is not supported with dual SAS Raid adapters. Remember: SSD must be on one side of the split backplane and HDD must be on the other side of the split backplane. SSD and HDD drive types cannot be mixed within a parity set. If you are mixing SSD and HDD in a system, ensure that you have enough of each drive type to manage your protection scheme. 9117-MMB, 9117-MMC, 9179-MHB, or 9179-MHC v FC 5662: Disk/Media Backplane v FC 5662: 175 MB Cache RAID - Dual IOA Enablement Card v FC 5903: PCIe 380 MB Cache Dual - x4 3 Gb SAS RAID Adapter v SSD and HDD cannot be mixed in the same RAID array or mirroring group. v SSD and HDD can be mixed in the system disk-drive backplane. Remember: You cannot mix SSD and HDD drive types within a parity set. If you are mixing SSD and HDD in a system, ensure that you have enough of each drive type to manage your protection scheme. Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 7 Table 1. Solid-state drive rules for the IBM AIX, IBM i, or Linux operating system (continued) Machine type and model Adapter Mixing rules 5802 FC 5903: PCIe 380 MB Cache Dual - x4 3 Gb SAS RAID Adapter v SSD and HDD cannot be mixed in the same system disk drive backplane. v SSD and HDD can both be present in a split backplane if each side of the split backplane has only SSD or HDD. SSD must be on one side of the split backplane and HDD must be on the other side of the split backplane. v SSD and HDD cannot be mixed on a pair of 5903 adapters. v A maximum of nine SSDs are supported on a pair of 5903 adapters. Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the AIX system or logical partition power turned on Learn how to install a disk-drive or solid-state drive in a system or expansion unit while the power of the AIX operating system or logical partition that controls the disk drive location is turned on. If you are servicing a failing part, see the service procedures for “Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the AIX system or logical partition power turned on” on page 61. This procedure is intended for installing a new or upgraded disk drive. Before installing or replacing a feature, ensure that the software required to support the feature is installed on your system. For information about software prerequisites, see IBM prerequisite. If the required software is not installed, go to Fix Central to download it, and install it before continuing. If you are installing solid-state drives (SSD), review the configuration rules and then return here. For more information, see “Solid-state drives” on page 5. 8 Disk drives DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) Attention: This procedure is a complex set of steps that must be performed precisely as instructed to avoid potential data loss or system damage. Alternatively, you can use the procedure for installing a disk drive or solid-state drive with the power off, which is a simpler task that involves less risk of data loss or system damage. For instructions, see “Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the system or logical partition power turned off” on page 31. Attention: v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted metal surface of the system for a minimum of 5 seconds. Note: Some of the figures in these procedures might not look exactly like the system or expansion unit that you have. However, the steps to perform the task are the same. Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 9 Preparing the system Complete the following steps for preparing the system before installing a disk drive. 1. Perform prerequisite tasks. For instructions, see Before you begin. 2. If applicable, remove the system unit door, remove the expansion unit door, or open the rack. 3. Determine the next available disk drive position in your system or expansion unit. For information, see “Disk drive locations and service indicators for the 8231-E2B, 8231-E1C, or 8231-E2C system” on page 97. Note: When you have a system with an expansion unit, fill the slot positions in the system unit first. However, you can choose a different disk drive placement depending on your data protection strategy. 4. Record the position where the new disk drive must be installed. For example, the next available disk drive slot might be P3-D4. Installing a disk drive in an 8231-E2B, 8231-E1C, or 8231-E2C Complete the following steps to install a disk drive in the system or an expansion unit. 1. Use the service indicator light-emitting diodes (LEDs) to help identify the part. For information, see Identifying a part. 2. Use the location information that you recorded while removing the disk drives to install or replace the disk drive to the correct location. 3. Find the package that contains the new disk drive. Attention: Disk drives are fragile. Handle with care. 4. Remove the disk drive from the static-protective package. Record the serial number (use the last four digits) of the disk drive. 5. If the slot you want to use contains a disk drive filler, remove the disk drive filler from the slot. For instructions, see “Removing a disk drive filler from the 8231-E2B, 8231-E1C, or 8231-E2C system” on page 89. 6. If the disk drive you are installing does not have a bezel preinstalled, then perform the tasks to install a bezel on the disk drive, then continue with this procedure from step 8. For instructions to install a bezel, see “Installing a disk drive bezel in a disk drive or filler” on page 94. 7. If the disk drive you are installing has a bezel preinstalled, then go to step 8. 8. Unlock the disk drive handle by squeezing it and pulling it out toward you. If the handle is not all the way out, the disk drive does not slide into the system or expansion unit. 9. Hold the disk drive by the top and bottom edges as you position the drive, and insert it into the system or the expansion unit. Do not hold the disk drive by the handle. 10. Slide the disk drive all the way into the system or expansion unit, and push the disk drive handle (A) in until it locks, as shown in Figure 1 on page 11. The service indicator on the disk drive turns on. Important: When installing a disk drive ensure that the disk drive is fully seated and is all the way into the system. Note: The light stops flashing and remains on when you have installed the disk drive properly. 10 Disk drives 11. If you are installing more than one disk drive, perform all the steps in this procedure until all disk drives are installed. Installing a disk drive in the 5887 disk-drive expansion unit Perform the following steps to install a disk drive in the 5887 disk-drive expansion unit. 1. With the handle in the unlocked position, see Figure 2 on page 12, support the bottom of the disk unit as you align it with the guide rails in the expansion unit. Do not hold the disk unit solely by the handle. 2. Slide the disk unit into the expansion unit until the drive stops. Important: When installing a disk drive ensure that the disk drive is fully seated and is all the way into the system. 3. Rotate the handle to the locked position. Figure 1. Installing the disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 11 Completing the disk drive installation by using the HSM Perform the following steps in the Hardware Service Manager (HSM): 1. Press Enter. 2. Exit to the command line. 3. If you are installing more than one disk drive, repeat this procedure until all disk drives are installed. 4. Cancel to get to the SCSI and SCSI RAID Hot Plug Manager menu. 5. Select Configure Added/Replaced Devices. Completing the procedure 1. Verify the installed part: v If you replaced the part because of a failure, verify the installed part. For instructions, see Verifying a repair. v If you installed the part for any other reason, verify the installed part. For instructions, see Verifying the installed part. 2. To configure the newly installed disk drive, see the procedure for the operating system or logical partition that controls the disk drive: v Preparing a disk drive for use in an AIX system or logical partition. v Preparing a disk drive for use in a Linux system or logical partition v Configuring a disk drive in an IBM i system or logical partition. 3. To rebuild the data on the replacement disk drive, see the procedure for the operating system or logical partition that controls the disk drive: v Rebuilding data on a replacement drive using AIX. v Rebuilding data on a replacement drive using Linux. v Rebuilding data on a replacement drive using IBM i. Figure 2. Installing the disk drive in the 5887 expansion unit 12 Disk drives Installing a disk-drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the Linux system or logical partition power turned on Learn how to install a disk drive or solid-state drive in a system or expansion unit while the power to the Linux operating system or logical partition that controls the disk drive location is turned on. This procedure uses the Linux kernel 2.6. Attention: If you are servicing a failing part, see the service procedures for “Replacing a disk drive using Linux kernel 2.6, with the Linux system or logical partition power turned on” on page 65. This procedure is intended for installing a new or upgraded disk drive. Before installing or replacing a feature, ensure that the software required to support the feature is installed on your system. For information about software prerequisites, see IBM prerequisite. If the required software is not installed, go to Fix Central to download it, and install it before continuing. If you are installing solid-state drives (SSD), review the configuration rules and then return here. For more information, see “Solid-state drives” on page 5. Attention: This procedure is a complex set of steps that must be performed precisely as instructed to avoid potential data loss or system damage. Alternatively, you can use the procedure for installing a disk drive or solid-state drive with the power off, which is a simpler task that involves less risk of data loss or system damage. For instructions, see “Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the system or logical partition power turned off” on page 31. Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 13 DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) Preparing the system Complete the following steps for preparing the system before installing a disk drive. 1. Perform prerequisite tasks. For instructions, see Before you begin. 2. If applicable, remove the system unit door, remove the expansion unit door, or open the rack. 3. Determine the next available disk drive position in your system or expansion unit. For information, see “Disk drive locations and service indicators for the 8231-E2B, 8231-E1C, or 8231-E2C system” on page 97. Note: When you have a system with an expansion unit, fill the slot positions in the system unit first. However, you can choose a different disk drive placement depending on your data protection strategy. 4. Record the position where the new disk drive must be installed. For example, the next available disk drive slot might be P3-D4. 14 Disk drives Preparing to install a disk drive by using the HSM Perform the following steps to install a disk drive by using the Hardware Service Manager (HSM): 1. Log in as root user. 2. Type lsslot on the command line of the Linux session and then press Enter. A display like the following figure, is shown. 3. Note the Linux Name for the slot that contains the adapter under which you want to install the disk drive. 4. Type iprconfig on the command line of the Linux session and press Enter. The IBM Power RAID Configuration Utility display is shown. 5. Select Work with disk unit recovery from the IBM Power RAID Configuration Utility display. Press Enter. 6. Select Concurrent add device from the Work with Disk Unit Recovery display. Press Enter. A Concurrent Device Add display, like the following figure, is shown. x0537p1:~ # lsslot # Slot Description Linux Name Device(s) U787A.001.DNZ00XV-P1-C1 Logical I/O Slot 0000:00:02.0 ethernet U787A.001.DNZ00XV-P1-C2 Logical I/O Slot 0000:00:02.2 pci U787A.001.DNZ00XV-P1-C3 Logical I/O Slot 0001:00:02.2 pci1069,b166 U787A.001.DNZ00XV-P1-C4 Logical I/O Slot 0000:00:02.6 pci1069,b166 U787A.001.DNZ00XV-P1-C5 Logical I/O Slot 0001:00:02.6 pci1014,0180 U787A.001.DNZ00XV-P1-T5 Logical I/O Slot 0000:00:02.4 ethernet U787A.001.DNZ00XV-P1-T7 Logical I/O Slot 0000:00:02.3 usb U787A.001.DNZ00XV-P1-T10 Logical I/O Slot 0001:00:02.4 scsi U787A.001.DNZ00XV-P1-T12 Logical I/O Slot 0001:00:02.3 ide U9111.520.10D3C6C-V1-C0 Virtual I/O Slot 30000000 vty Figure 3. Find the Linux Name for the location for which you want to install the disk drive. Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 15 7. Type option 1 next to the location into which you want to install the disk drive and press Enter. The Verify Device Concurrent Add display is shown. A service indicator indicates the slot where the disk drive can be installed. Installing a disk drive in an 8231-E2B, 8231-E1C, or 8231-E2C Complete the following steps to install a disk drive in the system or an expansion unit. 1. Use the service indicator light-emitting diodes (LEDs) to help identify the part. For information, see Identifying a part. 2. Use the location information that you recorded while removing the disk drives to install or replace the disk drive to the correct location. 3. Find the package that contains the new disk drive. Attention: Disk drives are fragile. Handle with care. 4. Remove the disk drive from the static-protective package. Record the serial number (use the last four digits) of the disk drive. 5. If the slot you want to use contains a disk drive filler, remove the disk drive filler from the slot. For instructions, see “Removing a disk drive filler from the 8231-E2B, 8231-E1C, or 8231-E2C system” on page 89. 6. If the disk drive you are installing does not have a bezel preinstalled, then perform the tasks to install a bezel on the disk drive, then continue with this procedure from step 8. For instructions to install a bezel, see “Installing a disk drive bezel in a disk drive or filler” on page 94. 7. If the disk drive you are installing has a bezel preinstalled, then go to step 8. 8. Unlock the disk drive handle by squeezing it and pulling it out toward you. If the handle is not all the way out, the disk drive does not slide into the system or expansion unit. 9. Hold the disk drive by the top and bottom edges as you position the drive, and insert it into the system or the expansion unit. Do not hold the disk drive by the handle. 10. Slide the disk drive all the way into the system or expansion unit, and push the disk drive handle (A) in until it locks, as shown in Figure 5 on page 17. The service indicator on the disk drive turns on. Important: When installing a disk drive ensure that the disk drive is fully seated and is all the way into the system. Note: The light stops flashing and remains on when you have installed the disk drive properly. Concurrent Device Add Choose a single location for add operations 1=Select OPT Name PCI/SCSI Location Description Status --- ------ -------------------------- ------------------------- ----------------- 0000:41:01.0.1/0:5: Empty 0000:41:01.0.1/1:8: Empty 0001:61:01.0.2/0:8: Empty Figure 4. Example Concurrent Device Remove display 16 Disk drives 11. If you are installing more than one disk drive, perform all the steps in this procedure until all disk drives are installed. Installing a disk drive in the 5887 disk-drive expansion unit Perform the following steps to install a disk drive in the 5887 disk-drive expansion unit. 1. With the handle in the unlocked position, see Figure 6 on page 18, support the bottom of the disk unit as you align it with the guide rails in the expansion unit. Do not hold the disk unit solely by the handle. 2. Slide the disk unit into the expansion unit until the drive stops. Important: When installing a disk drive ensure that the disk drive is fully seated and is all the way into the system. 3. Rotate the handle to the locked position. Figure 5. Installing the disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 17 Verifying that the new disk drive is operational Do the following to confirm if you have installed the disk drive correctly: v Press Enter on the Verify Device Concurrent Add display to verify the location. v Press Enter. The service indicator stops flashing and remains on when you have installed the disk drive correctly. Perform the following steps to verify that the new disk drive is operational: 1. Log in as root user. 2. Type iprconfig on the command line of the Linux session and press Enter. The IBM Power RAID Configuration Utility display is shown. 3. Select Display hardware status. The Display Hardware Status display is shown, like the following figure. Figure 6. Installing the disk drive in the 5887 expansion unit 18 Disk drives 4. Verify that the disk drive that you installed is shown on this display. Completing the procedure 1. Verify the installed part: v If you replaced the part because of a failure, verify the installed part. For instructions, see Verifying a repair. v If you installed the part for any other reason, verify the installed part. For instructions, see Verifying the installed part. 2. To configure the newly installed disk drive, see the procedure for the operating system or logical partition that controls the disk drive: v Preparing a disk drive for use in an AIX system or logical partition. v Preparing a disk drive for use in a Linux system or logical partition v Configuring a disk drive in an IBM i system or logical partition. 3. To rebuild the data on the replacement disk drive, see the procedure for the operating system or logical partition that controls the disk drive: Display Hardware Status Type option, press Enter. 1=Display hardware resource information details OPT Name PCI/SCSI Location Description Status --- ------ -------------------------- ------------------------- ---------------- 0000:01:01.0.0/ PCI-X SCSI Adapter Operational 0000:41:01.0.1/ PCI-X SCSI Adapter Operational sda 0000:41:01.0.1/0:3:0 Physical Disk Active sdb 0000:41:01.0.1/0:4:0 Physical Disk Active sdc 0000:41:01.0.1/0:8:0 Physical Disk Active sdd 0000:41:01.0.1/1:3:0 Physical Disk Active sde 0000:41:01.0.1/1:4:0 Physical Disk Active sdf 0000:41:01.0.1/1:5:0 Physical Disk Active 0001:61:01.0.2/ PCI-X SCSI RAID Adapter Operational sdg 0001:61:01.0.2/0:3:0 Physical Disk Active 0001:61:01.0.2/0:6:0 Advanced Function Disk Active sdi 0001:61:01.0.2/0:9:0 Physical Disk Active sdh 0001:61:01.0.2/255:0:0 RAID 10 Disk Array Failed 0001:61:01.0.2/0:4:0 RAID 10 Array Member Failed 0001:61:01.0.2/0:5:0 RAID 10 Array Member Failed e=Exit q=Cancel r=Refresh t=Toggle Figure 7. Example Display Hardware Status Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 19 v Rebuilding data on a replacement drive using AIX. v Rebuilding data on a replacement drive using Linux. v Rebuilding data on a replacement drive using IBM i. Installing a disk-drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the Virtual I/O Server system or logical partition power turned on You can install a disk-drive or solid-state drive in a system or expansion unit while the power to the Virtual I/O Server operating system or logical partition that controls the disk-drive location is turned on. Before installing or replacing a feature, ensure that the software required to support the feature is installed on your system. For information about software prerequisites, see IBM prerequisite. If the required software is not installed, go to Fix Central to download it, and install it before continuing. Attention: This procedure is a complex set of steps that must be performed precisely as instructed to avoid potential data loss or system damage. Alternatively, you can use the procedure for installing a disk drive or solid-state drive with the power off, which is a simpler task that involves less risk of data loss or system damage. For instructions, see “Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the system or logical partition power turned off” on page 31. 20 Disk drives DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) Note: Some of the figures in these procedures might not look exactly like the system or expansion unit that you have. However, the steps to perform the task are the same. Preparing the system Complete the following steps for preparing the system before installing a disk drive. 1. Perform prerequisite tasks. For instructions, see Before you begin. 2. If applicable, remove the system unit door, remove the expansion unit door, or open the rack. 3. Determine the next available disk drive position in your system or expansion unit. For information, see “Disk drive locations and service indicators for the 8231-E2B, 8231-E1C, or 8231-E2C system” on page 97. Note: When you have a system with an expansion unit, fill the slot positions in the system unit first. However, you can choose a different disk drive placement depending on your data protection strategy. 4. Record the position where the new disk drive must be installed. For example, the next available disk drive slot might be P3-D4. Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 21 Preparing to install a disk drive by using the HSM Complete the following steps by using the Hardware Service Manager (HSM) before installing a disk drive. 1. Log in as root user. 2. At the command line, type diag and press Enter. 3. Press Enter to continue. 4. On the Function Selection display, select Task Selection. 5. On the Tasks Selection display, select Hot Plug Task. 6. Select SCSI and SCSI RAID Hot Plug Manager. 7. Select Attach a Device to a SCSI Hot Swap Enclosure Device. A list of empty slots in the SCSI hot swap enclosure device is shown. 8. Select the slot where you want to install the disk drive. 9. Do not press Enter yet, first complete installing the disk drive. Important: When you press Enter, after a one minute delay, the disk drive slot enters the Identify state and the service indicator comes on and begins to flash rapidly. You then have nine seconds to slide the disk drive all the way in and then lock it in place by pushing in the disk drive handle. You can then press Enter on the console. Installing a disk drive in an 8231-E2B, 8231-E1C, or 8231-E2C Complete the following steps to install a disk drive in the system or an expansion unit. 1. Use the service indicator light-emitting diodes (LEDs) to help identify the part. For information, see Identifying a part. 2. Use the location information that you recorded while removing the disk drives to install or replace the disk drive to the correct location. 3. Find the package that contains the new disk drive. Attention: Disk drives are fragile. Handle with care. 4. Remove the disk drive from the static-protective package. Record the serial number (use the last four digits) of the disk drive. 5. If the slot you want to use contains a disk drive filler, remove the disk drive filler from the slot. For instructions, see “Removing a disk drive filler from the 8231-E2B, 8231-E1C, or 8231-E2C system” on page 89. 6. If the disk drive you are installing does not have a bezel preinstalled, then perform the tasks to install a bezel on the disk drive, then continue with this procedure from step 8. For instructions to install a bezel, see “Installing a disk drive bezel in a disk drive or filler” on page 94. 7. If the disk drive you are installing has a bezel preinstalled, then go to step 8. 8. Unlock the disk drive handle by squeezing it and pulling it out toward you. If the handle is not all the way out, the disk drive does not slide into the system or expansion unit. 9. Hold the disk drive by the top and bottom edges as you position the drive, and insert it into the system or the expansion unit. Do not hold the disk drive by the handle. 10. Slide the disk drive all the way into the system or expansion unit, and push the disk drive handle (A) in until it locks, as shown in Figure 8 on page 23. The service indicator on the disk drive turns on. Important: When installing a disk drive ensure that the disk drive is fully seated and is all the way into the system. Note: The light stops flashing and remains on when you have installed the disk drive properly. 22 Disk drives 11. If you are installing more than one disk drive, perform all the steps in this procedure until all disk drives are installed. Installing a disk drive in the 5887 disk-drive expansion unit Perform the following steps to install a disk drive in the 5887 disk-drive expansion unit. 1. With the handle in the unlocked position, see Figure 9 on page 24, support the bottom of the disk unit as you align it with the guide rails in the expansion unit. Do not hold the disk unit solely by the handle. 2. Slide the disk unit into the expansion unit until the drive stops. Important: When installing a disk drive ensure that the disk drive is fully seated and is all the way into the system. 3. Rotate the handle to the locked position. Figure 8. Installing the disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 23 Completing the disk drive installation by using the HSM Perform the following steps in the Hardware Service Manager (HSM) after you have installed the disk drive: 1. Cancel to get to the SCSI and SCSI RAID Hot Plug Manager menu. 2. Select Configure Added/Replaced Devices. 3. Exit to the command line. Completing the procedure 1. Verify the installed part: v If you replaced the part because of a failure, verify the installed part. For instructions, see Verifying a repair. v If you installed the part for any other reason, verify the installed part. For instructions, see Verifying the installed part. 2. To configure the newly installed disk drive, see the procedure for the operating system or logical partition that controls the disk drive: v Preparing a disk drive for use in an AIX system or logical partition. v Preparing a disk drive for use in a Linux system or logical partition v Configuring a disk drive in an IBM i system or logical partition. 3. To rebuild the data on the replacement disk drive, see the procedure for the operating system or logical partition that controls the disk drive: v Rebuilding data on a replacement drive using AIX. v Rebuilding data on a replacement drive using Linux. v Rebuilding data on a replacement drive using IBM i. Figure 9. Installing the disk drive in the 5887 expansion unit 24 Disk drives Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the IBM i system or logical partition power turned on Learn how to install a disk drive or solid-state drive in a system or expansion unit while the power to the IBM i operating system or logical partition that controls the disk drive location is turned on. If you are servicing a failing part, see the service procedures for “Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the IBM i system or logical partition power turned on” on page 69. This procedure is intended for installing a new or upgraded disk drive. Attention: This procedure is a complex set of steps that must be performed precisely as instructed to avoid potential data loss or system damage. Alternatively, you can use the procedure for installing a disk drive or solid-state drive with the power off, which is a simpler task that involves less risk of data loss or system damage. For instructions, see “Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the system or logical partition power turned off” on page 31. Before installing or replacing a feature, ensure that the software required to support the feature is installed on your system. For information about software prerequisites, see IBM prerequisite. If the required software is not installed, go to Fix Central to download it, and install it before continuing. If you are installing solid-state drives (SSD), review the configuration rules and then return here. For more information, see “Solid-state drives” on page 5. Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 25 DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) Note: Some of the figures in these procedures might not look exactly like the system or expansion unit that you have. However, the steps to perform the task are the same. Preparing the system Complete the following steps for preparing the system before installing a disk drive. 1. Perform prerequisite tasks. For instructions, see Before you begin. 2. If applicable, remove the system unit door, remove the expansion unit door, or open the rack. 3. Determine the next available disk drive position in your system or expansion unit. For information, see “Disk drive locations and service indicators for the 8231-E2B, 8231-E1C, or 8231-E2C system” on page 97. Note: When you have a system with an expansion unit, fill the slot positions in the system unit first. However, you can choose a different disk drive placement depending on your data protection strategy. 4. Record the position where the new disk drive must be installed. For example, the next available disk drive slot might be P3-D4. 26 Disk drives Preparing to install a disk drive by using the HSM Perform the following steps to install a disk drive by using the Hardware Service Manager (HSM): 1. Record the physical location where the new disk drive can be installed. a. The physical location has the following form: U787A.001.AAAXXXX-P3-D4 where P3-D4 is the position of the disk drive in the system unit or expansion unit. Find the U787A.001.AAAXXXX location information from the control panel display on the system unit or expansion unit. b. In this example, U787A.001.AAAXXXX-P3-D4, U787A.001 comes from the feature code (FC) on the label, AAAXXXX comes from the last seven digits of the sequence number (SN or SEQ) on the label, and P3–D4 is the slot where you are installing the disk drive as shown in “Disk drive locations and service indicators” on page 97. c. If you cannot find the location information from the control panel, find the feature code (FC) and sequence number (SN or SEQ) on the label that is shown on the front of the system unit or expansion unit, as shown in Figure 10. In the example, U5074.001.AAAXXXX-DB3-D32, you would install a disk drive in the expansion unit with type 5074, model 001, sequence number AAAXXXX, and disk drive position DB3–D32. 2. Sign on to service level authority. 3. Type strsst on the command line of the IBM i session and press Enter. 4. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign On display. Press Enter. Remember: The service tools password is case-sensitive. 5. Select Start a service tool from the System Service Tools (SST) display. Press Enter. 6. Select Hardware service manager from the Start a Service Tool display. Press Enter. 7. If the slot you want to use contains a disk drive filler, do not remove the disk drive filler yet. 8. Select Device Concurrent Maintenance from the Hardware Service Manager display, and then press Enter. The Device Concurrent Maintenance is displayed, as shown in the following figure. Figure 10. Example of a system or expansion unit label Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 27 9. In the Specify either physical location or resource name field, type the physical location that you had previously recorded in this procedure. 10. Select option 2 (Install device) for the Action to be performed. 11. Set the time delay for one minute - 01. 12. Do not press Enter yet, first complete installing the disk drive. Important: When you press Enter, after a one minute delay, the disk drive slot enters the Identify state and the service indicator comes on and begins to flash rapidly. You then have nine seconds to slide the disk drive all the way in and then lock it in place by pushing in the disk drive handle. You can then press Enter on the console. Installing a disk drive in an 8231-E2B, 8231-E1C, or 8231-E2C Complete the following steps to install a disk drive in the system or an expansion unit. 1. Use the service indicator light-emitting diodes (LEDs) to help identify the part. For information, see Identifying a part. 2. Use the location information that you recorded while removing the disk drives to install or replace the disk drive to the correct location. 3. Find the package that contains the new disk drive. Attention: Disk drives are fragile. Handle with care. 4. Remove the disk drive from the static-protective package. Record the serial number (use the last four digits) of the disk drive. 5. If the slot you want to use contains a disk drive filler, remove the disk drive filler from the slot. For instructions, see “Removing a disk drive filler from the 8231-E2B, 8231-E1C, or 8231-E2C system” on page 89. Device Concurrent Maintenance Type the choices, then press Enter. Specify either physical location or resource name. Physical location: U787A.001.1234567-P3-D4 or Device resource name: Specify action as 1=Remove device 2=Install device Action to be performed . . . . . . . .: 2 Enter a time value between 00 and 19. Time delay needed in minutes . . . . .: 01 F3=Exit F5=Refresh F12=Cancel Figure 11. Example Device Concurrent Maintenance display 28 Disk drives 6. If the disk drive you are installing does not have a bezel preinstalled, then perform the tasks to install a bezel on the disk drive, then continue with this procedure from step 8. For instructions to install a bezel, see “Installing a disk drive bezel in a disk drive or filler” on page 94. 7. If the disk drive you are installing has a bezel preinstalled, then go to step 8. 8. Unlock the disk drive handle by squeezing it and pulling it out toward you. If the handle is not all the way out, the disk drive does not slide into the system or expansion unit. 9. Hold the disk drive by the top and bottom edges as you position the drive, and insert it into the system or the expansion unit. Do not hold the disk drive by the handle. 10. Slide the disk drive all the way into the system or expansion unit, and push the disk drive handle (A) in until it locks, as shown in Figure 12. The service indicator on the disk drive turns on. Important: When installing a disk drive ensure that the disk drive is fully seated and is all the way into the system. Note: The light stops flashing and remains on when you have installed the disk drive properly. 11. If you are installing more than one disk drive, perform all the steps in this procedure until all disk drives are installed. Installing a disk drive in the 5887 disk-drive expansion unit Perform the following steps to install a disk drive in the 5887 disk-drive expansion unit. 1. With the handle in the unlocked position, see Figure 13 on page 30, support the bottom of the disk unit as you align it with the guide rails in the expansion unit. Do not hold the disk unit solely by the handle. 2. Slide the disk unit into the expansion unit until the drive stops. Figure 12. Installing the disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 29 Important: When installing a disk drive ensure that the disk drive is fully seated and is all the way into the system. 3. Rotate the handle to the locked position. Completing the disk drive installation by using the HSM 1. Return to the console and wait until the Concurrent Maintenance Results display is shown, then press Enter. 2. When the Hardware Service Manager display is shown, press F12 twice and then press Enter. Completing the procedure 1. Verify the installed part: v If you replaced the part because of a failure, verify the installed part. For instructions, see Verifying a repair. v If you installed the part for any other reason, verify the installed part. For instructions, see Verifying the installed part. 2. To configure the newly installed disk drive, see the procedure for the operating system or logical partition that controls the disk drive: v Preparing a disk drive for use in an AIX system or logical partition. v Preparing a disk drive for use in a Linux system or logical partition v Configuring a disk drive in an IBM i system or logical partition. 3. To rebuild the data on the replacement disk drive, see the procedure for the operating system or logical partition that controls the disk drive: v Rebuilding data on a replacement drive using AIX. v Rebuilding data on a replacement drive using Linux. v Rebuilding data on a replacement drive using IBM i. Figure 13. Installing the disk drive in the 5887 expansion unit 30 Disk drives Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the system or logical partition power turned off Learn how to install a disk drive in a system or logical partition that controls the disk-drive location with the power turned off. If you are servicing a failing part, see the service procedures for “Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the system or logical partition power turned off” on page 77. This procedure is intended for installing a new or upgraded disk drive. Before installing or replacing a feature, ensure that the software required to support the feature is installed on your system. For information about software prerequisites, see IBM prerequisite. If the required software is not installed, go to Fix Central to download it, and install it before continuing. If you are installing solid-state drives (SSD), review the configuration rules and then return here. For more information, see “Solid-state drives” on page 5. Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 31 DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) Preparing the system Complete the following steps for preparing the system before installing a disk drive. 1. Perform prerequisite tasks. For instructions, see Before you begin. 2. If applicable, remove the system unit door, remove the expansion unit door, or open the rack. 3. Determine the next available disk drive position in your system or expansion unit. For information, see “Disk drive locations and service indicators for the 8231-E2B, 8231-E1C, or 8231-E2C system” on page 97. Note: When you have a system with an expansion unit, fill the slot positions in the system unit first. However, you can choose a different disk drive placement depending on your data protection strategy. 4. Record the position where the new disk drive must be installed. For example, the next available disk drive slot might be P3-D4. 5. Stop the system. For instructions, see Stopping the system or logical partition. 32 Disk drives 6. Place the system into the service position. For instructions, see Placing a rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system into the service position. 7. Remove the service access cover. For instructions, see Removing the service access cover from a rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system. 8. Disconnect the power source from the system by unplugging the system. For instructions, see Disconnecting the power cords. Note: This system can be equipped with a second power supply. Disconnect it before continuing with this procedure. Ensure that the power source to the system has been disconnected. (L003) or 9. Attach the wrist strap. Attention: v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted metal surface of the system for a minimum of 5 seconds. Installing a disk drive in an 8231-E2B, 8231-E1C, or 8231-E2C Complete the following steps to install a disk drive in the system or an expansion unit. 1. Use the service indicator light-emitting diodes (LEDs) to help identify the part. For information, see Identifying a part. Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 33 2. Use the location information that you recorded while removing the disk drives to install or replace the internal disk drives to the correct locations. 3. Find the package that contains the new disk drive. Attention: Disk drives are fragile. Handle with care. 4. Remove the disk drive from the static-protective package. Record the serial number (use the last four digits) of the disk drive. 5. If the slot you want to use contains a disk drive filler, remove the disk drive filler from the slot. For instructions, see “Removing a disk drive filler from the 8231-E2B, 8231-E1C, or 8231-E2C system” on page 89. 6. If the disk drive you are installing does not have a bezel preinstalled, then perform the tasks to install a bezel on the disk drive, then continue with this procedure from step 7. For instructions to install a bezel, see “Installing a disk drive bezel in a disk drive or filler” on page 94. 7. Unlock the disk drive handle by squeezing it and pulling it out toward you. If the handle is not all the way out, the disk drive does not slide into the system or expansion unit. 8. Hold the disk drive by the top and bottom edges as you position the drive, and insert it into the system or the expansion unit. Do not hold the disk drive by the handle. 9. Slide the disk drive all the way into the system or expansion unit, and push the disk drive handle (A) in until it locks, as shown in Figure 14. Important: When installing a disk drive ensure that the disk drive is fully seated and is all the way into the system. 10. If you are installing more than one disk drive, perform all the steps in this procedure until all disk drives are installed. Figure 14. Installing the disk drive into the 8231-E2B, 8231-E1C, or 8231-E2C 34 Disk drives Installing a disk drive in the 5887 disk-drive expansion unit Perform the following steps to install a disk drive in the 5887 disk-drive expansion unit. 1. With the handle in the unlocked position, see Figure 15, support the bottom of the disk unit as you align it with the guide rails in the expansion unit. Do not hold the disk unit solely by the handle. 2. Slide the disk unit into the expansion unit until the drive stops. Important: When installing a disk drive ensure that the disk drive is fully seated and is all the way into the system. 3. Rotate the handle to the locked position. Completing the procedure 1. Replace the service access cover. For instructions, see Installing the service access cover on a rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system. 2. Connect the power cords to the system. For instructions, see Connecting the power cords to the 8231-E2B, 8231-E1C, or 8231-E2C. 3. Place the system in the operating position. For instructions, see Placing the rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system into the operating position. 4. Start the system or logical partition. For instructions, see Starting the system or logical partition. 5. Verify the installed part: v If you replaced the part because of a failure, verify the installed part. For instructions, see Verifying a repair. v If you installed the part for any other reason, verify the installed part. For instructions, see Verifying the installed part. 6. To configure the newly installed disk drive, see the procedure for the operating system or logical partition that controls the disk drive: v Preparing a disk drive for use in an AIX system or logical partition. v Preparing a disk drive for use in a Linux system or logical partition v Configuring a disk drive in an IBM i system or logical partition. Figure 15. Installing the disk drive in the 5887 expansion unit Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 35 7. To rebuild the data on the replacement disk drive, see the procedure for the operating system or logical partition that controls the disk drive: v Rebuilding data on a replacement drive using AIX. v Rebuilding data on a replacement drive using Linux. v Rebuilding data on a replacement drive using IBM i. 36 Disk drives Removing a disk drive or solid-state drive from the 8231-E2B, 8231-E1C, or 8231-E2C Learn to remove a failed disk drive or solid-state drive from the server or an expansion unit with the system or logical partition power turned on or turned off. Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C with the AIX system or logical partition power turned on Learn to remove a failed Small Computer System Interface (SCSI) or serial-attached SCSI (SAS) disk drive in a system or an expansion unit while the power of the AIX operating system or logical partition that controls the disk drive location is turned on. If you are installing a new or upgraded disk drive, see “Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the AIX system or logical partition power turned on” on page 8. If you are removing a disk drive as part of a service procedure, continue to use the following procedure. Attention: This procedure is a complex set of steps that must be performed exactly to avoid potential data loss or system damage. Alternatively, you can use the procedure, which is a simpler task that involves less risk of data loss or system damage. If the system or logical partition that controls the disk drive you are replacing is at the AIX command line, you can use this powered-on procedure. If the disk drive that you are removing is in the root volume group (rootvg) and it is not protected by either Redundant Array of Independent Disks (RAID) or mirroring, or to use the powered-off procedure, go to “Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C with the system or logical partition power turned off” on page 56. To replace a Just a Bunch of Disks (JBOD) disk drive that is attached to any SCSI or SCSI RAID adapter, see the procedure in “Removing a SCSI disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C” on page 55. © Copyright IBM Corp. 2010, 2011 37 DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) Note: Some of the figures in these procedures might not look exactly like the system or expansion unit that you have. However, the steps to perform the task are the same. Preparing the system for removing a disk drive 1. Perform prerequisite tasks. For instructions, see Before you begin. 2. Prepare to remove the disk drive from the system or expansion unit. For instructions, see “Preparing to remove a disk drive from a system or expansion unit controlled by AIX” on page 105. 3. Place the system into the service position. For instructions, see Placing a rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system into the service position. 4. Remove the service access cover. For instructions, see Removing the service access cover from a rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system. 5. Use the service indicator light-emitting diodes (LEDs) to help identify the part. For information, see Identifying a part. 38 Disk drives Preparing to remove a disk drive by using the HSM 1. If you are removing a solid-state device located on a PCIe RAID and SSD SAS adapter, go to Removing and replacing the PCIe RAID and SSD SAS adapterRemoving and replacing the PCIe RAID and SSD SAS adapter and complete that procedure. Then, return here and perform the following steps. Important: Ensure to follow the concurrent or nonconcurrent replacement procedures depending on the type of data protection enabled: v If the data protection is RAID, use the concurrent procedure. v If the data protection is mirrored (card to card) and the SSD is located in the 5802 or 5803 expansion unit, use the concurrent procedure. v If the data protection is mirrored (card to card) but the SSD is not located in the 5802 or 5803 expansion unit, use the nonconcurrent procedure. 2. Perform the following steps on the Hardware Service Manager (HSM) 3. Log in as root user. 4. At the command line, type diag and press Enter. 5. Press Enter to continue. 6. On the Function Selection display, select Task Selection. 7. Select Hot Plug Task. 8. Select SCSI and SCSI RAID Hot Plug Manager or RAID Hot Plug Devices, depending on the adapter that your drive is attached to. Tip: One or both of these options might be available, depending on the configuration of your system. If the drive you want to replace is not listed when you follow the steps for the SCSI and SCSI RAID Hot Plug Manager, go to step 9. v If you have a RAID disk drive attached to a 2498 PCI SCSI RAID adapter, continue with step 9. v If you have a JBOD disk drive attached to any SCSI or SCSI RAID adapter, go to “Removing a SCSI disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C” on page 55. v If you have RAID drives attached to any other adapter, see the PCI-X SCSI RAID Controller Reference Guide for AIX. This guide is available from the SCSI PCI Adapters web page. 9. Select RAID Hot Plug Devices. 10. On the Resource Selection List, select the adapter that is connected to the array that contains the disk drive you want to remove or replace. Press Enter. 11. Select Commit. 12. On the Identify and Remove Resources menu, select Remove. Press Enter. A list of the disk drives that you can remove is shown. If the disk drive you want to remove does not appear in the list, ask your system administrator to put the disk drive in the Failed state before continuing with this procedure. For information, see AIX System Management Guide: Operating System and Devices. This guide is available at the System p® and AIX Information Center . 13. Select the drive that you want to remove and press Enter. The service indicator for the slot flashes rapidly. Attention: If the light does not flash rapidly, do not remove the disk drive. To view the locations of the service indicators, go to “Disk drive locations and service indicators for the 8231-E2B, 8231-E1C, or 8231-E2C system” on page 97. Removing the disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C 1. Unlock the disk drive handle by squeezing it and pulling it out toward you, as shown in Figure 16 on page 40. Removing a disk drive or solid-state drive from the 8231-E2B, 8231-E1C, or 8231-E2C 39 2. Press Enter and when the service indicator starts flashing, then squeeze and pull the handle of the disk drive out toward you before you remove the disk drive. If the handle is not all the way out, the disk drive cannot slide out of the system or expansion unit. Important: When you press Enter, after a one minute delay, the service indicator comes on and begins to flash rapidly. You then have nine seconds to remove the disk drive. Note: If you are removing a disk drive from the 5887 expansion unit, the service indicator is on (solid) and does not flash during this procedure. 3. Support the bottom of the disk drive as you slide it out of the system or expansion unit. Do not hold the disk drive by the handle. Note: The service indicator stops flashing and remains off when the disk drive is removed. 4. If the removed disk drive has a bezel attached to it, you can remove the disk drive bezel. For instructions, see “Removing a disk drive bezel from a disk drive or filler” on page 93. Important: If the disk drive bezel does not have a thumbscrew then the bezel is not removable and the disk drive is a SAS disk drive. The replacement SAS disk drive comes with an preinstalled bezel. v You can proceed to install a disk drive as a replacement to the failed disk drive, see “Replacing a disk drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C” on page 61 and select the procedure for replacing a disk drive for your operating system. v You can proceed to install a disk drive filler into the slot from where you removed the disk drive, see “Installing a disk drive filler in the 8231-E2B, 8231-E1C, or 8231-E2C system” on page 90 for the procedure. Figure 16. Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C 40 Disk drives Removing the disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C using Linux kernel 2.6 with the Linux system or logical partition power turned on Learn to remove a Small Computer System Interface (SCSI) or serial-attached SCSI (SAS) disk drive in a system or expansion unit while the power of the Linux operating system or logical partition that controls the disk drive location is turned on. This procedure uses the Linux kernel 2.6. If you are installing a new or upgraded disk drive, see “Installing a disk-drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the Linux system or logical partition power turned on” on page 13. If you are removing a disk drive as part of a service procedure, continue to use the following procedure. Attention: Read these instructions carefully and understand the complexity of the powered-on procedure before you begin. If the system or logical partition that controls the disk drive you are replacing is at the Linux command line, you can use this powered-on procedure. Go to “Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the system or logical partition power turned off” on page 77, if: v You are installing a disk drive that is to be connected with the 6203 or 6204 SCSI adapter v You do not feel comfortable using the power turned on procedure Removing a disk drive or solid-state drive from the 8231-E2B, 8231-E1C, or 8231-E2C 41 DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) Note: Some of the figures in these procedures might not look exactly like the system or expansion unit that you have. However, the steps to perform the task are the same. Preparing the system 1. Perform prerequisite tasks. For instructions, see Before you begin. 2. Is the disk drive you are replacing in a redundant array of independent disk (RAID) Level 0 array or is the disk drive you are replacing in a failed RAID Level 5 or Level 10 array? v If yes, you must delete the disk array before removing or replacing the disk. For instructions, see PCI-X SCSI RAID Controller Reference Guide for Linux. This guide is available from the SCSI PCI Adapters web page. v If no, your drive is configured as Just a Bunch of Disks (JBOD) or is configured within a degraded array. Proceed to the next step. 3. If the disk drive you are replacing is in a redundant array of independent disk (RAID) Level 0 disk array or in a failed RAID Level 5 or RAID Level 10 disk array, you must delete the disk array before replacing the disk. For instructions, see the PCI-X SCSI RAID Controller Reference Guide for Linux. This guide is available from the SCSI PCI Adapters web page. 42 Disk drives 4. If applicable, remove the system unit door or expansion unit door or open the rack front door. Preparing to remove the disk drive by using the HSM Perform the following tasks to remove a disk drive by using the Hardware Service Manager (HSM): 1. Log in as root user. 2. Type iprconfig on the command line of the Linux session and press Enter. The IBM Power RAID Configuration Utility display is shown. 3. Select Analyze log. Press Enter. The Kernel Messages Log display is shown. 4. Select View most recent ipr error messages from the Kernel Messages Log display. Press Enter. 5. Find the entry in the log for the disk drive you want to replace. 6. Record the location information for the disk drive. Note: The location information has the form of 2:0:8:0. In this example, 2 is the SCSI host number, 0 is the SCSI bus, 8 is the SCSI target ID, and 0 is the LUN (logical unit). 7. Return to the command line. 8. Type the following command: ls -ld /sys/class/scsi_host/host#/device Where # is the SCSI host number. Press Enter. 9. Record the PCI location information. Note: The PCI location information has the form of 61:01:0:2. Kernel Messages Log Select one of the following: 1. View most recent ipr error messages 2. View ipr error messages 3. View all kernel error messages 4. View iprconfig error messages 5. Set root kernel message log directory 6. Set default editor 7. Restore defaults 8. View ipr boot time messages Selection: e=Exit Figure 17. Kernel Messages Log Removing a disk drive or solid-state drive from the 8231-E2B, 8231-E1C, or 8231-E2C 43 10. Type iprconfig on the command line and press Enter. The IBM Power RAID Configuration Utility display is shown. 11. Select Display hardware status from the IBM Power RAID Configuration Utility display. Press Enter. The Display Hardware Status display is shown. 12. Look for the disk drive at the PCI location you recorded. The disk drive might have a Failed status. 13. If the disk drive you want to replace is unprotected or in use, move the data from the disk drive before continuing with this procedure. For information, see the PCI-X SCSI RAID Controller Reference Guide for Linux. This guide is available from the SCSI PCI Adapters web page. 14. Type option 1 (Display hardware resource information details) next to the disk drive you want to replace. Press Enter. The Disk Hardware Resource Information Details is displayed like the shown in the following sample. Display Hardware Status Type option, press Enter. 1=Display hardware resource information details OPT Name PCI/SCSI Location Description Status --- ------ -------------------------- ------------------------- ---------------- 0000:01:01.0.0/ PCI-X SCSI Adapter Operational 0000:41:01.0.1/ PCI-X SCSI Adapter Operational sda 0000:41:01.0.1/0:3:0 Physical Disk Active sdb 0000:41:01.0.1/0:4:0 Physical Disk Active sdc 0000:41:01.0.1/0:8:0 Physical Disk Active sdd 0000:41:01.0.1/1:3:0 Physical Disk Active sde 0000:41:01.0.1/1:4:0 Physical Disk Active sdf 0000:41:01.0.1/1:5:0 Physical Disk Active 0001:61:01.0.2/ PCI-X SCSI RAID Adapter Operational sdg 0001:61:01.0.2/0:3:0 Physical Disk Active 0001:61:01.0.2/0:6:0 Advanced Function Disk Active sdi 0001:61:01.0.2/0:9:0 Physical Disk Active sdh 0001:61:01.0.2/255:0:0 RAID 10 Disk Array Failed 0001:61:01.0.2/0:4:0 RAID 10 Array Member Failed 0001:61:01.0.2/0:5:0 RAID 10 Array Member Failed e=Exit q=Cancel r=Refresh t=Toggle Figure 18. Example Display Hardware Status 44 Disk drives 15. Record the physical location information. 16. Return to the IBM Power RAID Configuration Utility display. 17. If you are removing a solid-state device located on a PCIe RAID and SSD SAS adapter, go to Removing and replacing the PCIe RAID and SSD SAS adapterRemoving and replacing the PCIe RAID and SSD SAS adapter and perform that procedure. Then, return here and perform the following steps: Important: Ensure that you follow the concurrent or nonconcurrent replacement procedure depending on the type of data protection enabled. a. If the data protection is RAID, use the concurrent procedure. b. If the data protection is mirrored (card to card) and the SSD is located in the 5802 or 5803 expansion unit, use the concurrent procedure. c. If the data protection is mirrored (card to card) but the SSD is not located in the 5802 or 5803 expansion unit, use the nonconcurrent procedure. 18. From the IBM Power RAID Configuration Utility display, select Work with disk unit recovery. Press Enter. Disk Unit Hardware Resource Information Details Manufacturer . . . . . . . . . . . . . . : IBM Product ID . . . . . . . . . . . . . . . : ST336607LC Firmware Version . . . . . . . . . . . . : 43353048 (C50H) Serial Number. . . . . . . . . . . . . . : 00006719 Capacity . . . . . . . . . . . . . . . . : 36.40 GB Resource Name. . . . . . . . . . . . . . : /dev/sdd Physical location PCI Address. . . . . . . . . . . . . . . : 0001:50:01.0 SCSI Host Number . . . . . . . . . . . . : 2 SCSI Channel . . . . . . . . . . . . . . : 0 SCSI Id. . . . . . . . . . . . . . . . . : 4 SCSI Lun . . . . . . . . . . . . . . . . : 0 Extended Details FRU Number . . . . . . . . . . . . . . . : 00P2676 More... Press Enter to Continue e=Exit q=Cancel f=PageDn b=PageUp Figure 19. Example Disk Hardware Resource Information Details display Removing a disk drive or solid-state drive from the 8231-E2B, 8231-E1C, or 8231-E2C 45 19. From the Work with Disk Unit Recovery display, select Concurrent remove device. Press Enter. A Concurrent Device Remove display is shown, like the following sample. 20. Type option 1 (Select) next to the location for the disk drive you want to replace. Press Enter. 21. The Verify Device Concurrent Remove display is shown. The service indicator turns on for that disk drive slot. To view the locations of the service indicators, see “Disk drive locations and service indicators” on page 97. 22. On the Verify Device Concurrent Remove display, verify that the selected disk drive is the disk drive you want to replace, then press Enter. The identify light turns on for the disk drive. Note: Ensure that the disk drive is not in use to prevent loss of data. 23. The Device Concurrent Remove display is shown. 24. Complete the removing of the disk drive as detailed in this procedure and then press Enter on the IBM Power RAID Configuration Utility to indicate that you have removed the disk drive. Removing the disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C 1. Unlock the disk drive handle by squeezing it and pulling it out toward you, as shown in Figure 21 on page 47. 2. Press Enter and when the service indicator starts flashing, then squeeze and pull the handle of the disk drive out toward you before you remove the disk drive. If the handle is not all the way out, the disk drive cannot slide out of the system or expansion unit. Important: When you press Enter, after a one minute delay, the service indicator comes on and begins to flash rapidly. You then have nine seconds to remove the disk drive. Concurrent Device Remove Choose a single location for remove operations 1=Select OPT Name PCI/SCSI Location Description Status --- ------ -------------------------- ------------------------- ----------------- sdc 0000:41:01.0.1/0:8:0 Physical Disk Active 0000:41:01.0.1/0:5: Empty sdb 0000:41:01.0.1/0:4:0 Physical Disk Active sda 0000:41:01.0.1/0:3:0 Physical Disk Active 0000:41:01.0.1/1:8: Empty sdf 0000:41:01.0.1/1:5:0 Physical Disk Active sde 0000:41:01.0.1/1:4:0 Physical Disk Active sdd 0000:41:01.0.1/1:3:0 Physical Disk Active 0001:61:01.0.2/0:8: Empty sdh 0001:61:01.0.2/0:9:0 Physical Disk Active sdg 0001:61:01.0.2/0:3:0 Physical Disk Active Figure 20. Example Concurrent Device Remove display 46 Disk drives Note: If you are removing a disk drive from the 5887 expansion unit, the service indicator is on (solid) and does not flash during this procedure. 3. Support the bottom of the disk drive as you slide it out of the system or expansion unit. Do not hold the disk drive by the handle. Note: The service indicator stops flashing and remains off when the disk drive is removed. 4. If the removed disk drive has a bezel attached to it, you can remove the disk drive bezel. For instructions, see “Removing a disk drive bezel from a disk drive or filler” on page 93. Important: If the disk drive bezel does not have a thumbscrew then the bezel is not removable and the disk drive is a SAS disk drive. The replacement SAS disk drive comes with an preinstalled bezel. v You can proceed to install a disk drive as a replacement to the failed disk drive, see “Replacing a disk drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C” on page 61 and select the procedure for replacing a disk drive for your operating system. v You can proceed to install a disk drive filler into the slot from where you removed the disk drive, see “Installing a disk drive filler in the 8231-E2B, 8231-E1C, or 8231-E2C system” on page 90 for the procedure. Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C or an expansion unit controlled by IBM i Learn how to remove a failed Small Computer System Interface (SCSI) or serial-attached SCSI (SAS) disk drive from a system or an expansion unit with the power turned on to the IBM i operating system or a logical partition that controls the disk drive location. Figure 21. Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C Removing a disk drive or solid-state drive from the 8231-E2B, 8231-E1C, or 8231-E2C 47 If you are removing a disk drive in order to install a new or upgraded disk drive, see “Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the IBM i system or logical partition power turned on” on page 25. If you are removing a disk drive as part of a service procedure, continue to use the following procedure. DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) Preparing the system for removing a disk drive 1. See Preparing to remove a disk drive from a system or logical partition controlled by IBM i and follow the steps before you start the tasks in this procedure. 2. Before removing a disk drive, ensure to back up all the data from the disk drive or the array that the drive is a part of, and then move the data from the disk drive that you want to remove. For instructions, choose one of the following options: v Move and remove disk units v Recovering your system in the IBM i Information Center To access the IBM i Information Center website, go to http://www.ibm.com/systems/i/infocenter and select from the following options: 48 Disk drives – If you select V6R1, expand IBM i 6.1 Information Center > Systems management > Backup and recovery > Recovering your system – If you select V5R4, expand i5/OS Information Center > Systems management > Backup and recovery > Backing up and recovering your system > Recovering your system 3. Locate the disk drive to be removed. See “Disk drive locations and service indicators” on page 97. 4. Stop the system. For instructions, see Stopping the system or logical partition. Preparing to remove the disk drive by using the HSM Perform the following steps to remove a disk drive by using the Hardware Service Manager (HSM): 1. If you are removing a solid-state device located on a PCIe RAID and SSD SAS adapter, go to Removing and replacing the PCIe RAID and SSD SAS adapterRemoving and replacing the PCIe RAID and SSD SAS adapter and perform that procedure. Then, return here and perform the following steps. Important: Ensure to follow the concurrent or nonconcurrent replacement procedures depending on the type of data protection enabled: v If the data protection is RAID, use the concurrent procedure. v If the data protection is mirrored (card to card) and the SSD is located in the 5802 or 5803 expansion unit, use the concurrent procedure. v If the data protection is mirrored (card to card) but the SSD is not located in the 5802 or 5803 expansion unit, use the nonconcurrent procedure. 2. Locate the disk drive to be replaced and record the location information, then use the location information to activate the identify light for that disk drive. 3. Determine if the disk drive to be removed is a non-configured unit by performing the following steps: a. Type strsst on the command line of the IBM i session and press Enter. b. Type your service tools user ID and service tools password and press Enter. c. Select Work with disk units and press Enter. d. Select Display Disk Configuration and press Enter. e. Select Display non-configured units and press Enter. Is the failed drive a non-configured unit? v No: Continue to 4 v Yes: Continue to next step. f. Determine if the disk drive to be removed was replaced by a hot spare when it failed. An xxxx9031 error logged close to the same time of the original disk failure indicates the automatic Rebuild of the data to a hot spare device. 4. Verify that the disk drive you are replacing has been suspended before continuing with this procedure. For instructions, see Preparing to remove a disk drive from a system or logical partition controlled by IBM i. 5. Did you find the service action log entry related to this problem? v No: Exit the service action log and go to step 7 v Yes: Continue with the next step. 6. Select the concurrent maintenance option. This option displays the disk drive location on the concurrent maintenance display. Go to step 9 on page 50 7. Select Device Concurrent Maintenance from the Hardware Service Manager display. Press Enter. The Device Concurrent Maintenance display is shown, as shown in the following figure. Removing a disk drive or solid-state drive from the 8231-E2B, 8231-E1C, or 8231-E2C 49 8. If the physical location does not show in the Device Concurrent Maintenance display, type the physical location where you are replacing the disk drive as shown in the following example: The physical location has the following form: U787A.001.AAAXXXX-P3-D4 where P3-D4 is the position of the disk drive in the system unit or expansion unit. You can get the U787A.001.AAAXXXX location information from the control panel display on the system unit or expansion unit. If you cannot get the information from the control panel, find the feature code (FC) and sequence number (SN or SEQ) on the label that is on the front of the system unit or expansion unit as shown in the following figure. In this example, U787A.001.AAAXXXX-P3-D4, U787A.001 comes from the feature code (FC) on the label, AAAXXXX comes from the last seven digits of the sequence number (SN or SEQ) on the label, and P3–D4 is the slot where you are replacing the disk drive as shown in “Disk drive locations and service indicators” on page 97. In the example, U5074.001.AAAXXXX-DB3-D32, you would replace a disk drive in the expansion unit with type 5074, model 001, sequence number AAAXXXX, and disk drive position DB3–D32. 9. Select option 1 (Remove device) for the Action to be performed. Device Concurrent Maintenance Type the choices, then press Enter. Specify either physical location or resource name. Physical location: U787A.001.1234567-P3-D4 or Device resource name: Specify action as 1=Remove device 2=Install device Action to be performed . . . . . . . .: 1 Enter a time value between 00 and 19. Time delay needed in minutes . . . . .: 01 F3=Exit F5=Refresh F12=Cancel Figure 22. Example Device Concurrent Maintenance display Figure 23. Example system or expansion unit label 50 Disk drives 10. Set the time delay for one minute: 01. Important: Do not press Enter at this time. 11. Locate the service indicator that corresponds to the position of the disk drive that you are replacing. You can view the locations of the service indicators for the system you are replacing the disk drive shown in “Disk drive locations and service indicators” on page 97. Proceed to remove the disk drive. Removing the disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C 1. Unlock the disk drive handle by squeezing it and pulling it out toward you, as shown in Figure 24. 2. Press Enter and when the service indicator starts flashing, then squeeze and pull the handle of the disk drive out toward you before you remove the disk drive. If the handle is not all the way out, the disk drive cannot slide out of the system or expansion unit. Important: When you press Enter, after a one minute delay, the service indicator comes on and begins to flash rapidly. You then have nine seconds to remove the disk drive. Note: If you are removing a disk drive from the 5887 expansion unit, the service indicator is on (solid) and does not flash during this procedure. 3. Support the bottom of the disk drive as you slide it out of the system or expansion unit. Do not hold the disk drive by the handle. Note: The service indicator stops flashing and remains off when the disk drive is removed. 4. If the removed disk drive has a bezel attached to it, you can remove the disk drive bezel. For instructions, see “Removing a disk drive bezel from a disk drive or filler” on page 93. Figure 24. Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C Removing a disk drive or solid-state drive from the 8231-E2B, 8231-E1C, or 8231-E2C 51 Important: If the disk drive bezel does not have a thumbscrew then the bezel is not removable and the disk drive is a SAS disk drive. The replacement SAS disk drive comes with an preinstalled bezel. v You can proceed to install a disk drive as a replacement to the failed disk drive, see “Replacing a disk drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C” on page 61 and select the procedure for replacing a disk drive for your operating system. v You can proceed to install a disk drive filler into the slot from where you removed the disk drive, see “Installing a disk drive filler in the 8231-E2B, 8231-E1C, or 8231-E2C system” on page 90 for the procedure. Removing a disk drive using hot-spare in an IBM i system or logical partition from the 8231-E2B, 8231-E1C, or 8231-E2C You can use the hot-spare function to replace a disk drive in a system or expansion unit while the IBM i operating system or logical partition that controls the disk drive location is powered on. Preparing the system 1. Perform prerequisite tasks. For instructions, see Before you begin. 2. Remove the front cover and door, if you need to, to access the disk drives. For instructions on removing the cover, see Removing and replacing covers and doors. 3. Locate the disk drive to be replaced, and record the location information. Then, use the location information to activate the identify light for that disk drive. For instructions, see Identifying a failing part in an IBM i system or logical partition Preparing to remove a disk drive by using the HSM Perform the following tasks to remove a disk drive by using the Hardware Service Manager (HSM): 1. If you are removing a solid state device located on a PCIe RAID and SSD SAS adapter, go to Removing and replacing the PCIe RAID and SSD SAS adapterRemoving and replacing the PCIe RAID and SSD SAS adapter and complete that procedure, then return here and perform the following steps. 2. Type strsst on the command line of the IBM i session and press Enter. 3. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign On display, and press Enter. 4. Select Work with disk units from the System Service Tools display, press Enter. 5. Select Display Disk Configuration from the Work with Disk Units display, and press Enter. 6. Select Display non-configured units from the Display Disk Configuration display, and press Enter. 7. Is the failed drive a non-configured unit? v No: Go to “Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the IBM i system or logical partition power turned on” on page 69. v Yes: Determine if the disk drive to be removed was replaced by a hot spare when it failed. A xxxx9031 error logged close to the same time of the original disk failure indicates the automatic Rebuild of the data to a hot spare device, then go on to the next step. 8. Is the disk drive to be removed controlled by the load source adapter? v No: Go to step 11 on page 53. v Yes: Go to the next step. 9. Is the disk drive to be removed located in a valid load source position? v No: Go to Configuring a disk drive on a load source adapter for hot-spare using the IBM i operating system. v Yes: Go to step 11 on page 53. 52 Disk drives 10. Choose the concurrent maintenance option. This option fills in the disk drive location on the concurrent maintenance display. Go to step 13 on page 54. 11. Select Device Concurrent Maintenance from the Hardware Service Manager display. Press Enter. The Device Concurrent Maintenance display is shown, as shown in the following figure. 12. If the physical location does not appear in the Device Concurrent Maintenance display, type the physical location where you will be replacing the disk drive as shown in the preceding example, Figure 25. The physical location has the following form: U787A.001.AAAXXXX-P3-D4 where P3-D4 is the position of the disk drive in the system unit or expansion unit. You can get the U787A.001.AAAXXXX location information from the control panel display on the system unit or expansion unit. If you cannot get the information from the control panel, find the feature code (FC) and sequence number (SN or SEQ) on the label that appears on the front of the system unit or expansion unit as shown in the following figure. Device Concurrent Maintenance Type the choices, then press Enter. Specify either physical location or resource name. Physical location: U787A.001.1234567-P3-D4 or Device resource name: Specify action as 1=Remove device 2=Install device Action to be performed . . . . . . . .: 1 Enter a time value between 00 and 19. Time delay needed in minutes . . . . .: 01 F3=Exit F5=Refresh F12=Cancel Figure 25. Example Device Concurrent Maintenance display Figure 26. Example system or expansion unit label Removing a disk drive or solid-state drive from the 8231-E2B, 8231-E1C, or 8231-E2C 53 In this example, U787A.001.AAAXXXX-P3-D4, U787A.001 comes from the feature code (FC) on the label, AAAXXXX comes from the last seven digits of the sequence number (SN or SEQ) on the label, and P3–D4 is the slot where you are replacing the disk drive as shown in “Disk drive locations and service indicators” on page 97. In the example, U5074.001.AAAXXXX-DB3-D32, you would replace a disk drive in the expansion unit with type 5074, model 001, sequence number AAAXXXX, and disk drive position DB3–D32. 13. Select option 1 (Remove device) for the Action to be performed. 14. Set the time delay for one minute: 01. Important: Do not press Enter at this time. Continue to remove the disk drive from your system of expansion unit. Removing the disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C 1. Unlock the disk drive handle by squeezing it and pulling it out toward you, as shown in Figure 27. 2. Squeeze and pull the handle of the disk drive out toward you before you remove the disk drive. If the handle is not all the way out, the disk drive cannot slide out of the system or expansion unit. 3. Support the bottom of the disk drive as you slide it out of the system or expansion unit. Do not hold the disk drive by the handle. 4. If the removed disk drive has a bezel attached to it, you can remove the disk drive bezel. For instructions, see “Removing a disk drive bezel from a disk drive or filler” on page 93. Important: If the disk drive bezel does not have a thumbscrew then the bezel is not removable and the disk drive is a SAS disk drive. The replacement SAS disk drive comes with an preinstalled bezel. v You can proceed to install a disk drive as a replacement to the failed disk drive. For instructions, see “Replacing a disk drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C” on page 61. Figure 27. Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C 54 Disk drives v You can proceed to install a disk drive filler into the slot from where you removed the disk drive. For instructions, see “Installing a disk drive filler in the 8231-E2B, 8231-E1C, or 8231-E2C system” on page 90. Removing a SCSI disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C Learn how to remove a Small Computer System Interface (SCSI) disk drive from a system or an expansion unit. Preparing to remove the SCSI disk drive by using the HSM Perform the following tasks by using the Hardware Service Manager (HSM): 1. Log in as root user on the Hardware Service Manager (HSM). 2. At the command line, type diag and press Enter. 3. Press Enter to continue. 4. On the Function Selection display, select Task Selection. 5. On the Task Selection display, select Hot Plug Task. 6. Select SCSI and SCSI RAID Hot Plug Manager. 7. Select Identify a Device Attached to a SCSI Hot Swap Enclosure Device. 8. Select the disk drive you want to replace and press Enter. The disk drive slot enters the Identify state, indicated by the flashing service indicator. To view the locations of the service indicator, see “Disk drive locations and service indicators” on page 97. 9. Exit to the SCSI and SCSI RAID Hot Plug Manager display. 10. On the SCSI and SCSI RAID Hot Plug Manager display, select Replace/Remove a Device Attached to a SCSI Hot Swap Enclosure Device. A list of all slots from which you can remove a disk drive is shown. 11. Select the slot that contains the disk drive you want to remove. The disk drive slot enters the Remove state and power is removed from the slot. The service indicator for the slot begins to flash rapidly. Removing the disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C 1. Unlock the disk drive handle by squeezing it and pulling it out toward you, as shown in Figure 28 on page 56. 2. Press Enter and when the service indicator starts flashing, then squeeze and pull the handle of the disk drive out toward you before you remove the disk drive. If the handle is not all the way out, the disk drive cannot slide out of the system or expansion unit. Important: When you press Enter, after a one minute delay, the service indicator comes on and begins to flash rapidly. You then have nine seconds to remove the disk drive. Note: If you are removing a disk drive from the 5887 expansion unit, the service indicator is on (solid) and does not flash during this procedure. Removing a disk drive or solid-state drive from the 8231-E2B, 8231-E1C, or 8231-E2C 55 3. Support the bottom of the disk drive as you slide it out of the system or expansion unit. Do not hold the disk drive by the handle. Note: The service indicator stops flashing and remains off when the disk drive is removed. 4. If the removed disk drive has a bezel attached to it, you can remove the disk drive bezel. For instructions, see “Removing a disk drive bezel from a disk drive or filler” on page 93. Important: If the disk drive bezel does not have a thumbscrew then the bezel is not removable and the disk drive is a SAS disk drive. The replacement SAS disk drive comes with an preinstalled bezel. v You can proceed to install a disk drive as a replacement to the failed disk drive, see “Replacing a disk drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C” on page 61 and select the procedure for replacing a disk drive for your operating system. v You can proceed to install a disk drive filler into the slot from where you removed the disk drive, see “Installing a disk drive filler in the 8231-E2B, 8231-E1C, or 8231-E2C system” on page 90 for the procedure. Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C with the system or logical partition power turned off Learn to remove a failed Small Computer System Interface (SCSI) or serial-attached SCSI (SAS) disk drive in a system or logical partition that controls the disk drive location with the power turned off. Attention: If you are installing a new or upgraded disk drive with the power turned off, see “Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the system or logical partition power turned off” on page 31. If you are removing a disk drive as part of a service procedure, continue to use the following procedure. Figure 28. Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C 56 Disk drives DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) Preparing the system 1. Perform prerequisite tasks. For instructions, see Before you begin. 2. If you cannot start the system or logical partition that controls the disk drive location, or if the system or logical partition is not powered on, go to step 6 on page 58. 3. Identify the disk drive to be removed and record the location information. For information, see Identifying a part. 4. Prepare to remove the disk drive from the system or expansion unit as follows: v If the system or logical partition that controls the disk drive location is running AIX, follow the steps in “Preparing to remove a disk drive from a system or expansion unit controlled by AIX” on page 105. v If the system or logical partition that controls the disk drive location is running IBM i, follow the steps in “Preparing to remove a disk-drive from a system or logical partition controlled by IBM i” on page 116. Removing a disk drive or solid-state drive from the 8231-E2B, 8231-E1C, or 8231-E2C 57 v If the system or logical partition that controls the disk drive location is running Linux and the disk drive you want to replace is unprotected or in use, move the data from the disk drive before continuing with this procedure. For information, see the PCI-X SCSI RAID Controller Reference Guide for Linux. This guide is available from the SCSI PCI Adapters web page. 5. Stop the system. For instructions, see Stopping the system or logical partition. 6. If you stopped the logical partition, do not disconnect the power source. 7. Place the system into the service position. For instructions, see Placing a rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system into the service position. 8. Remove the service access cover. For instructions, see Removing the service access cover from a rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system. 9. Disconnect the power source from the system by unplugging the system. For instructions, see Disconnecting the power cords. Note: This system can be equipped with a second power supply. Disconnect it before continuing with this procedure. Ensure that the power source to the system has been disconnected. (L003) or 10. Attach the wrist strap. 58 Disk drives Attention: v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted metal surface of the system for a minimum of 5 seconds. Removing the disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C 1. Unlock the disk drive handle by squeezing it and pulling it out toward you, as shown in Figure 29. 2. Squeeze and pull the handle of the disk drive out toward you before you remove the disk drive. If the handle is not all the way out, the disk drive cannot slide out of the system or expansion unit. 3. Support the bottom of the disk drive as you slide it out of the system or expansion unit. Do not hold the disk drive by the handle. 4. If the removed disk drive has a bezel attached to it, you can remove the disk drive bezel. For instructions, see “Removing a disk drive bezel from a disk drive or filler” on page 93. Important: If the disk drive bezel does not have a thumbscrew then the bezel is not removable and the disk drive is a SAS disk drive. The replacement SAS disk drive comes with an preinstalled bezel. v You can proceed to install a disk drive as a replacement to the failed disk drive. For instructions, see “Replacing a disk drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C” on page 61. Figure 29. Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C Removing a disk drive or solid-state drive from the 8231-E2B, 8231-E1C, or 8231-E2C 59 v You can proceed to install a disk drive filler into the slot from where you removed the disk drive. For instructions, see “Installing a disk drive filler in the 8231-E2B, 8231-E1C, or 8231-E2C system” on page 90. 60 Disk drives Replacing a disk drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C Learn how to replace a Small Computer System Interface (SCSI) or serial-attached SCSI (SAS) disk drive in the server or expansion unit. Note: These instructions can also be used if you are replacing an SAS disk drive. To remove and replace a disk drive in a different enclosure, use the information for that enclosure. Use this procedure to remove a failed disk drive and replace it with the same type of disk drive. If you plan to remove a disk drive and install a different type of disk drive, remove the existing disk drive. The system or logical partition can be turned off or turned on when you replace the disk drive. If the disk drive that is being replaced is in the AIX, IBM i or Linux root volume group (rootvg) and does not have disk protection, or if it is a PCIe RAID adapter with integrated solid-state disk drives, then replace the disk drive using the powered-off procedure. Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the AIX system or logical partition power turned on Learn to replace a failed Small Computer System Interface (SCSI) or serial-attached SCSI (SAS) disk drive in a system or an expansion unit while the power to the AIX operating system or logical partition that controls the disk drive location is turned on. To complete this procedure, you must have already completed the removal procedure. For the removal procedure instructions, see “Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C with the AIX system or logical partition power turned on” on page 37. If you are installing a new or upgraded disk drive, see “Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the AIX system or logical partition power turned on” on page 8. If you are replacing a disk drive as part of a service procedure, continue to use the following procedure. © Copyright IBM Corp. 2010, 2011 61 DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) Attention: This procedure is a complex set of steps that must be performed exactly to avoid potential data loss or system damage. Alternatively, you can use the procedure, which is a simpler task that involves less risk of data loss or system damage. If the system or logical partition that controls the disk drive you are replacing is at the AIX command line, you can use this power turned on procedure. If the disk drive that you are replacing is in the root volume group (rootvg) and it is not protected by either Redundant Array of Independent Disks (RAID) or mirroring, or to use the powered-off procedure, go to “Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the system or logical partition power turned off” on page 77. Note: Some of the figures in these procedures might not look exactly like the system or expansion unit that you have. However, the steps to perform the task are the same. Preparing the system 1. Perform prerequisite tasks as described in Before you begin. 2. Prepare to remove the disk drive from the system or expansion unit as described in “Preparing to remove a disk drive from a system or expansion unit controlled by AIX” on page 105. 62 Disk drives 3. Place the system into the service position, as described in Placing a rack-mounted model into the service position. 4. Remove the service access cover, as described in Removing the service access cover from a rack-mounted model. Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C 1. Use the service indicator light-emitting diodes (LEDs) to help identify the part. For information, see Identifying a part. 2. Use the location information that you recorded while removing the disk drives to install or replace the internal disk drives to the correct locations. 3. Find the package that contains the new disk drive. Attention: Disk drives are fragile. Handle with care. 4. Remove the disk drive from the static-protective package. 5. If the slot you want to use contains a disk drive filler, remove the disk drive filler from the slot. For instructions, see “Removing a disk drive filler from the 8231-E2B, 8231-E1C, or 8231-E2C system” on page 89. 6. If the disk drive you are installing does not have a bezel preinstalled, then perform the tasks to install a bezel on the disk drive, then continue with this procedure from step 7. For instructions to install a bezel, see “Installing a disk drive bezel in a disk drive or filler” on page 94. 7. Unlock the disk drive handle by squeezing it and pulling it out toward you. If the handle is not all the way out, the disk drive does not slide into the system or expansion unit. 8. Hold the disk drive by the top and bottom edges as you position the drive, and insert it into the system or the expansion unit. Do not hold the disk drive by the handle. 9. Slide the disk drive halfway into the system or expansion unit. 10. Press Enter on the console. 11. When the service indicator starts flashing, slide the disk drive all the way into the system or expansion unit, and push the disk drive handle (A) in until it locks, as shown in Figure 30 on page 64. Note: v It is important to ensure that when installing a disk drive, that the drive is fully seated and all the way into the system. v The light stops flashing and remains on when you have installed the disk drive properly. Replacing a disk drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 63 Completing the disk drive replace by using the HSM Perform the following tasks by using the Hardware Service Manager (HSM): 1. Perform the following steps using the HSM after you have replaced the disk drive: 2. Press Enter. The disk drive slot exits the Remove state and enters the Normal state. 3. Exit to the RAID Hot-Plug Devices menu. Press F3 to return. 4. Exit to the Task selection display. 5. Select Log Repair Action. 6. Select the disk drive that you replaced and then press Enter. 7. Select Commit after you have made your selection and then press Enter. 8. Exit to the command line. Completing the procedure 1. Verify the installed part: v If you replaced the part because of a failure, verify the installed part. For instructions, see Verifying a repair. v If you installed the part for any other reason, verify the installed part. For instructions, see Verifying the installed part. 2. To configure the newly installed disk drive, see the procedure for the operating system or logical partition that controls the disk drive: v Preparing a disk drive for use in an AIX system or logical partition. v Preparing a disk drive for use in a Linux system or logical partition v Configuring a disk drive in an IBM i system or logical partition. Figure 30. Installing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C 64 Disk drives 3. To rebuild the data on the replacement disk drive, see the procedure for the operating system or logical partition that controls the disk drive: v Rebuilding data on a replacement drive using AIX. v Rebuilding data on a replacement drive using Linux. v Rebuilding data on a replacement drive using IBM i. Replacing a disk drive using Linux kernel 2.6, with the Linux system or logical partition power turned on Learn to replace a Small Computer System Interface (SCSI) or serial-attached SCSI (SAS) disk drive in a system or expansion unit while the power to the Linux operating system or logical partition that controls the disk drive location is turned on. This procedure uses the Linux kernel 2.6. You must have removed the failed disk drive before you attempt to replace it. For the procedure to remove a disk drive using Linux kernel 2.6, see “Removing the disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C using Linux kernel 2.6 with the Linux system or logical partition power turned on” on page 41. If you are installing a new or upgraded disk drive, see “Installing a disk-drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the Linux system or logical partition power turned on” on page 13. If you are replacing a disk drive as part of a service procedure, continue to use the following procedure. Attention: Read these instructions carefully and understand the complexity of the powered-on procedure before you begin. If the system or logical partition that controls the disk drive you are replacing is at the Linux command line, you can use this powered-on procedure. Go to “Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the system or logical partition power turned off” on page 77 if: v You are installing a disk drive that is to be connected with the 6203 or 6204 SCSI adapter v You do not feel comfortable using the powered-on procedure Replacing a disk drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 65 DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) Note: Some of the figures in these procedures might not look exactly like the system or expansion unit that you have. However, the steps to perform the task are the same. Preparing the system to replace a disk drive 1. Perform prerequisite tasks. For instructions, see Before you begin. 2. Is the disk drive you are replacing in a redundant array of independent disk (RAID) 0 array or is the disk drive you are replacing in a failed RAID level 5 or 10 array? v If yes, you must delete the disk array before replacing the disk. For instructions, see PCI-X SCSI RAID Controller Reference Guide for Linux. This guide is available from the SCSI PCI Adapters web page. v If no, your drive is configured as Just a Bunch of Disks (JBOD) or is configured within a degraded array. Proceed to the next step. 3. If the disk drive you are replacing is in a redundant array of independent disk (RAID) Level 0 disk array or in a failed RAID Level 5 or RAID Level 10 disk array, you must delete the disk array before replacing the disk. For instructions, see the PCI-X SCSI RAID Controller Reference Guide for Linux. This guide is available from the SCSI PCI Adapters web page. 66 Disk drives Preparing to replace a disk drive by using the HSM Perform the following steps to replace a disk drive by using the Hardware Service Manager (HSM): 1. From the IBM Power RAID Configuration Utility display, select Work with disk unit recovery. Press Enter. 2. From the Work with Disk Unit Recovery display, select Concurrent add device. Press Enter. A Concurrent Device Add display like the following sample is shown. 3. Type option1 (Select) next to the location from which you removed the disk drive. The Verify Device Concurrent Add display is shown. 4. Press Enter on the Verify Device Concurrent Add display. The Complete Device Concurrent Add display is shown. Proceed to install the replacement disk drive. Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C 1. Use the service indicator light-emitting diodes (LEDs) to help identify the part. For information, see Identifying a part. 2. Use the location information that you recorded while removing the disk drives to install or replace the internal disk drives to the correct locations. 3. Find the package that contains the new disk drive. Attention: Disk drives are fragile. Handle with care. 4. Remove the disk drive from the static-protective package. 5. If the slot you want to use contains a disk drive filler, remove the disk drive filler from the slot. For instructions, see “Removing a disk drive filler from the 8231-E2B, 8231-E1C, or 8231-E2C system” on page 89. 6. If the disk drive you are installing does not have a bezel preinstalled, then perform the tasks to install a bezel on the disk drive, then continue with this procedure from step 7. For instructions to install a bezel, see “Installing a disk drive bezel in a disk drive or filler” on page 94. 7. Unlock the disk drive handle by squeezing it and pulling it out toward you. If the handle is not all the way out, the disk drive does not slide into the system or expansion unit. 8. Hold the disk drive by the top and bottom edges as you position the drive, and insert it into the system or the expansion unit. Do not hold the disk drive by the handle. 9. Slide the disk drive halfway into the system or expansion unit. Concurrent Device Add Choose a single location for add operations 1=Select OPT Name PCI/SCSI Location Description Status --- ------ -------------------------- ------------------------- ----------------- 0000:41:01.0.1/0:5: Empty 0000:41:01.0.1/1:8: Empty 0001:61:01.0.2/0:8: Empty Figure 31. Example Concurrent Device Add display Replacing a disk drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 67 10. Press Enter on the console. 11. When the service indicator starts flashing, slide the disk drive all the way into the system or expansion unit, and push the disk drive handle (A) in until it locks, as shown in Figure 32. Note: v It is important to ensure that when installing a disk drive, that the drive is fully seated and all the way into the system. v The light stops flashing and remains on when you have installed the disk drive properly. Completing the disk drive replace by using the HSM Press Enter on the Complete Device Concurrent Add display to indicate that the disk drive is installed. Completing the procedure 1. Verify the installed part: v If you replaced the part because of a failure, verify the installed part. For instructions, see Verifying a repair. v If you installed the part for any other reason, verify the installed part. For instructions, see Verifying the installed part. 2. To configure the newly installed disk drive, see the procedure for the operating system or logical partition that controls the disk drive: v Preparing a disk drive for use in an AIX system or logical partition. v Preparing a disk drive for use in a Linux system or logical partition v Configuring a disk drive in an IBM i system or logical partition. Figure 32. Installing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C 68 Disk drives 3. To rebuild the data on the replacement disk drive, see the procedure for the operating system or logical partition that controls the disk drive: v Rebuilding data on a replacement drive using AIX. v Rebuilding data on a replacement drive using Linux. v Rebuilding data on a replacement drive using IBM i. Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the IBM i system or logical partition power turned on Learn how to replace a Small Computer System Interface (SCSI) or serial-attached SCSI (SAS) disk drive in a system or expansion unit while the power of the IBM i operating system or logical partition that controls the disk drive location is turned on. To complete this procedure you must have removed the disk drive. For the procedure to remove the disk drive, see “Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C or an expansion unit controlled by IBM i” on page 47. If you are installing a new or upgraded disk drive, see “Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the IBM i system or logical partition power turned on” on page 25. If you are removing a disk drive as part of a service procedure, continue to use the following procedure. Attention: The powered-on procedure is a complex set of steps that must be performed exactly to avoid potential data loss or system damage. Alternatively, you can use the powered-off procedure, which is a simpler task that involves less risk of data loss or system damage. For instructions, see “Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the system or logical partition power turned off” on page 77. Replacing a disk drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 69 DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) Note: Some of the figures in these procedures might not look exactly like the system or expansion unit that you have. However, the steps to perform the task are the same. Preparing to replace the disk drive 1. Return to the Hardware Service Manager (HSM) console and wait until the Concurrent Maintenance Results display is shown. 2. Press F12. 3. The physical locations you entered while removing the disk drive might still show on the display. If not, retype the physical location where you want to install the new disk drive. In the example, U787A.001.1234567-P3-D4, you would install a disk drive in the system unit with type 9406, Model 520, sequence number 1234567, and disk drive position P3–D4. In the example, U5074.001.1234567-DB3-D32, you would install a disk drive in the expansion unit with type 5074, model 001, sequence number 1234567, and disk drive position DB3–D32. 4. Select option 2 (Install device) for the Action to be performed. 5. Set the time delay for one minute:01. 70 Disk drives Important: Do not press Enter at this time. Proceed to replace the disk drive. Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C 1. Use the service indicator light-emitting diodes (LEDs) to help identify the part. For information, see Identifying a part. 2. Use the location information that you recorded while removing the disk drives to install or replace the internal disk drives to the correct locations. 3. Find the package that contains the new disk drive. Attention: Disk drives are fragile. Handle with care. 4. Remove the disk drive from the static-protective package. 5. If the slot you want to use contains a disk drive filler, remove the disk drive filler from the slot. For instructions, see “Removing a disk drive filler from the 8231-E2B, 8231-E1C, or 8231-E2C system” on page 89. 6. If the disk drive you are installing does not have a bezel preinstalled, then perform the tasks to install a bezel on the disk drive, then continue with this procedure from step 7. For instructions to install a bezel, see “Installing a disk drive bezel in a disk drive or filler” on page 94. 7. Unlock the disk drive handle by squeezing it and pulling it out toward you. If the handle is not all the way out, the disk drive does not slide into the system or expansion unit. 8. Hold the disk drive by the top and bottom edges as you position the drive, and insert it into the system or the expansion unit. Do not hold the disk drive by the handle. 9. Slide the disk drive halfway into the system or expansion unit. 10. Press Enter on the console. 11. When the service indicator starts flashing, slide the disk drive all the way into the system or expansion unit, and push the disk drive handle (A) in until it locks, as shown in Figure 33 on page 72. Note: v It is important to ensure that when installing a disk drive, that the drive is fully seated and all the way into the system. v The light stops flashing and remains on when you have installed the disk drive properly. Replacing a disk drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 71 Completing the disk drive replace by using the HSM 1. Return to the console and wait until the Concurrent Maintenance Results display is shown. Press Enter. 2. If you return to the Service Action Log display, exit the service action log. 3. When the Hardware Service Manager display is shown, press F3. Completing the procedure 1. Verify the installed part: v If you replaced the part because of a failure, verify the installed part. For instructions, see Verifying a repair. v If you installed the part for any other reason, verify the installed part. For instructions, see Verifying the installed part. 2. To configure the newly installed disk drive, see the procedure for the operating system or logical partition that controls the disk drive: v Preparing a disk drive for use in an AIX system or logical partition. v Preparing a disk drive for use in a Linux system or logical partition v Configuring a disk drive in an IBM i system or logical partition. 3. To rebuild the data on the replacement disk drive, see the procedure for the operating system or logical partition that controls the disk drive: v Rebuilding data on a replacement drive using AIX. v Rebuilding data on a replacement drive using Linux. v Rebuilding data on a replacement drive using IBM i. Figure 33. Installing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C 72 Disk drives Replacing a disk drive in an 8231-E2B, 8231-E1C, or 8231-E2C using hot-spare in an IBM i system or logical partition You can use the hot-spare function to replace a disk drive in a system or expansion unit while the IBM i operating system or logical partition that controls the disk drive location is powered on. To complete this procedure, you must have already completed the removal procedure. For the removal procedure, see “Removing a disk drive using hot-spare in an IBM i system or logical partition from the 8231-E2B, 8231-E1C, or 8231-E2C” on page 52. If you are replacing a disk drive as part of a service procedure, continue to use the following procedure. Preparing to replace the disk drive by using the HSM Perform the following tasks by using the Hardware Service Manager (HSM): 1. Continue with the replacing of the disk drive after you have removed the disk drive. 2. Return to the console and wait until the Concurrent Maintenance Results display is shown. 3. Press F12. 4. The physical locations you entered while removing the disk drive might still appear on the display. If not, retype the physical location where you are installing the disk drive. In the example, U787A.001.1234567-P3-D4, you would install a disk drive in the system unit with type 9406, Model 520, sequence number 1234567, and disk drive position P3–D4. In the example, U5074.001.1234567-DB3-D32, you would install a disk drive in the expansion unit with type 5074, model 001, sequence number 1234567, and disk drive position DB3–D32. 5. Select option 2 (Install device) for the Action to be performed. 6. Set the time delay for one minute:01. Important: Do not press Enter at this time. Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C 1. Use the service indicator light-emitting diodes (LEDs) to help identify the part. For information, see Identifying a part. 2. Use the location information that you recorded while removing the disk drives to install or replace the internal disk drives to the correct locations. 3. Find the package that contains the new disk drive. Attention: Disk drives are fragile. Handle with care. 4. Remove the disk drive from the static-protective package. 5. If the slot you want to use contains a disk drive filler, remove the disk drive filler from the slot. For instructions, see “Removing a disk drive filler from the 8231-E2B, 8231-E1C, or 8231-E2C system” on page 89. 6. If the disk drive you are installing does not have a bezel preinstalled, then perform the tasks to install a bezel on the disk drive, then continue with this procedure from step 7. For instructions to install a bezel, see “Installing a disk drive bezel in a disk drive or filler” on page 94. 7. Unlock the disk drive handle by squeezing it and pulling it out toward you. If the handle is not all the way out, the disk drive does not slide into the system or expansion unit. 8. Hold the disk drive by the top and bottom edges as you position the drive, and insert it into the system or the expansion unit. Do not hold the disk drive by the handle. 9. Slide the disk drive halfway into the system or expansion unit. 10. Press Enter on the console. Replacing a disk drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 73 11. When the service indicator starts flashing, slide the disk drive all the way into the system or expansion unit, and push the disk drive handle (A) in until it locks, as shown in Figure 34. Note: v It is important to ensure that when installing a disk drive, that the drive is fully seated and all the way into the system. v The light stops flashing and remains on when you have installed the disk drive properly. Completing the procedure by using the HSM 1. Return to the HSM console and wait until the Concurrent Maintenance Results display is shown, and press Enter if you used the concurrent maintenance process above, otherwise continue with the next step. 2. If you return to the Hardware Service Manager (HSM), exit the HSM. 3. Select Work with disk units from the Start System Service Tools display. 4. Select Non-configured disk units from the Work with Disk Units display, and press Enter. 5. Is the new disk unit displayed with a status of non-configured? Note: It may take several minutes for the new disk drive to display. v No: Contact your next level of support. v Yes: Continue with the next step. 6. Press F3 to return to the Work with Disk Units display. 7. Select Work with disk unit recovery from the Work with Disk Units display, and press Enter. 8. Select Disk unit problem recovery procedure, and press Enter. 9. Select Initialize and format disk unit, and press Enter. Figure 34. Installing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C 74 Disk drives 10. Select the new disk and press Enter. Note: This will take several minutes to complete. 11. When the disk drive is initialized and formatted, press F3 to return to the Work with Disk Units display. 12. Select Start hot spare, and press Enter. 13. Select the IOA with the new disk, then press Enter and press Enter again. 14. Does the new hot spare disk drive have a smaller capacity than previous hot spare disk drive? Refer to the resource name identified in the xxxx9031 error which was logged close to the same time of the original disk failure. v No: The procedure is complete. Press F3 to return to the System service tools display. v Yes: Go to the next step. 15. Select Display Device Parity Status, and press Enter. If the new disk drive is not active and hot spare protected, do not continue. Call your next level of support. Restriction: You must complete the following steps to ensure your data is protected. The following steps will allow you to exchange the two disk drives so that the larger disk drive will become the hot spare, and the smaller disk drive will become a parity set member. 16. Fail the previous hot spare disk drive by unlocking the disk drive handle by squeezing it and pulling it out toward you. 17. Pull the drive out about one inch and wait ten seconds. 18. Push the disk drive back into the slot and close the handle. 19. Return to step 1 on page 74 and start hot spare on the larger drive. Replacing a SCSI disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C Learn how to replace a Small Computer System Interface (SCSI) disk drive in a system or expansion unit. Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C 1. Use the service indicator light-emitting diodes (LEDs) to help identify the part. For information, see Identifying a part. 2. Use the location information that you recorded while removing the disk drives to install or replace the internal disk drives to the correct locations. 3. Find the package that contains the new disk drive. Attention: Disk drives are fragile. Handle with care. 4. Remove the disk drive from the static-protective package. 5. If the slot you want to use contains a disk drive filler, remove the disk drive filler from the slot. For instructions, see “Removing a disk drive filler from the 8231-E2B, 8231-E1C, or 8231-E2C system” on page 89. 6. If the disk drive you are installing does not have a bezel preinstalled, then perform the tasks to install a bezel on the disk drive, then continue with this procedure from step 7. For instructions to install a bezel, see “Installing a disk drive bezel in a disk drive or filler” on page 94. 7. Unlock the disk drive handle by squeezing it and pulling it out toward you. If the handle is not all the way out, the disk drive does not slide into the system or expansion unit. 8. Hold the disk drive by the top and bottom edges as you position the drive, and insert it into the system or the expansion unit. Do not hold the disk drive by the handle. 9. Slide the disk drive halfway into the system or expansion unit. 10. Press Enter on the console. Replacing a disk drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 75 11. When the service indicator starts flashing, slide the disk drive all the way into the system or expansion unit, and push the disk drive handle (A) in until it locks, as shown in Figure 35. Note: v It is important to ensure that when installing a disk drive, that the drive is fully seated and all the way into the system. v The light stops flashing and remains on when you have installed the disk drive properly. Completing the disk drive replace by using the HSM Perform the following steps using the Hardware Service Manager (HSM): 1. Press Enter. 2. Exit to the SCSI and SCSI RAID Hot-Plug Manager menu. 3. Select Configure Added/Replaced Devices. Press Enter. 4. Select the disk drive that you replaced and then press Enter. The disk drive is now configured. 5. Exit to the Task Selection display. 6. Select Log Repair Action. 7. Select the disk drive that you replaced and then press Enter. 8. Select Commit after you have made your selection and then press Enter. 9. Exit to the command line. Completing the procedure 1. Verify the installed part: Figure 35. Installing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C 76 Disk drives v If you replaced the part because of a failure, verify the installed part. For instructions, see Verifying a repair. v If you installed the part for any other reason, verify the installed part. For instructions, see Verifying the installed part. 2. To configure the newly installed disk drive, see the procedure for the operating system or logical partition that controls the disk drive: v Preparing a disk drive for use in an AIX system or logical partition. v Preparing a disk drive for use in a Linux system or logical partition v Configuring a disk drive in an IBM i system or logical partition. 3. To rebuild the data on the replacement disk drive, see the procedure for the operating system or logical partition that controls the disk drive: v Rebuilding data on a replacement drive using AIX. v Rebuilding data on a replacement drive using Linux. v Rebuilding data on a replacement drive using IBM i. Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the system or logical partition power turned off Learn to replace a failed Small Computer System Interface (SCSI) or serial-attached SCSI (SAS) disk drive in a system or logical partition that controls the disk drive location with the power turned off. To complete this procedure, you must have already completed the removal procedure. For the removal procedure, see “Removing a disk drive from the 8231-E2B, 8231-E1C, or 8231-E2C with the system or logical partition power turned off” on page 56. If you are installing a new or upgraded disk drive, see “Installing a disk drive or solid-state drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the system or logical partition power turned off” on page 31. If you are replacing a disk drive as part of a service procedure, continue to use the following procedure. Replacing a disk drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 77 DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C 1. Use the service indicator light-emitting diodes (LEDs) to help identify the part. For information, see Identifying a part. 2. Use the location information that you recorded while removing the disk drives to install or replace the internal disk drives to the correct locations. 3. Find the package that contains the new disk drive. Attention: Disk drives are fragile. Handle with care. 4. Remove the disk drive from the static-protective package. 5. If the slot you want to use contains a disk drive filler, remove the disk drive filler from the slot. For instructions, see “Removing a disk drive filler from the 8231-E2B, 8231-E1C, or 8231-E2C system” on page 89. 6. If the disk drive you are installing does not have a bezel preinstalled, then perform the tasks to install a bezel on the disk drive, then continue with this procedure from step 7. For instructions to install a bezel, see “Installing a disk drive bezel in a disk drive or filler” on page 94. 7. Unlock the disk drive handle by squeezing it and pulling it out toward you. If the handle is not all the way out, the disk drive does not slide into the system or expansion unit. 78 Disk drives 8. Hold the disk drive by the top and bottom edges as you position the drive, and insert it into the system or the expansion unit. Do not hold the disk drive by the handle. 9. Slide the disk drive halfway into the system or expansion unit. 10. Press Enter on the console. 11. When the service indicator starts flashing, slide the disk drive all the way into the system or expansion unit, and push the disk drive handle (A) in until it locks, as shown in Figure 36. Note: v It is important to ensure that when installing a disk drive, that the drive is fully seated and all the way into the system. v The light stops flashing and remains on when you have installed the disk drive properly. Completing the procedure 1. Replace the service access cover. For instructions, see Installing the service access cover on a rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system. 2. Connect the power cords to the system. For instructions, see Connecting the power cords to the 8231-E2B, 8231-E1C, or 8231-E2C. 3. Place the system in the operating position. For instructions, see Placing the rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system into the operating position. 4. Start the system or logical partition. For instructions, see Starting the system or logical partition. 5. Verify the installed part: v If you replaced the part because of a failure, verify the installed part. For instructions, see Verifying a repair. v If you installed the part for any other reason, verify the installed part. For instructions, see Verifying the installed part. Figure 36. Installing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C Replacing a disk drive or solid state drive in the 8231-E2B, 8231-E1C, or 8231-E2C 79 6. To configure the newly installed disk drive, see the procedure for the operating system or logical partition that controls the disk drive: v Preparing a disk drive for use in an AIX system or logical partition. v Preparing a disk drive for use in a Linux system or logical partition v Configuring a disk drive in an IBM i system or logical partition. 7. To rebuild the data on the replacement disk drive, see the procedure for the operating system or logical partition that controls the disk drive: v Rebuilding data on a replacement drive using AIX. v Rebuilding data on a replacement drive using Linux. v Rebuilding data on a replacement drive using IBM i. 80 Disk drives Installing the external SAS port in the 8231-E2B, 8231-E1C, or 8231-E2C system Learn to install an external SAS port as a part of upgrading your disk drive backplane or to enable internal disk sharing. If you want to replace a failed SAS port, see “Replacing the external SAS port in the 8231-E2B, 8231-E1C, or 8231-E2C system” on page 87. Perform the tasks in this procedure to install an SAS port in the system. Perform the prerequisite tasks as described in Before you begin. Perform the following steps to connect the SAS cable to the external SAS connection port on the server system backplane: 1. Stop the system. For instructions, see Stopping the system or logical partition. 2. Disconnect the power source from the system by unplugging the system, as described in Disconnecting the power cords. (L003) or 3. Attach the wrist strap. © Copyright IBM Corp. 2010, 2011 81 Attention: v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted metal surface of the system for a minimum of 5 seconds. 4. Place the system into the service position. For instructions, see Placing a rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system into the service position. 5. Remove the service access cover. For instructions, see Removing the service access cover from a rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system. 6. Disconnect all PCI adapters that are attached to the system. For instructions, see Removing a PCI adapter. 7. If you are installing into a new system, remove the filler from the external SAS connection port on the disk drive backplane. 8. Connect the SAS shroud of the SAS cable to the external SAS connection port (C) at the rear of the system backplane until it snaps in place, see Figure 37. 9. Lay the cable (A) beside the side wall and slide it towards the SAS connection port on the disk drive backplane. 10. Push the SAS shroud (B) into the SAS connection port on the disk drive backplane until it snaps in place. 11. Connect all PCI adapters. For instructions, see Replacing a PCI adapter. 12. Place the system in the operating position. For instructions, see Placing the rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system into the operating position. Figure 37. Installing the SAS cable 82 Disk drives 13. Replace the service access cover. For instructions, see Installing the service access cover on a rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system. 14. Connect the power cords to the system. For instructions, see Connecting the power cords to the 8231-E2B, 8231-E1C, or 8231-E2C. 15. Start the system or logical partition. For instructions, see Starting the system or logical partition. 16. Verify the installed part: v If you replaced the part because of a failure, verify the installed part. For instructions, see Verifying a repair. v If you installed the part for any other reason, verify the installed part. For instructions, see Verifying the installed part. You can now attach other supported SAS features to the system. Installing the external SAS port in the 8231-E2B, 8231-E1C, or 8231-E2C system 83 84 Disk drives Removing the external SAS port from the 8231-E2B, 8231-E1C, or 8231-E2C system Learn to remove a failed SAS port from the disk drive backplane as part of a service procedure. Perform the prerequisite tasks as described in Before you begin. Perform the following steps to remove the SAS cable from the external SAS connection port on the server system backplane: 1. Stop the system. For instructions, see Stopping the system or logical partition. 2. Disconnect the power source from the system by unplugging the system. For instructions, see Disconnecting the power cords. Note: This system can be equipped with a second power supply. Disconnect it before continuing with this procedure. Ensure that the power source to the system has been disconnected. (L003) or 3. Attach the wrist strap. © Copyright IBM Corp. 2010, 2011 85 Attention: v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted metal surface of the system for a minimum of 5 seconds. 4. Place the system into the service position. For instructions, see Placing a rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system into the service position. 5. Remove the service access cover. For instructions, see Removing the service access cover from a rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system. 6. Disconnect all PCI adapters. For instructions, see Removing a PCI adapter. 7. Press the side tab (B) and pull out the SAS shroud from the external SAS connection port on the disk drive backplane. See Figure 38. 8. Pull out the SAS cable (C) from the system. 9. Push the SAS connection port down (A) and then dislodge the SAS cable (C) out of the system backplane. v You can continue with the procedure to replace the SAS port. See “Replacing the external SAS port in the 8231-E2B, 8231-E1C, or 8231-E2C system” on page 87. v If you are not replacing the SAS port, you can install a filler into the external SAS connection port on the disk drive backplane. Figure 38. Removing the external SAS port 86 Disk drives Replacing the external SAS port in the 8231-E2B, 8231-E1C, or 8231-E2C system Learn to replace an external SAS port as part of a service procedure. Perform the prerequisite tasks as described in Before you begin. Attention: You must have removed the failed external SAS port before you can replace it. Perform this procedure to replace the failed SAS port on the system backplane: 1. Attach the wrist strap. Attention: v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted metal surface of the system for a minimum of 5 seconds. 2. Connect the SAS shroud of the SAS cable to the external SAS connection port (C) at the rear of the system backplane until it snaps in place. See Figure 39 on page 88. 3. Lay the cable (A) beside the side wall and slide it toward the SAS connection port on the disk drive backplane. 4. Push the SAS shroud (B) into the SAS connection port on the disk drive backplane until it snaps in place. © Copyright IBM Corp. 2010, 2011 87 5. Connect all PCI adapters. For instructions, see Replacing a PCI adapter. 6. For a rack-mounted system, place the system into the operating position as described in Placing a rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system into the operating position. 7. Replace the service access cover. For instructions, see Installing the service access cover on a rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system. 8. Connect the power source to the system by plugging the power cords. For instructions, see Connecting the power cords. 9. Start the system or logical partition. For instructions, see Starting the system or logical partition. 10. Verify the installed part: v If you replaced the part because of a failure, verify the installed part. For instructions, see Verifying a repair. v If you installed the part for any other reason, verify the installed part. For instructions, see Verifying the installed part. You can now attach other supported SAS features to the system. Figure 39. Replacing the SAS cable 88 Disk drives Removing or installing a disk drive filler Learn how to remove or install a Small Computer System Interface (SCSI) disk drive filler in the server or expansion unit. Note: Refill the disk drive slots with either another disk drive or a disk drive filler. Filling the disk drive slot ensures proper air flow for cooling and maintains an optimal EMI compliance. Removing a disk drive filler from the 8231-E2B, 8231-E1C, or 8231-E2C system Learn how to remove a Small Computer System Interface (SCSI) disk drive filler from the server or expansion unit. When you are installing a disk drive in a system or an expansion unit, the slot you want to use can contain a disk drive filler. To remove the filler before installing a disk drive in the slot, complete the following steps: 1. Remove the system unit door, expansion unit door, or open the rack front door. 2. Identify the slot that contains the disk drive filler. Attention: v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted metal surface of the system for a minimum of 5 seconds. 3. Hold the filler handle (A) and pull out the disk drive filler from the slot, as shown in Figure 40 on page 90. © Copyright IBM Corp. 2010, 2011 89 Return to the procedure that you brought you here. Installing a disk drive filler in the 8231-E2B, 8231-E1C, or 8231-E2C system You might need to install a Small Computer System Interface (SCSI) disk drive filler in the server or expansion unit. When you remove a disk drive from a system or an expansion unit, you can install a disk drive filler in the slot if you are not installing a replacement disk drive. To install a disk drive filler in a system or an expansion unit, complete the following steps: 1. Hold the disk drive filler by the top and bottom edges as you position the filler, and insert it into the system or the expansion unit. Attention: v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted metal surface of the system for a minimum of 5 seconds. 2. Slide the disk drive filler all the way into the system or expansion unit, and push the disk drive filler handle (A) until it locks, as shown in Figure 41 on page 91. Figure 40. Removing a disk drive filler from the 8231-E2B, 8231-E1C, or 8231-E2C 90 Disk drives Return to the procedure that you brought you here. Figure 41. Installing a disk drive filler in the 8231-E2B, 8231-E1C, or 8231-E2C. Removing or installing a disk drive filler 91 92 Disk drives Removing or installing a disk drive bezel Learn how to remove or install the disk drive bezel from and into a disk drive. Removing a disk drive bezel from a disk drive or filler Learn how to remove a disk drive bezel from a disk drive or a disk drive filler. Learn to remove the disk drive bezel from either a failed disk drive that had a bezel installed in it or from the disk drive filler in the slot where you are replacing the failed disk drive. The removed disk drive bezel must be attached in the replacement disk drive or a disk drive filler before installing the disk drive or filler in the system or expansion unit. To install a disk drive filler, see “Installing a disk drive bezel in a disk drive or filler” on page 94. Perform the steps in this procedure only if the disk drive or the disk drive filler has a disk drive bezel attached to it. Important: If the disk drive bezel does not have a thumbscrew, it is not removable and is a SAS disk drive. Note: Some of the figures in this procedure might not look exactly like the system or expansion unit that you have. However, the steps to perform the task are the same. 1. From the removed disk drive or disk drive filler, gently pull off the disk drive bezel latch (A) from the front of the filler, as shown in Figure 42 on page 94. 2. Loosen and then pull out on the thumbscrew (C) to release the bezel (B) from the front edge of the filler. © Copyright IBM Corp. 2010, 2011 93 3. Retain the removed disk drive bezel safely and use the removed bezel to attach it to the replacement disk drive or a disk drive filler. Return to the procedure that brought you here. Installing a disk drive bezel in a disk drive or filler Learn how to install a disk drive bezel into a disk drive. You must have removed the disk drive bezel from either a failed disk drive that had a bezel installed in it or from the disk drive filler. To remove a disk drive filler, see “Removing a disk drive bezel from a disk drive or filler” on page 93. Perform the steps in this procedure only if the disk drive or the disk drive filler removed had a disk drive bezel attached to it. Note: If the disk drive bezel does not have a thumbscrew, it is not removable and is a SAS disk drive. The replacement SAS disk drive comes with an attached bezel. 1. Attach the disk drive bezel (A) to the disk drive or to a disk drive filler. Attention: v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted metal surface of the system for a minimum of 5 seconds. 2. Press in and then tighten the thumbscrew (B) to secure the bezel to the disk drive or filler. Figure 42. Removing a disk drive bezel 94 Disk drives 3. Install the disk drive bezel latch (C) onto the front of the disk drive or filler, as shown in Figure 43. Return to the procedure that brought you here. Figure 43. Attaching a bezel to a disk drive filler Removing or installing a disk drive bezel 95 96 Disk drives Disk drive locations and service indicators Find out where the disk drives and service indicators are located on the system or an expansion unit. Learn about the disk drive locations and location of the service indicators on the system or expansion unit. PCIe RAID and SSD SAS adapter locations Learn about the PCIe RAID and SSD SAS adapter locations and the location of the service indicators on the PCIe RAID and SSD SAS adapter. The figure Figure 44 shows the PCIe RAID and SSD SAS adapter locations. Disk drive locations and service indicators for the 8231-E2B, 8231-E1C, or 8231-E2C system Learn about the disk drive locations and the location of the service indicators on the system. The following figures show the disk drive locations and service indicators for the 8231-E2B, 8231-E1C, or 8231-E2C. Service indicators are located above the latch handle on the disk drives. Figure 44. PCIe RAID and SSD SAS adapter locations © Copyright IBM Corp. 2010, 2011 97 Figure 45. Disk-drive locations for systems with six small-form factor (SFF) bays Figure 46. Disk-drive locations for systems with three SFF bays (optional) Figure 47. Service indicators locations for systems with six SFF bays 98 Disk drives Disk drive locations and service indicators for the 5886 disk-drive enclosure Learn about the disk drive locations and where the service indicators are located for your disk-drive enclosure. The following figure shows the disk drive locations for the 5886 disk-drive enclosure. Figure 48. Service indicators locations for systems with three SFF bays (optional) Figure 49. Disk drive locations for the 5886 disk-drive enclosure Disk drive locations and service indicators 99 The following figure shows the locations of the disk drive service indicators for the 5886 disk-drive enclosure. Disk-drive locations and service indicators for the 5887 disk-drive enclosure Learn about the disk-drive locations and where the service indicators are located for the 5887 disk-drive enclosure. The following figures show the front view and rear view of the 5887 disk-drive enclosure and the disk drive locations in the 5887 enclosure. Figure 50. Front view of disk-drive enclosure service indicators for the 5886 Figure 51. Front view of the 5887 disk-drive enclosure showing the disk-drive locations 100 Disk drives The following figure shows the locations of the disk-drive service indicators for the 5887 disk-drive enclosure. Disk drive locations and service indicators for the 5786, 5787, 7031-D24, and 7031-T24 SCSI disk-drive enclosures Learn about the disk drive locations and where the service indicators are located for your disk-drive enclosure. The availability of disk drive locations depends on the number of SCSI interface cards installed in the SCSI disk-drive enclosure. If the next available disk drive position is the first slot in a disk-drive enclosure, such as D21, D26, or D46, look at the disk drive controller to verify that the enclosure is connected to a SCSI bus port. The controller end of the cable is clearly labeled with the disk-drive enclosure to which it connects. If the enclosure is not connected to a disk drive controller, you must either install the disk drive in a different disk-drive enclosure or connect the enclosure to a SCSI bus port on a disk drive controller. You might need to add a disk drive controller before you can add disk drives to the expansion unit. Figure 52. Rear view of the 5887 disk-drive enclosure Figure 53. Front view of the 5887 disk-drive enclosure showing the service indicators Disk drive locations and service indicators 101 The following figures show the disk drive locations for the front and rear of the 5786, 5787, 7031-D24, and 7031-T24 SCSI disk-drive enclosures. The following figures show the locations of the disk drive service indicators (A) for the 5786, 5787, 7031-D24, and 7031-T24 SCSI disk-drive enclosures. Figure 54. Front view of 5786, 5787, 7031-D24, and 7031-T24 SCSI disk-drive enclosures Figure 55. Rear view of 5786, 5787, 7031-D24, and 7031-T24 SCSI disk-drive enclosures 102 Disk drives Disk-drive locations and service indicators for the 5802 expansion unit Learn about the disk-drive locations and where the service indicators are located for your expansion unit. The following figure shows the disk-drive locations for the 5802. Service indicators are located at the bottom of the latch handle on each disk drive. Figure 56. Front view of service indicators for 5786, 5787, 7031-D24, and 7031-T24 Figure 57. Rear view of service indicators for 5786, 5787, 7031-D24, and 7031-T24 Disk drive locations and service indicators 103 Related information Cabling the 5802 disk subsystem Figure 58. Disk drive locations 104 Disk drives Related information Locate additional disk-drive related information and instructions you might need. Preparing a disk drive for use in an AIX system or logical partition Configure the newly installed disk drive for your environment. If the disk drive is attached to a PCI-X SCSI RAID controller, see the PCI-X SCSI RAID Controller Reference Guide for AIX. If the disk drive is attached to another controller, see the AIX System Management Guide: Operating System and Devices. These guides are available at the System p and AIX Information Center. (http://publib.boulder.ibm.com/infocenter/pseries/v5r3/index.jsp) Preparing to remove a disk drive from a system or expansion unit controlled by AIX Learn the actions you must take to properly remove a disk drive. Before removing a disk drive from a system or expansion unit that is controlled by AIX, ensure that all data on that disk drive or the array that contains the drive is backed up and removed from the disk drive and that the disk drive is in the Defined state. If the disk drive is attached to a PCI-X SCSI RAID controller, see the PCI-X SCSI RAID Controller Reference Guide for AIX. If the disk drive is attached to another controller, see the information for that controller or the AIX System Management Guide: Operating System and Devices. These guides are available at the System p and AIX Information Center. Rebuilding data on a replacement disk drive using AIX To rebuild data on the replacement disk drive, refer to the information for the controller to which the disk drive is attached. v If the disk drive is attached to a PCI-X SCSI RAID controller, see the PCI-X SCSI RAID Controller Reference Guide for AIX. v If the disk drive is attached to a PCI SCSI RAID adapter (feature code 2498), see the PCI 4–Channel Ultra3 SCSI RAID Adapter Reference Guide. For more information, see the AIX System Management Guide: Operating System and Devices.These guides are available at the System p and AIX Information Center. Removing a disk drive using Linux kernel 2.6 Learn how to remove a disk drive in a location that is controlled by a system or logical partition that is running Linux kernel 2.6. Preparing to remove the disk drive 1. Log in as root user. 2. Type iprconfig on the command line of the Linux session and press Enter. The IBM Power RAID Configuration Utility display is shown. 3. Select Analyze log. Press Enter. The Kernel Messages Log display is shown. © Copyright IBM Corp. 2010, 2011 105 4. Select View most recent ipr error messages from the Kernel Messages Log display. Press Enter. 5. Find the entry in the log for the disk drive you want to replace. 6. Record the location information for the disk drive. Note: The location information has the form of 2:0:8:0. In this example, 2 is the SCSI host number, 0 is the SCSI bus, 8 is the SCSI target ID, and 0 is the LUN (logical unit). 7. Return to the command line. 8. Type the following command: ls -ld /sys/class/scsi_host/host#/device Where # is the SCSI host number. Press Enter. 9. Record the PCI location information. Note: The PCI location information has the form of 61:01:0:2. 10. Type iprconfig on the command line and press Enter. The IBM Power RAID Configuration Utility display is shown. 11. Select Display hardware status from the IBM Power RAID Configuration Utility display. Press Enter. The Display Hardware Status display is shown. Kernel Messages Log Select one of the following: 1. View most recent ipr error messages 2. View ipr error messages 3. View all kernel error messages 4. View iprconfig error messages 5. Set root kernel message log directory 6. Set default editor 7. Restore defaults 8. View ipr boot time messages Selection: e=Exit Figure 59. Kernel Messages Log 106 Disk drives 12. Look for the disk drive at the PCI location you recorded. The disk drive might have a Failed status. 13. If the disk drive you want to replace is unprotected or in use, move the data from the disk drive before continuing with this procedure. For information, see the PCI-X SCSI RAID Controller Reference Guide for Linux. This guide is available from the SCSI PCI Adapters web page. 14. Type option 1 (Display hardware resource information details) next to the disk drive you want to replace. Press Enter. The Disk Hardware Resource Information Details is displayed like the shown in the following sample. Display Hardware Status Type option, press Enter. 1=Display hardware resource information details OPT Name PCI/SCSI Location Description Status --- ------ -------------------------- ------------------------- ---------------- 0000:01:01.0.0/ PCI-X SCSI Adapter Operational 0000:41:01.0.1/ PCI-X SCSI Adapter Operational sda 0000:41:01.0.1/0:3:0 Physical Disk Active sdb 0000:41:01.0.1/0:4:0 Physical Disk Active sdc 0000:41:01.0.1/0:8:0 Physical Disk Active sdd 0000:41:01.0.1/1:3:0 Physical Disk Active sde 0000:41:01.0.1/1:4:0 Physical Disk Active sdf 0000:41:01.0.1/1:5:0 Physical Disk Active 0001:61:01.0.2/ PCI-X SCSI RAID Adapter Operational sdg 0001:61:01.0.2/0:3:0 Physical Disk Active 0001:61:01.0.2/0:6:0 Advanced Function Disk Active sdi 0001:61:01.0.2/0:9:0 Physical Disk Active sdh 0001:61:01.0.2/255:0:0 RAID 10 Disk Array Failed 0001:61:01.0.2/0:4:0 RAID 10 Array Member Failed 0001:61:01.0.2/0:5:0 RAID 10 Array Member Failed e=Exit q=Cancel r=Refresh t=Toggle Figure 60. Example Display Hardware Status Related information 107 15. Record the physical location information. 16. Return to the IBM Power RAID Configuration Utility display. Removing the disk drive 1. If you are removing a solid state device located on a PCIe RAID and SSD SAS adapter, go to Removing and replacing the PCIe RAID and SSD SAS adapterRemoving and replacing the PCIe RAID and SSD SAS adapter and complete that procedure, then return here and perform the following steps. 2. From the IBM Power RAID Configuration Utility display, select Work with disk unit recovery. Press Enter. 3. From the Work with Disk Unit Recovery display, select Concurrent remove device. Press Enter. A Concurrent Device Remove display is shown, like the following sample. Disk Unit Hardware Resource Information Details Manufacturer . . . . . . . . . . . . . . : IBM Product ID . . . . . . . . . . . . . . . : ST336607LC Firmware Version . . . . . . . . . . . . : 43353048 (C50H) Serial Number. . . . . . . . . . . . . . : 00006719 Capacity . . . . . . . . . . . . . . . . : 36.40 GB Resource Name. . . . . . . . . . . . . . : /dev/sdd Physical location PCI Address. . . . . . . . . . . . . . . : 0001:50:01.0 SCSI Host Number . . . . . . . . . . . . : 2 SCSI Channel . . . . . . . . . . . . . . : 0 SCSI Id. . . . . . . . . . . . . . . . . : 4 SCSI Lun . . . . . . . . . . . . . . . . : 0 Extended Details FRU Number . . . . . . . . . . . . . . . : 00P2676 More... Press Enter to Continue e=Exit q=Cancel f=PageDn b=PageUp Figure 61. Example Disk Hardware Resource Information Details display 108 Disk drives 4. Type option 1 (Select) next to the location for the disk drive you want to replace. Press Enter. 5. The Verify Device Concurrent Remove display is shown. The service indicator turns on for that disk drive slot. . 6. On the Verify Device Concurrent Remove display, verify that the selected disk drive is the disk drive you want to replace, then press Enter. The identify light turns on for the disk drive. Note: Ensure that the disk drive is not in use to prevent loss of data. 7. The Device Concurrent Remove display is shown. Attention: v Attach a wrist strap to an unpainted metal surface of your hardware to prevent electrostatic discharge (ESD) from damaging your hardware. v When using a wrist strap, follow all electrical safety procedures. A wrist strap is for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment. v If you do not have a wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted metal surface of the system for a minimum of 5 seconds. 8. Squeeze and pull the handle of the disk drive out toward you before you remove the disk drive as shown in Figure 63 on page 110. If the handle is not all the way out, the disk drive does not slide out of the system or expansion unit. Concurrent Device Remove Choose a single location for remove operations 1=Select OPT Name PCI/SCSI Location Description Status --- ------ -------------------------- ------------------------- ----------------- sdc 0000:41:01.0.1/0:8:0 Physical Disk Active 0000:41:01.0.1/0:5: Empty sdb 0000:41:01.0.1/0:4:0 Physical Disk Active sda 0000:41:01.0.1/0:3:0 Physical Disk Active 0000:41:01.0.1/1:8: Empty sdf 0000:41:01.0.1/1:5:0 Physical Disk Active sde 0000:41:01.0.1/1:4:0 Physical Disk Active sdd 0000:41:01.0.1/1:3:0 Physical Disk Active 0001:61:01.0.2/0:8: Empty sdh 0001:61:01.0.2/0:9:0 Physical Disk Active sdg 0001:61:01.0.2/0:3:0 Physical Disk Active Figure 62. Example Concurrent Device Remove display Related information 109 9. Support the bottom of the disk drive as you slide it out of the system or expansion unit. Do not hold the disk drive by the handle. 10. If your replacement disk drive does not have a bezel preinstalled, then perform the following step. Loosen and pull out on the thumbscrew (B) to release the disk-drive bezel (C) from the disk drive, as shown in the following figure. Figure 63. Removing a disk drive from a model or 7311-D20 110 Disk drives 11. Press Enter on the IBM Power RAID Configuration Utility to indicate that you have removed the disk drive. Replacing the disk drive 1. Find the package that contains the new disk drive. Attention: Disk drives are fragile. Handle them with care. 2. Remove the disk drive from its protective package. 3. Unlock the disk drive handle by squeezing and pulling it out toward you before you install the disk drive. If the handle is not all the way out, the disk drive does not slide into the system or expansion unit. 4. If your replacement disk drive does not have a bezel preinstalled, then follow these steps to install the disk-drive bezel on the replacement disk drive: a. Attach the bezel (A) to the disk drive, as shown in Figure 65 on page 112. b. Press in and then tighten the thumbscrew (B) to secure the bezel to the disk drive. Figure 64. Removing a disk-drive bezel Related information 111 5. From the IBM Power RAID Configuration Utility display, select Work with disk unit recovery. Press Enter. 6. From the Work with Disk Unit Recovery display, select Concurrent add device. Press Enter. A Concurrent Device Add display like the following sample is shown. 7. Type option1 (Select) next to the location from which you removed the disk drive. The Verify Device Concurrent Add display is shown. Figure 65. Attaching a bezel to the disk drive Concurrent Device Add Choose a single location for add operations 1=Select OPT Name PCI/SCSI Location Description Status --- ------ -------------------------- ------------------------- ----------------- 0000:41:01.0.1/0:5: Empty 0000:41:01.0.1/1:8: Empty 0001:61:01.0.2/0:8: Empty Figure 66. Example Concurrent Device Add display 112 Disk drives 8. Press Enter on the Verify Device Concurrent Add display. The Complete Device Concurrent Add display is shown. 9. Support the bottom of the disk drive as you align it with the guide rails in the system or expansion unit. Do not hold the disk drive by the handle. 10. Slide the disk drive all the way into the system or expansion unit and then lock it in place by pushing in the disk drive handle, as shown in Figure 67. 11. Press Enter on the Complete Device Concurrent Add display to indicate that the disk drive is installed. Completing the procedure 1. Replace the front cover. 2. Install or close the system, expansion unit, or rack front door. 3. Verify the installed part: v If you replaced the part because of a failure, verify the installed part. For instructions, see Verifying a repair. v If you installed the part for any other reason, verify the installed part. For instructions, see Verifying the installed part. 4. To configure the newly installed disk drive, see the procedure for the operating system or logical partition that controls the disk drive: v Preparing a disk drive for use in an AIX system or logical partition. v Preparing a disk drive for use in a Linux system or logical partition v Configuring a disk drive in an IBM i system or logical partition. 5. To rebuild the data on the replacement disk drive, see the procedure for the operating system or logical partition that controls the disk drive: v Rebuilding data on a replacement drive using AIX. v Rebuilding data on a replacement drive using Linux. v Rebuilding data on a replacement drive using IBM i. Figure 67. Installing a disk drive in a model or 7311-D20 Related information 113 Rebuilding data on a replacement disk drive using Linux You can rebuild data on the replacement disk drive that was replaced because of a failed disk drive or a new installation. For an unprotected disk drive If the disk drive you are replacing is in a RAID Level 0 disk array or in a failed RAID Level 5 or RAID Level 10 disk array, perform these tasks: 1. Re-create the disk array. 2. Re-create the file systems on the disk array. 3. Copy the data back to the restored disk array from your backup media. For information on these tasks, see the PCI-X SCSI RAID Controller Reference Guide for Linux. This guide is available from the SCSI PCI Adapters web page. For a disk drive in a RAID Level 5 or RAID Level 10 disk array, select the appropriate procedure. Rebuild data on a disk drive using Linux kernel 2.6 Learn the steps to rebuild your data on a disk drive. If a hot-spare disk drive was available and protected the disk array throughout the disk drive failure, you can configure the newly installed disk drive as a hot spare. For instructions, see the PCI-X SCSI RAID Controller Reference Guide for Linux. This guide is available from the SCSI PCI Adapters Web page at http://publib.boulder.ibm.com/ infocenter/pseries/v5r3/index.jsp?topic=/com.ibm.pseries.doc/hardware_docs/scsipciadapters.htm. If a hot-spare rebuild was not initiated by the controller, you need to initiate a rebuild on the newly installed disk drive. To initiate the rebuild, follow these steps: 1. Log in as root user. 2. Type iprconfig on the command line of the Linux session and press Enter. The IBM Power RAID Configuration Utility display is shown. 3. Select Work with disk unit recovery from the IBM Power RAID Configuration Utility display. Press Enter. The Work with Disk Unit Recovery display is shown. 4. From the Work with Disk Unit Recovery display, select Rebuild disk unit data. A Rebuild Disk Unit Data display similar to the following is shown. 114 Disk drives 5. Type option 1 (Rebuild) next to the disk drive you want to rebuild and press Enter. Note: Performing a rebuild on a disk drive overwrites the data currently on the disk drive. A Confirm Rebuild Disk Unit Data display similar to the following is shown. Rebuild Disk Unit Data Select the disks to be rebuilt Type choice, press Enter. 1=Rebuild OPT Name PCI/SCSI Location Description Status --- ------ -------------------------- ------------------------- ------------- 1 0000:58:01.0.0/0:4:0 RAID Array Member Failed e=Exit q=Cancel t=Toggle Figure 68. Example Rebuild Disk Unit Data display Related information 115 6. To confirm that you want to rebuild the data on the disk drive, press Enter. The message that the rebuild has begun is shown at the bottom of the Work with Disk Unit Recovery display. The rebuild process might take several minutes to complete. Preparing to remove a disk-drive from a system or logical partition controlled by IBM i Learn the actions you must take to properly remove a disk drive. Before removing a disk-drive from a system or expansion unit that is controlled by IBM i, follow these steps: 1. Determine the protection status of the disk-drive that you are replacing. For instructions, see “Determining the IBM i disk-drive protection status” on page 117. 2. For mirrored disk-drives on an IBM i system or logical partition, follow these steps to verify that the disk-drive you are replacing has been suspended: a. Sign on with service level authority. b. Type strsst on the command line of the IBM i session and press Enter. c. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign On display. Press Enter. Remember: The service tools password is case-sensitive. d. Select Work with disk units from the System Service Tools (SST) display. Press Enter. e. Select Display Disk Configuration from the Work with Disk Units display. Press Enter. f. Select Display Disk Configuration Status from the Display Disk Configuration display. Press Enter. Confirm Rebuild Disk Unit Data Rebuilding the disk unit data might take several minutes for each disk selected. Press Enter to confirm having the data rebuilt. q=Cancel to return and change your choice. OPT Name PCI/SCSI Location Description Status --- ------ -------------------------- ------------------------- --------------- 1 0000:58:01.0.0/0:4:0 RAID Array Member Failed q=Cancel t=Toggle Figure 69. Example Confirm Rebuild Disk Unit Data display 116 Disk drives g. Does the disk-drive that is mirrored to the disk-drive that you are replacing (same drive number) have a status of Active? v No: The replacement must be performed by a service provider. v Yes: Does the disk-drive that you are replacing have a status of Suspended? – Yes: Continue with the next step. – No: Follow these steps to suspend mirrored protection on the disk-drive that you are replacing: 1) Press F3 from the Display Disk Configuration display to return to the Work with Disk Units display. 2) Select Work with Disk Unit Recovery from the Work with Disk Units display. Press Enter. 3) Select Suspend mirrored protection from the Work with Disk Unit Recovery display. Press Enter. 4) Select the option to suspend the disk-drive that you are replacing from the Suspend Mirrored Protection display. Press Enter. h. Press F3 repeatedly to exit System Service Tools and return to the main menu. Determining the IBM i disk-drive protection status To determine the protection status of the disk-drives in your IBM i system or logical partition, follow these steps from the System Service Tools (SST) display: Tip: If you do not know the protection status of the disk-drives and you cannot get to the System Service Tools display, contact your next level of support. 1. Sign on with at least service level authority. 2. Type strsst on the command line of the IBM i session and press Enter. 3. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign On display. Press Enter. Note: The service tools password is case-sensitive. 4. Select Work with disk units from the System Service Tools (SST) display. Press Enter. 5. Select Display Disk Configuration from the Work with Disk Units display. Press Enter. 6. Select Display Disk Configuration Status from the Display Disk Configuration display. Press Enter. A list of each auxiliary storage pool (ASP) displays, showing the disk-drives that are members of the ASP. The Status column shows one of the following protection statuses for the ASP: v Mirrored. These ASPs are configured for mirrored protection. If the failing disk-drive has mirrored protection, record the status of both disk-drives in the mirrored pair. You need this information in the recovery procedure. v Unprotected. An ASP with a status of Unprotected might contain disk-drives that are device parity protected. Record the status of the failing disk-drive as it is shown in the Status column of the Display Disk Configuration Status display. A disk-drive with one of the following statuses has device parity protection only if all other disk-drives in the array are operational: – DPY/Active – DPY/Failed – DPY/HDW Failure – DPY/Degraded – DPY/Power Loss – DPY/Not Ready Related information 117 – DPY/Unknown A disk-drive with any other status (such as DPY/Rebuild or DPY/Unprotected) does not have device parity protection. Examples: – Suppose one disk-drive has a status of DPY/Failed while all other disk-drives have a status of DPY/Unprotected. In this case, only the disk-drive with the DPY/Failed status is protected. Servicing the failed disk-drive does not result in data loss. Removing any of the unprotected disk-drives might result in data loss. – All disk-drives have a status of DPY/Active or DPY/Degraded, therefore, all disk-drives are protected. In this case, exchanging any disk-drive does not result in data loss. Tip: For more information about the meaning of a status for any disk-drive, use the Help key on the display. Configuring a disk drive in an IBM i system or logical partition To configure a newly installed disk drive into an existing device parity set, follow these steps: If you want to start a new device parity set or change the protection on your disks, see Manage device parity protection or Start device parity protection in the Disk Management section of the IBM i Information Center web site at http://www.ibm.com/systems/i/infocenter . 1. If necessary, start System Service Tools (SST) by typing strsst on the command line of the IBM i session and pressing Enter. 2. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign On display. Press Enter. Restriction: The service tools password is case sensitive. 3. Select Work with Disk Units on the System Service Tools display. Press Enter. 4. Select Display disk configuration on the Work with Disk Units display. Press Enter. 5. Select Display non-configured units on the Display disk configuration display. Press Enter. 6. The disk drive that you installed is listed. The serial number must match the serial number you recorded (the last four digits). Note: The new disk drive might take up to five minutes to appear in the list. If the disk drive is not listed, ensure that the disk drive was properly installed. 7. Press F12 twice. 8. Select Work with disk configuration on the Work with Disk Units display. Press Enter. 9. Are you adding a disk drive to an existing device parity set? v Yes: Continue with the next step. v No: Go to step 23 on page 119 for mirroring or no protection, go to 1 on page 120 for hot spare. Tip: Select No for mirroring, hot spare or no protection. 10. Select Work with device parity protection on the Work with Disk Configuration display. Press Enter. 11. Select Include unit in device parity protection on the Work with Device Parity Protection display. Press Enter. 12. Did the Include Device Parity Protection Failed display appear? v Yes: Continue with the next step. v No: Go to step 15 on page 119. 13. The following conditions must be met before you can start device parity protection. 118 Disk drives v If there are enough disk drives available to create a new parity set, the drives are eligible for Start Device Parity Protection operation and not for the Include Device Parity Protection operation. v All disk drives in a parity set must be the same capacity with a minimum number of three or four disk drives and maximum of 10 or 18 disk drives in the resulting parity set, depending on the disk drive type. v Not all disk drives attached to an advanced function input/output processor have reported to the system. Repeat the operation. v The type or model of the disk drives cannot be used for the requested operation. When these conditions are met, press F12 to return to the Work with Disk Configuration display. 14. Select Include unit in device parity protection on the Work with disk configuration display. Press Enter. 15. On the Include Disk Units in Device Parity Protection display, select the disk drives to be included in Device Parity Protection. Type a 1 in the Option column. Press Enter. 16. Press Enter to confirm your choice. The configuration starts and might take 30 - 60 minutes to complete. 17. When the device parity protection is complete, the message “Selected drives have been included successfully” is shown on the Work with device parity protection display. Press F12. 18. Select Display disk configuration on the Work with disk configuration display. Press Enter. 19. Select Display disk configuration status on the Display disk configuration display. 20. If you have more than one ASP, determine the auxiliary storage pool (ASP) to which you want to add the disk drive. 21. Press F12 twice to get to the Work with Disk Units display. 22. Select Work with disk configuration on the Work with Disk Units display. Press Enter. 23. Select the option to add units to ASPs and balance data on the Work with disk configuration display. Press Enter. 24. Specify the number of the ASP to which you want to add the disk drive. The system ASP is ASP 1. Press Enter. Notes: a. If ASP is mirror protected, disk drives must be added in pairs of the same size. b. When one or more pairs of disk drives are added to a mirrored ASP, they automatically become mirror protected. c. To start mirrored protection on an ASP, refer to Recovering your system in the IBM i Information Center. To access the IBM i Information Center website, go to http://www.ibm.com/systems/i/ infocenter and select from the following options: v If you select V6R1, expand IBM i 6.1 Information Center > Systems management > Backup and recovery > Recovering your system v If you select V5R4, expand i5/OS Information Center > Systems management > Backup and recovery > Backing up and recovering your system > Recovering your system 25. The Confirm Add Units displays showing the configuration your system must have when the add operation completes. Note: If you have selected the wrong ASP, press F12 to change your options. Press the Enter key to continue. The add process takes several minutes to complete. 26. When the message Selected units have been added successfully is shown, press F3 three times and press Enter to return to the Main menu display. 27. If you have a printer, print the configuration list. A service provider might refer to the configuration list in the future. a. You must sign on with at least service level authority Related information 119 b. Type strsst on the Main menu command line. Press Enter. c. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign On display. Press Enter. Remember: The service tools password is case-sensitive. d. Select Start a service tool on the System Service Tools (SST) display. Press Enter. e. Select Hardware service manager on the Start a Service Tool display. Press Enter. f. Press F6 (print the configuration) from the Hardware Service Manager display. g. To return to the Main Menu display, press F3 (Exit) twice and then press Enter. h. Place the configuration list where you can refer to it in the future. 28. If you are a service provider performing this task, return to the procedure that sent you here. 1. Select Work with hot spare protection from the Work with disk configuration menu. 2. Select Start device parity protection - RAID 5 with hot spare or Start device parity protection - RAID 6 with hot spare depending on the level of parity protection that is wanted. 3. Return to the procedure that sent you here. Configuring a disk drive on a load source adapter for hot spare using the IBM i operating system To start hot spare protection on your load source adapter you must determine the status of the disk drive in the load source slot and configure or exchange that disk drive, which might require you to turn off the power to the system. 1. If necessary, start System Service Tools (SST) by typing strsst on the command line of the IBM i session and pressing Enter. 2. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign On display. Press Enter. Note: The service tools password is case sensitive. 3. Determine the protection status of all disks controlled by the load source adapter. Do all disk drives that are a member of a parity sets or mirror sets have a status of active? v No: Replace the disk using the following procedure in, “Replacing a disk drive in the 8231-E2B, 8231-E1C, or 8231-E2C with the IBM i system or logical partition power turned on” on page 69. v Yes: Continue to the next step. Restriction: There must be a disk drive of the same or larger capacity located in a load source capable slot to act as a hot spare for the active load source disk drive. The next steps will have you change the disk drive in the desired slot if needed. Attention: If the disk drive acting as hot spare for the load source is of smaller capacity you can suffer data loss. 4. Determine whether a non-configured disk drive of the same or larger capacity is located in a load source capable slot but is not the active load source. Only parallel SCSI places a restriction on the location of the load source. Any SAS, Fibre Channel, or virtual disk drive does not have this restriction. v No non-configured drive is located in a load source capable slot. Continue with the next step v Yes a non-configured drive is located in a load source capable slot. Return to the procedure that sent you here, and continue to the next step of that procedure. 5. Choose the disk drive to be installed or exchanged with the drive in the load source slot by doing the following: a. Select a disk drive in a parity set located in a load source capable slot, but is not the active load source drive, and record the location of this disk drive. 120 Disk drives b. Select a non-configured disk drive of the same or larger capacity from either a non-load source slot or a failed drive you are replacing with a non-configured disk drive. 6. Stop the system if it is running. For instructions on stopping the system, see Stopping the system or logical partition. 7. Exchange the disk drives from the two locations you recorded in step 5 on page 120 by removing both disk drives and placing them into the other disk drives slot. 8. Start the system. For instructions, see Starting the system or logical partition 9. Return to the procedure that sent you here and continue with the next step in that procedure. Rebuilding data on a replacement disk drive using IBM i Learn how to rebuild data on replacement disks. To rebuild data on the replacement disk drive, complete the following steps: 1. If necessary, start System Service Tools (SST) by typing strsst on the command line of the IBM i session and pressing Enter. 2. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign On display. Press Enter. Note: The service tools password is case sensitive. 3. Select Work with disk units from the Start System Service Tools (SST) display. Press Enter. 4. Select Work with disk unit recovery from the Work with Disk Units display. Press Enter. 5. If you determined that the disk drive you replaced had device parity protection (RAID), follow these steps to restore device parity protection (RAID) on the disk drive: a. Select Rebuild disk unit data on the Work with Disk unit recovery display. Press Enter. b. Select 1 to rebuild the disk drive displayed (the disk drive displayed is the disk drive that you removed) on the Rebuild Disk Unit Data display. Press Enter. c. Press Enter on the Confirm Rebuild Disk Unit Data display. The rebuild process might take several minutes to complete. d. Press F5 to refresh the display until the Percent complete shows 5%. e. When the display shows at least 5% complete, you can either continue to monitor this display to completion, or press F3 (Exit) to return to the Work with disk units display. f. Press F3 (Exit) to return to the System service tools display. g. Press F3 (Exit) to return to the Exit SST display and press Enter. h. Return to the procedure that sent you here. 6. If the disk drive that you replaced had mirrored protection, follow these steps: a. Select Replace configured unit on the Work with Disk unit recovery display. Press Enter. b. Select the configured disk drive that you are exchanging (suspended drive) on the Select Configured Unit to Replace display. Press Enter. c. Select the disk drive that you just installed on the Select Replacement Unit display. This drive has a non-configured status. Note: In some cases, it might take several minutes for a new disk drive to display. Repeat these steps until the new drive is shown. Press Enter. d. Press Enter on the Confirm Replace of Configured Unit display to confirm your choice for replacement. The replacement process might take several minutes to complete. When the process is complete, the Work with Disk unit recovery display is shown. e. Press F3 (Exit) to return to the Work with disk units display. Related information 121 f. Select Display disk configuration on the Work with disk units display. g. Select Display disk configuration status on the Display Disk Configuration display. Mirrored status shows Resuming. When complete, the mirrored status shows Active. This process might take several minutes to complete. You can either monitor this display to completion, or press F3 (Exit) three times, and then press Enter to return to the IBM i main menu. Replacing a SAS conduit card in a 5802 expansion unit with the power turned off Learn how to replace a SAS conduit card in the 5802 expansion unit with the power turned off. Perform the prerequisite tasks as described in Before you begin. 1. Stop the system if it is running. For instructions, see Stopping the system or logical partition. 2. Disconnect the power source from the system by unplugging the system. Attention: This system might be equipped with a second power supply. You must disconnect the power source from the system by disconnecting all power cords to prevent system damage during this procedure. (L003) or 3. Perform the following steps to replace the midplane and SAS conduit card into the system: a. Place the card connector (C) on the midplane connection (D) using the guide track (B) to ensure a good connection. b. Lift the handle (A) to seat and secure the SAS conduit card to the midplane as shown in Figure 70 on page 123. 122 Disk drives 4. Perform the following steps to replace the midplane and SAS conduit card into the system: a. Align the midplane with the track and ensure that the slot (A) on the midplane attaches to the post (B) in the system. b. Secure the thumbscrews (C) and (D) on the SAS conduit card. c. Secure the thumbscrews (E) on the sides of the midplane. Figure 70. Installing the SAS conduit card into the midplane Related information 123 5. Attach the power cord cover and secure the thumbscrew (A) to the system. Figure 71. Installing the SAS conduit card and the midplane Figure 72. Installing the power cord plate 124 Disk drives 6. Carefully slide the enclosure management controller into the system and close the latch (A) as shown in Figure 73. 7. Carefully lift and align the I/O system board with the slot at the rear of the expansion unit. 8. Insert the I/O system board firmly into the server, as shown in Figure 74 on page 126. 9. Secure the I/O system board with the locking tabs (A), as shown in Figure 74 on page 126. Figure 73. Replacing the enclosure management controller Related information 125 10. To replace the power supply, with the locking handle (A) in the open position, push the power supply into the expansion unit as shown in Figure 75 on page 127. 11. Close the locking handle (A) until the power supply locks into position. Figure 74. Install the I/O system board on a rack mounted model 126 Disk drives 12. To replace the disk drive backplane, lift the backplane along the two sides and align the backplane with the guide rails in the expansion unit. CAUTION: The backplane can be heavy to lift. Ensure that you can safely perform this task before you begin. 13. Using your hand to support the bottom of the backplane, slide it into the expansion unit as shown in Figure 76 on page 128. 14. Secure the backplane in place by moving the locking tabs from the open position (B) to the locked position (A). Figure 75. Replacing a power supply Related information 127 15. Replace all disk drive units and fillers, if you had removed them earlier. a. Support the bottom of the disk drive unit as you align it with the guide rails in the backplane. b. Slide the disk drive unit all the way into the backplane. c. Push the handle (A) toward the disk drive unit to lock it into place. Figure 76. Installing the backplane on a rack-mounted model 128 Disk drives 16. Replace all expander cards and fillers, if you had removed them earlier. a. Supporting the bottom of the expander card with your hand, align it with the guide rails in the backplane. b. Slide the expander card all the way into the backplane. c. Push the handle (A) toward the card to lock it into place. 17. Start the system. For instructions, see Starting the system or logical partition. Continue with any other service actions you are required to perform. v For information about connecting the system, see Connecting the expansion units. v For information about installing the PCI adapters into the system board, see Model 5802 and 5877 expansion units, PCI adapters, and cassettes. Removing a SAS conduit card from a 5802 expansion unit with the power turned off Learn how to remove a SAS conduit card from a 5802 rack-mounted expansion unit if you must replace a failing part with the power turned off. Perform the prerequisite tasks as described in Before you begin. 1. Identify the card you want to remove. For instructions on identifying a failing part, see Identifying a failing part. 2. Stop the system if it is running. For instructions on stopping the system, see Stopping the system or logical partition. 3. Remove the disk drives and fillers as shown in Figure 77 on page 130. a. Squeeze the latch of the disk drive and pull the handle (A) toward you to release the disk drive. b. Support the bottom of the disk drive with your hand and slide the disk drive out of the backplane. Related information 129 4. Remove the port cards and fillers as shown in Figure 78 on page 131. a. Hold the end of the latching handle (A) and pull it upward to unlock the port card. b. Using your hand to support the bottom of the port card, slide the port card out of the backplane. c. Place the port card on an electrostatic discharge (ESD) surface. Figure 77. Removing the disk drives from the backplane 130 Disk drives 5. Remove the backplane as shown in Figure 79 on page 132. CAUTION: The backplane might be heavy. Ensure that you can safely perform the procedure. a. Squeeze the blue latches (A) to unlock the handles. b. Pull the handles (B) outward in the direction shown to release the backplane. c. Slide the backplane out of the enclosure and use your hand to support the bottom of the backplane. Figure 78. Removing the port card from the backplane Related information 131 6. Disconnect both the offline converter assembly (OCA) power cords by performing the following steps: a. Slide the power cord holders (A) to the left, located on the front of both the OCAs. b. Label and disconnect both the OCA power cords (B) from each OCA. Figure 79. Removing the backplane from the enclosure 132 Disk drives 7. Remove the offline converter assemblies by performing the following steps: a. Pull the OCA handle (A) all the way down in the direction shown to unseat the OCA. b. Hold both sides of the OCA and pull it out of the enclosure. Note: Use both the hands to support the OCA as you pull it out of the enclosure. c. Repeat these steps to remove both OCAs. Figure 80. Disconnecting the OCA power cords Related information 133 8. Remove the enclosure management controller by performing the following steps: a. Label and disconnect any cables connected to the enclosure management controller card. b. Pinch the latch (A) to unlock the handle (B). c. Pull the handle outward in the direction shown to unseat the EMC card. d. Slide the enclosure management controller card straight out of the enclosure. Figure 81. Remove the offline converter assemblies 134 Disk drives 9. Remove the power cord plate by turning the thumbscrew (A) and pulling the plate away from the system, as shown in Figure 83. 10. Disconnect the cables from the back of the I/O system board. Figure 82. Remove the enclosure management controller Figure 83. Removing the power cord plate Related information 135 11. Squeeze the release latches (A) and pull the levers (B) out away from the system, as shown in Figure 84. 12. Slide the system board out of the system. 13. Remove the SAS conduit card and midplane from the enclosure by doing the following steps: a. Pull the spring plungers (A) up and turn them one quarter to the left. b. Turn the captive screws (B) and (C) counterclockwise until they are completely loosened. c. Pull the SAS conduit and midplane component out of the enclosure. Figure 84. Removing the I/O system board 136 Disk drives 14. Push the handles (A) on the conduit card down to disengage the card from the midplane, and remove the card. Continue with replacing the SAS expander card or return to the procedure that sent you here. Figure 85. Removing the SAS conduit card and midplane Figure 86. Removing the SAS conduit card from the midplane Related information 137 Replacing a SAS expander card in a 5802 expansion unit with the power turned off Learn how to replace a SAS expander card in a 5802 expansion unit with the power turned off. Review the requirements in Before you begin. 1. Before installing or reinstalling a SAS expander card, inspect the signal connector pins to ensure that the pins are not bent or damaged. Attention: Bent pins damage the backplane receptacle. Be careful not to touch or damage the signal connector pins. 2. Slide the card completely into the slot and push the lever (A) down to secure the card as shown in the following figure. 3. Start the system. For instructions see, Starting the system or logical partition. 4. Verify the presence of the newly installed card. For instructions see, Verifying the installed part. Continue with any other service actions you need to perform. Removing a SAS expander card from a 5802 expansion unit with power turned off Learn how to remove a SAS expander card from a 5802 (rack-mounted) expansion unit if you need to replace a failing part with power turned off. 1. Perform prerequisite tasks. For instructions, see Before you begin. 2. Identify the card you are going to remove. For information, see Identifying a part. 3. Lift the handle (A) and pull the card out of the system, as shown in the following figure. Figure 87. Installing the SAS expander card 138 Disk drives Continue with replacing the SAS expander card or return to the procedure that sent you here. Figure 88. Removing the SAS expander card. Related information 139 140 Disk drives Common procedures for installable features This section contains all the common procedures that are related to installing, removing, and replacing features. © Copyright IBM Corp. 2010, 2011 141 142 Disk drives Before you begin Observe these precautions when you are installing, removing, or replacing features and parts. These precautions are intended to create a safe environment to service your system and do not provide steps for servicing your system. The installation, removal, and replacement procedures provide the step-by-step processes required to service your system. DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) DANGER © Copyright IBM Corp. 2010, 2011 143 Observe the following precautions when working on or around your IT rack system: v Heavy equipment–personal injury or equipment damage might result if mishandled. v Always lower the leveling pads on the rack cabinet. v Always install stabilizer brackets on the rack cabinet. v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest devices in the bottom of the rack cabinet. Always install servers and optional devices starting from the bottom of the rack cabinet. v Rack-mounted devices are not to be used as shelves or work spaces. Do not place objects on top of rack-mounted devices. v Each rack cabinet might have more than one power cord. Be sure to disconnect all power cords in the rack cabinet when directed to disconnect power during servicing. v Connect all devices installed in a rack cabinet to power devices installed in the same rack cabinet. Do not plug a power cord from a device installed in one rack cabinet into a power device installed in a different rack cabinet. v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of the system or the devices that attach to the system. It is the responsibility of the customer to ensure that the outlet is correctly wired and grounded to prevent an electrical shock. CAUTION v Do not install a unit in a rack where the internal rack ambient temperatures will exceed the manufacturer's recommended ambient temperature for all your rack-mounted devices. v Do not install a unit in a rack where the air flow is compromised. Ensure that air flow is not blocked or reduced on any side, front, or back of a unit used for air flow through the unit. v Consideration should be given to the connection of the equipment to the supply circuit so that overloading of the circuits does not compromise the supply wiring or overcurrent protection. To provide the correct power connection to a rack, refer to the rating labels located on the equipment in the rack to determine the total power requirement of the supply circuit. v (For sliding drawers.) Do not pull out or install any drawer or feature if the rack stabilizer brackets are not attached to the rack. Do not pull out more than one drawer at a time. The rack might become unstable if you pull out more than one drawer at a time. v (For fixed drawers.) This drawer is a fixed drawer and must not be moved for servicing unless specified by the manufacturer. Attempting to move the drawer partially or completely out of the rack might cause the rack to become unstable or cause the drawer to fall out of the rack. (R001) Before you begin a replacement or installation procedure, perform these tasks: 1. If you are installing a new feature, ensure that you have the software required to support the new feature. To do this, go to the following Web site: IBM Prerequisite 2. If you are performing an installation or replacement procedure that might put your data at risk, ensure, wherever possible, that you have a current backup of your system or logical partition (including operating systems, licensed programs, and data). 3. Review the installation or replacement procedure for the feature or part. 4. Note the significance of color on your system. 144 Disk drives Blue or terra-cotta on a part of the hardware indicates a touch point where you can grip the hardware to remove it from or install it in the system, open or close a latch, and so on. Terra-cotta might also indicate that the part can be removed and replaced with the system or logical partition power on. 5. Ensure that you have access to a medium flat-blade screwdriver, a Phillips screwdriver, and a pair of scissors. 6. If parts are incorrect, missing, or visibly damaged, do the following: v If you are replacing a part, contact the provider of your parts or next level of support. v If you are installing a feature, contact one of the following service organizations: – The provider of your parts or next level of support. – In the United States, the IBM Rochester Manufacturing Automated Information Line (R–MAIL) at 1–800–300–8751. In countries and regions outside of the United States, use the following Web site to locate your service and support telephone numbers: http://www.ibm.com/planetwide 7. If you encounter difficulties during the installation, contact your service provider, your IBM reseller, or your next level of support. 8. If you are installing new hardware in a logical partition, you need to understand and plan for the implications of partitioning your system. For information, see Logical Partitioning. Before you begin 145 146 Disk drives Identifying a part Use these instructions to learn how to identify the location of a failed part, the location of a part to be removed, or the location to install a new part on your system or expansion unit using the appropriate method for your system. For IBM Power Systems™ servers that contain the POWER7 processor, the light-emitting diodes (LEDs) can be used to identify or verify the location of a part that you are removing, servicing, or installing. The combination identify and fault LED (amber color) shows the location of a field replaceable unit (FRU). When removing a FRU, first verify whether you are working on the correct FRU by using the identify function in the management console or other user interface. When removing a FRU under the direction of the hardware management console, the identify function is activated and deactivated automatically at the correct times. The identify function causes the amber LED to blink. When you turn off the identify function, the LED returns to the state it was previously. For parts that have a blue service button, the identify function sets LED information for the service button so that when the button is pressed, the correct LEDs on that part blink. If you need to use the identify function, use the following procedures. Control panel LEDs Use this information as a guide to the control panel LEDs and buttons. The control panel has LEDs that indicate various system status. Figure 89. Control panel © Copyright IBM Corp. 2010, 2011 147 v A: Power-on button v B: Power LED – A constant light indicates full system power to the unit. – A blinking light indicates standby power to the unit. Note: There is approximately a 30-second transition period from the time the power-on button is pressed to when the power LED goes from blinking to solid. During the transition period, the LED might blink faster. v C: Enclosure identify light – A constant light indicates the identify state, which is used to identify a part. – No light indicates that the system is operating normally. v D: System information light – No light indicates that the system is operating normally. – Light on indicates that the system requires attention. v E: USB port v F: Enclosure fault roll-up light – A constant light indicates a fault in the system unit. – No light indicates that the system is operating normally. v G: Function/Data display v H: Decrement button v I: Enter button v J: Increment button v K: Pinhole reset button Related concepts Identifying a failing part Use these instructions to learn how to locate and identify a failing part on your system or expansion unit using the appropriate method for your system. Identifying a failing part in an AIX system or logical partition Use these instructions to learn how to locate a failing part, and then activate the indicator light for that part on a system or logical partition running the AIX operating system. Locating a failing part in an AIX system or logical partition You might need to use AIX tools, before activating the indicator light, to locate a part that is failing. 1. Log in as root user or celogin-. 2. At the command line, type diag and press Enter. 3. From the Function Selection menu, select Task Selection and press Enter. 4. Select Display Previous Diagnostic Results and press Enter. 5. From the Display Previous Diagnostic Results display, select Display Diagnostic Log Summary. The Display Diagnostic Log display shows a chronological list of events. 6. Look in the T column for the most recent S entry. Select this row in the table and press Enter. 7. Select Commit. The details of this log entry are shown. 8. Record the location information and the SRN value shown near the end of the entry. 9. Exit to the command line. Use the location information for the failing part to activate the indicator light that identifies the failing part. See “Activating the indicator light for the failing part” on page 149. 148 Disk drives Activating the indicator light for the failing part Use these instructions to help physically identify the location of a part you are servicing. 1. Log in as root user. 2. At the command line, type diag and press Enter. 3. From the Function Selection menu, select Task Selection and press Enter. 4. From the Task Selection menu, select Identify and Attention Indicators and press Enter. 5. From the list of lights, select the location code for the failing part and press Enter. 6. Select Commit. This turns on the system attention and indicator light for the failing part. 7. Exit to the command line. Deactivating the failing-part indicator light Use this procedure to turn off any indicator light that you turned on as a part of a service action. To deactivate the indicator light, follow these steps: 1. Log in as root user. 2. At the command line, type diag and press Enter. 3. From the Function Selection menu, select Task Selection and press Enter. 4. From the Task Selection menu, select Identify and Attention Indicators and press Enter. 5. From the list of lights, select the location code for the failing part and press Enter. When a light is activated for a failing part, an I character precedes the location code. 6. Select Commit. This turns off the system attention and indicator light for the failing part. 7. Exit to the command line. Identifying a failing part in an IBM i system or logical partition You can activate or deactivate the indicator light by using IBM i to assist in locating a failing part. Activating the failing-part indicator light You can search the service action log for an entry that matches the time, reference code, or resource of a problem, and then activate the indicator light for a failing part. 1. Sign on to an IBM i session, with at least service level authority. 2. On the command line of the session, type strsst and press Enter. Note: If you cannot get to the System Service Tools display, use function 21 from the control panel. Alternatively, if the system is managed by a Hardware Management Console (HMC), use the Service Focal Point™ utilities to get to the Dedicated Service Tools (DST) display. 3. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign On display. Press Enter. Remember: The service tools password is case-sensitive. 4. Select Start a service tool from the System Service Tools (SST) display and press Enter. 5. Select Hardware service manager from the Start a Service Tool display and press Enter. 6. Select Work with service action log from the Hardware Service Manager display and press Enter. 7. On the Select Timeframe display, change the From: Date and Time field to a date and time prior to when the problem occurred. 8. Search for an entry that matches one or more conditions of the problem: v System Reference code v Resource Identifying a part 149 v Date and time v Failing item list 9. Select option 2 (Display failing item information) to display the service action log entry. 10. Select option 2 (Display details) to display location information for the failing part to be replaced. The information displayed in the date and time fields is the date and time for the first occurrence of the specific System reference code for the resource displayed during the time range selected. 11. If location information is available, select option 6 (Indicator on) to turn on the failing part's indicator light. Tip: If the failing part does not contain a physical indicator light, a higher-level indicator light is activated. For example, the indicator light for the backplane or unit that contains the failing part might be lit. In this case, use the location information to locate the actual failing part. 12. Look for the enclosure indicator light to locate the enclosure that contains the failing part. Deactivating the failing-part indicator light Use this procedure to turn off any indicator light that you turned on as a part of a service action. To deactivate the indicator light, follow these steps: 1. Sign on to an IBM i session, with at least service level authority. 2. On the command line of the session, type strsst and press Enter. Note: If you cannot get to the System Service Tools display, use function 21 from the control panel. Alternatively, if the system is managed by a Hardware Management Console (HMC), use the Service Focal Point utilities to get to the Dedicated Service Tools (DST) display. 3. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign On display. Press Enter. Remember: The service tools password is case-sensitive. 4. Select Start a service tool from the System Service Tools (SST) display and press Enter. 5. Select Hardware service manager from the Start a Service Tool display and press Enter. 6. Select Work with service action log from the Hardware Service Manager display and press Enter. 7. On the Select Timeframe display, change the From: Date and Time field to a date and time prior to when the problem occurred. 8. Search for an entry that matches one or more conditions of the problem: v System Reference code v Resource v Date and time v Failing item list 9. Select option 2 (Display failing item information) to display the service action log entry. 10. Select option 2 (Display details) to display location information for the failing part to be replaced. The information displayed in the date and time fields is the date and time for the first occurrence of the specific System reference code for the resource displayed during the time range selected. 11. Select option 7 (Indicator off) to turn off the indicator light. 12. Select the Acknowledge all errors function at the bottom of the Service Action Log display, if all problems have been resolved. 13. Close the log entry by selecting option 8 (Close new entry) on the Service Action Log Report display. 150 Disk drives Identifying a failing part in a Linux system or logical partition If the service aids have been installed on a system or logical partition, you can activate or deactivate the indicator lights to locate a part or compete a service action. Locating a failing part in a Linux system or logical partition If the service aids have been installed on a system or logical partition, you need to activate the indicator lights to locate a part. To activate the indicator light, follow these steps: 1. Log in as root user. 2. At the command line, type /usr/sbin/usysident -s identify -l and press Enter. 3. Look for the system attention light to identify the enclosure that contains the failing part. Related information Service and productivity tools for Linux on POWER systems IBM provides hardware diagnostic aids and productivity tools, as well as installation aids for Linux operating systems on IBM Power systems. Finding the location code of a failing part in a Linux system or logical partition To retrieve the location code of the failing part, if you do not know the location code, use the procedure in this topic. To locate the failing part in a system or logical partition follow these steps: 1. Log in as root user. 2. At the command line, type grep diagela /var/log/platform and press Enter. 3. Look for the most recent entry that contains a system reference code (SRC). 4. Record the location information. Note: IBM provides hardware diagnostic aids and productivity tools, as well as installation aids for Linux operating systems on IBM Power systems. See Service and productivity tools for Linux on Power Systems servers Activating the indicator light for the failing part If you know the location code of the failing part, activate the indicator light to help you locate which part to replace. To activate the indicator light, follow these steps: 1. Log in as root user. 2. At the command line, type /usr/sbin/usysident -s identify -l and press Enter. 3. Look for the system attention light to identify the enclosure that contains the failing part. Note: IBM provides hardware diagnostic aids and productivity tools, as well as installation aids for Linux operating systems on IBM Power systems. See Service and productivity tools for Linux on Power Systems servers Deactivating the failing-part indicator light After you complete a removal and replacement procedure, you must deactivate the failing-part indicator light. Identifying a part 151 To deactivate the indicator light, follow these steps: 1. Log in as root user. 2. At the command line, type /usr/sbin/usysident -s normal -l and press Enter. Note: IBM provides hardware diagnostic aids and productivity tools, as well as installation aids for Linux operating systems on IBM Power systems. See Service and productivity tools for Linux on Power Systems servers Locating a failing part in a Virtual I/O Server system or logical partition You can use Virtual I/O Server (VIOS) tools, before activating the indicator light, to locate a part that is failing. 1. Log in as root user or celogin-. 2. At the command line, type diagmenu and press Enter. 3. From the Function Selection menu, select Task Selection and press Enter. 4. Select Display Previous Diagnostic Results and press Enter. 5. From the Display Previous Diagnostic Results display, select Display Diagnostic Log Summary. A Display Diagnostic Log display appears. This display contains a chronological list of events. 6. Look in the T column for the most recent S entry. Select this row in the table and press Enter. 7. Choose Commit. The details of this log entry are shown. 8. Record the location information and the SRN value shown near the end of the entry. 9. Exit to the command line. Use the location information for the failing part to activate the indicator light that identifies the failing part. For instructions, see “Identifying a part by using the Virtual I/O Server.” Identifying a part by using the Virtual I/O Server Use these instructions to turn on the indicator light to help you physically locate a part by using the Virtual I/O Server (VIOS). 1. Log in as root user. 2. At the command line, type diagmenu and press Enter. 3. From the Function Selection menu, select Task Selection. Press Enter. 4. From the Task Selection menu, select Identify and Attention Indicators. Press Enter. 5. From the list of lights, select the location code for the failing part and press Enter. 6. Select Commit. This turns on the system attention and indicator light for the failing part. 7. Exit to the command line. 152 Disk drives Starting the system or logical partition Learn how to start a system or logical partition after performing a service action or system upgrade. Starting a system that is not managed by a Hardware Management Console or or a Systems Director Management Console You can use the power button or the Advanced System Management Interface to start a system that is not managed by a Hardware Management Console (HMC) or an IBM Systems Director Management Console (SDMC). To start a system that is not managed by a HMC or SDMC, follow these steps: 1. Open the front rack door, if necessary. 2. Before you press the power button on the control panel, ensure that power is connected to the system unit as follows: v All system power cables are connected to a power source. v The Power LED, as shown in the following figure, is slowly blinking. v The top of the display, as shown in the following figure, shows 01 V=F. 3. Press the power button (A), as shown in the following figure, on the control panel. v A: Power-on button v B: Power LED – A constant light indicates full system power to the unit. – A blinking light indicates standby power to the unit. Figure 90. Control panel © Copyright IBM Corp. 2010, 2011 153 Note: There is approximately a 30-second transition period from the time the power-on button is pressed to when the power LED goes from blinking to solid. During the transition period, the LED might blink faster. v C: Enclosure identify light – A constant light indicates the identify state for the enclosure or for a resource within the enclosure. – No light indicates that no resources in the enclosure are being identified. v D: Attention light – No light indicates that the system is operating normally. – A solid light indicates that the system requires attention. v E: USB port v F: Enclosure fault roll-up light – A constant light indicates a fault indicator active in the system. – No light indicates that the system is operating normally. v G: Function/Data display v H: Decrement button v I: Enter button v J: Increment button v K: Pinhole reset button 4. Observe the following after pressing the power button: v The power-on light begins to blink faster. v The system cooling fans are activated after approximately 30 seconds and begin to accelerate to operating speed. v Progress indicators, also referred to as checkpoints, appear on the control panel display while the system is being started. The power-on light on the control panel stops blinking and remains on, indicating that system power is on. Tip: If pressing the power button does not start the system, do the following steps to start the system using the Advanced System Management Interface (ASMI): 1. Access the ASMI. For instructions, see Accessing the ASMI. 2. Start the system using the ASMI. For instructions, see Powering the system on and off. Starting a system or logical partition by using the Hardware Management Console You can use the Hardware Management Console (HMC) user interface to start the system or logical partition after the required cables are installed and the power cables are connected to a power source. For instructions on working with the HMC, see Managing the Hardware Management Console. For instructions on starting a logical partition, see Logical partitioning. For instructions on starting the system, see Powering on the managed system. Progress indicators, also referred to as checkpoints, appear on the control panel display while the system is being started. When the power-on light on the control panel stops blinking and remains on, the system power is on. 154 Disk drives Starting a system or virtual server with the Systems Director Management Console You can use the IBM Systems Director Management Console (SDMC) user interface to start the system or virtual server after the required cables are installed and the power cables are connected to a power source. For instructions on working with the SDMC, see Managing and configuring the SDMC. For instructions on starting a virtual server, see Managing virtual servers. For instructions on shutting down and restarting virtual servers, see Shutting down and restarting virtual servers. Progress indicators, also known as checkpoints, display on the control panel while the system is being started. When the power-on light on the control panel stops flashing and remains on, the system power is on. Starting the system 155 156 Disk drives Stopping a system or logical partition Learn how to stop a system or logical partition as a part of a system upgrade or service action. Attention: Using either the power-on button on the control panel or entering commands at the Hardware Management Console (HMC) to stop the system can cause unpredictable results in the data files. Also, the next time you start the system, it might take longer if all applications are not ended before stopping the system. To stop the system or logical partition, select the appropriate procedure. Stopping a system that is not managed by an HMC or an SDMC You might need to stop the system to perform another task. Use these instructions to stop the system using the power button or Advanced System Management Interface. Before you stop the system, follow these steps: 1. If an Integrated xSeries® Adapter (IXA) is present on the system, shut it down using IBM i options. 2. Ensure that all jobs are completed and end all applications. 3. Ensure that the operating system is stopped. Attention: Failure to do so can result in the loss of data. 4. If a Virtual I/O Server (VIOS) logical partition is running, ensure that all clients are shut down or that the clients have access to their devices using an alternate method. The following procedure describes how to stop a system that is not managed by a Hardware Management Console (HMC). 1. Log in to the system as a user with the authority to run the shutdown or pwrdwnsys (Power Down System) command. 2. At the command line, enter one of the following commands: v If your system is running the AIX operating system, type shutdown. v If your system is running the Linux operating system, type shutdown -h now. v If your system is running the IBM i operating system, type PWRDWNSYS. If your system is partitioned, use the PWRDWNSYS command to power down each of the secondary partitions. Then, use the PWRDWNSYS command to power down the primary partition. The command stops the operating system. The system power turns off, the power-on light begins to slowly flash, and the system goes into a standby state. 3. At the Linux command line, type shutdown -h now. The command stops the operating system. The system power turns off, the power-on light begins to slowly flash, and the system goes into a standby state. 4. At the Linux command line, type shutdown -h now. The command stops the operating system. The system power turns off, the power-on light begins to slowly flash, and the system goes into a standby state. 5. Record the IPL type and the IPL mode from the control panel display to help you return the system to this state when the installation or replacement procedure is completed. 6. Set the power switches of any devices connected to the system to off. 7. Unplug any power cables that are attached to the unit from electrical outlets. Ensure that you unplug power cables from peripheral devices, such as printers and expansion units. © Copyright IBM Corp. 2010, 2011 157 Important: The system is equipped with a second power supply. Before continuing with this procedure, ensure that all power sources to the system have been disconnected. (L003) or Stopping a system by using the Hardware Management Console You can use the Hardware Management Console (HMC) user interface to stop the system or a logical partition. By default, the managed system is set to power off automatically when you shut down the last running logical partition on the managed system. If you set the managed system properties on the HMC so that the managed system does not power off automatically, you must use this procedure to power off your managed system. Attention: If possible, shut down the running logical partitions on the managed system before powering off the managed system. Powering off the managed system without shutting down the logical partitions first causes the logical partitions to shut down abnormally and can cause data loss. If you use a Virtual I/O Server (VIOS) logical partition, ensure that all clients are shut down or that the clients have access to their devices using an alternate method. To power off a managed system, you must be a member of one of the following roles: v Super administrator v Service representative v Operator v Product engineer Use the following steps to stop the system using the HMC. 1. In the Navigation area, expand the Systems Management folder. 158 Disk drives 2. Click the Servers icon. 3. In the Contents area, select the managed system. 4. Select Tasks, then Operations, and then Power Off 5. Select the appropriate power-off mode and click OK. Related information Shutting down and restarting logical partitions Stopping a system with the Systems Director Management Console You can use the IBM Systems Director Management Console (SDMC) user interface to stop the system or a virtual server. By default, the managed system is set to power off automatically when you shut down the last running virtual server on the managed system. If you set the managed system properties on the SDMC so that the managed system does not power off automatically, you must use this procedure to power off your managed system. Attention: If possible, shut down the running virtual servers on the managed system before powering off the managed system. Powering off the managed system without shutting down the virtual servers first causes the virtual servers to shut down abnormally and can cause data loss. If you use a Virtual I/O Server (VIOS) logical partition, ensure that all clients are shut down or that the clients have access to their devices with an alternate method. To power off a managed system, you must be a member of one of the following roles: v Super administrator v Service representative v Operator v Product engineer Use the following steps to stop the system using the SDMC. 1. In the Power Systems Resource area, select the managed system you want to power off. 2. From the Actions menu, select Operations ? Power Off. 3. Select the appropriate power-off mode and click OK. Stopping a system or logical partition 159 160 Disk drives Removing and replacing the expansion unit cover or door You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing components or performing a service action. Removing the front cover on the 7314-G30 or 5796 You might need to remove the front cover to perform service to the system. To remove the front cover, follow these steps: 1. Remove the two thumbscrews (B) located on the left and right of the cover. 2. Press in on the two latches (C) located left and right of the cover to release the cover. 3. Lift out the cover (A) and away from the chassis. Tip: If an airflow block is present on an unpopulated side of the chassis do not remove it unless you are populating that side of the chassis. Installing the front cover on the 7314-G30 or 5796 Use this procedure to install the front cover after installing the system or performing a service action. If you only have one side of the chassis populated, ensure that the airflow block is present on the unpopulated side. 1. Insert the posts on the bottom of the cover (A) into the hooks on the chassis of the enclosure. Figure 91. Remove the front cover for model 7314-G30 or 5796 © Copyright IBM Corp. 2010, 2011 161 2. Press in on the two latches (C) on the right and left of the front cover. 3. Press the cover onto the chassis and release the latches. The cover securely snaps into place. 4. Replace the two thumbscrews (B) into the slots on the left and right of the front cover. Figure 92. Front cover on a model 7314-G30 or 5796 162 Disk drives System covers Use these procedures to remove and replace the covers on IBM Power Systems servers that contain the POWER7 processor. Removing and replacing covers for the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C Use these instructions to remove, replace, or install covers to access components or perform service. Removing the front cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system Use this procedure to remove the cover to access components or perform service. To remove the front cover, follow these steps: 1. Remove the two shipping screws (if present) that secure the system to the rack. Note: Replacing the shipping screws is optional but should be done in geographic areas that are prone to seismic activity. 2. Pull the cover away from the system. The cover has an indentation where you can hold onto it more easily. Figure 93. Removing the front cover © Copyright IBM Corp. 2010, 2011 163 Removing the front cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system Use this procedure to remove the cover to access components or perform service. To remove the front cover, follow these steps: 1. Open the front door (A) by swinging it towards the left as shown in the following figure. 2. Press latch (B) and pull the cover away from the system. The cover has an indentation where you can hold onto it more easily. Installing the front cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system Use this procedure to install the cover to access components or perform service. To install the front cover, follow these steps: 1. Push the cover onto the system. 2. Gently push the cover in until the four cover clips are seated in their respective mounting posts as shown in the following figure. The cover snaps into place and has an indentation where you can hold onto it more easily. Note: Replacing the shipping screws is optional but should be done in geographic areas that are prone to seismic activity. Figure 94. Removing the front cover 164 Disk drives Installing the front cover and front door on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system Use this procedure to install the cover to access components or perform service. To install the front cover and front door, complete the following steps. 1. Align the cover (A) until the two cover tabs are seated into the slots on the base plate (B), as shown in the following figure. 2. Rotate the cover up and towards the system until the release latch is seated into its respective slot. The cover has an indentation where you can hold onto it more easily. Figure 95. Removing the front cover System covers 165 3. Hold the front door at an approximate 120 degree angle to the system as shown in the following figure. Engage the latches. Removing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system Use this procedure to remove the service access cover. 1. Loosen the two thumbscrews (A) located at the back of the cover. 2. Slide the cover toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit. Figure 96. Installing the front cover Figure 97. Installing the front door 166 Disk drives Attention: When the service access cover is removed, the system powers down. Removing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system Use this procedure to remove the service access cover. 1. Loosen the thumbscrew (A) located at the back of the outside cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit. Figure 98. Removing the service access cover System covers 167 3. Loosen the two thumbscrews (A) located at the back of the inside cover. 4. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit. Attention: When the service access cover is removed, the system powers down. Figure 99. Installing the outside service access cover Figure 100. Removing the inside service access cover 168 Disk drives Installing the service access cover on a rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system Use this procedure to install the service access cover. 1. Place the service access cover on top of the system, about 25 mm (1 in.) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (A) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover. Installing the service access cover on a stand-alone 8202-E4B, 8202-E4C, or 8205-E6B system Use this procedure to install the service access cover. 1. Place the inside service access cover (A) on top of the system, about 25 mm (1 in.) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the two thumbscrews (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover. Figure 101. Installing the service access cover System covers 169 5. Place the outside service access cover (A) on top of the system. 6. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 7. Align the thumbscrew (B) located on the back of the service access cover with the hole on the back of the system chassis. 8. Tighten the thumbscrew to secure the service access cover. Figure 102. Installing the inside service access cover 170 Disk drives Removing and replacing covers for the 8231-E2B, 8231-E1C, or 8231-E2C Use these instructions to remove, replace, or install covers to access components or perform service. Removing the service access cover for the 8231-E2B, 8231-E1C, or 8231-E2C Use this procedure to remove the service access cover. 1. Loosen the thumbscrew (A) located at the back of the cover. 2. Slide the cover (B) toward the back of the system unit. When the front of the service access cover has cleared the upper frame ledge, lift the cover up and off the system unit. Attention: When the service access cover is removed, the system powers down. Figure 103. Installing the outside service access cover System covers 171 Installing the service access cover for the 8231-E2B, 8231-E1C, or 8231-E2C Use this procedure to install the service access cover. 1. Place the service access cover (A) on top of the system, about 25 mm (1 in.) from the upper chassis ledge. 2. Hold the service access cover against the system unit and slide it toward the front of the system. The tabs of the service access cover slide beneath the upper chassis ledge. 3. Align the thumbscrew (B) located on the back of the service access cover with the two holes on the back of the system chassis. 4. Tighten the thumbscrews to secure the service access cover. Figure 104. Removing the service access cover 172 Disk drives Removing and replacing covers on the 8233-E8B or 8236-E8C system Use these instructions to remove, replace, or install covers to access components or perform service. Removing the front cover from an 8233-E8B or 8236-E8C system Use this procedure to remove the cover to access components or perform service. To remove the front cover, follow these steps: 1. Remove the two screws (A) that secure the system to the rack (B) as shown in the following figure. 2. Pull the cover away from the system. The cover has an indentation where you can hold onto it more easily. Figure 105. Installing the service access cover System covers 173 Installing the front cover on an 8233-E8B or 8236-E8C system Use this procedure to install the cover after accessing components or performing service. To install the front cover, follow these steps: 1. Align the cover with the system. 2. Gently push the cover in until the four cover clips are seated in their respective mounting posts (B) on the system. 3. Replace the two screws (C) that secure the system to the rack (A). Figure 106. Removing the front cover 174 Disk drives Removing the service access cover from an 8233-E8B or 8236-E8C system Use this procedure to remove the service access cover to perform service or to gain access to internal components. To remove the service access cover, follow these steps: 1. Place the system into the service position. For instructions, see “Placing the 8233-E8B or 8236-E8C in the service position” on page 199. 2. Loosen the two thumbscrews (A) located at the back of the cover. 3. Slide the cover (B) toward the back of the system unit. When the front of the service access cover clears the upper frame ledge, lift the cover up and off the system unit. Attention: For proper cooling and airflow, install the cover before starting the system. Operating the system without the cover for more than 30 minutes could damage the system components. Figure 107. Installing the front cover System covers 175 Installing the service access cover on an 8233-E8B or 8236-E8C system Use this procedure to install the service access cover after performing service or accessing internal components. To install the service access cover, follow these steps: 1. Place the service access cover (A) on the top of the system unit, approximately 25 mm (1 in.) from the front of the system unit. 2. Hold the service access cover against the system unit, and slide it toward the front of the system. The tabs on the service access cover slide beneath the upper chassis ledge, and the two screws align with the screw holes at the back of the system unit. Important: Ensure that the fan LED cables do not get caught on the front edge of the service access cover as you move it forward. 3. Tighten the screws (B) located at the back of the cover. Figure 108. Removing the service access cover 176 Disk drives Removing and replacing covers for the 9117-MMB, 9117-MMC, 9179-MHB, or 9179-MHC Use these instructions to remove, replace, or install covers to access components or perform service. Removing the front cover from the 9117-MMB, 9117-MMC, 9179-MHB, or 9179-MHC Use this procedure to remove the cover to access components or perform service. This procedure applies to covers that are shipped after 1 March 2011. For covers that are shipped prior to 1 March 2011, use the Removing the front cover with the cover removal tool procedure. Note: The older covers shipped before 1 March 2011 can be identified by a single blue stripe on the cover assembly latches. Newer covers shipped after 1 March 2011 have two blue stripes on the cover assembly latches. To remove the front cover, complete the following steps. 1. If necessary, open the front rack door. 2. Pull out the latches (A) located on both sides of the cover as shown in the following figure. Figure 109. Installing the service access cover System covers 177 3. Pull off the cover (B) to remove it from the system unit. Removing the front cover with the cover removal tool Removing the front cover on the 9117-MMB or 9179-MHB systems shipped before 1 March 2011 requires the use of a cover removal tool. Systems shipped before 1 March 2011 require the use of a cover removal tool to prevent the front fans from being inadvertently removed and causing an unexpected outage. Note: The older covers shipped before 1 March 2011 can be identified by a single blue stripe (A) on the cover assembly latches. Newer covers shipped after 1 March 2011 have two blue stripes (B) on the cover assembly latches. See Figure 111 on page 179. Figure 110. Removing the front cover 178 Disk drives Important: Do not remove the cover to view the enclosure serial number. If you need to check the enclosure serial number, use one of the following methods: v From the Advanced System Management Interface (ASMI), complete the following steps. This procedure can be performed in FSP standby mode. 1. Expand System Information. 2. Click Vital Product Data in left column. 3. From the Vital Product Data panel, click Enclosure VPD (EV) for each enclosure configured. 4. Click Details to view results. v From the Hardware Management Console (HMC), complete the following steps. The server must be powered on to partition standby mode. 1. Expand Hardware Information. 2. Select View Hardware Topology to view the Enclosure ID for each enclosure. To remove the front cover with the cover removal tool, complete the following steps: 1. Remove the cover removal tool (PN 74Y9219) from the shipping box, if needed. 2. If necessary, open the front rack door. 3. Align the cover removal tool (A) with the front cover of the system as shown in the following figure. Figure 111. Identifying the older and newer covers System covers 179 Note: The top pins of the cover removal tool align with the second hole down from the top of the cover. Pins must be inserted through the hole until the removal tool pins contact the system and cannot be pushed forward any further. There is a gap between the system and the cover removal tool. 4. Place hands on the cover removal tool as shown in the following figure while pressing the tool against the system. 5. While keeping forward pressure on the cover removal tool, pull the cover assembly latches to disengage them as shown in the following figure, which releases the retention mechanism. Figure 112. Aligning the cover removal tool Figure 113. Cover removal tool use 180 Disk drives 6. If the front cover is difficult to remove, the cover might be adhering to the fan assemblies at the top of the system. Apply forward pressure against the cover removal tool and place fingers underneath the edge of the cover near the top corners. Then, pull the cover towards the cover removal tool as shown in the following figure. Figure 114. Releasing the cover latches System covers 181 7. After the cover assembly is unlatched and lying against the cover removal tool, remove both from the front of the system as shown in the following figure. 8. Repeat these steps for all system covers that must be removed. Figure 115. Removing a cover that adheres to the fan assemblies Figure 116. Removing the cover 182 Disk drives Installing the front cover on the 9117-MMB, 9117-MMC, 9179-MHB, or 9179-MHC Use this procedure to install the cover after accessing components or performing service. To install the front cover, follow these steps: 1. Position the cover (A) on the front of the system unit so that the four pins on the system match the four holes on the back of the cover. 2. Press the tabs (B) to snap the cover into position. 3. Close the front rack door. Figure 117. Installing the front cover System covers 183 184 Disk drives Placing the 8231-E2B, 8231-E1C, or 8231-E2C system into the service or operating position Use these procedures to place a system into the service position or operating position to perform service or to gain access to internal components. Placing the rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system into the service position Use this procedure to place the rack-mounted system into the service position. Notes: v When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. v Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. v When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. While holding the system unit release latches down on both the left and right sides, pull the system unit out from the rack until the rails are fully extended and locked. Figure 118. Placing the system in the service position © Copyright IBM Corp. 2010, 2011 185 Placing the rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system into the operating position Use this procedure to place the rack-mounted system into the operating position. When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the blue rail safety latches (A) by lifting them upwards. 2. Push the system unit back into the rack until both system-unit release latches have locked into position. Figure 119. Unlocking the safety rail latches 186 Disk drives 3. Close the front rack door of the unit you are servicing. Figure 120. Placing the system into the operating position Placing the 8231-E2B, 8231-E1C, or 8231-E2C system into the service or operating position 187 188 Disk drives Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service position Use this procedure to place the rack-mounted system into the service position. Notes: v When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. v Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. v When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. Note: Replacing the shipping screws is optional but should be done in geographic areas that are prone to seismic activity. 4. While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked. Figure 121. Placing the system in the service position © Copyright IBM Corp. 2010, 2011 189 190 Disk drives Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the operating position Use this procedure to place the rack-mounted system into the operating position. When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the blue rail safety latches (A) by lifting them upward. The rail safety latches are located near the middle of the system. 2. Push the system unit (B) back into the rack until both system-unit release latches have locked into position. 3. Replace and tighten the two screws (if present) (A) that secure the system unit (B) to the rack. Note: Replacing the two shipping screws is optional but should be done in geographic areas that are prone to seismic activity. 4. Close the front rack door of the unit you are servicing. Figure 122. Placing the system into the operating position © Copyright IBM Corp. 2010, 2011 191 Figure 123. Placing the system into the operating position 192 Disk drives Service and operating positions Use these procedures to place IBM Power Systems servers into the service or operating position. Placing the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service or operating position Use these procedures to place a system into the service position or operating position to perform service or to gain access to internal components. Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the service position Use this procedure to place the rack-mounted system into the service position. Notes: v When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. v Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. v When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. Remove the two screws (A) (if present) that secure the system unit (B) to the rack. Note: Replacing the shipping screws is optional but should be done in geographic areas that are prone to seismic activity. © Copyright IBM Corp. 2010, 2011 193 4. While holding the system unit release latches (A) down on both the left and right sides, pull the system unit (B) out from the rack until the rails are fully extended and locked. Placing the rack-mounted 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system into the operating position Use this procedure to place the rack-mounted system into the operating position. Figure 124. Placing the system in the service position 194 Disk drives When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the blue rail safety latches (A) by lifting them upward. The rail safety latches are located near the middle of the system. 2. Push the system unit (B) back into the rack until both system-unit release latches have locked into position. 3. Replace and tighten the two screws (if present) (A) that secure the system unit (B) to the rack. Note: Replacing the two shipping screws is optional but should be done in geographic areas that are prone to seismic activity. 4. Close the front rack door of the unit you are servicing. Figure 125. Placing the system into the operating position Service and operating positions 195 Placing the 8231-E2B, 8231-E1C, or 8231-E2C system into the service or operating position Use these procedures to place a system into the service position or operating position to perform service or to gain access to internal components. Placing the rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system into the service position Use this procedure to place the rack-mounted system into the service position. Notes: v When placing the system into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system unit is in the service position at a time. v Ensure that the cables at the back of the system unit do not catch or bind as you pull the unit forward in the rack. v When the system unit rails are fully extended, the rail safety latches lock into place. This action prevents the system from being pulled out too far. 1. Open the front rack door. 2. Identify the system unit you are servicing in the rack. 3. While holding the system unit release latches down on both the left and right sides, pull the system unit out from the rack until the rails are fully extended and locked. Figure 126. Placing the system into the operating position 196 Disk drives Placing the rack-mounted 8231-E2B, 8231-E1C, or 8231-E2C system into the operating position Use this procedure to place the rack-mounted system into the operating position. When placing the system into operating position, ensure that the cables at the back of the system do not catch or bind as you push the system unit back into the rack. 1. Unlock the blue rail safety latches (A) by lifting them upwards. Figure 127. Placing the system in the service position Service and operating positions 197 2. Push the system unit back into the rack until both system-unit release latches have locked into position. Figure 128. Unlocking the safety rail latches Figure 129. Placing the system into the operating position 198 Disk drives 3. Close the front rack door of the unit you are servicing. Placing the rack-mounted 8233-E8B or 8236-E8C system into the service position or operating position Use these procedures to place a system into the service position or operating position to perform service or to gain access to internal components. Placing the 8233-E8B or 8236-E8C in the service position Use this procedure to perform service or gain access to internal components by placing the rack-mounted system or expansion unit in the service position. Note: Some of the figures in these procedures might not look exactly like the system or expansion unit that you have. However, the steps to perform the task are the same. DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) DANGER Service and operating positions 199 Observe the following precautions when working on or around your IT rack system: v Heavy equipment–personal injury or equipment damage might result if mishandled. v Always lower the leveling pads on the rack cabinet. v Always install stabilizer brackets on the rack cabinet. v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest devices in the bottom of the rack cabinet. Always install servers and optional devices starting from the bottom of the rack cabinet. v Rack-mounted devices are not to be used as shelves or work spaces. Do not place objects on top of rack-mounted devices. v Each rack cabinet might have more than one power cord. Be sure to disconnect all power cords in the rack cabinet when directed to disconnect power during servicing. v Connect all devices installed in a rack cabinet to power devices installed in the same rack cabinet. Do not plug a power cord from a device installed in one rack cabinet into a power device installed in a different rack cabinet. v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of the system or the devices that attach to the system. It is the responsibility of the customer to ensure that the outlet is correctly wired and grounded to prevent an electrical shock. CAUTION v Do not install a unit in a rack where the internal rack ambient temperatures will exceed the manufacturer's recommended ambient temperature for all your rack-mounted devices. v Do not install a unit in a rack where the air flow is compromised. Ensure that air flow is not blocked or reduced on any side, front, or back of a unit used for air flow through the unit. v Consideration should be given to the connection of the equipment to the supply circuit so that overloading of the circuits does not compromise the supply wiring or overcurrent protection. To provide the correct power connection to a rack, refer to the rating labels located on the equipment in the rack to determine the total power requirement of the supply circuit. v (For sliding drawers.) Do not pull out or install any drawer or feature if the rack stabilizer brackets are not attached to the rack. Do not pull out more than one drawer at a time. The rack might become unstable if you pull out more than one drawer at a time. v (For fixed drawers.) This drawer is a fixed drawer and must not be moved for servicing unless specified by the manufacturer. Attempting to move the drawer partially or completely out of the rack might cause the rack to become unstable or cause the drawer to fall out of the rack. (R001) To place the rack-mounted system or expansion unit into the service position, follow these steps: 1. If necessary, open the front rack door. 2. Remove the two screws (A) that secure the system unit to the rack as shown in the following figure. 3. Release the rack latches (B) on both the left and right sides as shown in the following figure. 200 Disk drives 4. If required, unplug the cables from the back of the system or expansion unit before you pull the unit out from the rack. Ensure that the cables at the rear of the system or expansion unit do not catch or bind as you pull the unit out from the rack. 5. Slowly pull the system or expansion unit out from the rack until the rails are fully extended and locked. Ensure that the rails are fully extended. When the rails are fully extended, the rail safety latches lock into place. This action prevents the system or expansion unit from being pulled out too far. Placing the 8233-E8B or 8236-E8C in the operating position Use this procedure to place the system or expansion unit in the operating position to make the unit available for use. To place the system into the operating position follow these steps: Tip: Some of the figures in these procedures might not look exactly like the system or expansion unit that you have. However, the steps to perform the task are the same. 1. Simultaneously release the blue rail safety latches (B), located near the front of each rail, and push the system or expansion unit into the rack as shown in the following figure. Note: Ensure that the cables at the rear of the system or expansion unit do not catch or bind as you push the unit back into the rack. Figure 130. Releasing the rack latches Service and operating positions 201 2. Replace and tighten the two thumbscrews (C) that secure the system or expansion unit (A) to the rack as shown in the following figure. 3. Close the front rack door. Figure 131. Releasing the rail safety latches Figure 132. Replacing the thumbscrews 202 Disk drives Power cords Use these procedures to disconnect and connect the power cords on IBM Power Systems servers that contain the POWER7 processor. Disconnecting the power cords from the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system Use these procedures to disconnect the power cords from the system. 1. For a rack-mounted system, open the back rack door on the unit you are servicing and identify the system unit you are servicing in the rack. 2. For both a rack-mounted system and a stand-alone system, disconnect all power cords from the unit you are servicing. This system might be equipped with two power supplies. Before continuing with any remove and replace procedures that require the power off, ensure that both power sources to the system have been completely disconnected. Figure 133. Removing the power cords for a rack-mounted system © Copyright IBM Corp. 2010, 2011 203 Connecting the power cords to the 8202-E4B, 8202-E4C, 8205-E6B, or 8205-E6C system Use this procedure to connect the power cords to the system. 1. Identify the system unit you are servicing in the rack. 2. Connect all power cords to the unit you are servicing. Ensure that the cords are threaded through the handles. Figure 134. Removing the power cords for a stand-alone system 204 Disk drives 3. Close the back rack door. Disconnecting the power cords from the 8231-E2B, 8231-E1C, or 8231-E2C Use these procedures to disconnect the power cords from the system. 1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing. This system might be equipped with two power supplies. Before continuing with any remove and replace procedures that require the power off, ensure that both power sources to the system have been completely disconnected. Figure 135. Connecting the power cords Power cords 205 Connecting the power cords to the 8231-E2B, 8231-E1C, or 8231-E2C Use these procedures to connect the power cords to the system. 1. Open the back rack door on the unit you are servicing. 2. Identify the system unit you are servicing in the rack. 3. Disconnect all power cords from the unit you are servicing. Ensure that the cords are threaded through the handles. Figure 136. Removing the power cords 206 Disk drives Figure 137. Connecting the power cords Power cords 207 208 Disk drives Verifying the installed part You can verify a newly installed or replaced part on your system, logical partition, or expansion unit using the operating system, stand-alone diagnostics, or the Hardware Management Console (HMC). Verifying an installed feature or replaced part in an AIX system or logical partition If you installed feature or replaced a part, you might want to use the tools in the AIX operating system to verify that the feature or part is recognized by the system or logical partition. To verify the operation of a newly installed feature or replacement part, select the appropriate procedure: v Verify the installed feature using AIX v Verifying the replaced part using AIX Verify the installed feature by using the AIX operating system: 1. Log in as root user. 2. At the command line, type diag and press Enter. 3. Select Advanced Diagnostics Routines and press Enter. 4. From the Diagnostic Mode Selection menu, select System Verification and press Enter. 5. When the Advanced Diagnostic Selection menu appears, do one of the following: v To test a single resource, select the resource that you just installed from the list of resources and press Enter. v To test all the resources available to the operating system, select All Resources and press Enter. 6. Select Commit, and wait until the diagnostic programs run to completion, responding to any prompts that appear. 7. Did the diagnostics run to completion and display the message No trouble was found? v No: If a service request number (SRN) or other reference code is displayed, suspect a loose adapter or cable connection. Review the installation procedures to ensure that the new feature is installed correctly. If you cannot correct the problem, collect all SRNs or any other reference code information that you see. If the system is running in logical partitioning (LPAR) mode, note the logical partition in which you installed the feature. Contact your service provider for assistance. v Yes: The new device is installed correctly. Exit the diagnostic programs and return the system to normal operations. Verify the replacement part by using the AIX operating system: To verify the operation of a newly installed feature or replacement part, follow these steps: 1. Did you use either the AIX operating system or the online diagnostics service aid concurrent (hot-swap) service to replace the part? No: Go to step 2. Yes: Go to step 5 on page 210. 2. Is the system powered off? No: Go to step 4 on page 210. Yes: If the system supports slow boot, set the system to perform a slow boot. For information, see Performing a slow boot. © Copyright IBM Corp. 2010, 2011 209 3. Start the system and wait until the AIX operating system login prompt is displayed or until apparent system activity on the operator panel or display has stopped. Did the AIX login prompt display? v No: If a service request number (SRN) or other reference code is displayed, suspect a loose adapter or cable connection. Review the procedures for the part that you replaced to ensure that the new part is installed correctly. If you cannot correct the problem, collect all SRNs or any other reference code information that you see. If the system does not start or you have no login prompt, see: Problems with loading and starting the operating system. If the system is partitioned, note the logical partition in which you replaced the part. Contact your service provider for assistance. v Yes: Go to step 4. 4. At the command prompt, type diag —a and press Enter to check for missing resources. If you see a command prompt, go to step 5. If the Diagnostic selection menu is shown with M appearing next to any resource, follow these steps: a. Select the resource and press Enter. b. Select Commit. c. Follow any instructions that are shown. d. If the Do you want to review the previously displayed error? message is shown, select Yes and press Enter. e. If an SRN is shown, suspect a loose card or connection. If no obvious problem is shown, record the SRN and contact your service provider for assistance.. f. If no SRN is shown, go to step 5. 5. Test the part by doing the following steps: a. At the command line, type diag and press Enter. b. From the Function Selection menu, select Advanced Diagnostics Routines and press Enter. c. From the Diagnostic Mode Selection menu, select System Verification and press Enter. d. Select All Resources, or select the diagnostics for the individual part to test only the part you replaced and any devices that are attached to the part you replaced and press Enter. Did the Resource Repair Action menu appear? No: Go to step 6. Yes: Go to step 7. 6. Did the Testing Complete, No trouble was found message appear? v No: There is still a problem. Contact your service provider. This ends the procedure. v Yes: Select Log Repair Action, if not previously logged, from the Task Selection menu to update the AIX error log. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0 and press Enter. Tip: This action changes the indicator light for the part from the fault state to the normal state. Go to step 9 on page 211. 7. Select the resource for the replaced part from the Resource Repair Action menu. When a test is run on a resource in system verification mode, and that resource has an entry in the AIX error log, if the test on the resource was successful, the Resource Repair Action menu appears. Complete the following steps to update the AIX error log to indicate that a system-detectable part has been replaced. Note: On systems with an indicator light for the failing part, this action changes the indicator light to the normal state. 210 Disk drives a. Select the resource that has been replaced from the Resource Repair Action menu. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action does not appear on the resource list, select sysplanar0 and press Enter. b. Select Commit after you make your selections. Did another Resource Repair Action display appear? No: If the No Trouble Found display appears, go to step 9 Yes: Go to step 8. 8. Select the parent or child of the resource for the replaced part from the Resource Repair Action menu if necessary. When a test is run on a resource in system verification mode, and that resource has an entry in the AIX error log, if the test on the resource was successful, the Resource Repair Action menu appears. Complete the following steps to update the AIX error log to indicate that a system-detectable part has been replaced. Note: This action changes the indicator light for the part from the fault state to the normal state. a. From the Resource Repair Action menu, select the parent or child of the resource that has been replaced. If the repair action was to reseat a cable or adapter, select the resource associated with that repair action. If the resource associated with your action does not appear on the resource list, select sysplanar0 and press Enter. b. Select Commit after you make your selections. c. If the No Trouble Found display appears, go to step 9. 9. If you changed the service processor or network settings, as instructed in previous procedures, restore the settings to the values they had prior to servicing the system. 10. Did you do any hot-plug procedures before doing this procedure? No: Go to step 11. Yes: Go to step 12. 11. Start the operating system, with the system or logical partition in normal mode. Were you able to start the operating system? No: Contact your service provider. This ends the procedure. Yes: Go to step 12. 12. Are the indicator lights still on? v No. This ends the procedure. v Yes. Turn off the lights. See the following for instructions: Changing service indicators. Verifying the installed part in an IBM i system or logical partition If you have installed a new feature or part, verify that the system recognizes the feature or part by using the IBM i system service tools. To verify the installed part, follow these steps: 1. Deactivate the failing item indicator light. For instructions, see “Deactivating the failing-part indicator light” on page 150. 2. Sign on with at least service level authority. 3. On the command line of the IBM i session, type strsst and press Enter. Note: If you cannot get to the System Service Tools display, use function 21 from the control panel. Alternatively, if the system is managed by Hardware Management Console (HMC), use the Service Focal Point Utilities to get to the Dedicated Service Tools (DST) display. 4. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign On display and press Enter. Verifying the installed part 211 Note: The service tools password is case-sensitive. 5. Select Start a service tool from the System Service Tools (SST) display and press Enter. 6. Select Hardware service manager from the Start a Service Tool display and press Enter. 7. Select Logical hardware resources (buses, IOPs, controllers) from the Hardware Service Manager display and press Enter. This option allows you to display and work with logical resources. Logical hardware resources are the functional resources of the system used by the operating system. With the Logical Hardware Resources display, you can show logical hardware resource status or information, and associated packaging hardware resources. Use the online Help information to better understand specific functions, fields, or symbols. Deactivating the failing-part indicator light Use this procedure to turn off any indicator light that you turned on as a part of a service action. To deactivate the indicator light, follow these steps: 1. Sign on to an IBM i session, with at least service level authority. 2. On the command line of the session, type strsst and press Enter. Note: If you cannot get to the System Service Tools display, use function 21 from the control panel. Alternatively, if the system is managed by a Hardware Management Console (HMC), use the Service Focal Point utilities to get to the Dedicated Service Tools (DST) display. 3. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign On display. Press Enter. Remember: The service tools password is case-sensitive. 4. Select Start a service tool from the System Service Tools (SST) display and press Enter. 5. Select Hardware service manager from the Start a Service Tool display and press Enter. 6. Select Work with service action log from the Hardware Service Manager display and press Enter. 7. On the Select Timeframe display, change the From: Date and Time field to a date and time prior to when the problem occurred. 8. Search for an entry that matches one or more conditions of the problem: v System Reference code v Resource v Date and time v Failing item list 9. Select option 2 (Display failing item information) to display the service action log entry. 10. Select option 2 (Display details) to display location information for the failing part to be replaced. The information displayed in the date and time fields is the date and time for the first occurrence of the specific System reference code for the resource displayed during the time range selected. 11. Select option 7 (Indicator off) to turn off the indicator light. 12. Select the Acknowledge all errors function at the bottom of the Service Action Log display, if all problems have been resolved. 13. Close the log entry by selecting option 8 (Close new entry) on the Service Action Log Report display. Verifying the installed part in a Linux system or logical partition If you have installed a new part, learn how to verify that the system recognizes the part. To verify the newly installed or replaced part, continue with “Verifying an installed part by using stand-alone diagnostics” on page 213. 212 Disk drives Verifying an installed part by using stand-alone diagnostics If you have installed or replaced a part, verify that the system recognizes the new part. You can use stand-alone diagnostics to verify an installed part in an AIX or Linux system, expansion unit, or logical partition. v If this server is directly attached to another server or attached to a network, ensure communications with the other servers have stopped. v The stand-alone diagnostics require use of all of the logical partition resources. No other activity can be running on the logical partition. v The stand-alone diagnostics require access to the system console. You access these diagnostics from a CD-ROM or from the Network Installation Management (NIM) server. This procedure describes how to use the diagnostics from a CD-ROM. For information on running diagnostics from the Network Installation Management (NIM) server, see Running stand-alone diagnostics from a Network Installation Management server. To use stand-alone diagnostics, follow these steps: 1. Stop all jobs and applications and then stop the operating system on the system or logical partition. 2. Remove all tapes, diskettes, and CD-ROM. 3. Turn off the system unit power. The next step boots the server or logical partition from the stand-alone diagnostics CD-ROM. If the optical drive is not available as the boot device on the server or logical partition on which you are working, follow these steps: a. Access the ASMI. See Accessing the ASMI for information on using the ASMI. b. On the ASMI main menu, click on Power/Restart Control. c. Click Power On/Off System. d. Select the Service mode boot from default boot list option in the AIX or Linux logical partition mode boot drop-down menu. e. Click Save settings and power on. As soon as the optical drive has power, insert the standalone diagnostic CD-ROM. f. Go to step 5. 4. Turn on the system unit power and immediately insert the diagnostics CD-ROM into the optical drive. 5. After the keyboard POST indicator displays on the system console and before the last POST indicator (speaker) displays, press the numeric 5 key on the system console to indicate that a service mode boot should be initiated using the default-service mode boot list. 6. Enter any requested password. 7. At the Diagnostic Operating Instructions display, press Enter. Tip: If a service request number (SRN) or other reference code is displayed, suspect a loose adapter or cable connection. Note: If you received an SRN or any other reference code when you attempted to start the system, contact your service provider for assistance.. 8. If the terminal type is requested, select the Initialize Terminal option on the Function Selection menu to initialize the operating system. 9. From the Function Selection menu, select Advanced Diagnostics Routines and press Enter. 10. From the Diagnostic Mode Selection menu, select System Verification and press Enter. 11. When the Advanced Diagnostic Selection menu appears, select All Resources, or test only the part you replaced, and any devices that are attached to the part you replaced, by selecting the diagnostics for the individual part and press Enter. 12. Did the Testing Complete, No trouble was found message appear? Verifying the installed part 213 v No: There is still a problem. Contact your service provider. v Yes: Go to step 13. 13. If you changed the service processor or network settings, as instructed in previous procedures, restore the settings to the value they had prior to servicing the system. 14. If the indicator lights are still on, follow these steps: a. Select Identify and Attention Indicators from the Task Selection menu to turn off the system attention and indicator lights and press Enter. b. Select Set System Attention Indicator to NORMAL and press Enter. c. Select Set All Identify Indicators to NORMAL and press Enter. d. Choose Commit. Note: This changes the system attention and identify indicators from the Fault state to the Normal state. e. Exit to the command line. Verifying the installed part by using Hardware Management Console If you have installed or replaced a part, use the Hardware Management Console (HMC) to update your HMC records after you have completed a service action on your server. If you have reference codes, symptoms, or location codes that you used during the service action, locate the records for use during this procedure. To verify the installed part, complete these steps: 1. At the HMC, examine the service action event log for any open service action events. See “Viewing serviceable events by using the HMC” on page 216 for details. 2. Are there any service action events that are open? No: If the system attention LED is still on, use the HMC to turn off the LED. See “Activating and deactivating LEDs by using the HMC” on page 215. This ends the procedure. Yes: Continue with the next step. 3. Record the list of open service action events. 4. Examine the details of the open service action event. Is the error code associated with this service action event the same as you gathered earlier. v No: Select one of the following options: – Review the other serviceable events, find one that does match, and continue with the next step. – If the log does not match what you had gathered earlier, contact your service provider. v Yes: Continue with the next step. 5. Select and highlight the service action event from the Error Associated With This Serviceable Event window. 6. Click Close Event. 7. Add comments for the serviceable event. Include any unique additional information. Click OK. 8. Did you replace, add, or modify a field replaceable unit (FRU) of the open service action event? v No: Select the No FRU Replaced for this Serviceable Event option, and click OK to close the service action event. v Yes: Perform the following steps: a. From the FRU list, select a FRU that you need to update. b. Double-click the FRU and update the FRU information. c. Click OK to close the service action event. 9. If you continue to have problems, contact your service provider. 214 Disk drives Activating and deactivating LEDs by using the HMC Use this procedure to activate or deactivate LEDs using Service Focal Point for the Hardware Management Console (HMC). Choose from the following: v “Deactivating a system attention LED or partition LED” v “Activating or deactivating an identify LED” Deactivating a system attention LED or partition LED You can deactivate a system attention LED or a logical partition LED. For example, you might determine that a problem is not a high priority and decide to repair the problem at a later time. However, you want to be alerted if another problem occurs, so you must deactivate the system attention LED so that it can be activated again if another problem occurs. 1. In the navigation area, open Systems Management. 2. Open Servers and select the appropriate system. 3. In the content area, check the box for the appropriate Partition. 4. Select Tasks, then Operations, and then Manage Attention LED. 5. Select the appropriate Partition. 6. Select Deactivate System Attention LED from the Action menu. A confirmation window is displayed that provides the following information: v A verification that the system attention LED was deactivated. v An indication that there still might be open problems within the system. v An indication that you cannot activate the system attention LED. 7. Select one of the logical partitions in the lower table, and select Deactivate partition LED from the Partition Operations menu. A confirmation window is displayed that provides the following information: v A verification that the logical partition LED was deactivated. v An indication that there still might be open problems within the logical partition. v An indication that you cannot activate the logical partition LED. Activating or deactivating an identify LED The system provides several LEDs that help identify various components, such as enclosures or field replaceable units (FRUs), in the system. For this reason, they are called identify LEDs. You can activate or deactivate the following types of identify LEDs: v Identify LED for an enclosure If you want to add an adapter to a specific drawer (enclosure), you need to know the machine type, model, and serial number (MTMS) of the drawer. To determine whether you have the correct MTMS for the drawer that needs the new adapter, you can activate the LED for a drawer and verify that the MTMS corresponds to the drawer that requires the new adapter. v Identify LED for a FRU associated with a specified enclosure If you want to hook up a cable to a specific I/O adapter, you can activate the LED for the adapter which is a field replaceable unit (FRU), and then physically check to see where you should hook up the cable. This is especially useful when you have several adapters with open ports. To activate or deactivate an identify LED for an enclosure or FRU, follow these steps: 1. In the navigation area, open Systems Management. 2. Select Servers. 3. In the content area, check the box for the appropriate System. 4. Select Tasks, then Operations, then LED Status, and then Identify LED. Verifying the installed part 215 5. To activate or deactivate an identify LED for an enclosure, select an enclosure from the table, and click either Activate LED or Deactivate LED. The associated LED is either turned on or off. 6. To activate or deactivate an identify LED for a FRU, select an enclosure from the table, select Selected ? List FRUs. 7. Select one or more FRUs from the table, and click either Activate LED or Deactivate LED. The associated LED is either turned on or off. Viewing serviceable events by using the HMC Use this procedure to view a serviceable event, including details, comments, and service history. To view serviceable events and other information about the events, you must be a member of one of the following roles: v Super administrator v Service representative v Operator v Product engineer v Viewer To view serviceable events, follow these steps: 1. In the navigation area, select Service Management. 2. Select Manage Serviceable Events. 3. Select the criteria for the serviceable events that you want to view, and click OK. The Serviceable Event Overview window opens. The list shows all serviceable events that match your selection criteria. You can use the menu options to perform actions on the serviceable events. 4. Select a line in the Serviceable Event Overview window, and select Selected ? View Details The Serviceable Event Details window opens, showing detailed information about the serviceable event. The upper table shows information, such as problem number and reference code. The lower table shows the field replaceable units (FRUs) associated with this event. 5. Select the error for which you want to view comments and history, and follow these steps: a. Select Actions ? View Comments. b. When you are finished viewing the comments, click Close. c. Select Actions ? View Service History. The Service History window opens, showing service history associated with the selected error. d. When you are finished viewing the service history, click Close. 6. When you are finished, click Cancel twice to close the Serviceable Event Details window and the Serviceable Event Overview window. Verifying the installed part by using Systems Director Management Console If you installed or replaced a part, use the IBM Systems Director Management Console (SDMC) to update your SDMC records after you have completed a service action on your server. If you have reference codes, symptoms, or location codes that you used during the service action, locate the records for use during this procedure. To verify the installed part, complete these steps: 1. From the SDMC, examine the service action event log for any open service action events. See “Viewing serviceable events by using the Systems Director Management Console” on page 218 for details. 2. Are there any service action events that are open? 216 Disk drives No: If the system attention LED is still on, use the SDMC to turn off the LED. See “Activating and deactivating LEDs by using the Systems Director Management Console.” This ends the procedure. Yes: Continue with the next step. 3. Record the list of open service action events. 4. Examine the details of the open service action event. Is the error code associated with this service action event the same as you gathered earlier. v No: Select one of the following options: – Review the other serviceable events, find one that does match, and continue with the next step. – If the log does not match what you had gathered earlier, contact your service provider. v Yes: Continue with the next step. 5. Select and highlight the service action event from the Error Associated With This Serviceable Event window. 6. Click Delete or Ignore. Note: These options are only available from the problem event log. Activating and deactivating LEDs by using the Systems Director Management Console Use this procedure to activate or deactivate LEDs by using the IBM Systems Director Management Console (SDMC). Choose from the following: v “Deactivating a system attention LED or partition LED” v “Activating or deactivating an identify LED by using the SDMC” Deactivating a system attention LED or partition LED You can deactivate a system attention LED or a logical partition LED. For example, you might determine that a problem is not a high priority and decide to repair the problem at a later time. However, you want to be alerted if another problem occurs, so you must deactivate the system attention LED so that it can be activated again if another problem occurs. 1. On the Resources tab, select the appropriate host or virtual server. 2. Select Actions ? Service and Support ? Hardware ? System Attention LED. 3. Select Deactivate System Attention LED. A confirmation window is displayed that provides the following information: v A verification that the system attention LED was deactivated. v An indication that there still might be open problems within the system. v An indication that you cannot activate the system attention LED. 4. Select one of the virtual servers, and select Deactivate System Attention LED. A confirmation window is displayed that provides the following information: v A verification that the system attention LED was deactivated. v An indication that there still might be open problems within the logical partition. v An indication that you cannot activate the virtual server LED. Activating or deactivating an identify LED by using the SDMC The system provides several LEDs that help identify various components, such as enclosures or field replaceable units (FRUs). For this reason, they are called identify LEDs. Verifying the installed part 217 You can activate or deactivate the following types of identify LEDs: v Identify LED for an enclosure If you want to add an adapter to a specific drawer (enclosure), you need to know the machine type, model, and serial number (MTMS) of the drawer. To determine whether you have the correct MTMS for the drawer that needs the new adapter, you can activate the LED for a drawer and verify that the MTMS corresponds to the drawer that requires the new adapter. v Identify LED for a FRU associated with a specified enclosure If you want to hook up a cable to a specific I/O adapter, you can activate the LED for the adapter which is a field replaceable unit (FRU), and then physically check to see where you should hook up the cable. This is especially useful when you have several adapters with open ports. To activate or deactivate an identify LED for an enclosure or FRU, follow these steps: 1. On the Resources tab, select the appropriate host or virtual server. 2. Select Actions ? Service and Support ? Hardware ? Identify LED. 3. In the Identify LED, Select Enclosure window, select the system unit or enclosure. 4. To activate or deactivate an identify LED, click either Activate LED or Deactivate LED. The associated LED is either turned on or off. 5. To activate or deactivate an identify LED for a FRU, select a system or enclosure from the table, and then select List FRUs.... 6. Select one or more FRUs from the table, and click either Activate LED or Deactivate LED. The associated LED is either turned on or off. Viewing serviceable events by using the Systems Director Management Console Use this procedure to view a serviceable event, including details, comments, and service history. To view serviceable events, follow these steps: 1. On the Resources tab, select the appropriate host or virtual server. 2. Select Actions ? System Status and Health ? Event Log. 3. Optional: You can narrow the event criteria using the Event filter menu. 4. Select a line in the Events window, and select Actions ? Properties The Properties window opens, showing detailed information about the serviceable event. The table shows information, such as problem number, reference code, and the field replaceable units (FRUs) associated with this event. Verifying an installed feature or replaced part on a system or logical partition by using Virtual I/O Server tools If you installed feature or replaced a part, you might want to use the tools in Virtual I/O Server (VIOS) to verify that the feature or part is recognized by the system or logical partition. To verify the operation of a newly installed feature or replacement part, select the appropriate procedure: v Verify the installed feature using VIOS v Verifying the replaced part using VIOS Verify the installed feature by using VIOS: 1. Log in as root user. 2. At the command line, type diagmenu and press Enter. 3. Select Advanced Diagnostics Routines and press Enter. 4. From the Diagnostic Mode Selection menu, select System Verification and press Enter. 5. When the Advanced Diagnostic Selection menu appears, do one of the following: 218 Disk drives v To test a single resource, select the resource that you just installed from the list of resources and press Enter. v To test all the resources available to the operating system, select All Resources and press Enter. 6. Select Commit, and wait until the diagnostic programs run to completion, responding to any prompts that appear. 7. Did the diagnostics run to completion and display the message No trouble was found? v No: If a service request number (SRN) or other reference code is displayed, suspect a loose adapter or cable connection. Review the installation procedures to ensure that the new feature is installed correctly. If you cannot correct the problem, collect all SRNs or any other reference code information that you see. If the system is running in LPAR mode, note the logical partition in which you installed the feature. Contact your service provider for assistance. v Yes: The new device is installed correctly. Exit the diagnostic programs and return the system to normal operations. Verify the replacement part by using VIOS: To verify the operation of a newly installed feature or replacement part, follow these steps: 1. Did you replace the part by using either VIOS or the online diagnostics service aid's concurrent (hot-swap) service operation? No: Go to step 2. Yes: Go to step 5. 2. Is the system powered off? No: Go to step 4. Yes: If the system supports slow boot, set the system to perform a slow boot. For information, see Performing a slow boot. 3. Start the system and wait until the VIOS operating system login prompt displays or until apparent system activity on the operator panel or display has stopped. Did the VIOS login prompt display? v No: If an SRN or other reference code is displayed, suspect a loose adapter or cable connection. Review the procedures for the part that you replaced to ensure that the new part is installed correctly. If you cannot correct the problem, collect all SRNs or any other reference code information that you see. If the system does not start or you have no login prompt, see: Problems with loading and starting the operating system. If the system is partitioned, note the logical partition in which you replaced the part. Contact your service provider for assistance. v Yes: Go to step 4 4. At the command prompt, type diag —a and press Enter to check for missing resources. If you see a command prompt, go to step 5. If the Diagnostic selection menu is shown with M appearing next to any resource, follow these steps: a. Select the resource and press Enter. b. Select Commit. c. Follow any instructions that are shown. d. If a Do you want to review the previously displayed error? message is shown, select Yes and press Enter. e. If an SRN is shown, suspect a loose card or connection. If no obvious problem is shown, record the SRN and contact your service provider for assistance.. f. If no SRN is shown, go to 5. 5. Test the part by doing the following: Verifying the installed part 219 a. At the command line, type diagmenu and press Enter. b. From the Function Selection menu, select Advanced Diagnostics Routines and press Enter. c. From the Diagnostic Mode Selection menu, select System Verification and press Enter. d. Select All Resources, or select the diagnostics for the individual part to test only the part you replaced, and any devices that are attached to the part you replaced and press Enter. Did the Resource Repair Action menu appear? No: Go to step 6. Yes: Go to step 7. 6. Did the Testing Complete, No trouble was found message appear? v No: There is still a problem. Contact your service provider. This ends the procedure. v Yes: Select Log Repair Action, if not previously logged, from the Task Selection menu to update the AIX error log. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the Resource List, select sysplanar0 and press Enter. Tip: This action changes the indicator light for the part from the fault state to the normal state. Go to step 9 7. Select the resource for the replaced part from the Resource Repair Action menu. When a test is run on a resource in system verification mode, and that resource has an entry in the AIX error log, if the test on the resource was successful, the Resource Repair Action menu appears. Complete the following steps to update the AIX error log to indicate that a system-detectable part has been replaced. Note: On systems with a indicator light for the failing part, this changes the indicator light to the normal state. a. Select the resource that has been replaced from the Resource Repair Action menu. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action does not appear on the Resource List, select sysplanar0. Press Enter. b. Select Commit after you make your selections. Did another Resource Repair Action display appear? No: If the No Trouble Found display appears, go to step 9 Yes: Go to step 8. 8. Select the parent or child of the resource for the replaced part from the Resource Repair Action menu if necessary. When a test is run on a resource in system verification mode, and that resource has an entry in the AIX error log, if the test on the resource was successful, the Resource Repair Action menu appears. Complete the following steps to update the AIX error log to indicate that a system-detectable part has been replaced. Note: This changes the indicator light for the part from the fault state to the normal state. a. From the Resource Repair Action menu, select the parent or child of the resource that has been replaced. If the repair action was to reseat a cable or adapter, select the resource associated with that repair action. If the resource associated with your action does not appear on the Resource List, select sysplanar0. Press Enter. b. Select Commit after you make your selections. c. If the No Trouble Found display appears, go to step 9. 9. If you changed the service processor or network settings, as instructed in previous procedures, restore the settings to the values they had prior to servicing the system. 10. Did you do any hot-plug procedures before doing this procedure? No: Go to step 11 on page 221. 220 Disk drives Yes: Go to step 12. 11. Start the operating system, with the system or logical partition in normal mode. Were you able to start the operating system? No: Contact your service provider. This ends the procedure. Yes: Go to step 12. 12. Are the indicator lights still on? v No. This ends the procedure. v Yes. Turn off the lights. See one of the following for instructions: Changing service indicators Verifying the installed part 221 222 Disk drives Verifying a repair Use these procedures to verify hardware operation after making repairs to the system. Choose from the following options: v To verify the repair of a system that is currently powered off, go to step 1. v To verify the repair of a system that is currently powered on with no operating system loaded, go to step 3. v To verify the repair of a system that is currently powered on and that has an operating system loaded, go to step 5. 1. Power on the server and all attached I/O enclosures. Did all the enclosures power on? Yes: Go to step 3.? Go to step 3. No: Continue with the next step. 2. Choose from the following options: v If the original problem was that an enclosure would not power on and you have another FRU to replace, locate and replace the next field-replaceable unit (FRU). v If the next FRU in the FRU list is an isolation procedure, perform the isolation procedure. v If the original problem was that an enclosure would not power on and you have an isolation procedure to complete, perform the isolation procedure. v If the original problem was that an enclosure would not power on and there are no more FRUs or isolation procedures in the FRU list, contact your next level of support. v If you have a new problem, perform problem analysis and repair the new problem. 3. Load the operating system. Did the operating system load successfully? Yes: Go to step 5. No: Continue with the next step. 4. Choose from the following options: v If the original problem was a failing disk drive that contained the operating system software, go to step 5. v If the original problem was that the operating system would not load and you have another FRU to replace, go to your FRU locations section to locate the next FRU. v If the next FRU in the FRU list is an isolation procedure, perform the isolation procedure. v If the original problem was that the operating system would not load and you have an isolation procedure to complete, perform the isolation procedure. v If the original problem was that the operating system would not load and there are no more FRUs or isolation procedures in the FRU list, contact your next level of support. v If you have a new problem, perform problem analysis and repair the new problem. 5. Choose from the following options: v “Verifying the repair in AIX” on page 224 v “Verifying the repair in Linux” on page 229 v “Verifying a repair using an IBM i system or logical partition” on page 227 © Copyright IBM Corp. 2010, 2011 223 Verifying the repair in AIX You can use this procedure to verify that a repair is complete using the AIX operating system. Use this maintenance analysis procedure (MAP) to check out the server after a repair is completed. 1. Did you replace a disk drive in the root volume group? No Go to step 3. Yes Continue with the next step. 2. Run stand-alone diagnostics either from a CD or from a Network Installation Management (NIM) server. Did you encounter any problems? No Reinstall the operating system and continue with step 5. Yes If the original problem still exists, replace the field-replaceable unit (FRU) or perform the isolation procedure that is next in the FRU list. If you have reached the end of the FRU list, contact your next level of support. If a new problem has occurred, go to Beginning problem analysis. 3. Did you replace a FRU with the power turned on and concurrently with system operations? No Go to step 5. Yes Continue with the next step. 4. Did you use an AIX diagnostics service aid hot-swap operation to change the FRU? Yes Go to step 6. No Go to step 7 on page 225. Note: The AIX diagnostic service aid was used if a resource was removed using the Hot Plug task. 5. Note: If any FRUs have been removed that should be reinstalled, reinstall them now. Perform the following steps: 1. If the system is not powered on, power it on now. 2. Perform a slow boot. 3. Wait until the AIX operating system login prompt displays or until system activity on the operator panel or display apparently has stopped. 4. Did you encounter any problems? No Continue at step 6. Yes If the original problem still exists, replace the FRU or perform the isolation procedure that is next in the FRU list. If you have reached the end of the FRU list, contact your next level of support. If a new problem has occurred, go to Beginning problem analysis. 6. If the Resource Repair Action menu is already displayed, go to step 9 on page 225; otherwise, perform the following steps: 224 Disk drives 1. Log into the operating system either with root authority (if needed, ask the customer to enter the password) or use the CE login. 2. Enter the diag -a command and check for missing resources. Follow any instructions that display. If an SRN displays, suspect a loose card or connection. If no instructions display, no resources were detected as missing. Continue with the next step. 7. Perform the following steps: 1. Enter diag at the command prompt. 2. Press Enter. 3. Select the Diagnostics Routines option. 4. When the Diagnostic Mode Selection menu displays, select System verification. 5. When the Diagnostic Selection menu displays, select the All Resources option or test the FRUs you exchanged, and any devices that are attached to the FRU(s) you exchanged, by selecting the diagnostics for the individual FRU. Did the Resource Repair Action menu (801015) display? No Continue with the next step. Yes Go to step 9. 8. Did the Testing Complete, no trouble was found menu (801010) display? Yes Use the Log Repair Action option, if not previously logged, in the TASK SELECTION menu to update the AIX error log. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. Note: If the system attention indicator is on, this action will set it back to the normal state. Go to step 11 on page 226. No If the original problem still exists, replace the FRU or perform the isolation procedure that is next in the FRU list. If you have reached the end of the FRU list, contact your next level of support. If a new problem has occurred, go to Beginning problem analysis. 9. When a test is run on a resource in system verification mode, that resource has an entry in the AIX error log. If the test on that resource was successful, the Resource Repair Action menu displays. After replacing a FRU, you must select the resource for that FRU from the Resource Repair Action menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced. Note: If the system attention indicator is on, this action will set it back to the normal state. Perform the following steps: Verifying a repair 225 1. Select the resource that has been replaced from the Resource Repair Action menu. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. 2. Press Commit after you make your selections. Did another Resource Repair Action (801015) display? No If the No Trouble Found menu displays, go to step 11. Yes Continue with the next step. 10. The parent or child of the resource you just replaced might also require that you run the Resource Repair Action service aid on it. When a test is run on a resource in system verification mode, that resource has an entry in the AIX error log. If the test on the resource was successful, the Resource Repair Action menu displays. After replacing that FRU, you must select the resource for that FRU from the Resource Repair Action menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced. Note: If the system attention indicator is on, this action will set it back to the normal state. Perform the following steps: 1. From the RESOURCE REPAIR ACTION menu, select the parent or child of the resource that has been replaced. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. 2. Press COMMIT after you make your selections. 3. If the No Trouble Found menu displays, continue with the next step. 11. If you changed the service processor or network settings, as instructed in previous MAPs, restore the settings to the value they had prior to servicing the system. If you ran stand-alone diagnostics from CD-ROM, remove the stand-alone diagnostics CD-ROM from the system. Did you perform service on a RAID subsystem involving changing of the PCI RAID adapter cache card or changing the configuration? Note: This information does not apply to the PCI-X RAID adapter or cache. No Go to the close of call procedure. Yes Continue with the next step. 12. Use the Recover Options selection to resolve the RAID configuration. To do this, perform the following steps: 226 Disk drives 1. On the PCI SCSI Disk Array Manager display, select Recovery options. 2. If a previous configuration exists on the replacement adapter, this must be cleared. Select Clear PCI SCSI Adapter Configuration and press F3. 3. On the Recovery Options screen, select Resolve PCI SCSI RAID Adapter Configuration. 4. On the Resolve PCI SCSI RAID Adapter Configuration screen, select Accept Configuration on Drives. 5. On the PCI SCSI RAID Adapter selections menu, select the adapter that you changed. 6. On the next screen, press Enter. 7. When you see the Are You Sure selection menu, press Enter to continue. 8. If you see a Failed status message, verify that you selected the correct adapter, and then repeat this procedure. When the recovery is complete, exit the operating system. 9. Go to the close of call procedure. Verifying a repair using an IBM i system or logical partition Use this procedure to verify a repair using the IBM i operating system. 1. Was the system powered off during the repair? Yes: Continue with the next step. No: Continue with step 3. 2. Perform the following tasks: a. Verify that the power cable is plugged into the power outlet. b. Verify that power is available at the customer's power outlet. 3. Was the partition powered off during the repair? Yes: Continue with the next step. No: Continue with step 6. 4. Select the IPL type and mode for the system or logical partition that the customer uses (see IPL type mode and speed options in the Service functions). 5. Start an IPL by powering on the system or partition (see Powering on and powering off). Did the system complete the IPL? Yes: Continue with the next step. No: This might be a new problem. Go to the Start of call procedure. This ends the procedure. 6. Did the system or partition remain running throughout the repair, and was the I/O processor, I/O adapter, or storage device replaced? Yes: Continue with step 10. No: Continue with the next step. 7. Use the service action log or serviceable event view (if the system is managed by an HMC) to look for any reference codes that are related to this IPL (see Using the Service Action Log). Are there any reference codes that are related to this IPL? Yes: Continue with the next step. No: If the problem was related to removable media or communications, perform the verification procedures in the Service functions to verify that the problem is corrected. Then return the system to the customer and have the customer verify the system date and time. This ends the procedure. Verifying a repair 227 8. Is the new reference code the same as the original reference code? Yes: Continue with the next step. No: A new symptom might have occurred. Go to the Start of call procedure. This ends the procedure. 9. Are there any other failing items that remain to be replaced? Yes: Replace the next failing item listed for this reference code. This ends the procedure. No: Contact your next level of support for assistance. This ends the procedure. 10. Was concurrent maintenance performed on an optical storage unit? Yes: The product activity log and service action log, in most cases, contain a reference code for the optical storage unit when concurrent maintenance is performed. You can ignore this reference code. Perform the following: - Perform the verification procedures in the Service functions topic to verify that the problem is corrected. - Return the system to the customer and have the customer verify the system date and time. This ends the procedure. No: Continue with the next step. 11. Use the service action log to look for any new reference codes (see Using the Service Action Log). Are there any new reference codes? Yes: Continue with the next step. No: Go to step 14. 12. Is the new reference code the same as the original reference code? Yes: Continue with the next step. No: A new symptom might have occurred. Go to the Start of call procedure to determine the cause of the problem. This ends the procedure. 13. Are there any other failing items that need to be replaced? Yes: Replace the next failing item listed for the reference code. This ends the procedure.. No: Contact your next level of support for assistance. This ends the procedure. 14. Are you working with a tape device? Yes: Perform the verification procedures in the Service functions to verify that the problem is corrected. After the verification test is complete, the tape device description will be set to the failed state because a resource change was detected. Perform the following tasks: - Vary the tape device description off and then on. - Return the system to the customer and have the customer verify the system date and time. Then go to Verifying the repair from the HMC. This ends the procedure. No: Continue with the next step. 15. Are you working with an IOP or an IOA? Yes: Use the display hardware configuration service function to check for any missing or failed hardware: - On the command line, enter the STRSST (Start System Service Tools command). If you cannot get to SST, select DST. Do not IPL the system or partition to get to DST. - On the Start Service Tools Sign On display, enter the user ID with the service authority and password. - Select Start a service tool ? Hardware service manager ? Logical hardware resources ? System bus resources. - Select the function key for Include nonreporting resources. - If the IOP and IOA that you just replaced is a failed or non-reporting resource, the problem has not been fixed. Continue to the next failing item in the failing item list. This ends the procedure. 228 Disk drives No: Perform the verification procedures in the Service functions topics to verify that the problem is corrected. Resources that usually vary on automatically during an IPL, or that were previously varied on manually, might need to be varied on again after the verification procedures are complete. Return the system to the customer and have the customer verify the system date and time. This ends the procedure. Verifying the repair in Linux You can use this procedure to verify that a repair is complete using the Linux operating system. 1. Run stand-alone diagnostics from either a CD or from a Network Installation Management (NIM) server. See Running the stand-alone diagnostics from CD-ROM. Did you encounter any problems? No Reboot the operating system and continue with the close of call procedure. Yes If the original problem still exists, replace the field-replaceable unit (FRU) or perform the isolation procedure that is next in the FRU list. If you have reached the end of the FRU list, contact your next level of support. If a new problem has occurred, go to Beginning problem analysis and repair the new problem. Verifying the repair from the management console Perform these procedures to close problem numbers, clear hardware messages, and prepare the server to return to the customer using the management console. Follow this checklist before performing the procedures: v You return the server to the state that the customer normally uses such as IPL type, IPL mode, and the way the system is configured or partitioned. Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. v While you were performing the problem analysis on the original serviceable event, other serviceable-event numbers might have been opened. Close all serviceable events that were opened as a result of your service activity. v Server verification has been performed and there are no problems that require additional service actions. v If the repair was done using the HMC online repair procedures, ensure that the original serviceable event is now closed. 1. Is a management console used to manage the server that you are servicing? v Yes: Continue with the next step. v No: Return to the “Verifying a repair” on page 223. This ends the procedure. 2. Are you closing a service event that was a repair on the management console personal computer? v Yes: Continue with the next step. v No: Go to step 4 on page 230. 3. Power on the management console. Did the power-on process complete without errors? v Yes: Ensure that the management console can be used to perform server management tasks, and return the management console to normal operations. Go to “Closing a service call” on page 231. This ends the procedure. v No: Go to the HMC isolation procedures. This ends the procedure. Verifying a repair 229 4. Log into the management console as the service representative. If invalid user or invalid password displays, get the correct login information from the system administer. 1. If logged on the System Manager, select Exit from the Console, located on the System Manager window. 2. Login to the System Manager with the following: User identification service Password service mode 5. View Serviceable Event Details. 1. In the navigation area, select the Service Applications icon. 2. In the navigation area, select the Service Focal Point icon. 3. In the Contents area, select Manage Serviceable Events. 4. Designate the set of Serviceable Events you want to view. When you are finished, click OK. The Service Event Overview window opens. Note: Only the events that match all of the criteria you specify are shown. 6. Close open or delayed events. 1. Select the problem to close, on the Service Event Overview window. 2. Select the menu Selected, located on the menu bar. 3. Click Close Event. 4. Enter your comments in the Serviceable Event Comments window, and click Close Event. 5. Close all events associated with the problem on which you were working. 7. Did the Service Event Overview window contain the event or events on which you were working? v Yes: Return the HMC to normal operations. Go to “Closing a service call” on page 231. This ends the procedure. v No: Go to Detecting problems. This ends the procedure. 230 Disk drives Closing a service call Perform these procedures to close problem numbers, clear hardware messages, and prepare the server to return to the customer. Follow this checklist before performing the procedures: v Return the server to the state that the customer normally uses such as IPL type, IPL mode, and the way the system is configured or partitioned. Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. v While you were performing the problem analysis on the original serviceable event, other serviceable events might have been opened. Close all serviceable events that were opened as a result of your service activity. v Ensure that server verification has been performed and no problems require additional service actions. v If the repair was done using the management console online repair procedures, ensure that the original serviceable event is now closed. 1. Record the system reference code (SRC) or symptom and the location code of the field-replaceable unit (FRU) you replaced, for future reference. Is the server managed by a management console? v Yes: Continue with the next step. v No: Do one of the following steps: – If the server is managed by Integrated Virtualization Manager (IVM), go to “Closing a service call using Integrated Virtualization Manager” on page 239. – If the server is not partitioned and is running the AIX or Linux operating system, go to “Closing a service call using AIX or Linux” on page 235. 2. On the HMC, open Manage Serviceable Events and examine the service action event log for any open service action events. 3. Are there any service action events that are open? v Yes: Continue with the next step. v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 245. Return the system to the customer. This completes the repair. 4. Record the list of open service action events. 5. From the list of serviceable events recorded in step 4, perform the following step 6 through step 32 on page 234 for each open service action event. 6. Determine the error class of the serviceable event. Record for future use. 7. Examine the details of the open service action event. © Copyright IBM Corp. 2010, 2011 231 Is the error code that is associated with this service action event the same as recorded in Step 1 on page 231? v Yes: Go to step 11. v No: Continue with the next step. 8. Examine the FRU list of the service action event. Are any FRUs listed for the service action event? v Yes: Continue with the next step. v No: Go to step 11. 9. Is the FRU list identical (same FRUs, same number of FRUs, and same order of FRUs) to the FRU list of the error code recorded in step 1 on page 231? v Yes: Go to step 11. v No: Continue with the next step. 10. The FRU list is different. Is the FRU you replaced and recorded in step 1 on page 231 in the list of FRUs for this service action event? v Yes: Continue with the next step. v No: Go to step 32 on page 234. Note: There are service action events that will remain open when you leave this MAP. Further service actions might be required to complete the repair. 11. Examine the details of this service action event, and record the partitions involved in this service action event for use in a later step. 12. Is the error code associated with this service action event of the form A11-xxx or A01-xxx? v Yes: Continue with the next step. v No: Go to step 17 on page 233. 13. Have you begun a list of Axx partitions from prior service action events that you processed in this maintenance analysis procedure (MAP)? v Yes: Go to step 15. v No: Continue with the next step. 14. Begin a new list of Axx partitions by copying the list of partitions obtained in step 11. Go to step 16. 15. Add the partition list obtained in step 11 to the existing list of Axx partitions obtained from processing previous service action events in this MAP. 16. Remove all entries in the list of all partitions you recorded in step 11. If you are referred to the list of partitions obtained in step 11 in future steps, the list is empty. Go to step 17 on page 233. 232 Disk drives 17. Select and highlight the service action event from the Error Associated With This Serviceable Event window. 18. Click Close Event. 19. Add comments for the serviceable event. Include any unique additional information. Click OK. The following steps will add or update FRU information. 20. Did you replace, add, or modify a FRU of the open service action event? v Yes: Continue with the next step. v No: Go to step 22. 21. From the FRU list, select a FRU that you need to update. Double-click the FRU, and update the FRU information. Go to step 23. 22. Select the No FRU Replaced for this Serviceable Event option. 23. Click OK to close the service action event. 24. Is the list of all partitions you recorded in step 11 on page 232 empty? v Yes: Go to step 32 on page 234. v No: Continue with the next step. 25. Does the list of all partitions you recorded in step 11 on page 232 contain more than one entry? v Yes: Continue with the next step. v No: Go to step 32 on page 234. 26. Is the error class recorded in step 25 AIX? v Yes: Continue with the next step. v No: Go to step 32 on page 234. 27. Perform the following steps for each entry in the list of all partitions you recorded in step 11 on page 232, except the partition you were using to debug the original problem. 28. From the HMC virtual terminal window of a partition in the list of all partitions, type diag at the AIX command prompt. Closing a service call 233 29. When the diagnostic operating instructions are displayed, do the following steps: 1. Press Enter. 2. Select the Task Selection option. 3. Select the Log Repair option. 4. Select the resource associated with the repair action. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your repair action is not displayed on the Resource List, select sysplanar0. 5. Click Commit after you have made your selection. Note: If the terminal type is not defined, you are prompted to define it before you can proceed. 30. Exit from diagnostics in this partition and return to the AIX command prompt. 31. Have all the partitions in the list of all the partitions you recorded in step 11 on page 232 been processed? v Yes: Continue with the next step. v No: Go to step 24 on page 233 to process the next partition in the list you recorded in step 11 on page 232. 32. Have all the serviceable events recorded in step 4 on page 231 been processed? v Yes: Continue with the next step. v No: Go to step 5 on page 231 and process the next service action event in the list of serviceable events recorded in step 4 on page 231. 33. While processing all service action events, were you directed to step 14 on page 232? v Yes: Continue with the next step. v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 245. Return the system to the customer. This completes the repair. Note: If during the processing of the list of open service action events, some service action events remained open, further service actions might be required to complete the repair. 34. Perform the following steps for each entry in the list of Axx partitions you began recording in step 14 on page 232, except the partition you were using to debug the original problem. 35. From the management console virtual terminal window of a partition in the list of Axx partitions, type diag at the AIX command prompt. 36. When the diagnostic operating instructions are displayed, do the following steps: 234 Disk drives 1. Press Enter. 2. Select the Task Selection option. Note: If the terminal type is not defined, you are prompted to define it before you can proceed. 3. Select the Log Repair option. 4. Select the resource associated with the repair action. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your repair action is not displayed on the Resource List, select sysplanar0. 5. Click Commit after you have made your selection. 37. Exit from diagnostics in this partition and return to the AIX command prompt. 38. Have all the partitions in the list of Axx partitions you began recording in step 14 on page 232 been processed? v Yes: Continue with the next step. v No: Go to step 34 on page 234 to process the next partition in the list you recorded in step 14 on page 232. 39. If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 245. This completes the repair. Return the system to the customer. Note: If during the processing of the list of open service action events, some service action events remained open, further service actions might be required to complete the repair. Closing a service call using AIX or Linux If the server is not connected to an management console and not using Integrated Virtualization Manager (IVM), perform these procedures to close problem numbers, clear hardware messages, and prepare the server to return to the customer. Follow this checklist before performing the procedures: v Return the server to the state that the customer normally uses, such as IPL type, IPL mode, and the way the system is configured or partitioned. Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. v While you were performing the problem analysis on the original serviceable event, other serviceable-event numbers might have been opened. Close all serviceable events that were opened as a result of your service activity. v Ensure that server verification has been performed and that no problems require additional service actions. v If the repair was done using the IVM online repair procedures, ensure that the original serviceable event is now closed. 1. Did you use an AIX diagnostics service aid hot-swap operation to change the FRU? v Yes: Go to step 4 on page 236 v No: Continue with the next step. Closing a service call 235 2. Do you have any field-replaceable units (FRUs) (for example cards, adapters, cables, or devices) that were removed during problem analysis that you want to put back into the system? Note: If the system planar or battery has been replaced and you are loading diagnostics from a server over a network, it might be necessary for the customer to set the network boot information for this system before diagnostics can be loaded. The system time and date information should also be set when the repair is completed. v Yes: Reinstall all of the FRUs that were removed during problem analysis. Go to step 3 v No: Continue with the next step. 3. Is the system or logical partition that you are performing a repair action on running the AIX operating system? v Yes: Continue with the next step. v No: Go to step 5. 4. Does the system or logical partition you are performing a repair action on have AIX installed? Note: Answer no to this question if you have just replaced a hard disk in the root volume group. v Yes: Go to step 7. v No: Continue with the next step. 5. Run stand-alone diagnostics in problem determination mode from either a CD-ROM or from a Network Installation Management (NIM) server. Note: For instructions on running stand-alone diagnostics from a CD and not using an HMC, go to Running the stand-alone diagnostics from CD on a server without an HMC attached. For instructions on running stand-alone diagnostics from a NIM server, go to Running the stand-alone diagnostics from a Network Installation Management server. Did you encounter any problems? v Yes: Go to problem analysis. v No: Continue with the next step. 6. The system hardware is functioning correctly. If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 245. This completes the repair. Note: If, during the processing of the list of open service action events, some service action events remained open, further service actions might be required to complete the repair. Return the server to the state that the customer normally uses, such as IPL type, IPL mode, and the way the system is configured or partitioned. This might require you to reboot the operating system. Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. 7. Complete the following steps: 236 Disk drives 1. If the system supports slow boot (see Performing a slow boot), do a slow boot on the system. If the system does not support slow boot, do a normal boot. 2. Power on the system. 3. Wait until the AIX operating system login prompt displays or until system activity on the operator panel or display apparently has stopped. Did the AIX Login Prompt display? v Yes: Continue with the next step. v No: Go to problem analysis. 8. If the Resource Repair Action menu is already displayed, go to 12 on page 238; otherwise, do the following steps: 1. Log into the operating system, either with root authority (if needed, ask the customer to enter the password) or use the CE login. 2. Enter the diag -a command and check for missing resources. Follow any instructions that display. If an system request number (SRN) displays, suspect a loose card or connection. If no instructions display, no resources were detected as missing. Continue with 9. 9. Complete the following steps: 1. Enter diag at the command prompt. 2. Press Enter. 3. Select the Diagnostics Routines option. 4. When the Diagnostic Mode Selection menu displays, select Problem determination. 5. When the Advanced Diagnostic Selection menu displays, select the All Resources option, or test the FRUs you exchanged, and any devices that are attached to the FRUs you exchanged, by selecting the diagnostics for the individual FRU. Did the Resource Repair Action menu (801015) display? v Yes: Go to step 13 on page 238. v No: Continue with the next step. 10. Did the TESTING COMPLETE, no trouble was found menu (801010) display? v Yes: Continue with the next step. v No: There is still a problem. Go to problem analysis. 11. Use the Log Repair Action option, if not previously logged, in the TASK SELECTION menu to update the AIX error log. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. Closing a service call 237 If the resource associated with your action is not displayed on the resource list, select sysplanar0. Note: If the system attention indicator is on, this will set it back to the normal state. Go to step 14. 12. When a test is run on a resource in system verification mode, and that resource has an entry in the AIX error log, if the test on the resource was successful, the Resource Repair Action menu displays. After replacing a FRU, you must select the resource for that FRU from the Resource Repair Action menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced. Note: If the system attention indicator is on, this action will set it back to the normal state. Do the following steps: 1. Select the resource that has been replaced from the Resource Repair Action menu. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. 2. Press Commit after you make your selections. Did another Resource Repair Action (801015) display? v Yes: Continue with the next step. v No: If the No Trouble Found menu displays, go to step 14. 13. The parent or child of the resource you just replaced might also require that you run the Resource Repair Action service aid on it. When a test is run on a resource in system verification mode, and that resource has an entry in the AIX error log, if the test on the resource was successful, the Resource Repair Action menu displays. After replacing that FRU, you must select the resource for that FRU from the Resource Repair Action menu. This updates the AIX error log to indicate that a system-detectable FRU has been replaced. Note: If the system attention indicator is on, this action will set it back to the normal state. Do the following steps: 1. From the Resource Repair Action menu, select the parent or child of the resource that has been replaced. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the resource list, select sysplanar0. 2. Press COMMIT after you make your selections. Did the No Trouble Found menu display? v Yes: Continue with the next step. v No: Go to problem analysis. 14. If you changed the service processor or network settings, as instructed in previous MAPs, restore the settings to the value they had prior to servicing the system. If you ran stand-alone diagnostics from CD-ROM, remove the stand-alone diagnostics CD-ROM from the system. 238 Disk drives Did you perform service on a RAID subsystem involving changing of the PCI RAID adapter cache card or changing the configuration? Note: This does not refer to the PCI-X RAID adapter or cache. v Yes: Continue with the next step. v No: Go to step 16. 15. Use the Recover Options selection to resolve the RAID configuration. To do this, do the following steps: 1. On the PCI SCSI Disk Array Manager dialog, select Recovery options. 2. If a previous configuration exists on the replacement adapter, this must be cleared. Select Clear PCI SCSI Adapter Configuration and press F3. 3. On the Recovery Options dialog, select Resolve PCI SCSI RAID Adapter Configuration. 4. On the Resolve PCI SCSI RAID Adapter Configuration dialog, select Accept Configuration on Drives. 5. On the PCI SCSI RAID Adapter selections menu, select the adapter that you changed. 6. On the next dialog, press Enter. 7. When you see the Are You Sure selection menu, press Enter to continue. 8. You should see an OK status message when the recover is complete. If you get a Failed status message, verify that you selected the correct adapter, and then repeat this procedure. When recover is complete, exit the operating system. 9. Go to 16. 16. The system hardware is functioning correctly. Return the server to the state that the customer normally uses, such as IPL type, IPL mode, and the way the system is configured or partitioned. Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. Closing a service call using Integrated Virtualization Manager Perform these procedures to close problem numbers, clear hardware messages, and prepare the server to return to the customer. Follow this checklist before performing the procedures: v Return the server to the state that the customer normally uses, such as IPL type, IPL mode, and the way the system is configured or partitioned. Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours. v While you were performing the problem analysis on the original serviceable event, other serviceable-event numbers might have been opened. Close all serviceable events that were opened as a result of your service activity. v Ensure that server verification has been performed and there are no problems that require additional service actions. v If the repair was done using the Integrated Virtualization Manager (IVM) online repair procedures, ensure that the original serviceable event is now closed. 1. Record the system reference code (SRC) or symptom and the location code of the field-replaceable unit (FRU) you replaced, for future reference. Closing a service call 239 2. On the IVM, open Manage Serviceable Events and look at existing serviceable events. 3. Are there any service action events that are open? v Yes: Continue with the next step. v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 245. Return the system to the customer. This completes the repair. 4. Record the list of open service action events. 5. From the list of serviceable events recorded in step 4, perform step 6 through step 30 on page 242 for each open service action event. 6. Determine the error class of the serviceable event. Record for future use. 7. Examine the details of the open service action event. Is the error code associated with this service action event the same as recorded in step 1 on page 239? v Yes: Go to step 11. v No: Continue with the next step. 8. Examine the FRU list of the service action event. Are any FRUs listed for the service action event? v Yes: Continue with the next step. v No: Go to step 11. 9. Is the FRU list identical (same FRUs, same number of FRUs, and same order of FRUs) to the FRU list of the error code recorded in step 1 on page 239? v Yes: Go to step 11. v No: Continue with the next step. 10. Is the FRU you replaced and recorded in step 1 on page 239 in the list of FRUs for this service action event? v Yes: Continue with the next step. v No: Go to step 30 on page 242. Note: There are service action events that will remain open when you leave this MAP. Further service actions might be required to complete the repair. 11. Examine the details of this service action event, and record the partitions involved in this service action event for use in a later step. 240 Disk drives 12. Is the error code associated with this service action event of the form A11-xxx or A01-xxx? v Yes: Continue with the next step. v No: Go to step 17. 13. Have you begun a list of Axx partitions from prior service action events that you processed in this maintenance analysis procedure (MAP)? v Yes: Go to step 15. v No: Continue with the next step. 14. Begin a new list of Axx partitions by copying the list of partitions obtained in step 11 on page 240. Go to step 16. 15. Add the partition list obtained in step 11 on page 240 to the existing list of Axx partitions obtained from processing previous service action events in this MAP. 16. Remove all entries in the list of all partitions you recorded in step 11 on page 240. If you are referred to the list of partitions obtained in step 11 on page 240 in future steps, the list is empty. Go to step 17. 17. Select and highlight the service action event from the Manage Serviceable Events window. 18. Click Close Event. 19. Add comments for the serviceable event. Include any unique additional information. Click OK. 20. Add or update FRU information: Did you replace, add, or modify a FRU of the open service action event? v Yes: Continue with the next step. v No: Go to step 22. 21. Click OK to close the service action event. 22. Is the list of all partitions you recorded in step 11 on page 240 empty? v Yes: Go to step 30 on page 242. v No: Continue with the next step. Closing a service call 241 23. Does the list of all partitions you recorded in step 11 on page 240 contain more than one entry? v Yes: Continue with the next step. v No: Go to step 30. 24. Is the error class recorded in step 23? v Yes: Continue with the next step. v No: Go to step 30. 25. Perform the following steps for each entry in the list of all partitions you recorded in step 11 on page 240, except the partition you were using to debug the original problem. 26. From the IVM virtual terminal window of a partition in the list of all partitions, type diag at the AIX command prompt. 27. When the diagnostic operating instructions are displayed, do the following steps: 1. Press Enter. 2. Select the Task Selection option. 3. Select the Log Repair option. 4. Select the resource associated with the repair action. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your repair action is not displayed on the Resource List, select sysplanar0. 5. Click Commit after you have made your selection. Note: If the terminal type is not defined, you are prompted to define it before you can proceed. 28. Exit from diagnostics in this partition and return to the AIX command prompt. 29. Have all the partitions in the list of all partitions you recorded in step 11 on page 240 been processed? v Yes: Continue with the next step. v No: Go to step 25 to process the next partition in the list you recorded in step 11 on page 240. 30. Have all the serviceable events recorded in step 4 on page 240 been processed? v Yes: Continue with the next step. v No: Go to step 5 on page 240 and process the next service action event in the list of serviceable events recorded in step 4 on page 240. 31. While processing all service action events, were you directed to step 14 on page 241? 242 Disk drives v Yes: Continue with the next step. v No: If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 245. Return the system to the customer. This completes the repair. Note: If during the processing of the list of open service action events, some service action events remained open, further service actions might be required to complete the repair. 32. Perform the following steps for each entry in the list of Axx partitions you began recording in step 14 on page 241, except the partition you were using to debug the original problem. 33. From the IVM virtual terminal window of a partition in the list of Axx partitions, type diag at the AIX command prompt. 34. When the diagnostic operating instructions are displayed, do the following steps: 1. Press Enter. 2. Select the Task Selection option. Note: If the terminal type is not defined, you are prompted to define it before you can proceed. 3. Select the Log Repair option. 4. Select the resource associated with the repair action. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your repair action is not displayed on the Resource List, select sysplanar0. 5. Click Commit after you have made your selection. 35. Exit from diagnostics in this partition and return to the AIX command prompt. 36. Have all the partitions in the list of Axx partitions you began recording in step 14 on page 241 been processed? v Yes: Continue with the next step. v No: Go to step 32 to process the next partition in the list you recorded in step 14 on page 241. 37. If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 245. This completes the repair. Return the system to the customer. Note: If, during the processing of the list of open service action events, some service action events remained open, further service actions might be required to complete the repair. Closing a service call 243 244 Disk drives Activating and deactivating LEDs You can use these procedures to activate or deactivate light-emitting diodes (LEDs) using the management console or the Advanced System Management Interface (ASMI). For IBM Power Systems servers that contain the POWER7 processor, the LEDs can be used to identify or verify a part that you are servicing. The error and identify function (amber) LED indicates an error and corresponds to the location code in the system reference code (SRC). The LED is activated and deactivated automatically. Additionally, the following procedures can also be used to activate and deactivate LEDs. v “Deactivating a system attention LED or partition LED using the management console” v “Activating or deactivating an identify LED using the management console” on page 246 v “Deactivating a system attention LED or logical partition LED using the Advanced System Management Interface” on page 246 v “Activating or deactivating an identify LED using the Advanced System Management Interface” on page 247 Deactivating a system attention LED or partition LED using the management console You can deactivate a system attention LED or a logical partition LED if you decide that a problem is not a high priority and you decide to repair the problem at a later time. You can do this task from the Hardware Management Console (HMC) or the IBM Systems Director Management Console (SDMC). If you want to be alerted if another problem occurs, you must deactivate the system attention LED so that it can be activated again if another problem occurs. To deactivate a system attention LED by using HMC, complete the following steps: 1. In the navigation area, open Systems management. 2. Select the server you are working on by checking the box next to its name. 3. Open Operations. 4. Open LED Status. 5. Select View System Attention. The system attention LED window opens. The selected system and its LED state are displayed in the upper part of the window. The logical partition and its LED state are displayed in the lower part of the window. From the system attention LED window, you can deactivate both the system attention LED and the logical partition LED. 6. Select Deactivate System Attention LED from the Action menu. A confirmation window is displayed that provides the following information: v A verification that the system attention LED was deactivated. v An indication that there still might be open problems within the system. v An indication that you cannot activate the system attention LED. 7. Select one of the logical partitions in the lower table, and select Deactivate partition LED from the Partition Operations menu. A confirmation window is displayed that provides the following information: v A verification that the logical partition LED was deactivated. v An indication that there still might be open problems within the logical partition. v An indication that you cannot activate the logical partition LED. © Copyright IBM Corp. 2010, 2011 245 To deactivate a system attention LED by using SDMC, complete the following steps: 1. On the Welcome page, under the Resources tab, select the server. 2. Click Actions > Service and Support> Hardware> System Attention LED. 3. Click OK. Activating or deactivating an identify LED using the management console You can activate or deactivate an identify LED for components attached to the system from the Hardware Management Console (HMC) or the IBM Systems Director Management Console (SDMC). The system provides several LEDs that help identify various components, such as enclosures or field-replaceable units (FRUs), in the system. For this reason, they are called identify LEDs. You can activate or deactivate the following types of identify LEDs: v Identify LED for an enclosure If you want to add an adapter to a specific drawer (enclosure), you need to know the machine type, model, and serial number (MTMS) of the drawer. To determine whether you have the correct MTMS for the drawer that needs the new adapter, you can activate the LED for a drawer and verify that the MTMS corresponds to the drawer that requires the new adapter. v Identify LED for a FRU associated with a specified enclosure If you want to hook up a cable to a specific I/O adapter, you can activate the LED for the adapter, which is a field replaceable unit (FRU), and then physically check to see where you should hook up the cable. This is especially useful when you have several adapters with open ports. To activate or deactivate an identify LED for an enclosure or FRU by using the HMC, follow these steps: 1. In the navigation area, open Systems management. 2. Select the server you are working on. 3. Click Operations ? LED Status ? Identify LED. The Identify LED, Select Enclosure window opens. 4. To activate or deactivate an identify LED for an enclosure, select an enclosure from the table, and click either Activate LED or Deactivate LED. The associated LED is either turned on or off. 5. To activate or deactivate an identify LED for a FRU, select an enclosure from the table, and then select Selected ? List FRUs. 6. Select one or more FRUs from the table, and click either Activate LED or Deactivate LED. The associated LED is either turned on or off. To activate or deactivate an identify LED for an enclosure or FRU by using the SDMC, complete these steps: 1. On the Welcome page, under the Resources tab, select the server. 2. Click Actions > Service and Support> Hardware> Identify LED. 3. To activate or deactivate an identify LED for an enclosure, select an enclosure from the table, and click either Activate LED or Deactivate LED. The associated LED is either turned on or off. 4. To activate or deactivate an identify LED for a FRU, select an enclosure from the table, and then click List FRUs. 5. Select one or more FRUs from the table, and then click either Activate LED or Deactivate LED. The associated LED is either turned on or off. Deactivating a system attention LED or logical partition LED using the Advanced System Management Interface You can deactivate a system attention LED or a logical partition LED using the Advanced System Management Interface (ASMI). 246 Disk drives The system attention indicator provides a visual signal that the system as a whole requires attention or service. Each system has a single system attention indicator. When an event occurs that either needs your intervention or that of service and support, the system attention indicator lights continuously. The system attention indicator is turned on when an entry is made in the service processor error log. The error entry is transmitted to the system error log and the operating system error log. To perform this operation, your authority level must be one of the following levels: v Administrator v Authorized service provider To turn off the system attention indicator, do the following steps: 1. On the ASMI Welcome pane, specify your user ID and password, and click Log In. 2. In the navigation area, expand System Configuration ? Service Indicators ? System Attention Indicator. 3. In the right pane, click Turn off system attention indicator. If the attempt is unsuccessful, an error message is displayed. Activating or deactivating an identify LED using the Advanced System Management Interface You can activate or deactivate an identify LED using the Advanced System Management Interface (ASMI). You can specify the location code of any indicator to view or modify its current state. If you provide the wrong location code, the advanced system manager attempts to go to the next higher level of the location code. The next level is the base-level location code for that field replaceable unit (FRU). For example, a user types the location code for the FRU located on the second I/O slot of the third enclosure in the system. If the location code for the second I/O slot is incorrect (the FRU does not exist at this location), an attempt to set the indicator for the third enclosure is initiated. This process continues until a FRU is located or no other level is available. To perform this operation, your authority level must be one of the following levels: v Administrator v Authorized service provider To change the current state of an indicator, do the following steps: 1. On the ASMI Welcome pane, specify your user ID and password, and click Log In. 2. In the navigation area, expand System Configuration ? Service Indicators ? Indicators by Location code. 3. In the right pane, enter the location code of the FRU and click Continue. 4. Select the preferred state from the list. 5. Click Save settings. Activating and deactivating LEDs 247 248 Disk drives Notices This information was developed for products and services offered in the U.S.A. The manufacturer may not offer the products, services, or features discussed in this document in other countries. Consult the manufacturer's representative for information on the products and services currently available in your area. Any reference to the manufacturer's product, program, or service is not intended to state or imply that only that product, program, or service may be used. Any functionally equivalent product, program, or service that does not infringe any intellectual property right of the manufacturer may be used instead. However, it is the user's responsibility to evaluate and verify the operation of any product, program, or service. The manufacturer may have patents or pending patent applications covering subject matter described in this document. The furnishing of this document does not grant you any license to these patents. You can send license inquiries, in writing, to the manufacturer. The following paragraph does not apply to the United Kingdom or any other country where such provisions are inconsistent with local law: THIS INFORMATION IS PROVIDED “AS IS” WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESS OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF NON-INFRINGEMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Some states do not allow disclaimer of express or implied warranties in certain transactions, therefore, this statement may not apply to you. This information could include technical inaccuracies or typographical errors. Changes are periodically made to the information herein; these changes will be incorporated in new editions of the publication. The manufacturer may make improvements and/or changes in the product(s) and/or the program(s) described in this publication at any time without notice. Any references in this information to websites not owned by the manufacturer are provided for convenience only and do not in any manner serve as an endorsement of those websites. The materials at those websites are not part of the materials for this product and use of those websites is at your own risk. The manufacturer may use or distribute any of the information you supply in any way it believes appropriate without incurring any obligation to you. Any performance data contained herein was determined in a controlled environment. Therefore, the results obtained in other operating environments may vary significantly. Some measurements may have been made on development-level systems and there is no guarantee that these measurements will be the same on generally available systems. Furthermore, some measurements may have been estimated through extrapolation. Actual results may vary. Users of this document should verify the applicable data for their specific environment. Information concerning products not produced by this manufacturer was obtained from the suppliers of those products, their published announcements or other publicly available sources. This manufacturer has not tested those products and cannot confirm the accuracy of performance, compatibility or any other claims related to products not produced by this manufacturer. Questions on the capabilities of products not produced by this manufacturer should be addressed to the suppliers of those products. All statements regarding the manufacturer's future direction or intent are subject to change or withdrawal without notice, and represent goals and objectives only. The manufacturer's prices shown are the manufacturer's suggested retail prices, are current and are subject to change without notice. Dealer prices may vary. © Copyright IBM Corp. 2010, 2011 249 This information is for planning purposes only. The information herein is subject to change before the products described become available. This information contains examples of data and reports used in daily business operations. To illustrate them as completely as possible, the examples include the names of individuals, companies, brands, and products. All of these names are fictitious and any similarity to the names and addresses used by an actual business enterprise is entirely coincidental. If you are viewing this information in softcopy, the photographs and color illustrations may not appear. The drawings and specifications contained herein shall not be reproduced in whole or in part without the written permission of the manufacturer. The manufacturer has prepared this information for use with the specific machines indicated. The manufacturer makes no representations that it is suitable for any other purpose. The manufacturer's computer systems contain mechanisms designed to reduce the possibility of undetected data corruption or loss. This risk, however, cannot be eliminated. Users who experience unplanned outages, system failures, power fluctuations or outages, or component failures must verify the accuracy of operations performed and data saved or transmitted by the system at or near the time of the outage or failure. In addition, users must establish procedures to ensure that there is independent data verification before relying on such data in sensitive or critical operations. Users should periodically check the manufacturer's support websites for updated information and fixes applicable to the system and related software. Ethernet connection usage restriction This product is not intended to be connected directly or indirectly by any means whatsoever to interfaces of public telecommunications networks. Trademarks IBM, the IBM logo, and ibm.com are trademarks or registered trademarks of International Business Machines Corp., registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM or other companies. A current list of IBM trademarks is available on the Web at Copyright and trademark information at www.ibm.com/legal/copytrade.shtml. Linux is a registered trademark of Linus Torvalds in the United States, other countries, or both. Other product and service names might be trademarks of IBM or other companies. Electronic emission notices When attaching a monitor to the equipment, you must use the designated monitor cable and any interference suppression devices supplied with the monitor. Class A Notices The following Class A statements apply to the IBM servers that contain the POWER7 processor and its features unless designated as electromagnetic compatibility (EMC) Class B in the feature information. Federal Communications Commission (FCC) statement Note: This equipment has been tested and found to comply with the limits for a Class A digital device, pursuant to Part 15 of the FCC Rules. These limits are designed to provide reasonable protection against harmful interference when the equipment is operated in a commercial environment. This equipment 250 Disk drives generates, uses, and can radiate radio frequency energy and, if not installed and used in accordance with the instruction manual, may cause harmful interference to radio communications. Operation of this equipment in a residential area is likely to cause harmful interference, in which case the user will be required to correct the interference at his own expense. Properly shielded and grounded cables and connectors must be used in order to meet FCC emission limits. IBM is not responsible for any radio or television interference caused by using other than recommended cables and connectors or by unauthorized changes or modifications to this equipment. Unauthorized changes or modifications could void the user's authority to operate the equipment. This device complies with Part 15 of the FCC rules. Operation is subject to the following two conditions: (1) this device may not cause harmful interference, and (2) this device must accept any interference received, including interference that may cause undesired operation. Industry Canada Compliance Statement This Class A digital apparatus complies with Canadian ICES-003. Avis de conformité à la réglementation d'Industrie Canada Cet appareil numérique de la classe A est conforme à la norme NMB-003 du Canada. European Community Compliance Statement This product is in conformity with the protection requirements of EU Council Directive 2004/108/EC on the approximation of the laws of the Member States relating to electromagnetic compatibility. IBM cannot accept responsibility for any failure to satisfy the protection requirements resulting from a non-recommended modification of the product, including the fitting of non-IBM option cards. This product has been tested and found to comply with the limits for Class A Information Technology Equipment according to European Standard EN 55022. The limits for Class A equipment were derived for commercial and industrial environments to provide reasonable protection against interference with licensed communication equipment. European Community contact: IBM Deutschland GmbH Technical Regulations, Department M456 IBM-Allee 1, 71139 Ehningen, Germany Tele: +49 7032 15-2937 email: tjahn@de.ibm.com Warning: This is a Class A product. In a domestic environment, this product may cause radio interference, in which case the user may be required to take adequate measures. VCCI Statement - Japan The following is a summary of the VCCI Japanese statement in the box above: Notices 251 This is a Class A product based on the standard of the VCCI Council. If this equipment is used in a domestic environment, radio interference may occur, in which case, the user may be required to take corrective actions. Japanese Electronics and Information Technology Industries Association (JEITA) Confirmed Harmonics Guideline (products less than or equal to 20 A per phase) Japanese Electronics and Information Technology Industries Association (JEITA) Confirmed Harmonics Guideline with Modifications (products greater than 20 A per phase) Electromagnetic Interference (EMI) Statement - People's Republic of China Declaration: This is a Class A product. In a domestic environment this product may cause radio interference in which case the user may need to perform practical action. Electromagnetic Interference (EMI) Statement - Taiwan The following is a summary of the EMI Taiwan statement above. Warning: This is a Class A product. In a domestic environment this product may cause radio interference in which case the user will be required to take adequate measures. IBM Taiwan Contact Information: 252 Disk drives Electromagnetic Interference (EMI) Statement - Korea Germany Compliance Statement Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse A EU-Richtlinie zur Elektromagnetischen Verträglichkeit Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die Grenzwerte der EN 55022 Klasse A ein. Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne Empfehlung von IBM gesteckt/eingebaut werden. EN 55022 Klasse A Geräte müssen mit folgendem Warnhinweis versehen werden: "Warnung: Dieses ist eine Einrichtung der Klasse A. Diese Einrichtung kann im Wohnbereich Funk-Störungen verursachen; in diesem Fall kann vom Betreiber verlangt werden, angemessene Maßnahmen zu ergreifen und dafür aufzukommen." Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland. Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse A Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen - CE - zu führen. Notices 253 Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller: International Business Machines Corp. New Orchard Road Armonk, New York 10504 Tel: 914-499-1900 Der verantwortliche Ansprechpartner des Herstellers in der EU ist: IBM Deutschland GmbH Technical Regulations, Abteilung M456 IBM-Allee 1, 71139 Ehningen, Germany Tel: +49 7032 15-2937 email: tjahn@de.ibm.com Generelle Informationen: Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse A. Electromagnetic Interference (EMI) Statement - Russia Class B Notices The following Class B statements apply to features designated as electromagnetic compatibility (EMC) Class B in the feature installation information. Federal Communications Commission (FCC) statement This equipment has been tested and found to comply with the limits for a Class B digital device, pursuant to Part 15 of the FCC Rules. These limits are designed to provide reasonable protection against harmful interference in a residential installation. This equipment generates, uses, and can radiate radio frequency energy and, if not installed and used in accordance with the instructions, may cause harmful interference to radio communications. However, there is no guarantee that interference will not occur in a particular installation. If this equipment does cause harmful interference to radio or television reception, which can be determined by turning the equipment off and on, the user is encouraged to try to correct the interference by one or more of the following measures: v Reorient or relocate the receiving antenna. v Increase the separation between the equipment and receiver. v Connect the equipment into an outlet on a circuit different from that to which the receiver is connected. v Consult an IBM-authorized dealer or service representative for help. Properly shielded and grounded cables and connectors must be used in order to meet FCC emission limits. Proper cables and connectors are available from IBM-authorized dealers. IBM is not responsible for 254 Disk drives any radio or television interference caused by unauthorized changes or modifications to this equipment. Unauthorized changes or modifications could void the user's authority to operate this equipment. This device complies with Part 15 of the FCC rules. Operation is subject to the following two conditions: (1) this device may not cause harmful interference, and (2) this device must accept any interference received, including interference that may cause undesired operation. Industry Canada Compliance Statement This Class B digital apparatus complies with Canadian ICES-003. Avis de conformité à la réglementation d'Industrie Canada Cet appareil numérique de la classe B est conforme à la norme NMB-003 du Canada. European Community Compliance Statement This product is in conformity with the protection requirements of EU Council Directive 2004/108/EC on the approximation of the laws of the Member States relating to electromagnetic compatibility. IBM cannot accept responsibility for any failure to satisfy the protection requirements resulting from a non-recommended modification of the product, including the fitting of non-IBM option cards. This product has been tested and found to comply with the limits for Class B Information Technology Equipment according to European Standard EN 55022. The limits for Class B equipment were derived for typical residential environments to provide reasonable protection against interference with licensed communication equipment. European Community contact: IBM Deutschland GmbH Technical Regulations, Department M456 IBM-Allee 1, 71139 Ehningen, Germany Tele: +49 7032 15-2937 email: tjahn@de.ibm.com VCCI Statement - Japan Japanese Electronics and Information Technology Industries Association (JEITA) Confirmed Harmonics Guideline (products less than or equal to 20 A per phase) Notices 255 Japanese Electronics and Information Technology Industries Association (JEITA) Confirmed Harmonics Guideline with Modifications (products greater than 20 A per phase) IBM Taiwan Contact Information Electromagnetic Interference (EMI) Statement - Korea Germany Compliance Statement Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse B EU-Richtlinie zur Elektromagnetischen Verträglichkeit Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die Grenzwerte der EN 55022 Klasse B ein. Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne Empfehlung von IBM gesteckt/eingebaut werden. Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland. Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse B 256 Disk drives Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen - CE - zu führen. Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller: International Business Machines Corp. New Orchard Road Armonk, New York 10504 Tel: 914-499-1900 Der verantwortliche Ansprechpartner des Herstellers in der EU ist: IBM Deutschland GmbH Technical Regulations, Abteilung M456 IBM-Allee 1, 71139 Ehningen, Germany Tel: +49 7032 15-2937 email: tjahn@de.ibm.com Generelle Informationen: Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse B. Terms and conditions Permissions for the use of these publications is granted subject to the following terms and conditions. Personal Use: You may reproduce these publications for your personal, noncommercial use provided that all proprietary notices are preserved. You may not distribute, display or make derivative works of these publications, or any portion thereof, without the express consent of the manufacturer. Commercial Use: You may reproduce, distribute and display these publications solely within your enterprise provided that all proprietary notices are preserved. You may not make derivative works of these publications, or reproduce, distribute or display these publications or any portion thereof outside your enterprise, without the express consent of the manufacturer. Except as expressly granted in this permission, no other permissions, licenses or rights are granted, either express or implied, to the publications or any information, data, software or other intellectual property contained therein. The manufacturer reserves the right to withdraw the permissions granted herein whenever, in its discretion, the use of the publications is detrimental to its interest or, as determined by the manufacturer, the above instructions are not being properly followed. You may not download, export or re-export this information except in full compliance with all applicable laws and regulations, including all United States export laws and regulations. THE MANUFACTURER MAKES NO GUARANTEE ABOUT THE CONTENT OF THESE PUBLICATIONS. THESE PUBLICATIONS ARE PROVIDED "AS-IS" AND WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING BUT NOT LIMITED TO IMPLIED WARRANTIES OF MERCHANTABILITY, NON-INFRINGEMENT, AND FITNESS FOR A PARTICULAR PURPOSE. Armoires,dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension pour E1-700, E3-700, E2-700, E2-700T, E4-700 ou E4-700T REFERENCE 86 F1 29FG 01 ESCALA Power7 Armoires,dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension pour E1-700, E3-700, E2-700, E2-700T, E4-700 ou E4-700T Cette publication concerne les modèles suivants : - Bull Escala E5-700 (Power 750 / 8233-E8B) - Bull Escala M6-700 (Power 770 / 9117-MMB) - Bull Escala M7-700 (Power 780 / 9179-MHB) - Bull Escala E1-700 (Power 710 / 8231-E2B) - Bull Escala E2-700 / E2-700T (Power 720 / 8202-E4B) - Bull Escala E3-700 (Power 730 / 8231-E2B) - Bull Escala E4-700 / E4-700T (Power 740 / 8205-E6B) Les références au modèle Power 755 / 8236-E8C sont sans objet. Hardware Septembre 2010 BULL CEDOC 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE REFERENCE 86 F1 29FG 01 L’avis juridique de copyright ci–après place le présent document sous la protection des lois de Copyright qui prohibent, sans s’y limiter, des actions comme la copie, la distribution, la modification et la création de produits dérivés. Copyright Bull SAS 2007-2010 Imprimé en France Marques déposées Toutes les marques citées dans ce manuel sont la propriété de leurs détenteurs respectifs. Tous les noms de marques ainsi que les noms de produits matériels et/ou logiciels sont régis par le droit des marques et/ou des brevets. La citation des noms de marques et de produits est purement informative et ne constitue pas une violation du droit des marques et/ou des brevets. Des corrections ou des modifications au contenu de ce document peuvent intervenir sans préavis. Bull SAS ne pourra pas être tenu pour responsable des éventuelles erreurs qui pourraient être contenues dans ce manuel, ni pour tout dommage pouvant résulter de son application. Table des matières Avis aux lecteurs canadiens . . . . . . . . . . . . . . . . . . . . . . . . . . vii Consignes de sécurité . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xi Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension - Nouveautés . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 Installation de l'armoire . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 Installation de l'armoire 14T/00 ou 14T/42 . . . . . . . . . . . . . . . . . . . . . . . . . 5 Inventaire des composants. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 Positionnement de l'armoire . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 Mise à niveau de l'armoire. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 Fixation des équerres de stabilisation . . . . . . . . . . . . . . . . . . . . . . . . . . 7 Fixation de l'armoire à un plancher en béton . . . . . . . . . . . . . . . . . . . . . . . 8 Fixation de l'armoire à un plancher en béton sous un faux plancher . . . . . . . . . . . . . . . 13 Raccordement du système de distribution de l'alimentation . . . . . . . . . . . . . . . . . . 18 Vérification des prises de courant alternatif. . . . . . . . . . . . . . . . . . . . . . . . 18 Fixation de la prise de courant alternatif avant ou arrière . . . . . . . . . . . . . . . . . . . 19 Installation des plaques de montage pour l'alimentation CA avec prises CA. . . . . . . . . . . . 19 Installation des plaques de montage pour l'alimentation CA sans prises CA. . . . . . . . . . . . 22 Connexion d'une source d'alimentation CC. . . . . . . . . . . . . . . . . . . . . . . . 23 Installation en armoire d'une unité centrale ou d'une unité d'extension . . . . . . . . 29 Installation du modèle 02E/4B ou 05F/6B en armoire . . . . . . . . . . . . . . . . . . . . . 29 Détermination de l'emplacement . . . . . . . . . . . . . . . . . . . . . . . . . . . 30 Marquage de l'emplacement. . . . . . . . . . . . . . . . . . . . . . . . . . . . 31 Fixation du support de montage 02E/4B ou 05F/6B dans l'armoire. . . . . . . . . . . . . . . . 32 Installation du bras de routage des câbles . . . . . . . . . . . . . . . . . . . . . . . . 35 Raccordement de câbles d'alimentation au système . . . . . . . . . . . . . . . . . . . . . 38 Installation du modèle 31E/2B en armoire . . . . . . . . . . . . . . . . . . . . . . . . . 38 Détermination de l'emplacement . . . . . . . . . . . . . . . . . . . . . . . . . . . 40 Marquage de l'emplacement. . . . . . . . . . . . . . . . . . . . . . . . . . . . 41 Fixation du matériel de montage 31E/2B à l'armoire. . . . . . . . . . . . . . . . . . . . . 42 Installation du bras de routage des câbles . . . . . . . . . . . . . . . . . . . . . . . . 44 Raccordement de câbles d'alimentation au système . . . . . . . . . . . . . . . . . . . . . 47 Installation du modèle 33E/8B ou 36E/8C en armoire . . . . . . . . . . . . . . . . . . . . . 48 Détermination de l'emplacement . . . . . . . . . . . . . . . . . . . . . . . . . . . 50 Marquage de l'emplacement. . . . . . . . . . . . . . . . . . . . . . . . . . . . 51 Fixation du support de montage 33E/8B ou 36E/8C dans l'armoire . . . . . . . . . . . . . . . 52 Installation du bras de routage des câbles . . . . . . . . . . . . . . . . . . . . . . . . 59 Raccordement de câbles d'alimentation au système . . . . . . . . . . . . . . . . . . . . . 60 Installation en armoire du modèle 58/02 ou 58/77 . . . . . . . . . . . . . . . . . . . . . . 61 Détermination de l'emplacement . . . . . . . . . . . . . . . . . . . . . . . . . . . 62 Marquage de l'emplacement. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63 Fixation du support de montage dans l'armoire . . . . . . . . . . . . . . . . . . . . . . 64 Installation du modèle 58/02 ou 58/77 sur les glissières . . . . . . . . . . . . . . . . . . . 67 Installation du modèle 58/86 en armoire . . . . . . . . . . . . . . . . . . . . . . . . . 70 Détermination de l'emplacement . . . . . . . . . . . . . . . . . . . . . . . . . . . 71 Marquage de l'emplacement. . . . . . . . . . . . . . . . . . . . . . . . . . . . 72 Fixation du support de montage dans l'armoire . . . . . . . . . . . . . . . . . . . . . 73 Installation du modèle 58/86 sur les glissières. . . . . . . . . . . . . . . . . . . . . . 75 iii Installation en armoire de l'unité d'extension ou 14G/30 . . . . . . . . . . . . . . . . . . . . 78 Mise hors tension du système et coupure de l'alimentation . . . . . . . . . . . . . . . . . . 78 Marquage de l'emplacement. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78 Mise en place des glissières extensibles et du support d'extension dans l'armoire . . . . . . . . . . . 80 Installation du boîtier . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82 Installation de l'unité d'extension . . . . . . . . . . . . . . . . . . . . . . . . . . . 83 Fixation des câbles d'alimentation et du réseau de contrôle de l'alimentation système (SPCN) . . . . . . . 85 Mise sous tension du système . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85 Installation en armoire du modèle 57/86 ou 31D/24. . . . . . . . . . . . . . . . . . . . . . 85 Inventaire des composants . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86 Détermination de l'emplacement . . . . . . . . . . . . . . . . . . . . . . . . . . . 87 Marquage de l'emplacement. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88 Fixation du support de montage dans l'armoire . . . . . . . . . . . . . . . . . . . . . . 89 Installation du modèle 57/86 ou 31D/24 sur les glissières. . . . . . . . . . . . . . . . . . . 91 Fixation du bras de routage des câbles aux glissières standard . . . . . . . . . . . . . . . . . 94 Retrait et remise en place de panneaux latéraux 14T/00 ou 14T/42 . . . . . . . . . . 97 Remise en place d'un panneau latéral 14T/00 ou 14T/42 . . . . . . . . . . . . . . . . . . . . 98 Retrait et remise en place de panneaux d'habillage 14T/00 ou 14T/42. . . . . . . . . 99 Remise en place de panneaux d'habillage 14T/00 ou 14T/42 . . . . . . . . . . . . . . . . . . . 99 Fixation des volets de l'armoire . . . . . . . . . . . . . . . . . . . . . . . . 101 Fixation d'un volet avant de type High-perforation. . . . . . . . . . . . . . . . . . . . . . 101 Consignes de sécurité relatives aux armoires . . . . . . . . . . . . . . . . . . . . . . . . 102 Installation du kit de sécurité de l'armoire . . . . . . . . . . . . . . . . . . . . 105 Kit de renforcement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 109 Libération de l'équerre renforcée . . . . . . . . . . . . . . . . . . . . . . . . . . . . 110 Libération du panneau latéral à l'aide d'un kit de renforcement. . . . . . . . . . . . . . . . . . 110 Connexion de plusieurs armoires à l'aide d'un kit de connexion multiarmoire . . . . 113 Alarme de statut d'armoire. . . . . . . . . . . . . . . . . . . . . . . . . . . 115 Installation de l'alarme de statut d'armoire . . . . . . . . . . . . . . . . . . . . . . . . 115 Connexion de plusieurs boîtes de jonction. . . . . . . . . . . . . . . . . . . . . . . . 117 Installation ou retrait d'un taquet de verrouillage d'unité centrale montée en armoire 119 Retrait et remise en place du capot supérieur sur l'armoire . . . . . . . . . . . . 121 Retrait du capot supérieur de l'armoire. . . . . . . . . . . . . . . . . . . . . . . . . . 121 Remise en place du capot supérieur sur l'armoire . . . . . . . . . . . . . . . . . . . . . . 123 Unité d'alimentation plus . . . . . . . . . . . . . . . . . . . . . . . . . . . 127 Installation de l'unité PDU+ sur le côté d'une armoire. . . . . . . . . . . . . . . . . . . . . 127 Installation horizontale en armoire de la PDU+ . . . . . . . . . . . . . . . . . . . . . . 129 Configuration du contrôle de l'alimentation via l'unité PDU+ . . . . . . . . . . . . . . . . . . 133 Utilisation de l'utilitaire de configuration . . . . . . . . . . . . . . . . . . . . . . . . 133 Connexion de la console. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 134 Utilisation d'HyperTerminal . . . . . . . . . . . . . . . . . . . . . . . . . . . 134 Options du menu de l'utilitaire de configuration . . . . . . . . . . . . . . . . . . . . 134 Configuration des adresses IP. . . . . . . . . . . . . . . . . . . . . . . . . . . 135 Utilisation de l'interface Web . . . . . . . . . . . . . . . . . . . . . . . . . . . . 135 Démarrage de l'interface Web . . . . . . . . . . . . . . . . . . . . . . . . . . . 135 Modification des paramètres de base . . . . . . . . . . . . . . . . . . . . . . . . 136 Modification du nom de superutilisateur et du mot de passe . . . . . . . . . . . . . . . 136 Identification de la PDU+ et de la carte Web/SNMP . . . . . . . . . . . . . . . . . . 136 iv Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Ajout d'utilisateurs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 137 Modification de la date et de l'heure. . . . . . . . . . . . . . . . . . . . . . . . 137 Modification des alertes événement . . . . . . . . . . . . . . . . . . . . . . . . 137 Modification des informations réseau . . . . . . . . . . . . . . . . . . . . . . . . 138 Modification de la configuration de réseau . . . . . . . . . . . . . . . . . . . . . 138 Récapitulatifs du journal de l'historique et des événements . . . . . . . . . . . . . . . . . 138 Affichage du journal de l'historique . . . . . . . . . . . . . . . . . . . . . . . . 138 Affichage du journal des événements . . . . . . . . . . . . . . . . . . . . . . . 139 Retrait et remplacement du capot ou du volet de l'unité d'extension . . . . . . . . 141 Retrait du capot avant des modèles 14G/30 . . . . . . . . . . . . . . . . . . . . . . . . 141 Installation du capot avant des modèles 14G/30. . . . . . . . . . . . . . . . . . . . . . . 142 Retrait et installation des carters sur le système 33E/8B ou 36E/8C . . . . . . . . . 143 Retrait du capot d'accès des modèles 33E/8B ou 36E/8C. . . . . . . . . . . . . . . . . . . . 143 Installation du capot d'accès des modèles 33E/8B ou 36E/8C. . . . . . . . . . . . . . . . . . . 144 Retrait du carter avant des modèles 33E/8B ou 36E/8C. . . . . . . . . . . . . . . . . . . . . 145 Installation du carter avant des modèles 33E/8B ou 36E/8C. . . . . . . . . . . . . . . . . . . 146 Retrait et remise en place du capot avant sur les modèles 17M/MB et 79M/HB . . . . . . . . . . . . 147 Retrait du carter avant d'un modèle 17M/MB ou 79M/HB . . . . . . . . . . . . . . . . . . . 147 Installation du carter avant sur un modèle 17M/MB ou 79M/HB . . . . . . . . . . . . . . . . . 148 Mise en position de maintenance ou de fonctionnement du système 33E/8B ou 36E/8C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 151 Mise en position de maintenance de l'unité centrale ou de l'unité d'extension . . . . . . . . . . . . . 151 Mise en position fonctionnement de l'unité centrale ou de l'unité d'extension . . . . . . . . . . . . . 155 Mise en position de maintenance de l'unité 33E/8B ou 36E/8C. . . . . . . . . . . . . . . . . . 157 Mise en position de fonctionnement de l'unité 33E/8B ou 36E/8C. . . . . . . . . . . . . . . . . 160 Remarques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 163 Marques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 164 Bruits radioélectriques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 164 Remarques sur la classe A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 164 Remarques sur la classe B . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 168 Dispositions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 170 Table des matières v vi Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Avis aux lecteurs canadiens Le présent document a été traduit en France. Voici les principales différences et particularités dont vous devez tenir compte. Illustrations Les illustrations sont fournies à titre d'exemple. Certaines peuvent contenir des données propres à la France. Terminologie La terminologie des titres IBM peut différer d'un pays à l'autre. Reportez-vous au tableau ci-dessous, au besoin. IBM France IBM Canada ingénieur commercial représentant agence commerciale succursale ingénieur technico-commercial informaticien inspecteur technicien du matériel Claviers Les lettres sont disposées différemment : le clavier français est de type AZERTY, et le clavier français-canadien, de type QWERTY. OS/2 - Paramètres canadiens Au Canada, on utilise : v les pages de codes 850 (multilingue) et 863 (français-canadien), v le code pays 002, v le code clavier CF. Nomenclature Les touches présentées dans le tableau d'équivalence suivant sont libellées différemment selon qu'il s'agit du clavier de la France, du clavier du Canada ou du clavier des États-Unis. Reportez-vous à ce tableau pour faire correspondre les touches françaises figurant dans le présent document aux touches de votre clavier. vii Recommandations à l'utilisateur Ce matériel utilise et peut émettre de l'énergie radiofréquence. Il risque de parasiter les communications radio et télévision s'il n'est pas installé et utilisé conformément aux instructions du constructeur (instructions d'utilisation, manuels de référence et manuels d'entretien). Si cet équipement provoque des interférences dans les communications radio ou télévision, mettez-le hors tension puis sous tension pour vous en assurer. Il est possible de corriger cet état de fait par une ou plusieurs des mesures suivantes : v Réorienter l'antenne réceptrice ; v Déplacer l'équipement par rapport au récepteur ; v Éloigner l'équipement du récepteur ; v Brancher l'équipement sur une prise différente de celle du récepteur pour que ces unités fonctionnent sur des circuits distincts ; v S'assurer que les vis de fixation des cartes et des connecteurs ainsi que les fils de masse sont bien serrés ; v Vérifier la mise en place des obturateurs sur les connecteurs libres. Si vous utilisez des périphériques non IBM avec cet équipement, nous vous recommandons d'utiliser des câbles blindés mis à la terre, à travers des filtres si nécessaire. En cas de besoin, adressez-vous à votre détaillant. Le fabricant n'est pas responsable des interférences radio ou télévision qui pourraient se produire si des modifications non autorisées ont été effectuées sur l'équipement. viii Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension L'obligation de corriger de telles interférences incombe à l'utilisateur. Au besoin, l'utilisateur devrait consulter le détaillant ou un technicien qualifié pour obtenir de plus amples renseignements. Brevets Il est possible qu'IBM détienne des brevets ou qu'elle ait déposé des demandes de brevets portant sur certains sujets abordés dans ce document. Le fait qu'IBM vous fournisse le présent document ne signifie pas qu'elle vous accorde un permis d'utilisation de ces brevets. Vous pouvez envoyer, par écrit, vos demandes de renseignements relatives aux permis d'utilisation au directeur général des relations commerciales d'IBM, 3600 Steeles Avenue East, Markham, Ontario, L3R 9Z7. Assistance téléphonique Si vous avez besoin d'assistance ou si vous voulez commander du matériel, des logiciels et des publications IBM, contactez IBM direct au 1 800 465-1234. Avis aux lecteurs canadiens ix x Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Consignes de sécurité Les consignes de sécurité peuvent être imprimées tout au long de ce guide. v DANGER - Consignes attirant votre attention sur un risque de blessures graves, voire mortelles. v ATTENTION - Consignes attirant votre attention sur un risque de blessures graves, en raison de certaines circonstances réunies. v Avertissement - Consignes attirant votre attention sur un risque de dommages sur un programme, une unité, un système ou des données. Consignes de sécurité relatives au commerce international Plusieurs pays nécessitent la présentation des consignes de sécurité indiquées dans les publications du produit dans leur langue nationale. Si votre pays en fait partie, un livret de consignes de sécurité est inclus dans l'ensemble des publications livré avec le produit. Ce livret contient les consignes de sécurité dans votre langue en faisant référence à la source en anglais (Etats-Unis). Avant d'utiliser une publication en version originale anglaise pour installer, faire fonctionner ou dépanner ce produit, vous devez vous familiariser avec les consignes de sécurité figurant dans ce livret. Vous devez également consulter ce livret chaque fois que les consignes de sécurité des publications en anglais (Etats-Unis) ne sont pas assez claires pour vous. Informations sur les appareils à laser Les serveurs peuvent comprendre des cartes d'E-S ou des composants à fibres optiques, utilisant des lasers ou des diodes électroluminescentes (LED). Conformité aux normes relatives aux appareils à laser Les serveurs peuvent être installés à l'intérieur ou à l'extérieur d'une armoire d'équipement informatique. xi DANGER Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes suivantes : Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut présenter un danger. Pour éviter tout risque de choc électrique : v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce dernier avec un autre produit. v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique. v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de reconfiguration de ce produit au cours d'un orage. v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de danger électrique, débranchez tous les cordons d'alimentation. v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent aux informations de la plaque d'alimentation électrique du système. v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce produit. v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les câbles d'interface. v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence de dommages matériels. v Avant de retirer les carters de l'unité, mettez celle-ci hors tension et déconnectez ses cordons d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication et aux modems (sauf mention contraire dans les procédures d'installation et de configuration). v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et déconnecter les différents cordons. Pour déconnecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Débranchez les cordons d'alimentation des prises. 3. Débranchez les câbles d'interface des connecteurs. 4. Débranchez tous les câbles des unités. Pour connecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Branchez tous les cordons sur les unités. 3. Branchez les câbles d'interface sur des connecteurs. 4. Branchez les cordons d'alimentation aux prises. 5. Mettez l'unité sous tension. (D005a) DANGER xii Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Observez les consignes suivantes lors de l'utilisation du système en armoire ou lorsque vous travaillez à proximité de ce dernier : v Un mauvais maniement de l'équipement lourd peut engendrer blessures et dommages matériels. v Abaissez toujours les vérins de mise à niveau de l'armoire. v Installez toujours des équerres de stabilisation sur l'armoire. v Pour prévenir tout danger lié à une mauvaise répartition de la charge, installez toujours les unités les plus lourdes dans la partie inférieure de l'armoire. Installez toujours les serveurs et les unités en option en commençant par le bas de l'armoire. v Un serveur monté en armoire n'est pas une étagère ou un espace de travail. Ne posez pas d'objet sur un serveur monté en armoire. v Chaque armoire peut être équipée de plusieurs cordons d'alimentation. Avant de manipuler l'armoire, vous devez débrancher l'ensemble des cordons d'alimentation. v Reliez toutes les unités installées dans l'armoire aux dispositifs d'alimentation installés dans la même armoire. Vous ne devez pas brancher le cordon d'alimentation d'une unité installée dans une armoire au dispositif d'alimentation installé dans une autre armoire. v Un mauvais câblage du socle de prise de courant peut provoquer une mise sous tension dangereuse des parties métalliques du système ou des unités qui lui sont raccordées. Il appartient au client de s'assurer que le socle de prise de courant est correctement câblé et mis à la terre afin d'éviter tout risque de choc électrique. ATTENTION v N'installez pas d'unité dans une armoire dont la température ambiante interne dépasse la température ambiante que le fabricant recommande pour toutes les unités montées en armoire. v N'installez pas d'unité dans une armoire où la ventilation n'est pas assurée. Vérifiez que les côtés, l'avant et l'arrière de l'unité sont correctement ventilés. v Le matériel doit être correctement raccordé au circuit d'alimentation pour éviter qu'une surcharge des circuits n'entrave le câblage des dispositifs d'alimentation ou de protection contre les surintensités. Pour choisir des connexions d'alimentation à l'armoire adaptées, consultez les étiquettes de puissance nominale situées sur le matériel dans l'armoire afin de déterminer l'alimentation totale requise par le circuit d'alimentation. v Armoires dotées de tiroirs coulissants : Si l'armoire n'est pas équipée d'équerres de stabilisation, ne sortez et n'installez pas de tiroir ou de dispositif. Ne retirez pas plusieurs tiroirs à la fois. Si vous retirez plusieurs tiroirs simultanément, l'armoire risque de devenir instable. v Armoires dotées de tiroirs fixes : Sauf indication du fabricant, les tiroirs fixes ne doivent pas être retirés à des fins de maintenance. Si vous tentez de retirer une partie ou l'ensemble du tiroir, l'armoire risque de devenir instable et le tiroir risque de tomber. (R001) Consignes de sécurité xiii ATTENTION : Le retrait des composants des parties supérieures de l'armoire améliore sa stabilité au cours du déplacement. Pour déplacer une armoire remplie de composants dans une pièce ou dans un bâtiment, procédez comme suit : v Pour réduire le poids de l'armoire, retirez les équipements, à commencer par celui situé en haut. Si possible, restaurez la configuration d'origine de l'armoire. Si vous ne connaissez pas cette configuration, procédez comme suit : – Retirez toutes les unités de la position 32U et plus. – Assurez-vous que les unités les plus lourdes sont installées dans la partie inférieure de l'armoire. – Assurez-vous qu'il ne reste aucun niveau U vide entre les unités installées dans l'armoire, en dessous du niveau 32U. v Si l'armoire déplacée fait partie d'un groupe d'armoires, séparez-la de ce dernier. v Vérifiez l'itinéraire envisagé pour éliminer tout risque. v Vérifiez que l'armoire une fois chargée n'est pas trop lourde pour l'itinéraire choisi. Pour plus d'informations sur le poids d'une armoire chargée, consultez la documentation fournie avec votre armoire. v Vérifiez que toutes les ouvertures mesurent au moins 760 x 230 mm. v Vérifiez que toutes les unités, toutes les étagères, tous les tiroirs, toutes les portes et tous les câbles sont bien fixés. v Vérifiez que les vérins de mise à niveau sont à leur position la plus haute. v Vérifiez qu'aucune équerre de stabilisation n'est installée sur l'armoire pendant le déplacement. v N'utilisez pas de rampe inclinée à plus de dix degrés. v Dès que l'armoire est à son nouvel emplacement, procédez comme suit : – Abaissez les quatre vérins de mise à niveau. – Installez des équerres de stabilisation sur l'armoire. – Si vous avez retiré des unités de l'armoire, remettez-les à leur place, en remontant de la partie inférieure à la partie supérieure de l'armoire. v Si un déplacement important est nécessaire, restaurez la configuration d'origine de l'armoire. Mettez l'armoire dans son emballage d'origine ou dans un autre emballage équivalent. De plus, abaissez les vérins de mise à niveau pour que les roulettes ne soient plus au contact de la palette et fixez l'armoire à celle-ci. (R002) (L001) (L002) xiv Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension (L003) ou Aux Etats-Unis, tous les appareils à laser sont certifiés conformes aux normes indiquées dans le sous-chapitre J du DHHS 21 CFR relatif aux produits à laser de classe 1. Dans les autres pays, ils sont certifiés être des produits à laser de classe 1 conformes aux normes CEI 60825. Consultez les étiquettes sur chaque pièce du laser pour les numéros d'accréditation et les informations de conformité. ATTENTION : Ce produit peut contenir des produits à laser de classe 1 : lecteur de CD-ROM, DVD-ROM, DVD-RAM ou module à laser. Notez les informations suivantes : v Ne retirez pas les carters. En ouvrant le produit à laser, vous vous exposez au rayonnement dangereux du laser. Vous ne pouvez effectuer aucune opération de maintenance à l'intérieur. v Pour éviter tout risque d'exposition au rayon laser, respectez les consignes de réglage et d'utilisation des commandes, ainsi que les procédures décrites dans le présent manuel. (C026) Consignes de sécurité xv ATTENTION : Les installations informatiques peuvent comprendre des modules à laser fonctionnant à des niveaux de rayonnement excédant les limites de la classe 1. Il est donc recommandé de ne jamais examiner à l'oeil nu la section d'un cordon optique ni une prise de fibres optiques ouverte. (C027) ATTENTION : Ce produit contient un laser de classe 1M. Ne l'observez pas à l'aide d'instruments optiques. (C028) ATTENTION : Certains produits à laser contiennent une diode à laser intégrée de classe 3A ou 3B. Prenez connaissance des informations suivantes. Rayonnement laser lorsque le carter est ouvert. Evitez toute exposition directe au rayon laser. Evitez de regarder fixement le faisceau ou de l'observer à l'aide d'instruments optiques. (C030) Informations sur l'alimentation électrique et sur le câblage relatives au document GR-1089-CORE du NEBS (Network Equipment-Building System) Les commentaires suivants s'appliquent aux serveurs qui ont été déclarés conformes au document GR-1089-CORE du NEBS (Network Equipment-Building System) : Cet équipement peut être installé : v dans des infrastructures de télécommunications réseau v aux endroits préconisés dans les directives NEC (National Electrical Code). Les ports de ce matériel qui se trouvent à l'intérieur du bâtiment peuvent être connectés à des câbles internes ou non exposés uniquement. Ils ne doivent pas être connectés par leur partie métallique aux interfaces connectées au réseau extérieur ou à son câblage. Ces interfaces sont conçues pour être exclusivement utilisées à l'intérieur d'un bâtiment (ports de type 2 ou 4 décrits dans le document GR-1089-CORE) ; elles doivent être isolées du câblage à découvert du réseau extérieur. L'ajout de dispositifs de protection primaires n'est pas suffisant pour pouvoir connecter ces interfaces par leur partie métallique au câblage du réseau extérieur. Remarque : Tous les câbles Ethernet doivent être blindés et mis à la terre aux deux extrémités. Dans le cas d'un système alimenté en courant alternatif, il n'est pas nécessaire d'installer un dispositif externe de protection contre les surtensions (SPD). Un système alimenté en courant continu fait appel à un dispositif de retour du continu (DC-I). La borne de retour de la batterie en courant continu ne doit pas être connectée à la masse. xvi Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Informations sur les procédures d'installation en armoire. Procédures détaillées concernant l'installation de dispositifs, d'unités centrales et d'extension dans une armoire. Vous pouvez exécuter ces tâches vous-même ou faire appel à un fournisseur de services. Il vous imputera probablement les frais correspondants. 1 2 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension - Nouveautés Prenez connaissance des nouveautés et modifications significatives apportées à la rubrique Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension depuis la dernière mise à jour de cette collection. Septembre 2010 Mises à jour du contenu : v Procédures supplémentaire pour «Installation du modèle 02E/4B ou 05F/6B en armoire», à la page 29. v Procédures supplémentaire pour «Installation du modèle 31E/2B en armoire», à la page 38. v Procédures supplémentaire pour «Retrait et remise en place de panneaux d'habillage 14T/00 ou 14T/42», à la page 99. 3 4 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Installation de l'armoire Cette section décrit l'ensemble des tâches requises pour l'installation d'armoires (modèle 7014-T00 ou 7014-T42), depuis l'inventaire des composants jusqu'au branchement sur une source d'alimentation. Pourquoi et quand exécuter cette tâche Installation de l'armoire 14T/00 ou 14T/42 Cette procédure permet d'installer les armoires de type 14T/00 et 14T/42. Pourquoi et quand exécuter cette tâche Pour installer le kit de sécurité de l'armoire, voir «Installation du kit de sécurité de l'armoire», à la page 105 après l'installation de l'armoire. Avant d'installer une armoire, voir «Consignes de sécurité relatives aux armoires», à la page 102. Inventaire des composants Il est préférable d'effectuer un inventaire des composants avant de commencer l'installation de l'armoire. Cette section décrit la procédure à suivre. Pourquoi et quand exécuter cette tâche Si vous ne l'avez pas encore fait, effectuez un inventaire des pièces avant d'installer l'unité centrale dans l'armoire : Procédure 1. Localisez le rapport des kits dans une boîte d'accessoires. 2. Vérifiez que vous avez reçu tous les éléments commandés et toutes les pièces indiquées dans le kit. Résultats En cas de composants manquants, endommagés ou ne correspondant pas à la commande, vous disposez des contacts suivants : v Revendeur Positionnement de l'armoire La conformité aux normes de réglementation et de sécurité exige un positionnement correct de l'armoire. La procédure de cette section permet d'effectuer cette tâche. Pourquoi et quand exécuter cette tâche Une fois l'armoire installée à sa place sur le sol, bloquez chaque roulette en serrant la vis de verrouillage. Pour connaître l'emplacement de la vis de verrouillage (2), voir la figure suivante. Retirez toutes les bandes de protection et les emballages de l'armoire. 5 1 Roulette 2 Vis de verrouillage Pour passer à l'étape suivante, procédez comme suit : v Pour fixer l'armoire sur un plancher en béton, voir «Fixation de l'armoire à un plancher en béton», à la page 8. v Pour fixer l'armoire sur un plancher en béton recouvert d'un faux plancher, voir «Fixation de l'armoire à un plancher en béton sous un faux plancher», à la page 13. v Pour fixer l'armoire au plancher, voir «Mise à niveau de l'armoire». Mise à niveau de l'armoire S'il est nécessaire de niveler l'armoire, procédez comme décrit dans cette section. Pourquoi et quand exécuter cette tâche Pour niveler l'armoire, procédez comme suit : Procédure 1. Desserrez le contre-écrou sur chaque pied de nivellement. 2. Faites pivoter chaque pied de nivellement vers le bas jusqu'à ce qu'il soit en contact avec la surface sur laquelle est placée l'armoire. 3. Réglez les pieds de nivellement vers le bas jusqu'à ce que l'armoire soit à niveau. Une fois l'armoire à niveau, serrez les contre-écrous sur la base. Figure 1. Serrage de la vis de verrouillage 6 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 1 Avant de l'armoire (base) 2 Pied de nivellement (quantité 4) 3 Contre-écrou (x4) Résultats Fixation des équerres de stabilisation Il se peut que vous deviez fixer les équerres de stabilisation à l'armoire. Cette section vous aide à déterminer si des équerres de stabilisation sont nécessaires et décrit comment les fixer. Pourquoi et quand exécuter cette tâche Si vous devez installer des prises électriques en courant alternatif avant et arrière dans l'armoire, vous ne pouvez pas fixer d'équerre de stabilisation. Vous devez fixer l'armoire au sol. Vous utilisez des équerres de stabilisation uniquement lorsque vous ne fixez pas l'armoire au plancher. Pour fixer l'armoire au plancher, voir «Fixation de l'armoire à un plancher en béton», à la page 8. Pour fixer les équerres de stabilisation au bas de l'armoire, procédez comme suit : Remarque : Avant d'installer les équerres de stabilisation, consultez «Fixation de la prise de courant alternatif avant ou arrière», à la page 19 pour obtenir des instructions sur l'installation des plaques de montage des prises de courant alternatif. Procédure 1. Alignez les emplacements de l'une des équerres de stabilisation sur les trous de vis, au bas de l'avant de l'armoire. 2. Installez les deux vis de fixation. 3. Vérifiez que la base de l'équerre de stabilisation repose fermement sur le sol. A l'aide de la clé Allen fournie avec l'armoire, serrez les vis de fixation, en passant de l'une à l'autre jusqu'à vissage complet. Figure 2. Réglage des pieds de nivellement Installation de l'armoire 7 1 Avant de l'armoire (base) 3 Equerre de stabilisation 2 Vis de fixation du stabilisateur 4 Arrière de l'armoire (base) 4. Pour installer la seconde équerre de stabilisation à l'arrière de l'armoire, répétez les étapes 1 à 3. Résultats Fixation de l'armoire à un plancher en béton Si vous devez installer des prises électriques en courant alternatif avant et arrière dans l'armoire, vous devez fixer l'armoire au sol. Cette section explique comment effectuer cette tâche sur un plancher en béton. Pourquoi et quand exécuter cette tâche Faites appel à un entrepreneur en mécanique pour fixer les plaques de montage de l'armoire au plancher en béton. Celui-ci doit déterminer que le matériel utilisé pour fixer les plaques de montage de l'armoire au plancher en béton respecte les exigences de l'installation. Pour fixer l'armoire à un plancher en béton, procédez comme suit : Procédure 1. Installez l'armoire sur son emplacement prédéfini et serrez les vis de verrouillage sur les roulettes. 2. S'ils sont en place, retirez les panneaux d'habillage haut, gauche et droit. Les panneaux d'habillage sont maintenus en place par des pinces de fixation Reportez-vous à l'illustration suivante Figure 3. Fixation des équerres de stabilisation 8 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 1 Boîtier d'armoire 4 Panneau d'habillage droit 2 Panneau d'habillage haut 5 Pince de fixation 3 Panneau d'habillage gauche 3. S'ils sont en place, retirez les volets avant et arrière. Une fois les volets de l'armoire retirés, passez à l'étape suivante. Pour enlever un volet d'armoire : a. Déverrouillez et ouvrez le volet. b. Saisissez le volet fermement des deux mains et extrayez-le de la charnière en le tirant vers l'extérieur. 4. Localisez le kit de montage du matériel et les deux plaques de montage. Lors de l'inventaire du contenu du kit de montage du matériel, voir la figure suivante. Le kit de montage du matériel contient les éléments suivants : v 4 boulons de montage d'armoire v 4 rondelles plates v 8 bagues d'isolation en plastique v 4 rondelles épaisses v 4 entretoises 5. Si vous installez une armoire alimentée en courant alternatif, placez provisoirement les bagues d'isolation en plastique inférieures afin de repérer les emplacements de montage de l'équerre de Figure 4. Retrait des panneaux d'habillage Installation de l'armoire 9 stabilisation. Une fois l'équerre de stabilisation correctement mise en place, retirez les bagues d'isolation en plastique inférieures. 1 Boîtier d'armoire 7 Contre-écrou 2 Boulon de montage d'armoire 8 Pied de nivellement 3 Rondelle plate 9 Bague d'isolation en plastique inférieure (utilisée uniquement sur les systèmes alimentés en courant continu) 4 Bague d'isolation en plastique supérieure CA Installation standard des pieds de nivellement sur une armoire alimentée en courant alternatif 5 Rondelle épaisse CC Installation standard des pieds de nivellement sur une armoire alimentée en courant continu 6 Entretoise 6. Placez les deux plaques de montage dans la position de montage approximative, sous l'armoire. 7. Créez un assemblage de boulon de montage d'armoire en ajoutant les pièces suivantes, dans l'ordre indiqué, à chaque boulon de montage d'armoire : a. Rondelle plate b. Bague d'isolation en plastique supérieure c. Rondelle épaisse d. Entretoise 8. Insérez un assemblage de boulon de montage d'armoire sur chaque pied de nivellement. 9. Repositionnez les plaques de montage d'armoire sous les quatre boulons de montage d'armoire afin que ces derniers soient centrés directement sur les trous taraudés. Figure 5. Installation de plaques de montages CA 10 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 10. Exercez quatre tours complets sur les boulons de montage d'armoire dans les trous taraudés de la plaque de montage. 1 Boulon de montage d'armoire 7 Pied de nivellement 2 Rondelle plate 8 Bague d'isolation en plastique inférieure (utilisée uniquement sur les systèmes alimentés en courant continu) 3 Bague d'isolation en plastique supérieure 9 Plaque de montage 4 Rondelle épaisse 10 Trou taraudé (permet de fixer l'armoire à l'équerre de stabilisation) 5 Entretoise 11 Trou de boulon d'ancrage 6 Contre-écrou 12 Schéma tracé (schéma à tracer sur le sol, en utilisant l'équerre de stabilisation comme gabarit) 11. Faites des repères sur le sol reproduisant les contours des deux équerres de stabilisation. 12. Marquez les trous de boulon inférieurs de la plaque qui sont accessibles par l'ouverture, à l'arrière de l'armoire. 13. Retirez les assemblages de boulon de montage d'armoire. 14. Si vous installez une armoire alimentée en courant alternatif, retirez la bague d'isolation inférieure sur chaque pied de nivellement. 15. Retirez les équerres de stabilisation des emplacements marqués. 16. Desserrez les vis de verrouillage sur les roulettes. 17. Eloignez l'armoire des deux zones identifiées au sol pour les emplacements des équerres de stabilisation. 18. Repositionnez les équerres de stabilisation sur les zones marquées. Figure 6. Fixation de l'armoire au plancher Installation de l'armoire 11 19. Faites un repère au sol, au centre de tous les trous des deux équerres de stabilisation. 20. Retirez les deux plaques de montage d'armoire des emplacements repérés. 21. Sur les repères des trous de boulon de montage d'armoire taraudés, percez quatre trous de passage dans le plancher en béton. Chacun d'eux doit avoir environ 2,5 cm de profondeur. Cette profondeur permet aux boulons de montage de traverser l'épaisseur des équerres de stabilisation. Remarque : Vous devez utiliser au minimum deux boulons d'ancrage sur chaque plaque de montage pour fixer la plaque au plancher en béton. Il est possible que certains trous d'une plaque de montage s'alignent sur des armatures en béton du plancher ; ils peuvent donc ne pas être utilisables. 22. Sélectionnez au moins deux emplacements de trous appropriés pour chaque boulon d'équerre de stabilisation. Les emplacements sélectionnés doivent être aussi proches que possible des trous de boulon taraudés. Vérifiez que les trous choisis à l'arrière de l'armoire sont accessibles. Percez des trous sur les emplacements sélectionnés du plancher en béton. 23. Positionnez les équerres de stabilisation sur les ancrages en béton. 24. Serrez les boulons de l'équerre de stabilisation avant sur le plancher en béton. 25. Positionnez l'équerre de stabilisation sur les ancrages en béton. 26. Serrez les boulons de l'équerre de stabilisation arrière sur le plancher en béton. Remarque : La taille des boulons d'ancrage et des ancrages en béton doit être déterminée par l'entrepreneur en mécanique qui effectue l'installation des plaques de montage. 27. Positionnez l'armoire sur l'équerre de stabilisation. 28. Insérez chaque boulon d'équerre de stabilisation dans une rondelle plate, une bague d'isolation en plastique et une rondelle épaisse et faites passer le tout par un pied de nivellement. 29. Alignez les quatre boulons d'équerre de stabilisation sur les quatre trous taraudés des deux plaques de montage, puis effectuez trois à quatre tours. 30. Serrez les vis de verrouillage sur chaque roulette. 31. Réglez les pieds de nivellement vers le bas jusqu'à ce que l'armoire soit à niveau. Une fois l'armoire à niveau, serrez les contre-écrous sur la base de l'armoire. 1 Avant de l'armoire (base) 2 Pied de nivellement (x4) 3 Contre-écrou (quantité 4) 12 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 32. Si plusieurs armoires sont reliées en série (fixées l'une à l'autre), voir «Connexion de plusieurs armoires à l'aide d'un kit de connexion multiarmoire», à la page 113. Sinon, vissez les quatre boulons à 54-67 nm. 33. Si vous n'installez pas de volets sur l'armoire, replacez les panneaux d'habillage haut, gauche et droit. 34. Connectez le système de distribution de l'alimentation conformément à la description (voir «Raccordement du système de distribution de l'alimentation», à la page 18). 35. Lorsque toutes les armoires sont fixées au plancher, passez à la section «Fixation de la prise de courant alternatif avant ou arrière», à la page 19. 36. Si vous ne fixez pas de prise électrique avant et installez des volets d'armoire, voir «Fixation des volets de l'armoire», à la page 101. Résultats Fixation de l'armoire à un plancher en béton sous un faux plancher Si vous prévoyez d'installer des prises électriques en courant alternatif avant et arrière dans l'armoire, vous devez fixer l'armoire au sol. Lorsque vous fixez l'armoire à un plancher en béton recouvert d'un faux plancher, procédez comme il est expliqué dans cette section. Pourquoi et quand exécuter cette tâche Faites appel à un entrepreneur en mécanique pour fixer les plaques de montage de l'armoire au plancher en béton. Il doit déterminer que le matériel utilisé pour fixer les plaques de montage de l'armoire au plancher en béton respecte les exigences de l'installation. Pour fixer l'armoire à un plancher en béton recouvert d'un faux plancher, procédez comme suit : Procédure 1. Installez l'armoire sur son emplacement prédéfini et serrez les vis de verrouillage sur les roulettes. 2. Le cas échéant, retirez les panneaux d'habillage haut, gauche et droit. Les panneaux d'habillage sont maintenus en place par des pinces de fixation. Reportez-vous à l'illustration suivante Figure 7. Réglage des pieds de nivellement Installation de l'armoire 13 1 Boîtier d'armoire 4 Panneau d'habillage droit 2 Panneau d'habillage haut 5 Pince de fixation 3 Panneau d'habillage gauche 3. Le cas échéant, retirez les volets avant et arrière. Pour retirer un volet d'armoire, voir . Une fois les volets de l'armoire retirés, passez à l'étape suivante. 4. Localisez le kit de montage du matériel et les deux plaques de montage. Lors de l'inventaire du contenu du kit de montage du matériel, voir la figure suivante. Le kit de montage du matériel contient les éléments suivants : v Quatre boulons de montage d'armoire v Quatre rondelles plates v Huit bagues d'isolation en plastique v Quatre rondelles épaisses v Quatre entretoises 5. Si vous installez une armoire alimentée en courant alternatif, placez provisoirement les bagues d'isolation en plastique inférieures afin de mettre en place la plaque de montage de l'armoire. Une fois la plaque de montage correctement mise en place, retirez les bagues d'isolation en plastique inférieures. Figure 8. Retrait des panneaux d'habillage 14 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 1 Boîtier d'armoire 7 Contre-écrou 2 Boulon de montage d'armoire 8 Pied de nivellement 3 Rondelle plate 9 Bague d'isolation en plastique inférieure (utilisée uniquement sur les systèmes alimentés en courant continu) 4 Bague d'isolation en plastique supérieure CA Installation standard des pieds de nivellement sur une armoire alimentée en courant alternatif 5 Rondelle épaisse CC Installation standard des pieds de nivellement sur une armoire alimentée en courant continu 6 Entretoise 6. Placez les deux plaques de montage dans la position de montage approximative, sous l'armoire. 7. Créez un assemblage de boulon de montage d'armoire en ajoutant les pièces suivantes, dans l'ordre indiqué, à chaque boulon de montage d'armoire : a. Rondelle plate b. Bague d'isolation en plastique supérieure c. Rondelle épaisse d. Entretoise 8. Insérez un assemblage de boulon de montage d'armoire sur chaque pied de nivellement. 9. Repositionnez les plaques de montage d'armoire sous les quatre boulons de montage d'armoire afin que ces derniers soient centrés directement sur les trous taraudés. 10. Exercez quatre tours complets sur les boulons de montage d'armoire dans les trous taraudés de la plaque de montage. 11. Faites des repères sur le faux plancher autour des plaques de montage avant et arrière. Figure 9. Installation des plaques de montage de l'unité d'alimentation Installation de l'armoire 15 12. Marquez les trous de boulon inférieurs de la plaque qui sont accessibles par l'ouverture, à l'arrière de l'armoire. 13. Retirez les assemblages de boulon de montage d'armoire. 14. Si vous installez une armoire alimentée en courant alternatif, retirez la bague d'isolation inférieure sur chaque pied de nivellement. 15. Retirez les plaques de montage d'armoire des emplacements repérés. 16. Desserrez les vis de verrouillage sur les roulettes. 17. Eloignez l'armoire des deux zones identifiées au sol pour les emplacements des plaques de montage. 18. Repositionnez les plaques de montage sur les zones repérées. 19. Faites des repères sur le faux plancher au centre de chaque trou des plaques de montage (y compris des trous taraudés). 20. Retirez les deux plaques de montage d'armoire des emplacements repérés sur le faux plancher. 21. Percez deux trous de passage à chaque extrémité de chaque plaque de montage. Chacun d'eux doit avoir environ 2,5 cm de profondeur. Cette profondeur permettra le passage des boulons de montage dans la plaque de montage lors de la fixation de cette dernière à l'armoire. 22. Sélectionnez au moins deux emplacements de trous appropriés pour chaque plaque de montage. Sélectionnez les emplacements des trous aussi proches que possible des trous taraudés. Vérifiez que les emplacements des trous choisis à l'arrière de l'armoire sont accessibles. 23. Percez des trous de passage dans le faux plancher. Ces trous permettent d'insérer les boulons d'ancrage dans la plaque de montage et de traverser le faux plancher jusqu'au plancher en béton. Remarque : Vous devez utiliser au minimum deux boulons d'ancrage sur chaque plaque de montage pour fixer la plaque au plancher en béton via le faux plancher. Il est possible que certains trous d'une plaque de montage s'alignent sur des armatures en béton du plancher ; ils peuvent donc ne pas être utilisables. 24. Transférez les repères des trous des boulons d'ancrage (à l'exception de ceux des trous de passage percés pour les boulons de montage de l'armoire) du faux plancher au plancher en béton et marquez l'emplacement des trous sur le plancher en béton. 25. Percez des trous dans le plancher en béton pour fixer les boulons d'ancrage. 26. Remettez le faux plancher en position, au-dessus des trous des boulons d'ancrage. 27. Positionnez l'équerre de stabilisation avant sur la zone repérée du faux plancher. 28. A l'aide des boulons d'ancrage, fixez l'équerre de stabilisation avant sur le plancher en béton, en traversant le faux plancher. 29. Positionnez l'équerre de stabilisation arrière sur la zone repérée du faux plancher. 16 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 1 Boulon de montage d'armoire 7 Pied de nivellement 2 Rondelle plate 8 Bague d'isolation en plastique inférieure (utilisée uniquement sur les systèmes alimentés en courant continu) 3 Bague d'isolation en plastique supérieure 9 Equerres de stabilisation 4 Rondelle épaisse 10 Trou taraudé (permet de fixer l'armoire à la plaque de montage) 5 Entretoise 11 Trou de boulon d'ancrage 6 Contre-écrou 12 Schéma tracé (schéma à tracer sur le sol, en utilisant la plaque de montage comme gabarit) 30. A l'aide des boulons d'ancrage, fixez l'équerre de stabilisation arrière sur le plancher en béton, en traversant le faux plancher. 31. Replacez toutes les dalles de faux plancher qui ont été retirées lors de l'alignement et de la fixation des boulons d'ancrage sur le plancher en béton. 32. Alignez l'armoire sur les équerres de stabilisation avant et arrière. 33. Insérez chaque assemblage de boulon dans un pied de nivellement. 34. Alignez les boulons de montage de l'armoire sur les trous taraudés de chaque équerre de stabilisation. Appliquez trois ou quatre tours sur chaque boulon. 35. Serrez les vis de verrouillage sur chaque roulette. 36. Réglez les pieds de nivellement vers le bas jusqu'à ce que l'armoire soit à niveau. Une fois l'armoire à niveau, serrez les contre-écrous sur la base de l'armoire. Figure 10. Fixation de l'armoire au plancher Installation de l'armoire 17 37. Si plusieurs armoires sont reliées en série (fixées l'une à l'autre), voir «Connexion de plusieurs armoires à l'aide d'un kit de connexion multiarmoire», à la page 113. Sinon, vissez les quatre boulons à 54-67 nm. 38. Si vous n'installez pas de volets sur l'armoire, replacez les panneaux d'habillage haut, gauche et droit. 39. Connectez le système de distribution de l'alimentation conformément à la description (voir «Raccordement du système de distribution de l'alimentation»). 40. Une fois l'armoire fixée au sol, passez à la section «Fixation de la prise de courant alternatif avant ou arrière», à la page 19 pour fixer une prise électrique avant. 41. Si vous ne fixez pas de prise électrique avant et installez des volets d'armoire, voir «Fixation des volets de l'armoire», à la page 101. Résultats Raccordement du système de distribution de l'alimentation Vous pouvez utiliser un système de distribution de l'alimentation pour contrôler les charges électriques des unités qui y sont branchés. La procédure de cette section permet de connecter ce système. Pourquoi et quand exécuter cette tâche Pour raccorder une unité d'alimentation, voir «Unité d'alimentation plus», à la page 127. Vérification des prises de courant alternatif Pour garantir une opération sûre et fiable, vous devrez vérifier les prises de courant alternatif. La procédure de cette section permet d'effectuer cette tâche. Avant de commencer Avant de commencer, veillez à vous munir d'un multimètre pour vérifier la tension, et d'un testeur d'impédance du sol homologué pour tester les résistances de mise à la terre. Remarque : Utilisez uniquement un testeur d'impédance du sol homologué pour tester les résistances de mise à la terre. N'utilisez pas de multimètre pour mesurer la résistance de mise à la terre. Pourquoi et quand exécuter cette tâche Avant de brancher l'armoire sur l'unité d'alimentation en courant alternatif, contrôlez les points suivants sur l'unité d'alimentation en courant alternatif : Procédure 1. Mettez hors tension le disjoncteur de la prise d'alimentation en courant alternatif à laquelle va être raccordée l'armoire. Sur le commutateur du disjoncteur, placez l'étiquette S229-0237 demandant de ne pas manipuler le disjoncteur. Remarque : Lorsque vous faites les tests, le cache de la prise doit être installé dans sa position habituelle. 2. Certaines prises se trouvent dans des logements métalliques. Pour ce type de prise, procédez comme suit : a. A l'aide d'un multimètre, vérifiez qu'il y a une tension inférieure à 1 volt entre le boîtier de la prise et tout élément métallique du bâtiment relié à la terre, par exemple, la structure métallique d'un faux plancher, une tuyauterie, un élément métallique de la structure du bâtiment, ou une structure similaire. 18 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension b. A l'aide d'un multimètre, vérifiez qu'il y a une tension inférieure à 1 volt entre la borne terre de la prise et un point relié à la terre dans le bâtiment. Remarque : Si le boîtier ou le cache de la prise est peint, assurez-vous que la sonde de test traverse la peinture et entre en contact avec le métal. c. A l'aide d'un multimètre, testez la résistance entre la borne terre et le boîtier de la prise. Testez la résistance entre la borne terre de la prise et la mise à la terre du bâtiment. Vous devez obtenir un résultat inférieur à 1 ohm, ce qui indique la présence d'un conducteur de terre continu. 3. Si l'un des trois tests effectués à l'étape 2, à la page 18 n'est pas concluant, coupez le courant du circuit de dérivation et modifiez les branchements. Ensuite, testez à nouveau la prise, une fois les branchements corrigés. 4. A l'aide d'un testeur d'impédance du sol, testez la résistance infinie entre la borne terre de la prise et chaque borne de phase. Cela permet de détecter un court-circuit à la terre ou une inversion de branchement. 5. A l'aide d'un testeur d'impédance du sol, testez la résistance infinie entre les bornes de phase. Ce test permet de vérifier l'absence de court-circuit. 6. Remettez le disjoncteur en position sous tension. 7. A l'aide d'un multimètre, vérifiez que la tension entre les phases est correcte. S'il n'y a pas de tension au niveau du boîtier de la prise ou de la borne terre, vous pouvez toucher la prise en toute sécurité. 8. A l'aide d'un multimètre, vérifiez que la tension au niveau de la prise de courant alternatif est correcte. Résultats Fixation de la prise de courant alternatif avant ou arrière Utilisez cette procédure pour fixer une prise de courant alternatif. Avertissement : Les plaques de montage de prise de courant alternatif avant et arrière sont fixées sur les trous de montage permettant déjà de fixer les équerres de stabilisation au boîtier de l'armoire. Par conséquent, vous devez retirer les équerres si vous fixez l'armoire au sol. Installez les plaques de montage de la prise de courant alternatif uniquement une fois que l'armoire a été fixée au sol et que les équerres de stabilisation ont été retirées. Les éléments suivants sont installés sur le site du client : v Plaques de montage de la prise de courant alternatif pour l'installation des prises électriques en courant alternatif fournies par le client, à l'avant ou à l'arrière de l'armoire. La plaque de montage de la prise de courant alternatif constitue l'emplacement de montage d'une prise électrique ca. v Cosse de mise à la terre en laiton pour un raccordement de décharge d'électricité statique (ESD). Remarque : Il appartient au client de fournir les prises et les cordons d'alimentation pour le raccordement à l'unité d'alimentation. Il est également de sa responsabilité de brancher correctement la prise de courant alternatif. Ces composants ne font pas partie des unités remplaçables sur site (FRU). Installation des plaques de montage pour l'alimentation CA avec prises CA Si vous choisissez d'installer des plaques de montage, vous pouvez suivre la procédure détaillée dans cette section pour effectuer cette tâche. Cette section comporte également des illustrations des composants matériel concernés et montre comment ces composants sont liés les uns aux autres. Pourquoi et quand exécuter cette tâche Si vous ne souhaitez pas installer de prises de courant alternatif sur l'armoire, voir «Installation des plaques de montage pour l'alimentation CA sans prises CA», à la page 22. Installation de l'armoire 19 Pour installer des prises de courant alternatif sur la plaque de montage de prise de courant alternatif avant ou arrière, procédez comme suit : Procédure 1. Déterminez le nombre de prises de courant alternatif que vous devez installer. 2. Vérifiez auprès de votre spécialiste que le nombre et l'emplacement des prises de courant alternatif à installer sont corrects. 3. Retirez les obturateurs des plaques de montage des prises de courant alternatif correspondant au nombre de prises de courant alternatif à installer. 4. Installez les prises de courant alternatif sur la plaque de montage de prise de courant alternatif. 5. Installez la cosse de mise à la terre sur la plaque de montage de prise de courant alternatif à l'aide d'un seul écrou (voir la figure suivante). 6. Serrez l'écrou sur la cosse de mise à la terre. 7. Localisez le conducteur de terre en fourni avec la plaque de montage. Remarque : Pour installer des prises de courant alternatif à l'avant ou à l'arrière de l'armoire, procédez comme suit. 8. Placez la rondelle de blocage sur la cosse de mise à la terre de la plaque de montage de prise de courant alternatif avant. 9. Placez la cosse située sur la plus longue extrémité du conducteur de terre sur la cosse de mise à la terre. 10. Placez un écrou sur la cosse de mise à la terre et vissez-le fermement. 11. Placez la plaque de montage de prise de courant alternatif avant sur le cadre de l'armoire et insérez complètement la cosse de mise à la terre dans les trous de montage de l'armoire. 12. Faites passer le câble sous l'armoire. 13. Placez la rondelle de blocage sur la cosse de mise à la terre de la plaque de montage de prise de courant alternatif arrière. 14. Placez la cosse située sur la plus courte extrémité du conducteur de terre sur la cosse de mise à la terre. 15. Placez un écrou sur la cosse de mise à la terre et vissez-le fermement. 20 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 1 Cosse de conducteur de terre 7 Cosse de mise à la terre 2 Rondelle de blocage 8 Conducteur de terre (extrémité courte du conducteur de terre) 3 Avant de l'armoire 9 Extrémité en Y du conducteur de terre 4 Cordon d'alimentation de l'unité d'alimentation 10 Cosse de mise à la terre (x2) 5 Plaque de montage 11 Extrémité longue du conducteur de terre 6 Extrémité longue du conducteur de terre 12 Cosse de mise à la terre (x2) 16. Placez la plaque de montage de prise de courant alternatif arrière sur le cadre de l'armoire et insérez complètement la cosse de mise à la terre dans les trous de montage de l'armoire. 17. Placez les vis de la plaque de montage de prise de courant alternatif avant (vis de fixation des stabilisateurs) dans la plaque de montage, en les insérant dans les trous de montage de l'armoire. Serrez les vis fermement. 1 Avant ou arrière de l'armoire (selon le cas) 4 Plaque de montage 2 Cordon d'alimentation de l'unité d'alimentation 5 Clé Allen 3 Vis à tête demi-ronde 6 Extrémité longue du conducteur de terre 18. Raccordez l'extrémité en Y du conducteur de terre au cadre de l'armoire, soit au centre du panneau arrière de l'armoire, soit à la barre omnibus de mise à la terre, à l'arrière de l'armoire. 19. Placez les vis de la plaque de montage de prise de courant alternatif arrière (vis de fixation des stabilisateurs) dans la plaque de montage, en les insérant dans les trous de montage de l'armoire. Serrez les vis fermement. Figure 11. Installation de la cosse de mise à la terre Figure 12. Installation de la plaque de montage avant Installation de l'armoire 21 Remarque : La barre omnibus peut se trouver en haut ou en bas de l'armoire. 1 Plaque de montage à barre omnibus 3 Vis à tête hexagonale (M5 x 20) (quantité 2) 2 Rondelle de fixation (x2) 4 Barre omnibus de mise à la terre Installation des plaques de montage pour l'alimentation CA sans prises CA La procédure décrite dans cette section permet d'installer une plaque de montage sans prises. Si vous ne souhaitez pas installer de prise de courant alternatif sur la plaque de montage de prise de courant alternatif avant ou arrière, effectuez uniquement les étapes des sections «Fixation de l'armoire à un plancher en béton sous un faux plancher», à la page 13 et «Raccordement du système de distribution de l'alimentation», à la page 18. Pour les plaques de montage de prise de courant alternatif avant ou arrière, voir «Installation des plaques de montage pour l'alimentation CA avec prises CA», à la page 19. Figure 13. Installation de la plaque de montage arrière 22 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Connexion d'une source d'alimentation CC Certains modèles d'armoire (tel que le 14T/00) peuvent prendre en charge une configuration d'alimentation en courant continu pour les systèmes qui le nécessitent. Utilisez la procédure décrite dans cette section pour connecter une source d'alimentation en courant continu. Cette section comporte également des illustrations des composants matériel concernés et montre comment ces composants sont liés les uns aux autres. Pourquoi et quand exécuter cette tâche Remarque : Le client est responsable de la fourniture et du raccordement de la source d'alimentation -48 V cc et des câbles de retour électriques -48 V cc à partir de la source du client -48 V cc jusqu'aux barres omnibus du panneau d'alimentation. Il est également responsable de la connexion du conducteur de terre au cadre de l'armoire. Cette procédure fournit des informations sur l'accès au panneau d'alimentation. Procédure 1. Retirez les six vis de fixation du capot supérieur du panneau d'alimentation en courant continu et déposez le capot supérieur. 2. Si elles sont en place, retirez les quatre vis du capot du chemin de câble. 3. Retirez le capot du chemin de câble. 1 Vis de fixation du capot du chemin de câble 5 Boîtier de protection 2 Capot du chemin de câble 6 Panneau d'alimentation 3 Vis de fixation du capot supérieur du panneau d'alimentation 4 Capot supérieur du panneau d'alimentation 4. Retirez le boîtier de protection de la barre omnibus -48 V cc du panneau d'alimentation. Figure 14. Retrait du capot du chemin de câble Installation de l'armoire 23 Avertissement : Le boîtier de protection de la barre omnibus doit être correctement réinstallé sur les barres omnibus de retour -48 V cc afin d'empêcher d'éventuelles blessures pendant la maintenance du panneau d'alimentation. 5. Vérifiez que les étapes suivantes sont exécutées pendant le branchement de la source d'alimentation en courant continu. a. A la source d'alimentation -48 V cc, mettez hors tension toutes les sources d'alimentation -48 V cc qui seront raccordées au panneau d'alimentation. b. Une fois les sources d'alimentation -48 V cc hors tension, assurez-vous qu'une étiquette, sur les commutateurs ou les fusibles de la source d'alimentation (dispositif de blocage), indique que la source d'alimentation a été mise hors tension volontairement. Remarque : Vérifiez que les barres omnibus en cuivre ne subissent aucune oxydation. c. S'il s'agit d'une installation sur un faux plancher et que vous travaillez à l'arrière de l'armoire, faites passer les câbles le long du côté droit de l'armoire. d. Vérifiez que le cordon d'alimentation -48 V cc externe est raccordé correctement à la barre omnibus -48 V cc. e. Vérifiez que le câble de retour -48 V cc externe est raccordé correctement et qu'il est installé sur la barre omnibus de retour. 1 Cordon d'alimentation et câble de retour -48 V cc 2 Panneau d'alimentation 3 Avant de l'armoire 4 Cordon d'alimentation et câble de retour -48 V cc Figure 15. Routage des cordons d'alimentation 24 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 1 Avant du panneau d'alimentation 6 (B) Retour (-) Cordon d'alimentation 2 (A) -48 V cc (-) Barre omnibus 7 (B) -48 V cc (-) Cordon d'alimentation 3 (A) -48 V cc (-) Cordon d'alimentation 8 (B) Retour (-) Barre omnibus 4 (A) Retour (-) Barre omnibus 9 (B) -48 V cc (-) Barre omnibus 5 (A) Retour (-) Cordon d'alimentation f. Pour installer une alarme d'état de l'alimentation, raccordez le câble d'alarme à la plaque à bornes sur le capot arrière du panneau d'alimentation en courant continu. Remarque : Veillez à éliminer l'oxydation sur les barres omnibus en cuivre. g. Vérifiez que le conducteur de terre de l'alimentation est correctement placé et raccordez-le à la barre de cuivre, au centre du panneau arrière inférieur ou supérieur de l'armoire. h. Si l'armoire est posée sur un faux plancher, fixez les câbles de l'unité d'alimentation -48 V cc à l'arrière de l'armoire à l'aide de sangles de maintien du câblage. Figure 16. Barre omnibus de retour Installation de l'armoire 25 1 Vue arrière d'une armoire (cc) 2 Cordon d'alimentation, câble de retour et conducteur de terre 3 Conducteur de terre (installation en haut ou en bas de l'armoire) 6. Réinstallez le boîtier de protection de la barre omnibus -48 V cc. 7. Réinstallez le capot supérieur sur le panneau d'alimentation en courant continu. 8. Réinstallez le capot du chemin de câble. 1 Capot du chemin de câble Figure 17. Emplacements de câble 26 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 2 Plaque à bornes (deux côtés) 3 Panneau d'alimentation 4 Avant de l'armoire Figure 18. Réinstallation du capot du chemin de câble Installation de l'armoire 27 28 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Installation en armoire d'une unité centrale ou d'une unité d'extension Il se peut que vous deviez installer une unité centrale ou une unité d'extension dans une armoire. Outre les informations de sécurité, cette section fournit les procédures à suivre pour effectuer ces tâches. Cette section comporte également des illustrations des composants matériel concernés et montre comment ces composants sont liés les uns aux autres. Les procédures suivantes expliquent comment installer des unités centrales ou des unités d'extension dans une armoire. Vous pouvez exécuter cette tâche vous-même ou faire appel à un fournisseur de services. Il vous imputera probablement les frais correspondants. Installation du modèle 02E/4B ou 05F/6B en armoire Il se peut que vous deviez installer l'unité centrale dans une armoire. La procédure de cette section permet d'effectuer cette tâche. Cette opération incombe au client. Vous pouvez exécuter vous-même cette tâche ou faire appel à un fournisseur de services. Ce dernier vous imputera probablement les frais correspondants. Important : Pour effectuer cette procédure, deux personnes sont nécessaires pour fixer le guide à l'armoire, une devant l'armoire, et l'autre à l'arrière. Trois personnes sont nécessaires pour soulever l'unité centrale dans l'armoire. Cette procédure suppose que l'installation de l'unité centrale s'effectue dans une armoire déjà montée et mise en place. Si l'armoire n'est pas installée, consultez les instructions du «Installation de l'armoire», à la page 5, puis revenez à cette procédure pour obtenir des instructions sur l'installation de l'unité d'extension dans l'armoire. Remarque : Cette procédure ne concerne que les modèles conçus pour une armoire, ne tentez pas d'installer un modèle autonome dans une armoire. Pour installer le modèle 02E/4B ou 05F/6B dans une armoire, procédez comme suit : ATTENTION : L'installation des glissières dans l'armoire est une opération complexe. Pour une installation correcte des guides, respectez l'ordre des tâches ci-dessous. Si vous ne le respectez pas, vous risquez de vous blesser ou d'endommager l'unité centrale. 1. Lisez les «Consignes de sécurité relatives aux armoires», à la page 102. 2. Faites l'inventaire des composants. La figure ci-dessous présente les éléments dont vous avez besoin pour installer le serveur dans l'armoire. 29 Si des pièces sont manquantes, endommagées, ou ne correspondent pas à la commande, veuillez contacter : v Revendeur 3. Repérez le kit de montage en armoire (A) et les guides (B) fournis avec l'unité centrale, comme indiqué dans figure 19. Les guides (B) sont dans le sens avant en arrière et gauche à droite. Les glissières portent des étiquettes indiquant si elles doivent être placées à gauche ou à droite par rapport à l'avant de l'armoire. L'arrière de chaque guide possède deux grands assemblages de loquets. Ces assemblages de loquets doivent être placés à l'arrière de l'armoire. 4. Choisissez l'emplacement d'installation de l'unité centrale dans l'armoire. Voir «Détermination de l'emplacement». 5. Fixez le matériel de montage à l'armoire. Pour plus d'informations, voir Fixation du support de montage à l'armoire. Détermination de l'emplacement Il se peut que vous deviez déterminer l'emplacement d'installation du système dans l'armoire. Cette procédure permet d'y parvenir. Pourquoi et quand exécuter cette tâche Avant d'installer l'unité d'extension dans une armoire, procédez comme suit : Procédure 1. Lisez les «Consignes de sécurité relatives aux armoires», à la page 102. 2. Choisissez l'emplacement d'installation des unités. Placez les unités les plus encombrantes et les plus lourdes dans la partie inférieure de l'armoire. Ce système fait quatre unités EIA (Electronic Industries Alliance) de hauteur. Une unité EIA mesure 44,45 mm de haut. L'armoire contient trois trous de montage pour chaque unité EIA sur la hauteur. Cette unité centrale mesure donc 177,8 mm (7 po) de haut et couvre 12 orifices de fixation de l'armoire. Figure 19. Inventaire des composants 30 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 3. Si nécessaire, retirez les panneaux obturateurs pour permettre l'accès à l'intérieur du boîtier de l'armoire. 4. Le cas échéant, retirez les volets avant et arrière de l'armoire. Résultats Marquage de l'emplacement Il se peut que vous deviez marquer l'emplacement d'installation. Cette procédure permet d'effectuer cette tâche. Pourquoi et quand exécuter cette tâche Pour marquer l'emplacement de l'installation et installer les écrous clip dans une armoire, procédez comme suit : Figure 20. Retrait des panneaux obturateurs Installation en armoire d'une unité centrale ou d'une unité d'extension 31 Procédure 1. Déterminez l'emplacement du système dans l'armoire. Installez les unités en commençant par le bas. Placez les unités les plus encombrantes et les plus lourdes dans la partie inférieure de l'armoire. Mémorisez l'emplacement EIA. Ce système fait quatre unités EIA (Electronic Industries Alliance) de hauteur. Une unité EIA mesure 44,45 mm de haut. L'armoire comporte trois orifices de montage pour chaque unité EIA de hauteur. Ce système mesure donc 177,8 mm (7 po) de haut et couvre 12 orifices de fixation de l'armoire. 2. Face à l'avant de l'armoire, et en travaillant depuis le côté droit, repérez l'unité EIA inférieure qui sera utilisée par votre unité d'extension. Mémorisez l'emplacement EIA. Utilisez du ruban adhésif, un marqueur ou un crayon pour marquer l'orifice inférieur de cette unité EIA (A). Marquez l'armoire de sorte que la marque reste visible au dos de l'armoire. 3. Commencez par l'orifice identifié par votre marque (A). Comptez un orifice et placez une seconde marque (B). Marquez l'armoire de sorte que la marque soit également visible depuis l'arrière de l'armoire (voir figure 21). 4. Répétez les deux étapes précédentes sur le côté gauche de l'armoire. 5. Face à l'arrière de l'armoire, et en travaillant depuis le côté droit, repérez l'unité EIA inférieure qui sera utilisée par votre unité d'extension. Faites une marque sur l'armoire au niveau de l'orifice inférieur de cette unité EIA (A). 6. Commencez par l'orifice identifié par votre marque (A). Comptez un orifice et placez une deuxième marque (B) (voir figure 21). 7. Commencez par l'orifice identifié par votre marque (B). Comptez un orifice et placez une troisième marque (C) (voir figure 21). 8. Répétez les deux étapes précédentes sur le côté gauche de l'armoire. Résultats Fixation du support de montage 02E/4B ou 05F/6B dans l'armoire Vous devez fixer le support de montage dans l'armoire. La procédure de cette section permet d'effectuer cette tâche. Figure 21. Marquage des orifices à l'avant et à l'arrière du châssis de l'armoire 32 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Pourquoi et quand exécuter cette tâche Avertissement : Pour éviter d'entraîner une défaillance des guides, de vous blesser ou d'endommager l'unité, vérifiez que vous disposez des guides et raccords adaptés à votre armoire. Si votre armoire a des trous à rebord pour support carrés ou des trous à rebord pour support de filetage de vis, veillez à ce que les guides et raccords correspondent aux trous à rebord pour support utilisés sur votre armoire. N'installez pas de matériel non conforme à l'aide de rondelles ou de cales d'espacement. Si vous ne disposez pas des guides et raccords adéquats pour votre armoire, veuillez contacter votre revendeur. De même, pour installer correctement les guides, effectuez chaque tâche dans l'ordre suivant. Pour installer le support de montage dans l'armoire, procédez comme suit : Procédure 1. Chaque glissière est marquée d'un R (right, pour droite) ou d'un L (left, pour gauche). Sélectionnez l'une des glissières et poussez vers le haut la languette mobile avant (1). Retirez le taquet avant (2) pour faire glisser la glissière avant. Si une vis moletée est installée dans la glissière (3), retirez-la. Remarque : Assurez-vous que le taquet mobile reste étendu et ne s'enclenche pas de nouveau. 2. Alignez les trois broches à l'arrière de la glissière avec les trois orifices dans l'unité EIA sélectionnée à l'arrière de l'armoire. Poussez les glissières afin que les broches rentrent dans les orifices (1), et lâchez la glissière (2) pour qu'elle s'enclenche. Figure 22. Glissière et taquet mobile Figure 23. Alignez les broches avec les orifices à l'arrière de l'armoire Installation en armoire d'une unité centrale ou d'une unité d'extension 33 3. Tirez la glissière vers l'avant et insérez les deux broches (1) à l'avant de la glissière dans les deux orifices inférieurs dans le U à l'avant de l'armoire. Faites descendre la glissière en place jusqu'à ce qu'elle s'enclenche. Enfoncez le taquet avant (2) jusqu'au fond. 4. Répétez les trois étapes précédentes pour installer l'autre glissière dans l'armoire. Vérifiez que chaque taquet avant est bien en place. 5. Tirez les glissières vers l'avant (1) jusqu'à ce qu'elles s'enclenchent, avec deux clics. Soulevez avec précaution le serveur et inclinez-le pour le mettre en position au-dessus des glissières de sorte que les têtes de clou arrière (2) sur le serveur s'alignent avec les logements arrière (3) sur les glissières. Faites glisser le serveur vers le bas jusqu'à ce que les têtes de clou arrière s'enclenchent dans les deux emplacements arrière, puis faites lentement descendre l'avant du serveur (4) jusqu'à ce que les autres têtes de clou s'enclenchent dans les autres emplacements sur les glissières. Veillez à ce que le taquet avant (5) passe au-dessus des têtes de clou. 6. Soulevez les taquets de déverrouillage bleus (1) sur les glissières et poussez le serveur (2) jusqu'au fond de l'armoire jusqu'à ce qu'il s'enclenche. Figure 24. Glissière avant et broches de l'armoire Figure 25. Glissières étendues et têtes de clou du serveur alignées sur les emplacements des rails 34 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 7. Installez le bras de routage des câbles. Pour plus d'informations, voir «Installation du bras de routage des câbles». Installation du bras de routage des câbles Il se peut que vous deviez installer le bras de routage des câbles. Cette procédure permet d'effectuer cette tâche. Pourquoi et quand exécuter cette tâche Pour installer le bras de routage des câbles, procédez comme suit : Procédure 1. Installez le bras de routage des câbles sur le côté droit du serveur, lorsque vous êtes face à l'arrière. Connectez une extrémité du bras de support (1) à la glissière sur laquelle vous comptez fixer le bras de routage des câbles afin de pouvoir balancer l'autre extrémité du bras (2) vers l'armoire. 2. Installez le support d'arrêt de routage des câbles en forme de L (1) sur l'extrémité libre du bras de support. Tournez le support (2) afin de le fixer au bras de support. Figure 26. Taquets de déverrouillage et serveur Figure 27. Connexion du bras de support Installation en armoire d'une unité centrale ou d'une unité d'extension 35 3. Pour fixer l'autre côté du bras de support à l'arrière de la glissière, sortez la broche de (1), puis faites glisser le support (2) dans la glissière. 4. Retirez la broche du support de fixation (1) et faites glisser le support de fixation (2) dans la glissière sur laquelle vous installez le bras de routage des câbles. Poussez le support dans la glissière jusqu'à ce que la broche s'enclenche. Figure 28. Support d'arrêt de routage des câbles fixé au bras de support Figure 29. Broche étendue et support installé dans la glissière 36 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 5. Placez le bras de routage des câbles sur le bras de support. Retirez la broche du bras de routage des câbles (1), puis faites glisser la languette du bras de routage des câbles (2) dans l'encoche située à l'intérieur de la glissière. Enfoncez la languette jusqu'à ce qu'elle s'enclenche. Retirez l'autre broche du bras de routage des câbles (3), et faites glisser la languette de ce bras de routage des câbles dans l'encoche (4) de la partie extérieure de la glissière. Enfoncez la languette jusqu'à ce qu'elle s'enclenche. 6. Faites passer les cordons d'alimentation et autres câbles (notamment clavier, écran et souris, si nécessaire) sur le bras de routage des câbles (1). Branchez tous les câbles à l'arrière du serveur, sauf le cordon d'alimentation. Fixez le cordon d'alimentation et les câbles à l'aide des attache-câble ou des attaches velcro. Remarque : Laissez du mou à tous les câbles afin d'éviter tensions dans les câbles lorsque le bras de routage des câbles bouge. Figure 30. Broche du support de fixation étendue et support de fixation installé dans la glissière Figure 31. Connexion du bras de routage des câbles Installation en armoire d'une unité centrale ou d'une unité d'extension 37 7. Faites glisser le serveur dans l'armoire jusqu'à ce qu'il s'enclenche. Résultats Raccordement de câbles d'alimentation au système Il se peut que vous deviez raccorder des câbles d'alimentation au système. Cette procédure permet d'effectuer cette tâche. Pourquoi et quand exécuter cette tâche Pour raccorder des câbles d'alimentation au système, procédez comme suit : Procédure 1. En vous tenant face à l'arrière de l'unité centrale, insérez le cordon d'alimentation du système dans le passe-câbles horizontale. 2. Branchez le cordon d'alimentation dans l'unité centrale. Remarque : Si le système est équipé deux alimentations, chacune doit se voir brancher son propre cordon d'alimentation. 3. Branchez le cordon d'alimentation du système et les cordons d'alimentation des éventuelles imprimantes ou périphériques d'affichage sur le secteur en CA (courant alternatif). 4. Mettez sous tension l'unité centrale ainsi que tout périphérique raccordé dessus. Résultats Pour obtenir des instructions sur la manière de connecter et de configurer l'unité d'extension une fois l'installation terminée, consultez Boîtiers et unités d'extension. Installation du modèle 31E/2B en armoire Il se peut que vous deviez installer l'unité centrale dans une armoire. La procédure de cette section permet d'effectuer cette tâche. Cette opération incombe au client. Vous pouvez exécuter vous-même cette tâche ou faire appel à un fournisseur de services. Ce dernier vous imputera probablement les frais correspondants. Figure 32. Fixation et routage du cordon d'alimentation 38 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Important : Pour effectuer cette procédure, deux personnes sont nécessaires pour fixer le guide à l'armoire, une devant l'armoire, et l'autre à l'arrière. Trois personnes sont nécessaires pour soulever l'unité centrale dans l'armoire. Cette procédure suppose que l'installation de l'unité centrale s'effectue dans une armoire déjà montée et mise en place. Si l'armoire n'est pas installée, consultez les instructions du «Installation de l'armoire», à la page 5, puis revenez à cette procédure pour obtenir des instructions sur l'installation de l'unité d'extension dans l'armoire. Remarque : Cette procédure ne concerne que les modèles conçus pour une armoire, ne tentez pas d'installer un modèle autonome dans une armoire. Pour installer le modèle 31E/2B en armoire, procédez comme suit : ATTENTION : L'installation des glissières dans l'armoire est une opération complexe. Pour une installation correcte des guides, respectez l'ordre des tâches ci-dessous. Si vous ne le respectez pas, vous risquez de vous blesser ou d'endommager l'unité centrale. 1. Lisez les «Consignes de sécurité relatives aux armoires», à la page 102. 2. Faites l'inventaire des composants. La figure ci-dessous présente les éléments dont vous avez besoin pour installer le serveur dans l'armoire. Si des pièces sont manquantes, endommagées, ou ne correspondent pas à la commande, veuillez contacter : v Revendeur 3. Repérez le kit de montage en armoire (A) et les guides (B) fournis avec l'unité centrale, comme indiqué dans figure 33. Les guides (B) sont dans le sens avant en arrière et gauche à droite. Les glissières portent des étiquettes indiquant si elles doivent être placées à gauche ou à droite par rapport à l'avant de l'armoire. L'arrière de chaque guide possède deux grands assemblages de loquets. Ces assemblages de loquets doivent être placés à l'arrière de l'armoire. Figure 33. Inventaire des composants Installation en armoire d'une unité centrale ou d'une unité d'extension 39 4. Choisissez l'emplacement d'installation de l'unité centrale dans l'armoire. Voir «Détermination de l'emplacement». 5. Fixez le matériel de montage à l'armoire. Pour plus d'informations, voir Fixation du support de montage à l'armoire. Détermination de l'emplacement Il se peut que vous deviez déterminer l'emplacement d'installation du système dans l'armoire. Cette procédure permet d'y parvenir. Pourquoi et quand exécuter cette tâche Avant d'installer l'unité d'extension dans une armoire, procédez comme suit : Procédure 1. Lisez les «Consignes de sécurité relatives aux armoires», à la page 102. 2. Choisissez l'emplacement d'installation des unités. Placez les unités les plus encombrantes et les plus lourdes dans la partie inférieure de l'armoire. Ce système fait deux unités EIA (Electronic Industries Alliance) de haut. Une unité EIA mesure 44,45 mm de haut. L'armoire contient trois trous de montage pour chaque unité EIA sur la hauteur. Cette unité centrale mesure donc 88,9 mm (3,5 po) de haut et couvre six orifices de fixation de l'armoire. 3. Si nécessaire, retirez les panneaux obturateurs pour permettre l'accès à l'intérieur du boîtier de l'armoire. 40 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 4. Le cas échéant, retirez les volets avant et arrière de l'armoire. Résultats Marquage de l'emplacement Il se peut que vous deviez marquer l'emplacement d'installation. Cette procédure permet d'effectuer cette tâche. Pourquoi et quand exécuter cette tâche Pour marquer l'emplacement de l'installation et installer les écrous clip dans une armoire, procédez comme suit : Procédure 1. Déterminez l'emplacement du système dans l'armoire. Installez les unités en commençant par le bas. Placez les unités les plus encombrantes et les plus lourdes dans la partie inférieure de l'armoire. Figure 34. Retrait des panneaux obturateurs Installation en armoire d'une unité centrale ou d'une unité d'extension 41 Mémorisez l'emplacement EIA. Ce système fait deux unités EIA (Electronic Industries Alliance) de haut. Une unité EIA mesure 44,45 mm de haut. L'armoire comporte trois orifices de montage pour chaque unité EIA de hauteur. Ce système mesure donc 88,9 mm (3,5 po) de haut et couvre six orifices de fixation de l'armoire. 2. Face à l'avant de l'armoire, et en travaillant depuis le côté droit, repérez l'unité EIA inférieure qui sera utilisée par votre unité d'extension. Mémorisez l'emplacement EIA. Utilisez du ruban adhésif, un marqueur ou un crayon pour marquer l'orifice inférieur de cette unité EIA (A). Marquez l'armoire de sorte que la marque reste visible au dos de l'armoire. 3. Commencez par l'orifice identifié par votre marque (A). Comptez un orifice et placez une seconde marque (B). Marquez l'armoire de sorte que la marque soit également visible depuis l'arrière de l'armoire (voir figure 35). 4. Répétez les deux étapes précédentes sur le côté gauche de l'armoire. 5. Face à l'arrière de l'armoire, et en travaillant depuis le côté droit, repérez l'unité EIA inférieure qui sera utilisée par votre unité d'extension. Faites une marque sur l'armoire au niveau de l'orifice inférieur de cette unité EIA (A). 6. Commencez par l'orifice identifié par votre marque (A). Comptez un orifice et placez une seconde marque (B) (voir figure 35). 7. Commencez par l'orifice identifié par votre marque (B). Comptez un orifice et placez une troisième marque (C) (voir figure 35). 8. Répétez les deux étapes précédentes sur le côté gauche de l'armoire. Résultats Fixation du matériel de montage 31E/2B à l'armoire Vous devez fixer le support de montage dans l'armoire. La procédure de cette section permet d'effectuer cette tâche. Pourquoi et quand exécuter cette tâche Avertissement : Pour éviter d'entraîner une défaillance des guides, de vous blesser ou d'endommager l'unité, vérifiez que vous disposez des guides et raccords adaptés à votre armoire. Si votre armoire a des trous à rebord pour support carrés ou des trous à rebord pour support de filetage de vis, veillez à ce que les guides et raccords correspondent aux trous à rebord pour support utilisés sur votre armoire. N'installez pas de matériel non conforme à l'aide de rondelles ou de cales d'espacement. Si vous ne disposez pas des guides et raccords adéquats pour votre armoire, veuillez contacter votre revendeur. De même, pour installer correctement les guides, effectuez chaque tâche dans l'ordre suivant. Pour installer le support de montage dans l'armoire, procédez comme suit : Procédure 1. Chaque glissière est marquée d'un R (right, pour droite) ou d'un L (left, pour gauche). Sélectionnez l'une des glissières et poussez vers le haut la languette mobile avant (1). Retirez le taquet avant (2) Figure 35. Marquage des orifices à l'avant et à l'arrière du châssis de l'armoire 42 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension pour faire glisser la glissière avant. Si une vis moletée est installée dans la glissière (3), retirez-la. Remarque : Assurez-vous que le taquet mobile reste étendu et ne s'enclenche pas de nouveau. 2. Alignez les trois broches à l'arrière de la glissière avec les trois orifices dans l'unité EIA sélectionnée à l'arrière de l'armoire. Poussez les glissières afin que les broches rentrent dans les orifices (1), et lâchez la glissière (2) pour qu'elle s'enclenche. 3. Tirez la glissière vers l'avant et insérez les deux broches (1) à l'avant de la glissière dans les deux orifices inférieurs dans le U à l'avant de l'armoire. Faites descendre la glissière en place jusqu'à ce qu'elle s'enclenche. Enfoncez le taquet avant (2) jusqu'au fond. Figure 36. Glissière et taquet mobile Figure 37. Alignez les broches avec les orifices à l'arrière de l'armoire Figure 38. Glissière avant et broches de l'armoire Installation en armoire d'une unité centrale ou d'une unité d'extension 43 4. Répétez les trois étapes précédentes pour installer l'autre glissière dans l'armoire. Vérifiez que chaque taquet avant est bien en place. 5. Tirez les glissières vers l'avant (1) jusqu'à ce qu'elles s'enclenchent, avec deux clics. Soulevez avec précaution le serveur et inclinez-le pour le mettre en position au-dessus des glissières de sorte que les têtes de clou arrière (2) sur le serveur s'alignent avec les logements arrière (3) sur les glissières. Faites glisser le serveur vers le bas jusqu'à ce que les têtes de clou arrière s'enclenchent dans les deux emplacements arrière, puis faites lentement descendre l'avant du serveur (4) jusqu'à ce que les autres têtes de clou s'enclenchent dans les autres emplacements sur les glissières. Veillez à ce que le taquet avant (5) passe au-dessus des têtes de clou. 6. Soulevez les taquets de déverrouillage bleus (1) sur les glissières et poussez le serveur (2) jusqu'au fond de l'armoire jusqu'à ce qu'il s'enclenche. 7. Installez le bras de routage des câbles. Pour plus d'informations, voir «Installation du bras de routage des câbles». Installation du bras de routage des câbles Il se peut que vous deviez installer le bras de routage des câbles. Cette procédure permet d'effectuer cette tâche. Figure 39. Glissières étendues et têtes de clou du serveur alignées sur les emplacements des rails Figure 40. Taquets de déverrouillage et serveur 44 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Pourquoi et quand exécuter cette tâche Pour installer le bras de routage des câbles, procédez comme suit : Procédure 1. Installez le bras de routage des câbles sur le côté droit du serveur, lorsque vous êtes face à l'arrière. Connectez une extrémité du bras de support (1) à la glissière sur laquelle vous comptez fixer le bras de routage des câbles afin de pouvoir balancer l'autre extrémité du bras (2) vers l'armoire. 2. Installez le support d'arrêt de routage des câbles en forme de L (1) sur l'extrémité libre du bras de support. Tournez le support (2) afin de le fixer au bras de support. 3. Pour fixer l'autre côté du bras de support à l'arrière de la glissière, sortez la broche de (1), puis faites glisser le support (2) dans la glissière. Figure 41. Connexion du bras de support Figure 42. Support d'arrêt de routage des câbles fixé au bras de support Installation en armoire d'une unité centrale ou d'une unité d'extension 45 4. Retirez la broche du support de fixation (1) et faites glisser le support de fixation (2) dans la glissière sur laquelle vous installez le bras de routage des câbles. Poussez le support dans la glissière jusqu'à ce que la broche s'enclenche. 5. Placez le bras de routage des câbles sur le bras de support. Retirez la broche du bras de routage des câbles (1), puis faites glisser la languette du bras de routage des câbles (2) dans l'encoche située à l'intérieur de la glissière. Enfoncez la languette jusqu'à ce qu'elle s'enclenche. Retirez l'autre broche du bras de routage des câbles (3), et faites glisser la languette de ce bras de routage des câbles dans l'encoche (4) de la partie extérieure de la glissière. Enfoncez la languette jusqu'à ce qu'elle s'enclenche. Figure 43. Broche étendue et support installé dans la glissière Figure 44. Broche du support de fixation étendue et support de fixation installé dans la glissière 46 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 6. Faites passer les cordons d'alimentation et autres câbles (notamment clavier, écran et souris, si nécessaire) sur le bras de routage des câbles (1). Branchez tous les câbles à l'arrière du serveur, sauf le cordon d'alimentation. Fixez le cordon d'alimentation et les câbles à l'aide des attache-câble ou des attaches velcro. Remarque : Laissez du mou à tous les câbles afin d'éviter tensions dans les câbles lorsque le bras de routage des câbles bouge. 7. Faites glisser le serveur dans l'armoire jusqu'à ce qu'il s'enclenche. Résultats Raccordement de câbles d'alimentation au système Il se peut que vous deviez raccorder des câbles d'alimentation au système. Cette procédure permet d'effectuer cette tâche. Pourquoi et quand exécuter cette tâche Pour raccorder des câbles d'alimentation au système, procédez comme suit : Figure 45. Connexion du bras de routage des câbles Figure 46. Fixation et routage du cordon d'alimentation Installation en armoire d'une unité centrale ou d'une unité d'extension 47 Procédure 1. En vous tenant face à l'arrière de l'unité centrale, insérez le cordon d'alimentation du système dans le passe-câbles horizontale. 2. Branchez le cordon d'alimentation dans l'unité centrale. Remarque : Si le système est équipé deux alimentations, chacune doit se voir brancher son propre cordon d'alimentation. 3. Branchez le cordon d'alimentation du système et les cordons d'alimentation des éventuelles imprimantes ou périphériques d'affichage sur le secteur en CA (courant alternatif). 4. Mettez sous tension l'unité centrale ainsi que tout périphérique raccordé dessus. Résultats Pour obtenir des instructions sur la manière de connecter et de configurer l'unité d'extension une fois l'installation terminée, consultez Boîtiers et unités d'extension. Installation du modèle 33E/8B ou 36E/8C en armoire Il se peut que vous deviez installer l'unité centrale dans une armoire. La procédure de la présente section vous permet d'effectuer cette tâche. Outre les informations de sécurité permettant d'effectuer la procédure de façon fiable et sans danger, cette section comporte également des illustrations des composants matériel concernés et montre comment ces composants sont liés les uns aux autres. Cette opération incombe au client. Vous pouvez exécuter vous-même cette tâche ou faire appel à un fournisseur de services. Ce dernier vous imputera probablement les frais correspondants. Important : Pour effectuer cette procédure, deux personnes sont nécessaires pour fixer le guide à l'armoire, une devant l'armoire, et l'autre à l'arrière. Trois personnes seront nécessaires pour soulever l'unité centrale dans l'armoire. Cette procédure suppose que l'installation de l'unité centrale s'effectue dans une armoire déjà montée et mise en place. Si l'armoire n'est pas installée, consultez les instructions de la rubrique «Installation de l'armoire», à la page 5, puis revenez à cette procédure pour obtenir des instructions sur l'installation de l'unité d'extension dans l'armoire. Remarque : Cette procédure ne concerne que les modèles conçus pour une armoire, ne tentez pas d'installer un modèle autonome dans une armoire. Pour installer le modèle 33E/8B ou 36E/8C dans une armoire, procédez comme suit : ATTENTION : L'installation des glissières dans l'armoire est une opération complexe. Pour une installation correcte des guides, respectez l'ordre des tâches ci-dessous. Si vous ne le respectez pas, vous risquez de vous blesser ou d'endommager l'unité centrale. 1. Lisez les «Consignes de sécurité relatives aux armoires», à la page 102. 2. Si ce n'est déjà fait, reportez-vous à la figure 47, à la page 49 et terminez l'inventaire des composants. 48 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension v A Kit de montage en armoire – Quatre grandes vis de fixation – Deux vis de taille moyenne – Deux écrous clip – Huit petites vis de fixation – Deux étriers de fixation de glissière – Deux passe-câbles v B glissières d'armoire droite et gauche avec des supports d'armoire v C Fixations de bras de routage des câbles v D Deux équerres de bras de routage des câbles (une pour l'installation à gauche, une pour l'installation à droite) v E Bras de routage des câbles v F Deux taquets d'armoire Si des pièces sont manquantes, endommagées, ou ne correspondent pas à la commande, veuillez contacter : v Revendeur 3. Repérez le kit de montage en armoire (A), et les guides (B) fournis avec l'unité centrale, comme indiqué dans la figure 47. Figure 47. Kit de montage en armoire Installation en armoire d'une unité centrale ou d'une unité d'extension 49 Les guides (B) sont dans le sens avant en arrière et gauche à droite. Les glissières portent des étiquettes indiquant si elles doivent être placées à gauche ou à droite par rapport à l'avant de l'armoire. L'arrière de chaque guide possède deux grands assemblages de loquets. Ces assemblages de loquets doivent être placés à l'arrière de l'armoire. 4. Choisissez l'emplacement d'installation de l'unité centrale dans l'armoire. Pour plus d'informations, voir «Détermination de l'emplacement». 5. Ensuite, vous devez fixer le support de montage à l'armoire. Pour plus d'informations, voir Fixation du support de montage à l'armoire. Détermination de l'emplacement Il se peut que vous deviez déterminer l'emplacement d'installation du système dans l'armoire. Cette procédure permet d'y parvenir. Pourquoi et quand exécuter cette tâche Avant d'installer l'unité d'extension dans une armoire, procédez comme suit : Procédure 1. Lisez les «Consignes de sécurité relatives aux armoires», à la page 102. 2. Choisissez l'emplacement d'installation des unités. Placez les unités les plus encombrantes et les plus lourdes dans la partie inférieure de l'armoire. Ce système fait quatre unités EIA (Electronic Industries Alliance) de hauteur. Une unité EIA mesure 44,45 mm de hauteur. L'armoire contient trois trous de montage pour chaque unité EIA sur la hauteur. Cette unité centrale mesure donc 177,8 mm de hauteur et correspond à 12 trous de vis sur l'armoire. 3. Si nécessaire, retirez les panneaux obturateurs pour permettre l'accès à l'intérieur du boîtier de l'armoire. 50 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 4. Le cas échéant, retirez les volets avant et arrière de l'armoire. Résultats Marquage de l'emplacement Il se peut que vous deviez marquer l'emplacement d'installation. Cette procédure permet d'y parvenir. Pourquoi et quand exécuter cette tâche Pour marquer l'emplacement de l'installation et installer les écrous clip dans une armoire, procédez comme suit : Procédure 1. Déterminez l'emplacement du système dans l'armoire. Installez les unités en commençant par le bas. Placez les unités les plus encombrantes et les plus lourdes dans la partie inférieure de l'armoire. Mémorisez l'emplacement EIA. Ce système fait quatre unités EIA (Electronic Industries Alliance) de Figure 48. Retrait des panneaux obturateurs Installation en armoire d'une unité centrale ou d'une unité d'extension 51 hauteur. Une unité EIA mesure 44,45 mm de hauteur. L'armoire contient trois trous de montage par unité EIA sur la hauteur. Cette unité fait donc 177,8 mm de haut et correspond à 12 trous de vis sur l'armoire. 2. Mettez-vous face à l'avant de l'armoire et opérez à partir du côté droit. Utilisez du ruban adhésif, un marqueur ou un crayon pour marquer les trous. Marquez le trou médian A de l'unité EIA se trouvant le plus bas parmi les quatre utilisés pour cette unité centrale. Vous devez identifier le trou marqué depuis l'avant et l'arrière de l'armoire. 3. Marquez le trou du milieu de l'unité EIA inférieure, sur le côté gauche de l'armoire (A) comme indiqué sur la figure 49. 4. Placez-vous à l'arrière de l'armoire. Sur le côté droit, localisez l'unité EIA qui correspond à l'unité EIA inférieure indiquée sur le devant de l'armoire. 5. Marquez l'orifice du milieu de cette unité EIA inférieure (B) comme indiqué sur la figure 49. 6. Marquez le trou inférieur de l'unité EIA suivante (située plus haut), (C) comme indiqué sur la figure 49. 7. Marquez les trous correspondants sur le côté gauche de l'armoire. Résultats Fixation du support de montage 33E/8B ou 36E/8C dans l'armoire Vous devez fixer le support de montage dans l'armoire. La procédure de cette section permet d'effectuer cette tâche. Cette section fournit des informations permettant d'effectuer les opérations de manière fiable et sans danger. Cette section comporte également des illustrations des composants matériel concernés et montre comment ces composants sont liés les uns aux autres. Pourquoi et quand exécuter cette tâche Avertissement : Pour éviter un mauvais montage des glissières, ainsi que de vous blesser ou d'endommager l'unité, vérifiez que vous disposez des glissières et raccords adaptés à votre armoire. Si votre armoire a des trous à rebord pour support carrés ou des trous à rebord pour support de filetage de vis, veillez à ce que les guides et raccords correspondent aux trous à rebord pour support utilisés sur votre armoire. N'installez pas de matériel non conforme à l'aide de rondelles ou de cales d'espacement. Si vous ne disposez pas des guides et raccords adéquats pour votre armoire, veuillez contacter votre revendeur. De même, pour installer correctement les guides, effectuez chaque tâche dans l'ordre suivant. Pour installer le support de montage dans l'armoire, procédez comme suit : Figure 49. Marquage des trous sur l'avant et l'arrière du cadre de l'armoire 52 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Procédure 1. Avec la bonne glissière, tournez la patte de déverrouillage du bloc loquet (A) dans la figure 50. Faites glisser la patte de déverrouillage (B) en position escamotée et verrouillez le bloc loquet. Les goupilles d'alignement arrière doivent être entièrement escamotées. 2. Une fois les goupilles d'alignement escamotées, introduisez la goupille d'alignement avant de la glissière de droite dans l'orifice du rebord avant de l'armoire, comme indiqué sur la figure 50. Cet orifice est identifié par la marque précédemment apposée. Demandez à une deuxième personne de maintenir fermement la glissière en position. 3. Alignez les goupilles d'alignement arrière de la glissière sur les orifices à l'arrière de l'armoire. Les orifices sont identifiés par les marques situées à l'arrière de l'armoire. Veillez à ce que les guides soient au même niveau. 4. Faites coulisser la patte de déverrouillage, (B), pour étendre les deux goupilles d'alignement arrière dans l'arrière de l'armoire. Veillez à ce que les goupilles soient passées dans les trous correspondants à l'arrière du cadre de l'armoire. 5. Placé à l'arrière de l'armoire, comme illustré sur figure 51, à la page 54, installez l'étrier de fixation de la glissière sur les goupilles d'alignement. Puis introduisez une grande vis de retenue de glissière dans l'orifice situé entre les deux goupilles d'alignement arrière. Serrez partiellement la vis. Ne serrez pas complètement à cette étape. Remarque : Les étriers sont étiquetés LEFT (GAUCHE) et RIGHT (DROIT). Figure 50. Broche d'alignement, vis de fixation et taquet de verrouillage de la glissière avant Installation en armoire d'une unité centrale ou d'une unité d'extension 53 6. Répétez les étapes 1, à la page 53 à 5, à la page 53 pour le guide gauche. 7. Localisez les taquets de verrouillage (F) sur figure 47, à la page 49. Pour installer les taquets de verrouillage de l'armoire, procédez comme suit : a. Faites pivoter le haut du support vers l'extérieur (voir la figure 52, à la page 55). Figure 51. Installation des étriers de fixation de glissière 54 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension b. Alignez le support avec le logement sur le côté du serveur (voir la figure 53, à la page 56). Figure 52. Rotation du haut du support vers l'extérieur Installation en armoire d'une unité centrale ou d'une unité d'extension 55 c. Déplacez le support vers le haut afin d'engager le taquet (voir la figure 54). d. Faites pivoter le haut du support vers l'arrière de façon à positionner la languette supérieure sur le haut du système. Puis, installez les vis comme indiqué sur figure 55, à la page 57. Figure 53. Alignez le support avec son logement sur le côté de l'unité. Figure 54. Déplacement du support vers le haut afin d'engager le taquet 56 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 8. Etirez les glissières intérieures en les tirant vers l'extérieur. Elles doivent être tirées hors du cadre comme les glissières illustrées dans figure 56. 9. Avec l'aide de trois personnes, saisissez les deux poignées situées de chaque côté du système et placez-le sur la glissière intérieure. Alignez la languette A sur la fente B comme indiqué par la ligne verticale tiretée sur figure 56. 10. Appuyez simultanément sur les taquets de sécurité et enfoncez le système dans l'armoire jusqu'à ce qu'il soit solidement en place, comme indiqué dans la figure 57, à la page 58. Figure 55. Faites pivoter le haut du support vers l'arrière de façon à positionner la languette supérieure sur le haut du système Figure 56. Placez le système sur les glissières. Installation en armoire d'une unité centrale ou d'une unité d'extension 57 11. Installez et positionnez les écrous clip avant d'insérer les vis. Utilisez les vis de fixation pour fixer le système à l'armoire, comme indiqué dans la figure 58, à la page 59. Figure 57. Guide intérieur étendu 58 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 12. Une fois les deux glissières installées, vérifiez que toutes les vis de retenue des glissières ne sont serrées qu'à la main. Les guides doivent être au même niveau d'arrière en avant et de gauche à droite. 13. Ensuite, vous devez fixer le bras de routage des câbles. Pour plus d'informations, voir «Installation du bras de routage des câbles». Installation du bras de routage des câbles Il se peut que vous deviez installer le bras de routage des câbles. Cette procédure permet d'effectuer cette tâche. Pourquoi et quand exécuter cette tâche Pour installer le bras de routage des câbles, procédez comme suit : Procédure 1. Déterminez le côté de l'armoire sur lequel vous voulez installer le bras de routage des câbles. 2. Placez le support de bras approprié (le gauche ou le droit) au bras de routage des câbles. 3. Utilisez l'ergot (F) pour fixer le bras de routage des câbles (E) au châssis de l'armoire (D) comme illustré sur figure 59, à la page 60. Figure 58. Fixez fermement le système à l'armoire par le biais des taquets de l'armoire Installation en armoire d'une unité centrale ou d'une unité d'extension 59 Conseil : Si l'accès à l'arrière de l'armoire est rendu difficile par le grand nombre de câbles présents, il peut être plus aisé d'enlever la petite charnière d'articulation du bras de routage des câbles et de la fixer en premier. Vous pouvez ensuite fixer la section restante du bras de routage des câbles à la charnière de connexion. 4. Utilisez le deuxième ergot F pour fixer l'autre extrémité du bras de routage des câbles au rebord C qui est fixé sur la partie coulissante du guide gauche du système B (voir figure 59). Résultats Raccordement de câbles d'alimentation au système Il se peut que vous deviez raccorder des câbles d'alimentation au système. Cette procédure permet d'effectuer cette tâche. Pourquoi et quand exécuter cette tâche Pour raccorder des câbles d'alimentation au système, procédez comme suit : Procédure 1. En vous tenant face à l'arrière de l'unité centrale, notez l'emplacement de la languette inférieure du passe-câbles. Tournez le passe-câbles dans le sens inverse des aiguilles d'une montre pour le mettre en position horizontale et insérez sa languette inférieure dans l'orifice inférieur de la glissière. Puis, tournez le passe-câbles dans le sens des aiguilles d'une montre jusqu'à le remettre en position verticale, comme indiqué dans figure 60, à la page 61. Faites passer le cordon d'alimentation du système dans le passe-câbles comme indiqué dans cette figure. Puis appuyez sur le dessus du passe-câbles et insérez la languette supérieure dans l'orifice supérieur. Remarque : La glissière comporte deux séries d'orifices dans lesquels le passe-câbles peut être installé. Ces séries d'orifices sont séparés d'environ 25,4 mm. Installez le passe-câbles dans la série d'orifices placée le plus en arrière. Figure 59. Fixation du bras de routage des câbles 60 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 2. Branchez le cordon d'alimentation dans l'unité centrale. Remarque : Si le système est équipé deux alimentations, chacune doit se voir brancher son propre cordon d'alimentation. 3. Brancher le cordon d'alimentation du système et les cordons d'alimentation des éventuels imprimantes ou périphériques d'affichage sur le secteur en CA (courant alternatif). 4. Mettez sous tension l'unité centrale ainsi que tout périphérique raccordé dessus. Résultats Pour obtenir des instructions sur la manière de connecter et de configurer l'unité d'extension une fois l'installation terminée, consultez Boîtiers et unités d'extension. Installation en armoire du modèle 58/02 ou 58/77 Cette procédure peut être utilisée pour installer le boîtier d'unité de disque 58/02 ou 58/77 dans une armoire. La procédure inclut des informations destinées à promouvoir une opération sécurisée et fiable, et fournit des illustrations sur la façon dont les composants matériels sont liés les uns aux autres. v Cette procédure suppose que vous installez l'unité d'extension ou 14G/30 dans une armoire existante. Si l'armoire n'est pas installée, vous devez effectuer la procédure de «Installation de l'armoire», à la page 5 avant d'effectuer cette procédure pour installer l'unité d'extension dans l'armoire. Remarque : Cette procédure ne concerne que les modèles conçus pour une armoire, ne tentez pas d'installer un modèle autonome dans une armoire. v Lisez les «Consignes de sécurité relatives aux armoires», à la page 102. v Pour des instructions concernant l'installation, le retrait et le remplacement d'une unité de disque, d'un fond de pânier d'unité de disque ou dun boîtier, voir Unités de disque. Figure 60. Routage du cordon d'alimentation de l'unité centrale Installation en armoire d'une unité centrale ou d'une unité d'extension 61 Pour installer le modèle 58/02ou 58/77 dans une armoire, procédez comme suit : ATTENTION : L'installation des glissières dans l'armoire est une opération complexe. Pour une installation correcte des guides, respectez l'ordre des tâches ci-dessous. Si vous ne le respectez pas, vous risquez de vous blesser ou d'endommager l'unité centrale. Avertissement : L'installation des glissières dans l'armoire est une opération complexe. Pour une installation correcte des glissières, respectez l'ordre des tâches ci-dessous. Si vous ne le respectez pas, vous risquez de vous blesser ou d'endommager l'unité d'extension. Notez également que pour 58/02 et 58/77, trois personnes sont nécessaires pour soulever le système en toute sécurité. Le non-respect de cette recommandation peut provoquer des blessures. 1. Reportez-vous à la liste des éléments du kit fournie dans le kit d'installation en armoire, et effectuez un inventaire des composants, si ce n'est déjà fait. 2. Repérez le kit matériel de montage en armoire et les guides fournis avec l'unité d'extension. 3. Déterminez l'emplacement d'installation de l'unité d'extension dans l'armoire. Pour plus d'informations, voir «Détermination de l'emplacement». Cette unité d'extension mesure quatre unités EIA (Electronic Industries Alliance) de haut. Une unité EIA mesure 4,45 mm de hauteur. L'armoire comporte trois orifices de montage pour chaque unité EIA de hauteur. 4. Marquez l'emplacement. Pour plus d'informations, voir «Marquage de l'emplacement», à la page 63. Détermination de l'emplacement Il se peut que vous deviez déterminer l'emplacement de l'unité d'extension dans l'armoire. Cette procédure permet d'y parvenir. Pourquoi et quand exécuter cette tâche Avant d'installer l'unité d'extension dans une armoire, procédez comme suit : 1. Déterminez l'emplacement de l'unité d'extension dans l'armoire par rapport au reste du matériel du système. Lors de la préparation, organisez les parties les plus grandes et les plus lourdes dans la partie inférieure de l'armoire et prévoyez d'installer les éléments de la partie inférieure de l'armoire en premier. Enregistrez les emplacements EIA dans votre plan. Remarque : Ces unités d'extension font quatre unités EIA (Electronic Industries Alliance) de hauteur. Une unité EIA équivaut à 44,50 mm (1,75 po) de haut. L'armoire contient trois trous de montage par unité EIA sur la hauteur. Cette unité centrale mesure donc 177,8 mm (7 po) de haut et couvre 12 orifices de fixation de l'armoire. 2. Si nécessaire, retirez les panneaux obturateurs pour permettre l'accès à l'intérieur du boîtier de l'armoire où vous allez placer l'unité d'extension. 62 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 3. Le cas échéant, retirez les volets avant et arrière de l'armoire. 4. Marquez l'emplacement où vous prévoyez d'installer le boîtier d'unité de disque. a. Si vous possédez un gabarit de montage, utilisez-le pour marquer l'emplacement. b. Si vous ne possédez pas de gabarit de montage, voir «Marquage de l'emplacement». Marquage de l'emplacement Il se peut que vous deviez marquer l'emplacement d'installation. La procédure de cette section permet d'effectuer cette tâche. Pourquoi et quand exécuter cette tâche Pour marquer l'emplacement de l'installation et la position de mise en place des écrous clip sur une armoire, procédez comme suit : 1. Déterminez l'emplacement de l'unité d'extension dans l'armoire par rapport au reste du matériel du système. Lorsque vous planifiez, gardez les éléments suivants à l'esprit : a. Organisez les unités les plus grandes et les plus lourdes dans la partie inférieure de l'armoire. Figure 61. Retrait des panneaux obturateurs Installation en armoire d'une unité centrale ou d'une unité d'extension 63 b. Planifiez l'installation en premier des unités dans la partie inférieure de l'armoire. c. Enregistrez les emplacements EIA dans votre plan. Remarque : Ces unités d'extension font quatre unités EIA (Electronic Industries Alliance) de hauteur. Une unité EIA mesure 44,50 mm de hauteur. L'armoire comporte trois orifices de montage pour chaque unité EIA de hauteur. Cette unité centrale mesure donc 7 po (177.8 mm) de haut et couvre 12 orifices de fixation de l'armoire. 2. Mettez-vous face à l'avant de l'armoire et travillez du côté gauche. Notez l'unité EIA la plus élevée et la plus basse qui vous serviront pour l'unité d'extension. Utilisez du ruban adhésif, un marqueur ou un crayon pour marquer le trou de vis inférieur (A)de la plus haute unité EIA. Puis marquez le trou de vis supérieur (B) de l'unité EIA la plus basse. Marquez l'armoire de sorte que la marque reste visible au dos de l'armoire. 3. Comptez deux orifices de vis depuis l'endroit où vous avez placé la marque de l'orifice de vis (B) à la dernière étape et placez une autre marque à côté de cet orifice de vis (B). Marquez l'armoire de sorte que la marque reste visible au dos de l'armoire. Vous avez maintenant deux marques (B) sur l'armoire, avec un trou de montage entre ces marques. 4. Répétez les deux dernières étapes pour placer trois marques sur les orifices de montage correspondants sur la partie avant droite de l'armoire. 5. Placez-vous à l'arrière de l'armoire. Sur le côté gauche, localisez l'unité EIA qui correspond à l'unité EIA inférieure indiquée sur le devant de l'armoire. 6. Placez une marque au niveau de l'orifice supérieur (C) de l'unité EIA la plus basse. Marquez l'armoire de sorte que la marque reste visible au dos de l'armoire. 7. Comptez deux orifices de vis depuis l'endroit où vous avez placé la marque à la dernière étape et placez une autre marque à côté de cet orifice de vis (C). 8. Répétez les deux dernières étapes pour placer deux marques sur les orifices de montage correspondants sur la partie arrière droite de l'armoire. Que faire ensuite Fixation du support de montage dans l'armoire Il se peut que vous deviez fixer le support de montage dans l'armoire. La procédure de cette section permet d'effectuer cette tâche. Cette section intègre également des informations de sécurité permettant d'effectuer les opérations de façon fiable et sans danger, ainsi que des illustrations des composants matériels. Figure 62. Marquage des emplacements d'installation 64 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Pourquoi et quand exécuter cette tâche ATTENTION : L'installation des glissières est une procédure délicate. Pour une installation correcte des glissières, respectez l'ordre des tâches ci-dessous. Si vous ne le respectez pas, vous risquez de vous blesser ou d'endommager l'unité centrale. Pour installer le support de montage dans l'armoire, procédez comme suit : 1. Faites pivoter vers le bas les fixations avant et arrière (C) des glissières gauche et droite. 2. Sur chaque glissière, ôtez la bande maintenant le ressort (B) sur la glissière. Enroulez l'extrémité du ressort autour de la vis de butée de la glissière. 3. Commencez par l'avant de l'armoire. Tirez la glissière droite et introduisez les ergots (D) dans les emplacements marqués de l'armoire de sorte que la glissière soit placée recto vers le haut. Faites pivoter vers le haut la fixation avant (C) pour bloquer la glissière. Remarques : v Si votre armoire possède des trous de vis carrés, retirez les ergots (D) de la glissière. Installez les ergots de plus grande taille fournis dans le kit d'installation de l'armoire. v Bien que l'installation de la glissière puisse être effectuée par une seule personne, la procédure sera facilitée si une personne est positionnée à l'avant de l'armoire et une autre à l'arrière. 4. Tirez la glissière de gauche et introduisez les ergots (D) dans les emplacements marqués. Faites pivoter vers le haut la fixation avant (C) pour bloquer la glissière. 5. A l'avant de l'armoire, étendez la glissière de droite et insérez ses ergots (D) dans les encoches indiquées.Faites pivoter vers le haut la fixation arrière (C) pour bloquer la glissière. 6. Tirez la glissière de gauche et introduisez les ergots (D) dans les emplacements marqués. Faites pivoter vers le haut la fixation arrière (C) pour bloquer la glissière. 7. Fixez la glissière de gauche à l'arrière de l'armoire à l'aide de deux vis M5 (E), une au-dessous de chaque ergot de la glissière (D). Figure 63. Installation des glissières Installation en armoire d'une unité centrale ou d'une unité d'extension 65 8. A l'aide de deux vis M5 (E), une en-dessous de chaque ergot de glissière (D), fixez la glissière de droite à l'arrière de l'armoire. 9. Placez-vous à l'avant de l'armoire. A l'aide d'une vis M5 (E), en-dessous de la ergot de la glissière supérieure, fixez la glissière de gauche à l'avant de l'armoire. 10. A l'aide d'une vis M5 (E), en-dessous de l'ergot de la glissière supérieure (D), fixez la glissière de droite à l'avant de l'armoire. 11. Repérez les marques précédemment effectuées sur les bandes EIA (Electronic Industries Alliance) avant de l'armoire. Chaque côté comporte une marque. Insérez un écrou clip (F) à la marque du côté droit de l'armoire (voir figure 65, à la page 67). 12. Insérez un écrou clip (F) à la marque du côté gauche de l'armoire (voir figure 65, à la page 67). 13. Sur l'arrière de l'armoire, repérez les marques faites précédemment sur les bandes EIA (Electronic Industries Alliance) arrière. Chaque côté comporte deux marques. Insérez un écrou clip (F) à chaque marque du côté droit de l'armoire (voir figure 65, à la page 67). Figure 64. Fixation des glissières 66 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 14. A l'aide de deux vis M5 (H), fixez le support (G) sur le côté droit de l'armoire. 15. Insérez un écrou clip (F) à chaque marque du côté gauche de l'armoire (voir figure 65). 16. A l'aide de deux vis M5 (H), fixez le support (G) sur le côté gauche de l'armoire. Résultats Installation du modèle 58/02 ou 58/77 sur les glissières Il se peut que vous deviez installer l'unité d'extension sur les glissières. La procédure de cette section permet d'effectuer cette tâche. Outre les informations de sécurité, cette section comporte également des illustrations des composants matériel concernés et montre comment ces composants sont liés les uns aux autres. Avant de commencer 1. Avant d'installer l'unité d'extension sur les glissières, assurez-vous que les stabilisateurs sont étendus et que la plaque de soutien est fixée au bas de la face avant de l'armoire pour éviter que cette dernière ne bascule vers l'avant lorsque les glissières sont étendues hors de l'armoire. Pour plus d'informations, voir «Consignes de sécurité relatives aux armoires», à la page 102. Pourquoi et quand exécuter cette tâche Pour installer l'unité d'extension sur ses glissières, procédez comme suit : 1. Pour ôter le capot de transport situé à l'arrière de l'unité d'extension, retirez les quatre vis moletées servant à l'assujettir. Retirez le capot. Remarque : Deux vis de serrage à main sont situées dans la partie supérieure du capot et une sur chacun des côtés. 2. Si le panneau frontal est monté à l'avant du système, enlevez-le en procédant comme suit : a. Retirez les deux vis de fixations M5 (A) comme indiqué sur la figure 66, à la page 68. Figure 65. Installation du support de fixation Installation en armoire d'une unité centrale ou d'une unité d'extension 67 b. Retirez le support des informations de maintenance de son emplacement situé dans le panneau frontal. c. A l'aide des deux points tactiles bleus situés de chaque côté de l'unité d'extension, tirez sur le panneau frontal pour le retirer de l'avant de l'unité, en le maintenant bien droit. d. Retirez les deux écrous clip servant à assujettir le panneau frontal à l'unité d'extension par le rebord EIA. 3. Avec l'aide de trois autres personnes, saisissez les quatre poignées (I), soulevez l'unité d'extension et positionnez-la sur l'avant des glissières (voir la figure suivante). Avertissement : Trois personnes sont nécessaires pour effectuer cette opération en toute sécurité. Le non-respect de cette recommandation peut provoquer des blessures. Figure 66. Retrait du support des informations de maintenance et du panneau frontal. Figure 67. Installation en armoire de l'unité d'extension 68 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 4. Faites glisser l'unité d'extension dans l'armoire. 5. Installez les deux vis de fixation M5 (J) en les faisant passer à travers l'unité d'extension et les insérant dans les écrous clip sur les glissières gauche et droite. 6. Installez le panneau frontal (L) sur l'unité d'extension. Maintenez le panneau en place en appuyant dessus et fixez-le au moyen de deux vis M5 (M) sur les glissières gauche et droite. 7. Insérez la plaquette d'informations de maintenance (N) dans son emplacement situé dans le panneau central. 8. A l'arrière de l'unité, installez les quatre vis de fixation (K) en les faisant passer à travers l'unité d'extension et les supports de fixation situés sur les glissières gauche et droite. Figure 68. Installation du panneau frontal et du support des informations de maintenance. Figure 69. Installation de quatre vis à l'arrière de l'unité Installation en armoire d'une unité centrale ou d'une unité d'extension 69 Résultats Pour obtenir des instructions sur la manière de connecter et de configurer l'unité d'extension une fois l'installation terminée, consultez Boîtiers et unités d'extension. Installation du modèle 58/86 en armoire Cette procédure peut être utilisée pour installer le boîtier d'unité de disque 58/86 dans une armoire. La procédure inclut des informations destinées à promouvoir une opération sécurisée et fiable, et fournit des illustrations sur la façon dont les composants matériels sont liés les uns aux autres. Avant de commencer v Cette procédure suppose que vous installez le boîtier d'unité de disque 58/86 dans une armoire existante. Si l'armoire n'est pas installée, vous devez effectuer la procédure de «Installation de l'armoire», à la page 5 avant d'effectuer cette procédure pour installer le boîtier d'unité de disque dans l'armoire. Remarque : Cette procédure ne concerne que les modèles conçus pour une armoire, ne tentez pas d'installer un modèle autonome dans une armoire. v Lisez les «Consignes de sécurité relatives aux armoires», à la page 102. Pourquoi et quand exécuter cette tâche L'installation du boîtier d'unité de disque dans une armoire est une tâche client. Vous pouvez exécuter vous-même cette tâche ou faire appel à un fournisseur de services. Il vous imputera probablement les frais correspondants. Important : Pour effectuer cette procédure, deux personnes sont nécessaires pour fixer les glissières à l'armoire, l'une devant l'armoire et l'autre à l'arrière. Pour installer le modèle 58/86 en armoire, procédez comme suit : ATTENTION : L'installation des glissières est une procédure délicate. Pour une installation correcte des glissières, respectez l'ordre des tâches ci-dessous. Si vous ne le respectez pas, vous risquez de vous blesser ou d'endommager l'unité centrale. 1. Faites l'inventaire des composants si ce n'est déjà fait. Remarque : Pour cette installation, vous aurez besoin d'un tournevis à la lame plate de 5 mm (3/16 po). Un tournevis cruciforme de taille 2 et une clé de 8 mm sont facultatifs. Voici la liste des éléments dont vous avez besoin pour installer l'unité dans l'armoire : v Glissière de droite (1) v Glissière de gauche (1) v Vis M5 (8) v Cales d'espacement de petit diamètre (8) (déjà installées, 4 par glissière) v Cales d'espacement de grand diamètre (8) Remarque : Ces cales ne sont utilisées qu'avec les armoires à support carré. v Vis à tête tronconique M4 (2) v Rondelles de 15 mm (0,60 po) de diamètre (6) Remarque : Ces rondelles ne sont utilisées qu'avec les armoires à support carré. 70 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 2. Repérez le gabarit de montage du matériel de montage et les glissières fournis avec votre boîtier d'unité de disque. Le sens des glissières et d'avant en arrière et de gauche à droite. 3. Déterminez l'emplacement d'installation du boîtier d'unité de disque dans l'armoire. Pour plus d'informations, voir «Détermination de l'emplacement». Résultats Détermination de l'emplacement Vous pouvez avoir besoin de déterminer l'emplacement d'installation du boîtier d'unité de disque dans l'armoire. Cette procédure permet d'effectuer cette tâche. Avant de commencer Lisez les «Consignes de sécurité relatives aux armoires», à la page 102. Pourquoi et quand exécuter cette tâche Avant d'installer le boîtier d'unité de disque dans une armoire, suivez les étapes ci-dessous : 1. Déterminez l'emplacement du boîtier d'unité de disque dans l'armoire par rapport au reste du matériel du système. Lorsque vous planifiez, gardez les éléments suivants à l'esprit : a. Organisez les unités les plus grandes et les plus lourdes dans la partie inférieure de l'armoire. b. Planifiez l'installation en premier des unités dans la partie inférieure de l'armoire. c. Enregistrez les emplacements EIA dans votre plan. Remarque : Le boîtier d'unité de disque 58/86 est de deux unités EIA (Electronic Industries Alliance). Une unité EIA mesure 44,50 mm de hauteur. L'armoire comporte trois orifices de montage pour chaque unité EIA de hauteur. Cette unité centrale mesure donc 3,5 po (89 mm) de haut et couvre six orifices de fixation de l'armoire. 2. Si nécessaire, retirez les panneaux obturateurs pour permettre l'accès à l'intérieur du boîtier de l'armoire où vous envisagez de placer le boîtier d'unité de disque. Installation en armoire d'une unité centrale ou d'une unité d'extension 71 3. Le cas échéant, retirez les volets avant et arrière de l'armoire. 4. Marquez l'emplacement où vous prévoyez d'installer le boîtier d'unité de disque. a. Si vous possédez un gabarit de montage, utilisez-le pour marquer l'emplacement. b. Si vous ne possédez pas de gabarit de montage, voir «Marquage de l'emplacement». Marquage de l'emplacement Il se peut que vous deviez marquer l'emplacement d'installation de l'armoire. Cette procédure permet d'y parvenir. Pourquoi et quand exécuter cette tâche Pour marquer l'emplacement de l'installation et installer les écrous clip dans une armoire, procédez comme suit : 1. Déterminez l'emplacement du boîtier d'unité de disque dans l'armoire par rapport au reste du matériel du système. Lorsque vous planifiez, gardez les éléments suivants à l'esprit : a. Organisez les unités les plus grandes et les plus lourdes dans la partie inférieure de l'armoire. Figure 70. Retrait des panneaux obturateurs 72 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension b. Planifiez l'installation en premier des unités dans la partie inférieure de l'armoire. c. Enregistrez les emplacements EIA dans votre plan. Remarque : Le boîtier d'unité de disque 58/86 est de deux unités EIA (Electronic Industries Alliance). Une unité EIA mesure 44,50 mm de hauteur. L'armoire comporte trois orifices de montage pour chaque unité EIA de hauteur. Cette unité centrale mesure donc 3,5 po (89 mm) de haut et couvre six orifices de fixation de l'armoire. 2. Mettez-vous face à l'avant de l'armoire et travaillez sur son côté droit. Marquez l'orifice du milieu de l'unité EIA inférieure (A) comme illustré dans la figure suivante. Repérez le trou marqué depuis l'avant et l'arrière de l'armoire. 3. Marquez le trou du milieu de l'unité EIA inférieure, sur le côté gauche de l'armoire. 4. Placez-vous à l'arrière de l'armoire. Sur le côté droit, localisez l'unité EIA qui correspond à l'unité EIA inférieure indiquée sur le devant de l'armoire. Remarque : La figure suivante illustre un exemple de disposition des unités EIA, il se peut que vous ayez une configuration ou un placement différent. 5. Marquez l'orifice du milieu de l'unité EIA inférieure B. 6. Marquez l'orifice inférieur de l'unité EIA suivante (supérieure) C. 7. Marquez les trous correspondants sur le côté gauche de l'armoire. Résultats Fixation du support de montage dans l'armoire Il se peut que vous deviez fixer le support de montage dans l'armoire. Cette procédure permet d'y parvenir. Outre les informations de sécurité permettant d'effectuer la procédure de façon fiable et sans danger, cette section comporte également des illustrations des composants matériel concernés et montre comment ces composants sont liés les uns aux autres. Pourquoi et quand exécuter cette tâche ATTENTION : L'installation des glissières est une procédure délicate. Pour une installation correcte des glissières, respectez l'ordre des tâches ci-dessous. Si vous ne le respectez pas, vous risquez de vous blesser ou d'endommager l'unité centrale. Pour installer le support de montage dans l'armoire, procédez comme suit : Figure 71. Marquage des orifices à l'avant et à l'arrière du châssis de l'armoire Installation en armoire d'une unité centrale ou d'une unité d'extension 73 1. Desserrez les deux vis de réglage de glissière sur l'une des glissières, juste assez pour permettre aux glissières de glisser. 2. Repérez les marques précédemment faites sur les bandes EIA (Electronic Industries Alliance) à l'avant et à l'arrière de l'armoire. Utilisez les lettres L ou R imprimées sur le support avant de votre première glissière pour l'aligner avec ces marques du côté correspondant de l'armoire. La partie inférieure du bord du support de glissière doit se situer légèrement au-dessus de la marque U de la bride de l'armoire. La glissière est fournie avec deux cales d'espacement dans le support avant et deux cales d'espacement dans le support arrière. Si l'armoire comporte des orifices circulaires, les cales d'espacement des supports de glissière s'insèrent dans les orifices circulaires des brides de l'armoire. Si l'armoire comporte des orifices carrés, remplacez les petites cales à l'arrière et à l'avant de la glissière rail par les cales plus grosses qui sont incluses dans le kit de glissières. 3. Fixez la glissière à l'avant de l'armoire avec l'une des vis M5, dans l'orifice supérieur du support. Pour les armoires à orifices carrés, ajoutez une rondelle entre la vis M5 et le support de glissière. Figure 72. Extension des glissières Figure 73. Mise en place des cales 74 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 4. Fixez la glissière à l'arrière de l'armoire, avec deux vis M5 dans les orifices inférieur et supérieur du support de glissière arrière. Pour les armoires à orifices carrés, ajoutez une rondelle entre chaque vis M5 et le support de glissière. 5. Serrez les vis de réglage de glissière. 6. Répétez l'opération de l'étape 1, à la page 74 à l'étape 5 pour l'autre glissière. Résultats Installation du modèle 58/86 sur les glissières Il se peut que vous deviez installer l'unité d'extension sur les glissières. Cette procédure permet d'y parvenir. Outre les informations de sécurité, les illustrations des composants matériel associées sont fournies. Avant de commencer 1. Avant d'installer l'unité d'extension sur les glissières, assurez-vous que les stabilisateurs sont étendus et que la plaque de soutien est fixée au bas de la face avant de l'armoire pour éviter que cette dernière ne bascule vers l'avant lorsque les glissières sont étendues hors de l'armoire. Pour plus d'informations, voir «Consignes de sécurité relatives aux armoires», à la page 102. Pourquoi et quand exécuter cette tâche Pour installer le boîtier d'unité de disque sur les glissières, procédez comme suit : 1. Pour réduire le poids de l'unité et en faciliter l'installation dans l'armoire, retirez les blocs d'alimentation électrique. Pour retirer un bloc d'alimentation électrique, procédez comme suit : a. Sur le côté gauche de l'alimentation électrique, appuyez sur la patte de déverrouillage orange vers la droite, juste assez pour libérer la poignée (pas plus de 6,4 mm (0,24 po)) tout en faisant pivoter la poignée vers le bas. Figure 74. Fixation des glissières Installation en armoire d'une unité centrale ou d'une unité d'extension 75 b. Retirez avec soin l'alimentation électrique de l'unité. c. Répétez cette étape pour l'autre alimentation électrique, puis mettez les deux unités de côté pour une installation ultérieure. 2. A l'aide de deux personnes, soulevez le boîtier d'unité de disque et positionnez-le sur l'avant des glissières. Avertissement : Deux personnes sont nécessaires pour soulever le boîtier d'unité de disque. Réaliser cette opération avec moins de deux personnes peut entraîner des blessures. 3. Faites glisser le boîtier d'unité de disque dans l'armoire, en vous assurant qu'il s'insère dans les pattes situées à l'arrière de la glissière. Figure 75. Retrait d'alimentations électriques Figure 76. Installation du boîtier sur les glissières 76 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 4. Fixez l'avant du boîtier de l'unité de disque à l'armoire avec une vis M5 dans l'orifice ouvert de chaque support (n'pas pas besoin de rondelle). 5. Fixez l'arrière du boîtier de l'unité de disque à chaque glissière avec une vis à tête tronconique M4 dans l'orifice rainuré sur l'extérieur de la paroi du boîtier de stockage. 6. Réinstallez les alimentations électriques et les unités de disque dur. 7. Connectez un cordon d'alimentation à chaque alimentation électrique. Utilisez le guide pour cordon d'alimentation pour empêcher le cordon d'alimentation de se déconnecter accidentellement. 8. Installez le panneau frontal gauche, doté de voyants d'alimentation, puis le panneau frontal droit. Placez la découpe de la partie supérieure du panneau frontal sur la patte du rebord du boîtier. Faites ensuite pivoter le panneau frontal vers le bas jusqu'à ce qu'il s'emboîte. Assurez-vous que la surface intérieure du panneau frontal est contre le boîtier. Résultats Pour obtenir des instructions sur la manière de connecter et de configurer l'unité d'extension après son installation dans l'armoire, voir Boîtiers et unités d'extension. Figure 77. Installation des vis de fixation Figure 78. Fixation des panneaux latéraux Installation en armoire d'une unité centrale ou d'une unité d'extension 77 Installation en armoire de l'unité d'extension ou 14G/30 Cette rubrique explique comment installer l'unité d'extension ou 14G/30 dans une armoire. La procédure inclut des informations destinées à promouvoir une opération sécurisée et fiable, et fournit des illustrations sur la façon dont les composants matériels sont liés les uns aux autres. v Cette procédure suppose que vous installez l'unité d'extension ou 14G/30 dans une armoire existante. Si l'armoire n'est pas installée, vous devez effectuer la procédure de «Installation de l'armoire», à la page 5 avant d'effectuer cette procédure pour installer l'unité d'extension dans l'armoire. Remarque : Cette procédure ne concerne que les modèles conçus pour une armoire, ne tentez pas d'installer un modèle autonome dans une armoire. v Lisez les «Consignes de sécurité relatives aux armoires», à la page 102. Pour installer le modèle ou 14G/30 dans une armoire, procédez comme suit : ATTENTION : L'installation des glissières dans l'armoire est une opération complexe. Pour une installation correcte des glissières, respectez l'ordre des tâches ci-dessous. Si vous ne le respectez pas, vous risquez de vous blesser ou d'endommager l'unité centrale. 1. Faites l'inventaire des composants. v Localisez le rapport des kits dans une boîte d'accessoires. v Vérifiez que vous avez reçu tous les composants commandés. 2. Vérifiez qu'un emplacement est disponible dans l'unité existante ou que l'espace dans l'armoire est suffisant pour installer une nouvelle unité. Mise hors tension du système et coupure de l'alimentation Il se peut que vous deviez mettre hors tension le système et couper l'alimentation. La procédure de cette section permet d'effectuer cette tâche. 1. Vérifiez que le système auquel vous effectuez la connexion exécute le dernier niveau de microprogramme. Pour plus d'informations sur la vérification ou la mise à niveau du microprogramme ou de votre système, consultez la section des mises à jour de microcode dans les informations d'installation de votre système. 2. Après avoir vérifié que votre microprogramme est au dernier niveau, arrêtez et mettez hors tension le système auquel vous effectuez la connexion. Pour plus d'informations sur l'arrêt et la mise hors tension du système, consultez la documentation fournie avec l'unité d'extension. 3. Pour installer une unité d'extension dans une coque existante, voir «Installation de l'unité d'extension» , à la page 83. Sinon, consultez la rubrique «Marquage de l'emplacement». Marquage de l'emplacement Il se peut que vous deviez marquer l'emplacement. La procédure de cette section permet d'effectuer cette tâche. Pourquoi et quand exécuter cette tâche Pour marquer l'emplacement de l'installation et installer les écrous clip dans une armoire, procédez comme suit : 1. Déterminez l'emplacement de l'unité d'extension dans l'armoire par rapport au reste du matériel du système. Lorsque vous planifiez, gardez les éléments suivants à l'esprit : a. Organisez les unités les plus grandes et les plus lourdes dans la partie inférieure de l'armoire. b. Planifiez l'installation en premier des unités dans la partie inférieure de l'armoire. c. Enregistrez les emplacements EIA dans votre plan. 78 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Remarque : Ces unités d'extension font quatre unités EIA (Electronic Industries Alliance) de hauteur. Une unité EIA mesure 44,50 mm de hauteur. L'armoire comporte trois orifices de montage pour chaque unité EIA de hauteur. Cette unité centrale mesure donc 7 po (177.8 mm) de haut et couvre 12 orifices de fixation de l'armoire. La figure suivante montre une unité EIA 1 et une unité EIA 4. Selon le constructeur de l'armoire, le séparateur des unités EIA peut être symbolisé par une ligne ou par une couleur. L'espacement entre les trous de la bande EIA est irrégulier. En l'absence d'élément visuel de séparation (couleur ou ligne) des unités EIA dans l'armoire, chaque unité EIA commence à l'endroit où les trous sont les plus rapprochés. A Marque de positionnement B Fixe-écrou 2. Face à l'avant de l'armoire, et en travaillant depuis le côté droit, repérez l'unité EIA inférieure qui sera utilisée par votre unité d'extension. Notez l'emplacement de l'EIA. Utilisez du ruban adhésif, un marqueur ou un crayon pour marquer l'armoire au niveau de l'orifice supérieur de cette unité EIA (A). Vérifiez que la marque est visible depuis l'avant et l'arrière de l'armoire. 3. Commencez par l'orifice identifié par votre marque pour (A). Comptez deux orifices et faites une marque (B) en regard du deuxième orifice. Vérifiez que la marque est visible depuis l'avant et l'arrière de l'armoire. 4. Commencez par l'orifice identifié par votre marque pour (B). Comptez deux orifices et placez un écrou clip. L'écrou clip (C) permet de fixer l'unité d'extension à l'armoire. 5. Commencez par l'orifice identifié par votre marque pour (B). Comptez six orifices et placez un écrou clip. L'écrou clip (D) permet de fixer l'unité d'extension à l'armoire. 6. Répétez les quatre dernières étapes pour faire des marques et placer des écrous clips sur le côté gauche de l'armoire. 7. Face à l'arrière de l'armoire, et en travaillant depuis le côté droit, repérez l'unité EIA inférieure qui sera utilisée par votre unité d'extension. Faites une marque sur l'armoire au niveau de l'orifice supérieur de cette unité EIA (E). 8. Commencez par l'orifice identifié par votre marque pour (E). Comptez deux orifices et faites une marque (F) en regard du deuxième orifice. 9. Commencez par l'orifice identifié par votre marque pour (F). Comptez quatre orifices et faites une marque (G) en regard du quatrième orifice. 10. Répétez les trois dernières étapes pour faire des marques sur le côté gauche de l'armoire. Figure 79. Unités EIA avec glissières extensibles pour l'unité d'extension 5796 Installation en armoire d'une unité centrale ou d'une unité d'extension 79 Résultats Mise en place des glissières extensibles et du support d'extension dans l'armoire Il se peut que vous deviez mettre en place des glissières dans l'armoire. La procédure de la présente section vous permet d'effectuer cette tâche. Outre les informations de sécurité, cette section comporte également des illustrations des composants matériel concernés et montre comment ces composants sont liés les uns aux autres. Pourquoi et quand exécuter cette tâche Avertissement : L'installation des glissières est une procédure délicate. Pour une installation correcte des glissières, respectez l'ordre des tâches ci-dessous. Si vous ne le respectez pas, vous risquez de vous blesser ou d'endommager l'unité centrale. Pour installer les glissières extensibles dans l'armoire, procédez comme suit : 1. Desserrez les deux vis (B) situées au milieu des deux glissières (A) pour les étendre. 2. A l'arrière de l'armoire, étendez la glissière de gauche.Insérez ses ergots (C) dans les encoches indiquées afin de l'orienter vers le haut (en forme de L). 3. A l'arrière de l'armoire, étendez la glissière de droite et insérez ses ergots (C) dans les encoches indiquées. 4. A l'avant de l'armoire, étendez la glissière de gauche et insérez ses ergots (C) dans les encoches indiquées. 5. Etendez la glissière de droite et insérez les ergots (C) dans les encoches indiquées. 6. A l'aide d'une vis M5 (D), fixez la glissière de gauche à l'avant de l'armoire dans l'emplacement inférieur (E) Figure 80. Extension des glissières 80 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Important : Ne placez pas de vis dans l'emplacement supérieur pour l'instant. Sinon, le boîtier sera fixé à l'assemblage. . 7. A l'aide d'une vis M5 (D), fixez la glissière de droite à l'avant de l'armoire dans l'emplacement inférieur (E). 8. Placez-vous à l'arrière de l'armoire et repérez les marques faites sur la glissière arrière gauche. A l'aide de trois vis M5 (G), fixez le bras et le support (F) de chaque glissière à l'armoire à ces emplacements. Remarque : La glissière doit être orientée vers l'intérieur. Figure 81. Fixation des glissières à l'avant de l'armoire. Installation en armoire d'une unité centrale ou d'une unité d'extension 81 9. Repérez les marques qui ont été faites sur la glissière arrière droite de l'armoire. A l'aide de trois vis M5 (G), fixez le bras et le support (F) de chaque glissière à l'armoire à ces emplacements. 10. Serrez les deux vis (B) situées au milieu de la glissière de gauche et les deux vis (B) au milieu de la glissière de droite. Voir figure 80, à la page 80. Résultats Installation du boîtier Il se peut que vous deviez installer le boîtier. La procédure de cette section permet d'effectuer cette tâche. Cette section comporte également des illustrations des composants matériel concernés et montre comment ces composants sont liés les uns aux autres. Pourquoi et quand exécuter cette tâche Remarque : Avant d'installer le boîtier, vérifiez que les glissières sont alignées horizontalement et verticalement. Pour installer la coque, procédez comme suit : 1. A l'avant de l'armoire, placez le boîtier (A) sur les glissières et faites-le glisser dans l'armoire. Figure 82. Fixation des glissières à l'arrière de l'armoire 82 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension A Boîtier B Vis de fixation (quatre) 2. Mettez en place et serrez les quatre vis de fixation (B) à l'avant du boîtier (A) afin de fixer ce dernier à l'armoire. Résultats Installation de l'unité d'extension Il se peut que vous deviez installer l'unité d'extension dans une armoire. La procédure de cette section permet d'effectuer cette tâche. Cette section comporte également des illustrations des composants matériel concernés et montre comment ces composants sont liés les uns aux autres. Pourquoi et quand exécuter cette tâche Pour installer l'unité d'extension dans une armoire, procédez comme suit : 1. Retirez les capots de transport avant et arrière du modèle 57/96 ou 14G/30. Chacun de ces capots de transport est assujetti au moyen de quatre vis à serrage à main. Deux vis sont situées au-dessus du capot et deux sur les côtés du capot. 2. Si vous installez une unité d'extension (H) dans un boîtier existant (G) qui dispose d'un panneau obturateur (F), retirez ce panneau en appuyant sur les pattes supérieures puis en le tirant vers vous. Figure 83. Installation du boîtier dans l'armoire sur les glissières extensibles Installation en armoire d'une unité centrale ou d'une unité d'extension 83 F Panneau obturateur G Boîtier H Tiroir I Vis moletée J Support de câble 3. Faites glisser l'unité d'extension (H) dans le boîtier. 4. Si vous installez un 5796, installez le support de câble (J) à l'aide d'une vis moletée (I). 5. Si vous avez deux unités d'extension à installer, répétez les quatre dernières étapes pour installer la deuxième unité d'extension. 6. Si une seule unité d'extension est installée dans le boîtier, installez le panneau obturateur (N) dans l'emplacement vide du boîtier en faisant glisser le panneau obturateur sous la languette supérieure du boîtier. Appuyez doucement sur le panneau obturateur jusqu'à ce qu'il s'emboîte dans son logement. Figure 84. Retrait du panneau obturateur Figure 85. Retrait du panneau obturateur 84 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension N Panneau obturateur 7. Fixez le carter avant. Voir Installation du carter avant sur un modèle 7314-G30 ou 5796. Résultats Fixation des câbles d'alimentation et du réseau de contrôle de l'alimentation système (SPCN) Il se peut que vous deviez fixer à l'unité d'extension les câbles d'alimentation et du réseau de contrôle de l'alimentation système (System Power Control Network, ou SPCN). La procédure de cette section permet d'effectuer cette tâche. Pour plus d'instructions sur le câblage et la connexion de vos unités d'extension, voir Boîtiers et unités d'extension. Mise sous tension du système Il se peut que vous deviez mettre l'unité d'extension sous tension. Pour plus d'informations sur la mise sous tension de l'unité d'extension, voir Démarrage du système ou de la partition logique. Installation en armoire du modèle 57/86 ou 31D/24 Il se peut que vous deviez installer une unité d'extension dans une armoire. La présente section explique comment effectuer ces tâches. Cette procédure suppose que l'installation de l'unité d'extension s'effectue dans une armoire existante. Si l'armoire n'est pas installée, suivez les instructions du «Installation de l'armoire», à la page 5, puis revenez à la présente section pour obtenir des instructions sur l'installation de l'unité d'extension dans l'armoire. Une fois que vous avez installé l'unité d'extension dans l'armoire, vous devez suivre les instructions indiquées à la section Unités de disque pour connecter et configurer le modèle 31D/24 ou 57/86. Figure 86. Installation du panneau obturateur Installation en armoire d'une unité centrale ou d'une unité d'extension 85 Remarque : Cette procédure ne concerne que les modèles conçus pour une armoire. Pour installer l'unité d'extension dans une armoire, procédez comme suit : Avertissement : L'installation des glissières dans l'armoire est une opération complexe. Pour une installation correcte des glissières, respectez l'ordre des tâches ci-dessous. Si vous ne le respectez pas, vous risquez de vous blesser ou d'endommager l'unité d'extension. 1. Consultez la rubrique «Consignes de sécurité relatives aux armoires», à la page 102. Le non-respect de ces instructions peut endommager le matériel ou provoquer des lésions corporelles. 2. Faites l'inventaire des composants (voir «Inventaire des composants»). 3. Identifiez le kit de montage en armoire (A) et les guides du système (B) fournis avec votre unité d'extension. Voir la figure 87 pour connaître les emplacements relatifs de ces éléments. Les guides sont identiques. Remarque : Le tournevis à douille 8 mm et le tournevis à lame plate représentés dans le diagramme suivant ne figurent pas dans l'inventaire, ni dans les pièces livrées, mais ils vous permettront d'effectuer l'installation. 4. Choisissez l'emplacement d'installation de l'unité d'extension dans l'armoire Pour plus d'informations, voir «Détermination de l'emplacement», à la page 87. Cette unité d'extension fait quatre unités EIA (Electronic Industries Alliance) de hauteur. Une unité EIA mesure 4,45 mm de hauteur. L'armoire contient trois trous de montage pour chaque unité EIA sur la hauteur. 5. Pour le marquage de l'emplacement, voir «Marquage de l'emplacement», à la page 88. Inventaire des composants Il se peut que vous deviez remplir l'inventaire des composants. La procédure de cette section permet d'effectuer cette tâche. Figure 87. Inventaire des composants 86 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Pourquoi et quand exécuter cette tâche Si vous ne l'avez pas encore fait, effectuez un inventaire des composants avant de poursuivre l'installation : Procédure 1. Localisez le rapport des kits dans une boîte d'accessoires. 2. Vérifiez que vous avez reçu tous les composants commandés. Résultats En cas de composants manquants, endommagés ou ne correspondant pas à la commande, vous disposez des contacts suivants : v Revendeur Détermination de l'emplacement Il se peut que vous deviez déterminer l'emplacement de l'unité d'extension dans l'armoire. Cette procédure permet d'effectuer cette tâche. Pourquoi et quand exécuter cette tâche Avant d'installer l'unité d'extension dans une armoire, procédez comme suit : Procédure 1. Consultez la rubrique «Consignes de sécurité relatives aux armoires», à la page 102. 2. Choisissez l'emplacement d'installation des unités. Placez les unités les plus encombrantes et les plus lourdes dans la partie inférieure de l'armoire. Ces unités d'extension font quatre unités EIA (Electronic Industries Alliance) de hauteur. L'armoire contient trois trous de montage par unité EIA sur la hauteur. Chacune de ces unités d'extension mesure donc 177,8 mm de hauteur et couvre 12 trous de vis sur l'armoire. 3. Enlevez les panneaux obturateurs pour permettre l'accès à l'intérieur du boîtier de l'armoire. Installation en armoire d'une unité centrale ou d'une unité d'extension 87 4. Retirez les volets avant et arrière si nécessaire. 5. Pour le marquage de l'emplacement, voir «Marquage de l'emplacement». Marquage de l'emplacement Il se peut que vous deviez marquer l'emplacement d'installation de l'armoire. La procédure de cette section permet d'effectuer cette tâche. Pourquoi et quand exécuter cette tâche Pour marquer l'emplacement de l'installation et installer les écrous clip dans une armoire, procédez comme suit : Procédure 1. Déterminez l'emplacement de l'unité dans l'armoire et notez les numéros d'emplacement. Cette unité fait quatre unités EIA (Electronic Industries Alliance) de hauteur. Une unité EIA correspond à 1,75 po Figure 88. Retrait des panneaux obturateurs 88 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension (44,45 mm) de hauteur. L'armoire contient trois trous de montage par unité EIA sur la hauteur. Cette unité fait donc 7 po (177,8 mm) de haut et correspond à 12 orifices de fixation sur l'armoire. 2. Face à l'avant de l'armoire, et en travaillant depuis le côté gauche, repérez l'unité EIA inférieure qui sera utilisée par votre unité. Notez l'emplacement de l'EIA. Utilisez du ruban adhésif, un marqueur ou un crayon pour marquer le trou de vis inférieure (B) de cette unité EIA. Marquez l'armoire de sorte que la marque reste visible au dos de l'armoire. 3. Sur la glissière gauche, comptez deux trous de vis depuis l'endroit où vous avez placé la marque à l'étape 2 et placez une autre marque à côté de ce trou de vis (B). Marquez l'armoire de sorte que la marque reste visible au dos de l'armoire. Vous avez maintenant deux marques sur l'armoire, avec un trou de vis entre ces marques. 4. Répétez les étapes 2 et 3 pour placer deux marques sur les trous de vis correspondants sur la glissière avant droite. 5. Placez-vous à l'arrière de l'armoire. Sur le côté gauche, localisez l'unité EIA qui correspond à l'unité EIA inférieure indiquée sur le devant de l'armoire. 6. Placez une marque au niveau de l'orifice supérieur (C) de l'unité EIA la plus basse. Marquez l'armoire de sorte que la marque reste visible au dos de l'armoire. 7. Comptez deux trous de vis depuis l'endroit où vous avez placé la la marque à l'étape 6 et placez une autre marque à côté de ce trou de vis (C). 8. Répétez les étapes 6 et 7 pour placer deux marques sur les trous de vis correspondants sur la glissière arrière droite. Que faire ensuite Fixation du support de montage dans l'armoire Il se peut que vous deviez fixer le support de montage dans l'armoire. La procédure de cette section permet d'effectuer cette tâche. Outre les informations de sécurité permettant d'effectuer la procédure de façon fiable et sans danger, cette section comporte également des illustrations des composants matériel concernés et montre comment ces composants sont liés les uns aux autres. Pourquoi et quand exécuter cette tâche ATTENTION : L'installation des glissières est une procédure délicate. Pour une installation correcte des glissières, respectez l'ordre des tâches ci-dessous. Si vous ne le respectez pas, vous risquez de vous blesser ou d'endommager l'unité centrale. Figure 89. Marquage des emplacements d'installation Installation en armoire d'une unité centrale ou d'une unité d'extension 89 Pour installer le support de montage dans l'armoire, procédez comme suit : Procédure 1. Desserrez les deux vis (B) situées au milieu des deux glissières (A) pour les étendre. 2. A l'arrière de l'armoire, étendez la glissière gauche et insérez ses ergots (C) dans les encoches indiquées afin de l'orienter vers le haut. Remarque : Bien que l'installation de la glissière puisse être effectuée par une seule personne, la procédure sera facilitée si une personne est positionnée à l'avant de l'armoire et une autre à l'arrière. 3. A l'arrière de l'armoire, étendez la glissière de droite et insérez ses ergots (C) dans les encoches indiquées. 4. A l'avant de l'armoire, étendez la glissière de gauche et insérez ses ergots (C) dans les encoches indiquées. 5. Etendez la glissière de droite et insérez les ergots (C) dans les encoches indiquées. 6. A l'aide des deux vis M5 (D), fixez la glissière de gauche à l'avant de l'armoire sous chaque ergot de glissière (C). Figure 90. Extension des glissières 90 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 7. A l'aide de deux vis M5 (D), fixez la glissière de droite à l'avant de l'armoire. 8. Placez-vous à l'arrière de l'armoire et fixez la glissière de gauche à l'arrière de l'armoire à l'aide des deux vis M5 (D). 9. Fixez la glissière de droite à l'arrière de l'armoire en utilisant les deux vis M5 (D). 10. Serrez les deux vis sur la partie médiane de la glissière gauche et de la glissière droite. Pour plus d'informations, voir la figure 91. 11. Insérez deux écrous clip dans l'emplacement indiqué sur la glissière avant gauche. 12. Insérez deux écrous clip dans les emplacements indiqués sur la glissière avant droite. Résultats Installation du modèle 57/86 ou 31D/24 sur les glissières Il se peut que vous deviez installer l'unité d'extension sur les glissières. La procédure de cette section permet d'effectuer cette tâche. Outre les informations de sécurité, cette section comporte également des illustrations des composants matériel concernés et montre comment ces composants sont liés les uns aux autres. Pourquoi et quand exécuter cette tâche Avant d'installer l'unité d'extension sur les glissières, assurez-vous que les stabilisateurs sont étendus et que la plaque de soutien est fixée au bas de la face avant de l'armoire pour éviter que cette dernière ne bascule vers l'avant lorsque les glissières sont étendues hors de l'armoire. Pour plus d'informations, voir «Consignes de sécurité relatives aux armoires», à la page 102. Pour installer l'unité d'extension sur ses glissières, procédez comme suit : Figure 91. Fixation des glissières Installation en armoire d'une unité centrale ou d'une unité d'extension 91 Procédure 1. Si le panneau frontal est installé à l'avant de l'unité d'extension, retirez-le en appuyant sur les deux boutons de déverrouillage. 2. En vous faisant aider par trois autres personnes, soulevez l'unité d'extension et positionnez-la sur l'avant des glissières. Avertissement : Trois personnes sont nécessaires pour effectuer cette opération en toute sécurité. Le non-respect de cette recommandation peut provoquer des blessures. 3. Poussez l'unité d'extension dans l'armoire jusqu'à ce qu'elle soit complètement calée. 4. Vérifiez si les plaques de fixation sont déjà installées à l'arrière de l'unité d'extension. Si ce n'est pas le cas, fixez-les à droite et à gauche à l'arrière de l'unité. Les plaques de fixation sont illustrées à la figure 94, à la page 93. 5. Installez les deux vis de fixation M5 A sur le châssis et dans les écrous clip (B) sur les glissières droite et gauche. Figure 92. Retrait du panneau frontal 92 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 6. A l'arrière de l'armoire, installez les deux vis de fixation M5 (C) à l'arrière des deux plaques de fixation droite et gauche et insérez-les sur les glissières. 7. Installation du panneau frontal sur l'unité d'extension. Figure 93. Installation des vis de fixation Figure 94. Fixation de l'unité d'extension à l'arrière de l'armoire Installation en armoire d'une unité centrale ou d'une unité d'extension 93 Résultats Pour obtenir des instructions sur la manière de connecter et de configurer l'unité d'extension une fois que vous avez terminé l'installation de 57/86 ou de 31D/24 sur l'armoire, voir Boîtiers et unités d'extension. Fixation du bras de routage des câbles aux glissières standard Il se peut que vous deviez fixer le bras de routage des câbles. La procédure de cette section permet d'effectuer cette tâche. Pourquoi et quand exécuter cette tâche Si l'unité d'extension est fournie avec les glissières standard (longueur fixe) et que le bras de routage des câbles n'y est pas fixé, procédez comme suit. Sinon, passez à la rubrique «Détermination de l'emplacement», à la page 87. Procédure 1. Insérez la languette de la glissière du bras de routage des câbles (B) dans la glissière. 2. Faites pivoter le bras de routage des câbles (A) (voir la figure suivante). Figure 95. Installation du panneau frontal 94 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 3. Fixez le bras de routage des câbles (A) à l'aide de la vis de fixation (C). 4. Répétez les étapes 1 à 3 pour l'autre glissière. Résultats Figure 96. Rotation du bras de routage des câbles Installation en armoire d'une unité centrale ou d'une unité d'extension 95 96 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Retrait et remise en place de panneaux latéraux 14T/00 ou 14T/42 Il se peut que vous deviez retirer et remettre en place un panneau latéral sur une armoire. La procédure de cette section permet d'effectuer cette tâche. Pourquoi et quand exécuter cette tâche Il se peut que l'armoire soit dotée de panneaux latéraux facultatifs. Pour retirer et remettre en place un panneau latéral, procédez comme suit : Procédure 1. Déverrouillez les panneaux latéraux en appuyant sur les taquets de verrouillage pour les débloquer. Remarque : Si votre armoire utilise le kit de renforcement, retirez la vis de fixation pour que chaque panneau latéral puisse être extrait. Pour plus d'informations, voir «Libération du panneau latéral à l'aide d'un kit de renforcement», à la page 110. 2. Inclinez légèrement la partie supérieure du panneau latéral vers vous. 3. Dégagez le panneau latéral de la rainure située au bas de l'armoire en le soulevant. 4. Répétez cette procédure pour l'autre panneau latéral. Figure 97. Retrait du panneau latéral 97 Remise en place d'un panneau latéral 14T/00 ou 14T/42 Il se peut que vous deviez remettre en place un panneau latéral sur l'armoire. La procédure de la présente section vous permet d'effectuer cette tâche. Pourquoi et quand exécuter cette tâche Pour remettre en place un panneau latéral, procédez comme suit : Procédure 1. Inclinez légèrement la partie supérieure du panneau latéral vers vous. 2. Placez la partie inférieure du panneau latéral sur la rainure située au bas de l'armoire. 3. Mettez la partie supérieure du panneau latéral en place en la faisant glisser et fermez les taquets de verrouillage. Remarque : Si votre armoire utilise le kit de renforcement, installez une vis de fixation dans chaque panneau latéral installé. Pour plus d'informations, voir le «Kit de renforcement», à la page 109. Figure 98. Remise en place du panneau latéral 98 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Retrait et remise en place de panneaux d'habillage 14T/00 ou 14T/42 Les boîtiers installés avec des systèmes à tiroir de processeurs multiples peuvent utiliser les panneaux d'habillage avant au lieu de portes. Pour les armoires qui utilisent les panneaux d'habillage, un type de panneau à interférences réduites doit être installé lorsque certains modèles d'unité d'extension sont présents. Utilisez la procédure décrite dans cette section pour retirer les panneaux d'habillage existants de l'armoire et les remplacer par le type de panneaux d'interférences réduites. Pourquoi et quand exécuter cette tâche Pour retirer les panneaux d'habillage armoire existants, exécutez la procédure suivante: Procédure 1. Placez les deux mains au centre du panneau d'habillage latéral droit. 2. Appuyez fermement vers l'intérieur pour libérer les pinces qui maintiennent le panneau en place. 3. Faites pivoter vos mains légèrement jusqu'à ce que le panneau soi dégagé. 4. Soulevez le panneau, puis mettez-le de côté. 5. Répétez cette procédure pour retirer le panneau d'habillage latéral gauche. Remise en place de panneaux d'habillage 14T/00 ou 14T/42 Pour les armoires qui utilisent des panneaux d'habillage au lieu de portes, un type de panneau à interférences réduites doit être installé lorsque certains modèles d'unité d'extension sont présents. La procédure de cette section permet d'effectuer cette tâche. Figure 99. Retrait du panneau d'habillage de l'armoire 99 Pourquoi et quand exécuter cette tâche Pour installer les panneaux d'habillage, procédez comme suit : Procédure 1. Alignez la plaque inférieure (A) du panneau d'habillage latéral droit avec le bas de l'armoire 2. Alignez le haut du panneau d'habillage (B) et serrez légèrement avec vos doigts. 3. Une fois le panneau d'habillage à l'emplacement approprié, relâchez la pression pour permettre aux pinces de fixation de maintenir le panneau en place. 4. Répétez cette procédure pour installer le panneau d'habillage latéral gauche. Figure 100. Installation du panneau d'habillage de l'armoire 100 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Fixation des volets de l'armoire Il se peut que vous deviez fixer les volets de l'armoire. Cette procédure permet d'effectuer cette tâche. Pourquoi et quand exécuter cette tâche Le volet avant est facultatif pour certains modèles d'armoire. Si votre système est déjà équipé d'un volet avant ou si vous ne souhaitez pas installer de volet avant, passez directement à l'étape suivante. Fixation d'un volet avant de type High-perforation Il se peut que vous deviez fixer un volet avant à l'armoire. La procédure de cette section permet d'effectuer cette tâche. Pourquoi et quand exécuter cette tâche Pour installer le volet avant de type High-perforation, procédez comme suit : Procédure 1. Lisez les «Consignes de sécurité relatives aux armoires», à la page 102. 2. Retirez les panneaux d'habillage haut, gauche et droit. Pour plus d'informations sur le retrait des panneaux d'habillage gauche et droit d'une armoire 14T/00 ou 14T/42, voir «Retrait et remise en place de panneaux d'habillage 14T/00 ou 14T/42», à la page 99. Figure 101. Fixation du volet de l'armoire 101 1 Boîtier d'armoire 4 Panneau d'habillage droit 2 Panneau d'habillage haut 5 Pince de fixation 3 Panneau d'habillage gauche 3. Installez le loquet du volet sur la droite et les charnières sur la gauche. 4. Pour un volet avant High Perforation, alignez le volet sur la charnière de l'armoire, puis placez l'ergot au-dessus et descendez-le sur la charnière. 5. Réglez le loquet afin que le volet se ferme correctement. Consignes de sécurité relatives aux armoires Vous devez lire les consignes de sécurité relatives aux armoires avant d'installer votre équipement. Pourquoi et quand exécuter cette tâche Avant d'installer une armoire, des fonctionnalités d'armoire ou une unité centrale ou d'extension, lisez les consignes de sécurité suivantes. Avertissement : Si vous installez le matériel dans une armoire , elle doit être conforme à la spécification EIA 310D. L'installation du matériel dans l'armoire sans glissière de montage conçue à cet effet risque d'endommager le matériel ou de provoquer des lésions corporelles. Figure 102. Retrait des panneaux d'habillage 102 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension DANGER Observez les consignes suivantes lors de l'utilisation du système en armoire ou lorsque vous travaillez à proximité de ce dernier : v Un mauvais maniement de l'équipement lourd peut engendrer blessures et dommages matériels. v Abaissez toujours les vérins de mise à niveau de l'armoire. v Installez toujours des équerres de stabilisation sur l'armoire. v Pour prévenir tout danger lié à une mauvaise répartition de la charge, installez toujours les unités les plus lourdes dans la partie inférieure de l'armoire. Installez toujours les serveurs et les unités en option en commençant par le bas de l'armoire. v Un serveur monté en armoire n'est pas une étagère ou un espace de travail. Ne posez pas d'objet sur un serveur monté en armoire. v Chaque armoire peut être équipée de plusieurs cordons d'alimentation. Avant de manipuler l'armoire, vous devez débrancher l'ensemble des cordons d'alimentation. v Reliez toutes les unités installées dans l'armoire aux dispositifs d'alimentation installés dans la même armoire. Vous ne devez pas brancher le cordon d'alimentation d'une unité installée dans une armoire au dispositif d'alimentation installé dans une autre armoire. v Un mauvais câblage du socle de prise de courant peut provoquer une mise sous tension dangereuse des parties métalliques du système ou des unités qui lui sont raccordées. Il appartient au client de s'assurer que le socle de prise de courant est correctement câblé et mis à la terre afin d'éviter tout risque de choc électrique. ATTENTION v N'installez pas d'unité dans une armoire dont la température ambiante interne dépasse la température ambiante que le fabricant recommande pour toutes les unités montées en armoire. v N'installez pas d'unité dans une armoire où la ventilation n'est pas assurée. Vérifiez que les côtés, l'avant et l'arrière de l'unité sont correctement ventilés. v Le matériel doit être correctement raccordé au circuit d'alimentation pour éviter qu'une surcharge des circuits n'entrave le câblage des dispositifs d'alimentation ou de protection contre les surintensités. Pour choisir des connexions d'alimentation à l'armoire adaptées, consultez les étiquettes de puissance nominale situées sur le matériel dans l'armoire afin de déterminer l'alimentation totale requise par le circuit d'alimentation. v Armoires dotées de tiroirs coulissants : Si l'armoire n'est pas équipée d'équerres de stabilisation, ne sortez et n'installez pas de tiroir ou de dispositif. Ne retirez pas plusieurs tiroirs à la fois. Si vous retirez plusieurs tiroirs simultanément, l'armoire risque de devenir instable. v Armoires dotées de tiroirs fixes : Sauf indication du fabricant, les tiroirs fixes ne doivent pas être retirés à des fins de maintenance. Si vous tentez de retirer une partie ou l'ensemble du tiroir, l'armoire risque de devenir instable et le tiroir risque de tomber. (R001) Précautions à prendre pour soulever un dispositif : Fixation des volets de l'armoire 103 104 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Installation du kit de sécurité de l'armoire Il se peut que vous deviez installer le kit de sécurité de l'armoire. La procédure de la présente section vous permet d'effectuer cette tâche. Cette section comporte également des illustrations des composants matériel concernés et montre comment ces composants sont liés les uns aux autres. Pour installer un kit de sécurité de l'armoire comprenant un verrou et des glissières de sécurité, procédez comme suit : 1. Lisez les «Consignes de sécurité relatives aux armoires», à la page 102. 2. Effectuez l'inventaire du kit de sécurité. v 1 Deux kits de verrouillage. Chaque kit contient : - Verrou d'armoire - Support - Vis - Deux clés 2 Deux glissières de sécurité 3 Deux autocollants Verrouillé/Déverrouillé 3. Retirez le loquet du volet. Figure 103. Inventaire du kit de sécurité de l'armoire 105 a. Ouvrez le volet avant de l'armoire. b. Sur la partie intérieure du volet, retirez la vis (4) qui fixe le verrou sur le volet de l'armoire (voir la figure 104). c. Retirez le taquet (5). d. Retirez le loquet (6) situé sur la partie extérieure du volet. Remarque : Si l'armoire est équipée d'un kit de renforcement, retirez le contre-écrou et l'écrou six pans du loquet du volet et réinstallez-les sur le nouveau taquet de verrouillage. 4. Installez le verrou. a. Sur la partie avant du volet, insérez le verrou à clé de l'armoire dans le logement du loquet (6) (voir la figure 104). b. Installez le verrou en fixant le support (5) à l'aide de la vis (4) sur la partie intérieure du volet. 5. Répétez les étapes 3, à la page 105 et 4 pour installer le second verrou sur le volet arrière de l'armoire. 6. Installez une glissière de sécurité sur le côté droit de l'armoire. Figure 104. Retrait du loquet du volet 106 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Remarque : Les glissières possèdent chacune deux longs taquets situés sur la partie inférieure. Elles sont identiques et peuvent s'installer sur le panneau latéral gauche ou droit. a. Déverrouillez le panneau latéral droit afin de pouvoir accéder à la partie supérieure du panneau. b. Placez la partie plate de la glissière (7) face à la partie intérieure du panneau (8) (voir la figure 105) et insérez les deux taquets (9) de la glissière dans les deux canaux de support verticaux (10) situés sur le panneau latéral. Remarque : Lorsque la glissière est correctement installée, elle doit coulisser de l'avant vers l'arrière. c. Réinstallez le panneau latéral sur l'armoire. d. Verrouillez les panneaux latéraux en plaçant les glissières à l'avant de l'armoire. e. Placez un autocollant Verrouillé/Déverrouillé sur la partie intérieure du panneau afin que le taquet se trouve sur le symbole Verrouillé lorsque la glissière est en position de verrouillage (11) (voir la figure 106, à la page 108) et sur le symbole Déverrouillé (12), lorsque la glissière est déverrouillée. Figure 105. Installation d'une glissière de sécurité Installation du kit de sécurité de l'armoire 107 f. Répétez cette procédure pour le côté gauche de l'armoire. Figure 106. Positionnement de l'autocollant Verrouillé/Déverrouillé sur le panneau 108 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Kit de renforcement Il se peut que vous deviez retirer ou remplacer un composant du kit de renforcement. La présente section explique comment effectuer ces tâches. Le kit de renforcement doit être commandé en même temps que l'armoire. Les équerres du kit de renforcement sont installées par le fabricant. Remarque : Si vous installez une armoire à l'aide du kit de renforcement et que vous avez besoin de fixer l'armoire au sol, voir «Installation de l'armoire», à la page 5. L'illustration suivante décrit le contenu du kit et l'emplacement approximatif des supports et des charnières pour le cas où vous auriez besoin de désinstaller ou de réinstaller une pièce. 1 Armoire 9 Charnières de l'équerre 2 Entretoise 10 Equerre 3 Charnière de l'équerre 11 Monture de vis de fixation du volet latéral 4 Goujons de pivot de la charnière 12 Entretoise 5 Taquet de blocage de l'équerre 13 Vis 6 Entretoise 14 Rondelle 7 Vis 15 Vis de fixation du volet latéral 8 Vis moletée de l'équerre Figure 107. Composants du kit de renforcement 109 Libération de l'équerre renforcée Il se peut que vous deviez libérer l'équerre renforcée. La procédure de cette section permet d'effectuer cette tâche. Pour accéder à la partie arrière d'un système installé dans une armoire équipée d'un kit de renforcement, effectuez les étapes suivantes pour libérer l'équerre renforcée : 1. Enlevez la vis moletée de l'équerre (8). 2. Sortez l'équerre (10) hors de l'armoire. 3. Accédez au système, puis ré-installez l'équerre. Libération du panneau latéral à l'aide d'un kit de renforcement Il se peut que vous deviez libérer le panneau latéral sur l'armoire. Cette procédure permet d'effectuer cette tâche. Le kit de renforcement contient des vis qui permettent la fixation des panneaux latéraux à l'armoire. Pour retirer une vis de serrage, procédez comme suit : 1. Si nécessaire, ouvrez ou retirez le volet arrière de l'armoire. 2. Repérez la monture de vis de fixation (11) du volet latéral à retirer. Figure 108. Desserrement de l'équerre pour accéder aux systèmes 110 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 3. A l'aide d'un tournevis, retirez la vis de fixation (15) et la rondelle (14) du panneau latéral. Pour retirer le panneau latéral, voir «Retrait et remise en place de panneaux latéraux 14T/00 ou 14T/42», à la page 97. Figure 109. Retrait des vis de serrage sur le panneau latéral Kit de renforcement 111 112 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Connexion de plusieurs armoires à l'aide d'un kit de connexion multiarmoire Il se peut que vous deviez relier plusieurs armoires. La procédure de cette section permet d'effectuer cette tâche. Pourquoi et quand exécuter cette tâche La présente section explique comment connecter plusieurs armoires à l'aide d'un kit de connexion multiarmoire. Pour ce faire, munissez-vous du kit de connexion multiarmoire. Pour connecter plusieurs armoires à l'aide d'un kit de connexion multiarmoire, procédez comme suit : Procédure 1. Lisez les «Consignes de sécurité relatives aux armoires», à la page 102. 2. S'ils sont en place, retirez les panneaux latéraux de chaque armoire. Ne retirez les panneaux latéraux que des côtés qui seront reliés l'un à l'autre. Pour ce faire : a. Soulevez les deux pattes de déverrouillage du panneau. b. Retirez le panneau du boîtier de l'armoire en le soulevant afin de le dégager des deux pattes inférieures J. c. Placez à l'écart les panneaux latéraux. 3. Retirez les deux pattes Z et les deux pattes J. Ces pattes servent à accrocher les panneaux latéraux. 4. Installez les deux premières vis de butée dans les angles supérieur gauche et inférieur droit de la première armoire (voir la figure 110, à la page 114). 5. Installez les deux autres vis de butée dans les angles supérieur gauche et inférieur droit de la deuxième armoire (voir la figure 110, à la page 114). 6. Fixez la mousse comme illustré sur la figure 110, à la page 114. Sur les modèles d'armoire T42, placez la bande courte de mousse protectrice dans le prolongement de la bande longue et placez l'ensemble sur la longueur du cadre de l'armoire. 7. Positionnez les armoires ensemble. 8. Alignez les trous des vis de butée. Au besoin, réglez les pieds de nivellement. 9. Placez une vis et une rondelle dans les quatre emplacements correspondants, sans les serrer. 113 10. Une fois toutes les armoires reliées les unes aux autres, ajustez leur niveau. 11. Serrez toutes les vis. 12. Enclipsez les couvre-joints entre les armoires avant et arrière. 13. Enclipsez les couvre-joints supérieurs entre les armoires. 14. A l'aide des panneaux obturateurs, masquez les espaces vides situés à l'avant des armoires. Tous les espaces situés à l'avant de l'armoire doivent également être bouchés, y compris les espaces entre le matériel. Cette étape permet de garantir une bonne ventilation dans l'armoire. 15. Connectez les câbles séparant les armoires. 16. Pour installer des plaques de soutien, passez à l'étape 5 de «Installation de l'armoire», à la page 5. Figure 110. Retrait des panneaux latéraux et des pattes Z et J, mise en place des vis de butée et de la longue bande de mousse pour relier plusieurs armoires Figure 111. Emplacement des bandes de mousse protectrices (vue de dessus) 114 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Alarme de statut d'armoire L'alarme de statut d'armoire se place au sommet d'une armoire et peut être reliée à plusieurs unités centrales montées dans l'armoire. Le voyant de l'alarme de statut d'armoire est activé par le microprogramme du serveur en cas de problème de statut, ce qui permet d'identifier le problème dans les meilleurs délais. Cette opération incombe au client. Vous pouvez exécuter cette tâche vous-même ou faire appel à un fournisseur de services. Il vous imputera probablement les frais correspondants. Ces procédures supposent que les unités centrales ont été installées dans l'armoire et reliées au bloc d'alimentation. Si tel n'est pas le cas, reportez-vous aux instructions d'installation relatives à l'unité centrale. Installation de l'alarme de statut d'armoire Il se peut que vous deviez installer l'alarme de statut d'armoire. La procédure de cette section permet d'effectuer cette tâche. Avant de commencer Pour installer l'alarme de statut d'armoire, vous devez disposer des éléments suivants : v Une alarme de statut d'armoire v Au moins deux câbles d'alarme de statut d'armoire ; un pour connecter l'alarme à la boîte de jonction et un autre pour chaque unité centrale que vous souhaitez connecter à l'alarme v Un ou deux cordons d'alimentation v Une ou plusieurs boîtes de jonction pour l'alarme de statut d'armoire. v Un ou plusieurs câbles de boîte de jonction. Pourquoi et quand exécuter cette tâche Pour installer l'alarme de statut d'armoire, procédez comme suit : Procédure 1. Placez l'alarme de statut d'armoire (A) sur l'armoire, en dirigeant le voyant vers l'avant de l'armoire. 115 2. Connectez l'une des extrémités du câble de l'alarme de statut d'armoire (1) au port situé à l'arrière de l'alarme. 3. Faites passer le câble de l'alarme de statut d'armoire par le capot d'accès des câbles de l'armoire (B). 4. Connectez l'autre extrémité du câble de l'alarme de statut d'armoire au port de sortie de la boîte de jonction (C). 5. Pour relier la boîte de jonction à l'unité centrale, connectez l'une des extrémités d'un câble de l'alarme de statut d'armoire (1) à un port d'entrée de la boîte de jonction. Figure 112. Connexion de l'alarme de statut d'armoire à l'unité centrale via la boîte de jonction Figure 113. Connexion de l'alarme de statut d'armoire et de l'unité centrale à la boîte de jonction 116 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 6. Connectez l'autre extrémité du second câble de l'alarme de statut d'armoire au port d'alarme de l'unité centrale. Ce port est indiqué par un symbole représentant une alarme. Remarque : L'orientation du port peut être différente sur certaines unités centrales. 7. Pour connecter d'autres unités centrales à l'alarme de statut d'armoire, répétez les étapes 4 et 5 pour chaque unité centrale supplémentaire. Vous pouvez connecter plusieurs boîtes de jonction pour ajouter des unités centrales. Pour plus d'informations, voir «Connexion de plusieurs boîtes de jonction». 8. Connectez les cordons d'alimentation à l'alarme de statut d'armoire. Un seul cordon d'alimentation est requis, mais vous pouvez en utiliser deux par sécurité. 9. Branchez l'autre extrémité des cordons d'alimentation au bloc d'alimentation de l'armoire. Résultats Si vous installez l'alarme de statut d'armoire lors de la configuration initiale du système, les informations d'identification de l'armoire seront transmises au démarrage du système. En revanche, si vous installez l'alarme de statut d'armoire sur des unités centrales en cours d'exécution, vous devrez redémarrer le système. Connexion de plusieurs boîtes de jonction Il se peut que vous deviez connecter plusieurs boîtes de jonction. La présente section explique comment effectuer ces tâches. Pourquoi et quand exécuter cette tâche Vous pouvez connecter plusieurs boîtes de jonction pour pouvoir ajouter d'autres unités centrales à l'alarme de statut d'armoire. Pour connecter plusieurs boîtes de jonction, vous avez besoin du matériel supplémentaire suivant : v Une ou plusieurs boîtes de jonction d'alarme de statut d'armoire v Un ou plusieurs câbles de boîte de jonction. Vous avez besoin d'un câble de boîte de jonction par boîte de jonction à ajouter. Procédure 1. Connectez une extrémité du câble de boîte de jonction (2) dans un port d'entrée de la première boîte de jonction. Figure 114. Port d'alarme d'état de l'armoire Alarme de statut d'armoire 117 2. Connectez un câble de boîte de jonction dans le port de sortie de la boîte de jonction supplémentaire. 3. Raccordez des unités centrales supplémentaires à cette boîte de jonction comme indiqué dans «Installation de l'alarme de statut d'armoire», à la page 115. 4. Répétez les étapes 1 - 3 pour chaque boîte de jonction supplémentaire. Figure 115. Connexion de l'alarme de statut d'armoire et de l'unité centrale à la boîte de jonction 118 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Installation ou retrait d'un taquet de verrouillage d'unité centrale montée en armoire Il se peut que vous deviez installer ou retirer un taquet de verrouillage. La procédure de cette section permet d'effectuer cette tâche. Pourquoi et quand exécuter cette tâche Ces procédures décrivent l'installation ou le retrait d'un taquet de verrouillage de l'unité centrale montée en armoire. Vous pouvez exécuter vous-même cette tâche ou faire appel à un fournisseur de services. Il vous imputera probablement les frais correspondants. Pour installer ou enlever un taquet de verrouillage, effectuez les opérations suivantes : Remarque : Les illustrations de ces procédures peuvent présenter des différences par rapport à votre type et modèle de machine. Procédure 1. Lisez les «Consignes de sécurité relatives aux armoires», à la page 102. 2. Ouvrez le volet avant de l'armoire. 3. Poussez le taquet de verrouillage (A) ou (B) pour le mettre en place sur le côté de l'unité centrale. 4. Fixez le taquet de verrouillage à l'aide des deux vis de fixation (C) sur le côté du taquet. 5. Réinstallez le carter avant. Voir «Retrait et installation des carters sur le système 33E/8B ou 36E/8C», à la page 143. 6. Fermez le volet avant de l'armoire. Figure 116. Réinstallation des taquets de verrouillage 119 Résultats Pour retirer un taquet de verrouillage, procédez comme suit : 1. Retirez le capot situé à l'avant du système. Voir «Retrait et installation des carters sur le système 33E/8B ou 36E/8C», à la page 143. 2. Placez le système en position de maintenance. Voir les informations de maintenance de votre modèle. 3. Ôtez les vis de fixation de verrouillage (C) situés sur le côté du taquet de verrouillage (A) ou (B). 4. Retirez le taquet de verrouillage de l'unité centrale. Figure 117. Retrait des vis de fixation 120 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Retrait et remise en place du capot supérieur sur l'armoire Il se peut que vous deviez retirer ou remettre en place le capot supérieur de l'armoire. Les procédures de cette section permettent d'effectuer ces tâches. Retrait du capot supérieur de l'armoire Il se peut que vous deviez retirer le capot supérieur de l'armoire. Cette procédure permet d'effectuer cette tâche. Pourquoi et quand exécuter cette tâche Remarque : Une clé à douille de 10 mm avec rallonge est nécessaire pour retirer les vis du capot. D'autres types d'outils risqueraient d'arrondir la tête de vis qui serait alors indémontable. Pour ôter le capot de l'armoire, procédez comme suit : Procédure 1. Retirez le volet avant. Remarque : Si l'armoire est verrouillée, déverrouillez les volets. Puis débloquez les panneaux latéraux en ôtant les barres de verrouillage situées de chaque côté de la partie supérieure de l'arrière de l'armoire. Faites glisser les barres de verrouillage hors de l'armoire. Les barres de verrouillage peuvent être ôtées depuis l'avant de l'armoire si l'espace dans la zone des barres est suffisant et que les barres sont facilement accessibles. 2. Retirez les panneaux d'habillage haut, gauche et droit. 3. Retirez les panneaux latéraux.Pour plus d'informations, voir «Retrait et remise en place de panneaux latéraux 14T/00 ou 14T/42», à la page 97. 4. Ôtez une vis du côté droit et une vis du côté gauche du capot supérieur. Appliquez la même procédure à l'avant et à l'arrière. 5. Identifiez les renforts avant (A) et arrière (C) de l'armoire, fournis dans le conteneur de transport. Fixez chacun des renforts en haut à l'avant et à l'arrière de l'armoire, juste au-dessous du capot supérieur. 6. Utilisez les quatre vis (B) qui avaient été ôtées du capot supérieur pour fixer chacun des renforts à l'armoire (voir figure 118, à la page 122). 121 A Renfort avant de l'armoire B Vis de fixation (2 vis par renfort) C Renfort arrière de l'armoire 7. Ôtez les six vis restantes des côtés droit et gauche du capot supérieur (D). Figure 118. Fixation des renforts 122 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension D Capot supérieur E Capot d'accès aux câbles F Panneau latéral (quantité : 2) G étiquette EIA 8. Soulevez le capot supérieur. Remise en place du capot supérieur sur l'armoire Il se peut que vous deviez remettre en place le capot supérieur de l'armoire. Cette procédure permet d'effectuer cette tâche. Pourquoi et quand exécuter cette tâche Remarque : Une clé à douille de 10 mm avec rallonge est nécessaire pour remettre en place les vis du capot. D'autres types d'outils risqueraient d'arrondir la tête de vis qui serait alors indémontable par la suite. Pour remettre en place le capot sur l'armoire, procédez comme suit : Procédure 1. Positionnez le capot supérieur (D) sur l'armoire. 2. Insérez les six vis des côtés droit et gauche du capot supérieur. Figure 119. Retrait du capot supérieur Retrait et remise en place du capot supérieur sur l'armoire 123 D Capot supérieur E Capot d'accès aux câbles F Panneau latéral (quantité : 2) G étiquette EIA Remarque : Une clé à douille de 10 mm avec rallonge est nécessaire pour remettre en place les vis du capot. D'autres types d'outils risqueraient d'arrondir la tête de vis qui serait alors indémontable par la suite. 3. Retirez les renforts (A) et (C) en les desserrant dans la partie supérieure de l'armoire, juste au-dessous du capot supérieur. Ôtez les renforts à l'avant et à l'arrière de l'armoire. Figure 120. Retrait du capot supérieur 124 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension A Renfort avant de l'armoire B Vis de fixation (quantité : 2 vis par renfort) C Renfort arrière de l'armoire 4. Insérez les vis (B) que vous avez ôtées des renforts à droite et à gauche du capot supérieur. Insérez les vis à l'avant et à l'arrière de l'armoire. 5. A l'aide des panneaux obturateurs, masquez les espaces vides situés à l'avant de l'armoire. Comblez tous les espaces à l'avant de l'armoire, y compris les espaces entre les éléments de l'équipement.Cette étape permet de garantir une bonne ventilation dans l'armoire. Figure 121. Retrait du capot supérieur Retrait et remise en place du capot supérieur sur l'armoire 125 126 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Unité d'alimentation plus L'unité d'alimentation plus (PDU+) peut être installée dans les armoires 14T/00 et 14T/42. Elle permet de surveiller les charges électriques des unités qui y sont raccordées. Installation de l'unité PDU+ sur le côté d'une armoire Apprenez à installer l'unité d'alimentation plus (PDU+) sur le côté d'une armoire. Avant de commencer Conseil : Le retrait des volets et des panneaux latéraux de l'armoire peut faciliter l'installation. Important : Vous devez utiliser des écrous clip pour installer les étriers de fixation. Les écrous clip sont fournis avec la PDU+ et s'installent sur les poignées de montage. Pourquoi et quand exécuter cette tâche Pour installer la PDU+ dans l'espace de montage 1U sur le côté d'une armoire, procédez comme suit : Procédure 1. Lisez les «Consignes de sécurité relatives aux armoires», à la page 102. 2. Alignez les étriers de fixation verticaux sur l'avant de la PDU+. Veillez à fixer les crochets de manière à ce que les prises électriques soient face à l'arrière de l'armoire. 3. Fixation des crochets sur la PDU+ à l'aide de 5 vis M3 par étrier. Utilisez les vis qui ont été fournies avec la PDU+. 4. Alignez la PDU+ sur l'ouverture latérale de l'armoire. Puis, en maintenant la PDU+ en place, fixez les crochets sur les poignées de montage avec quatre écrous à clip et quatre vis M5, fournis avec la PDU+. Figure 122. Alignement des étriers de fixation verticaux sur l'avant de la PDU+. 127 Avertissement : Vous devez déconnecter l'alimentation principale avant de brancher ou de débrancher le cordon d'alimentation de la PDU+. 5. Si la PDU+ a été fournie avec un cordon d'alimentation détaché, branchez maintenant le cordon d'alimentation. Alignez le connecteur du cordon d'alimentation fourni avec la PDU+ sur le connecteur situé à l'avant de la PDU+, en tournant si nécessaire, afin que les clés soient alignées. Puis, tournez le verrou rotatif sur le connecteur dans le sens des aiguilles d'une montre, jusqu'à ce qu'il soit fixé en place. Figure 123. Alignement de la PDU+ sur l'ouverture latérale de l'armoire Figure 124. Alignement du connecteur du cordon d'alimentation sur la PDU+ 128 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 6. Guidez le cordon d'alimentation de la PDU+ vers les crochets latéraux de l'armoire. Puis, faites passer le cordon d'alimentation le long d'un crochet latéral, vers l'arrière de l'armoire et fixez le cordon d'alimentation avec les brides de câblage fournies avec la PDU+. 7. Connectez le cordon d'alimentation à une source électrique dédiée. Utilisez les brides de câblage fournies pour fixer le cordon d'alimentation sur toute sa longueur. Utilisez les ouvertures de l'armoire, si le cordon d'alimentation doit sortir de l'armoire pour se brancher sur une alimentation électrique. Avertissement : Pour éviter tout dommage à un appareil électrique et autres appareils connectés, connectez toujours l'appareil électrique à une alimentation électrique autorisée pour cet appareil. 8. Connectez le cordon électrique à une alimentation électrique dédiée correctement câblée et reliée à la terre. Ensuite, vous pouvez connecter des PDU de serveurs ou d'armoire dans l'armoire aux prises électriques de la PDU+. 9. Acheminez proprement tous les autres câbles électriques et fixez-les avec des brides de câblage. Concepts associés «Configuration du contrôle de l'alimentation via l'unité PDU+», à la page 133 Vous pouvez contrôler le statut de l'alimentation d'une unité connectée à l'unité d'alimentation plus (PDU+), que ce soit manuellement ou à distance, via l'interface Web PDU+. Vous pouvez utiliser l'utilitaire de configuration pour configurer initialement l'unité PDU+, ainsi que les paramètres PDU+ tels que les propriétés du réseau, la table de contrôle d'accès, et la table des récepteurs d'alerte. Installation horizontale en armoire de la PDU+ Il se peut que vous deviez installer horizontalement la PDU. Suivez alors les instructions de cette section pour effectuer cette tâche. Avant de commencer Conseil : Le retrait des volets et des panneaux latéraux de l'armoire peut faciliter l'installation. Important : Utilisez des écrous captifs pour les armoires dotées de trous carrés, et utilisez des écrous à clip pour les armoires dotées de trous ronds. Si votre armoire nécessite des écrous captifs, utilisez un outil d'installation des écrous captifs ou un tournevis à lame plate pour les installer. Pourquoi et quand exécuter cette tâche Pour installer la PDU+ sur l'armoire, procédez comme suit : Procédure 1. Lisez les «Consignes de sécurité relatives aux armoires», à la page 102. 2. Alignez les étriers de fixation verticaux sur l'avant de la PDU+. Veillez à fixer les crochets de manière à ce que les prises électriques soient face à l'arrière de l'armoire. Unité d'alimentation plus 129 3. Alignez les supports de fixation sur les trous à l'arrière de la PDU+ et fixez les supports sur la PDU+ à l'aide des deux vis à tête tronconique M3 avec les rondelles de verrou captif par support. Utilisez les vis fournies avec le kit de montage en armoire. 4. Maintenez-la PDU+ en formant un léger angle, et insérez-la doucement dans l'espace de montage 1-U dans l'armoire. Le fait de pousser légèrement sur les deux étriers de fixation long aidera à effacer les étriers des brides de soutien de l'armoire. Figure 125. Alignement des étriers de fixation verticaux sur l'avant de la PDU+. 130 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Avertissement : Vous devez déconnecter l'alimentation principale avant de brancher ou de débrancher le cordon d'alimentation de la PDU+. 5. Fixez en premier l'extrémité de la PDU+ qui a les étriers de fixation courts à l'armoire. Veillez à ce que les étriers de fixation courts soient alignés sur l'extérieure des brides de soutien de l'armoire. Fixez les étriers aux brides de soutien avec deux vis M6 et deux écrous captifs ou écrous à clip par étrier. Utilisez les écrous captifs ou les écrous à clip ainsi que les vis fournis avec le kit de montage en armoire. 6. Fixez les étriers de fixation longs et le panneau obturateur blanc 1-U sur l'armoire en procédant comme suit : Figure 126. Alignement de la PDU+ sur l'ouverture de l'armoire. Unité d'alimentation plus 131 a. Ajustez les étriers de fixation longs de manière à ce qu'ils s'ajustent à la profondeur de l'armoire. b. Veillez à ce que les étriers de fixation longs soient alignés sur l'intérieur des brides de soutien de l'armoire. c. Alignez le panneau obturateur blanc sur l'extérieur des brides de soutien. d. Fixez le panneau obturateur aux brides de soutien de l'armoire puis à l'étrier de fixation long à l'aide d'une vis M6 par étrier. e. Serrez les vis à tête tronconique M3 qui fixent les étriers de fixation longs à la PDU. 7. Fixez en premier l'extrémité de la PDU+ qui a les étriers de fixation courts à l'armoire. Veillez à ce que les étriers de fixation courts soient alignés sur l'extérieure des brides de soutien de l'armoire. Fixez les étriers aux brides de soutien avec deux vis M6 et deux écrous captifs ou écrous à clip par étrier. Utilisez les écrous captifs ou les écrous à clip ainsi que les vis fournis avec le kit de montage en armoire. 8. Si la PDU+ a été fournie avec un cordon d'alimentation détaché, branchez maintenant le cordon d'alimentation. Alignez le connecteur du cordon d'alimentation fourni avec la PDU+ sur le connecteur situé à l'avant de la PDU+, en tournant si nécessaire, afin que les clés soient alignées. Puis, tournez le verrou rotatif sur le connecteur dans le sens des aiguilles d'une montre, jusqu'à ce qu'il soit fixé en place. 132 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 9. Guidez le cordon d'alimentation de la PDU+ vers les crochets latéraux de l'armoire. Puis, faites passer le cordon d'alimentation le long d'un crochet latéral, vers l'arrière de l'armoire et fixez le cordon d'alimentation avec les brides de câblage fournies avec la PDU+. 10. Connectez le cordon d'alimentation à une source électrique dédiée. Utilisez les brides de câblage fournies pour fixer le cordon d'alimentation sur toute sa longueur. Utilisez les ouvertures de l'armoire, si le cordon d'alimentation doit sortir de l'armoire pour se brancher sur une alimentation électrique. Avertissement : Pour éviter tout dommage à un appareil électrique et autres appareils connectés, connectez toujours l'appareil électrique à une alimentation électrique autorisée pour cet appareil. 11. Connectez le cordon électrique à une alimentation électrique dédiée correctement câblée et reliée à la terre. Ensuite, vous pouvez connecter des PDU de serveurs ou d'armoire dans l'armoire aux prises électriques de la PDU+. 12. Acheminez proprement tous les autres câbles électriques et fixez-les avec des brides de câblage. Configuration du contrôle de l'alimentation via l'unité PDU+ Vous pouvez contrôler le statut de l'alimentation d'une unité connectée à l'unité d'alimentation plus (PDU+), que ce soit manuellement ou à distance, via l'interface Web PDU+. Vous pouvez utiliser l'utilitaire de configuration pour configurer initialement l'unité PDU+, ainsi que les paramètres PDU+ tels que les propriétés du réseau, la table de contrôle d'accès, et la table des récepteurs d'alerte. Remarque : Toutes les options de configuration de l'utilitaire de configuration sont disponibles via l'interface Web une fois l'unité PDU+ configurée sur le réseau local. Tâches associées «Installation de l'unité PDU+ sur le côté d'une armoire», à la page 127 Apprenez à installer l'unité d'alimentation plus (PDU+) sur le côté d'une armoire. Utilisation de l'utilitaire de configuration Apprenez à utiliser l'utilitaire de configuration pour configurer les paramètres d'unité d'alimentation plus (PDU+), tels que l'adresse IP, les propriétés du réseau, la table de contrôle d'accès et la table des récepteurs d'alerte. Figure 127. Alignement du connecteur du cordon d'alimentation sur la PDU+ Unité d'alimentation plus 133 Pourquoi et quand exécuter cette tâche Connexion de la console Pourquoi et quand exécuter cette tâche Vous pouvez configurer la PDU+ à l'aide d'un poste de travail ou d'un ordinateur bloc-notes connecté à la PDU+. Connectez le câble DB9-to-RJ-45 fourni avec la PDU+ au connecteur de la console RJ-45 sur la PDU+, et à un connecteur de série RS-232 (COM) sur un poste de travail ou ordinateur bloc-notes. Utilisation d'HyperTerminal Pourquoi et quand exécuter cette tâche HyperTerminal est un programme de terminal dans un système d'exploitation Microsoft® Windows® qui vous permet de configurer ou de commander un appareil à l'aide de paramètres de ligne de commande. Vous pouvez configurer les paramètres PDU+ et ses prises à l'aide de commandes numériques à partir d'un clavier. Vous pouvez également utiliser Telnet ou tout autre programme de terminal pour configurer la PDU+ une fois l'adresse IP paramétrée. Pour démarrer l'HyperTerminal et communiquer avec la PDU+, procédez comme suit : Procédure 1. Cliquez sur Démarrer ? Programmes ? Accessoires ? Communications ? HyperTerminal. La fenêtre Description de la connexion s'affiche. 2. Saisissez le nom de la connexion dans la zone Nom et sélectionnez une icône pour la connexion. 3. Cliquez sur OK. La fenêtre Connecter à s'affiche. 4. Dans la zone Connecter à l'aide de, sélectionnez le port COM connecté à la PDU+. 5. Cliquez sur OK. La fenêtre Propriétés s'affiche. 6. Cliquez sur Restaurer les paramètres par défaut pour utiliser les paramètres par défaut. Veillez à ce que la zone Bits par seconde soit réglée sur 9600 et à ce que la zone Contrôle de flux soit définie sur Aucun. 7. Cliquez sur OK. 8. Appuyez sur n'importe quelle touche. Le menu principal de l'utilitaire de configuration s'affiche. 9. Saisissez le mot de passe par défaut, passw0rd (tout en lettres minuscules avec un zéro et non un O), et appuyez sur Entrée. 10. Saisissez l'option de menu que vous souhaitez. Pour obtenir la description des options, voir «Options du menu de l'utilitaire de configuration». Options du menu de l'utilitaire de configuration Pourquoi et quand exécuter cette tâche Les options suivantes se trouvent dans le menu principal de l'utilitaire de configuration : Paramètres de Lorsque vous sélectionnez Paramètres , le fenêtre de l'utilitaire de configuration s'affiche avec les options suivantes : Configurez l'adresse IP, l'adresse de la passerelle et le Groupe de systèmes MIB Affichez et modifiez l'adresse IP, la date, l'heure et les informations du système MIB. Définissez le Groupe de commandes Configurez le nom d'utilisateur de l'administrateur, le mot de passe, et les protocoles d'accès. Configurez les gestionnaires d'accès en écriture Configurez une liste des utilisateurs pouvant accéder à et contrôler la PDU+. 134 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Configurez les récepteurs d'alarme Configurez les serveurs de gestion du réseau à distance (NMS) pour recevoir des alarmes. Définissez la date et l'heure Réglez les informations de la date et de l'heure pour la PDU+. Configurez le nom de superutilisateur et mot de passe Configurez le nom d'utilisateur et le mot de passe de l'administrateur qui utilisera un navigateur Web pour configurer la PDU+. Notification par e-mail Configurez une liste des utilisateurs qui seront alertés par des messages d'événements si un événement inhabituel est déclenché sur le système PDU+. Configurez les multi-utilisateurs Configurez les autres connexions utilisateurs et mots de passe ainsi que les niveaux d'accès en lecture et écriture. Configurez les informations d' Configurez les zones intervalle de connexion, fréquence de régénération, et nom personnalisé des groupes de charge PDU+. Récapitulatif des paramètres et du journal des événements Affichez tous les paramètres de configuration de la PDU+. Réinitialisez la configuration aux paramètres par défaut Réinitialisez tous les paramètres systèmes aux valeurs par défaut en sortie d'usine. Redémarrez le HD-PDU Redémarrez la PDU+. Configuration des adresses IP Pourquoi et quand exécuter cette tâche Important : Vous devez configurer l'adresse IP avant de pouvoir utiliser l'interface Web ou d'accéder à la PDU+ dans un réseau IP (LAN/WAN). Si vous ne connaissez pas l'adresse IP, contactez votre administrateur système. Pour configurer l'adresse IP, procédez comme suit : Procédure 1. Dans le menu principal de l'utilitaire de configuration, saisissez l'option de menu pour les Paramètres. 2. Saisissez l'option de menu pour Configurer l'adresse IP, l'adresse de la passerelle et le groupe système MIB. Utilisation de l'interface Web Apprenez à utiliser l'interface Web pour configurer et surveiller à distance l'unité d'alimentation plus (PDU+). La PDU+ fournit une interface graphique que vous pouvez afficher à partir d'un navigateur Web. A l'aide d'un navigateur Web, vous pouvez accéder à et surveiller les prises électriques d'alimentation des PDU+ à distance depuis un poste de travail ou un ordinateur bloc-notes. Pourquoi et quand exécuter cette tâche Démarrage de l'interface Web Pourquoi et quand exécuter cette tâche Pour démarrer l'interface Web, procédez comme suit : Unité d'alimentation plus 135 Procédure 1. Démarrez un navigateur Web à partir d'un poste de travail ou d'un ordinateur bloc-notes, et entrez l'adresse IP de la PDU+q dans la zone d'adresse. La fenêtre de connexion s'affiche. Remarque : Pour plus d'informations sur le paramétrage de l'adresse IP du système voir «Configuration des adresses IP», à la page 135. 2. Dans la zone du nom d'utilisateur, entrez USERID (tout en majuscules). 3. Dans la zone du mot de passe, entrez passw0rd (tout en minuscules avec un zéro à la place du O). 4. Cliquez sur OK. La page de statut principale s'affiche. Résultats La page de statut principale affiche une représentation graphique des prises électriques de la PDU+, ainsi que le statut d'entrée : v Le panneau à gauche affiche les menus et les sous-menus de la PDU+. Cliquez sur un menu pour en afficher les options, développer les éléments de menu, et modifier les options de menu selon vos besoins. v Le graphique affiché dans le panneau de droite montre le statut des prises électriques, le voltage d'entrée, le voltage de sortie, la fréquence, la consommation de courant, électricité, en watt/heure, et la consommation électrique cumulée en watt/heure. Si vous connectez une sonde environnementale surveillée facultative, les conditions de température et d'humidité de l'environnement s'affichent. Chaque page de menu fournit une aide en ligne pour vous aider dans la configuration de la PDU+. Cliquez sur l'icône d'aide en haut de chaque page pour afficher l'aide. Modification des paramètres de base Utilisez le menu Système pour configurer les paramètres système de la PDU+ tels que le nom de superutilisateur, le mot de passe, l'adresse IP, la date et l'heure. Pourquoi et quand exécuter cette tâche Modification du nom de superutilisateur et du mot de passe : Vous pouvez configurer le nom d'utilisateur et le mot de passe de l'administrateur qui utilisera un navigateur Web pour configurer la PDU+ sur la page Utilitaire de configuration. Pourquoi et quand exécuter cette tâche Pour modifier le nom de superutilisateur et le mot de passe, procédez comme suit : Procédure 1. A partir de la page de statut principale, dans le panneau de navigation de gauche, cliquez sur Système. 2. Cliquez sur Configuration pour afficher et modifier la configuration système et le nom de superutilisateur et le mot de passe. Identification de la PDU+ et de la carte Web/SNMP : Vous pouvez afficher les informations concernant la PDU+ et la carte Web/SNMP sur la page Identification de la gestion de l'alimentation. 136 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Pourquoi et quand exécuter cette tâche Pour afficher les informations de gestion de l'alimentation de la PDU+ et de la carte Web/SNMP, procédez comme suit : Procédure 1. A partir de la page de statut principale, dans le panneau de navigation de gauche, cliquez sur Système. 2. Cliquez sur Identification pour afficher les informations de la PDU+ et de la carte Web/SNMP. Ajout d'utilisateurs : Sur la page Configuration Multi-utilisateurs, vous pouvez ajouter des utilisateurs qui peuvent accéder à et surveiller la PDU+. Pourquoi et quand exécuter cette tâche Pour créer une liste des utilisateurs qui peuvent accéder à et surveiller la PDU+, procédez comme suit : Procédure 1. A partir de la page de statut principale, dans le panneau de navigation de gauche, cliquez sur Système. 2. Cliquez sur Multi-Utilisateurs pour ajouter des utilisateurs qui peuvent uniquement afficher le statut de la PDU+ ou des utilisateurs qui peuvent modifier les paramètres de la PDU+. Modification de la date et de l'heure : Vous pouvez modifier la date et l'heure de la PDU+ sur la page Date et Heure. Pourquoi et quand exécuter cette tâche Important : La modification de la date et heure de la PDU+ affecte les autres paramètres système, tels que les e-mails, les interruptions et journaux. Pour modifier la date et l'heure, procédez comme suit : Procédure 1. A partir de la page de statut principale, dans le panneau de navigation de gauche, cliquez sur Système. 2. Cliquez sur Date et Heure pour afficher et modifier la date et l'heure du système. Vous pouvez paramétrer manuellement la date et l'heure, les synchroniser avec l'heure de l'ordinateur, ou les synchroniser avec un serveur NTP. Modification des alertes événement : Vous pouvez modifier les alertes événement sur la page des récepteurs d'alerte SNMP. Pourquoi et quand exécuter cette tâche Pour configurer la PDU+ afin d'envoyer des alertes e-mail ou alarmes SNMP à des utilisateurs spécifiés lors de la survenance d'événements spécifiques, procédez comme suit : Unité d'alimentation plus 137 Procédure 1. A partir de la page de statut principale, dans le panneau de navigation de gauche, cliquez sur Système. 2. Cliquez sur Récepteurs d'alerte pour créer une liste des utilisateurs ou des postes de travail qui seront alertés par un message d'alerte SNMP. Vous pouvez spécifier les adresses IP pour un maximum de huit récepteurs d'alarmes, ainsi que les informations de la communauté, le type d'alarme, la gravité de l'alarme, et la description des événements qui déclenchent les alarmes. 3. Cliquez sur Notification par e-mail sous Système pour créer une liste d'un maximum de quatre utilisateurs qui seront alertés par un e-mail. Utilisez ce menu pour spécifier le serveur de mail, le compte utilisateur, le système de nom de domaine (DNS) et les autres informations nécessaires pour configurer un serveur de mails pour l'envoi d'alertes mail. Utilisez la table Récepteurs d'e-mail pour ajouter les adresses e-mail. Modification des informations réseau Utilisez le menu Réseau pour modifier les informations réseau pour la PDU+, par exemple, l'adresse IP. Pourquoi et quand exécuter cette tâche Modification de la configuration de réseau : Vous pouvez afficher ou modifier la configuration réseau sur la page Configuration réseau. Pourquoi et quand exécuter cette tâche Pour afficher ou modifier la configuration réseau de la PDU+, procédez comme suit : Procédure 1. A partir de la page de statut principale, dans le panneau de navigation de gauche, cliquez sur Réseau. 2. Cliquez sur Configuration pour paramétrer l'adresse IP, l'adresse de la passerelle, le masque de sous-réseau, et l'adresse de système de nom de domaine de PDU+. 3. Cliquez sur Commande pour configurer les paramètres TCP/IP. 4. Cliquez sur Commande d'accès pour paramétrer la commande d'accès afin d'empêcher les utilisateurs non autorisés d'accéder à la PDU+. Récapitulatifs du journal de l'historique et des événements Le menu Journaux fournit une description détaillée de tous les événements ainsi qu'un enregistrement du statut de la PDU+. Les administrateurs système peuvent utiliser cette page pour analyser les incidents avec l'équipement réseau. Pourquoi et quand exécuter cette tâche Affichage du journal de l'historique : Vous pouvez afficher l'historique complet des entrées et sorties de PDU+ sur la page du Journal de l'historique. Pourquoi et quand exécuter cette tâche Pour afficher l'historique de la PDU+, procédez comme suit : Procédure 1. A partir de la page de statut principale, dans le panneau de navigation de gauche, cliquez sur Journaux. 138 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 2. Cliquez sur Historique. Chaque fichier du journal des événements affiche l'heure, la date et la description de tous les événements survenant sur la PDU+. Affichage du journal des événements : Vous pouvez afficher l'enregistrement complet des événements de la PDU+ sur la page Journal des événements. Pourquoi et quand exécuter cette tâche Pour afficher l'enregistrement complet des événements PDU+, procédez comme suit : Procédure 1. A partir de la page de statut principale, dans le panneau de navigation de gauche, cliquez sur Journaux. 2. Cliquez sur Evénements. Chaque fichier journal affiche un enregistrement de l'alimentation d'entrée et de sortie de chaque prise. Unité d'alimentation plus 139 140 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Retrait et remplacement du capot ou du volet de l'unité d'extension Il se peut que vous deviez retirer, remplacer ou installer des capots ou des portes sur une unité d'extension pour accéder à des composants ou effectuer une opération de maintenance. Retrait du capot avant des modèles 14G/30 Il se peut que vous deviez retirer le capot pour effectuer des opérations de maintenance du système. Pourquoi et quand exécuter cette tâche Pour retirer le capot avant, procédez comme suit : Procédure 1. Retirez les deux vis moletées (B) figurant sur la partie gauche et à l'arrière du panneau. 2. Appuyez sur les loquets (C) situés de part et d'autre du panneau pour libérer celui-ci. 3. Soulevez le panneau (A) et retirez-le du châssis. Conseil : Si un bloc de ventilation est présent dans une partie vide du châssis, ne le retirez que si vous installez des composants dans cette partie du châssis. Figure 128. Retrait du capot avant des modèles 14G/30 141 Installation du capot avant des modèles 14G/30 Cette procédure permet d'installer le capot avant après avoir installé le système ou effectué une opération de maintenance. Avant de commencer Si des composants ne sont installés que d'un seul côté du châssis, assurez-vous que le bloc de ventilation est présent du côté vide. Procédure 1. Insérez les colonnes situées à l'arrière du panneau (A) dans les crochets du châssis. 2. Appuyez sur les loquets (C) situés de part et d'autre du panneau avant. 3. Insérez le panneau dans le châssis et libérez les loquets. Le panneau s'enclenche dans son logement. 4. Remettez en place les deux vis moletées (B) dans les encoches sur la gauche et la droite du capot avant. Figure 129. Capot avant d'un modèle 14G/30 142 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Retrait et installation des carters sur le système 33E/8B ou 36E/8C Ces instructions permettent de retirer, de remettre en place ou d'installer des carters pour accéder à des composants ou effectuer une opération de maintenance. Retrait du capot d'accès des modèles 33E/8B ou 36E/8C Cette procédure permet de retirer le capot d'accès pour effectuer des opérations de maintenance ou accéder à des composants internes. Pourquoi et quand exécuter cette tâche Pour retirer le capot d'accès, procédez comme suit : Procédure 1. Placez le système en position de maintenance. Pour plus d'informations, voir «Mise en position de maintenance de l'unité 33E/8B ou 36E/8C», à la page 157. 2. Desserrez les deux vis de serrage à main (A) situées à l'arrière du capot. 3. Placez le capot (B) à l'arrière de l'unité centrale. Une fois l'avant du capot d'accès dégagé du bord supérieur, soulevez et retirez le capot. Résultats Avertissement : Pour garantir un niveau de refroidissement et de ventilation approprié, installez le capot avant de démarrer le système. Vous risquez d'endommager les composants système si vous utilisez le système sans son capot pendant plus de 30 minutes. 143 Installation du capot d'accès des modèles 33E/8B ou 36E/8C. Cette procédure permet d'installer le capot d'accès après avoir effectué une opération de maintenance ou accédé aux composants internes. Pourquoi et quand exécuter cette tâche Pour installer le capot d'accès, procédez comme suit : Procédure 1. Placez le capot d'accès (A) sur le haut de l'unité centrale, à environ 25 mm du carter avant. 2. Tout en maintenant le capot d'accès contre l'unité centrale, faites glisser le capot vers l'avant de l'unité centrale. Les taquets du capot d'accès doivent se trouver en-dessous du bord supérieur et les deux vis doivent être alignées avec les trous situés à l'arrière de l'unité centrale. Important : Assurez-vous que les câble LED du ventilateur ne sont pas coincés par l'arête avant du capot d'accès lorsque vous le déplacez vers l'avant. 3. Serrez les deux vis (B) situées à l'arrière du capot. Figure 130. Retrait du capot d'accès 144 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Retrait du carter avant des modèles 33E/8B ou 36E/8C. Cette procédure permet de retirer le carter pour accéder aux composants ou effectuer une opération de maintenance. Pourquoi et quand exécuter cette tâche Pour retirer le carter avant, procédez comme suit : Procédure 1. Retirez les deux vis (A) qui maintiennent l'unité centrale à l'armoire (B) (voir figure suivante). 2. Retirez le carter du système. Le carter est doté d'un renfoncement permettant de le tenir plus facilement. Figure 131. Installation du capot d'accès Retrait et installation des carters sur le système 33E/8B ou 36E/8C 145 Installation du carter avant des modèles 33E/8B ou 36E/8C Cette procédure permet de remettre en place le carter après avoir accédé à des composants ou effectué une opération de maintenance. Pourquoi et quand exécuter cette tâche Pour installer le carter avant, procédez comme suit : Procédure 1. Alignez le carter face au système. 2. Poussez doucement le carter jusqu'à ce que les quatre taquets de déverrouillage soient positionnés dans leurs montants de fixation respectifs (B) sur le système. 3. Remettez en place les deux vis (C) qui maintiennent le système fixé à l'armoire (A). Figure 132. Retrait du carter avant 146 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Retrait et remise en place du capot avant sur les modèles 17M/MB et 79M/HB Ces procédures permettent de retirer et de remettre en place le capot pour accéder à des composants ou effectuer une opération de maintenance. Retrait du carter avant d'un modèle 17M/MB ou 79M/HB Cette procédure permet de retirer le capot pour accéder aux composants ou effectuer une opération de maintenance. Pourquoi et quand exécuter cette tâche Pour retirer le carter avant, procédez comme suit : Procédure 1. Si nécessaire, ouvrez le volet avant de l'armoire. 2. Tirez les taquets (A) situés de chaque côté du capot (voir figure suivante). Figure 133. Installation du carter avant Retrait et installation des carters sur le système 33E/8B ou 36E/8C 147 3. Tirez le panneau (B) pour l'ôter de l'unité centrale. Installation du carter avant sur un modèle 17M/MB ou 79M/HB Cette procédure permet de remettre en place le carter après avoir accédé à des composants ou effectué une opération de maintenance. Pourquoi et quand exécuter cette tâche Pour installer le carter avant, procédez comme suit : Procédure 1. Placez le carter (A) en face du système en alignant les quatre broches avec les quatre trous à l'arrière du carter. Figure 134. Retrait du capot avant 148 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 2. Enfoncez les ergots (B) pour emboîter le carter dans son emplacement. 3. Fermez le volet avant de l'armoire. Figure 135. Installation du carter avant Retrait et installation des carters sur le système 33E/8B ou 36E/8C 149 150 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Mise en position de maintenance ou de fonctionnement du système 33E/8B ou 36E/8C Ces procédures décrivent comment mettre un système en maintenance et en fonctionnement dans le cadre d'une opération de maintenance ou pour accéder à des composants internes. Mise en position de maintenance de l'unité centrale ou de l'unité d'extension Cette procédure permet d'effectuer des opérations de maintenance ou d'accéder à des composants internes en plaçant l'unité centrale ou l'unité d'extension montée en armoire en position de maintenance. Pourquoi et quand exécuter cette tâche Remarque : Certaines figures accompagnant ces procédures peuvent ne pas correspondre exactement à votre unité centrale ou votre unité d'extension. Néanmoins, les procédures demeurent identiques pour l'exécution de chaque tâche. 151 DANGER Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes suivantes : Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut présenter un danger. Pour éviter tout risque de choc électrique : v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce dernier avec un autre produit. v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique. v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de reconfiguration de ce produit au cours d'un orage. v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de danger électrique, débranchez tous les cordons d'alimentation. v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent aux informations de la plaque d'alimentation électrique du système. v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce produit. v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les câbles d'interface. v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence de dommages matériels. v Avant de retirer les carters de l'unité, mettez celle-ci hors tension et déconnectez ses cordons d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication et aux modems (sauf mention contraire dans les procédures d'installation et de configuration). v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et déconnecter les différents cordons. Pour déconnecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Débranchez les cordons d'alimentation des prises. 3. Débranchez les câbles d'interface des connecteurs. 4. Débranchez tous les câbles des unités. Pour connecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Branchez tous les cordons sur les unités. 3. Branchez les câbles d'interface sur des connecteurs. 4. Branchez les cordons d'alimentation aux prises. 5. Mettez l'unité sous tension. (D005a) DANGER 152 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Observez les consignes suivantes lors de l'utilisation du système en armoire ou lorsque vous travaillez à proximité de ce dernier : v Un mauvais maniement de l'équipement lourd peut engendrer blessures et dommages matériels. v Abaissez toujours les vérins de mise à niveau de l'armoire. v Installez toujours des équerres de stabilisation sur l'armoire. v Pour prévenir tout danger lié à une mauvaise répartition de la charge, installez toujours les unités les plus lourdes dans la partie inférieure de l'armoire. Installez toujours les serveurs et les unités en option en commençant par le bas de l'armoire. v Un serveur monté en armoire n'est pas une étagère ou un espace de travail. Ne posez pas d'objet sur un serveur monté en armoire. v Chaque armoire peut être équipée de plusieurs cordons d'alimentation. Avant de manipuler l'armoire, vous devez débrancher l'ensemble des cordons d'alimentation. v Reliez toutes les unités installées dans l'armoire aux dispositifs d'alimentation installés dans la même armoire. Vous ne devez pas brancher le cordon d'alimentation d'une unité installée dans une armoire au dispositif d'alimentation installé dans une autre armoire. v Un mauvais câblage du socle de prise de courant peut provoquer une mise sous tension dangereuse des parties métalliques du système ou des unités qui lui sont raccordées. Il appartient au client de s'assurer que le socle de prise de courant est correctement câblé et mis à la terre afin d'éviter tout risque de choc électrique. ATTENTION v N'installez pas d'unité dans une armoire dont la température ambiante interne dépasse la température ambiante que le fabricant recommande pour toutes les unités montées en armoire. v N'installez pas d'unité dans une armoire où la ventilation n'est pas assurée. Vérifiez que les côtés, l'avant et l'arrière de l'unité sont correctement ventilés. v Le matériel doit être correctement raccordé au circuit d'alimentation pour éviter qu'une surcharge des circuits n'entrave le câblage des dispositifs d'alimentation ou de protection contre les surintensités. Pour choisir des connexions d'alimentation à l'armoire adaptées, consultez les étiquettes de puissance nominale situées sur le matériel dans l'armoire afin de déterminer l'alimentation totale requise par le circuit d'alimentation. v Armoires dotées de tiroirs coulissants : Si l'armoire n'est pas équipée d'équerres de stabilisation, ne sortez et n'installez pas de tiroir ou de dispositif. Ne retirez pas plusieurs tiroirs à la fois. Si vous retirez plusieurs tiroirs simultanément, l'armoire risque de devenir instable. v Armoires dotées de tiroirs fixes : Sauf indication du fabricant, les tiroirs fixes ne doivent pas être retirés à des fins de maintenance. Si vous tentez de retirer une partie ou l'ensemble du tiroir, l'armoire risque de devenir instable et le tiroir risque de tomber. (R001) Pour mettre une unité centrale ou une unité d'extension montée en armoire en position de maintenance, procédez comme suit : Procédure 1. Si nécessaire, ouvrez le volet avant de l'armoire. 2. Retirez les deux vis (A) qui maintiennent l'unité centrale ou l'unité d'extension (B) fixée à l'armoire (voir figure suivante). Mise en position de maintenance ou de fonctionnement du système 33E/8B ou 36E/8C 153 3. Débloquez les taquets de déverrouillage (A) situés sur les deux côtés de l'unité (voir la figure suivante). Figure 136. Retrait des vis du système et de l'armoire 154 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 4. Lisez la remarque suivante, puis tirez doucement l'unité centrale ou l'unité d'extension hors de l'armoire jusqu'à ce que les glissières soient complètement étendues et bloquées. A faire : v Si vous devez débrancher des câbles figurant à l'arrière de l'unité centrale ou de l'unité d'extension pour exécuter la procédure, faites-le avant de sortir l'unité de l'armoire. v Veillez à ne pas bloquer ou plier les câbles situés à l'arrière de l'unité centrale ou de l'unité d'extension en tirant l'unité hors de l'armoire. v Vérifiez que les glissières sont complètement étendues. Une fois les glissières complètement étendues, les taquets de sécurité s'enclenchent. Ils évitent de sortir l'unité centrale ou l'unité d'extension de manière excessive. Mise en position fonctionnement de l'unité centrale ou de l'unité d'extension Cette procédure permet de mettre l'unité centrale ou l'unité d'extension montée en armoire en position de fonctionnement pour pouvoir l'utiliser. Pourquoi et quand exécuter cette tâche Conseil : Certaines figures accompagnant ces procédures peuvent ne pas correspondre exactement à votre unité centrale ou votre unité d'extension. Néanmoins, les procédures demeurent identiques pour l'exécution de chaque tâche. Pour mettre l'unité centrale ou l'unité d'extension montée en armoire en position de fonctionnement, procédez comme suit : Figure 137. Déblocage des taquets de sécurité Mise en position de maintenance ou de fonctionnement du système 33E/8B ou 36E/8C 155 Procédure 1. Relâchez simultanément les taquets de sécurité bleus (A) situés à proximité de l'avant de chaque glissière, puis poussez l'unité centrale ou l'unité d'extension dans l'armoire, comme indiqué dans la figure suivante. Remarque : Veillez à ne pas bloquer ou plier les câbles situés à l'arrière de l'unité centrale ou de l'unité d'extension en poussant l'unité dans l'armoire. 2. Remettez en place et serrez les deux vis (C) qui maintiennent l'unité centrale ou l'unité d'extension (A) fixée à l'armoire (voir la figure suivante). Figure 138. Déblocage des taquets de sécurité de la glissière 156 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 3. Fermez le volet avant de l'armoire. Mise en position de maintenance de l'unité 33E/8B ou 36E/8C Cette procédure permet d'effectuer des opérations de maintenance ou d'accéder à des composants internes en plaçant l'unité centrale ou l'unité d'extension montée en armoire en position de maintenance. Pourquoi et quand exécuter cette tâche Remarque : Certaines figures accompagnant ces procédures peuvent ne pas correspondre exactement à votre unité centrale ou votre unité d'extension. Néanmoins, les procédures demeurent identiques pour l'exécution de chaque tâche. Figure 139. Mise en place de l'unité centrale dans l'armoire et fixation des vis Mise en position de maintenance ou de fonctionnement du système 33E/8B ou 36E/8C 157 DANGER Lorsque vous utilisez le système ou travaillez à proximité de ce dernier, observez les consignes suivantes : Le courant électrique provenant de l'alimentation, du téléphone et des câbles de transmission peut présenter un danger. Pour éviter tout risque de choc électrique : v Branchez cette unité uniquement avec le cordon d'alimentation fourni par . N'utilisez pas ce dernier avec un autre produit. v N'ouvrez pas et n'entretenez pas le bloc d'alimentation électrique. v Ne manipulez aucun câble et n'effectuez aucune opération d'installation, d'entretien ou de reconfiguration de ce produit au cours d'un orage. v Le produit peut être équipé de plusieurs cordons d'alimentation. Pour supprimer tout risque de danger électrique, débranchez tous les cordons d'alimentation. v Branchez tous les cordons d'alimentation sur un socle de prise de courant correctement câblé et mis à la terre. Vérifiez que la tension et l'ordre des phases des prises de courant correspondent aux informations de la plaque d'alimentation électrique du système. v Branchez sur des socles de prise de courant correctement câblés tout équipement connecté à ce produit. v Lorsque cela est possible, n'utilisez qu'une seule main pour connecter ou déconnecter les câbles d'interface. v Ne mettez jamais un équipement sous tension en cas d'incendie ou d'inondation, ou en présence de dommages matériels. v Avant de retirer les carters de l'unité, mettez celle-ci hors tension et déconnectez ses cordons d'alimentation, ainsi que les câbles qui la relient aux réseaux, aux systèmes de télécommunication et aux modems (sauf mention contraire dans les procédures d'installation et de configuration). v Lorsque vous installez, que vous déplacez, ou que vous manipulez le présent produit ou des périphériques qui lui sont raccordés, reportez-vous aux instructions ci-dessous pour connecter et déconnecter les différents cordons. Pour déconnecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Débranchez les cordons d'alimentation des prises. 3. Débranchez les câbles d'interface des connecteurs. 4. Débranchez tous les câbles des unités. Pour connecter les cordons : 1. Mettez toutes les unités hors tension (sauf mention contraire). 2. Branchez tous les cordons sur les unités. 3. Branchez les câbles d'interface sur des connecteurs. 4. Branchez les cordons d'alimentation aux prises. 5. Mettez l'unité sous tension. (D005a) DANGER 158 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Observez les consignes suivantes lors de l'utilisation du système en armoire ou lorsque vous travaillez à proximité de ce dernier : v Un mauvais maniement de l'équipement lourd peut engendrer blessures et dommages matériels. v Abaissez toujours les vérins de mise à niveau de l'armoire. v Installez toujours des équerres de stabilisation sur l'armoire. v Pour prévenir tout danger lié à une mauvaise répartition de la charge, installez toujours les unités les plus lourdes dans la partie inférieure de l'armoire. Installez toujours les serveurs et les unités en option en commençant par le bas de l'armoire. v Un serveur monté en armoire n'est pas une étagère ou un espace de travail. Ne posez pas d'objet sur un serveur monté en armoire. v Chaque armoire peut être équipée de plusieurs cordons d'alimentation. Avant de manipuler l'armoire, vous devez débrancher l'ensemble des cordons d'alimentation. v Reliez toutes les unités installées dans l'armoire aux dispositifs d'alimentation installés dans la même armoire. Vous ne devez pas brancher le cordon d'alimentation d'une unité installée dans une armoire au dispositif d'alimentation installé dans une autre armoire. v Un mauvais câblage du socle de prise de courant peut provoquer une mise sous tension dangereuse des parties métalliques du système ou des unités qui lui sont raccordées. Il appartient au client de s'assurer que le socle de prise de courant est correctement câblé et mis à la terre afin d'éviter tout risque de choc électrique. ATTENTION v N'installez pas d'unité dans une armoire dont la température ambiante interne dépasse la température ambiante que le fabricant recommande pour toutes les unités montées en armoire. v N'installez pas d'unité dans une armoire où la ventilation n'est pas assurée. Vérifiez que les côtés, l'avant et l'arrière de l'unité sont correctement ventilés. v Le matériel doit être correctement raccordé au circuit d'alimentation pour éviter qu'une surcharge des circuits n'entrave le câblage des dispositifs d'alimentation ou de protection contre les surintensités. Pour choisir des connexions d'alimentation à l'armoire adaptées, consultez les étiquettes de puissance nominale situées sur le matériel dans l'armoire afin de déterminer l'alimentation totale requise par le circuit d'alimentation. v Armoires dotées de tiroirs coulissants : Si l'armoire n'est pas équipée d'équerres de stabilisation, ne sortez et n'installez pas de tiroir ou de dispositif. Ne retirez pas plusieurs tiroirs à la fois. Si vous retirez plusieurs tiroirs simultanément, l'armoire risque de devenir instable. v Armoires dotées de tiroirs fixes : Sauf indication du fabricant, les tiroirs fixes ne doivent pas être retirés à des fins de maintenance. Si vous tentez de retirer une partie ou l'ensemble du tiroir, l'armoire risque de devenir instable et le tiroir risque de tomber. (R001) Pour mettre l'unité centrale ou l'unité d'extension montée en armoire en position de maintenance, procédez comme suit : Procédure 1. Si nécessaire, ouvrez le volet avant de l'armoire. 2. Retirez les deux vis (A) qui maintiennent l'unité centrale à l'armoire (voir figure suivante). 3. Débloquez les taquets de sécurité (B) situés sur les deux côtés de l'unité (voir figure suivante). Mise en position de maintenance ou de fonctionnement du système 33E/8B ou 36E/8C 159 4. Si vous devez sortir l'unité centrale ou l'unité d'extension de l'armoire, veillez à débrancher auparavant les câbles situés à l'arrière de l'unité. Veillez à ne pas bloquer ou plier les câbles situés à l'arrière de l'unité centrale ou de l'unité d'extension en tirant l'unité hors de l'armoire. 5. Tirez doucement l'unité centrale ou l'unité d'extension hors de l'armoire jusqu'à ce que les glissières soient complètement étendues et bloquées. Vérifiez que les glissières sont complètement étendues. Une fois les glissières complètement étendues, les taquets de sécurité s'enclenchent. Ils évitent de sortir l'unité centrale ou l'unité d'extension de manière excessive. Mise en position de fonctionnement de l'unité 33E/8B ou 36E/8C Cette procédure permet de mettre l'unité centrale ou l'unité d'extension en position de fonctionnement pour pouvoir l'utiliser. Pourquoi et quand exécuter cette tâche Pour mettre l'unité centrale en position de fonctionnement, procédez comme suit : Conseil : Certaines figures accompagnant ces procédures peuvent ne pas correspondre exactement à votre unité centrale ou votre unité d'extension. Néanmoins, les procédures demeurent identiques pour l'exécution de chaque tâche. Procédure 1. Libérez simultanément les taquets de sécurité bleus (B) situés vers l'avant de chaque glissière, puis poussez l'unité centrale ou l'unité d'extension dans l'armoire (voir la figure suivante). Remarque : Veillez à ne pas bloquer ou plier les câbles situés à l'arrière de l'unité centrale ou de l'unité d'extension en poussant l'unité dans l'armoire. Figure 140. Déblocage des taquets de sécurité 160 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension 2. Remettez en place et vissez les deux vis de serrage à main (C) qui maintiennent l'unité centrale ou l'unité d'extension (A) fixé(e) à l'armoire (voir la figure suivante). 3. Fermez le volet avant de l'armoire. Figure 141. Déblocage des taquets de sécurité de la glissière Figure 142. Remise en place des vis de serrage à main Mise en position de maintenance ou de fonctionnement du système 33E/8B ou 36E/8C 161 162 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Remarques Le présent document peut contenir des informations ou des références concernant certains produits, logiciels ou services non annoncés dans ce pays. Pour plus d'informations, référez-vous aux documents d'annonce disponibles dans votre pays ou adressez-vous au partenaire commercial du fabricant. Toute référence à un produit, logiciel ou service du fabricant n'implique pas que seul ce produit, logiciel ou service puisse convenir. Tout autre élément fonctionnellement équivalent peut être utilisé, s'il n'enfreint aucun droit du fabricant. Il est toutefois de la responsabilité de l'utilisateur d'évaluer et de vérifier lui-même le fonctionnement de tout produit, programme ou service. Le fabricant peut détenir des brevets ou des demandes de brevet couvrant les produits mentionnés dans le présent document. La remise de ce document ne vous octroie aucun droit de licence sur ces brevets ou demandes de brevet. Si vous désirez recevoir des informations concernant l'acquisition de licences, veuillez en faire la demande par écrit au fabricant. Le paragraphe suivant ne s'applique ni au Royaume-Uni, ni dans aucun pays dans lequel il serait contraire aux lois locales. LE PRESENT DOCUMENT EST LIVRE «EN L'ETAT» SANS AUCUNE GARANTIE EXPLICITE OU IMPLICITE. LE FABRICANT DECLINE NOTAMMENT TOUTE RESPONSABILITE RELATIVE A CES INFORMATIONS EN CAS DE CONTREFAÇON AINSI QU'EN CAS DE DEFAUT D'APTITUDE A L'EXECUTION D'UN TRAVAIL DONNE. Certaines juridictions n'autorisent pas l'exclusion des garanties implicites, auquel cas l'exclusion ci-dessus ne vous sera pas applicable. Le présent document peut contenir des inexactitudes ou des coquilles. Il est mis à jour périodiquement. Chaque nouvelle édition inclut les mises à jour. IBM peut modifier sans préavis les produits et logiciels décrits dans ce document. Les références à des sites Web non référencés par le fabricant sont fournies à titre d'information uniquement et n'impliquent en aucun cas une adhésion aux données qu'ils contiennent. Les éléments figurant sur ces sites Web ne font pas partie des éléments du présent produit et l'utilisation de ces sites relève de votre seule responsabilité. Le fabricant pourra utiliser ou diffuser, de toute manière qu'il jugera appropriée et sans aucune obligation de sa part, tout ou partie des informations qui lui seront fournies. Les données de performance indiquées dans ce document ont été déterminées dans un environnement contrôlé. Par conséquent, les résultats peuvent varier de manière significative selon l'environnement d'exploitation utilisé. Certaines mesures évaluées sur des systèmes en cours de développement ne sont pas garanties sur tous les systèmes disponibles. En outre, elles peuvent résulter d'extrapolations. Les résultats peuvent donc varier. Il incombe aux utilisateurs de ce document de vérifier si ces données sont applicables à leur environnement d'exploitation. Les informations concernant les produits de fabricants tiers ont été obtenues auprès des fournisseurs de ces produits, par l'intermédiaire d'annonces publiques ou via d'autres sources disponibles. Ce fabricant n'a pas testé ces produits et ne peut confirmer l'exactitude de leurs performances ni leur compatibilité. Il ne peut recevoir aucune réclamation concernant des produits de fabricants tiers. Toute question concernant les performances de produits de fabricants tiers doit être adressée aux fournisseurs de ces produits. Toute instruction relative aux intentions du fabricant pour ses opérations à venir est susceptible d'être modifiée ou annulée sans préavis, et doit être considérée uniquement comme un objectif. 163 Tous les tarifs indiqués sont les prix de vente actuels suggérés et sont susceptibles d'être modifiés sans préavis. Les tarifs appliqués peuvent varier selon les revendeurs. Ces informations sont fournies uniquement à titre de planification. Elles sont susceptibles d'être modifiées avant la mise à disposition des produits décrits. Le présent document peut contenir des exemples de données et de rapports utilisés couramment dans l'environnement professionnel. Ces exemples mentionnent des noms fictifs de personnes, de sociétés, de marques ou de produits à des fins illustratives ou explicatives uniquement. Tous ces noms sont fictifs et toute ressemblance avec des noms et adresses utilisés par une entreprise réelle serait purement fortuite. Si vous visualisez ces informations en ligne, il se peut que les photographies et illustrations en couleur n'apparaissent pas à l'écran. Les figures et les spécifications contenues dans le présent document ne doivent pas être reproduites, même partiellement, sans l'autorisation écrite d'IBM. Le fabricant a conçu le présent document pour expliquer comment utiliser les machines indiquées. Il n'est exploitable dans aucun autre but. Les ordinateurs du fabricant contiennent des mécanismes conçus pour réduire les risques d'altération ou de perte de données. Ces risques, cependant, ne peuvent pas être éliminés. En cas de rupture de tension, de défaillances système, de fluctuations ou de rupture de l'alimentation ou d'incidents au niveau des composants, l'utilisateur doit s'assurer de l'exécution rigoureuse des opérations, et que les données ont été sauvegardées ou transmises par le système au moment de la rupture de tension ou de l'incident (ou peu de temps avant ou après). De plus, ces utilisateurs doivent établir des procédures garantissant la vérification indépendante des données, afin de permettre une utilisation fiable de ces dernières dans le cadre d'opérations stratégiques. Ces utilisateurs doivent enfin consulter régulièrement sur les sites Web de support IBM les mises à jour et les correctifs applicables au système et aux logiciels associés. Marques IBM, le logo IBM et ibm.com sont des marques d'International Business Machines aux Etats-Unis et/ou dans certains autres pays. D'autres sociétés sont propriétaires d'autres marques, noms de produits ou logos. Une liste à jour des marques IBM est disponible sur Internet : Copyright and trademark information, à l'adresse www.ibm.com/legal/copytrade.shtml. Microsoft et Windows sont des marques de Microsoft Corporation aux Etats-Unis et/ou dans certains autres pays. Les autres noms de sociétés, de produits et de services peuvent appartenir à des tiers. Bruits radioélectriques Remarques sur la classe A Les avis de conformité de classe A suivants s'appliquent aux serveurs. Recommandation de la Federal Communications Commission (FCC) [Etats-Unis] Remarque : Cet appareil respecte les limites des caractéristiques d'immunité des appareils numériques définies pour la classe A, conformément au chapitre 15 de la réglementation de la FCC. La conformité aux spécifications de cette classe offre une garantie acceptable contre les perturbations électromagnétiques dans les zones commerciales. Ce matériel génère, utilise et peut émettre de l'énergie radiofréquence. Il risque de parasiter les communications radio s'il n'est pas installé conformément aux instructions du 164 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension constructeur. L'exploitation faite en zone résidentielle peut entraîner le brouillage des réceptions radio et télé, ce qui obligerait le propriétaire à prendre les dispositions nécessaires pour en éliminer les causes. Utilisez des câbles et connecteurs correctement blindés et mis à la terre afin de respecter les limites de rayonnement définies par la réglementation de la FCC. IBM® ne peut pas être tenue pour responsable du brouillage des réceptions radio ou télévision résultant de l'utilisation de câbles et connecteurs inadaptés ou de modifications non autorisées apportées à cet appareil. Toute modification non autorisée pourra annuler le droit d'utilisation de cet appareil. Cet appareil est conforme aux restrictions définies dans le chapitre 15 de la réglementation de la FCC. Son utilisation est soumise aux deux conditions suivantes : (1) il ne peut pas causer de perturbations électromagnétiques gênantes et (2) il doit accepter toutes les perturbations reçues, y compris celles susceptibles d'occasionner un fonctionnement indésirable. Avis de conformité à la réglementation d'Industrie Canada Cet appareil numérique de la classe A est conforme à la norme NMB-003 du Canada. Avis de conformité aux exigences de l'Union européenne Le présent produit satisfait aux exigences de protection énoncées dans la directive 2004/108/CEE du Conseil concernant le rapprochement des législations des Etats membres relatives à la compatibilité électromagnétique. IBM décline toute responsabilité en cas de non-respect de cette directive résultat d'une modification non recommandée du produit, y compris l'ajout de cartes en option non IBM. Ce produit respecte les limites des caractéristiques d'immunité des appareils de traitement de l'information définies par la classe A de la norme européenne EN 55022 (CISPR 22). La conformité aux spécifications de la classe A offre une garantie acceptable contre les perturbations avec les appareils de communication agréés, dans les zones commerciales et industrielles. Dans l'Union européenne, contactez : IBM Deutschland GmbH Technical Regulations, Department M456 IBM-Allee 1, 71139 Ehningen, Germany Tele: +49 7032 15-2937 e-mail: tjahn@de.ibm.com Avertissement : Ce matériel appartient à la classe A. Il est susceptible d'émettre des ondes radioélectriques risquant de perturber les réceptions radio. Son emploi dans une zone résidentielle peut créer des perturbations électromagnétiques. L'utilisateur devra alors prendre les mesures nécessaires pour en éliminer les causes. Avis de conformité aux exigences du Voluntary Control Council for Interference (VCCI) - Japon Voici un résumé de la recommandation du VCCI japonais figurant dans l'encadré ci-dessus : Ce produit de la classe A respecte les limites des caractéristiques d'immunité définies par le VCCI (Voluntary Control Council for Interference) japonais. Si ce matériel est utilisé dans une zone résidentielle, Remarques 165 il peut créer des perturbations électromagnétiques. L'utilisateur devra alors prendre les mesures nécessaires pour en éliminer les causes. Directive relative aux harmoniques confirmée par l'association japonaise JEITA (Japanese Electronics and Information Technology Industries Association) (produits inférieurs ou égaux à 20 A par phase) Directive relative aux harmoniques confirmée avec modifications par l'association japonaise JEITA (Japanese Electronics and Information Technology Industries Association) (produits supérieurs 20 A par phase) Avis d'interférences électromagnétiques (EMI) - République populaire de Chine Ce matériel appartient à la classe A. Il est susceptible d'émettre des ondes radioélectriques risquant de perturber les réceptions radio. L'utilisateur devra alors prendre les mesures nécessaires pour en éliminer les causes. Avis d'interférences électromagnétiques (EMI) - Taïwan Voici un résumé de l'avis EMI de Taïwan figurant ci-dessus. Avertissement : Ce matériel appartient à la classe A. Il est susceptible d'émettre des ondes radioélectriques risquant de perturber les réceptions radio. Son emploi dans une zone résidentielle peut créer des interférences. L'utilisateur devra alors prendre les mesures nécessaires pour les supprimer. 166 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Avis d'interférences électromagnétiques (EMI) - Corée Cet équipement a obtenu l'agrément EMC en vue de son utilisation commerciale. S'il a été vendu ou acheté par erreur, veuillez l'échanger avec du matériel conforme à un usage domestique. Avis de conformité pour l'Allemagne Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse A EU-Richtlinie zur Elektromagnetischen Verträglichkeit Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die Grenzwerte der EN 55022 Klasse A ein. Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne Empfehlung von IBM gesteckt/eingebaut werden. EN 55022 Klasse A Geräte müssen mit folgendem Warnhinweis versehen werden: "Warnung: Dieses ist eine Einrichtung der Klasse A. Diese Einrichtung kann im Wohnbereich Funk-Störungen verursachen; in diesem Fall kann vom Betreiber verlangt werden, angemessene Maßnahmen zu ergreifen und dafür aufzukommen." Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland. Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse A Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen - CE - zu führen. Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller: International Business Machines Corp. New Orchard Road Armonk, New York 10504 Tel: 914-499-1900 Der verantwortliche Ansprechpartner des Herstellers in der EU ist: IBM Deutschland GmbH Technical Regulations, Abteilung M456 IBM-Allee 1, 71139 Ehningen, Germany Tel: +49 7032 15-2937 e-mail: tjahn@de.ibm.com Remarques 167 Generelle Informationen: Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse A. Avis d'interférences électromagnétiques (EMI) - Russie Remarques sur la classe B Les avis de conformité de classe B suivants s'appliquent aux dispositifs déclarés comme relevant de la compatibilité électromagnétique (EMC) de classe B dans les informations d'installation des dispositifs. Recommandation de la Federal Communications Commission (FCC) [Etats-Unis] Cet appareil respecte les limites des caractéristiques d'immunité des appareils numériques définies par la classe B, conformément au chapitre 15 de la réglementation de la FCC. La conformité aux spécifications de la classe B offre une garantie acceptable contre les perturbations électromagnétiques dans les zones résidentielles. Ce matériel génère, utilise et peut émettre de l'énergie radiofréquence. Il risque de parasiter les communications radio s'il n'est pas installé conformément aux instructions du constructeur. Toutefois, il n'est pas garanti que des perturbations n'interviendront pas pour une installation particulière. Si cet appareil provoque des perturbations gênantes dans les communications radio ou télévision, mettez-le hors tension puis sous tension pour vous en assurer. L'utilisateur peut tenter de remédier à cet incident en appliquant une ou plusieurs des mesures suivantes : v Réorienter ou repositionner l'antenne de réception. v Eloigner l'appareil du récepteur. v Brancher l'appareil sur une prise différente de celle du récepteur, sur un circuit distinct. v Prendre contact avec un distributeur agréé IBM ou un représentant commercial IBM pour obtenir de l'aide. Utilisez des câbles et connecteurs correctement blindés et mis à la terre afin de respecter les limites de rayonnement définies par la réglementation de la FCC. Ces câbles et connecteurs sont disponibles chez votre distributeur agréé IBM. IBM ne peut pas être tenue pour responsable du brouillage des réceptions radio ou télévision résultant de modifications non autorisées apportées à cet appareil. Toute modification non autorisée pourra annuler le droit d'utilisation de cet appareil. Cet appareil est conforme aux restrictions définies dans le chapitre 15 de la réglementation de la FCC. Son utilisation est soumise aux deux conditions suivantes : (1) il ne peut pas causer de perturbations électromagnétiques gênantes et (2) il doit accepter toutes les perturbations reçues, y compris celles susceptibles d'occasionner un fonctionnement indésirable. Avis de conformité à la réglementation d'Industrie Canada Cet appareil numérique de la classe B est conforme à la norme NMB-003 du Canada. 168 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Avis de conformité aux exigences de l'Union européenne Le présent produit satisfait aux exigences de protection énoncées dans la directive 2004/108/CEE du Conseil concernant le rapprochement des législations des Etats membres relatives à la compatibilité électromagnétique. IBM décline toute responsabilité en cas de non-respect de cette directive résultant d'une modification non recommandée du produit, y compris l'ajout de cartes en option non IBM. Ce produit respecte les limites des caractéristiques d'immunité des appareils de traitement de l'information définies par la classe B de la norme européenne EN 55022 (CISPR 22). La conformité aux spécifications de la classe B offre une garantie acceptable contre les perturbations avec les appareils de communication agréés, dans les zones résidentielles. Dans l'Union européenne, contactez : IBM Deutschland GmbH Technical Regulations, Department M456 IBM-Allee 1, 71139 Ehningen, Germany Tele: +49 7032 15-2937 e-mail: tjahn@de.ibm.com Avis de conformité aux exigences du Voluntary Control Council for Interference (VCCI) - Japon Directive relative aux harmoniques confirmée par l'association japonaise JEITA (Japanese Electronics and Information Technology Industries Association) (produits inférieurs ou égaux à 20 A par phase) Directive relative aux harmoniques confirmée avec modifications par l'association japonaise JEITA (Japanese Electronics and Information Technology Industries Association) (produits supérieurs 20 A par phase) Avis d'interférences électromagnétiques (EMI) - Corée Remarques 169 Avis de conformité pour l'Allemagne Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse B EU-Richtlinie zur Elektromagnetischen Verträglichkeit Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die Grenzwerte der EN 55022 Klasse B ein. Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne Empfehlung von IBM gesteckt/eingebaut werden. Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland. Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse B Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen - CE - zu führen. Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller: International Business Machines Corp. New Orchard Road Armonk, New York 10504 Tel: 914-499-1900 Der verantwortliche Ansprechpartner des Herstellers in der EU ist: IBM Deutschland GmbH Technical Regulations, Abteilung M456 IBM-Allee 1, 71139 Ehningen, Germany Tel: +49 7032 15-2937 e-mail: tjahn@de.ibm.com Generelle Informationen: Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse B. Dispositions Les droits d'utilisation relatifs à ces publications sont soumis aux dispositions suivantes. Usage personnel : Vous pouvez reproduire ces publications pour votre usage personnel, non commercial, sous réserve que toutes les mentions de propriété soient conservées. Vous ne pouvez distribuer ou publier tout ou partie de ces publications ou en faire des oeuvres dérivées, sans le consentement exprès du fabricant. Usage commercial : Vous pouvez reproduire, distribuer et afficher ces publications uniquement au sein de votre entreprise, sous réserve que toutes les mentions de propriété soient conservées. Vous ne pouvez reproduire, distribuer, afficher ou publier tout ou partie de ces publications en dehors de votre entreprise, ou en faire des oeuvres dérivées, sans le consentement exprès du fabricant. 170 Armoires, dispositifs d'armoire et installation en armoire de systèmes ou d'unités d'extension Excepté les droits d'utilisation expressément accordés dans ce document, aucun autre droit, licence ou autorisation, implicite ou explicite, n'est accordé pour ces publications ou autres informations, données, logiciels ou droits de propriété intellectuelle contenus dans ces publications. Le fabricant se réserve le droit de retirer les autorisations accordées dans le présent document si l'utilisation des publications s'avère préjudiciable à ses intérêts ou que, selon son appréciation, les instructions susmentionnées n'ont pas été respectées. Vous ne pouvez télécharger, exporter ou réexporter ces informations qu'en total accord avec toutes les lois et règlements applicables dans votre pays, y compris les lois et règlements américains relatifs à l'exportation. LE FABRICANT NE DONNE AUCUNE GARANTIE SUR LE CONTENU DE CES PUBLICATIONS. CES PUBLICATIONS SONT LIVREES EN L'ETAT SANS AUCUNE GARANTIE EXPLICITE OU IMPLICITE. LE FABRICANT DECLINE NOTAMMENT TOUTE RESPONSABILITE RELATIVE A CES INFORMATIONS EN CAS DE CONTREFAÇON AINSI QU'EN CAS DE DEFAUT D'APTITUDE A L'EXECUTION D'UN TRAVAIL DONNE. E1-700, E3-700, E2-700, E2-700T, E4-700 or E4-700T Rack features, and installing systems or expansion units into a rack REFERENCE 86 A1 29FG 01 ESCALA Power7 E1-700, E3-700, E2-700, E2-700T, E4-700 or E4-700T Rack features, and installing systems or expansion units into a rack This publication concerns the following models: - Bull Escala E5-700 (Power 750 / 8233-E8B) - Bull Escala M6-700 (Power 770 / 9117-MMB) - Bull Escala M7-700 (Power 780 / 9179-MHB) - Bull Escala E1-700 (Power 710 / 8231-E2B) - Bull Escala E2-700 / E2-700T (Power 720 / 8202-E4B) - Bull Escala E3-700 (Power 730 / 8231-E2B) - Bull Escala E4-700 / E4-700T (Power 740 / 8205-E6B) References to Power 755 / 8236-E8C models are irrelevant. Hardware September 2010 BULL CEDOC 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE REFERENCE 86 A1 29FG 01 The following copyright notice protects this book under Copyright laws which prohibit such actions as, but not limited to, copying, distributing, modifying, and making derivative works. Copyright Bull SAS 2007-2010 Printed in France Suggestions and criticisms concerning the form, content, and presentation of this book are invited. A form is provided at the end of this book for this purpose. To order additional copies of this book or other Bull Technical Publications, you are invited to use the Ordering Form also provided at the end of this book. Trademarks and Acknowledgements We acknowledge the right of proprietors of trademarks mentioned in this book. The information in this document is subject to change without notice. Bull will not be liable for errors ontained herein, or for incidental or consequential damages in connection with the use of this material. c Contents Safety notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vii Racks, rack features, and installing systems or expansion units into a rack . . . . . . 1 What's new in Racks, rack features, and installing systems or expansion units into a rack . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 Installing the rack . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 Installing the 7014-T00 or 7014-T42 racks . . . . . . . . . . . . . . . . . . . . . . . . . . 5 Completing a parts inventory. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 Positioning the rack . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 Leveling the rack. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 Attaching the stabilizer brackets. . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 Attaching the rack to a concrete floor . . . . . . . . . . . . . . . . . . . . . . . . . . 8 Attaching the rack to the concrete floor beneath a raised floor . . . . . . . . . . . . . . . . . 13 Connecting the power distribution system . . . . . . . . . . . . . . . . . . . . . . . . 17 Checking the ac outlets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17 Attaching the front or back ac electrical outlet. . . . . . . . . . . . . . . . . . . . . . . 18 Installing the ac outlet-mounting plates with ac outlets. . . . . . . . . . . . . . . . . . . 18 Installing the ac outlet-mounting plate without ac outlets . . . . . . . . . . . . . . . . . . 20 Connecting a dc power source . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21 Installing a system or expansion unit into a rack . . . . . . . . . . . . . . . . . 27 Installing the model 8202-E4B or 8205-E6B into a rack . . . . . . . . . . . . . . . . . . . . . 27 Determining the location . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28 Marking the location . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29 Attaching the 8202-E4B or 8205-E6B mounting hardware to the rack . . . . . . . . . . . . . . . 30 Installing the cable-management arm. . . . . . . . . . . . . . . . . . . . . . . . . . 32 Connecting the power cables to the system. . . . . . . . . . . . . . . . . . . . . . . . 35 Installing the model 8231-E2B into a rack . . . . . . . . . . . . . . . . . . . . . . . . . 35 Determining the location . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36 Marking the location . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37 Attaching the 8231-E2B mounting hardware to the rack . . . . . . . . . . . . . . . . . . . 38 Installing the cable-management arm. . . . . . . . . . . . . . . . . . . . . . . . . . 40 Connecting the power cables to the system. . . . . . . . . . . . . . . . . . . . . . . . 43 Installing the model 8233-E8B or 8236-E8C into a rack . . . . . . . . . . . . . . . . . . . . . 43 Determining the location . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45 Marking the location . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46 Attaching the 8233-E8B or 8236-E8C mounting hardware to the rack . . . . . . . . . . . . . . . 47 Installing the cable-management arm. . . . . . . . . . . . . . . . . . . . . . . . . . 54 Connecting the power cables to the system. . . . . . . . . . . . . . . . . . . . . . . . 55 Installing the model 5802 or 5877 into a rack . . . . . . . . . . . . . . . . . . . . . . . . 56 Determining the location . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57 Marking the location . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58 Attaching the mounting hardware to the rack . . . . . . . . . . . . . . . . . . . . . . . 59 Installing the 5802 or 5877 on the rail assembly . . . . . . . . . . . . . . . . . . . . . . 62 Installing the model 5886 into a rack . . . . . . . . . . . . . . . . . . . . . . . . . . . 64 Determining the location . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65 Marking the location . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67 Attaching the mounting hardware to the rack . . . . . . . . . . . . . . . . . . . . . . 67 Installing the 5886 on the rail assembly . . . . . . . . . . . . . . . . . . . . . . . . 69 Installing the 5796 or 7314-G30 expansion unit in a rack . . . . . . . . . . . . . . . . . . . . 71 Powering off the system and removing power. . . . . . . . . . . . . . . . . . . . . . . 72 Marking the location . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72 © Copyright IBM Corp. 2010 iii Installing the expandable rails and extension bracket into the rack . . . . . . . . . . . . . . . . 73 Installing the shell . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76 Installing the expansion unit. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77 Attaching the power controller (SPCN), and power cables. . . . . . . . . . . . . . . . . . . 79 Powering on the system . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79 Installing the model 5786 or 7031-D24 into a rack. . . . . . . . . . . . . . . . . . . . . . . 79 Completing a parts inventory . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80 Determining the location . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81 Marking the location . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82 Attaching the mounting hardware to the rack . . . . . . . . . . . . . . . . . . . . . . . 82 Installing the 5786 or 7031-D24 on the rail assembly. . . . . . . . . . . . . . . . . . . . . 84 Attaching the cable-management arm to the standard rails . . . . . . . . . . . . . . . . . . 87 Removing and replacing 7014-T00 or 7014-T42 side panels . . . . . . . . . . . . . 89 Replacing a 7014-T00 or 7014-T42 side panel . . . . . . . . . . . . . . . . . . . . . . . . 89 Removing and replacing 7014-T00 or 7014-T42 trim panels . . . . . . . . . . . . . 91 Replacing the 7014-T00 or 7014-T42 trim panels . . . . . . . . . . . . . . . . . . . . . . . 91 Attaching the rack doors . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93 Attaching a high-perforation front door . . . . . . . . . . . . . . . . . . . . . . . . . . 93 Rack safety notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 94 Installing the rack security kit. . . . . . . . . . . . . . . . . . . . . . . . . . 97 Ruggedized kit. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101 Releasing the ruggedized brace . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102 Releasing the side panel with a ruggedized kit . . . . . . . . . . . . . . . . . . . . . . . 102 Connecting multiple racks with rack-to-rack attachment kit . . . . . . . . . . . . 105 Rack status beacon . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 109 Installing the rack status beacon . . . . . . . . . . . . . . . . . . . . . . . . . . . . 109 Connecting multiple junction boxes . . . . . . . . . . . . . . . . . . . . . . . . . . 111 Installing or removing a rack-mounted system-unit latch bracket . . . . . . . . . . 113 Removing and replacing the rack top cover . . . . . . . . . . . . . . . . . . . 115 Removing the rack top cover . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 115 Replacing the rack top cover . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 117 Power distribution unit plus . . . . . . . . . . . . . . . . . . . . . . . . . . 121 Installing the PDU+ in the side of a rack . . . . . . . . . . . . . . . . . . . . . . . . . 121 Installing the PDU+ horizontally in a rack. . . . . . . . . . . . . . . . . . . . . . . . 123 Setting up power monitoring using the PDU+ . . . . . . . . . . . . . . . . . . . . . . . 127 Using the IBM DPI Configuration Utility . . . . . . . . . . . . . . . . . . . . . . . . 127 Connecting the console . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 128 Using HyperTerminal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 128 Configuration Utility menu options . . . . . . . . . . . . . . . . . . . . . . . . . 128 Setting the IP address . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 129 Using the Web interface. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 129 Starting the Web interface . . . . . . . . . . . . . . . . . . . . . . . . . . . . 129 Modifying the basic settings . . . . . . . . . . . . . . . . . . . . . . . . . . . 129 Changing the superuser name and password. . . . . . . . . . . . . . . . . . . . . 130 Identifying the PDU+ and Web/SNMP card . . . . . . . . . . . . . . . . . . . . . 130 Adding users . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 130 Changing the date and time . . . . . . . . . . . . . . . . . . . . . . . . . . 130 Changing event alerts . . . . . . . . . . . . . . . . . . . . . . . . . . . . 130 iv Installing into a rack Changing the network information . . . . . . . . . . . . . . . . . . . . . . . . . 131 Changing the network configuration . . . . . . . . . . . . . . . . . . . . . . . 131 History and event log summaries . . . . . . . . . . . . . . . . . . . . . . . . . 131 Viewing the history log . . . . . . . . . . . . . . . . . . . . . . . . . . . . 131 Viewing the event log . . . . . . . . . . . . . . . . . . . . . . . . . . . . 131 Removing and replacing the expansion unit cover or door. . . . . . . . . . . . . 133 Removing the front cover on the 7314-G30 or 5796. . . . . . . . . . . . . . . . . . . . . . 133 Installing the front cover on the 7314-G30 or 5796 . . . . . . . . . . . . . . . . . . . . . . 133 Removing and replacing covers on the 8233-E8B or 8236-E8C system. . . . . . . . 135 Removing the service access cover from an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . . 135 Installing the service access cover on an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . . . 136 Removing the front cover from an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . . . . . 136 Installing the front cover on an 8233-E8B or 8236-E8C system . . . . . . . . . . . . . . . . . . 137 Removing and replacing the front cover on a 9117-MMB or 9179-MHB . . . . . . . . . . . . . . . 138 Removing the front cover from the 9117-MMB or 9179-MHB . . . . . . . . . . . . . . . . . . 138 Installing the front cover on the 9117-MMB or 9179-MHB . . . . . . . . . . . . . . . . . . . 139 Placing the rack-mounted 8233-E8B or 8236-E8C system into the service position or operating position . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 141 Placing the rack-mounted system or expansion unit in the service position. . . . . . . . . . . . . . 141 Placing the rack-mounted system or expansion unit in the operating position . . . . . . . . . . . . . 144 Placing a 8233-E8B or 8236-E8C in the service position . . . . . . . . . . . . . . . . . . . . 145 Placing the 8233-E8B or 8236-E8C in the operating position . . . . . . . . . . . . . . . . . . . 148 Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 151 Trademarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 152 Electronic emission notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 152 Class A Notices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 152 Class B Notices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 156 Terms and conditions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 159 Contents v vi Installing into a rack Safety notices Safety notices may be printed throughout this guide: v DANGER notices call attention to a situation that is potentially lethal or extremely hazardous to people. v CAUTION notices call attention to a situation that is potentially hazardous to people because of some existing condition. v Attention notices call attention to the possibility of damage to a program, device, system, or data. World Trade safety information Several countries require the safety information contained in product publications to be presented in their national languages. If this requirement applies to your country, a safety information booklet is included in the publications package shipped with the product. The booklet contains the safety information in your national language with references to the U.S. English source. Before using a U.S. English publication to install, operate, or service this product, you must first become familiar with the related safety information in the booklet. You should also refer to the booklet any time you do not clearly understand any safety information in the U.S. English publications. German safety information Das Produkt ist nicht für den Einsatz an Bildschirmarbeitsplätzen im Sinne § 2 der Bildschirmarbeitsverordnung geeignet. Laser safety information IBM® servers can use I/O cards or features that are fiber-optic based and that utilize lasers or LEDs. Laser compliance IBM servers may be installed inside or outside of an IT equipment rack. © Copyright IBM Corp. 2010 vii DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) DANGER viii Installing into a rack Observe the following precautions when working on or around your IT rack system: v Heavy equipment–personal injury or equipment damage might result if mishandled. v Always lower the leveling pads on the rack cabinet. v Always install stabilizer brackets on the rack cabinet. v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest devices in the bottom of the rack cabinet. Always install servers and optional devices starting from the bottom of the rack cabinet. v Rack-mounted devices are not to be used as shelves or work spaces. Do not place objects on top of rack-mounted devices. v Each rack cabinet might have more than one power cord. Be sure to disconnect all power cords in the rack cabinet when directed to disconnect power during servicing. v Connect all devices installed in a rack cabinet to power devices installed in the same rack cabinet. Do not plug a power cord from a device installed in one rack cabinet into a power device installed in a different rack cabinet. v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of the system or the devices that attach to the system. It is the responsibility of the customer to ensure that the outlet is correctly wired and grounded to prevent an electrical shock. CAUTION v Do not install a unit in a rack where the internal rack ambient temperatures will exceed the manufacturer's recommended ambient temperature for all your rack-mounted devices. v Do not install a unit in a rack where the air flow is compromised. Ensure that air flow is not blocked or reduced on any side, front, or back of a unit used for air flow through the unit. v Consideration should be given to the connection of the equipment to the supply circuit so that overloading of the circuits does not compromise the supply wiring or overcurrent protection. To provide the correct power connection to a rack, refer to the rating labels located on the equipment in the rack to determine the total power requirement of the supply circuit. v (For sliding drawers.) Do not pull out or install any drawer or feature if the rack stabilizer brackets are not attached to the rack. Do not pull out more than one drawer at a time. The rack might become unstable if you pull out more than one drawer at a time. v (For fixed drawers.) This drawer is a fixed drawer and must not be moved for servicing unless specified by the manufacturer. Attempting to move the drawer partially or completely out of the rack might cause the rack to become unstable or cause the drawer to fall out of the rack. (R001) Safety notices ix CAUTION: Removing components from the upper positions in the rack cabinet improves rack stability during relocation. Follow these general guidelines whenever you relocate a populated rack cabinet within a room or building: v Reduce the weight of the rack cabinet by removing equipment starting at the top of the rack cabinet. When possible, restore the rack cabinet to the configuration of the rack cabinet as you received it. If this configuration is not known, you must observe the following precautions: – Remove all devices in the 32U position and above. – Ensure that the heaviest devices are installed in the bottom of the rack cabinet. – Ensure that there are no empty U-levels between devices installed in the rack cabinet below the 32U level. v If the rack cabinet you are relocating is part of a suite of rack cabinets, detach the rack cabinet from the suite. v Inspect the route that you plan to take to eliminate potential hazards. v Verify that the route that you choose can support the weight of the loaded rack cabinet. Refer to the documentation that comes with your rack cabinet for the weight of a loaded rack cabinet. v Verify that all door openings are at least 760 x 230 mm (30 x 80 in.). v Ensure that all devices, shelves, drawers, doors, and cables are secure. v Ensure that the four leveling pads are raised to their highest position. v Ensure that there is no stabilizer bracket installed on the rack cabinet during movement. v Do not use a ramp inclined at more than 10 degrees. v When the rack cabinet is in the new location, complete the following steps: – Lower the four leveling pads. – Install stabilizer brackets on the rack cabinet. – If you removed any devices from the rack cabinet, repopulate the rack cabinet from the lowest position to the highest position. v If a long-distance relocation is required, restore the rack cabinet to the configuration of the rack cabinet as you received it. Pack the rack cabinet in the original packaging material, or equivalent. Also lower the leveling pads to raise the casters off of the pallet and bolt the rack cabinet to the pallet. (R002) (L001) (L002) x Installing into a rack (L003) or All lasers are certified in the U.S. to conform to the requirements of DHHS 21 CFR Subchapter J for class 1 laser products. Outside the U.S., they are certified to be in compliance with IEC 60825 as a class 1 laser product. Consult the label on each part for laser certification numbers and approval information. CAUTION: This product might contain one or more of the following devices: CD-ROM drive, DVD-ROM drive, DVD-RAM drive, or laser module, which are Class 1 laser products. Note the following information: v Do not remove the covers. Removing the covers of the laser product could result in exposure to hazardous laser radiation. There are no serviceable parts inside the device. v Use of the controls or adjustments or performance of procedures other than those specified herein might result in hazardous radiation exposure. (C026) Safety notices xi CAUTION: Data processing environments can contain equipment transmitting on system links with laser modules that operate at greater than Class 1 power levels. For this reason, never look into the end of an optical fiber cable or open receptacle. (C027) CAUTION: This product contains a Class 1M laser. Do not view directly with optical instruments. (C028) CAUTION: Some laser products contain an embedded Class 3A or Class 3B laser diode. Note the following information: laser radiation when open. Do not stare into the beam, do not view directly with optical instruments, and avoid direct exposure to the beam. (C030) Power and cabling information for NEBS (Network Equipment-Building System) GR-1089-CORE The following comments apply to the IBM servers that have been designated as conforming to NEBS (Network Equipment-Building System) GR-1089-CORE: The equipment is suitable for installation in the following: v Network telecommunications facilities v Locations where the NEC (National Electrical Code) applies The intrabuilding ports of this equipment are suitable for connection to intrabuilding or unexposed wiring or cabling only. The intrabuilding ports of this equipment must not be metallically connected to the interfaces that connect to the OSP (outside plant) or its wiring. These interfaces are designed for use as intrabuilding interfaces only (Type 2 or Type 4 ports as described in GR-1089-CORE) and require isolation from the exposed OSP cabling. The addition of primary protectors is not sufficient protection to connect these interfaces metallically to OSP wiring. Note: All Ethernet cables must be shielded and grounded at both ends. The ac-powered system does not require the use of an external surge protection device (SPD). The dc-powered system employs an isolated DC return (DC-I) design. The DC battery return terminal shall not be connected to the chassis or frame ground. xii Installing into a rack Racks, rack features, and installing systems or expansion units into a rack Learn about the procedures used to install racks. Also detailed procedures are provided for installing rack features, systems and expansion units into a rack. You can perform these tasks or contact a service provider to perform the tasks for you. You might be charged a fee by the service provider for this service. © Copyright IBM Corp. 2010 1 2 Installing into a rack What's new in Racks, rack features, and installing systems or expansion units into a rack Read about new or significantly changed information in Racks, rack features, and installing systems or expansion units into a rack since the previous update of this topic collection. September 2010 Content updates include the following: v Added procedures for “Installing the model 8202-E4B or 8205-E6B into a rack” on page 27. v Added procedures for “Installing the model 8231-E2B into a rack” on page 35. v Added procedures for “Removing and replacing 7014-T00 or 7014-T42 trim panels” on page 91. © Copyright IBM Corp. 2010 3 4 Installing into a rack Installing the rack This section describes all the tasks required to install 7014-T00 or 7014-T42 racks. The full spectrum of related tasks is described, from completing a parts inventory to finally connecting to a dc power source. Installing the 7014-T00 or 7014-T42 racks Use this information to install the 7014-T00 or 7014-T42 racks. If you are installing a rack security kit in this rack, see “Installing the rack security kit” on page 97 after you have installed the rack. Before installing a rack, read the “Rack safety notices” on page 94. Completing a parts inventory Before beginning the rack installation it is a good idea to do a parts inventory. This section guides you in performing this task. If you have not done so, complete a parts inventory before installing the unit in the rack: 1. Locate the kitting report in an accessory box. 2. Ensure that you received all of the features that you ordered and all of the parts on the kitting report. If there are incorrect, missing, or damaged parts, contact: v Your IBM reseller v IBM support (see Directory of worldwide contacts website at http://www.ibm.com/planetwide for contact information for your country) v IBM Rochester Manufacturing Automated Information Line at 1–800–300–8751 (United States only) Positioning the rack Proper rack positioning is needed to comply with safety and regulatory requirements. Use the procedure in this section to perform this task. After the rack has been placed into its location on the floor, lock each caster by tightening the locking screw. See the following illustration for the locking screw location (2). Remove all the tape and packing materials from the rack. © Copyright IBM Corp. 2010 5 1 Caster 2 Locking screw Use the following to determine the next step: v To bolt the rack to a concrete floor, go to “Attaching the rack to a concrete floor” on page 8. v To bolt the rack to a concrete floor beneath a raised floor, go to “Attaching the rack to the concrete floor beneath a raised floor” on page 13. v To attach the rack to the floor, go to “Leveling the rack.” Leveling the rack If you need to level the rack, use the procedure described in this section. To level the rack, complete the following steps: 1. Loosen the jam nut on each leveling foot. 2. Rotate each leveling foot downward until it contacts the surface on which the rack is placed. 3. Adjust the leveling feet downward as needed until the rack is level. When the rack is level, tighten the jam nuts against the base. Figure 1. Tightening the locking screw 6 Installing into a rack 1 Rack Front (base) 2 Leveling Foot (quantity 4) 3 Jam Nut (quantity 4) Attaching the stabilizer brackets You might need to attach the stabilizer brackets to the rack. This section helps you determine whether stabilizer brackets are necessary and describes how to attach them if needed. If the front or back ac electrical outlets are going to be installed in the rack, you cannot attach the stabilizer brackets. The rack must be bolted to the floor. Stabilizer brackets are used only if you will not be bolting the rack to the floor. If you are going to bolt the rack to the floor, go to “Attaching the rack to a concrete floor” on page 8. To attach the stabilizer brackets to the bottom of the rack, do the following: Note: Before installing the stabilizer brackets, see “Attaching the front or back ac electrical outlet” on page 18 for instruction on installing the ac outlet-mounting plates. 1. Align the slots of one of the stabilizer brackets with the mounting holes at the bottom front of the rack. 2. Install the two mounting screws. 3. Ensure that the base of the stabilizer bracket rests firmly on the floor. Use the Allen wrench that was supplied with the rack to tighten the mounting screws alternately until they are tight. Figure 2. Adjusting the leveling feet Installing the rack 7 1 Rack front (base) 3 Stabilizer bracket 2 Stabilizer mounting screws 4 Rack rear (base) 4. To install the second stabilizer bracket on the back of the rack, repeat steps 1 through 3. Attaching the rack to a concrete floor If you plan to install the front or back ac electrical outlets in the rack, the rack must be bolted to the floor. This section describes how to perform this task for a concrete floor. Obtain the services of a mechanical contractor to attach the rack-mounting plates to the concrete floor. The mechanical contractor must determine that the hardware being used to secure the rack-mounting plates to the concrete floor is sufficient to meet the requirements for the installation. To attach the rack to a concrete floor, do the following: 1. Put the rack in its predetermined location, and tighten the locking screws on the casters. 2. If they are installed, remove the top, left, and right trim panels. The trim panels are held in place with spring clips. See the following illustration. Figure 3. Attaching the stabilizer brackets 8 Installing into a rack 1 Rack chassis 4 Right-side trim panel 2 Top trim panel 5 Spring clip 3 Left-side trim panel 3. If they are installed, remove the front and back doors. After the rack doors have been removed, go to the next substep. To remove a rack door: a. Unlock and open the door. b. Grasp the door firmly with both hands and pull it away from the hinges. 4. Locate the hardware mounting kit and the two mounting plates. Refer to the following illustration when reviewing the contents of the hardware mounting kit. The hardware mounting kit contains the following: v 4 Rack-mounting bolts v 4 Thin washers v 8 Plastic isolator bushings v 4 Thick washers v 4 Spacers 5. If you are installing an ac-powered rack, temporarily install the lower plastic isolator bushings to help you locate the mounting locations for the stabilizer bracket. After the stabilizer bracket has been Figure 4. Removing the trim panels Installing the rack 9 correctly located, remove the lower plastic isolator bushings. 1 Rack chassis 7 Jam nut 2 Rack-mounting bolt 8 Leveling foot 3 Thin washer 9 Lower plastic isolator bushing (used only on dc powered systems) 4 Top plastic isolator bushing AC Typical leveling foot installation for an ac-powered rack 5 Thick washer DC Typical leveling foot installation for an dc-powered rack 6 Spacer 6. Position the two mounting plates in the approximate mounting location under the rack. 7. Create a rack-mounting bolt assembly by adding the following items, in the order listed, to each rack-mounting bolt. a. Thin washer b. Top plastic isolator bushing c. Thick flat washer d. Spacer 8. Insert a rack-mounting bolt assembly through each of the leveling feet. 9. Reposition the rack-mounting plates under the four rack-mounting bolts so that the mounting bolts are centered directly over the threaded bolt holes. 10. Turn the rack-mounting bolts four complete turns into the mounting plate's threaded bolt holes. Figure 5. Installing ac-power mounting plates 10 Installing into a rack 1 Rack-mounting bolt 7 Leveling foot 2 Thin washer 8 Lower plastic isolator bushing (Used only on dc powered systems) 3 Top plastic isolator bushing 9 Mounting plate 4 Thick washer 10 Threaded hole (Used to secure the rack to stabilizer bracket.) 5 Spacer 11 Anchor bolt hole 6 Jam nut 12 Traced pattern (Pattern to be traced onto the floor using the stabilizer bracket as a template) 11. Mark the floor around the edges of both stabilizer brackets. 12. Mark the plate bolt-down holes that are accessible through the opening in the rear of the rack. 13. Remove the rack-mounting bolt assemblies. 14. If you are installing an ac-powered rack, remove the bottom isolator bushing from each of the leveling feet. 15. Remove the stabilizer brackets from the marked locations. 16. Loosen each of the locking screws on the casters. 17. Move the rack so that it is clear of both areas that were marked on the floor for the stabilizer bracket locations. 18. Reposition the stabilizer brackets within the marked areas. 19. Mark the floor at the center of all holes in both stabilizer brackets. 20. Remove the two rack-mounting plates from the marked areas. 21. At the marked location of the threaded rack-mounting bolt holes, drill four clearance holes into the concrete floor. Each clearance hole should be approximately 1-inch deep. This depth allows the rack-mounting bolts enough room to protrude past the thickness of the stabilizer brackets. Figure 6. Securing the rack to the floor Installing the rack 11 Note: You must use a minimum of two anchor bolts for each rack-mounting plate to securely attach the plate to the concrete floor. Because some of the holes in each rack-mounting plate might align with concrete reinforcement rods embedded in the concrete, some of the rack-mounting plate holes might not be usable. 22. Select at least two suitable hole locations for each stabilizer bracket bolt. The selected locations should be as close to the threaded bolt holes as possible. Be sure that the holes selected at the back of the rack are accessible. Drill holes at the selected locations into the concrete floor. 23. Position the stabilizer brackets over the concrete anchors. 24. Securely bolt the front stabilizer bracket to the concrete floor. 25. Position the stabilizer bracket over the concrete anchors. 26. Securely bolt the back stabilizer bracket to the concrete floor. Note: The size of the anchor bolts and concrete anchors must be determined by the mechanical contractor who will be installing the rack-mounting plate. 27. Position the rack over the stabilizer bracket. 28. Insert each of the stabilizer bracket bolts through a flat washer, a plastic isolator bushing and a thick washer, and through a leveling foot. 29. Align the four stabilizer brackets bolts with the four tapped holes in the two mounting plates and turn three to four rotations. 30. Tighten the locking screw on each caster. 31. Adjust the leveling feet downward as needed until the rack is level. When the rack is level, tighten the jam nuts against the base of the rack. 1 Rack front (base) 2 Leveling foot (quantity 4) 3 Jam nut (quantity 4) 32. If you have multiple racks that are connected in a suite (bolted to each other), go to “Connecting multiple racks with rack-to-rack attachment kit” on page 105. Otherwise, torque the four bolts to 40-50 ft-lbs (54-67 nm). Figure 7. Adjusting the leveling feet 12 Installing into a rack 33. If you are not installing doors on your rack, install the top, left, and right trim panel. 34. Connect the power distribution system as described in “Connecting the power distribution system” on page 17. 35. After all racks are bolted down, go to “Attaching the front or back ac electrical outlet” on page 18. 36. If you are not going to attach a front electrical outlet and you are installing rack doors, go to “Attaching the rack doors” on page 93 Attaching the rack to the concrete floor beneath a raised floor If you plan to install front or back ac electrical outlets in the rack, the rack must be bolted to the floor. When you attach the rack to a concrete floor, which lays beneath a raised floor, follow the procedure described in this section. Obtain the services of a mechanical contractor to attach the rack-mounting plates to the concrete floor. The mechanical contractor needs to determine that the hardware being used to secure the rack-mounting plates to the concrete floor is sufficient to meet the requirements for the installation. To attach the rack to a concrete floor beneath a raised floor, do the following: 1. Put your rack in its predetermined location and tighten the locking screws on the casters. 2. If installed, remove the top, left and right trim panels. The trim panels are held in place with spring clips. See the following illustration. 1 Rack chassis 4 Right-side trim panel 2 Top trim panel 5 Spring clip 3 Left-side trim panel Installing the rack 13 3. If installed, remove the front and rear doors. To remove a rack door, go to . After the rack doors have been removed, go to the next substep. 4. Locate the hardware mounting kit and the two mounting plates. Refer to the following illustration when reviewing the contents of the hardware mounting kit. The hardware mounting kit contains the following: v Four rack-mounting bolts v Four thin washers v Eight plastic isolator bushings v Four thick washers v Four spacers 5. If you are installing an ac-powered rack, temporarily install the lower plastic isolator bushings to help you locate the rack-mounting plate. After the mounting plate has been correctly located, remove the lower plastic isolator bushings. 1 Rack chassis 7 Jam nut 2 Rack-mounting bolt 8 Leveling foot 3 Thin washer 9 Lower plastic isolator bushing (used only on dc powered systems) 4 Top plastic isolator bushing AC Typical leveling foot installation for an ac-powered rack 5 Thick washer DC Typical leveling foot installation for an dc-powered rack 6 Spacer Figure 8. Removing the trim panels Figure 9. Installing the ac power-mounting plates 14 Installing into a rack 6. Position the two mounting plates in the approximate mounting location under the rack. 7. Create a rack-mounting bolt assembly by adding the following items, in the order listed, to each rack-mounting bolt. a. Thin washer b. Top plastic isolator bushing c. Thick flat washer d. Spacer 8. Insert a rack-mounting bolt assembly through each of the leveling feet. 9. Reposition the rack-mounting plates under the four rack-mounting bolts so that the mounting bolts are centered directly over the threaded bolt holes. 10. Turn the rack-mounting bolts four complete turns into the mounting plate's threaded bolt holes. 11. Mark the raised-floor panel around the edges of front and back rack-mounting plates. 12. Mark the plate bolt-down holes that are accessible through the opening in the back of the rack. 13. Remove the rack-mounting bolt assemblies. 14. If you are installing an ac-powered rack, remove the bottom isolator bushing from each of the leveling feet. 15. Remove the rack-mounting plates from the marked locations. 16. Loosen each of the locking screws on the casters. 17. Move the rack so that it is clear of both areas that were marked on the floor for the rack-mounting plate locations. 18. Reposition the mounting plates within the marked areas. 19. Mark the raised-floor panel at the center of each hole in the rack-mounting plates (including the tapped holes). 20. Remove the two rack-mounting plates from the marked locations on the raised floor panel. 21. Drill two clearance holes on each end of each rack-mounting plate. The drilled holes should be approximately 1-inch deep. This depth will accommodate any rack-mounting bolt extending past the rack-mounting plate when securing the rack to the rack-mounting plate. 22. For each rack-mounting plate, select at least two suitable hole locations. Select the hole locations as close to the threaded hole areas as possible. Be sure the hole locations selected at the back of the rack are accessible. 23. Drill pass-through holes in the raised-floor panel. The pass-through holes allow the anchor bolts to be inserted into the rack-mounting plate and pass through the raised floor panel to the concrete floor. Note: You must use a minimum of two anchor bolts for each rack-mounting plate to securely attach the rack-mounting plate through the raised-floor panel to the concrete floor. Because some of the holes in each rack-mounting plate may align with concrete reinforcement rods imbedded in the concrete, some of the rack-mounting plate holes may not be usable. 24. Transfer the locations of the anchor bolt holes (exclude the clearance holes drilled for the rack-mounting bolts ) from the raised-floor panel to the concrete floor directly beneath, and mark the hole locations on the concrete floor. 25. Drill holes in the concrete floor to secure the anchor bolts. 26. Position the raised-floor panel back into position over the anchor bolt holes. 27. Position the front stabilizer bracket within the marked area on the raised-floor panel. 28. Using your anchor bolts, secure the front stabilizer brackets on top of the raised floor and through to the concrete floor. 29. Position the rear stabilizer brackets within the marked area on the raised-floor panel. Installing the rack 15 1 Rack-mounting bolt 7 Leveling foot 2 Thin washer 8 Lower plastic isolator bushing (used only on dc-powered systems) 3 Top plastic isolator bushing 9 Stabilizer brackets 4 Thick washer 10 Threaded hole (used to secure the rack to mounting plate.) 5 Spacer 11 Anchor bolt hole 6 Jam nut 12 Traced pattern (pattern to be traced onto the floor using the mounting plate as a template) 30. Using your anchor bolts, secure the back stabilizer bracket on top of the raised floor and through to the concrete floor. 31. Replace all raised-floor panels that may have been removed when aligning and securing the anchor bolts to the concrete floor. 32. Align the rack over the front and back stabilizer brackets. 33. Insert each of the bolt assemblies through a leveling foot. 34. Align the rack-mounting bolts with the threaded holes in each stabilizer bracket. Turn each bolt three to four rotations. 35. Tighten the locking screw on each caster. 36. Adjust the leveling feet downward as needed until the rack is level. When the rack is level, tighten the jam nuts against the base of the rack. 37. If you have multiple racks that are connected as a suite (bolted to each other), go to “Connecting multiple racks with rack-to-rack attachment kit” on page 105. Otherwise, torque the four bolts to 40-50 ft-lbs (54-67 nm). 38. If you are not installing doors on your rack, install the top, left, and right trim panel. Figure 10. Securing the rack to the floor 16 Installing into a rack 39. Connect the power distribution system as described in “Connecting the power distribution system.” 40. After the rack is bolted down and you are going to attach a front electrical outlet, go to “Attaching the front or back ac electrical outlet” on page 18. 41. If you are not going to attach a front electrical outlet and you are installing rack doors, go to “Attaching the rack doors” on page 93. Connecting the power distribution system You can use a power distribution system to monitor the individual power loads of the devices that are plugged into it. Use the procedure in this section to connect this system. To connect a Power Distribution Unit, see “Power distribution unit plus” on page 121. Checking the ac outlets To help ensure safety and reliable operation, you should check the ac outlets. Use the procedure in this section to perform this task. Before you begin, ensure that you have a multimeter to check voltages and an appropriately approved ground-impedance tester to test the grounding resistances. Note: Use only an appropriately approved ground-impedance tester to test the grounding resistances. Do not use a multimeter to measure grounding resistance. Before plugging the rack into the ac power source, complete the following checks on the ac power source: 1. Turn off the branch circuit breaker for the ac power outlet that the rack will plug into. To the circuit breaker switch, attach tag S229-0237, which reads "Do Not Operate." Note: All measurements are made with the receptacle faceplate in the usual installed position. 2. Some receptacles are enclosed in metal housings. For this type of receptacle, do the following steps: a. Using a multimeter, check for less than 1 volt from the receptacle case to any grounded metal structure in the building, such as a raised-floor metal structure, water pipe, building steel, or similar structure. b. Using a multimeter, check for less than 1 volt from the receptacle ground pin to a grounded point in the building. Note: If the receptacle case or faceplate is painted, be sure the probe tip penetrates the paint and makes good electrical contact with the metal. c. Using a multimeter, check the resistance from the receptacle ground pin to the receptacle case. Check resistance from the ground pin to the building ground. The readings should be less than 1.0 ohm, which indicates the presence of a continuous grounding conductor. 3. If any of the checks made in step 2 are not correct, remove the power from the branch circuit and make the wiring corrections. Recheck the receptacle after the wiring is corrected. 4. Using a ground-impedance tester, check for infinite resistance between the ground pin of the receptacle and each of the phase pins. This is a check for a wiring short to ground or a wiring reversal. 5. Using a ground-impedance tester, check for infinite resistance between the phase pins. This is a check for a wiring short. 6. Turn on the branch circuit breaker. 7. Using a multimeter, measure for the appropriate voltages between phases. If no voltage is present on the receptacle case or grounded pin, the receptacle is safe to touch. 8. Using a multimeter, verify that the voltage at the ac outlet is correct. Installing the rack 17 Attaching the front or back ac electrical outlet If you need to attach an ac outlet, you can use the procedure in this section to perform this task. Attention: The front and back ac outlet-mounting plates mount through the same mounting holes in that secure the stabilizer brackets to the rack chassis. Therefore, if the rack must be bolted to the floor, the stabilizer brackets must be removed. Install the ac outlet-mounting plates only after the rack has been bolted to the floor and the stabilizer brackets have been removed. The following items are installed at the customer's site: v The ac outlet-mounting plates for installing customer-supplied ac electrical outlets on the front or back of the rack. The ac outlet-mounting place provides the mounting location for an ac electrical outlet. v The brass ground lug for an electrostatic discharge (ESD) connection. Note: The customer is responsible for providing both the outlets and the power cables that attach to the power source. The customer is also responsible for connecting the ac outlet correctly. These items are not field-replaceable units (FRUs). Installing the ac outlet-mounting plates with ac outlets If you choose to install ac mounting plates, you can follow the procedure detailed in this section to perform this task. This section also includes illustrations of the related hardware components and shows how these components relate to each other. If you do not want ac outlets installed on the rack, go to “Installing the ac outlet-mounting plate without ac outlets” on page 20. If you want ac outlets installed on the front or back ac outlet-mounting plate, do the following: 1. Determine the number of ac outlets that you are installing. 2. Confirm with your contractor that the number and location of ac outlets to be installed are correct. 3. Remove the blank filler plates from the ac outlet-mounting plates for the number of ac outlets being installed. 4. Install the ac outlets on the ac outlet-mounting plate. 5. Install the ground lug in the ac outlet-mounting plate using only one nut, as shown in the following illustration. 6. Securely tighten the one nut on the ground lug. 7. Locate the "Y"-shaped ground cable supplied with the mounting plate. Note: The remaining steps can be used to install ac outlets on the front or the back of the rack. 8. Place the star washer onto the ground lug of the front ac outlet-mounting plate. 9. Place the lug on the long end of the ground cable onto the ground lug. 10. Place a ground lug nut onto the ground lug and securely tighten it. 11. Position the front ac outlet-mounting plate onto the rack frame with the ground lug fully inserted through the mounting holes in the rack. 12. Route the cable under the rack. 13. Place the star washer onto the ground lug of the back ac outlet-mounting plate. 14. Place the lug on the short end of the ground cable onto the ground lug. 15. Place a ground lug nut onto the ground lug and securely tighten it. 18 Installing into a rack 1 Ground cable lug 7 Ground lug 2 Star washer 8 Ground connector (short end of ground cable) 3 Front of rack 9 "Y" End of ground cable 4 Power cable from the power source 10 Ground lug nut (quantity 2) 5 Mounting plate 11 Long end of ground cable 6 Long end of ground cable 12 Ground lug nut (quantity 2) 16. Position the back ac outlet-mounting plate onto the rack frame with the ground lug fully inserted through the mounting holes in the rack. 17. Install the front ac outlet-mounting plate screws (stabilizer mounting screws) into the mounting plate and through the mounting holes in the rack. Securely tighten the screws. 1 Front or back of rack (as applicable) 4 Mounting plate 2 Power cable from power source 5 Allen wrench Figure 11. Installing the ground lug Installing the rack 19 3 Button-head screw 6 Long end of ground cable 18. Connect the "Y"-shaped end of the ground cable to the rack frame, either near the center in the back of the rack or to the ground bus bar at the back of the rack. 19. Install the back ac outlet-mounting plate screws (stabilizer mounting screws) into the mounting plate and through the mounting holes in the rack. Securely tighten the screws. Note: The bus bar might be located at either the top or bottom of the rack. 1 Bus bar mounting plate 3 Hex screws (M5 x 20) (quantity 2) 2 Lock washer (quantity 2) 4 Ground bus bar Installing the ac outlet-mounting plate without ac outlets If you choose to install an outlet plate without the outlets, use the procedure in this section to perform this task. Figure 12. Installing the front mounting plate Figure 13. Installing the back mounting plate 20 Installing into a rack If you do not want any ac outlets installed on the front or rear ac outlet-mounting plate, perform only “Attaching the rack to the concrete floor beneath a raised floor” on page 13 through “Connecting the power distribution system” on page 17. For front or rear ac outlet-mounting plates, refer to “Installing the ac outlet-mounting plates with ac outlets” on page 18. Connecting a dc power source Some rack models (such as the 7014-T00) can support a dc power configuration for systems that require dc power. If you decide to connect a dc power source to the rack, you can use the procedure in this section to perform this task. This section also includes illustrations of the related hardware components and shows how these components relate to each other. Note: The customer is responsible for providing and connecting the -48 V dc power source and -48 V dc power return cables from the customer's source -48 V dc to the bus bars in the power distribution panel. The customer is also responsible for connecting the ground cable to the rack frame. This procedure provides information about accessing the power distribution panel. 1. Remove the six mounting screws from the top cover of the dc power distribution panel and remove the top cover. 2. If they are installed, remove the four screws from the cable channel cover. 3. Remove the cable channel cover. 1 Cable channel cover retaining screw 5 Shield 2 Cable channel cover 6 Power distribution panel 3 Power distribution panel top cover retaining screws 4 Power distribution panel top cover 4. Remove the -48 V dc bus bar shield from the power distribution panel. Figure 14. Removing the cable channel cover Installing the rack 21 Attention: The bus bar shield must be correctly reinstalled over the -48 V dc return bus bars to protect against injury while servicing the power distribution panel. 5. Ensure that the following steps are performed when connecting the dc power source. a. At -48 V dc power source, turn off any -48 V dc power sources that will be connected to the power distribution panel. b. After the -48 V dc power sources are turned off, be sure there is a tag or label over the power source switches or fuses (lock-out/tag-out) to indicate that the power source is turned off intentionally. Note: Ensure that any oxidation on the copper bus bars is removed. c. If this is a raised-floor installation and you are working at the back of the rack, route the power cables up the rack's right side. d. Ensure that the external -48 V dc power cable is connected correctly to the -48 V dc bus bar. e. Ensure that the external -48 V dc return cable is routed correctly and installed on the return bus bar. 1 -48 V dc power cable and return power cable 2 Power distribution panel 3 Front of rack 4 -48 V dc power cable and return power cable Figure 15. Routing the power cables 22 Installing into a rack 1 Front of power distribution panel 6 (B) Return (-) power cable 2 (A) -48 V dc (-) bus bar 7 (B) -48 V dc (-) power cable 3 (A) -48 V dc (-) power cable 8 (B) Return (-) bus bar 4 (A) Return (-) bus bar 9 (B) -48 V dc (-) bus bar 5 (A) Return (-) power cable f. If you want to install a power status alarm, connect the alarm cable to the terminal board on the back cover of the dc power distribution panel. Note: Ensure that you remove the oxidation on the copper bus bars. g. Ensure that the power-source ground cable is routed correctly and connects the power-source ground cable to the copper bar at the lower-back or upper-back center of the rack. h. If the rack is on a raised floor, attach the -48 V dc power source cables to the back of the rack with cable-restraint straps. Figure 16. Return bus bar Installing the rack 23 1 Back view of rack (dc) 2 Power cable, power return cable, and ground 3 Ground cable (Install at either top or bottom of the rack) 6. Reinstall the -48 V dc bus bar shield. 7. Reinstall the top cover on the dc power distribution panel. 8. Reinstall the cable channel cover. 1 Cable channel cover Figure 17. Cable locations 24 Installing into a rack 2 Terminal block (both sides) 3 Power distribution panel 4 Front of rack Figure 18. Reinstalling the cable channel cover Installing the rack 25 26 Installing into a rack Installing a system or expansion unit into a rack You might need to install a system or expansion unit into a rack. In addition to important related safety information, this section provides procedures that you can follow to perform these tasks. This section also includes illustrations of the related hardware components and shows how these components relate to each other. The following procedures describe how to install system units or expansion units into a rack. You can perform this task or contact a service provider to perform the task for you. You might be charged a fee by the service provider for this service. Installing the model 8202-E4B or 8205-E6B into a rack You might need to install the system into a rack. Use the procedure in this section to perform this task. This is a customer task. You can perform this task yourself, or contact a service provider to perform the task for you. You might be charged a fee by the service provider for this service. Important: To complete this procedure, use two people when attaching the rail assembly to the rack, one in front of the rack and one at the rear of the rack. Use three people when lifting the system unit into the rack. This procedure assumes that you are installing the system into an existing rack. If the rack is not installed, go to the instructions for “Installing the rack” on page 5 and then return to this procedure for instructions on installing the system unit in a rack. Note: This procedure applies only to the models that are designed to be mounted in a rack. Do not attempt to install a stand-alone model in a rack. To install the model 8202-E4B or 8205-E6B into a rack, complete the following steps: CAUTION: Installing the rails in the rack is a complex procedure. To install the rails correctly, you must perform each task in the following order. Failure to do so might cause rail failure and potential danger to yourself and the system unit. 1. Read the “Rack safety notices” on page 94. 2. Complete a parts inventory. The following figure shows the items that you need to install the server in the rack cabinet. © Copyright IBM Corp. 2010 27 If there are incorrect, missing, or damaged parts, contact: v Your IBM reseller v IBM Rochester Manufacturing Automated Information Line at 1-800-300-8751 (United States only) v Directory of worldwide contacts website at http://www.ibm.com/planetwide (Locate your service and support telephone numbers.) 3. Locate the rack-mounting hardware kit (A) and the rack rails (B) that were included with your system unit as shown in Figure 19. The system rails (B) are front-to-back and left-to-right side dependent. The rails are labeled left and right to indicate their placement when you face the front of the rack. The back of each rail has two large latch assemblies. These latch assemblies go in the back of the rack. 4. Determine where you will locate the system unit in the rack. See “Determining the location.” 5. Attach mounting hardware to the rack. For more information, see Attach the mounting hardware to the rack. Determining the location You might need to determine where to install the system in the rack. Use this procedure to perform this task. Before installing the system unit into a rack, complete the following steps: 1. Read the “Rack safety notices” on page 94. 2. Plan where to place the units. Place the larger and heavier units in the lower part of the rack. This system unit is four Electronic Industries Alliance (EIA) units high. An EIA unit is 44.45 mm (1.75 in.) in height. The rack contains three mounting holes for each EIA unit of height. This system unit therefore is 177.8 mm (7 in.) high and covers 12 mounting holes in the rack. 3. If necessary, remove the filler panels to provide access to the inside of the rack enclosure where you plan to place the unit. Figure 19. Parts inventory 28 Installing into a rack 4. If necessary, remove the front and rear rack doors. Marking the location You might need to mark the installation location. Use this procedure to perform this task. To mark the installation location and install the nut clips into a rack, complete the following steps: 1. Determine where in the rack to place the system. Install units in the lower part of the rack first. Place larger and heavier units in the lower part of the rack. Record the EIA location. The system is four Electronic Industries Alliance (EIA) units high. An EIA unit is 44.45 mm (1.75 in.) in height. The rack contains three mounting holes for each EIA unit of height. This system therefore is 177.8 mm (7 in.) high and covers 12 mounting holes in the rack. 2. Facing the front of the rack and working from the right side, locate the bottom EIA unit that your expansion unit uses. Record the EIA location. Use tape, a marker, or a pencil to mark the bottom hole of this EIA unit (A). Mark the rack so that the mark can also be seen from the rear of the rack. Figure 20. Removing the filler panels Installing a system or expansion unit into a rack 29 3. Begin with the hole identified by the mark (A). Count up one hole and place a second mark (B). Mark the rack so that the mark can also be seen from the rear of the rack, as shown in Figure 21. 4. Repeat the previous two steps on the left side of the rack. 5. Facing the rear of the rack and working from the right side, locate the bottom EIA unit that your expansion unit uses. Make a mark next to the bottom hole of this EIA unit (A). 6. Begin with the hole identified by the mark (A). Count up one hole and place a second mark (B), as shown in Figure 21. 7. Begin with the hole identified by the mark (B). Count up one hole and place a third mark (C), as shown in Figure 21. 8. Repeat the previous two steps on the left side of the rack. Attaching the 8202-E4B or 8205-E6B mounting hardware to the rack You might need to attach the mounting hardware to the rack. Use the procedure in this section to perform this task. Attention: To avoid rail failure and potential danger to yourself and to the unit, ensure that you have the correct rails and fittings for your rack. If your rack has square support flange holes or screw-thread support flange holes, ensure that the rails and fittings match the support flange holes used on your rack. Do not install mismatched hardware using washers or spacers. If you do not have the correct rails and fittings for your rack, contact your IBM reseller. Also, to install the rails correctly, perform each task in the following order. To install the rack-mounting hardware into the rack, complete the following steps: 1. Each slide rail is marked with either an R (right) or an L (left). Select one of the slide rails and push up on the front movable tab (1). Pull out the front latch (2) to slide out the front side rail. If a thumbscrew is installed in the slide rail (3), remove it. Figure 21. Marking holes on the front and rear of the rack frame 30 Installing into a rack Note: Make sure that the movable tab remains extended and does not click back into place. 2. Align the three pins on the rear of the slide rail with the three holes in the selected EIA unit on the rear of the rack. Push the rails so that the pins go into the holes (1), and drop the slide rail down (2) until it latches into place. 3. Pull the slide rail forward and insert the two pins (1) on the front of the rail into the two lower holes in the U on the front of the rack. Drop the rail into place until it clicks into place. Push the front latch (2) in all the way. 4. Repeat the previous three steps to install the other rail into the rack. Make sure that each front latch is fully seated. Figure 22. Slide rail and movable tab Figure 23. Align the pins with the holes in the rear of the rack Figure 24. Rack front rail and pins Installing a system or expansion unit into a rack 31 5. Pull the slide rails forward (1) until they click, twice, into place. Carefully lift the server and tilt it into position over the slide rails so that the rear nail heads (2) on the server line up with the rear slots (3) on the slide rails. Slide the server down until the rear nail heads slip into the two rear slots, and then slowly lower the front of the server (4) until the other nail heads slip into the other slots on the slide rails. Make sure that the front latch (5) slides over the nail heads. 6. Lift the blue release latches (1) on the slide rails and push the server (2) all the way into the rack until it clicks into place. 7. Install the cable-management arm. For more information, see “Installing the cable-management arm.” Installing the cable-management arm You might need to install the cable-management arm. Use this procedure to perform this task. To install the cable-management arm, complete the following steps: 1. Install the cable-management arm on the right side of the server, when you are facing it from the rear. Connect one end of the support arm (1) to the same slide rail to which you plan to attach the cable-management arm so that you can swing the other end of the support arm (2) toward the rack. Figure 25. Slide rails extended and server nail heads aligned with slots in rail Figure 26. Release latches and server 32 Installing into a rack 2. Install the L-shaped cable-management stop bracket (1) on the unattached end of the support arm. Turn the bracket (2) to secure it to the support arm. 3. To attach the other side of the support arm to the back side of the slide rail, pull the pin out (1), and then slide the bracket (2) into the slide rail. 4. Pull out the mounting bracket pin (1) and slide the mounting bracket (2) into the slide rail onto which you are installing the cable-management arm. Push the bracket into the slide rail until the Figure 27. Support arm connection Figure 28. Cable-management stop bracket secured to the support arm Figure 29. Pin extended and bracket installed into slide rail Installing a system or expansion unit into a rack 33 spring-loaded pin snaps into place. 5. Place the cable-management arm on the support arm. Pull out the cable-management arm pin (1), and then slide the cable-management arm tab (2) into the slot on the inside of the slide rail. Push the tab until it snaps into place. Pull out the other cable-management arm pin (3), and then slide that cable management arm tab into the slot (4) on the outside of the slide rail. Push the tab until it snaps into place. 6. Route the power cords and other cables (including keyboard, monitor, and mouse cables, if required) on the cable-management arm (1). Attach all cables to the rear of the server except for the power cord. Secure the cord and cables with cable ties or hook-and-loop fasteners. Note: Allow slack in all cables to avoid tension in the cables while the cable-management arm moves. Figure 30. Mounting bracket pin extended and mounting bracket installed into slide rail Figure 31. Cable-management arm connection 34 Installing into a rack 7. Slide the server into the rack until it snaps into place. Connecting the power cables to the system You might need to connect power cables to the system. Use this procedure to perform this task. To connect power cables to the system, follow these steps: 1. While facing the rear of the system unit, insert the system power cord up through the horizontal cable-restraint bracket. 2. Plug the power cord into the system unit. Note: If the system is equipped with two power supplies, each must have its own power cord plugged in. 3. Plug the system power cord and the power cords for any other attached devices into the alternating current (AC) power source. 4. Power on the system unit and any devices connected to the system unit. For instructions about how to connect and configure the expansion unit after installation, see Enclosures and expansion units. Installing the model 8231-E2B into a rack You might need to install the system into a rack. Use the procedure in this section to perform this task. This is a customer task. You can perform this task yourself, or contact a service provider to perform the task for you. You might be charged a fee by the service provider for this service. Important: To complete this procedure, use two people when attaching the rail assembly to the rack, one in front of the rack and one at the rear of the rack. Use three people when lifting the system unit into the rack. This procedure assumes that you are installing the system into an existing rack. If the rack is not installed, go to the instructions for “Installing the rack” on page 5 and then return to this procedure for instructions on installing the system unit in a rack. Note: This procedure applies only to the models that are designed to be mounted in a rack. Do not attempt to install a stand-alone model in a rack. To install the model 8231-E2B into a rack, complete the following steps: Figure 32. Power cord attachment and routing Installing a system or expansion unit into a rack 35 CAUTION: Installing the rails in the rack is a complex procedure. To install the rails correctly, you must perform each task in the following order. Failure to do so might cause rail failure and potential danger to yourself and the system unit. 1. Read the “Rack safety notices” on page 94. 2. Complete a parts inventory. The following figure shows the items that you need to install the server in the rack cabinet. If there are incorrect, missing, or damaged parts, contact: v Your IBM reseller v IBM Rochester Manufacturing Automated Information Line at 1-800-300-8751 (United States only) v Directory of worldwide contacts website at http://www.ibm.com/planetwide (Locate your service and support telephone numbers.) 3. Locate the rack-mounting hardware kit (A) and the rack rails (B) that were included with your system unit as shown in Figure 33. The system rails (B) are front-to-back and left-to-right side dependent. The rails are labeled left and right to indicate their placement when you face the front of the rack. The back of each rail has two large latch assemblies. These latch assemblies go in the back of the rack. 4. Determine where you will locate the system unit in the rack. See “Determining the location.” 5. Attach mounting hardware to the rack. For more information, see Attach the mounting hardware to the rack. Determining the location You might need to determine where to install the system in the rack. Use this procedure to perform this task. Before installing the system unit into a rack, complete the following steps: 1. Read the “Rack safety notices” on page 94. 2. Plan where to place the units. Place the larger and heavier units in the lower part of the rack. Figure 33. Parts inventory 36 Installing into a rack This system unit is two Electronic Industries Alliance (EIA) units high. An EIA unit is 44.45 mm (1.75 in.) in height. The rack contains three mounting holes for each EIA unit of height. This system unit therefore is 88.9 mm (3.5 in.) high and covers six mounting holes in the rack. 3. If necessary, remove the filler panels to provide access to the inside of the rack enclosure where you plan to place the unit. 4. If necessary, remove the front and rear rack doors. Marking the location You might need to mark the installation location. Use this procedure to perform this task. To mark the installation location and install the nut clips into a rack, complete the following steps: 1. Determine where in the rack to place the system. Install units in the lower part of the rack first. Place larger and heavier units in the lower part of the rack. Record the EIA location. The system is two Electronic Industries Alliance (EIA) units high. An EIA unit is 44.45 mm (1.75 in.) in height. The rack contains three mounting holes for each EIA unit of height. This system therefore is 88.9 mm (3.5 in.) high and covers six mounting holes in the rack. Figure 34. Removing the filler panels Installing a system or expansion unit into a rack 37 2. Facing the front of the rack and working from the right side, locate the bottom EIA unit that your expansion unit uses. Record the EIA location. Use tape, a marker, or a pencil to mark the bottom hole of this EIA unit (A). Mark the rack so that the mark can also be seen from the rear of the rack. 3. Begin with the hole identified by the mark (A). Count up one hole and place a second mark (B). Mark the rack so that the mark can also be seen from the rear of the rack, as shown in Figure 35. 4. Repeat the previous two steps on the left side of the rack. 5. Facing the rear of the rack and working from the right side, locate the bottom EIA unit that your expansion unit uses. Make a mark next to the bottom hole of this EIA unit (A). 6. Begin with the hole identified by the mark (A). Count up one hole and place a second mark (B), as shown in Figure 35. 7. Begin with the hole identified by the mark (B). Count up one hole and place a third mark (C), as shown in Figure 35. 8. Repeat the previous two steps on the left side of the rack. Attaching the 8231-E2B mounting hardware to the rack You might need to attach the mounting hardware to the rack. Use the procedure in this section to perform this task. Attention: To avoid rail failure and potential danger to yourself and to the unit, ensure that you have the correct rails and fittings for your rack. If your rack has square support flange holes or screw-thread support flange holes, ensure that the rails and fittings match the support flange holes used on your rack. Do not install mismatched hardware using washers or spacers. If you do not have the correct rails and fittings for your rack, contact your IBM reseller. Also, to install the rails correctly, perform each task in the following order. To install the rack-mounting hardware into the rack, complete the following steps: 1. Each slide rail is marked with either an R (right) or an L (left). Select one of the slide rails and push up on the front movable tab (1). Pull out the front latch (2) to slide out the front side rail. If a thumbscrew is installed in the slide rail (3), remove it. Figure 35. Marking holes on the front and rear of the rack frame 38 Installing into a rack Note: Make sure that the movable tab remains extended and does not click back into place. 2. Align the three pins on the rear of the slide rail with the three holes in the selected EIA unit on the rear of the rack. Push the rails so that the pins go into the holes (1), and drop the slide rail down (2) until it latches into place. 3. Pull the slide rail forward and insert the two pins (1) on the front of the rail into the two lower holes in the U on the front of the rack. Drop the rail into place until it clicks into place. Push the front latch (2) in all the way. 4. Repeat the previous three steps to install the other rail into the rack. Make sure that each front latch is fully seated. Figure 36. Slide rail and movable tab Figure 37. Align the pins with the holes in the rear of the rack Figure 38. Rack front rail and pins Installing a system or expansion unit into a rack 39 5. Pull the slide rails forward (1) until they click, twice, into place. Carefully lift the server and tilt it into position over the slide rails so that the rear nail heads (2) on the server line up with the rear slots (3) on the slide rails. Slide the server down until the rear nail heads slip into the two rear slots, and then slowly lower the front of the server (4) until the other nail heads slip into the other slots on the slide rails. Make sure that the front latch (5) slides over the nail heads. 6. Lift the blue release latches (1) on the slide rails and push the server (2) all the way into the rack until it clicks into place. 7. Install the cable-management arm. For more information, see “Installing the cable-management arm.” Installing the cable-management arm You might need to install the cable-management arm. Use this procedure to perform this task. To install the cable-management arm, complete the following steps: 1. Install the cable-management arm on the right side of the server, when you are facing it from the rear. Connect one end of the support arm (1) to the same slide rail to which you plan to attach the cable-management arm so that you can swing the other end of the support arm (2) toward the rack. Figure 39. Slide rails extended and server nail heads aligned with slots in rail Figure 40. Release latches and server 40 Installing into a rack 2. Install the L-shaped cable-management stop bracket (1) on the unattached end of the support arm. Turn the bracket (2) to secure it to the support arm. 3. To attach the other side of the support arm to the back side of the slide rail, pull the pin out (1), and then slide the bracket (2) into the slide rail. 4. Pull out the mounting bracket pin (1) and slide the mounting bracket (2) into the slide rail onto which you are installing the cable-management arm. Push the bracket into the slide rail until the Figure 41. Support arm connection Figure 42. Cable-management stop bracket secured to the support arm Figure 43. Pin extended and bracket installed into slide rail Installing a system or expansion unit into a rack 41 spring-loaded pin snaps into place. 5. Place the cable-management arm on the support arm. Pull out the cable-management arm pin (1), and then slide the cable-management arm tab (2) into the slot on the inside of the slide rail. Push the tab until it snaps into place. Pull out the other cable-management arm pin (3), and then slide that cable management arm tab into the slot (4) on the outside of the slide rail. Push the tab until it snaps into place. 6. Route the power cords and other cables (including keyboard, monitor, and mouse cables, if required) on the cable-management arm (1). Attach all cables to the rear of the server except for the power cord. Secure the cord and cables with cable ties or hook-and-loop fasteners. Note: Allow slack in all cables to avoid tension in the cables while the cable-management arm moves. Figure 44. Mounting bracket pin extended and mounting bracket installed into slide rail Figure 45. Cable-management arm connection 42 Installing into a rack 7. Slide the server into the rack until it snaps into place. Connecting the power cables to the system You might need to connect power cables to the system. Use this procedure to perform this task. To connect power cables to the system, follow these steps: 1. While facing the rear of the system unit, insert the system power cord up through the horizontal cable-restraint bracket. 2. Plug the power cord into the system unit. Note: If the system is equipped with two power supplies, each must have its own power cord plugged in. 3. Plug the system power cord and the power cords for any other attached devices into the alternating current (AC) power source. 4. Power on the system unit and any devices connected to the system unit. For instructions about how to connect and configure the expansion unit after installation, see Enclosures and expansion units. Installing the model 8233-E8B or 8236-E8C into a rack You might need to install the system into a rack. Use the procedure in this section to perform this task. In addition to information intended to promote safety and reliable operation, this section also includes illustrations of the related hardware components and shows how these components relate to each other. This is a customer task. You can perform this task yourself, or contact a service provider to perform the task for you. You might be charged a fee by the service provider for this service. Important: To complete this procedure, it is suggested that you use two people to attach the rail assembly to the rack, one in front of the rack and one at the back of the rack. You will need three people to lift the system unit onto the rack. This procedure assumes that you are installing the system into an existing rack. If the rack is not installed, go to the instructions for “Installing the rack” on page 5 and then return to this procedure for instructions on installing the system unit in a rack. Note: This procedure applies only to the models that are designed to be mounted in a rack. Do not attempt to install a stand-alone model in a rack. Figure 46. Power cord attachment and routing Installing a system or expansion unit into a rack 43 To install the model 8233-E8B or 8236-E8C into a rack, complete the following steps: CAUTION: Installing the rails in the rack is a complex procedure. To install the rails correctly, you must perform each task in the following order. Failure to do so might cause rail failure and potential danger to yourself and the system unit. 1. Read the “Rack safety notices” on page 94. 2. If you have not already done so, refer to Figure 47 and complete a parts inventory. v A Rack-mounting hardware kit – Four large retaining screws – Two medium-sized screws – Two nut clips – Eight small retaining screws – Two rail support brackets – Two cable-restraint brackets v B Left and right rack rails with rack brackets v C Cable-management arm pins v D Two cable-management arm brackets (one for left-side installation, one for right-side installation) v E Cable-management arm Figure 47. Rack-mounting kit 44 Installing into a rack v F Two rack latches If there are incorrect, missing, or damaged parts, contact: v Your IBM reseller v IBM Rochester Manufacturing Automated Information Line at 1-800-300-8751 (United States only) v Directory of worldwide contacts website at http://www.ibm.com/planetwide (Locate your service and support telephone numbers.) 3. Locate the rack-mounting hardware kit (A), and the rack rails (B) that were included with your system unit as shown in Figure 47 on page 44. The system rails (B) are front-to-back and left-to-right side dependent. The rails are labeled left and right to indicate their placement when you face the front of the rack. The back of each rail has two large latch assemblies. These latch assemblies go in the back of the rack. 4. Determine where you will locate the system unit in the rack. See “Determining the location.” 5. Next, you must attach mounting hardware to the rack. For more information, see Attach the mounting hardware to the rack. Determining the location You might need to determine where to install the system in the rack. Use this procedure to perform this task. Before installing the system unit into a rack, complete the following steps: 1. Read the “Rack safety notices” on page 94. 2. Plan where to place the units. Place the larger and heavier units in the lower part of the rack. This system unit is four Electronic Industries Alliance (EIA) units high. An EIA unit is 1.75 in. (44.45 mm) in height. The rack contains three mounting holes for each EIA unit of height. This system unit therefore is 7 in. (177.8 mm) high and covers 12 mounting holes in the rack. 3. If necessary, remove the filler panels to allow access to the inside of the rack enclosure where you plan to place the unit. Installing a system or expansion unit into a rack 45 4. If necessary, remove the front and back rack doors. Marking the location You might need to mark the installation location. Use this procedure to perform this task. To mark the installation location and install the nut clips into a rack, complete the following steps: 1. Determine where in the rack to place the system. Install units in the lower part of the rack first. Place larger and heavier units in the lower part of the rack. Record the EIA location. The system is four Electronic Industries Alliance (EIA) units high. An EIA unit is 1.75 in. (44.45 mm) in height. The rack contains three mounting holes for each EIA unit of height. This system therefore is 7 in (177.8 mm) high and covers 12 mounting holes in the rack. 2. Face the front of the rack and work from the right side. Use tape, a marker, or a pencil to mark the holes. Mark the middle hole A of the bottom EIA unit of the four EIA units that you are using for this system unit. You need to identify the marked hole from both the front and back of the rack. Figure 48. Removing the filler panels 46 Installing into a rack 3. Mark the middle hole of the bottom EIA unit on the left side of the rack (A) as shown in Figure 49. 4. Go to the back of the rack. On the right side, find the EIA unit that corresponds to the bottom EIA unit marked on the front of the rack. 5. Mark the middle hole of the bottom EIA unit (B) as shown in Figure 49. 6. Mark the bottom hole of the next (higher) EIA unit (C) as shown in Figure 49. 7. Mark the corresponding holes on the left side of the rack. Attaching the 8233-E8B or 8236-E8C mounting hardware to the rack You might need to attach the mounting hardware to the rack. Use the procedure in this section to perform this task. The information provided is intended to promote safety and reliable operation. This section also includes illustrations of the related hardware components and shows how these components relate to each other. Attention: To avoid rail failure and potential danger to yourself and to the unit, ensure that you have the correct rails and fittings for your rack. If your rack has square support flange holes or screw-thread support flange holes, ensure that the rails and fittings match the support flange holes used on your rack. Do not install mismatched hardware using washers or spacers. If you do not have the correct rails and fittings for your rack, contact your IBM reseller. Also, to install the rails correctly, perform each task in the following order. To install the rack-mounting hardware into the rack, complete the following steps: 1. With the right rail, twist the latch assembly release tab (A) in Figure 50 on page 48. Slide release tab (B) back to the retracted position and lock the latch assembly. The back-alignment pins must be fully retracted. 2. After the alignment pins are retracted, insert the right side rail front-alignment pin, as shown in Figure 50 on page 48, into the rack front flange hole. This hole is identified by the mark that you previously made. Have a second person hold the rail securely in the front hole. Figure 49. Marking holes on the front and back of the rack frame Installing a system or expansion unit into a rack 47 3. Align the back-alignment pins of the rail with the holes at the back of the rack. The holes are identified by the marks on the back of the rack. Ensure that the rails are level. 4. Slide the release tab (B) to extend the two back-alignment pins into the back of the rack. Ensure that the pins have passed through the correct holes in the rack frame. 5. From the back of the rack, as shown in Figure 51 on page 49, install the rail support bracket on the alignment pins. Then insert a large rail-retaining screw into the hole located between the two back alignment pins. Partially tighten the screw. Do not tighten completely at this time. Note: The brackets are labeled LEFT and RIGHT. Figure 50. Front slide rail alignment pin, retaining screws, and latch bracket 48 Installing into a rack 6. Repeat steps 1 on page 47 through 5 on page 48 for the left side rail. 7. Locate the two latch brackets (F) in Figure 47 on page 44. To install the rack latches, use the following procedure: a. Rotate the top of the bracket out as shown in Figure 52 on page 50. Figure 51. Installing the rail support brackets Installing a system or expansion unit into a rack 49 b. Align the bracket with the slot on the side of the system as shown in Figure 53 on page 51. Figure 52. Rotate out the top of the bracket 50 Installing into a rack c. Move the bracket up to engage the tab as shown in Figure 54. d. Rotate the top of the bracket back so that the top tab is on top of the system. Then install the screws as shown in Figure 55 on page 52. Figure 53. Align the bracket with the slot on the side of the system Figure 54. Move the bracket up to engage the tab Installing a system or expansion unit into a rack 51 8. Extend the inner rails by pulling out the rails. They must be extended from the frame like the rails shown in Figure 56. 9. Using three people, grasp the two handles located on each side of the system, and place the system onto the inner rail. Align tab A with slot B as indicated by the vertical dashed line shown in Figure 56. 10. Simultaneously press the safety latches and push the system into the rack until it locks as shown in Figure 57 on page 53. Figure 55. Rotate the top of the bracket back so that the top tab is on top of the system Figure 56. Place system onto the rails 52 Installing into a rack 11. Install and position the nut clips before inserting the screws. Use the retaining screws to attach the system to the rack as shown in Figure 58 on page 54. Figure 57. Inner rail extended Installing a system or expansion unit into a rack 53 12. After both rails have been installed, ensure that none of the rail retaining screws are more than finger tight. The rails must be level from front to back and from left to right. 13. Next, you must install the cable-management arm. For more information, see “Installing the cable-management arm.” Installing the cable-management arm You might need to install the cable-management arm. Use this procedure to perform this task. To install the cable-management arm, complete the following steps: 1. Determine on which side of the rack you want to install the cable-management arm. 2. Place the correct arm bracket (Left or Right) with the cable-management arm. 3. Use the pin (F) to pin the cable-management arm (E) to the rack frame (D) as shown in Figure 59 on page 55. Figure 58. Secure system to rack through rack latches 54 Installing into a rack Tip: If access to the back of the rack is obscured by many existing cables, it might be easier to remove the small connecting hinge from the cable-management arm and attach it first. Then, you can attach the remaining section of the cable-management arm to the connecting hinge. 4. Use the second pin (F) to pin the other end of the cable-management arm to the flange (C) that is attached to the sliding portion of the left system rail assembly (B) as shown in Figure 59. Connecting the power cables to the system You might need to connect power cables to the system. Use this procedure to perform this task. To connect power cables to the system, follow these steps: 1. While facing the rear of the system unit, note the bottom tab of the cable-restraint bracket. Turn the cable-restraint bracket counter-clockwise to a horizontal position, and insert the bottom tab of the cable-restraint bracket into the bottom hole in the rail. Then rotate the cable-restraint bracket clockwise to the vertical position as shown in Figure 60 on page 56. Insert the system power cord through the cable-restraint bracket as shown in this figure. Then press down on the top of the bracket and insert the top tab into the top hole. Note: There are two sets of holes on the rail where the cable-restraint bracket can be installed. These sets of holes are about 25.4 mm (1 inch) apart. Install the bracket in the rear most set of holes. Figure 59. Attaching the cable-management arm. Installing a system or expansion unit into a rack 55 2. Plug the power cord into the system unit. Note: If the system is equipped with two power supplies, each must have its own power cord plugged in. 3. Plug the system power cord and the power cords for any printers or displays into the alternating current (AC) power source. 4. Power on the system unit and any devices connected to the system unit. For instructions on how to connect and configure the expansion unit after installation, see Enclosures and expansion units. Installing the model 5802 or 5877 into a rack This procedure can be used to install the 5802 or 5877 disk drive enclosure into a rack. The procedure includes information intended to promote safety and reliable operation, and provides illustrations on how hardware components relate to each other. v This procedure assumes that you are installing the 5796 or 7314-G30 expansion unit into an existing rack. If the rack is not installed, you must perform the procedure for “Installing the rack” on page 5 before performing this procedure to install the expansion unit into the rack. Note: This procedure applies only to the models that are designed to be mounted in a rack. Do not attempt to install a stand-alone model in a rack. v Read the “Rack safety notices” on page 94. v For instructions related to installing, removing, and replacing a disk drive, disk-drive backplane, or enclosure, see Disk drives. To install the model 5802 or 5877 into a rack, complete these steps: Figure 60. Routing system power cord 56 Installing into a rack CAUTION: Installing the rails in the rack is a complex procedure. To install the rails correctly, you must perform each task in the following order. Failure to do so might cause rail failure and potential danger to yourself and the system unit. Attention: Installing the rails in the rack is a complex procedure. To install the rails correctly, you must perform each task in order. Failure to do so might cause rail failure and potential danger to yourself and the expansion unit. Also, note that for the 5802 and 5877, three people are required to safely lift the system. Using fewer than three people to lift the system can result in injury. 1. Refer to the kitting list provided with the rack installation kit, and complete a parts inventory, if you have not done so. 2. Locate the rack-mounting hardware kit and the system rail assemblies that were included with your expansion unit. 3. Determine where to locate the expansion unit in the rack. See “Determining the location.” This expansion unit is four Electronic Industries Alliance (EIA) units high. An EIA unit is 1.75 in (44.45 mm) in height. The rack contains three mounting holes for each EIA unit of height. 4. Mark the location. See “Marking the location” on page 58. Determining the location You might need to determine where to install an expansion unit in the rack. Use this procedure to perform this task. Before installing the expansion unit into a rack, complete the following steps: 1. Determine where in the rack to place the expansion unit in relation to other system hardware. As you prepare, organize larger and heavier units into the lower part of the rack, and plan to install units into the lower part of the rack first. Record the EIA locations in your plan. Note: These expansion units are four Electronic Industries Alliance (EIA) units high. An EIA unit is 44.50 mm (1.75 in.) in height. The rack contains three mounting holes for each EIA unit of height. This system unit, therefore, is 177.8 mm (7 in.) high and covers 12 mounting holes in the rack. 2. If necessary, remove the filler panels to allow access to the inside of the rack enclosure where you plan to place the expansion unit. Installing a system or expansion unit into a rack 57 3. If necessary, remove the front and back rack doors. 4. Mark the location where you intend to install the disk drive enclosure. a. If you have a rack-mounting template, use it to mark the location. b. If you do not have a rack-mounting template, see “Marking the location.” Marking the location You might need to mark the installation location. Use the procedure in this section to perform this task. To mark the installation location and the position for installing the nut clips into a rack, complete these steps: 1. Determine where in the rack to place the expansion unit in relation to other system hardware. As you plan, keep the following in mind: a. Organize larger and heavier units into the lower part of the rack. b. Plan to install units into the lower part of the rack first. c. Record the EIA locations in your plan. Figure 61. Removing the filler panels 58 Installing into a rack Note: These expansion units are four Electronic Industries Alliance (EIA) units high. An EIA unit is 1.75 in. (44.50 mm) in height. The rack contains three mounting holes for each EIA unit of height. This system unit, therefore, is 7 in. (177.8 mm) high and covers 12 mounting holes in the rack. 2. Face the front of the rack and work from the left side. Make a note of both the lowest and highest EIA units to be used for the expansion unit. Use tape, a marker, or pencil to mark the bottom mounting hole (A) of the highest EIA unit. Then mark the top mounting hole (B) of the lowest EIA unit. Mark the rack so that the mark can also be seen from the rear of the rack. 3. Count up two mounting holes from where you placed the mark for mounting hole (B) in the last step, and place another mark next to that mounting hole (B). Mark the rack so that the mark can also be seen from the rear of the rack. You now have two (B) marks on the rack, with one mounting hole between the marks. 4. Repeat the last two steps to place three marks on the corresponding mounting holes on the front-right side of the rack. 5. Go to the back of the rack. On the left side, find the EIA unit that corresponds to the lowest EIA unit marked on the front of the rack. 6. Place a mark at the top hole (C) of the lowest EIA unit. Mark the rack so that the mark can be seen from the rear of the rack. 7. Count up two mounting holes from where you placed the mark in the last step, and place another mark next to that mounting hole (C). 8. Repeat the last two steps to place two marks on the corresponding mounting holes on the rear-right side of the rack. Attaching the mounting hardware to the rack You might need to attach mounting hardware to the rack. Use the procedure in this section to perform this task. This section also includes information intended to promote safety and reliable operation, and illustrations of the hardware components. CAUTION: Installing the rails in the rack is a complex procedure. To install the rails correctly, you must perform each task in the following order. Failure to do so might cause rail failure and potential danger to yourself and the system unit. To install the mounting hardware into the rack, follow these steps: 1. Rotate down the front and back clamps (C) on the left and right rails. 2. On each rail, remove the tape holding the spring (B) to the rail. Hook the end of the spring around the circular standoff on the rail. Figure 62. Marking the installation locations Installing a system or expansion unit into a rack 59 3. Start at the front of the rack. Extend the right rail and insert the rail pins (D) into the marked locations in the rack so that the rail is facing upward. Rotate up the front clamp (C) to secure the rail in place. Notes: v If your rack has the square mounting holes, remove the rail pins (D) from the rail. Install the larger rail pins provided with the rack installation kit. v The rail installation can be completed by one person. However, the installation is easier if one person is positioned at the front of the rack and one person is positioned at the back of the rack. 4. Extend the left rail and insert the rail pins (D) into the marked locations. Rotate up the front clamp (C) to secure the rail in place. 5. Move to the back of the rack, extend the right rail, and insert the rail pins (D) into the marked locations. Rotate up the back clamp (C) to secure the rail in place. 6. Extend the left rail and insert the rail pins (D) into the marked locations. Rotate up the back clamp (C) to secure the rail in place. 7. At the back of the rack, use two M5 screws (E), one below each rail pin (D), to secure the left rail to the back of the rack. 8. Use two M5 screws (E), one below each rail pin (D), to secure the right rail to the back of the rack. 9. Move to the front of the rack. Use one M5 screw (E) below the top rail pin to secure the left rail to the front of the rack. Figure 63. Installing the rails 60 Installing into a rack 10. Use one M5 screw (E), below the top rail pin (D), to secure the right rail to the front of the rack. 11. Locate the marks previously made on the front Electronic Industries Alliance (EIA) strips of the rack cabinet. Each side contains one mark. Insert one nut clip (F) at the mark on the right side of the rack as shown in Figure 65 on page 62. 12. Insert a nut clip (F) at the mark on the left side of the rack as shown in Figure 65 on page 62. 13. From the back of the rack cabinet, locate the marks previously made on the rear Electronic Industries Alliance (EIA) strips. Each side contains two marks. Insert a nut clip (F) at each of the marks on the right side of the rack as shown in Figure 65 on page 62. Figure 64. Attaching the rails Installing a system or expansion unit into a rack 61 14. Use two M5 screws (H) to attach the retention bracket (G) to the rack on the right side. 15. Insert a nut clip (F) at each of the marks on the left side of the rack as shown in Figure 65. 16. Use two M5 screws (H) to attach the retention bracket (G) to the rack on the left side. Installing the 5802 or 5877 on the rail assembly You might need to install the expansion unit on the rail assembly. Use the procedure in this section to perform this task. In addition to related safety information, this section also includes illustrations of the related hardware components and shows how these components relate to each other. 1. Before installing the expansion unit on the rail assembly, ensure that the stabilizers are extended and the tip plate is attached to the bottom front of the rack to prevent the rack from falling forward when the rails are pulled out of the rack. Refer to “Rack safety notices” on page 94. To install the expansion unit on the rail assembly, follow these steps: 1. To remove the shipped cover from the back of the expansion unit, remove the four thumb screws that are used to secure it. Pull off the cover. Note: There are two thumb screws at the top of the cover and one on each side. 2. If the bezel is installed on the front of the system, remove it by completing the followingsteps: a. Remove the two M5 securing screws (A) as shown in Figure 66 on page 63. b. Remove the service information holder from the slot in the bezel. c. Using the two blue touch points on each side of the expansion unit, pull the bezel straight out to remove the bezel from the front of the expansion unit. d. Remove the two nut clips that were used to secure the bezel to the expansion unit from the EIA flange. Figure 65. Installing the retention bracket 62 Installing into a rack 3. Using three people, hold onto the four handles (I), as shown in the following figure, to lift the expansion unit and position it over the front of the rails. Attention: Three people are required to safely lift the expansion unit. Using fewer than three people to lift the expansion unit can result in injury. 4. Slide the expansion unit into the rack. 5. Install the two M5 securing screws (J) through the expansion unit and into the nut clips on the left and right rails. 6. Install the bezel (L) onto the expansion unit. Press the bezel into place and secure it with two M5 screws (M) on the left and right rails. Figure 66. Removing the service information holder and bezel Figure 67. Installing the expansion unit into a rack Installing a system or expansion unit into a rack 63 7. Insert the service information holder (N) into the slot in the bezel. 8. At the rear of the system, install the four securing screws (K) through the expansion unit and retention brackets on the left and right rails. For instructions on how to connect and configure the expansion unit after installation, see Enclosures and expansion units. Installing the model 5886 into a rack This procedure can be used to install the 5886 disk drive enclosure into a rack. The procedure includes information intended to promote safety and reliable operation, and provides illustrations on how hardware components relate to each other. Figure 68. Installing the bezel and service information holder Figure 69. Installing four screws at back of system 64 Installing into a rack v This procedure assumes that you are installing the 5886 disk drive enclosure into an existing rack. If the rack is not installed, you must perform the procedure for “Installing the rack” on page 5 before performing this procedure to install the disk drive enclosure into the rack. Note: This procedure applies only to the models that are designed to be mounted in a rack. Do not attempt to install a stand-alone model in a rack. v Read the “Rack safety notices” on page 94. Installing the disk drive enclosure into a rack is a customer task. You can perform this task yourself, or contact a service provider to perform the task for you. You might be charged a fee by the service provider for this service. Important: To complete this procedure, it is suggested that you use two people to attach the rail assembly to the rack. Position one person in front of the rack and one person at the back of the rack. To install the model 5886 into a rack, complete these steps: CAUTION: Installing the rails in the rack is a complex procedure. To install the rails correctly, you must perform each task in the following order. Failure to do so might cause rail failure and potential danger to yourself and the system unit. 1. Complete a parts inventory, if you have not done so. Note: For this installation you will need a 5 mm (3/16-in.) flat-blade screwdriver during the installation. A #2 Phillips screwdriver and an 8 mm wrench are optional. Here is a list of items that you need to install the unit in the rack cabinet: v Rail assembly, right (1) v Rail assembly, left (1) v M5 screws (8) v Small diameter spacers (8) (these come installed, four in each rail) v Large diameter spacers (8) Note: These are used with square mount racks only. v M4 pan-head screws (2) v 15 mm (0.60 in.) diameter washers (6) Note: These are used with square mount racks only. 2. Locate the rack-mounting hardware kit and the system rail assemblies that were included with your disk drive enclosure. The rails are front-to-back and left-to-right side dependent. 3. Determine where you will locate the disk drive enclosure in the rack. See “Determining the location.” Determining the location You might need to determine where to install the disk drive enclosure into the rack. Follow this procedure to perform this task. Read the “Rack safety notices” on page 94. Before installing the disk drive enclosure into a rack, complete the following steps: 1. Determine where in the rack to place the disk drive enclosure in relation to other system hardware. As you plan, keep the following in mind: a. Organize larger and heavier units into the lower part of the rack. Installing a system or expansion unit into a rack 65 b. Plan to install units into the lower part of the rack first. c. Record the EIA locations in your plan. Note: The 5886 disk drive enclosure is two Electronic Industries Alliance (EIA) units high. An EIA unit is 1.75 in. (44.50 mm) in height. The rack contains three mounting holes for each EIA unit of height. This system unit, therefore, is 3.5 in. (89 mm) high and covers six mounting holes in the rack. 2. If necessary, remove the filler panels to allow access to the inside of the rack enclosure where you plan to place the disk drive enclosure. 3. If necessary, remove the front and back rack doors. 4. Mark the location where you intend to install the disk drive enclosure. a. If you have a rack-mounting template, use it to mark the location. b. If you do not have a rack-mounting template, see “Marking the location” on page 67. Figure 70. Removing the filler panels 66 Installing into a rack Marking the location You might need to mark the location for rack installation. Follow these steps to perform this task. To mark the installation location and install the nut clips into a rack, complete the following steps: 1. Determine where in the rack to place the disk drive enclosure in relation to other system hardware. As you plan, keep the following in mind: a. Organize larger and heavier units into the lower part of the rack. b. Plan to install units into the lower part of the rack first. c. Record the EIA locations in your plan. Note: The 5886 disk drive enclosure is two Electronic Industries Alliance (EIA) units high. An EIA unit is 1.75 in. (44.50 mm) in height. The rack contains three mounting holes for each EIA unit of height. This system unit, therefore, is 3.5 in. (89 mm) high and covers six mounting holes in the rack. 2. Face the front of the rack and work from the right side of the rack. Mark the middle hole of the bottom EIA unit (A) as shown in the following figure. Identify the marked hole from both the front and back of the rack. 3. Mark the middle hole of the bottom EIA unit on the left side of the rack. 4. Go to the back of the rack. On the right side, find the EIA unit that corresponds to the bottom EIA unit marked on the front of the rack. Note: The following figure is an example of how the EIA units appear, you might have a different configuration or placement. 5. Mark the middle hole of the bottom EIA unit B. 6. Mark the bottom hole of the next (higher) EIA unit C. 7. Mark the corresponding holes on the left side of the rack. Attaching the mounting hardware to the rack You might need to attach mounting hardware to the rack. Follow these steps to perform this task. In addition to information intended to promote safety and reliable operation, this section also includes illustrations of the related hardware components and shows how these components relate to each other. CAUTION: Installing the rails in the rack is a complex procedure. To install the rails correctly, you must perform each task in the following order. Failure to do so might cause rail failure and potential danger to yourself and the system unit. Figure 71. Marking holes on the front and back of the rack frame Installing a system or expansion unit into a rack 67 To install the mounting hardware into the rack, follow these steps: 1. Loosen the two rail-adjustment screws on one rail just enough to enable the rails to slide. 2. Locate the marks previously made on the Electronic Industries Alliance (EIA) strips at the front and back of the rack cabinet. Use the L or R imprinted on the front bracket of your first rail to align it with these marks within the matching side of the rack cabinet. The bottom of the rail support ledge should be slightly above the U mark on the rack flange. The rail comes with two spacers in the front bracket and two spacers in the rear bracket. If the rack has round holes, the spacers on the rail brackets fit into the round holes in the rack flanges. If the rack has square holes, replace the set of small spacers on the front and rear of the rail with the larger spacers that are included with the rail kit. 3. Secure the rail to the front of the rack cabinet with one M5 screw in the top hole of the bracket. For racks with square holes, add one washer between the M5 screw and the rail bracket. Figure 72. Extending the rails Figure 73. Setting the spacers in place 68 Installing into a rack 4. Secure the rail to the rear of the rack cabinet with two M5 screws in the top and bottom holes in the rear rail bracket. For racks with square holes, add one washer between each M5 screw and the rail bracket. 5. Tighten the rail-adjustment screws. 6. Repeat step 1 on page 68 through step 5for the other rail. Installing the 5886 on the rail assembly You might need to install the expansion unit on the rail assembly. Use the steps detailed in this procedure to perform this task. In addition to related safety information, illustrations of the related hardware components are provided. 1. Before installing the expansion unit on the rail assembly, ensure that the stabilizers are extended and the tip plate is attached to the bottom front of the rack to prevent the rack from falling forward when the rails are pulled out of the rack. Refer to “Rack safety notices” on page 94. To install the disk drive enclosure on the rail assembly, follow these steps: 1. To reduce the weight of the unit for easier installation in the rack cabinet, remove the power supplies. To remove a power supply, complete the following steps: a. On the left side of a power supply, press the orange release tab to the right just enough to release the handle (no more than 6.4 mm (0.24 in.)) as you rotate the handle downward. Figure 74. Attaching the rails Installing a system or expansion unit into a rack 69 b. Carefully slide the power supply out of the unit. c. Repeat this step for the other power supply, and then, set both units aside for later installation. 2. Using two people, lift the disk drive enclosure and position it over the front of the rails. Attention: Two people are required to safely lift the disk drive enclosure. Using fewer than two people to lift the disk drive enclosure can result in injury. 3. Slide the disk drive enclosure into the rack cabinet, making sure that it slides into the tabs on the rear of the rail. 4. Secure the front of the disk drive enclosure to the rack cabinet with one M5 screw in the open hole in each bracket (no washer is required). Figure 75. Removing power supplies Figure 76. Installing the enclosure on the rails 70 Installing into a rack 5. Secure the rear of the disk drive enclosure to each rail assembly with one M4 pan-head screw in the slotted hole on the outside of the storage enclosure chassis wall. 6. Reinstall the power supplies and hard disk drives. 7. Connect a power cord to each power supply. Use the power-cord guide for strain relief to prevent the power cord from disconnecting accidentally. 8. Install the left bezel that has the LEDs, and then install the right bezel. Fit the cutout on the top of the bezel over the tab on the chassis flange. Rotate the bezel down until it snaps into place. Make sure that the inside surface of the bezel is flush with the chassis. For instructions on how to connect and configure the expansion unit when you have completed installing the expansion unit into the rack, see Enclosures and expansion units. Installing the 5796 or 7314-G30 expansion unit in a rack This procedure can be used to install the 5796 or 7314-G30 expansion unit into a rack. The procedure includes information intended to promote safety and reliable operation, and provides illustrations on how hardware components relate to each other. Figure 77. Installing the securing screws Figure 78. Attaching the side bezels Installing a system or expansion unit into a rack 71 v This procedure assumes that you are installing the 5796 or 7314-G30 expansion unit into an existing rack. If the rack is not installed, you must perform the procedure for “Installing the rack” on page 5 before performing this procedure to install the expansion unit into the rack. Note: This procedure applies only to the models that are designed to be mounted in a rack. Do not attempt to install a stand-alone model in a rack. v Read the “Rack safety notices” on page 94. To install the model 5796 or 7314-G30 into a rack, complete these steps: CAUTION: Installing the rails in the rack is a complex procedure. To install the rails correctly, you must perform each task in the following order. Failure to do so might cause rail failure and potential danger to yourself and the system unit. 1. Complete a parts inventory. v Locate the kitting report in an accessory box. v Ensure that you received all the parts that you ordered. 2. Ensure that there is an empty slot in an existing shell or that there is enough room in the rack to install a new shell. Powering off the system and removing power You might need to power off the system and remove power. Use the procedure in this section to perform this task. 1. Ensure that the system you are attaching to is running the latest level of firmware. For information on checking or upgrading the firmware level or your system, refer to the firmware updates section of your system's installation information. 2. After ensuring that your firmware is at the latest level, shut down and power off the system that you are attaching to. For information about shutting down and powering off the system, see the documentation provided with your expansion unit. 3. If you are installing an expansion unit into an existing shell, continue with “Installing the expansion unit” on page 77. Otherwise, refer to “Marking the location.” Marking the location You might need to mark the location. Use the procedure in this section to perform this task. To mark the installation location and install the nut clips into a rack, complete the following steps: 1. Determine where in the rack to place the expansion unit in relation to other system hardware. As you plan, keep the following in mind: a. Organize larger and heavier units into the lower part of the rack. b. Plan to install units into the lower part of the rack first. c. Record the EIA locations in your plan. Note: These expansion units are four Electronic Industries Alliance (EIA) units high. An EIA unit is 1.75 in. (44.50 mm) in height. The rack contains three mounting holes for each EIA unit of height. This system unit, therefore, is 7 in. (177.8 mm) high and covers 12 mounting holes in the rack. The following illustration shows a 1-EIA unit and a 4-EIA unit. Depending on the rack manufacturer, the EIA units might be separated either by color or by a line. The holes along the EIA strip are not evenly spaced. If your rack has no color or line separation between EIAs, assume that each EIA section begins where the hole spacing is closest together. 72 Installing into a rack A Placement mark B Nut clip 2. Facing the front of the rack and working from the right side, locate the bottom EIA unit that your expansion unit uses. Make a note of the EIA location. Use tape, a marker, or a pencil to mark the rack next to the top hole of this EIA unit (A). Assure that the mark can be seen from both the front and rear of the rack. 3. Begin with the hole identified by your mark for (A). Count up two holes and make a mark (B) next to the second hole. Assure that the mark can be seen from both the front and rear of the rack. 4. Begin with the hole identified by your mark for (B). Count up two holes and place a nut clip. The nut clip (C) aids in securing your expansion unit to the rack. 5. Begin with the hole identified by your mark for (B). Count up six holes and place a nut clip. The nut clip (D) aids in securing your expansion unit to the rack. 6. Repeat the last four steps to make marks and place nut clips on the left side of the rack. 7. Facing the back of the rack and working from the right side, locate the bottom EIA unit that your expansion unit uses. Mark the rack next to the top hole of this EIA unit (E). 8. Begin with the hole identified by your mark for (E). Count up two holes and make a mark (F) next to the second hole. 9. Begin with the hole identified by your mark for (F). Count up four holes and make a mark (G) next to the fourth hole. 10. Repeat the last three steps to make marks on the left side of the rack. Installing the expandable rails and extension bracket into the rack You might need to install rails into the rack. Use the procedure in this section to perform this task. In addition to related safety information, this section also includes illustrations of the related hardware components and shows how these components relate to each other. Attention: Installing the rails in the rack is a complex procedure. To install the rails correctly, you must perform each task in the following order. Failure to do so might cause rail failure and potential danger to yourself and the system unit. To install the expandable rails into the rack, complete the following steps: 1. Loosen the two screws (B) in the middle of both rails (A) to allow each rail to extend. Figure 79. EIA units with expandable rails for the 5796 expansion unit Installing a system or expansion unit into a rack 73 2. Starting at the back of the rack, extend the left rail. Insert the rail pins (C) into the marked locations on the rack so that it is facing upward (L shape). 3. From the back of the rack, extend the right rail and insert the rail pins (C) into the marked locations. 4. Move to the front of the rack, extend the left rail and insert the rail pins (C) into the marked locations. 5. Extend the right rail and insert the rail pins (C) into the marked locations. 6. Using one M5 screw (D), secure the left rail to the front of rack into the lower location (E) Important: Do not install a screw in the top location at this time. It will secure the shell to the assembly. Figure 80. Extending the rails 74 Installing into a rack . 7. Using one M5 screw (D), secure the right rail to the front of the rack into the lower location (E). 8. Move to the back of the rack and locate the marks that were previously made on the back-left rail. Use three M5 screws (G) to secure the rail arm and bracket (F) to the rack at these locations. Note: The bracket will face inward. Figure 81. Attaching the rails to the front of the rack Installing a system or expansion unit into a rack 75 9. Locate the the marks that were previously made on the back-right rail of the rack. Use three M5 screws (G) to secure the rail arm and bracket (F) to the rack at these locations. 10. Tighten the two screws (B) in the middle of the left rail and the two screws (B) in the middle of the right rail. Refer to Figure 80 on page 74. Installing the shell You might need to install the shell. Use the procedure in this section to perform this task. This section also includes illustrations of the related hardware components and shows how these components relate to each other. Note: Before installing the shell, ensure that the rails are level from side to side and from front to back. To install the shell, complete the following steps: 1. From the front of the rack, place the shell (A) on the rails and slide it into the rack. Figure 82. Attaching the rails to the back of the rack 76 Installing into a rack A Shell B Retaining screws (four) 2. Install and tighten the four retaining screws (B) into the front of the shell, securing the shell (A) to the rack. Installing the expansion unit You might need to install the expansion unit into the rack. Use the procedure in this section to perform this task. This section also includes illustrations of the related hardware components and shows how these components relate to each other. To install the expansion unit into a rack, complete the following steps: 1. Remove the front and back shipping covers of the 5796 or 7314-G30. These shipping covers are secured by four thumb screws each. Two screws are located on the top of the cover and two are located on the sides of the cover. 2. If you are installing a expansion unit (H) into an existing shell (G) that has a filler plate (F), remove the filler plate by pressing down the top tabs and pulling it out. Figure 83. Installing the shell into the rack on the expandable rails Installing a system or expansion unit into a rack 77 F Filler plate G Shell H Drawer I Thumb screw J Cable retention bracket 3. Slide the expansion unit (H) into the shell. 4. If you are installing a 5796, use a thumb screw (I) to install the cable retention bracket (J). 5. If you have two expansion units to install, repeat the last four steps to install the second expansion unit. 6. If there is only one expansion unit installed in the shell, install the filler plate (N) in the blank slot in the shell by sliding the filler plate up under the top tab of the shell. Gently press the filler plate into place. Figure 84. Removing the filler plate Figure 85. Removing the filler plate 78 Installing into a rack N Filler plate 7. Attach the front cover. See http://publib.boulder.ibm.com/infocenter/systems/scope/hw/topic/ p7haj/frontcoverin7314g30.htm (http://publib.boulder.ibm.com/infocenter/systems/scope/hw/topic/ p7haj/frontcoverin7314g30.htm). Attaching the power controller (SPCN), and power cables You might need to attach the System Power Control Network (SPCN) and power cables to the expansion unit. Use the procedure in this section to perform this task. For instructions on cabling and connecting your expansion units, see Enclosures and expansion units. Powering on the system You might need to power on the expansion unit. For information about powering on the expansion unit, refer to Start the system or logical partition. Installing the model 5786 or 7031-D24 into a rack You might need to install the expansion unit into a rack. This section includes procedures so that you can perform these tasks. This procedure assumes that you are installing the expansion unit into an existing rack. If the rack is not installed, follow the instructions in “Installing the rack” on page 5. Return to this procedure for instructions on installing the expansion unit into the rack. When you have installed the expansion unit into the rack, you will need the instructions in Disk drives, to connect and configure the 7031-D24 or 5786. Note: This procedure applies only to the models that are designed to be mounted into a rack. To install the expansion unit into a rack, perform the following steps: Figure 86. Installing the filler plate Installing a system or expansion unit into a rack 79 Attention: Installing the rails in the rack is a complex procedure. To install the rails correctly, you must perform each task in order. Failure to do so might cause rail failure and potential danger to yourself and the expansion unit. 1. Read the “Rack safety notices” on page 94. Failure to read these notices could cause damage to the equipment or injury to the installer. 2. Complete a parts inventory (See “Completing a parts inventory”). 3. Locate the rack-mounting hardware kit (A) and the system rail assemblies (B) that were included with your expansion unit. See Figure 87 for the relative locations of these items. The system rails are identical. Note: An 8 mm nut driver and a flat-blade screwdriver, represented in the following diagram, are not listed in the inventory or included in the parts shipped. These tools assist you in completing the installation. 4. Determine where you will locate the expansion unit in the rack. See “Determining the location” on page 81. This expansion unit is 4 Electronic Industries Alliance (EIA) units high. An EIA unit is 1.75 in (44.45 mm) in height. The rack contains three mounting holes for each EIA unit of height. 5. To mark the location, see “Marking the location” on page 82. Completing a parts inventory You might need to complete a parts inventory. Use the procedure in this section to perform this task. If you have not done so, complete a parts inventory before proceeding with the installation: 1. Locate the kitting report in an accessory box. 2. Ensure that you received all the parts that were ordered. If there are incorrect, missing or damaged parts, contact: v Your IBM reseller Figure 87. Parts inventory 80 Installing into a rack v IBM Rochester Manufacturing Automated Information Line at 1-800-300-8751 (United States only) v Directory of worldwide contacts website at http://www.ibm.com/planetwide (Locate your service and support telephone numbers.) Determining the location You might need to determine where to install an expansion unit in the rack. Use this procedure to perform this task. Before installing the expansion unit into a rack, complete the following steps: 1. Read the “Rack safety notices” on page 94. 2. Plan where you will place the units. Place the larger and heavier units in the lower part of the rack. These expansion units are four Electronic Industries Alliance (EIA) units high. The rack contains three mounting holes for each EIA unit of height. Each of these units therefore is 7 in. (177.8 mm) high and covers 12 mounting holes in the rack. 3. Remove the filler panels to allow access to the inside of the rack enclosure where you plan to place the unit. Installing a system or expansion unit into a rack 81 4. Remove the front and back rack doors if necessary. 5. To mark the location, see “Marking the location.” Marking the location You might need to mark the location for the rack installation. Use the procedure in this section to perform this task. To mark the installation location and install the nut clips into a rack, complete these steps: 1. Determine where in the rack to place the unit and make note of the location numbers. This unit is four Electronic Industries Alliance (EIA) units high. An EIA unit is 1.75 in. (44.45 mm) in height. The rack contains three mounting holes for each EIA unit of height. This unit therefore is 7 in. (177.8 mm) high and covers 12 mounting holes in the rack. 2. Facing the front of the rack and working from the left side, locate the lowest EIA unit that your unit uses. Make a note of the EIA location. Use tape, a marker, or a pencil to mark the top mounting hole (B) of this EIA unit. Mark the rack so that the mark can be seen from the rear of the rack. 3. On the left rail, count up two mounting holes from where you placed the mark in step 2 and place another mark next to that mounting hole (B). Mark the rack so that the mark can be seen from the rear of the rack. You now have two marks on the rack, with one mounting hole between the marks. 4. Repeat steps 2 and 3 to place two marks on the corresponding mounting holes on the front-right rail. 5. Go to the back of the rack. On the left side, find the EIA unit that corresponds to the lowest EIA unit marked on the front of the rack. 6. Place a mark at the top hole (C) of the lowest EIA unit. Mark the rack so that the mark can be seen from the rear of the rack. 7. Count up two mounting holes from where you placed the mark in step 6 and place another mark next to that mounting hole (C). 8. Repeat steps 6 and 7 to place two marks on the corresponding mounting holes on the rear-right rail. Attaching the mounting hardware to the rack You might need to attach mounting hardware to the rack. Use the procedure in this section to perform this task. In addition to information intended to promote safety and reliable operation, this section also includes illustrations of the related hardware components and shows how these components relate to each other. Figure 88. Removing the filler panels Figure 89. Marking the installation locations 82 Installing into a rack CAUTION: Installing the rails in the rack is a complex procedure. To install the rails correctly, you must perform each task in the following order. Failure to do so might cause rail failure and potential danger to yourself and the system unit. To install the mounting hardware into the rack, follow these steps: 1. Loosen the two screws (B) in the middle of both rails (A) to allow each rail to extend. 2. Starting at the back of the rack, extend the left rail and insert the rail pins (C) into the marked locations the rack so that the rail is facing upward. Note: Although the rail installation can be completed by one person, the installation will be easier if one person is positioned at the front of the rack and one person is positioned at the back of the rack. 3. From the back of the rack, extend the right rail and insert the rail pins (C) into the marked locations. 4. Moving to the front of the rack, extend the left rail and insert the rail pins (C) into the marked locations. 5. Extend the right rail and insert the rail pins (C) into the marked locations. 6. Using two M5 screws (D), secure the left rail to the front of rack below each rail pin (C). Figure 90. Extending the rails Installing a system or expansion unit into a rack 83 7. Using two M5 screws (D), secure the right rail to the front of the rack. 8. Moving to the back of the rack and using two M5 screws (D), secure the left rail to the back of the rack. 9. Using two M5 screws (D), secure the right rail to the back of the rack. 10. Tighten the two screws in the middle of the left rail and the two screws in the middle of the right rail. Refer to Figure 91. 11. Insert two nut clips at the marked location on the front-left rail. 12. Insert two nut clips at the marked locations on the front-right rail. Installing the 5786 or 7031-D24 on the rail assembly You might need to install the expansion unit on the rail assembly. Use the procedure in this section to perform this task. In addition to related safety information, this section also includes illustrations of the related hardware components and shows how these components relate to each other. Before installing the expansion unit on the rail assembly, ensure that the stabilizers are extended and the tip plate is attached to the bottom front of the rack to prevent the rack from falling forward when the rails are pulled out of the rack. Refer to “Rack safety notices” on page 94. To install the expansion unit on the rail assembly, follow these steps: 1. If the bezel is installed on the front of the expansion unit, remove it by pressing in on the two release buttons. Figure 91. Attaching the system rails 84 Installing into a rack 2. Using three people, lift the expansion unit and position it over the front of the rails. Attention: Three people are required to safely lift the expansion unit. Using fewer than three people to lift the expansion unit can result in injury. 3. Push the expansion unit into the rack until it is completely installed into the rack. 4. Determine whether the securing plates are preinstalled on the back of the expansion unit. If not, attach the plates to the back left and back right of the expansion unit. The securing plates are illustrated in Figure 94 on page 86. 5. Install the two M5 securing screws A through the chassis and into the nut clips (B) on the left and right rail. Figure 92. Removing the bezel Figure 93. Installing the securing screws Installing a system or expansion unit into a rack 85 6. At the back of the rack, install two M5 securing screws (C) through the back of both the left and right securing plates and into the rack rail. 7. Install the bezel onto the expansion unit. For instructions on how to connect and configure the expansion unit when you have completed installing the 5786 or 7031-D24 into the rack, see Enclosures and expansion units. Figure 94. Securing the expansion unit to the back of the rack Figure 95. Installing the bezel 86 Installing into a rack Attaching the cable-management arm to the standard rails You might need to attach the cable-management arm. Use the procedure in this section to perform this task. If standard (fixed length) rails were included with the expansion unit, and the cable-management arm is not attached to the rails, complete the following steps. Otherwise, go to “Determining the location” on page 81. 1. Insert the cable-management arm rail tab (B) into the rail. 2. Rotate the cable-management arm (A) as shown in the following illustration. 3. Secure the cable-management arm (A) with the retaining screw (C). 4. Repeat steps 1 through 3 for the other rail. Figure 96. Rotating the cable-management arm Installing a system or expansion unit into a rack 87 88 Installing into a rack Removing and replacing 7014-T00 or 7014-T42 side panels You might need to remove and replace a side panel on a rack. Use the procedure in this section to perform this task. The rack might have optional side panels. To remove and replace a side panel, complete the following procedure: 1. Unlock the side panels by pressing down on both locking latches to release the latches. Note: If your rack uses a ruggedized kit, you must remove the securing screw to allow each side panel to be removed. Refer to “Releasing the side panel with a ruggedized kit” on page 102. 2. Tilt the top of the side panel slightly toward you. 3. Lift the side panel away from the ridge on the bottom of the rack. 4. Repeat this procedure for the other side panel. Replacing a 7014-T00 or 7014-T42 side panel You might need to replace a side panel on a rack. Use the procedure in this section to perform this task. To replace a side panel, complete the following procedure: 1. Tilt the top of the side panel slightly toward you. Figure 97. Removing the side panel © Copyright IBM Corp. 2010 89 2. Place the bottom of the side panel onto the ridge on the bottom of the rack. 3. Slide the top of the side panel into place and close the locking latches. Note: If your rack uses ruggedized kit, you must install a securing screw into each side panel that was installed. Refer to “Ruggedized kit” on page 101. Figure 98. Replacing the side panel 90 Installing into a rack Removing and replacing 7014-T00 or 7014-T42 trim panels Racks that are installed with multiple processor drawer systems can use front trim panels instead of doors. For racks that use trim panels, a reduced-interference panel type must be installed when certain expansion unit models are present. Use the procedure in this section to remove the existing trim panels from the rack and replace them with the reduced-interference panel type. To remove the existing rack trim panels, complete the following procedure: 1. Place both hands on the center of the right side trim panel. 2. Squeeze inward firmly with your fingertips to release the spring clips that hold the panel in place. 3. Rotate your hands in slightly until the panel is disengaged. 4. Lift the panel out and set it aside. 5. Repeat this procedure to remove the left side trim panel. Replacing the 7014-T00 or 7014-T42 trim panels For racks that use trim panels instead of doors, a reduced-interference panel type must be installed when certain expansion unit models are present. Use the procedure in this section to perform this task. To install the trim panels, complete the following procedure: Figure 99. Removing the rack trim panel © Copyright IBM Corp. 2010 91 1. Align the bottom plate (A) of the right side trim panel to the bottom of the rack 2. Align the top of the trim panel (B) and squeeze slightly with your fingertips. 3. After the trim panel is in the correct location, release the pressure to allow the spring clips to hold the panel in place. 4. Repeat this procedure to install the left side trim panel. Figure 100. Installing the rack trim panel 92 Installing into a rack Attaching the rack doors You might need to attach the rack doors. Use this procedure to perform this task. Depending on the model of the rack, the front door of a rack might be an optional feature. If your system already has the front door installed, or does not have a front door to install, skip this step. Attaching a high-perforation front door You might need to attach a front door to the rack. Use the procedure in this section to perform this task. To install the high-perforation front door, do the following: 1. Read the “Rack safety notices” on page 94. 2. Remove the top, left, and right trim panels. For details on removing the left and right trim panels from a7014-T00 or 7014-T42 rack, see “Removing and replacing 7014-T00 or 7014-T42 trim panels” on page 91 Figure 101. Attaching the rack door © Copyright IBM Corp. 2010 93 page 91. 1 Rack chassis 4 Right-side trim panel 2 Top trim panel 5 Spring clip 3 Left-side trim panel 3. Install the door latch on the right and the door hinges on the left. 4. For a high-perforation front door, align the door over the rack hinge, then move up the hinge pin on the door, and lower the hinge pin into the hinge. 5. Adjust the latch so the door latches securely. Rack safety notices You need to read the rack safety notices before installing equipment. Before installing a rack, rack features, or a system or expansion unit into a rack, read the following safety information. Attention: If you are installing equipment into a non-IBM rack, the rack must comply with the Electronics Industries Association (EIA) 310D specifications. If you do not have a rail kit designed for the equipment in the non-IBM rack, do not install the equipment into the rack as damage to the equipment or personal injury could occur. Figure 102. Removing the trim panels 94 Installing into a rack DANGER Observe the following precautions when working on or around your IT rack system: v Heavy equipment–personal injury or equipment damage might result if mishandled. v Always lower the leveling pads on the rack cabinet. v Always install stabilizer brackets on the rack cabinet. v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest devices in the bottom of the rack cabinet. Always install servers and optional devices starting from the bottom of the rack cabinet. v Rack-mounted devices are not to be used as shelves or work spaces. Do not place objects on top of rack-mounted devices. v Each rack cabinet might have more than one power cord. Be sure to disconnect all power cords in the rack cabinet when directed to disconnect power during servicing. v Connect all devices installed in a rack cabinet to power devices installed in the same rack cabinet. Do not plug a power cord from a device installed in one rack cabinet into a power device installed in a different rack cabinet. v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of the system or the devices that attach to the system. It is the responsibility of the customer to ensure that the outlet is correctly wired and grounded to prevent an electrical shock. CAUTION v Do not install a unit in a rack where the internal rack ambient temperatures will exceed the manufacturer's recommended ambient temperature for all your rack-mounted devices. v Do not install a unit in a rack where the air flow is compromised. Ensure that air flow is not blocked or reduced on any side, front, or back of a unit used for air flow through the unit. v Consideration should be given to the connection of the equipment to the supply circuit so that overloading of the circuits does not compromise the supply wiring or overcurrent protection. To provide the correct power connection to a rack, refer to the rating labels located on the equipment in the rack to determine the total power requirement of the supply circuit. v (For sliding drawers.) Do not pull out or install any drawer or feature if the rack stabilizer brackets are not attached to the rack. Do not pull out more than one drawer at a time. The rack might become unstable if you pull out more than one drawer at a time. v (For fixed drawers.) This drawer is a fixed drawer and must not be moved for servicing unless specified by the manufacturer. Attempting to move the drawer partially or completely out of the rack might cause the rack to become unstable or cause the drawer to fall out of the rack. (R001) Lift precautions: Attaching the rack doors 95 96 Installing into a rack Installing the rack security kit You might need to install the rack security kit. Use the procedure in this section to perform this task. This section also includes illustrations of the related hardware components and shows how these components relate to each other. To install a rack security kit (feature 6580) that consists of the security lock and security slide bars, complete the following procedure: 1. Read the “Rack safety notices” on page 94. 2. Verify the inventory in the rack security kit. v 1 Two lock hardware kits. Each kit contains: - Rack lock - Bracket - Screw - Two keys 2 Two security slide bars 3 Two locked/unlocked stickers 3. Remove the existing door latch. Figure 103. Rack security kit inventory © Copyright IBM Corp. 2010 97 a. Open the front rack door. b. On the inside of the door, remove the screw (4) in Figure 104, that secures the lock to the rack door. c. Remove the bracket (5). d. From the outside of the door, remove the door latch (6). Note: If the rack is equipped with the ruggedized kit, remove the jam nut and hex nut from the existing door latch and reinstall both nuts on the new door lock latch. 4. Install the locking latch. a. Insert the keyed rack lock into the latch slot on the front of the door (6) in Figure 104. b. Secure the lock by attaching the lock bracket (5) with the screw (4), on the inside of the door. 5. Repeat steps 3 on page 97 and 4 to install the second lock on the back rack door. 6. Install a security slide bar on the right side of the rack. Figure 104. Removing the existing door latch 98 Installing into a rack Note: Each slide bar rail has two long tabs on the bottom of the rail. The slide bar rails are identical and can be installed on either the right or left side cover panel. a. Unlatch right-side cover panel and lean the panel back so that you can access the top of the panel. b. With the flat side of the slide bar rail (7) in Figure 105, facing the inside of the cover panel (8), insert the two tabs (9), on the slide bar rail into the two vertical support channels (10) on the side cover panel. Note: When installed correctly, the slide rail moves from front to back. c. Reinstall the side panel cover on to the rack. d. Lock the side panel covers by sliding the bars to the front of the rack. e. Place a locked/unlocked sticker on the inside of the cover panel so that when the slide bar is in the locked position, the tab is over the locked symbol (11), as shown in Figure 106 on page 100, and over the unlocked symbol (12), when the slide bar is unlocked. Figure 105. Installing a security slide bar Installing the rack security kit 99 f. Repeat the procedure for the left side of the rack. Figure 106. Placing the locked/unlocked sticker on the cover panel 100 Installing into a rack Ruggedized kit You might need to remove or replace a part in the ruggedized kit. This section includes procedures so that you can perform these tasks. The ruggedized kit, feature code 6080, should be ordered at the same time the rack is ordered. The ruggedized kit brackets are installed at the manufacturer. Note: If you are installing a rack with the ruggedized kit and need to secure the rack to the floor, refer to “Installing the rack” on page 5. The following illustration highlights the contents of the kit and the approximate location of each bracket and hinges in the event that you need to uninstall and reinstall a part. 1 Rack 9 Brace hinges 2 Spacer 10 Brace 3 Brace hinge 11 Side-door securing screw mount 4 Hinge pivot studs 12 Spacer 5 Brace latch bracket 13 Screw 6 Spacer 14 Washer 7 Screw 15 Side door securing screw 8 Brace thumbscrew Figure 107. Ruggedized kit parts © Copyright IBM Corp. 2010 101 Releasing the ruggedized brace You might need to release the ruggedized brace. Use the procedure in this section to perform this task. To access the back of a system that is installed in a rack with a ruggedized kit, complete the following steps to release the ruggedized brace: 1. Remove the brace thumbscrew (8). 2. Swing the brace (10) out of the rack. 3. Access the system and reinstall the brace. Releasing the side panel with a ruggedized kit You might need to release the side panel on the rack. Use this procedure to perform this task. The ruggedized kit contains securing screws that secure the side panels to the rack. To remove a securing screw, complete the following steps: 1. If necessary, open or remove the back rack door. 2. Locate the securing screw mount (11) for the side door that will be removed. Figure 108. Releasing the brace to access systems 102 Installing into a rack 3. Using a screwdriver, remove the securing screw (15) and washer (14) from the side panel. To remove the side panel, see “Removing and replacing 7014-T00 or 7014-T42 side panels” on page 89. Figure 109. Removing the securing screws on the side panel Ruggedized kit 103 104 Installing into a rack Connecting multiple racks with rack-to-rack attachment kit You might need to connect multiple racks together. Use the procedure in this section to perform this task. This topic describes how to connect multiple racks to each other using a rack-to-rack attachment kit. To do this, you will need the rack-to-rack attachment kit (feature 7840). To connect multiple racks with the rack-to-rack attachment kit follow this procedure: 1. Read the “Rack safety notices” on page 94. 2. If they are installed, remove the side panels from each rack. Remove side panels only from the sides that will be attached to each other. To do this: a. Lift up the two panel-release tabs. b. Pull the panel up and away from the rack chassis. This motion will release the panel from the two lower J brackets. c. Store the side panels. 3. Remove the two Z brackets and the two J brackets. These brackets are used to hang the side panels. 4. Install the first two standoffs in the upper-left and lower-right corners of the first rack as shown in Figure 110 on page 106. 5. Install the second two standoffs in the upper-left and lower-right corners of the second rack as shown in Figure 110 on page 106. 6. Attach the long foam as shown in Figure 110 on page 106. For a model T42 rack, join the short foam to the end of the long foam, and adhere it to the frame length of the rack. 7. Position the racks together. 8. Align the standoff holes. You might need to adjust the leveling feet to do this. 9. Install a screw and washer into all four positions, but do not tighten. © Copyright IBM Corp. 2010 105 10. After all racks are bolted together, level the racks. 11. Tighten all four screws. 12. Snap on the trim pieces that go between the front and back racks. 13. Snap on the trim piece that goes on top and between the racks. 14. Install rack filler panels to cover the open areas at the front of the racks. All the gaps in the front of the rack must also be sealed, including the gaps between equipment. This step ensures that proper airflow within the rack is maintained. 15. Connect the cables that go between the racks. Figure 110. Removing the side panels, Z and J brackets, and installing standoffs and long foam to connect multiple racks Figure 111. Location of foam strips (top view) 106 Installing into a rack 16. If you are installing tip plates, go to step 5 in “Installing the rack” on page 5. Connecting multiple racks with rack-to-rack attachment kit 107 108 Installing into a rack Rack status beacon The rack status beacon is placed on top of a rack and cabled to multiple system units inside the rack. The server firmware illuminates the rack status beacon in response to any detected problems in the status. This helps you locate the problem more quickly. This is a customer task. You can perform this task yourself, or contact an service provider to perform the task for you. The service provider might charge you for this service. These procedures assume that the system units have been installed into the rack and have been connected to the power supply. If the system units have not been installed, refer to the installation instructions for your system unit. Installing the rack status beacon You might need to install the rack status beacon. Use the procedure in this section to perform this task. To install the rack status beacon, you need the following: v Rack status beacon assembly. v Two or more rack status beacon cables. You need one cable to connect the beacon to the junction box and one for each system unit that you want to connect to the beacon. v One or two power cords. v One or more junction boxes for the rack status beacon. v One or more junction box cables. To install the rack status beacon, follow these steps: 1. Place the rack status beacon (A), on top of the rack so that the light is pointing to the front of the rack. © Copyright IBM Corp. 2010 109 2. Plug one end of the rack status beacon cable (1) into the port on the back of the rack status beacon assembly. 3. Thread the rack status beacon cable through the rack's cable access cover, (B). 4. Plug the other end of the rack status beacon cable into the output port on the junction box, (C). 5. To connect the junction box to the system unit, plug one end of a rack status beacon cable (1), into an input port on the junction box. Figure 112. Connecting the rack status beacon to the system unit through the junction box Figure 113. Connecting the rack status beacon and system unit to the junction box 110 Installing into a rack 6. Plug the other end of the second rack status beacon cable into the beacon port on the system unit. This port is labeled with a beacon symbol. Note: On some system units, the orientation of the port is rotated. 7. To add additional system units to this rack status beacon, repeat steps 4 and 5 for each additional system unit. You can connect multiple junction boxes to add additional system units. Refer to “Connecting multiple junction boxes.” 8. Plug the power cords into the rack status beacon assembly. Only one power cord is required, but you can use two power cords for backup. 9. Plug the other end of the power cords into the rack power supply. If you install the rack status beacon during initial system setup, the initial program load (IPL) will read the rack identification information. If you install the rack status beacon on system units that are running, you will need to start the system again. Connecting multiple junction boxes You might need to connect multiple junction boxes. This section includes procedures so that you can perform these tasks. You can connect multiple junction boxes so that additional system units can be added to the rack status beacon. To connect multiple junction boxes, you need the following additional equipment: v One or more rack status beacon junction boxes. v One or more junction box cables. You need a junction box cable for each junction box that you want to add. 1. Plug the one end of the junction box cable (2), into an input port on the first junction box. Figure 114. Rack status beacon port Rack status beacon 111 2. Plug a junction box cable into the output port of the additional junction box. 3. Connect additional system units to the additional junction box as described in “Installing the rack status beacon” on page 109. 4. Repeat steps 1 - 3 for each additional junction box. Figure 115. Connect the rack status beacon and system unit to the junction box 112 Installing into a rack Installing or removing a rack-mounted system-unit latch bracket You might need to install or remove a latch bracket. Use the procedure in this section to perform this task These procedures describe how to install or remove the rack-mounted system-unit latch bracket. You can perform this task yourself, or contact a service provider to perform the task for you. The service provider might charge you for this service. To install or remove a latch bracket, complete the following steps: Note: The illustrations in these procedures may differ from your machine type and model. 1. Read the “Rack safety notices” on page 94. 2. Open the front rack door. 3. Push the latch bracket (A) or (B) into place on the side of the system unit. 4. Secure the latch bracket with two retaining screws (C) on the side of the latch bracket. 5. Replace the front cover. See “Removing and replacing covers on the 8233-E8B or 8236-E8C system” on page 135. 6. Close the front rack door. To remove a latch bracket, complete the following steps: 1. Remove the cover from the front of the system. See “Removing and replacing covers on the 8233-E8B or 8236-E8C system” on page 135. 2. Place the system in the service position. See the service information for your model. Figure 116. Reinstalling the latch brackets © Copyright IBM Corp. 2010 113 3. Remove the retaining screws (C) located on the side of the latch bracket (A) or (B). 4. Remove the latch bracket from the system unit. Figure 117. Removing the retaining screws 114 Installing into a rack Removing and replacing the rack top cover You might need to remove or replace the rack top cover. User these procedures to perform these tasks. Removing the rack top cover You might need to remove the rack top cover. Use this procedure to perform this task. Note: A 10 mm box socket with an extension bar is required to remove the screws from the top cover. Other tools might cause the screw heads to become rounded and unremovable. To remove the rack cover from the rack, do the following steps: 1. Remove the front door. Note: If the rack is locked, unlock the doors. Then unlock the side panels by removing the locking bars from both sides of the top rear of the rack. Slide the locking bars out from the rack at the top of each side of the rack from the rear. The locking bars can be removed from each side of the front of the rack if the rack space in the locking bars area is open and the bars are easily accessible. 2. Remove the top, left, and right trim panels. 3. Remove the side panels. For more information, see “Removing and replacing 7014-T00 or 7014-T42 side panels” on page 89. 4. Remove one screw from the right side of the top cover and one screw from the left side of the top cover. Do for both front and rear. 5. Locate the front (A) and back (C) rack braces that were provided in the shipping container. Fasten each rack brace at the top of the front and rear of the rack, just below the top cover. 6. Use the four screws (B) that were removed from the top cover to secure each rack brace to the rack, as shown in Figure 118 on page 116. © Copyright IBM Corp. 2010 115 A Front rack brace B Retaining screws (2 screws for each brace) C Back rack brace 7. Remove the remaining six screws from the left and right sides of the top cover (D). Figure 118. Securing the rack brace 116 Installing into a rack D Top cover E Cable access cover F Side cover (quantity 2) G EIA label 8. Lift off the top cover. Replacing the rack top cover You might need to replace the rack top cover. Use this procedure to perform this task. Note: A 10 mm box socket with an extension bar is required to replace the screws into the rack top cover. Other tools might cause the screw heads to become rounded and unable to be removed again. To replace the rack top cover onto the rack, do the following: 1. Position the rack top cover (D) on the rack. 2. Install the six screws at the left and right sides of the rack top cover. Figure 119. Removing the top cover Removing and replacing the rack top cover 117 D Top cover E Cable access cover F Side cover (quantity 2) G EIA label Note: A 10 mm box socket with an extension bar is required to replace the screws into the rack top cover. Other tools might cause the screw heads to become rounded and unable to be removed again. 3. Remove the rack braces (A) and (C) by unfastening them at the top of the rack, just below the top cover. Remove the braces at the front and rear of the rack. Figure 120. Removing the top cover 118 Installing into a rack A Front rack brace B Retaining screws (quantity 2 for each brace) C Back rack brace 4. Install the screws (B) you removed from the rack braces at the right and left side of the top cover. Install the screws for both the front and rear of the rack. 5. Install rack filler panels to cover open areas at the front of the rack. Seal all gaps in the front of the rack, including the gaps between pieces of equipment. This step ensures that proper airflow within the rack is maintained. Figure 121. Removing the top cover Removing and replacing the rack top cover 119 120 Installing into a rack Power distribution unit plus The power distribution unit plus (PDU+) can be installed in the 7014-T00 and 7014-T42 racks. It allows you to monitor the individual power loads of the devices that are plugged into it. Installing the PDU+ in the side of a rack Learn how to install the power distribution unit plus (PDU+) in the side of a rack. Tip: Removing the rack doors and side panels might make installation easier. Important: You must use clip nuts to install the mounting brackets. Clip nuts are provided with the PDU+ and install on the rack-mounting flanges. To install the PDU+ in the 1U mounting space in the side of a rack, complete the following steps 1. Read the “Rack safety notices” on page 94. 2. Align the vertical-mounting brackets to the front of the PDU+. Make sure that you attach the brackets so that the power outlets face the rear of the rack. 3. Attach the brackets to the PDU+ with two M3x5 screws per bracket. Use the screws that were provided with the PDU+. 4. Align the PDU+ with the opening in the side of the rack. Then, while holding the PDU+ in place, attach the brackets to the rack-mounting flanges with four clip nuts and four M5 screws that were provided with the PDU+. Figure 122. Aligning the vertical-mounting brackets to the front of the PDU+ © Copyright IBM Corp. 2010 121 Attention: You must disconnect the main input power before connecting or disconnecting the input power cord from the PDU+. 5. If the PDU+ was provided with a detached power cord, connect the power cord now. Align the connector on the power cord that was provided with the PDU+ with the connector on the front of the PDU+, turning as necessary for key alignment. Then, turn the twist-lock on the connector clockwise until it locks into place. Figure 123. Aligning the PDU+ with the opening in the side of the rack Figure 124. Aligning the connector on the power cord with the PDU+ 122 Installing into a rack 6. Route the power cord from the PDU+ toward the rack side braces. Then, route the power cord along a side brace toward the back of the rack and secure the power cord with the cable straps that are provided with the PDU+. 7. Route the power cord toward a dedicated power source. Use the provided cable straps to secure the power cord along the way. Use the openings in the rack, if the power cord must exit the rack to connect to a power source. Attention: To prevent damage to a power device and other connected devices, always connect the power device to an authorized power source for that device. 8. Connect the power cord to a properly wired and grounded dedicated power source. Then, you can connect servers or rack PDUs in the rack to the power outlets on the PDU+. 9. Route all of the other power cables neatly and secure the power cables with cable straps. Related concepts “Setting up power monitoring using the PDU+” on page 127 You can monitor the power status for any device that is connected to the power distribution unit plus (PDU+), either manually or remotely, through the PDU+ Web interface. You can use the IBM DPI® Configuration Utility to initially set up the PDU+ and to configure PDU+ settings such as network parameters, access control table, and trap receivers table. Installing the PDU+ horizontally in a rack You might need to install the PDU horizontally, use the instructions in this section to complete this task. Tip: Removing the rack doors and side panels might make installation easier. Important: Use cage nuts for rack cabinets with square holes, and use clip nuts for rack cabinets with round holes. If your rack cabinet requires cage nuts, use a cage-nut-insertion tool or a flat-blade screwdriver to install them.. To install the PDU+ in the rack, complete the following steps 1. Read the “Rack safety notices” on page 94. 2. Align the vertical-mounting brackets to the front of the PDU+. Make sure that you attach the brackets so that the power outlets face the rear of the rack. 3. Align the long mounting-brackets with the holes in the rear of thePDU+ and attach the brackets to the PDU+ with two M3 pan-head screws with captive lock washers per bracket. Use the screws that Figure 125. Aligning the vertical-mounting brackets to the front of the PDU+. Power distribution unit plus 123 come with the rack-mounting kit. 4. Hold the PDU+ at a slight angle and carefully insert it into the 1-U mounting space in the rack cabinet. Pushing in slightly on both of the long mounting-brackets helps clear the brackets from the rack flanges. 124 Installing into a rack Attention: You must disconnect the main input power before connecting or disconnecting the input power cord from the PDU+. 5. Secure the end of the PDU+ that has the short mounting-brackets to the rack cabinet first. Make sure that the short mounting-brackets are aligned with the outside of the rack flanges. Attach the brackets to the rack flanges with two M6 screws and two cage nuts or clip nuts per bracket. Use the cage nuts or clip nuts and the screws that come with the rack-mounting kit. 6. Secure the long mounting-brackets and the 1-U blank filler panel to the rack cabinet by doing the following steps: Figure 126. Aligning the PDU+ with the opening in the rack. Power distribution unit plus 125 a. Adjust the long mounting-brackets to fit the depth of the rack cabinet. b. Make sure that the long mounting-brackets are aligned with the inside of the rack flanges. c. Align the blank filler panel on the outside of the rack flanges. d. Attach the filler panel to the rack flanges and then to the long mounting-bracket with one M6 screw per bracket. e. Tighten the M3 pan-head screws that secure the long mounting-brackets to the PDU. 7. Secure the end of the PDU+ that has the short mounting-brackets to the rack cabinet first. Make sure that the short mounting-brackets are aligned with the outside of the rack flanges. Attach the brackets to the rack flanges with two M6 screws and two cage nuts or clip nuts per bracket. Use the cage nuts or clip nuts and the screws that come with the rack-mounting kit. 8. If the PDU+ was provided with a detached power cord, connect the power cord now. Align the connector on the power cord that was provided with the PDU+ with the connector on the front of the PDU+, turning as necessary for key alignment. Then, turn the twist-lock on the connector clockwise until it locks into place. 126 Installing into a rack 9. Route the power cord from the PDU+ toward the rack side braces. Then, route the power cord along a side brace toward the back of the rack and secure the power cord with the cable straps that are provided with the PDU+. 10. Route the power cord toward a dedicated power source. Use the provided cable straps to secure the power cord along the way. Use the openings in the rack, if the power cord must exit the rack to connect to a power source. Attention: To prevent damage to a power device and other connected devices, always connect the power device to an authorized power source for that device. 11. Connect the power cord to a properly wired and grounded dedicated power source. Then, you can connect servers or rack PDUs in the rack to the power outlets on the PDU+. 12. Route all of the other power cables neatly and secure the power cables with cable straps. Setting up power monitoring using the PDU+ You can monitor the power status for any device that is connected to the power distribution unit plus (PDU+), either manually or remotely, through the PDU+ Web interface. You can use the IBM DPI Configuration Utility to initially set up the PDU+ and to configure PDU+ settings such as network parameters, access control table, and trap receivers table. Note: All of the Configuration Utility configuration options are available through the Web interface after the PDU+ is set up on the local network. Related tasks “Installing the PDU+ in the side of a rack” on page 121 Learn how to install the power distribution unit plus (PDU+) in the side of a rack. Using the IBM DPI Configuration Utility Learn how to use the IBM DPI Configuration Utility to configure the power distribution unit plus (PDU+) settings, such as the IP address, network parameters, access control table, and trap receivers table. Figure 127. Aligning the connector on the power cord with the PDU+ Power distribution unit plus 127 Connecting the console You can configure the PDU+ using a workstation or notebook computer that is connected to the PDU+. Connect the DB9-to-RJ-45 cable that is shipped with the PDU+ to the RJ-45 console connector on the PDU+, and to a RS-232 serial (COM) connector on a workstation or notebook computer. Using HyperTerminal HyperTerminal is a terminal program in a Microsoft® Windows® operating system that enables you to configure or control a device using command line parameters. You can configure the PDU+ parameters and its outlets using numeric commands from a keyboard. You can also use Telnet or any other terminal program to configure the PDU+ after the IP address is set. To start HyperTerminal and communicate with the PDU+, complete the following steps: 1. Click Start ? Programs ? Accessories ? Communications ? HyperTerminal. The Connection Description window is displayed. 2. Type the name for the connection in the Name field and select an icon for the connection. 3. Click OK. The Connect To window is displayed. 4. In the Connect using field, select the COM port that is connected to the PDU+. 5. Click OK. The Properties window is displayed. 6. Click Restore Defaults to use the default settings. Make sure that the Bits per second field is 9600 and that the Flow control field is None. 7. Click OK. 8. Press any key. The Configuration Utility main menu is displayed. 9. Type the default password, passw0rd (all lowercase letters with a zero, not O), and press Enter. 10. Enter the menu option that you want. For descriptions of the options, see “Configuration Utility menu options.” Configuration Utility menu options The following options are on the Configuration Utility main menu: IBM DPI Settings When you select IBM DPI Settings, the IBM DPI Configuration Utility window is displayed with the following options: Set the IP Address, Gateway Address and MIB System Group View and change the IP address, date, time, and MIB system information. Set IBM DPI Control Group Set the administrator user name, password, and access protocols. Set Write Access Managers Set up a list of users who can access and control the PDU+. Set Trap Receivers Configure remote network management system (NMS) servers to receive traps. Set Date and Time Adjust the date and time information for the PDU+. Set Superuser Name and Password Set the user name and password of the administrator who will use a Web browser to configure the PDU+. E-mail Notification Set up a list of users who will be alerted with event messages if an unusual event is triggered on the PDU+ system. 128 Installing into a rack Set Multi-Users Configure other user and password logins and the read and write access levels. Set IBM DPI Information Configure the PDU+ logging interval, refresh rate, and custom name fields for the load groups. Settings and Event Log Summary View all PDU+ configuration settings. Reset Configuration to Default Reset all system settings to their factory default values. Restart HD-PDU Restart the PDU+. Setting the IP address Important: You must set the IP address before you can use the Web interface or access the PDU+ in an IP network (LAN/WAN). Contact your system administrator if you do not know the IP address. To set the IP address, complete the following steps: 1. In the Configuration Utility main menu, enter the menu option for IBM DPI Settings. 2. Enter the menu option for Set the IP Address, Gateway Address and MIB System Group. Using the Web interface Learn how to use the Web interface to configure and monitor the power distribution unit plus (PDU+) remotely. The PDU+ provides a graphical user interface that you can view from a Web browser. Using a Web browser, you can access and monitor the PDU+ power outlets and output devices remotely from a workstation or notebook computer. Starting the Web interface To start the Web interface, complete the following steps: 1. Start a Web browser from a workstation or notebook computer, and enter the IP address of the PDU+ in the address field. The Connect to window is displayed. Note: For more information about setting the IP address of the system, see “Setting the IP address.” 2. In the User name field, type USERID (all uppercase letters). 3. In the Password field, type passw0rd (all lowercase letters with a zero, not O). 4. Click OK. The main status page is displayed. The main status page displays a graphical representation of the PDU+ power outlets and input status: v The left pane displays the menus and submenus for the PDU+. Click a menu to display the menu options, expand the menu items, and modify the menu options as required. v The graphic displayed in the right pane shows the status of the outlets, input voltage, output voltage, frequency, current and power, watt-hour consumption, and cumulative kilowatt hour power consumption. If you connect an optional environmental monitored probe, the temperature and humidity environment conditions are displayed. Each menu page provides online help to assist you with configuring the PDU+. Click the Help icon at the top of each page to view the help. Modifying the basic settings Use the System menu to configure the PDU+ system parameters such as the superuser name, password, IP address, date, and time. Power distribution unit plus 129 Changing the superuser name and password: You can set the user name and password of the administrator who will use a Web browser to configure the PDU+ on the Configuration Utility page. To change the superuser name and password, complete the following steps: 1. From the main status page, in the left navigation pane, click System. 2. Click Configuration to view and modify the system configuration and superuser user name and password. Identifying the PDU+ and Web/SNMP card: You can view the PDU+ and Web/SNMP card information on the Identification of Power Management page. To view the power management information of the PDU+ and Web/SNMP card, complete the following steps: 1. From the main status page, in the left navigation pane, click System. 2. Click Identification to view the PDU+ and Web/SNMP card information. Adding users: You can add users who can access and control the PDU+ on the Multi-User Configuration page. To create a list of users who can access and control the PDU+, complete the following steps: 1. From the main status page, in the left navigation pane, click System. 2. Click Multi-User to add users who can only view the PDU+ status or users who can change the PDU+ settings. Changing the date and time: You can change the date and time of the PDU+ on the Date and Time page. Important: Changing the PDU+ date and time affects other system settings such as e-mail, traps, and logs. To change the date and time, complete the following steps: 1. From the main status page, in the left navigation pane, click System. 2. Click Date and Time to view and modify the system date and time. You can set the date and time manually, synchronize it with the computer time, or synchronize it with an NTP server. Changing event alerts: You can change event alerts on the SNMP Trap Receivers page. To configure the PDU+ to send e-mail or SNMP trap alerts to specified users when specific events occur, complete the following steps: 1. From the main status page, in the left navigation pane, click System. 2. Click Trap Receivers to create a list of users or workstations who will be alerted with an SNMP trap message. You can specify the IP addresses of up to eight trap receivers, the community information, type of trap, severity of trap, and description of the events that cause the traps. 130 Installing into a rack 3. Click Email Notification under System to create a list of up to four users who will be alerted with an e-mail. Use this menu to specify the mail server, user account, DNS, and other information necessary to set up a mail server for sending mail alerts. Use the Email Receivers Table to add the e-mail addresses. Changing the network information Use the Network menu to change the network information for the PDU+, for example, the IP address. Changing the network configuration: You can view or change the network configuration on the Network Configuration page. To view or change the network configuration of the PDU+, complete the following steps: 1. From the main status page, in the left navigation pane, click Network. 2. Click Configuration to set the PDU+ IP address, gateway address, subnet mask, and Domain Name System (DNS) address. 3. Click Control to configure TCP/IP settings. 4. Click Access Control to set access control to prevent unauthorized users from accessing the PDU+. History and event log summaries The Logs menu provides a detailed description of all events and a record of the PDU+ status. System administrators can use this page to analyze problems with network equipment. Viewing the history log: You can view the complete history of the PDU+ inputs and outputs on the History Log page. To view the history of the PDU+, complete the following steps: 1. From the main status page, in the left navigation pane, click Logs. 2. Click History. Each event log file shows the time, date, and description of all the events occurring on the PDU+. Viewing the event log: You can view the complete record of the PDU+ events on the Event Log page. To view the complete record of the PDU+ events, complete the following steps: 1. From the main status page, in the left navigation pane, click Logs. 2. Click Events. Each log file shows a record of the input power and output power of each outlet. Power distribution unit plus 131 132 Installing into a rack Removing and replacing the expansion unit cover or door You might need to remove, replace, or install covers or doors on an expansion unit as a part of accessing components or performing a service action. Removing the front cover on the 7314-G30 or 5796 You might need to remove the front cover to perform service to the system. To remove the front cover, follow these steps: 1. Remove the two thumbscrews (B) located on the left and right of the cover. 2. Press in on the two latches (C) located left and right of the cover to release the cover. 3. Lift out the cover (A) and away from the chassis. Tip: If an airflow block is present on an unpopulated side of the chassis do not remove it unless you are populating that side of the chassis. Installing the front cover on the 7314-G30 or 5796 Use this procedure to install the front cover after installing the system or performing a service action. If you only have one side of the chassis populated, ensure that the airflow block is present on the unpopulated side. 1. Insert the posts on the bottom of the cover (A) into the hooks on the chassis of the enclosure. Figure 128. Remove the front cover for model 7314-G30 or 5796 © Copyright IBM Corp. 2010 133 2. Press in on the two latches (C) on the right and left of the front cover. 3. Press the cover onto the chassis and release the latches. The cover securely snaps into place. 4. Replace the two thumbscrews (B) into the slots on the left and right of the front cover. Figure 129. Front cover on a model 7314-G30 or 5796 134 Installing into a rack Removing and replacing covers on the 8233-E8B or 8236-E8C system Use these instructions to remove, replace, or install covers to access components or perform service. Removing the service access cover from an 8233-E8B or 8236-E8C system Use this procedure to remove the service access cover to perform service or to gain access to internal components. To remove the service access cover, follow these steps: 1. Place the system into the service position. For instructions, see “Placing a 8233-E8B or 8236-E8C in the service position” on page 145. 2. Loosen the two thumbscrews (A) located at the back of the cover. 3. Slide the cover (B) toward the back of the system unit. When the front of the service access cover clears the upper frame ledge, lift the cover up and off the system unit. Attention: For proper cooling and airflow, install the cover before starting the system. Operating the system without the cover for more than 30 minutes could damage the system components. Figure 130. Removing the service access cover © Copyright IBM Corp. 2010 135 Installing the service access cover on an 8233-E8B or 8236-E8C system Use this procedure to install the service access cover after performing service or accessing internal components. To install the service access cover, follow these steps: 1. Place the service access cover (A) on the top of the system unit, approximately 25 mm (1 in.) from the front of the system unit. 2. Hold the service access cover against the system unit, and slide it toward the front of the system. The tabs on the service access cover slide beneath the upper chassis ledge, and the two screws align with the screw holes at the back of the system unit. Important: Ensure that the fan LED cables do not get caught on the front edge of the service access cover as you move it forward. 3. Tighten the screws (B) located at the back of the cover. Removing the front cover from an 8233-E8B or 8236-E8C system Use this procedure to remove the cover to access components or perform service. To remove the front cover, follow these steps: 1. Remove the two screws (A) that secure the system to the rack (B) as shown in the following figure. 2. Pull the cover away from the system. The cover has an indentation where you can hold onto it more easily. Figure 131. Installing the service access cover 136 Installing into a rack Installing the front cover on an 8233-E8B or 8236-E8C system Use this procedure to install the cover after accessing components or performing service. To install the front cover, follow these steps: 1. Align the cover with the system. 2. Gently push the cover in until the four cover clips are seated in their respective mounting posts (B) on the system. 3. Replace the two screws (C) that secure the system to the rack (A). Figure 132. Removing the front cover Removing and replacing covers on the 8233-E8B or 8236-E8C system 137 Removing and replacing the front cover on a 9117-MMB or 9179-MHB Use these procedures to remove and replace the cover to access components or perform service. Removing the front cover from the 9117-MMB or 9179-MHB Use this procedure to remove the cover to access components or perform service. To remove the front cover follow these steps: 1. If necessary, open the front rack door. 2. Pull out the latches (A) located on both sides of the cover as shown in the following figure. Figure 133. Installing the front cover 138 Installing into a rack 3. Pull off the cover (B) to remove it from the system unit. Installing the front cover on the 9117-MMB or 9179-MHB Use this procedure to install the cover after accessing components or performing service. To install the front cover, follow these steps: 1. Position the cover (A) on the front of the system unit so that the four pins on the system match the four holes on the back of the cover. Figure 134. Removing the front cover Removing and replacing covers on the 8233-E8B or 8236-E8C system 139 2. Press the tabs (B) to snap the cover into position. 3. Close the front rack door. Figure 135. Installing the front cover 140 Installing into a rack Placing the rack-mounted 8233-E8B or 8236-E8C system into the service position or operating position Use these procedures to place a system into the service position or operating position to perform service or to gain access to internal components. Placing the rack-mounted system or expansion unit in the service position Use this procedure to perform service or gain access to internal components by placing the rack-mounted system or expansion unit in the service position. Note: Some of the figures in these procedures might not look exactly like the system or expansion unit that you have. However, the steps to perform the task are the same. DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) © Copyright IBM Corp. 2010 141 DANGER Observe the following precautions when working on or around your IT rack system: v Heavy equipment–personal injury or equipment damage might result if mishandled. v Always lower the leveling pads on the rack cabinet. v Always install stabilizer brackets on the rack cabinet. v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest devices in the bottom of the rack cabinet. Always install servers and optional devices starting from the bottom of the rack cabinet. v Rack-mounted devices are not to be used as shelves or work spaces. Do not place objects on top of rack-mounted devices. v Each rack cabinet might have more than one power cord. Be sure to disconnect all power cords in the rack cabinet when directed to disconnect power during servicing. v Connect all devices installed in a rack cabinet to power devices installed in the same rack cabinet. Do not plug a power cord from a device installed in one rack cabinet into a power device installed in a different rack cabinet. v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of the system or the devices that attach to the system. It is the responsibility of the customer to ensure that the outlet is correctly wired and grounded to prevent an electrical shock. CAUTION v Do not install a unit in a rack where the internal rack ambient temperatures will exceed the manufacturer's recommended ambient temperature for all your rack-mounted devices. v Do not install a unit in a rack where the air flow is compromised. Ensure that air flow is not blocked or reduced on any side, front, or back of a unit used for air flow through the unit. v Consideration should be given to the connection of the equipment to the supply circuit so that overloading of the circuits does not compromise the supply wiring or overcurrent protection. To provide the correct power connection to a rack, refer to the rating labels located on the equipment in the rack to determine the total power requirement of the supply circuit. v (For sliding drawers.) Do not pull out or install any drawer or feature if the rack stabilizer brackets are not attached to the rack. Do not pull out more than one drawer at a time. The rack might become unstable if you pull out more than one drawer at a time. v (For fixed drawers.) This drawer is a fixed drawer and must not be moved for servicing unless specified by the manufacturer. Attempting to move the drawer partially or completely out of the rack might cause the rack to become unstable or cause the drawer to fall out of the rack. (R001) To place a rack-mounted system or expansion unit into the service position, follow these steps: 1. If necessary, open the front rack door. 2. Remove the two screws (A) that secure the system or expansion unit (B) to the rack as shown in the following figure. 142 Installing into a rack 3. Release the rack latches (A) on both the left and right sides as shown in the following figure. Figure 136. Removing the screws from the system and rack Figure 137. Releasing the rack latches Placing the rack-mounted 8233-E8B or 8236-E8C system into the service position or operating position 143 4. Read the following note, and then slowly pull the system or expansion unit out from the rack until the rails are fully extended and locked. Remember: v If the procedure you are performing requires you to unplug cables from the back of the system or expansion unit, do so before you pull the unit out from the rack. v Ensure that the cables at the rear of the system or expansion unit do not catch or bind as you pull the unit out from the rack. v Ensure the rails are fully extended. When the rails are fully extended, the rail safety latches lock into place. This action prevents the system or expansion unit from being pulled out too far. Placing the rack-mounted system or expansion unit in the operating position Use this procedure to place the rack-mounted system or expansion unit in the operating position to make the unit available for use. Tip: Some of the figures in these procedures might not look exactly like the system or expansion unit that you have. However, the steps to perform the task are the same. To place the rack-mounted system or expansion unit into the operating position, follow these steps: 1. Simultaneously release the blue rail safety latches (A), located near the front of each rail, and push the system or expansion unit into the rack as shown in the following figure. Note: Ensure that the cables at the rear of the system or expansion unit do not catch or bind as you push the unit back into the rack. Figure 138. Releasing the rail safety latches 144 Installing into a rack 2. Replace and tighten the two screws (C) that secure the system or expansion unit (A) to the rack as shown in the following figure. 3. Close the front rack door. Placing a 8233-E8B or 8236-E8C in the service position Use this procedure to perform service or gain access to internal components by placing the rack-mounted system or expansion unit in the service position. Note: Some of the figures in these procedures might not look exactly like the system or expansion unit that you have. However, the steps to perform the task are the same. Figure 139. Pushing the system into the rack and attaching the screws Placing the rack-mounted 8233-E8B or 8236-E8C system into the service position or operating position 145 DANGER When working on or around the system, observe the following precautions: Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product. v Do not open or service any power supply assembly. v Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm. v The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. v Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. v Connect any equipment that will be attached to this product to properly wired outlets. v When possible, use one hand only to connect or disconnect signal cables. v Never turn on any equipment when there is evidence of fire, water, or structural damage. v Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures. v Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices. To Disconnect: 1. Turn off everything (unless instructed otherwise). 2. Remove the power cords from the outlets. 3. Remove the signal cables from the connectors. 4. Remove all cables from the devices To Connect: 1. Turn off everything (unless instructed otherwise). 2. Attach all cables to the devices. 3. Attach the signal cables to the connectors. 4. Attach the power cords to the outlets. 5. Turn on the devices. (D005) DANGER 146 Installing into a rack Observe the following precautions when working on or around your IT rack system: v Heavy equipment–personal injury or equipment damage might result if mishandled. v Always lower the leveling pads on the rack cabinet. v Always install stabilizer brackets on the rack cabinet. v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest devices in the bottom of the rack cabinet. Always install servers and optional devices starting from the bottom of the rack cabinet. v Rack-mounted devices are not to be used as shelves or work spaces. Do not place objects on top of rack-mounted devices. v Each rack cabinet might have more than one power cord. Be sure to disconnect all power cords in the rack cabinet when directed to disconnect power during servicing. v Connect all devices installed in a rack cabinet to power devices installed in the same rack cabinet. Do not plug a power cord from a device installed in one rack cabinet into a power device installed in a different rack cabinet. v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of the system or the devices that attach to the system. It is the responsibility of the customer to ensure that the outlet is correctly wired and grounded to prevent an electrical shock. CAUTION v Do not install a unit in a rack where the internal rack ambient temperatures will exceed the manufacturer's recommended ambient temperature for all your rack-mounted devices. v Do not install a unit in a rack where the air flow is compromised. Ensure that air flow is not blocked or reduced on any side, front, or back of a unit used for air flow through the unit. v Consideration should be given to the connection of the equipment to the supply circuit so that overloading of the circuits does not compromise the supply wiring or overcurrent protection. To provide the correct power connection to a rack, refer to the rating labels located on the equipment in the rack to determine the total power requirement of the supply circuit. v (For sliding drawers.) Do not pull out or install any drawer or feature if the rack stabilizer brackets are not attached to the rack. Do not pull out more than one drawer at a time. The rack might become unstable if you pull out more than one drawer at a time. v (For fixed drawers.) This drawer is a fixed drawer and must not be moved for servicing unless specified by the manufacturer. Attempting to move the drawer partially or completely out of the rack might cause the rack to become unstable or cause the drawer to fall out of the rack. (R001) To place the rack-mounted system or expansion unit into the service position, follow these steps: 1. If necessary, open the front rack door. 2. Remove the two screws (A) that secure the system unit to the rack as shown in the following figure. 3. Release the rack latches (B) on both the left and right sides as shown in the following figure. Placing the rack-mounted 8233-E8B or 8236-E8C system into the service position or operating position 147 4. If required, unplug the cables from the back of the system or expansion unit before you pull the unit out from the rack. Ensure that the cables at the rear of the system or expansion unit do not catch or bind as you pull the unit out from the rack. 5. Slowly pull the system or expansion unit out from the rack until the rails are fully extended and locked. Ensure that the rails are fully extended. When the rails are fully extended, the rail safety latches lock into place. This action prevents the system or expansion unit from being pulled out too far. Placing the 8233-E8B or 8236-E8C in the operating position Use this procedure to place the system or expansion unit in the operating position to make the unit available for use. To place the system into the operating position follow these steps: Tip: Some of the figures in these procedures might not look exactly like the system or expansion unit that you have. However, the steps to perform the task are the same. 1. Simultaneously release the blue rail safety latches (B), located near the front of each rail, and push the system or expansion unit into the rack as shown in the following figure. Note: Ensure that the cables at the rear of the system or expansion unit do not catch or bind as you push the unit back into the rack. Figure 140. Releasing the rack latches 148 Installing into a rack 2. Replace and tighten the two thumbscrews (C) that secure the system or expansion unit (A) to the rack as shown in the following figure. 3. Close the front rack door. Figure 141. Releasing the rail safety latches Figure 142. Replacing the thumbscrews Placing the rack-mounted 8233-E8B or 8236-E8C system into the service position or operating position 149 150 Installing into a rack Notices This information was developed for products and services offered in the U.S.A. The manufacturer may not offer the products, services, or features discussed in this document in other countries. Consult the manufacturer's representative for information on the products and services currently available in your area. Any reference to the manufacturer's product, program, or service is not intended to state or imply that only that product, program, or service may be used. Any functionally equivalent product, program, or service that does not infringe any intellectual property right of the manufacturer may be used instead. However, it is the user's responsibility to evaluate and verify the operation of any product, program, or service. The manufacturer may have patents or pending patent applications covering subject matter described in this document. The furnishing of this document does not grant you any license to these patents. You can send license inquiries, in writing, to the manufacturer. The following paragraph does not apply to the United Kingdom or any other country where such provisions are inconsistent with local law: THIS INFORMATION IS PROVIDED “AS IS” WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESS OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF NON-INFRINGEMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Some states do not allow disclaimer of express or implied warranties in certain transactions, therefore, this statement may not apply to you. This information could include technical inaccuracies or typographical errors. Changes are periodically made to the information herein; these changes will be incorporated in new editions of the publication. The manufacturer may make improvements and/or changes in the product(s) and/or the program(s) described in this publication at any time without notice. Any references in this information to Web sites not owned by the manufacturer are provided for convenience only and do not in any manner serve as an endorsement of those Web sites. The materials at those Web sites are not part of the materials for this product and use of those Web sites is at your own risk. The manufacturer may use or distribute any of the information you supply in any way it believes appropriate without incurring any obligation to you. Any performance data contained herein was determined in a controlled environment. Therefore, the results obtained in other operating environments may vary significantly. Some measurements may have been made on development-level systems and there is no guarantee that these measurements will be the same on generally available systems. Furthermore, some measurements may have been estimated through extrapolation. Actual results may vary. Users of this document should verify the applicable data for their specific environment. Information concerning products not produced by this manufacturer was obtained from the suppliers of those products, their published announcements or other publicly available sources. This manufacturer has not tested those products and cannot confirm the accuracy of performance, compatibility or any other claims related to products not produced by this manufacturer. Questions on the capabilities of products not produced by this manufacturer should be addressed to the suppliers of those products. All statements regarding the manufacturer's future direction or intent are subject to change or withdrawal without notice, and represent goals and objectives only. © Copyright IBM Corp. 2010 151 The manufacturer's prices shown are the manufacturer's suggested retail prices, are current and are subject to change without notice. Dealer prices may vary. This information is for planning purposes only. The information herein is subject to change before the products described become available. This information contains examples of data and reports used in daily business operations. To illustrate them as completely as possible, the examples include the names of individuals, companies, brands, and products. All of these names are fictitious and any similarity to the names and addresses used by an actual business enterprise is entirely coincidental. If you are viewing this information in softcopy, the photographs and color illustrations may not appear. The drawings and specifications contained herein shall not be reproduced in whole or in part without the written permission of the manufacturer. The manufacturer has prepared this information for use with the specific machines indicated. The manufacturer makes no representations that it is suitable for any other purpose. The manufacturer's computer systems contain mechanisms designed to reduce the possibility of undetected data corruption or loss. This risk, however, cannot be eliminated. Users who experience unplanned outages, system failures, power fluctuations or outages, or component failures must verify the accuracy of operations performed and data saved or transmitted by the system at or near the time of the outage or failure. In addition, users must establish procedures to ensure that there is independent data verification before relying on such data in sensitive or critical operations. Users should periodically check the manufacturer's support websites for updated information and fixes applicable to the system and related software. Trademarks IBM, the IBM logo, and ibm.com are trademarks or registered trademarks of International Business Machines Corp., registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM or other companies. A current list of IBM trademarks is available on the Web at Copyright and trademark information at www.ibm.com/legal/copytrade.shtml. Microsoft and Windows are trademarks of Microsoft Corporation in the United States, other countries, or both. Other company, product, or service names may be trademarks or service marks of others. Electronic emission notices Class A Notices The following Class A statements apply to the IBM servers that contain the POWER7 processor and its features unless designated as electromagnetic compatibility (EMC) Class B in the feature information. Federal Communications Commission (FCC) statement Note: This equipment has been tested and found to comply with the limits for a Class A digital device, pursuant to Part 15 of the FCC Rules. These limits are designed to provide reasonable protection against harmful interference when the equipment is operated in a commercial environment. This equipment generates, uses, and can radiate radio frequency energy and, if not installed and used in accordance with the instruction manual, may cause harmful interference to radio communications. Operation of this equipment in a residential area is likely to cause harmful interference, in which case the user will be required to correct the interference at his own expense. 152 Installing into a rack Properly shielded and grounded cables and connectors must be used in order to meet FCC emission limits. IBM is not responsible for any radio or television interference caused by using other than recommended cables and connectors or by unauthorized changes or modifications to this equipment. Unauthorized changes or modifications could void the user's authority to operate the equipment. This device complies with Part 15 of the FCC rules. Operation is subject to the following two conditions: (1) this device may not cause harmful interference, and (2) this device must accept any interference received, including interference that may cause undesired operation. Industry Canada Compliance Statement This Class A digital apparatus complies with Canadian ICES-003. Avis de conformité à la réglementation d'Industrie Canada Cet appareil numérique de la classe A est conforme à la norme NMB-003 du Canada. European Community Compliance Statement This product is in conformity with the protection requirements of EU Council Directive 2004/108/EC on the approximation of the laws of the Member States relating to electromagnetic compatibility. IBM cannot accept responsibility for any failure to satisfy the protection requirements resulting from a non-recommended modification of the product, including the fitting of non-IBM option cards. This product has been tested and found to comply with the limits for Class A Information Technology Equipment according to European Standard EN 55022. The limits for Class A equipment were derived for commercial and industrial environments to provide reasonable protection against interference with licensed communication equipment. European Community contact: IBM Deutschland GmbH Technical Regulations, Department M456 IBM-Allee 1, 71139 Ehningen, Germany Tele: +49 7032 15-2937 e-mail: tjahn@de.ibm.com Warning: This is a Class A product. In a domestic environment, this product may cause radio interference, in which case the user may be required to take adequate measures. VCCI Statement - Japan The following is a summary of the VCCI Japanese statement in the box above: This is a Class A product based on the standard of the VCCI Council. If this equipment is used in a domestic environment, radio interference may occur, in which case, the user may be required to take corrective actions. Notices 153 Japanese Electronics and Information Technology Industries Association (JEITA) Confirmed Harmonics Guideline (products less than or equal to 20 A per phase) Japanese Electronics and Information Technology Industries Association (JEITA) Confirmed Harmonics Guideline with Modifications (products greater than 20 A per phase) Electromagnetic Interference (EMI) Statement - People's Republic of China Declaration: This is a Class A product. In a domestic environment this product may cause radio interference in which case the user may need to perform practical action. Electromagnetic Interference (EMI) Statement - Taiwan The following is a summary of the EMI Taiwan statement above. Warning: This is a Class A product. In a domestic environment this product may cause radio interference in which case the user will be required to take adequate measures. IBM Taiwan Contact Information: 154 Installing into a rack Electromagnetic Interference (EMI) Statement - Korea Please note that this equipment has obtained EMC registration for commercial use. In the event that it has been mistakenly sold or purchased, please exchange it for equipment certified for home use. Germany Compliance Statement Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse A EU-Richtlinie zur Elektromagnetischen Verträglichkeit Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die Grenzwerte der EN 55022 Klasse A ein. Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne Empfehlung von IBM gesteckt/eingebaut werden. EN 55022 Klasse A Geräte müssen mit folgendem Warnhinweis versehen werden: "Warnung: Dieses ist eine Einrichtung der Klasse A. Diese Einrichtung kann im Wohnbereich Funk-Störungen verursachen; in diesem Fall kann vom Betreiber verlangt werden, angemessene Maßnahmen zu ergreifen und dafür aufzukommen." Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland. Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse A Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen - CE - zu führen. Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller: International Business Machines Corp. Notices 155 New Orchard Road Armonk, New York 10504 Tel: 914-499-1900 Der verantwortliche Ansprechpartner des Herstellers in der EU ist: IBM Deutschland GmbH Technical Regulations, Abteilung M456 IBM-Allee 1, 71139 Ehningen, Germany Tel: +49 7032 15-2937 e-mail: tjahn@de.ibm.com Generelle Informationen: Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse A. Electromagnetic Interference (EMI) Statement - Russia Class B Notices The following Class B statements apply to features designated as electromagnetic compatibility (EMC) Class B in the feature installation information. Federal Communications Commission (FCC) statement This equipment has been tested and found to comply with the limits for a Class B digital device, pursuant to Part 15 of the FCC Rules. These limits are designed to provide reasonable protection against harmful interference in a residential installation. This equipment generates, uses, and can radiate radio frequency energy and, if not installed and used in accordance with the instructions, may cause harmful interference to radio communications. However, there is no guarantee that interference will not occur in a particular installation. If this equipment does cause harmful interference to radio or television reception, which can be determined by turning the equipment off and on, the user is encouraged to try to correct the interference by one or more of the following measures: v Reorient or relocate the receiving antenna. v Increase the separation between the equipment and receiver. v Connect the equipment into an outlet on a circuit different from that to which the receiver is connected. v Consult an IBM-authorized dealer or service representative for help. Properly shielded and grounded cables and connectors must be used in order to meet FCC emission limits. Proper cables and connectors are available from IBM-authorized dealers. IBM is not responsible for any radio or television interference caused by unauthorized changes or modifications to this equipment. Unauthorized changes or modifications could void the user's authority to operate this equipment. 156 Installing into a rack This device complies with Part 15 of the FCC rules. Operation is subject to the following two conditions: (1) this device may not cause harmful interference, and (2) this device must accept any interference received, including interference that may cause undesired operation. Industry Canada Compliance Statement This Class B digital apparatus complies with Canadian ICES-003. Avis de conformité à la réglementation d'Industrie Canada Cet appareil numérique de la classe B est conforme à la norme NMB-003 du Canada. European Community Compliance Statement This product is in conformity with the protection requirements of EU Council Directive 2004/108/EC on the approximation of the laws of the Member States relating to electromagnetic compatibility. IBM cannot accept responsibility for any failure to satisfy the protection requirements resulting from a non-recommended modification of the product, including the fitting of non-IBM option cards. This product has been tested and found to comply with the limits for Class B Information Technology Equipment according to European Standard EN 55022. The limits for Class B equipment were derived for typical residential environments to provide reasonable protection against interference with licensed communication equipment. European Community contact: IBM Deutschland GmbH Technical Regulations, Department M456 IBM-Allee 1, 71139 Ehningen, Germany Tele: +49 7032 15-2937 e-mail: tjahn@de.ibm.com VCCI Statement - Japan Japanese Electronics and Information Technology Industries Association (JEITA) Confirmed Harmonics Guideline (products less than or equal to 20 A per phase) Japanese Electronics and Information Technology Industries Association (JEITA) Confirmed Harmonics Guideline with Modifications (products greater than 20 A per phase) Notices 157 IBM Taiwan Contact Information Electromagnetic Interference (EMI) Statement - Korea Germany Compliance Statement Deutschsprachiger EU Hinweis: Hinweis für Geräte der Klasse B EU-Richtlinie zur Elektromagnetischen Verträglichkeit Dieses Produkt entspricht den Schutzanforderungen der EU-Richtlinie 2004/108/EG zur Angleichung der Rechtsvorschriften über die elektromagnetische Verträglichkeit in den EU-Mitgliedsstaaten und hält die Grenzwerte der EN 55022 Klasse B ein. Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne Zustimmung von IBM verändert bzw. wenn Erweiterungskomponenten von Fremdherstellern ohne Empfehlung von IBM gesteckt/eingebaut werden. Deutschland: Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten Dieses Produkt entspricht dem “Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG)“. Dies ist die Umsetzung der EU-Richtlinie 2004/108/EG in der Bundesrepublik Deutschland. Zulassungsbescheinigung laut dem Deutschen Gesetz über die elektromagnetische Verträglichkeit von Geräten (EMVG) (bzw. der EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse B Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das EG-Konformitätszeichen - CE - zu führen. Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller: International Business Machines Corp. New Orchard Road Armonk, New York 10504 Tel: 914-499-1900 Der verantwortliche Ansprechpartner des Herstellers in der EU ist: IBM Deutschland GmbH Technical Regulations, Abteilung M456 158 Installing into a rack IBM-Allee 1, 71139 Ehningen, Germany Tel: +49 7032 15-2937 e-mail: tjahn@de.ibm.com Generelle Informationen: Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse B. Terms and conditions Permissions for the use of these publications is granted subject to the following terms and conditions. Personal Use: You may reproduce these publications for your personal, noncommercial use provided that all proprietary notices are preserved. You may not distribute, display or make derivative works of these publications, or any portion thereof, without the express consent of the manufacturer. Commercial Use: You may reproduce, distribute and display these publications solely within your enterprise provided that all proprietary notices are preserved. You may not make derivative works of these publications, or reproduce, distribute or display these publications or any portion thereof outside your enterprise, without the express consent of the manufacturer. Except as expressly granted in this permission, no other permissions, licenses or rights are granted, either express or implied, to the publications or any information, data, software or other intellectual property contained therein. The manufacturer reserves the right to withdraw the permissions granted herein whenever, in its discretion, the use of the publications is detrimental to its interest or, as determined by the manufacturer, the above instructions are not being properly followed. You may not download, export or re-export this information except in full compliance with all applicable laws and regulations, including all United States export laws and regulations. THE MANUFACTURER MAKES NO GUARANTEE ABOUT THE CONTENT OF THESE PUBLICATIONS. THESE PUBLICATIONS ARE PROVIDED "AS-IS" AND WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING BUT NOT LIMITED TO IMPLIED WARRANTIES OF MERCHANTABILITY, NON-INFRINGEMENT, AND FITNESS FOR A PARTICULAR PURPOSE. Bull D10 and D20 I/O Drawers Service Guide 86 A1 38EG 01 ORDER REFERENCE Bull D10 and D20 I/O Drawers Service Guide Hardware June 2003 BULL CEDOC 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE 86 A1 38EG 01 ORDER REFERENCE The following copyright notice protects this book under the Copyright laws of the United States of America and other countries which prohibit such actions as, but not limited to, copying, distributing, modifying, and making derivative works. Copyright Bull S.A. 1992, 2003 Printed in France Suggestions and criticisms concerning the form, content, and presentation of this book are invited. A form is provided at the end of this book for this purpose. To order additional copies of this book or other Bull Technical Publications, you are invited to use the Ordering Form also provided at the end of this book. Trademarks and Acknowledgements We acknowledge the right of proprietors of trademarks mentioned in this book. AIX is a registered trademark of International Business Machines Corporation, and is being used under licence. UNIX is a registered trademark in the United States of America and other countries licensed exclusively through the Open Group. Linux is a registered trademark of Linus Torvalds. The information in this document is subject to change without notice. Groupe Bull will not be liable for errors contained herein, or for incidental or consequential damages in connection with the use of this material. Contents Safety Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vii Rack Safety Instructions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vii Electrical Safety . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . viii Laser Safety Information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix Laser Compliance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix Data Integrity and Verification . . . . . . . . . . . . . . . . . . . . . . . . . . . xi About This Book . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xiii ISO 9000 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xiii Highlighting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xiii Related Publications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xiv Trademarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xiv Chapter 1. Reference Information . . . . . . . . . . . . . . . . . . . . . . . . . . 1 D10 Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 D10 I/O Subsystem Features . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 I/O Subsystem PCI-X Slot LED Definitions. . . . . . . . . . . . . . . . . . . . . . . 2 D10 Power . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 Subsystem Positioning and Cabling . . . . . . . . . . . . . . . . . . . . . . . . . 6 D10 I/O Subsystem Locations. . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 D10 I/O Subsystem Front Locations. . . . . . . . . . . . . . . . . . . . . . . . . 10 D10 I/O Subsystem Rear Locations. . . . . . . . . . . . . . . . . . . . . . . . . 11 D10 I/O Backplane Locations . . . . . . . . . . . . . . . . . . . . . . . . . . . 12 D10 I/O Subsystem PHB. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13 D10 Location Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14 AIX and Physical Location Code Table. . . . . . . . . . . . . . . . . . . . . . . . 14 D10 I/O Subsystem Location Code Table. . . . . . . . . . . . . . . . . . . . . . . 15 D20 Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16 D20 I/O Subsystem Features . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16 D20 PCI-X Slots. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16 D20 Power. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18 D20 Location Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31 AIX and Physical Location Code Table. . . . . . . . . . . . . . . . . . . . . . . . 31 D20 Location Code Table . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32 Powering the System On and Off . . . . . . . . . . . . . . . . . . . . . . . . . . 34 Power-On Self-Test. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34 D10 and D20 Specifications . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35 D10 Specifications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35 D20 Specifications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36 External AC Power Cables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37 Service Inspection Guide. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38 Chapter 2. Diagnostics and Maintenance Procedures . . . . . . . . . . . . . . . . . . 39 Power and Attention LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39 Component LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39 Resetting the LEDs. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39 Checkpoints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39 FRU Isolation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39 Maintenance Analysis Procedures (MAPs) . . . . . . . . . . . . . . . . . . . . . . . 40 Checkpoints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40 Error Code to FRU Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40 Using the Service Processor . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40 iii Using System Management Services . . . . . . . . . . . . . . . . . . . . . . . . . 40 Chapter 3. Removal and Replacement Procedures . . . . . . . . . . . . . . . . . . . 41 Safety Considerations. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41 Handling Static-Sensitive Devices . . . . . . . . . . . . . . . . . . . . . . . . . . 42 Stopping and Starting the System . . . . . . . . . . . . . . . . . . . . . . . . . . 42 Removal and Replacement Procedures for the D10 I/O Subsystem . . . . . . . . . . . . . . 42 D10 FRU Replacement Procedure List . . . . . . . . . . . . . . . . . . . . . . . 42 D10 Service Position . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43 D10 Operating Position . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43 D10 Front Bezel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43 D10 Covers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44 D10 PCI Adapters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45 Hot-Pluggable PCI Adapter . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46 PCI Hot-Plug Manager Access . . . . . . . . . . . . . . . . . . . . . . . . . . 53 Removing and Replacing a PCI Adapter Cassette . . . . . . . . . . . . . . . . . . . 55 PCI Adapter or Blank Filler Removal from a Cassette Assembly . . . . . . . . . . . . . . 59 Replacing an Adapter in a PCI Adapter Cassette . . . . . . . . . . . . . . . . . . . . 70 Short Adapter or Blank Filler Installation . . . . . . . . . . . . . . . . . . . . . . . 74 Long Adapter Installation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89 D10Fan . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104 D10 RIO Cable. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105 D10 RIO Bus Adapter Assembly . . . . . . . . . . . . . . . . . . . . . . . . . 105 D10 I/O Backplane Assembly . . . . . . . . . . . . . . . . . . . . . . . . . . 106 D10 Power Supply . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107 Removal and Replacement Procedures for the D20 I/O Subsystem . . . . . . . . . . . . . 108 D20 FRU Replacement Procedure List . . . . . . . . . . . . . . . . . . . . . . . 108 D20 Service Position. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 109 D20 Operating Position . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 110 D20 Service Access Cover. . . . . . . . . . . . . . . . . . . . . . . . . . . . 111 D20 Front Bezel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 112 D20 Operator Panel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113 D20 Blowers. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 115 D20 Cooling Blower Cable. . . . . . . . . . . . . . . . . . . . . . . . . . . . 118 D20 RIO Bus Adapter Assembly . . . . . . . . . . . . . . . . . . . . . . . . . 120 D20 PCI Adapters. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 123 D20 Hot-Plug Disk Drives . . . . . . . . . . . . . . . . . . . . . . . . . . . . 137 D20 Disk Drive Cage. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 142 D20 Disk Drive Backplane. . . . . . . . . . . . . . . . . . . . . . . . . . . . 144 D20 Power Supplies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 146 D20 Power Supply Bulkhead. . . . . . . . . . . . . . . . . . . . . . . . . . . 148 D20 I/O Backplane Assembly . . . . . . . . . . . . . . . . . . . . . . . . . . 151 Chapter 4. Parts Information . . . . . . . . . . . . . . . . . . . . . . . . . . . 157 D10 I/O Subsystem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 157 D10 I/O Subsystems Parts . . . . . . . . . . . . . . . . . . . . . . . . . . . 157 D10 I/O Subsystem Parts (continued) . . . . . . . . . . . . . . . . . . . . . . . 159 D10 SPCN, RIO, and Rack Beacon Cables (These parts are not shown.) . . . . . . . . . . 161 D10 Rack Mounting Enclosure (These parts are not shown.) . . . . . . . . . . . . . . . 161 D20 I/O Subsystem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 162 D20 I/O Backplane and Cabling. . . . . . . . . . . . . . . . . . . . . . . . . . 162 D20 PCI Slots . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 163 D20 Power, RIO Adapter, Cabling . . . . . . . . . . . . . . . . . . . . . . . . . 164 D20 Operator Panel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 165 D20 Fans and Disk Drives. . . . . . . . . . . . . . . . . . . . . . . . . . . . 166 D20 Covers and Brackets . . . . . . . . . . . . . . . . . . . . . . . . . . . . 167 iv D10 and D20 Service Guide D20 RIO, SPCN, and Rack Beacon Cables . . . . . . . . . . . . . . . . . . . . . 167 Power Cables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 168 Chapter 5. Fault and Attention LEDs . . . . . . . . . . . . . . . . . . . . . . . . 169 Component LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 169 Resetting the LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 171 Appendix A. Environmental Notices . . . . . . . . . . . . . . . . . . . . . . . . 173 Product Recycling and Disposal. . . . . . . . . . . . . . . . . . . . . . . . . . . 173 Acoustical Noise Emissions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 174 D10 Declared Acoustical Noise Emissions . . . . . . . . . . . . . . . . . . . . . . 174 D20 Declared Acoustical Noise Emissions . . . . . . . . . . . . . . . . . . . . . . 174 Appendix B. Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 175 Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 177 Contents v vi D10 and D20 Service Guide Safety Notices A danger notice indicates the presence of a hazard that has the potential of causing death or serious personal injury. Danger notices appear on the following pages: v viii v 41 v 146 A caution notice indicates the presence of a hazard that has the potential of causing moderate or minor personal injury. Caution notices appear on the following pages: v viii v viii v 41 v 41 v ix Note: For a translation of these notices, see System Unit Safety Information, order number SA23-2652. Rack Safety Instructions v Do not install this unit in a rack where the internal rack ambient temperatures will exceed 35 degrees C. v Do not install this unit in a rack where the air flow is compromised. Any side, front or back of the unit used for air flow through the unit must not be in direct contact with the rack. v Care should be taken to ensure that a hazardous condition is not created due to uneven mechanical loading when installing this unit in a rack. If the rack has a stabilizer it must be firmly attached before installing or removing this unit. v Consideration should be given to the connection of the equipment to the supply circuit so that overloading of circuits does not compromise the supply wiring or overcurrent protection. To provide the correct power connection to the rack, refer to the rating labels located on the equipment in the rack to determine the total power requirement for the supply circuit. v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of the system or the devices that attach to the system. It is the responsibility of the customer to ensure that the outlet is correctly wired and grounded to prevent an electrical shock. vii Electrical Safety Observe the following safety instructions any time you are connecting or disconnecting devices attached to the workstation. In the system you are about to setup or service: v The ac power interface connector is considered the main power disconnect device. v This system has redundant power supply capabilities, meaning that it has the ability to have two power supplies running simultaneously in the same system unit. When instructed to disconnect the power source, ensure that all power cables have been unplugged. DANGER An electrical outlet that is not correctly wired could place hazardous voltage on metal parts of the system or the devices that attach to the system. It is the responsibility of the customer to ensure that the outlet is correctly wired and grounded to prevent an electrical shock. Before installing or removing signal cables, ensure that the power cables for the system unit and all attached devices are unplugged. When adding or removing any additional devices to or from the system, ensure that the power cables for those devices are unplugged before the signal cables are connected. If possible, disconnect all power cables from the existing system before you add a device. Use one hand, when possible, to connect or disconnect signal cables to prevent a possible shock from touching two surfaces with different electrical potentials. During an electrical storm, do not connect cables for display stations, printers, telephones, or station protectors for communications lines. D05 CAUTION: This product is equipped with a three-wire power cable and plug for the user’s safety. Use this power cable with a properly grounded electrical outlet to avoid electrical shock. C01 CAUTION: This unit has more than one power supply cord. To reduce the risk of electrical shock, disconnect two power supply cords before servicing. C21 viii D10 and D20 Service Guide Laser Safety Information CAUTION: This product may contain a CD-ROM, DVD-ROM, or laser module on a PCI card, which are class 1 laser products. C30 Laser Compliance All lasers are certified in the U.S. to conform to the requirements of DHHS 21 CFR Subchapter J for class 1 laser products. Outside the U.S., they are certified to be in compliance with the IEC 825 (first edition 1984) as a class 1 laser product. Consult the label on each part for laser certification numbers and approval information. CAUTION: All laser modules are designed so that there is never any human access to laser radiation above a class 1 level during normal operation, user maintenance, or prescribed service conditions. Data processing environments can contain equipment transmitting on system links with laser modules that operate at greater than class 1 power levels. For this reason, never look into the end of an optical fiber cable or open receptacle. Only trained service personnel should perform the inspection or repair of optical fiber cable assemblies and receptacles. C25, C26 Safety Notices ix x D10 and D20 Service Guide Data Integrity and Verification These computer systems contain mechanisms designed to reduce the possibility of undetected data corruption or loss. This risk, however, cannot be eliminated. Users who experience unplanned outages, system failures, power fluctuations or outages, or component failures must verify the accuracy of operations performed and data saved or transmitted by the system at or near the time of the outage or failure. In addition, users must establish procedures to ensure that there is independent data verification before relying on such data in sensitive or critical operations. Users should periodically check our support websites for updated information and fixes applicable to the system and related software. xi xii D10 and D20 Service Guide About This Book This book provides maintenance information that is specific to the D10 and D20 I/O subsystems, as well as adapters and attached devices that do not have their own service information. In this book, the I/O subsystems are referred to as the D10 I/O subsystem or the D20 I/O subsystem. MAPs that are common to all systems are contained in the Diagnostic Information for Multiple Bus Systems. This book is used by the service representative to repair system failures. This book assumes that the service representative has had training on the processor subsystem to which an I/O subsystem is attached, in addition to training on the I/O subsystem drawer. ISO 9000 ISO 9000 registered quality systems were used in the development and manufacturing of this product. Highlighting The following highlighting conventions are used in this book: Bold Identifies commands, subroutines, keywords, files, structures, directories, and other items whose names are predefined by the system. Also identifies graphical objects such as buttons, labels, and icons that the user selects. Italics Identifies parameters whose actual names or values are to be supplied by the user. Monospace Identifies examples of specific data values, examples of text similar to what you might see displayed, examples of portions of program code similar to what you might write as a programmer, messages from the system, or information you should actually type. xiii Related Publications The following publications provide additional information about your system: v The D10 I/O Drawer Installation Guide, order number 86 A1 32EG, contains information on how to install the D10 I/O subsystem. v The Installation Guide, order number 86 A1 39EG, contains information on how to install the D20 I/O subsystem. v The PL420T and PL420R User’s Guide, order number 86 A1 41EG, contains information to help users use the system, use the service aids, and solve minor problems. v The Diagnostic Information for Multiple Bus Systems, order number 86 A1 26HX, contains diagnostic information, service request numbers (SRNs), and failing function codes (FFCs). v The Adapters Information for Multiple Bus Systems, order number 86 A1 27HX, contains information about adapterss for your system. This manual is intended to supplement the service information found in the Diagnostic Information for Multiple Bus Systems. v The Site Preparation for Rack Systems Guide, order number 86 A1 30PX, contains information to help you plan your installation. v The System Unit Safety Information, order number 86 X1 11WD, contains translations of safety information used throughout this book. xiv D10 and D20 Service Guide Chapter 1. Reference Information This chapter provides reference information for the D10 and D20 I/O subsystems. D10 Overview The D10 is a 19-inch, rack-mountable I/O subsystem that is attached to a processor subsystem drawer to extend the system’s capacity for I/O adapters. The I/O drawer includes redundant concurrently maintainable power and cooling. Five PCI-X slots and one PCI slot are available for PCI adapters. PCI adapters are installed through the rear of the I/O subsystem using PCI adapter cassettes. Because the PCI slots support hot-pluggable adapters, the I/O subsystem can have adapters installed without turning off power or removing covers. The D10 is 4 EIA units high. Two D10s can be installed side-by-side in a 19-inch rack enclosure. D10 I/O Subsystem Features The following figure shows the front view of I/O subsystem. 2 3 1 5 4 1 Cooling Fan 4 Power Supply 1 2 Power cord channel 5 Power cord receptacles 3 Power Supply 2 1 The following figure shows the rear view of the I/O subsystem. 1 2 3 4 5 6 1 2 3 4 5 6 8 7 9 11 10 12 1 PCI adapter slot 1 7 RIO bus adapter card Upper connector 0 Lower connector 1 2 PCI-X adapter slot 2 8 SPCN connector card 3 PCI-X adapter slot 3 9 Lower SPCN connector J16 4 PCI-X adapter slot 4 10 Upper SPCN connector J15 5 PCI-X adapter slot 5 11 Rack beacon connector 6 PCI-X adapter slot 6 12 Power cord channel D10 Operator Indicators LEDs are used on the D10 as operator indicators. Tables found in “I/O Subsystem PCI-X Slot LED Definitions” and “D10 Power” on page 4 describes the LED states and their definitions. For more information on LEDs and their definitions, see Chapter 5, “Fault and Attention LEDs”, on page 169. LEDs can be found on or near the following FRUs: v LED indicators visible on each PCI adapter cassette v LED indicators on the I/O subsystem backplane v Attention/Identify LEDs for power supplies and fans D10 PCI-X Slots The D10 has five PCI-X slots and one PCI slot. Adapters are installed and removed using a PCI adapter cassette, which allows adapters to be installed without turning off the power or opening the I/O subsystem covers. PCI adapters are installed using an adapter cassette, each adapter cassette shows two LEDs for each adapter. There is a green power indicator LED (upper) and an amber fault/identify LED (lower). The slots are numbered on the rear of the chassis from left to right 1 through 6. Slot 1 is a 5V PCI slot. Slots 2 through 6 are 3.3V PCI-X. I/O Subsystem PCI-X Slot LED Definitions The green and amber LEDs (viewed on the rear of the PCI adapter cassette when installed) are used to indicate the state of the PCI slot during removal and replacement of an adapter. The following table 2 D10 and D20 Service Guide describes the LED states and their definitions. 3 2 1 1 PCI Adapter Cassette 2 Power LED (Green ) 3 Fault/Identify LED (Amber) Green PCI Slot LED Indication Amber PCI Slot LED PCI Slot Power/Status Definition On Off On/No action The PCI slot power is on and the slot is operational. Do not remove or install adapters. On Flashing On/Identify The PCI slot power is on, and the slot has been identified by the software or firmware. Do not remove or install adapters. Off Flashing Off/Action The PCI slot power is off, and the slot has been identified by the software or firmware. If directed by the PCI hot-plug procedures, perform PCI adapter removal, replacement, or installation. Off Off Off/No power The PCI slot power is off. If the system power is off, perform non-hot-plug PCI adapter removal, replacement, or installation procedures. Chapter 1. Reference Information 3 D10 Power The following figure shows the front view of the pocessor subsystem and two I/O subsystems. Each unit has two power supplies and two power cords for redundancy. Connect the power cords from one side of the system to one power distribution bus in the rack. Connect the power cords from the other side of the system to a different power distribution bus in the rack. To ensure that the systems in a rack keep running if power is interrupted, a separate branch power line for each side of the rack must be used. 1 1 2 1 3 1 Power supplies, power receptacles 2 Two D10 I/O subsystems mounted side-by-side. 3 One 386/50 processor subsystem The D10 I/O subsystem has two power supplies. Either power supply is capable of providing the necessary voltages and currents, independent of the other power supply. Each power supply provides 5V dc, 3.3V dc, -12V dc, and 5V dc standby. The power supplies are hot-pluggable and may be changed one at a time while the system is operational. 4 D10 and D20 Service Guide The following table describes power-supply LED indicators that are available on the D10. LED Status LED Definition Off Power source not connected Blinking green LED System power source connected, but power is not turned on Blinking green LED, visibly begins to blink faster after the power button has been pressed. System power source connected, the power-on button has been pressed and power-on initiated Solid (not blinking) green LED, (There is approximately a 30-second transition period from the time the power-on button is pressed to the time the power LED is on solid.) System power source connected and turned on The following figure shows the rear view of the processor subsystem and two I/O subsystems. 1 D10 I/O subsystem 3 Processor subsystem power supplies, power receptacles 2 386/50 processor subsystem 4 I/O backplane power (green) and fault/identify (amber) LEDs D10 Cooling Fans mounted inside each I/O subsystem power supply and an additional fan mounted on the front of the subsystem provide cooling. The power supplies and the fan can be removed and replaced with the power turned on, as long as only one fan is removed from the I/O subsystem at a time. D10 Input/Output Ports The connector ports on the rear of the D10 I/O subsystem are used to connect the RIO-2 cables, the SPCN cables, and the rack-beacon LED. Chapter 1. Reference Information 5 Subsystem Positioning and Cabling I/O subsystems can be installed in a standard 19-inch EIA rack in any location. The cables that connect the subsystems allow some flexibility in drawer placement, but the I/O subsystems should be located above the processor subsystem to which they are connected in the same rack. Up to eight I/O subsystems can be connected to a processor subsystem. Use the following guidelines: v Each I/O subsystem connects to the processor subsystem using one system power control network (SPCN) cable loop. v Each I/O subsystem connects to the processor subsystem using one or more remote I/O (RIO) cable loops. v Lowest cost configurations allow up to four I/O subsystems connected to the processor subsystem using a single RIO loop. v Optimum performance is achieved when each I/O drawer is connected to the processor subsystem through its own RIO loop. SPCN Cabling I/O subsystem SPCN cables are connected to the processor subsystem unit in a loop configuration. The SPCN cables are connected in a loop so that the system has two paths to each I/O subsystem. Up to eight I/O subsystems are supported on one SPCN loop. One I/O Subsystem, Two I/O Subsystems: The illustrations on the left show one I/O subsystem connected to the processor subsystem. The illustrations on the right show two I/O subsystems connected to the processor subsystem using one SPCN loop. Processor Subsystem Connected to One or Two D10 I/O Subsystems: 1 Processor Subsystem 2 I/O Subsystem 6 D10 and D20 Service Guide Four I/O Subsystems, Eight I/O Subsystems: The illustration on the left shows four I/O subsystems connected to the processor subsystem using one SPCN loop. The illustration on the right shows eight I/O subsystems connected to the processor subsystem using one SPCN loop. Processor Subsystem Connected to Four or Eight D10 I/O Subsystems: 1 Processor Subsystem 2 I/O Subsystem RIO Cabling I/O subsystems are connected to the processor subsystem through remote I/O (RIO) cable loops. The cable loops are connected to ports that are on the rear of the processor subsystem. The RIO cables are connected in loops so that the system has two paths to each I/O subsystem when more than one I/O drawer is in a loop. If only one I/O subsystem is in a RIO loop and both the processor subsystem and the I/O subsystem are RIO-2 capable, the RIO-2 cables are used as one two-cable path to the I/O subsystem. In this cable configuration, the data rate is increased by a factor of 2. v A maximum of eight D10 drawers are supported on one processor subsystem. v If both D10 I/O drawers are connected to the same processor subsystem, the total number of I/O subsystems supported is eight. v The maximum number of I/O subsystems supported on one RIO loop is four. v The D10 I/O drawers must be on separate RIO loops. v The D10 with RIO cabling and D10 with RIO-2 cabling can share one RIO loop. The system can have up to four RIO loops. A total of eight I/O drawers can be connected to the processor subsystem in a variety of cabling configurations. For optimum performance, connect the RIO loops in a configuration that distributes multiple I/O drawers among as many RIO loops as possible. A dedicated RIO loop for each I/O drawer can provide optimum performance. Some examples of valid cabling examples follow. Chapter 1. Reference Information 7 One I/O Subsystem or Two I/O Subsystems: The illustration on the left shows one I/O subsystem drawer connected to the processor subsystem. The illustration on the right shows two I/O subsystem drawers connected to the system unit using one RIO loop. Processor Subsystem Connected to One or Two D10 I/O Subsystems: A0 A0 A1 A1 0 1 1 0 0 1 1 2 1 Processor Subsystem 2 I/O Subsystem Three I/O Subsystems or Four I/O Subsystems: The illustration on the left shows three I/O subsystem drawers connected to the system unit using one RIO loop. The illustration on the right shows four I/O subsystem drawers connected to the system unit using one RIO loop. Processor Subsystem Connected to Three or Four D10 I/O Subsystems: 1 Processor Subsystem 2 I/O Subsystem Eight I/O Subsystems with Two RIO Loops: The following illustrations show eight I/O subsystem drawers connected to the processor subsystem unit using two RIO loops. 8 D10 and D20 Service Guide Processor Subsystem Connected to Eight D10 I/O Subsystems: 1 Processor Subsystem 2 I/O Subsystem Chapter 1. Reference Information 9 D10 I/O Subsystem Locations This system uses physical location codes to provide mapping of the failing field replaceable units. The location codes are produced by the processor subsystem’s firmware and AIX. For information about how to read a location code, see the service guide for the processor subsystem to which your I/O subsystem is connected. D10 I/O Subsystem Front Locations 1 2 4 3 1 Cooling fan 3 Power Supply 2 U0.dd-V2 2 Power cord channel 4 Power Supply 1 U0.dd-V1 10 D10 and D20 Service Guide D10 I/O Subsystem Rear Locations 1 2 3 4 5 6 1 2 3 4 5 6 8 7 9 11 10 12 1 PCI adapter slot 1 U0.dd-P1-I1 7 RIO bus adapter card U0.dd-P1.1 2 PCI-X adapter slot 2 U0.dd-P1-I2 8 SPCN connector card U0.dd-P1 (part of the I/O backplane FRU) 3 PCI-X adapter slot 3 U0.dd-P1-I3 9 SPCN Connector 2/J16 U0.ddP1/Q5 4 PCI-X adapter slot 4 U0.dd-P1-I4 10 SPCN Connector 1/J15 U0.ddP1/Q4 5 PCI-X adapter slot 5 U0.dd-P1-I5 11 Rack beacon connector 6 PCI-X adapter slot 6 U0.dd-P1-I6 12 Power cord channel Note: In the preceding table, dd is equal to the number assigned to each I/O subsystem by the connected processor subsystem. Chapter 1. Reference Information 11 D10 I/O Backplane Locations The following illustration of the I/O backplane identifies the primary connectors used in your subsystem. 6 5 4 3 2 1 7 8 9 10 11 12 1 PCI adapter connector 1 7 SPCN connector 2 PCI-X adapter connector 2 8 VPD module 3 PCI-X adapter connector 3 9 VPD module pin 1 orientation 4 PCI-X adapter connector 4 10 RIO bus adapter connector 5 PCI-X adapter connector 5 11 Fan connector 6 PCI-X Adapter connector 6 12 Power supply connectors 12 D10 and D20 Service Guide D10 I/O Subsystem PHB Each D10 I/O Subsystem has two PCI host bridges (PHBs) through which the PCI slots are connected. 1 2 3 4 5 6 Slot PHB Slot Characteristics 1 1 64-bit 5V, 33 MHz 2 1 64-bit 3.3V, 131.5 MHz 3 1 64-bit 3.3V, 131.5 MHz 4 2 64-bit 3.3V, 131.5 MHz 5 2 64-bit 3.3V, 131.5 MHz 6 2 64-bit 3.3V, 131.5 MHz Chapter 1. Reference Information 13 D10 Location Codes This system (processor subsystem and attached I/O subsystems) use physical location codes in conjunction with AIX location codes to provide mapping of failing field replaceable units (FRUs). The location codes are produced by the processor subsystem’s firmware and AIX. For information about how to read a location code, see the service guide for the processor subsystem to which your I/O subsystem is connected. AIX and Physical Location Code Table This section covers the AIX and Physical Location Code tables for the D10. The tables in this section contain the location codes for I/O subsystems when they are attached to the system unit. In the tables, the location code for the I/O subsystem number is represented by dd. The first time that an installed system is powered on, the I/O subsystems are numbered. For example, if a system is first powered on with eight I/O subsystems connected, the dd value for the subsystems should be numbered from 2 through 9. Note: If the system was powered on with I/O subsystems connected before delivery to the customer, the I/O subsystem location codes for the connected I/O subsystems are permanently set. If at a later time, an I/O subsystem is removed from the system and a different I/O subsystem is substituted, the substitute is assigned the next higher number available for its value of dd, which in the earlier example, is 10. If the original I/O subsystem is reinstalled in the system configuration, the system uses the original dd value for the I/O subsystem. To keep the system from renumbering the I/O drawer when an I/O backplane is replaced, the VPD module from the old I/O backplane must be moved over to the new I/O backplane. The following diagram defines each part of a location code. Ux.dd-xx-yy | | | | | | | | | | | yy This code is used for the next component (yy, yy, yy, ...) | | xx This code is used for the next component (Px, Fx, Vx, ...) | dd This code is the location code for the I/O drawer (2 to 63) Ux This code normally identifies the rack in which a drawer is installed. The x will always be zero (0) for this system. 14 D10 and D20 Service Guide D10 I/O Subsystem Location Code Table The following table lists the location codes for a D10 I/O subsystem. The location code for the I/O subsystem connected to a system unit is U0.dd (where dd can be any number from 2 through 63). FRU Name Physical Location Code D10 I/O Drawer U0.dd (dd = any number from 2 through 63) I/O subsystem 1 drawer U0.dd I/O backplane U0.dd-P1 RIO connector riser card U0.dd-P1.1 RIO port 0 (upper connector) U0.dd-P1.1/Q1 RIO port 0 cable U0.dd-P1.1/Q1# RIO port 1 (lower connector) U0.dd-P1.1/Q2 RIO port 1 cable U0.dd-P1.1/Q2# PHB1 U0.dd-P1 EADS_X under PHB1 U0.dd-P1 PCI slot 1 U0.dd-P1/I1 PCI slot 1 adapter U0.dd-P1-I1 PCI slot 2 U0.dd-P1/I2 PCI slot 2 adapter U0.dd-P1-I2 PCI slot 3 U0.dd-P1/I3 PCI slot 3 adapter U0.dd-P1-I3 PHB2 U0.dd-P1 EADS_X under PHB2 U0.dd-P1 PCI slot 4 U0.dd-P1/I4 PCI slot 4 adapter U0.dd-P1-I4 PCI slot 5 U0.dd-P1/I5 PCI slot 5 adapter U0.dd-P1-I5 PCI slot 6 U0.dd-P1/I6 PCI slot 6 adapter U0.dd-P1-I6 Power supply 1 (with 2 fans) U0.dd-V1 Power supply 2 (with 2 fans) U0.dd-V2 Cooling fan 5 U0.dd-F5 Rack indicator connector (4-pins) U0.dd-P1/Q3 SPCN connector 1/J15 U0.dd-P1/Q4 SPCN connector 2/J16 U0.dd-P1/Q5 Subsystem VPD U0.dd-P1 Chapter 1. Reference Information 15 D20 Overview The D20 is a 19-inch, rack-mountable I/O subsystem that is attached to a processor subsystem to extend the system’s capacity for I/O adapters and disk drives. The I/O subsystem includes redundant concurrently maintainable power and cooling. Seven PCI slots are available for PCI adapters. Because the PCI slots support hot-pluggable adapters, the I/O subsystem can have adapters installed without turning off power. The D20 is 4 EIA units high and is compatible with 19-inch EIA rack enclosures. D20 I/O Subsystem Features The features of the D20 enable this I/O subsystem to extend the input and output capabilities of the processor subsystem. Features such as PCI-X adapters and internal SCSI disk drives are supported. D20 Operator Indicators LEDs are used on the D20 as operator indicators. Tables found in “D20 PCI-X Slots” and “D20 Power” on page 18 describes the LED states and their definitions. For more information on LEDs and their definitions, see Chapter 5, “Fault and Attention LEDs”, on page 169. LEDs can be found on or near the following FRUs: v LED indicators visible on each PCI adapter light pipe v LEDs for power on, attention, SCSI activity v Attention/Identify LEDs v Rack Beacon D20 PCI-X Slots Seven PCI-X slots are available. The slots are 64-bit capable at up to 131.5 Mhz, 3.3 volts. The slots are numbered on the rear of the chassis from left to right 1 through 7. The I/O backplane and each I/O slot have green power indicator LEDs and amber identify LEDs. The following illustration shows the LED locations when viewing from the rear of the I/O drawer. 1 PCI-X Expansion Slot 1 6 PCI-X Expansion Slot 6 2 PCI-X Expansion Slot 2 7 PCI-X Expansion Slot 7 3 PCI-X Expansion Slot 3 A Green Power LEDs 4 PCI-X Expansion Slot 4 B Amber Identify LEDs 5 PCI-X Expansion Slot 5 16 D10 and D20 Service Guide The green LEDs for the PCI-X adapter slots (viewed on the rear of the I/O subsystem) indicate the state of the PCI slot during removal and replacement of an adapter. PCI LED (Green) Indication PCI Slot Status Definition Off Off Slot power is Off. It is safe to remove or replace adapters. On (not flashing) On Slot power is On. Do not remove or replace adapters. Flashing slowly (one flash per second) Slot has been identified by the software. Do not remove or replace adapters at this time. Flashing rapidly (six to eight flashes per second) Slot is ready removing or replacing an adapter. D20 Disk Drives There are 12 hot-plug disk-drive bays provided. The bays are located behind the D20 front bezel. The D20 has from 18.2 GB to 1.7 terabytes of disk storage capacity. The following disk drive sizes and speeds are available: v 18.2 GB Ultra3 10K RPM 1 inch v 36.4 GB Ultra3 10K RPM 1 inch v 73.4 GB Ultra3 10K RPM 1 inch v 146.8 GB Ultra3 10K RPM 1 inch For an updated listing of hot-plug disk-drive sizes, contact your sales representative. Chapter 1. Reference Information 17 D20 Power The D20 is standard with one power supply. If the customer wants redundant power in their I/O subsystem, an optional power supply can be ordered. Either power supply, in a D20 that is configured with two power supplies, can provide the necessary voltages and currents, independent of the other power supply. The left and right power-supply output voltages are connected and monitored by the power distribution board contained in the I/O subsystem. The left and right power supplies are hot-pluggable and may be changed one at a time while the system is operational. Each power supply is capable of converting available 110/220 V ac to the proper internal voltages used by the system components. The internal voltage range is 5 volts system standby, 3.3 volts, 5 volts, 12 volts and -12 volts. When operating correctly, each supply will share the current load when powered on. When in a fault state, the operating power supply will supply standby current to the faulted supply for the DEVROS module. The following table describes power-related LED indicators that are available on the D20. The operator panel LED referred to in the following table is on the front of the processor subsystem to which the I/O subsystem is attached. Status of LED Operator Panel LED (processor subsystem) Off Power Source not connected Blinking green LED System power source connected, but power is not turned on Blinking green LED, visibly begins to blink faster after the power-button has been pressed. System power source connected, the power-on button has been pressed and power-on initiated Solid (not blinking) green LED, (There is approximately a 30-second transition period from the time the power on button is pressed to the time the power LED is on solid.) System power source connected and turned on D20 Cooling Four blowers are mounted on the top of the I/O subsystem to provide cooling. Attention: The four cooling blowers mounted on top of the D20 are hot-plug. The one exception is when your subsystem has only one power supply either installed or functioning, in which case you must shut down the system and subsystem. Failure to do so will automatically shut down the system. D20 Input/Output Ports The connector ports on the rear of the D20 I/O subsystem are used to connect the RIO-2 cables, the SPCN cables, and the rack-beacon LED. 18 D10 and D20 Service Guide D20 I/O Subsystem Locations The system uses physical location codes to provide mapping of the failing field replaceable units. The location codes are produced by the processor subsystem’s firmware and AIX. For information about how to read a location code, see the service guide for the processor subsystem to which your I/O subsystem is connected. D20 Front View 1 Serial Number 3 D20 Front Bezel 2 Operator Panel 4 Disk Drives (maximum quantity 12 ) D20 Rear View 1 PCI-X Expansion Slot 1 9 Primary Power Supply 2 PCI-X Expansion Slot 2 10 D20 I/O Drawer 3 PCI-X Expansion Slot 3 11 Connector J11 (Not Used) 4 PCI-X Expansion Slot 4 12 Connector J14 (Not Used) 5 PCI-X Expansion Slot 5 13 Rear Serial Number Label 6 PCI-X Expansion Slot 6 14 Connector J15 (SPCN 3) 7 PCI-X Expansion Slot 7 15 Connector J16 (SPCN 4) 8 Redundant Power Supply Receptacle 16 RIO Connectors Upper Connector = 1 Lower Connector = 0 Chapter 1. Reference Information 19 D20 Power Supply Locations 1 2 3 4 6 5 1 Power Supplies 4 Green AC Power LED 2 Amber Fault/Identify LED 5 Power Supply Filler 3 Green DC Good LED 6 D20 I/O Drawer 20 D10 and D20 Service Guide D20 Blower Locations The following illustration identifies the I/O subsystem cooling blowers. Each blower has a green power LED and an amber identify LED located on the front blower housing. The green LED indicates that a blower has power and is operating correctly. The amber LED is in the identify state when blinking. 1 Green LED 4 Blower Connector 2 Amber LED 5 Hot-Plug Blower Assembly 3 Snap Button 6 Docking Connector Chapter 1. Reference Information 21 D20 I/O Backplane Locations The following illustration of the I/O backplane identifies the primary connectors used in your subsystem. Note: Before replacing the I/O backplane, note the position of pin 1 on the VPD module. 1 PCI-X Adapter Connector 1 11 Disk Drive Power Connector 2 PCI-X Adapter Connector 2 12 Cooling Blower Connector 3 PCI-X Adapter Connector 3 13 VPD Module 4 PCI-X Adapter Connector 4 14 Redundant Power Supply Connector 5 PCI-X Adapter Connector 5 15 Primary Power Supply Connector 6 PCI-X Adapter Connector 6 16 Connector J11 (Not Used) 7 PCI-X Adapter Connector 7 17 Connector J14 (Not Used) 8 Operator Panel Connector 18 Connector J15 (SPCN 3) 9 RIO Bus Adapter Connector 19 Connector J16 (SPCN 4) 10 Disk Drive System Power Control Network (SPCN) Connector 20 4-pin Connector, Rack Beacon Connector 22 D10 and D20 Service Guide D20 Reading the Operator Panel and I/O Backplane LEDs The following illustration shows the component location on the operator panel 1 Power-On LED 2 Attention LED 3 Operator Panel 4 Snap Buttons The following table defines each LEDs function on the operator panel. Number Component Name Component Description 1 Green Power LED Standby State - Blinking LED (System connected to power source but not powered on.) Powered On - LED is turned on solid (not blinking). 2 Amber Attention LED Normal State - LED is off. Chapter 1. Reference Information 23 D20 SCSI IDs and Bay Locations Index Bay Location Drive Name SCSI Bus ID 8 DB1 D01 Hot-Plug Disk Drive 8 - DB1 9 DB1 D02 Hot-Plug Disk Drive 9 - DB1 A DB1 D03 Hot-Plug Disk Drive A - DB1 B DB1 D04 Hot-Plug Disk Drive B - DB1 C DB1 D05 Hot-Plug Disk Drive C - DB1 D DB1 D06 Hot-Plug Disk Drive D - DB1 8 DB2 D07 Hot-Plug Disk Drive 8 - DB2 9 DB2 D08 Hot-Plug Disk Drive 9 - DB2 A DB2 D09 Hot-Plug Disk Drive A - DB2 B DB2 D10 Hot-Plug Disk Drive B - DB2 C DB2 D11 Hot-Plug Disk Drive C - DB2 D DB2 D12 Hot-Plug Disk Drive D - DB2 Note: The SCSI bus IDs are the recommended values and indicate how the IDs are set when the system is shipped from the factory. Field installations might not comply with these recommendations. 24 D10 and D20 Service Guide Cabling the D20 to a 286/C4 The D20 subsystem can be installed in any location in a standard 19-inch EIA rack. The cables that connect the subsystems allow some flexibility in drawer placement. Up to two D20s can be connected to a 286/C4. Each I/O subsystem is connected to the processor subsystem using a system power control network (SPCN) cable loop and a remote I/O (RIO) cable loop. One SPCN cable loop is needed to connect the I/O subsystems to one processor subsystem. Before connecting the RIO-2 and SPCN cables to the D20, do the following: 1. Ensure that your 286/C4 is running the latest level firmware. Refer to the ²Firmware Updates² section of the ²Using the Service Processor² chapter in the 286/C4 and 286/E4 Service Guide, order number SA23-1277. For information about checking or upgrading the firmware level of your 286/C4. 2. After ensuring that your firmware is at the latest level, turn off the power and disconnect the 286/C4 power cables from the power source. 3. Attach the RIO-2 and SPCN cables. For cabling diagrams, see “RIO-2 (Remote Input Output) Cabling” on page 26. 4. After connecting the RIO-2 and SPCN cables from the D20 to your system, reconnect the system’s power cables to the power source and refer to “Stopping and Starting the System” on page 42. 5. Run system verification. Refer to the ²Verifying the Hardware² chapter in the 286/C4 and 286/E4 Installation Guide, order number SA23-1277. Your 286/C4 is expandable when connected up to two D20 I/O drawers. Each D20 can contain up to 7 I/O PCI adapter slots, and two 6-pack disk drive (DASD) bays. Chapter 1. Reference Information 25 RIO-2 (Remote Input Output) Cabling The following rules apply to RIO-2 cable connections: Notes: 1. To connect to a D20, you will be using the RIO-2 (Remote Input Output) and SPCN (System Power Control Network) connectors and cables. The RIO-2 and SPCN cables provide two functions, remote data bus connection, and power control. 2. The I/O drawers must be connected in a loop for both the RIO-2 and SPCN cables. The loop connection provides redundant paths so that if a failure occurs in part of a cable, the system will continue to operate. If a failure does occur, a message is displayed on the system console; however, the system does continue to operate. See the following illustrations for possible cabling configurations. To connect the RIO-2 cables to the rear of the 286/C4, do the following: 1. Locate the RIO-2 cable-support bracket. 2. Using two M3 x 6 retaining screws, secure the RIO-2 cable-support bracket to the rear of the 286/C4 chassis. See the following illustration. 1 286/C4 2 RIO-2 Cable-Support Bracket 3 M3 x 6 Retaining Screws (Quantity 2) 3. Pull the sliding portion of the RIO-2 cable-support bracket back from the 286/C4 chassis. The sliding portion of the cable support bracket will stop before being fully separated. 4. Align the RIO-2 cable connector with the bottom of the sliding portion of the RIO-2 cable support bracket. 26 D10 and D20 Service Guide 5. Lift the RIO-2 cable and cable connector into the bottom of the RIO-2 cable-support bracket. 1 286/C4 2 RIO-2 Cable-Support Bracket 3 Slide Portion of RIO-2 Cable-Support Bracket 4 RIO-2 Connector Support Lever 6. Pull the RIO-2 connector back until the connector is seated against the support hook. 7. Push the RIO-2 connector forward, ensuring that the RIO-2 connector lever slides over the top of the connector hooks. 8. Slowly push the assembly toward the chassis. 9. Secure the RIO-2 cable-support bracket with the captive thumbscrew, the RIO-2 connector and slide portion of the RIO-2 support bracket to the back of the 286/C4 chassis. 1 Captive Thumbscrews 2 RIO-2 Cable and Connector 3 RIO-2 Connector Support Lever Chapter 1. Reference Information 27 286/C4 Attached to One I/O Drawer: 1 286/C4 2 D20 3 Cable from 286/C4 Connector RIO-2 0 to D20 Connector RIO-2 0 4 Cable from 286/C4 Connector RIO-2 1 to D20 Connector RIO-2 1 286/C4 Attached to Two I/O Drawers: 1 286/C4 2 D20 I/O Drawer 3 D20 I/O Drawer 4 Cable from 286/C4 Connector RIO-2 0 to D20 Connector RIO-2 0 5 Cable from 286/C4 Connector RIO-2 1 to D20 Connector RIO-2 1 6 Cable from D20 Connector RIO-2 1 to D20 Connector RIO-2 0 28 D10 and D20 Service Guide SPCN (System Power Control Network) Cabling The following figures provide cabling examples for all valid cabling configurations. Match your configuration to the correct figure and connect your SPCN cables as shown. 286/C4 Attached to One I/O Drawer: 1 286/C4 2 D20 3 Cable from 286/C4 Connector SPCN 2 to D20 Connector SPCN 4 (J16) 4 Cable from 286/C4 Connector SPCN 1 to D20 Connector SPCN 3 (J15) 286/C4 Attached to Two I/O Drawers: 1 286/C4 2 D20 I/O Drawer 3 D20 I/O Drawer 4 Cable from 286/C4 Connector SPCN 1 to D20 Connector SPCN 3 (J15) 5 Cable from 286/C4 Connector SPCN 2 to D20 Connector SPCN 4 (J16) 6 Cable from D20 Connector SPCN 4 (J16) to D20 Connector SPCN 3 (J15) Chapter 1. Reference Information 29 D20 Internal Cabling 1 Cooling Blowers (Quantity 4) 6 Disk Drive Power Connector 2 Operator Panel 7 Disk Drive SPCN Connector 3 Disk Drive Backplanes 8 Disk Drive Power/Signal Cable 53P0416 4 Cooling Blower Cable 53P0419 9 Operator Panel Connector 5 Cooling Blower Connector 10 Operator Panel Cable 53P0414 30 D10 and D20 Service Guide D20 Location Codes This system (processor subsystem and attached I/O subsystems) uses physical location codes in conjunction with AIX location codes to provide mapping of a failing field replaceable units (FRUs). The location codes are produced by the processor subsystem’s firmware and AIX. For information about how to read a location code, see the service guide for the processor subsystem to which your I/O subsystem is connected. AIX and Physical Location Code Table This section covers the AIX and Physical Location Code tables for the D20 I/O subsystem. The tables in this section contain the location codes for I/O subsystems when they are attached to the system unit. In the tables, the location code for the I/O subsystem number is represented by dd. The first time that an installed system is powered on, the I/O subsystems are numbered. For example, if a system is first powered on with two I/O subsystems connected, then the dd value for the connected I/O subsystems should be numbered from 2 and 3. Note: If the system was powered on with I/O subsystems connected before delivery to the customer, the I/O subsystem location codes for the drawers connected at that time are already assigned. If at a later time, an I/O subsystem is removed from the system and a different I/O subsystem is substituted, the substitute is assigned the next higher number available for its value of dd, which, in the earlier example, is 4. If the original I/O subsystem is reinstalled in the system configuration, the system uses the original dd value for the I/O subsystem. To keep the system from renumbering the I/O drawer when an I/O backplane is replaced, the VPD module from the old I/O backplane must be moved over to the new I/O backplane. The following diagram defines each part of a location code. Ux.dd-xx-yy | | | | | | | | | | | yy This code is used for the next component (yy, yy, yy, ...) | | xx This code is used for the next component (Px, Fx, Vx, ...) | dd This code is the location code for the I/O drawer (2 to 63) Ux This code normally identifies the rack in which a drawer is installed. The x will always be zero (0) for this system. Chapter 1. Reference Information 31 D20 Location Code Table The following table lists the location codes for a D20 I/O subsystem. The location code for the I/O subsystem connected to a system unit is U0.dd (where dd can be any number from 2 through 63). FRU Name Physical Location Code D20 I/O Drawer U0.dd (dd = any number from 2 through 63) Rack frame U0 I/O drawer backplane U0.dd-P1 I/O subsystem drawer U0.dd RIO Bus Adapter RIO bus adapter U0.dd-P1.1 RIO port 1 upper connector U0.dd-P1.1/Q1 RIO port 1 cable U0.dd-P1.1/Q1# RIO port 0 lower connector U0.dd-P1.1/Q2 RIO port 0 cable U0.dd-P1.1/Q2# PCI Bus Controller PCI bus controller PHB0 U0.dd-P1 PCI to PCI bridge EADS-X U0.dd-P1 PCI Bus Controller PHB2 PCI bus controller PHB2 U0.dd-P1 PCI to PCI bridge EADS-X U0.dd-P1 Pluggable Adapter Slots PCI slot 1 U0.dd-P1/I1 PCI card in slot 1 U0.dd-P1-I1 PCI slot 2 U0.dd-P1/I2 PCI card in slot 2 U0.dd-P1-I2 PCI slot 3 U0.dd-P1/I3 PCI card in slot 3 U0.dd-P1-I3 PCI slot 4 U0.dd-P1/I4 PCI card in slot 4 U0.dd-P1-I4 PCI slot 5 U0.dd-P1/I5 PCI card in slot 5 U0.dd-P1/I5 PCI slot 6 U0.dd-P1/I6 PCI card in slot 6 U0.dd-P1/I6 PCI slot 7 U0.dd-P1/I7 PCI card in slot 7 U0.dd-P1/I7 Power Supply Power supply 1 (no internal blower) U0.dd-V1 Power supply 2 (no internal blower) U0.dd-V2 Cooling Blowers Cooling blower 1 U0.dd-F1 Cooling blower 2 U0.dd-F2 Cooling blower 3 U0.dd-F3 32 D10 and D20 Service Guide FRU Name Physical Location Code Cooling blower 4 U0.dd-F4 SPCN Connectors SPCN connector 1 U0.dd-P1/Q3 SPCN connector 2 U0.dd-P1/Q4 SPCN connector 3 U0.dd-P1/Q5 SPCN connector 4 U0.dd-P1/Q6 Rack Indicator Rack beacon (4–pin) U0.dd-P1/Q7 Operator Panel Operator panel U0.dd-L1 Temperature Sensor Temperature sensor (on operator panel) U0.dd-L1 Disk Drives Backplanes Disk drives backplane 1 U0.dd-P3 Disk drives backplane 2 U0.dd-P4 Disk Drive Locations Hot-swap disk drive bay 1 U0.dd-P1-Ix/Zn-A8 Hot-swap disk drive bay 2 U0.dd-P1-Ix/Zn-A9 Hot-swap disk drive bay 3 U0.dd-P1-Ix/Zn-AA Hot-swap disk drive bay 4 U0.dd-P1-Ix/Zn-AB Hot-swap disk drive bay 5 U0.dd-P1-Ix/Zn-AC Hot-swap disk drive bay 6 U0.dd-P1-Ix/Zn-AD Hot-swap disk drive bay 7 U0.dd-P1-Iy/Zn-A8 Hot-swap disk drive bay 8 U0.dd-P1-Iy/Zn-A9 Hot-swap disk drive bay 9 U0.dd-P1-Iy/Zn-AA Hot-swap disk drive bay 10 U0.dd-P1-Iy/Zn-AB Hot-swap disk drive bay 11 U0.dd-P1-Iy/Zn-AC Hot-swap disk drive bay 12 U0.dd-P1-Iy/Zn-AD SCSI SES0 U0.dd-P1-Ix/Zn-AF SES1 U0.dd-P1-Iy/Zn-AF Chapter 1. Reference Information 33 Powering the System On and Off Power for the I/O subsystems is controlled by the attached processor subsystem and the System Power Control Network (SPCN). For information about procedures to power the system on and off, refer to the service guide of the processor subsystem to which the I/O subsystem is attached. Power-On Self-Test After power is turned on and before the operating system is loaded, the system does a power-on self-test (POST). The RIO loop that connects the I/O subsystems to the system unit is tested. Tests are also performed on the installed adapters and devices in the I/O subsystems. If an error occurs during the POST, an error code is displayed on the system operator panel LCD that indicates which part is failing and which subsystem contains the error. 34 D10 and D20 Service Guide D10 and D20 Specifications This section contains system specifications for both the D10 and D20 I/O subsystems. D10 Specifications Dimensions 7311-D10 Two 7311-D10s with Enclosure Height 170 mm (6.6 in) 178 mm (7.0 in) Width 220 mm (8.7 in) 445 mm (17.5 in) Depth 711 mm (28.0 in) 711 mm (28.0 in) Weight 16.8 kg (37 lbs) 39.1 kg (86 lbs) Electrical Power source loading for two 7311-D10 (max.) 0.21 kVA Voltage range 200 to 240 V ac, (dc not supported) Frequency 50 or 60 Hz Thermal output (typical) 461 Btu/hr Thermal output (max.) 683 Btu/hr Power requirements (typical) 135 watts Power requirements for two 7311-D10 (max.) 200 watts Power factor 0.91 Inrush currentæ 64 amps Maximum altitude3, 4 3048 m (10000 ft.) Temperature Requirementsç Operating 10 to 38°C 50 to 100°F) Non-Operating 1 to 60°C (34 to 140°F) Storage 1 to 60°C (34 to 140°F) Humidity Requirements4 Operating Non-Operating Storage (Noncondensing) 8 to 80% 8 to 80% 8 to 80% Wet Bulb 23°C (73°F) 27°C (81°F) 29°C (84°F) Noise Emissions1, 4 Operating Idle LWAd, one 7311-D10 LWAd, two 7311-D10 LWAd, four 7311-D10 5.6 bels 5.9 bels 6.2 bels 5.6 bels 5.9 bels 6.2 bels m, one 7311-D10 m, two 7311-D10 m, four 7311-D10 40 dBA 43 dBA 46 dBA 40 dBA 43 dBA 46 dBA Install/Air Flow: Maintaining service clearance allows proper air flow. Service Clearances: See 14T/0 and 14T/4 Installation and Service Guide, order number SA23-2544 for T00 or T42 rack service clearances. 1. See page 174 for definitions of noise emissions positions. See noise emissions note 4. 2. Inrush currents occur only at initial application of power, no inrush occurs during normal power off-on cycle. 3. The upper limit of the dry bulb temperature must be derated 1°C per 137 m (450 ft.) above 915 m (3000 ft.). 4. The upper limit of the wet bulb temperature must be derated 1°C per 274 m (900 ft. ) above 305 m (1000 ft.). Chapter 1. Reference Information 35 D20 Specifications Dimensions Height 178 mm (7.0 in) Width 445 mm (17.5 in) Depth 610 mm (24.0 in) Maximum Weight 45.9kg (101 lbs) Electrical Power source loading (max.) 0.358 kVA Voltage range 100 to 240 V ac, V dc not supported Frequency 50 or 60 Hz Thermal output (typical) 774 Btu/hr Thermal output (max.) 1161 Btu/hr Power requirements (typical) 227 watts Power requirements for two 7311-D20 (max.) 340 watts Power factor 0.91 Inrush currentæ 60 amps Maximum altitude3, 4 3048 m (10,000 ft.) Temperature Requirementsç Operating 5 to 35°C 41 to 95°F) Non-Operating 1 to 43°C (34 to 109°F) Storage 1 to 60°C (34 to 140°F) Humidity Requirements4 Operating Non-Operating Storage (Noncondensing) 8 to 80% 8 to 80% 5 to 80% Wet Bulb 23°C (73°F) 27°C (81°F) 29°C (84°F) Noise Emissions1, 5 Operating Idle LWAd 6.1 bels 6.0 bels m 44 dBA 43 dBA Install/Air Flow Maintaining service clearance allows proper air flow. Service Clearances: See 14T/0 and 14T/4 Installation and Service Guide, order number SA23-2544 for T00 or T42 rack service clearances. 1. See 174 for definitions of noise emissions positions. See noise emissions note 4. 2. Inrush currents occur only at initial application of power, no inrush occurs during normal power off-on cycle. 3. The upper limit of the dry bulb temperature must be derated 1 degree C per 137 m (450 ft.) above 915 m (3000 ft.). 4. The upper limit of the wet bulb temperature must be derated 1 degree C per 274 m (900 ft. ) above 305 m (1000 ft.). 36 D10 and D20 Service Guide External AC Power Cables To avoid electrical shock, a power cable with a grounded attachment plug is provided. Use only properly grounded outlets. Power cables used in the United States and Canada are listed by Underwriter’s Laboratories (UL) and certified by the Canadian Standards Association (CSA). These power cords consist of the following: v Electrical cables, Type SVT or SJT. v Attachment plugs complying with National Electrical Manufacturers Association (NEMA) 5-15P, that is: ²For 115 V operation (D20 only), use a UL listed cable set consisting of a minimum 18 AWG, Type SVT or SJT three-conductor cord a maximum of 15 feet in length and a parallel blade, grounding type attachment plug rated at 15 A, 125 V.² ²For 230 V operation in the United States use a UL listed cable set consisting of a minimum 18 AWG, Type SVT or SJT three-conductor cable a maximum of 15 feet in length, and a tandem blade, grounding type attachment plug rated at 15 A, 250 V.² v Appliance couplers complying with International Electrotechnical Commission (IEC) Standard 320, Sheet C13. Power cables used in other countries consist of the following: v Electrical cables, Type HD21. v Attachment plugs approved by the appropriate testing organization for the specific countries where they are used. ²For units set at 230 V (outside of U.S.): use a cable set consisting of a minimum 18 AWG cable and grounding type attachment plug rated 15 A, 250 V. The cable set should have the appropriate safety approvals for the country in which the equipment will be installed and should be marked `HAR’.² For information about the power cables that are available, refer to Chapter 4, “Parts Information”, on page 157. Chapter 1. Reference Information 37 Service Inspection Guide Perform a service inspection on the system (processor subsystem and connected I/O subsystems) when: v The system is inspected for a maintenance agreement. v Service is requested and service has not recently been performed. v An alterations and attachments review is performed. v Changes have been made to the equipment that may affect the safe operation of the equipment. v External devices with their own power cables have those cables attached. If the inspection indicates an unacceptable safety condition, the condition must be corrected before anyone can service the machine. Note: The owner of the system is responsible to correct any unsafe conditions. Perform the following checks: 1. Check the covers for sharp edges and for damage or alterations that expose the internal parts of the system. 2. Check the covers for proper fit to the system. They should be in place and secure. 3. Gently rock the system from side to side to determine if it is steady. 4. Set the power button of the system to Off. 5. Remove the covers. 6. Check for alterations or attachments. If there are any, check for obvious safety hazards, such as broken wires, sharp edges, or broken insulation. 7. Check the internal cables for damage. 8. Check for dirt, water, and any other contamination within the system. 9. Check the voltage label on the back of the system to ensure that it matches the voltage at the outlet. 10. Check the external power cable for damage. 11. With the external power cable connected to the system, check for 0.1 ohm or less resistance between the ground lug on the external power cable plug and the metal frame. 12. Perform the following checks on each device that has its own power cables: a. Check for damage to the power cable. b. Check for the correctly grounded power cable. c. With the external power cable connected to the device, check for 0.1 ohm or less resistance between the ground lug on the external power cable plug and the metal frame of the device. 13. Install the covers. 38 D10 and D20 Service Guide Chapter 2. Diagnostics and Maintenance Procedures The system uses an integrated set of software diagnostic procedures to help isolate failing components and system maintenance. This book, along with the Diagnostic Information for Multiple Bus Systems, is the basis of the diagnostic procedures. Refer to the system service guide when running diagnostics on your I/O subsystem. Power and Attention LEDs The Power and Attention LEDs provide a means to identify failing components in your subsystem. When a failing component is detected in your system, the system’s Attention LED is turned on. Component LEDs To further help you identify the failing component, all system components have individual LEDs that indicate a failure when lit. The LEDs are either on the component itself or on the carrier of the component (memory card, fan, memory module, or CPU). The LEDs are either green or amber in color. A lit green LED indicates that the system or component is receiving power. A lit amber LED identifies a system or component on the system. The amber LED can also indicate when a system or component on a system has a problem or fault. For more information on LEDs and their definitions, see Chapter 5, “Fault and Attention LEDs”, on page 169. Resetting the LEDs To reset the LEDs, do the following: 1. Replace the failing component with the new component. 2. Log in as root user. 3. At the command line, type diag. 4. Select Task Selection. 5. Select Log Repair Action. 6. Select the device that was repaired. (If the device is not listed, select sysplanar0.) Checkpoints The system uses various types of checkpoints, error codes, and SRNs, which are referred to throughout your system’s service guide. These codes can appear in the service processor boot progress log, the AIX error log, and the operator panel display. Understanding the definition and relationships of these codes is important to the service personnel who are installing or maintaining the system. Service Request Numbers (SRNs) are listed in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. FRU Isolation For a list of error codes and recommended actions for each code, see your system’s service guide. These actions can refer to Chapter 4, “Parts Information”, on page 157, “Maintenance Analysis Procedures (MAPs)” on page 40, or provide informational message and directions. If a replacement part is indicated, direct reference is made to the part name. The respective AIX and physical location codes are listed for each occurrence as required. For a list of locations codes, see your system’s service guide. 39 Maintenance Analysis Procedures (MAPs) The maintenance analysis procedures (MAPs) provide the service representative a step-by-step procedure to analyze a problem with the system and I/O subsystem hardware. Be prepared to record code numbers and other data while using the MAPs. Because the D10 and D20 are subsystems, refer to your system’s service guide. The system’s service guide contains all of the MAPs needed for servicing the D10 and D20 I/O subsystems. Checkpoints Checkpoints display on the system operator panel and the virtual terminal while the system unit is powering on and going through the initial program load (IPL). See the system’s service guide for descriptive information and a complete listing of all checkpoints used by the D10 and D20 I/O subsystems. Error Code to FRU Index The Error Code to FRU Index lists fault symptoms and possible causes. The most likely cause is listed first. Use this index to help you decide which FRUs to replace when servicing the D10 or D20 I/O subsystem. See your system’s service guide for more information and a complete listing of all error codes and FRUs used by the D10 and D20 I/O subsystem. Using the Service Processor The service processor is in the processor subsystem to which your I/O subsystem is connected and runs on its own power boundary. The service processor continually monitors hardware attributes and the environmental conditions within the system and connected I/O subsystems. The service processor is controlled by firmware and does not require the operating system to be operational to perform its tasks. The service processor menus allow you to configure service processor options, as well as enable and disable functions. Service processor menus are available using an ASCII terminal or an HMC virtual terminal window when OK is displayed on the operator panel or when the service processor has detected a system problem (such as a surveillance failure). For more information about the service processor for the system to which your I/O subsystem is connected, see the system’s service guide. Using System Management Services Use the system management services menus to view information about the processor subsystem to which the I/O subsystem is connected. For more information about the service processor, see the service guide for the attached system. 40 D10 and D20 Service Guide Chapter 3. Removal and Replacement Procedures Before performing any of the removal or replacement procedures in this chapter, read the following danger and caution notices. Safety Considerations Observe the following safety precautions anytime you work with these I/O subsystems. For the I/O subsystem you are about to setup or service: v The ac power interface connector is considered the main power disconnect device. v The I/O subsystems have independent redundant power supply capabilities, meaning that each unit might be configured to have two power supplies running simultaneously in the same I/O subsystem. When instructed to disconnect the power source, ensure that all power cables that run to each serviced subsystem are disconnected from the power distribution bus. DANGER An electrical outlet that is not correctly wired could place hazardous voltage on metal parts of the system or the devices that attach to the system. It is the responsibility of the customer to ensure that the outlet is correctly wired and grounded to prevent an electrical shock. Before installing or removing signal cables, ensure that the power cables for the system unit and all attached devices are unplugged. When adding or removing any additional devices to or from the system, ensure that the power cables for those devices are unplugged before the signal cables are connected. If possible, disconnect all power cables from the existing system before you add a device. Use one hand, when possible, to connect or disconnect signal cables to prevent a possible shock from touching two surfaces with different electrical potentials. During an electrical storm, do not connect cables for display stations, printers, telephones, or station protectors for communications lines. D05 CAUTION: This product is equipped with a three-wire power cable and plug for the user’s safety. Use this power cable with a properly grounded electrical outlet to avoid electrical shock. C01 CAUTION: This unit has more than one power supply cord. To reduce the risk of electrical shock, disconnect two power supply cords before servicing. C21 41 Handling Static-Sensitive Devices Attention: Electronic boards and disk drives are sensitive to static electricity discharge. These devices are wrapped in antistatic bags to prevent this damage. Take the following precautions: v If you have an antistatic wrist strap available, use it while handling the device. v Do not remove the device from the antistatic bag until you are ready to install the device in the system. v With the device still in its antistatic bag, touch it to a metal frame of the system. v Grasp cards and boards by the edges. Hold drives by the frame. Avoid touching the solder joints or pins. v If you need to lay the device down while it is out of the antistatic bag, lay it on the antistatic bag. Before picking it up again, touch the antistatic bag and the metal frame of the system at the same time. v Handle the devices carefully to prevent permanent damage. Stopping and Starting the System For procedures to power the system on and off, refer to the service guide of the processor subsystem to which the I/O subsystem is attached. Removal and Replacement Procedures for the D10 I/O Subsystem The following procedures cover the removal and replacement of the D10 I/O Subsystem FRUs. Note: Before performing any of the removal or replacement procedures in this chapter, read the danger and caution notices on “Safety Considerations” on page 41. D10 FRU Replacement Procedure List Field Replaceable Unit (FRU) FRU Procedure Name and Page Location Adapters “D10 PCI Adapters” on page 45 Bezel “D10 Front Bezel” on page 43 Cover “D10 Covers” on page 44 Blower “D10 Fan” on page 104 I/O Backplane “D10 I/O Backplane Assembly” on page 106 Operating Position “D10 Operating Position” on page 43 Power Supplies “D10 Power Supply” on page 107 RIO Bus Adapter “D10 RIO Bus Adapter Assembly” on page 105 Service Position “D10 Service Position” on page 43 Static-Sensitive Devices “Handling Static-Sensitive Devices” Stopping and Starting the System “Stopping and Starting the System” 42 D10 and D20 Service Guide D10 Service Position To perform a removal or replacement procedure that requires access to the inside of the I/O subsystem, the subsystem must be removed from the rack and placed on a stable work surface. To put the drawer into the service position, do the following: 1. Shut down the system unit to which the I/O subsystem is connected. 2. From the rear of the rack, disconnect the I/O subsystem’s power cables from the power distribution bus. 3. Remove the retaining screws located on the rear of the I/O subsystem drawer. 4. Label and disconnect all of the cables connected to the rear of the I/O subsystem. 5. From the rear of the rack, pull the I/O subsystem straight back until the subsystem stops. 6. To allow the subsystem to move further to the rear, press the stop latch on the side of the enclosure. 7. Support the I/O subsystem as you pull it toward the rear of the rack. 8. When the I/O subsystem is out of the rack, place it on a stable work surface. D10 Operating Position To return the subsystem to the operating position, do the following: 1. From the rear of the rack, insert the I/O subsystem into the position from which it was removed. The end of the I/O subsystem that has the power supplies goes toward the front of the rack. 2. Support the I/O subsystem as you push it toward the front of the rack. 3. Install the retaining screws in the rear of the I/O subsystem drawer. 4. Reconnect the cables to the rear of the I/O subsystem. 5. Reconnect the power cables. 6. Restart the system. D10 Front Bezel Before performing the following procedure, read the “Safety Notices” on page vii. Removal To remove the front bezel, do the following: 1. Open the front rack door. 2. Simultaneously press in both bezel-release tabs. 3. Pivoting the bezel from the top, swing the top forward. 4. Pull the bottom of the bezel up, then away from the subsystem chassis. This action releases the two tab hooks located on the bottom of the I/O subsystem chassis. Front Bezel Replacement To replace the front bezel, do the following: 1. Open the front rack door. 2. Insert the two tabs located on the bottom edge of the bezel into their locking hooks, located on the chassis. 3. Pivot the front bezel up toward the top of the chassis. 4. Align the release tabs to the matching slots located on the front of the subsystem chassis. 5. Gently push the tabs into the slots until the bezel seats against the front of the subsystem. 6. If the subsystem is not completely in the operating position, push the subsystem back into the operating position as described in “D10 Operating Position”. 7. Close the rack door. Chapter 3. Removal and Replacement Procedures 43 D10 Covers Before performing the following procedure, read the “Safety Considerations” on page 41. Service Access Cover Removal To remove the service access cover, do the following: 1. Put the I/O subsystem into the service position as described in “D10 Service Position” on page 43. 2. Loosen the two captive thumbscrews located on the rear of the cover. 3. From the rear of the I/O subsystem, lift the cover and slide it backwards until the front disengages. Lift the cover off the I/O subsystem drawer. 1 2 1 Service access cover 2 Thumbscrews Service Access Cover Replacement To replace the service access cover, do the following: 1. Position the cover over the rear of the I/O subsystem. 2. Align the service access cover with the I/O subsystem so that the front portion of the cover engages with the front part of the I/O subsystem chassis. The flanges on the left and right sides of the cover should be on the outside of the I/O subsystem chassis. 3. Hold the service access cover down and slide it forward toward the front of the I/O subsystem. The front edge of the service access cover engages the front portion of the I/O subsystem. 4. Push in to engage and then tighten the thumbscrews located on the rear of the cover. 44 D10 and D20 Service Guide D10 PCI Adapters Most PCI adapters can be removed and replaced, or installed in the system without turning off the power to the entire system. These adapters are referred to as hot-pluggable PCI adapters. Some adapters are not hot-pluggable and power must be removed from the system for adapter removal, replacement, or installation. Though some adapters are not hot-pluggable, they must still be installed in the cassette. Note: An adapter or an adapter blank filler must be installed into the PCI adapter cassette assembly before it is reinstalled in a system unit or an I/O drawer. Before you remove or replace an adapter, determine if the PCI adapter you are working with is hot-pluggable. See the PCI Adapter Placement Reference, order number SA23-2504. If you are removing a PCI adapter that is: v Not hot-pluggable, go to “Removing a Non-Hot-Pluggable PCI Adapter”. v Hot-pluggable, go to “Hot-Pluggable PCI Adapter” on page 46. Removing a Non-Hot-Pluggable PCI Adapter To remove a non-hot-pluggable adapter, do the following: 1. Turn off power and remove the power from the system as described in “Stopping and Starting the System” on page 42. 2. Determine the slot from which you are removing the adapter. 3. Disconnect any cables that are connected to the adapter being removed. 4. Press the center of the handle of the PCI card cassette to release the latch, then lower the handle completely. 5. When the handle is completely lowered, push the gray locking cross bar. 6. Carefully pull the PCI card cassette straight out from the I/O subsystem as shown in the following illustration. Take care not to pull EMC gaskets from neighboring cassettes. 1 I/O Subsystem 2 PCI adapter Cassette 7. If you are installing another adapter in this slot, follow the instructions given in “Replacing a Non-Hot-Pluggable PCI Adapter” on page 46. 8. If you are not installing another adapter in this slot, install a blank PCI adapter cassette into the slot. 9. Connect power to the system and turn the power on. 10. Return the system to normal operations. Chapter 3. Removal and Replacement Procedures 45 Replacing a Non-Hot-Pluggable PCI Adapter Replace the adapter using the following procedure: Note: Because the adapter you are installing is not hot-pluggable, shut down the system and remove power before performing this procedure. 1. Press the center of the handle of the PCI card cassette to release the latch, then lower the handle completely. 2. When the handle is completely lowered, push the gray locking cross bar. 3. Hold the assembly straight on and level with the slot. 4. Align the bottom edge of the PCI cassette cover with the PCI card guide rail on the I/O backplane. Note: If there is a cassette to the left of the one you are installing, align the ridge on the cover with the ²tick² in the notch of the neighboring cassette. 5. Slide the cassette partially into the guide. 6. Ensure the dovetail on the top track aligns with its mating component(s) on both sides. 7. When the cassette is fully inserted, prepare to activate the handle by lowering the gray locking bar. Lift the handle all the way up until you hear a click. The PCI card should be completely seated. Note: To insert the adapter correctly, some minor forward or backward movement of the PCI adapter cassette might be necessary. 8. Set the color slide to blue. 9. Connect power to the system, and turn on the power as described in “Stopping and Starting the System” on page 42. 10. Ensure that the adapter is configured when the system completes the boot process. Hot-Pluggable PCI Adapter Before performing the following procedure, read “Safety Notices” on page vii. Attention: The Linux operating system does not support these hot-plug procedures. Also, Linux does not support hot-plugging any hot-pluggable PCI adapters. Systems with Linux installed on one or more partitions must be shut down and powered off before replacing any PCI adapter assigned to a Linux partition. Follow the non-hot-pluggable adapter procedures when replacing a PCI adapter in any partition with Linux installed. The following hot-plug procedures take you through removing and replacing hot-plug PCI adapters using software that presents procedures on your display. The LEDs on a PCI adapter cassette are described in “I/O Subsystem PCI-X Slot LED Definitions” on page 2. 46 D10 and D20 Service Guide Removing and Replacing a Hot-Pluggable PCI Adapter Note: Use this procedure only when you are replacing an adapter with an identical adapter. If you are replacing an adapter with an adapter that is not identical to the adapter removed, go to “Removing a Hot-Pluggable PCI Adapter” on page 49, and then to “Installing a Hot-Pluggable PCI Adapter” on page 51. To replace an adapter, perform the following steps: 1. Determine the slot from which you are removing the adapter. 2. Ensure that any processes or applications that might use the adapter are stopped. Note: Removing a hot-pluggable PCI adapter requires the system administrator to take the PCI adapter offline before performing any PCI adapter hot-plug procedures. Before taking an adapter offline, the devices attached to the adapter must be taken offline as well. This action prevents a service representative or user from causing an unexpected outage for system users. For additional information about taking an adapter offline or removing it from the system configuration, see the AIX System Management Guide: Operating System and Devices. 3. Log in as root user. If the system is a partitioned system, log in as root user on the partition that has the adapter assigned to it. 4. At the command line, type smitty. 5. Select Devices. 6. Select PCI Hot Plug Manager. 7. Select Unconfigure a Device and press Enter. 8. Press F4 to display the Device Names menu. 9. From the menu, select the adapter you are removing. 10. Answer YES to Keep Definition. Press Enter. 11. The ARE YOU SURE screen displays. Press Enter to verify the information. Successful unconfigure is indicated by the OK message displayed next to the Command field at the top of the screen. 12. Press F3 to return to the PCI Hot-Plug Manager menu. 13. Select Replace/Remove a PCI Hot-Plug Adapter and press Enter. The Replace/Remove a PCI Hot-Plug Adapter menu displays. 14. Move the cursor to select the adapter that you are removing and press Enter. 15. Press the Tab key until the entry field displays the replace operation and then press the Enter key. Follow the instructions that display on the screen until you are instructed to remove the adapter. 16. When you are instructed to remove the adapter from the adapter slot, disconnect any cables that are connected to the adapter being removed. 17. Press the center of the handle of the PCI card cassette to release the latch, then lower the handle completely. 18. When the handle is completely lowered, push the gray locking cross bar. Chapter 3. Removal and Replacement Procedures 47 19. Carefully pull the PCI card cassette straight out from the rear of the subsystem. Take care not to pull EMC gaskets from neighboring cassettes. 1 I/O Subsystem 2 PCI Adapter Cassette 20. Install the replacement adapter in the PCI adapter cassette, see “PCI Adapter or Blank Filler Removal from a Cassette Assembly” on page 59. 21. When the adapter is installed in the PCI adapter cassette, install the adapter in the adapter slot, as follows: a. Lower the black handle completely. b. Push the gray locking cross bar until you hear a clicking sound. c. Hold the assembly straight on and level with the slot. 1 I/O Subsystem 2 PCI Adapter Cassette d. Align the bottom edge of the PCI cassette cover with the PCI adapter guide rail on the I/O backplane. Note: If there is a cassette to the left of the one that you are installing, align the ridge on the cover with the tick in the notch of the neighboring cassette. e. Slide the cassette partially into the guide. f. Ensure that the dovetail on the top track aligns with its mating component(s) on both sides. g. When the cassette is fully inserted, prepare to activate the handle, lower the gray locking bar. Lift the handle up completely until you hear a click. The PCI adapter should be completely seated. Note: To enable proper insertion of the adapter, some minor forward or backward movement of the PCI adapter cassette might be necessary. h. Set the color slide to the orange color indicating that the adapter in the cassette is hot-pluggable. 22. Connect appropriate cables and devices to the adapter. 48 D10 and D20 Service Guide 23. Continue to follow the screen instructions until you receive a message that the replacement is successful. Successful replacement is indicated by the OK message displayed next to the Command field at the top of the screen. 24. Press the F3 key to return to the PCI Hot-Plug Manager menu. 25. Select Install/Configure Devices Added After IPL and press Enter. Then follow the instructions on the screen. Successful replacement is indicated by the OK message displayed next to the Command field at the top of the screen. 26. If you do not have other adapters to replace, continue with the next step. OR If you have other adapters to replace, press the F3 key to return to the PCI Hot-Plug Manager menu and then return to step 13 on page 47. 27. Press F10 to exit the Hot-Plug Manager. If you have added, removed, or replaced any adapters, run the diag -a command. If the system responds with a menu or prompt, follow the instructions to complete the device configuration. 28. Install the covers that you removed earlier and return the drawer to the operating position. Removing a Hot-Pluggable PCI Adapter If you are permanently removing an adapter from a system, this procedure should be used. Attention: The Linux operating system does not support these hot-plug procedures. Also, Linux does not support hot-plugging any hot-pluggable PCI adapters. Systems with Linux installed on one or more partitions must be shut down and powered off before replacing any PCI adapter assigned to a Linux partition. Follow the non-hot-pluggable adapter procedures when replacing a PCI adapter in any partition with Linux installed. Note: If you are removing an adapter and replacing it with an adapter that is identical to the adapter that was removed, use the “Removing and Replacing a Hot-Pluggable PCI Adapter” on page 47 procedure. To remove an adapter, perform the following steps: 1. Open the door of the rack. 2. Determine the slot from which you are removing the adapter. 3. Ensure that any processes or applications that might use the adapter are stopped. Note: Removing a hot-pluggable PCI adapter requires the system administrator to take the PCI adapter offline before performing any PCI adapter hot-plug procedures. Before taking an adapter offline, the devices attached to the adapter must also be taken offline as well. This action prevents a service representative or user from causing an unexpected outage for system users. For more information on taking the PCI adapter online, see the AIX operating system documentation. Documentation for the AIX operating system is available from the IBM Eserver pSeries Information Center at http://publib16.boulder.ibm.com/pseries/en_US/infocenter/base. Select AIX documentation. The AIX Documentation CD contains the base set of publications for the operating system, including system-management and end-user documentation. Chapter 3. Removal and Replacement Procedures 49 4. Log in as root user. If the system is a partitioned system, log in as root user on the partition that has the adapter assigned to it. 5. At the command line, type smitty. 6. Select Devices. 7. Select PCI Hot Plug Manager. 8. Select Unconfigure a Device and press Enter. 9. Press F4 to display the Device Names menu. 10. Select the adapter you are removing from the menu. 11. Use the Tab key to answer NO to Keep Definition. Press Enter. 12. The ARE YOU SURE screen displays. Press Enter to verify the information. Successful unconfigure is indicated by the OK message displayed next to the Command field at the top of the screen. 13. Press F3 to return to the PCI Hot-Plug Manager menu. 14. Select Replace/Remove a PCI Hot-Plug Adapter and press Enter. The Replace/Remove a PCI Hot-Plug Adapter menu displays. 15. Move the cursor to select the adapter that you are removing and press Enter. (The description entry displays as unknown). 16. Press the Tab key until the entry field displays the remove operation and then press the Enter key. Follow the instructions that display on the screen until you are instructed to remove the adapter. 17. When you are instructed to remove the adapter from the adapter slot, disconnect any cables that are connected to the adapter being removed. 18. Press the center of the handle of the PCI card cassette to release the latch, then lower the handle completely. 19. When the handle is completely lowered, push the gray locking cross bar. 20. Carefully pull the PCI adapter cassette straight out from the I/O subsystem, as shown in the following illustration. Take care not to pull EMC gaskets from neighboring cassettes. 1 I/O Subsystem 2 PCI adapter Cassette 21. If you are not installing another adapter in this slot, install a blank PCI adapter cassette into the slot. 22. Continue to follow the screen instructions until you receive a message that the adapter removal is successful. Successful removal is indicated by the OK message displayed next to the Command field at the top of the screen. 23. If you do not have other adapters to remove, continue with the next step. OR If you have other adapters to remove, press the F3 key to return to the PCI Hot-Plug Manager menu and then return to step 14. 24. Press F10 to exit the Hot-Plug Manager. 25. Install any covers that you removed earlier, and return the drawer to the normal operations. 50 D10 and D20 Service Guide Installing a Hot-Pluggable PCI Adapter Attention: The Linux operating system does not support these hot-plug procedures. Also, Linux does not support hot-plugging any hot-pluggable PCI adapters. Systems with Linux installed on one or more partitions must be shut down and powered off before replacing any PCI adapter assigned to a Linux partition. Follow the non-hot-pluggable adapter procedures when replacing a PCI adapter in any partition with Linux installed. Note: Before installing a new adapter, ensure the device driver for the adapter is installed. Check with the system administrator if the device driver is not already installed. To install an adapter, perform the following steps: 1. Install the new adapter into a PCI adapter cassette, see “PCI Adapter or Blank Filler Removal from a Cassette Assembly” on page 59. 2. Log in as root user. If the system is a partitioned system, log in as root user on the partition that has the adapter assigned to it. 3. At the command line, type smitty. 4. Select Devices. 5. Select PCI Hot Plug Manager. 6. From the PCI Hot-Plug Manager menu, select Add a PCI Hot-Plug Adapter and press Enter. The Add a Hot-Plug Adapter window displays. 7. See the PCI Adapter Placement Reference, order number SA23-2504, for adapter placement information. Select an empty PCI slot for the adapter. 8. Select the appropriate empty PCI slot from the ones listed on the screen, and press Enter. 9. Follow the instructions on the screen to install the adapter until the visual indicator (LED) for the specified PCI slot is set to the Action state. 10. Install the new adapter in the adapter slot by performing the following: a. Lower the black handle completely. b. Push the gray locking cross bar until you hear a clicking sound. c. Hold the assembly straight on and level with the slot. d. Align the bottom edge of the PCI cassette cover with the PCI card guide rail on the I/O backplane. Note: If there is a cassette to the left of the one you are installing, align the ridge on the cover with the ²tick² in the notch of the neighboring cassette. Chapter 3. Removal and Replacement Procedures 51 e. Slide the cassette partially into the guide. f. Ensure that the dovetail on the top track aligns with its mating component(s) on both sides. 1 I/O Subsystem 2 PCI adapter Cassette g. When the cassette is fully inserted, prepare to activate the handle, lower the gray locking bar. Lift the handle all the way up until you hear a click. The PCI Card should be completely seated. h. Set the color slide to the orange color indicating that the adapter in the cassette is hot-pluggable. 11. Connect appropriate cables and devices to the adapter. 12. Continue to follow the screen instructions until you receive a message that the installation is successful. Successful installation is indicated by the OK message displayed next to the Command field at the top of the screen. 13. Press the F3 key to return to the PCI Hot-Plug Manager menu. 14. Select Install/Configure Devices Added After IPL and press Enter. Then follow the instructions on the screen. Successful installation is indicated by the OK message displayed next to the Command field at the top of the screen. 15. If you do not have other adapters to install, continue with the next step. OR If you have other adapters to install, press the F3 key to return to the PCI Hot-Plug Manager menu and then return to step 6 on page 51. 16. Press F10 to exit the Hot-Plug Manager. If you have added, removed, or replaced any adapters, run the diag -a command. If the system responds with a menu or prompt, follow the instructions to complete the device configuration. 17. Install any covers that you removed earlier, and return the drawer to normal operations. 52 D10 and D20 Service Guide PCI Hot-Plug Manager Access Attention: The Linux operating system does not support these hot-plug procedures. Also, Linux does not support hot-plugging any hot-pluggable PCI adapters. Systems with Linux installed on one or more partitions must be shut down and powered off before replacing any PCI adapter assigned to a Linux partition. Follow the non-hot-pluggable adapter procedures when replacing a PCI adapter in any partition with Linux installed. The installation instructions for hot-pluggable PCI adapters refer you to these procedures when it is appropriate to perform them. Note: A PCI adapter is only hot-pluggable if the PCI adapter supports hot-plug applications. See the PCI Adapter Placement Reference, order number SA23-2504. Accessing Hot-Plug Management Functions Note: Removing or installing a Hot-Pluggable PCI adapter requires the system administrator to take the PCI adapter offline prior to performing the operation. Before taking an adapter offline, the devices attached to the adapter must be taken offline as well. This action prevents a service representative or user from causing an unexpected outage for system users. For additional information about taking an adapter offline or removing it from the system configuration, see the AIX System Management Guide: Operating System and Devices. To access the hot-plug menus, do the following: 1. Log in as root user. If the system is a partitioned system, log in as root user on the partition that has the adapter assigned to it. 2. At the command line, type smitty. 3. Select Devices. 4. Select PCI Hot Plug Manager and press Enter. 5. The PCI Hot-Plug Manager menu displays. Return to the procedure that directed you here. For a description of the menu options, see below. PCI Hot-Plug Manager Menu The following options are available from the PCI Hot Plug Manager menu: Note: For information about the PCI slot LED states, see “I/O Subsystem PCI-X Slot LED Definitions” on page 2. Chapter 3. Removal and Replacement Procedures 53 List PCI Hot-Plug Slots Provides a descriptive list of all slots that support PCI hot-plug capability. If the listing for a slot indicates it holds an “Unknown” device, select the Install/Configure Devices Added after IPL to configure the adapter in that slot. Add a PCI Hot-Plug Adapter Allows the user to add a new PCI hot-plug-capable adapter to the slot with the system turned on. You will be asked to identify the PCI slot that you have selected prior to the actual operation. The selected PCI slot will go into the Action state and finally into the On state. Note: The system will indicate the slot holds an “Unknown” device until you perform the Install/Configure Devices Added After IPL option to configure the adapter. Replace/Remove a PCI Hot-Plug Adapter Allows the user to remove an existing adapter, or replace an existing adapter with an identical one. For this option to work, the adapter must be in the Defined state (see “Unconfigure a Device” option below). You will be asked to identify the PCI slot prior to the actual operation. The selected PCI slot will go into the Action state. Identify a PCI Hot-Plug Slot Allows the user to identify a PCI slot. The selected PCI slot will go into the Identify state. See “I/O Subsystem PCI-X Slot LED Definitions” on page 2. Unconfigure a Device Allows the user to put an existing PCI adapter into the Defined state if the device is no longer in use. This step must be completed successfully before starting any removal or replacement operation. If this step fails, the customer must take action to release the device. Configure a Defined Device Allows a new PCI adapter to be configured into the system if software support is already available for the adapter. The selected PCI slot will go into the On state. 54 D10 and D20 Service Guide Install/Configure Devices Added After IPL The system attempts to configure any new devices and tries to find and install any required software from a user-selected source. The add, remove, and replace functions return information to the user indicating whether the operation was successful. If additional instructions are provided on the screen, complete the recommended actions. If the instructions do not resolve the problem, see the following: v If the adapter is listed as Unknown, perform the Install/Configure Devices Added After IPL option to configure the adapter. v If you receive a warning indicating that needed device packages are not installed, the system administrator must install the specified packages before you can configure or diagnose the adapter. v If you receive a failure message indicating a hardware error, the problem might be either the adapter or the PCI slot. Isolate the problem by retrying the operation in a different PCI slot, or trying a different adapter in the slot. If you determine that you have failing hardware, call your service representative. Removing and Replacing a PCI Adapter Cassette Perform these procedures only when you are directed to by the “Removing a Non-Hot-Pluggable PCI Adapter” on page 45 or “Hot-Pluggable PCI Adapter” on page 46. Removing a PCI Adapter Cassette Attention: Perform this procedure only when you are instructed to by the removal and replacement procedures for a “Removing a Non-Hot-Pluggable PCI Adapter” on page 45 or “Hot-Pluggable PCI Adapter” on page 46. Note: An adapter or a blank filler must be installed into the PCI adapter cassette assembly before it is reinstalled in a processor subsystem or an I/O subsystem. The following illustration shows the PCI adapter cassette with the handle lowered. 1 2 3 4 1 dove-tail 3 Handle in the lowered position 2 Gray locking crossbar 4 Center of handle 1. Determine the slot from which you are removing the PCI adapter cassette. 2. Disconnect any cables that are connected to the adapter cassette being removed. Chapter 3. Removal and Replacement Procedures 55 3. Press the center of the handle of the PCI card cassette to release the latch, then lower the handle completely. 1 2 3 1 Center of handle 3 Gray locking crossbar 2 Handle in the lowered position 4. When the handle is completely lowered, push the gray locking cross bar. This action prevents the handle from pulling up while the adapter is being removed from the unit. 5. Carefully pull the PCI card cassette straight out from the subsystem as shown in the following illustration. Take care not to pull EMC gaskets from neighboring cassettes. Attention: Do not remove more than one adapter cassette at at time. 1 Processor subsystem 2 PCI adapter cassette Replacing a PCI Adapter Cassette Attention: Perform this procedure only when instructed to by “Removing a Non-Hot-Pluggable PCI Adapter” on page 45 or “Hot-Pluggable PCI Adapter” on page 46. Check the following: v An adapter or blank filler must be installed in each PCI adapter cassette and the top cover for the unit must be installed before installing PCI adapter cassette in the system. v Check the adapter in the cassette that you are about to install and make sure that the adapter is held firmly by the retaining arms and rachet clips on all four corners. v Review the procedures beginning with “PCI Adapter or Blank Filler Removal from a Cassette Assembly” on page 59 to make sure that the adapter is correctly installed in the cassette. Replace the PCI adapter cassette in the unit using the following procedure: 56 D10 and D20 Service Guide 1. Press the center of the handle of the PCI card cassette to release the latch, then lower the handle completely. 1 2 3 4 1 Dove-tail 3 Handle in the lowered position 2 Gray locking crossbar 4 Center of handle 2. Study the PCI adapter slot opening carefully and note the location the dove-tail channels and the cassette guide slot. Use a flashlight to better see the dove-tail channels and the cassette guide slot 1 Dove-tail channels 2 PCI cassette guide slot Chapter 3. Removal and Replacement Procedures 57 3. Hold the assembly straight on and level with the slot. 1 Processor subsystem 2 PCI adapter cassette 4. Align the bottom edge of the PCI cassette cover with the PCI cassette guide rail on the of the system or I/O unit. Note: If there is a cassette to the left of the one you are installing, align the ridge on the cover with the ²tick² in the notch of the neighboring cassette. 5. Slide the cassette partially into the guide. Attention: If the cassette does not slide in smoothly, the cover might not have engaged PCI cassette guide rail. Carefully pull the adapter back out and start again. Also, the cassette might bind if it hits the top cover as it is inserted. 6. As you slide the cassette into the slot, ensure the dove-tails on the top of the cassette align with the mating components on both the left and right sides. 7. Push the cassette in until its bezel is even with the next cassettes (to the left or right). Note: Viewed from the rear, the left and right dove tail channels must be properly engaged. Check the following on the cassette after it is completely inserted into the unit. v There should be no freedom of movement of the cassette when pressure is applied to the handle in a back and fourth motion. v The PCI cassette bezel should be even with the other cassettes that are already installed. v Visually check to ensure that the cassette is parallel with the cassettes to the left or right. If the cassette is not parallel with the neighboring cassettes, the cassette dove-tail channels might not have engaged, pull the cassette out a few inches, reinsert it, and check the alignment again. You might need to apply light pressure to the left or right neighboring cassettes against the new cassette in order to engage both of the dove-tails correctly. 8. When the cassette is fully inserted, ensure that the gray locking bar is in the unlocked position (pulled out). Attention: Do not force the handle into the raised postion. To enable proper insertion of the adapter, press down on the adapter tailstock (metal tab just below the handle hinge) at the rear of the adapter cassette when lifting the handle. Some minor forward or backward movement of the PCI adapter cassette might be necessary. 9. Carfully lift the handle all the way up until you hear and feel a click. The PCI adapter is pushed into the PCI adapter slot when the handle is raised. 58 D10 and D20 Service Guide PCI Adapter or Blank Filler Removal from a Cassette Assembly Use this procedure when you are preparing to install or replace an adapter or an adapter blank filler in the system unit or an I/O drawer. Note: An adapter or an adapter blank filler must be installed into the PCI adapter cassette assembly before it is reinstalled in a system unit or an I/O drawer. Determine the slot of the system or I/O drawer in which you plan to install or remove an adapter, and refer to the procedures for removing an adapter from the system. Remove the PCI adapter cassette assembly from the system unit or I/O drawer before beginning this procedure. Note: It may take approximately 30 to 40 minutes to perform this procedure the first time. This time includes using the instructions in this guide and performing the steps. Thereafter, performing this procedure usually takes approximately 10 minutes. Before performing the following steps, familiarize yourself with the entire procedure. To remove a PCI adapter or blank filler from a cassette assembly, do the following: 1. Place the PCI adapter cassette assembly on a flat work surface with the cover facing up, and the top of the adapter facing you. See the following illustration. 1 Cover 3 Top of Cover 2 Handle Chapter 3. Removal and Replacement Procedures 59 2. Using two fingers, remove the bushing-lock pin from the bushing. The pin can be removed by pulling it out of the bushing with your fingernails. 1 Bushing 2 Bushing-Lock Pin 1 Bushing-Lock Pin 60 D10 and D20 Service Guide 3. Remove the bushing. The bushing can be removed by pulling it out of the PCI adapter cassette assembly with your fingernails. 1 Bushing 4. Turn over the PCI adapter cassette assembly so that the top is facing away from you. 1 Top of Adapter 2 Handle Chapter 3. Removal and Replacement Procedures 61 5. Remove the bezel, as follows: a. Locate the plastic latch fingers in the top part of the cassette. 1 Top of Cassette 2 Bezel 3 Plastic Latch Fingers b. Using one hand, pinch the plastic latch fingers, and with your other hand, carefully lift the top part of the bezel extension out until the tabs clear the slots in the PCI adapter cassette assembly. 1 Slots 2 Tab 3 Bezel Extension 62 D10 and D20 Service Guide c. While holding the bezel extension out, push the plastic cover latch out of the bezel hook, as shown in the following illustration. This action allows the bezel to be removed. 1 Bezel Extension 2 Plastic Cover Latch in Bezel Hook d. On the opposite side of the cassette (cover side), push the cover latch to release the bezel. 1 Cover Side 2 Cover Latch Chapter 3. Removal and Replacement Procedures 63 e. While holding the bezel extension out, carefully slide the bezel off. You might have to work from both sides to loosen the bezel assembly from the cassette assembly. When the bezel is free, slide it completely off the cassette assembly, and set it aside. 1 Bezel Extension 2 Bezel 1 Bezel 64 D10 and D20 Service Guide 6. Turn over the cassette so that the cover is facing up, as shown in the following illustration. Remove the cover from the cassette as follows: a. Slide the cover until it releases from the cassette assembly. Attention: The cover might be tight and difficult to slide. If you grasp the left end (handle end) of the cassette and the right end of the cover, you can use enough force to pull the cover off the PCI adapter cassette assembly. 1 Left End of the Cassette 2 Cassette Cover b. Lift the cover off the assembly, and set it aside. c. By pulling on both sides of the gray plastic locking bar, which is located on the handle, ensure that the handle is pulled into the unlocked position. Raise the handle on the cassette linkage assembly until it locks into the up position (the blank filler or adapter moves downward). 1 Handle 2 Gray Plastic Locking Bar Chapter 3. Removal and Replacement Procedures 65 7. Remove the metal EMC shield from the top of the tailstock. 1 Metal EMC Shield 8. Remove the blank filler or adapter that is installed in the cassette linkage assembly. Note: If there is a blank filler in the cassette linkage assembly, as shipped from the manufacturer, there are two adapter arms. One adapter arm is used with short adapters (short adapter arm) and the other adapter arm is used with long adapters (long adapter arm). If you are removing a short adapter or blank filler, see “Short Adapter or Blank Filler Removal” on page 67. If you are removing a long adapter, see “Long Adapter Removal” on page 69. 66 D10 and D20 Service Guide Short Adapter or Blank Filler Removal To remove a short adapter or blank filler, do the following: 1. Slide the long and short adapter arms away from the adapter or blank filler by doing the following: a. Each adapter arm has a release tab that allows the arm to be moved away from the adapter or blank filler in the cassette assembly. Use your fingernail to lift the tab, to allow each arm to be moved away from the adapter or blank filler. Lift the release tab on the short adapter arm, and push on the slotted tab to release the end of the blank filler. Note: If you plan to install a short adapter, leave the long adapter arm on the cassette linkage assembly. If you plan to install a long adapter, remove both the long and short adapter arms from the cassette linkage assembly in the next step. b. Slide the long and short adapter arms away from the blank filler or adapter. 1 Long Adapter Arm 4 Short Adapter Arm 2 Release Tab 5 Slotted Tab 3 Release Tab Chapter 3. Removal and Replacement Procedures 67 2. Remove the adapter or blank filler from the cassette linkage assembly by rotating the bottom of the tailstock out, as shown in the following illustration. Store the adapter or blank filler in a safe place. 1 Cassette Linkage Assembly 3 Handle 2 Bottom of Tailstock 3. The removal procedure for the PCI adapter cassette assembly is complete. To install a new adapter or blank filler in the cassette, go to “Installing a Hot-Pluggable PCI Adapter” on page 51. 68 D10 and D20 Service Guide Long Adapter Removal To remove a long adapter, do the following: 1. Each adapter arm has a release tab that allows the arm to be moved away from the adapter in the cassette assembly. Use your fingernail to lift the tab, to allow the arm to be moved away from the adapter. Lift the release tab on the long adapter arm, and slide it off the cassette linkage assembly. 1 Release Tab 2. Remove the adapter from the cassette linkage assembly by rotating the bottom of the tailstock out, as shown in the following illustration. Store the adapter in a safe place. 1 Cassette Linkage Assembly 3 Bottom of Tailstock 2 Adapter 4 Handle 3. The removal procedure for the PCI adapter cassette assembly is complete. To install a new adapter or blank filler in the cassette, go to “Installing a Hot-Pluggable PCI Adapter” on page 51. Chapter 3. Removal and Replacement Procedures 69 Replacing an Adapter in a PCI Adapter Cassette This procedure is performed when preparing to install a new adapter or a blank filler in the processor subsystem or an I/O drawer. Familiarize yourself with the entire procedure before performing the following steps. 1. Place the empty PCI adapter cassette linkage on a flat work surface in front of you. Position the cassette so that the handle is in the raised (up position), and on the left, with the top of the linkage facing away from you. Note: Your PCI adapter cassette linkage might not have the short or long adapter retaining arms installed if you removed them during the remove procedure. In the following figure, both arms are shown. Note: If you are installing an adapter that is in the following list, use the additional brackets listed to help when you seat the adapter during installation. There are unique brackets (1) for each adapter type (2): 1 2 53P5450 Bracket for Gigabit Ethernet PCI-X Adapter, FC 5700 or 10/100/1000 Base-TX Ethernet PCI-X Adapter, FC 5701 44P2661 Bracket for Dual Channel Ultra3 SCSI PCI Adapter, FC 6203 Type 4Y 44P0321 Bracket for 10/100 BaseT Ethernet PCI Adapter, FC 4962 Type AF 44P2675 Bracket for 2 Gigabit Fibre Channel Adapter, FC 6228 Type 4W 44P2676 Bracket for Gigabit Ethernet - SX PCI Adapter, FC 2969 Type 9U, or 10/100/1000 Base-T Ethernet PCI Adapter, FC 2975 Type A-A 70 D10 and D20 Service Guide 2. Use the following procedure to determine if you are installing a long adapter or a short adapter. If you know the length of your adapter, skip the following steps and proceed to step 3 on page 73. a. Remove the adapter from its protective packaging and, if present, remove the plastic extension handle from the end of the adapter. b. Place the adapter or a blank filler over the cassette with the upper left corner aligned into the top adapter-retaining clip. 5 4 2 3 1 1 Top Adapter-Retaining Clip 3 Adapter 2 Cassette Linkage Assembly 4 Bottom of Tailstock 5 Handle Chapter 3. Removal and Replacement Procedures 71 c. The cassette linkage is marked to show long or short adapters similar to the adapter shown in the following figure. Check the right end of the adapter to determine if it is long enough to fall into the long area of the cassette linkage. Ensure that the corner on the left end of the adapter is still aligned into the top adapter-retaining clip and determine the length of the adapter. LONG ARM ARM SHORT 1 1 Top Adapter-Retaining Clip 72 D10 and D20 Service Guide 3. Remove the adapter or blank filler and ensure that the handle is in the up position. If you need to move the handle, ensure that the gray locking bar is pulled into the unlocked position, and then rotate the adapter handle until the handle is in the up position (the cassette linkage will extend down beyond the cassette top). If you are installing a short adapter or blank filler, proceed to “Short Adapter or Blank Filler Installation” on page 74. If you are installing a long adapter, proceed to “Long Adapter Installation” on page 89. 2 1 1 Handle 2 Lock Chapter 3. Removal and Replacement Procedures 73 Short Adapter or Blank Filler Installation Use the following procedure to install a short adapter or blank filler. 1. Install the adapter or blank filler by doing the following: a. Place the adapter or blank filler into the cassette so that the upper-left corner of the adapter engages the adjustable top adapter-retaining clip as shown in the following figure. 4 3 1 2 1 Cassette Linkage Assembly 3 Retaining Clip 2 Bottom of Tailstock 4 Handle 74 D10 and D20 Service Guide b. Rotate the adapter so that the adapter engages the slot in the bottom adapter-retaining clip and the top corner of the adapter is seated into the adjustable top adapter-retaining clip. Note: If the adapter is not a full-height adapter, you must slide the adjustable top adapter-retaining clip downward until the lower edge of the adapter is seated into the slot on the bottom adapter-retaining clip. 3 2 1 1 Adapter 3 Bottom Retaining Clip 2 Slot Chapter 3. Removal and Replacement Procedures 75 c. Slide the short-adapter retaining arm toward the adapter or blank filler on the cassette linkage rail. 1) If the short-adapter retaining arm has been removed from the cassette linkage rail, install the short-adapter retaining arm and then slide the arm until it contacts the adapter or blank filler as shown in the following figure. d. Ensure that the adjustable top retainer clip catches the corner of the adapter as shown in the following figure: 1 1 Short Adapter Top Retaining Clip 76 D10 and D20 Service Guide e. Use the lower short adapter-retaining clip to engage and hold the bottom of the adapter. Note: It might be necessary to apply pressure to engage and hold the bottom of the adapter. 1 1 Lower Short Adapter Arm Clip f. To hold the top of the adapter, slide down the adjustable top adapter-retaining clip on the retaining arm. Ensure that the bottom edge of the adapter is held by the lower part of the adapter retaining arm. 1 1 Short Adapter Adjustable Retaining Clip Chapter 3. Removal and Replacement Procedures 77 g. Press the lock on the handle, and rotate the adapter handle until it is in the down position (adapter or blank filler moves up into the cassette assembly). 2 1 1 Handle 2 Lock Ensure the right end of the cassette linkage rail moves up into the cassette linkage assembly as shown in the following figure: 1 1 Right End of Cassette Linkage 78 D10 and D20 Service Guide h. Position the adapter and cassette assembly with the handle on the left (in the down position) and the top facing away from you. 2 1 1 Top of Cassette 2 Handle Chapter 3. Removal and Replacement Procedures 79 i. Install the cover on the cassette assembly: 1) Place the cassette cover on the cassette assembly as shown below. Slide the cover toward the handle until the hole in the cover aligns with the hole in the cassette assembly. 1 2 1 Handle 2 Holes Aligned 80 D10 and D20 Service Guide j. Install the bezel assembly using the following procedure. 1) Carefully slide the bezel onto the cassette assembly. 1 1 Bezel 2) Align and insert the cover arm latch in the hooked notch in the bezel. Chapter 3. Removal and Replacement Procedures 81 3) Align the top of the bezel assembly into the grooves on the top of the cassette assembly. Push the bezel onto the cassette linkage until the tab on the top of the bezel is seated in the recess of the cassette assembly. 1 2 3 4 1 Recess 3 Tab 2 Groove 4 Bezel Assembly 4) Insert the two tabs on the bezel extension into the two slots on the cassette assembly. 1 1 Tabs 82 D10 and D20 Service Guide k. Check for the following: 1) Ensure that the extension arm engages the pins on the cassette. You should be able to see the three pins in the holes in the arm. The bezel tab should be seated as shown in the following figure. 1 1 1 1 Pin 2) Ensure the cover arm latch is completely pressed into the hooked slot on the bezel as shown. Chapter 3. Removal and Replacement Procedures 83 3) Turn the cassette assembly over so the cover is up. Check the cover latch to ensure it is holding the bezel to the cover as shown. 1 1 Cover Latch l. Ensure the holes are aligned, and insert the bushing as shown. 1 1 Bushing 84 D10 and D20 Service Guide m. Insert the bushing lock pin into the hole in the bushing and push it in until it seats. 1 1 Bushing Lock Pin n. By pulling on both sides, ensure the gray plastic locking bar on the handle is pulled into the unlocked position. Raise the handle on the cassette linkage until it locks into the up position (the blank filler or adapter moves downward). 1 Handle 2 Gray Plastic Locking Bar Chapter 3. Removal and Replacement Procedures 85 o. To install the metal EMC shield on the adapter bracket, do the following: 1) Ensure that the shield slides up inside the top of the cassette. 1 Metal EMC Shield 86 D10 and D20 Service Guide 2) The metal shield has clips that slide over the top of the tailstock. Ensure that these clips are holding the EMC shield to the tailstock. 1 1 Top of Tailstock p. Press the lock on the handle and rotate the adapter handle until the handle is in the down position (adapter or blank filler moves up into the cassette assembly). 1 1 Handle Chapter 3. Removal and Replacement Procedures 87 q. Using the system documentation, determine if the adapter you are installing is hot-swappable. If the adapter is hot-swappable, move the slider on the color indicator to allow the orange to be visible. If the adapter is not hot-swappable, the blue color is visible. 1 2 1 Orange for Hot-Swappable 2 Blue for Not Hot-Swappable r. The adapter is ready to be installed into a system or an I/O drawer. 88 D10 and D20 Service Guide Long Adapter Installation Use the following procedure to install a long adapter. Note: Ensure that both the retaining arms for the long and short adapters are removed from the cassette linkage. Before continuing, see step 1a on page 67 to remove the arms. 1. Install the long adapter by doing the following: a. Place the adapter into the cassette so that the upper left corner of the adapter engages the top adjustable adapter-retaining clip as shown in the following figure: 5 4 2 3 1 1 Top Adapter-Retaining Clip 3 Adapter 2 Cassette Linkage Assembly 4 Bottom of Tailstock 5 Handle Chapter 3. Removal and Replacement Procedures 89 b. Rotate the adapter so that the adapter engages the slot in the bottom retaining clip and the top corner of the adapter is seated into the top adjustable adapter-retaining clip. Note: If the adapter is not a full-height adapter, slide the top adjustable adapter-retaining clip downward until the lower edge of the adapter is seated into the slot on the bottom retaining clip. 2 1 3 1 Adapter 3 Bottom Retaining Clip 2 Slot 90 D10 and D20 Service Guide c. Install the long adapter-retaining arm onto the cassette linkage rail, as shown in the following figure, and slide the arm toward the adapter. 1 2 3 1 Cassette Linkage Assembly 3 Top Adapter Retaining Clip 2 Adapter Retaining Arm Chapter 3. Removal and Replacement Procedures 91 d. Ensure that the top adjustable adapter-retaining clip catches the corner of the adapter as shown in the following figures. Long Arm Adjustable Retaining Clip 1 Long Arm Adjustable Retaining Clip e. Ensure that the bottom edge of the adapter is held by the groove in the lower part of the adapter-retaining arm. 1 2 1 Long Adapter Arm Adjustable Retaining Clip 2 Long Adapter Retaining Arm Slot 92 D10 and D20 Service Guide f. To hold the top of the adapter, slide down the top adjustable adapter-retaining clip on the retaining arm. 1 1 Long Adapter Retaining Arm g. Press the lock on the handle and rotate the adapter handle until it is in the down position (adapter or blank filler moves up into the cassette assembly). 2 1 1 Handle 2 Lock Chapter 3. Removal and Replacement Procedures 93 Ensure the right end of the cassette linkage rail moves up into the cassette linkage assembly as shown below: 1 1 Right End of Cassette Linkage h. Position the adapter and cassette assembly with the handle on the left (in the down position) and the top facing away from you. 1 2 1 Handle 2 Top of Cassette 94 D10 and D20 Service Guide i. Install the cover on the cassette assembly by placing the cassette cover on the cassette assembly as shown below. Slide the cover toward the handle until the hole in the cover aligns with the hole in the cassette assembly. 1 2 1 Top of Cassette 2 Handle Chapter 3. Removal and Replacement Procedures 95 j. Install the bezel assembly using the following procedure: 1) Carefully slide the bezel onto the cassette assembly. 1 1 Bezel 2) Align and insert the cover arm latch in the hooked notch in the bezel. 96 D10 and D20 Service Guide 3) Align the top of the bezel assembly into the grooves on the top of the cassette assembly and then push the bezel onto the cassette linkage until the tab on the top of the bezel is seated in the recess of the cassette assembly. 1 2 3 4 1 Recess 3 Tab 2 Groove 4 Bezel 4) Insert the two tabs on the bezel extension into the two slots on the cassette assembly. 1 1 Tabs Chapter 3. Removal and Replacement Procedures 97 k. Check for the following: 1) Ensure that the extension arm engages the three pins on the cassette. You should be able to see the pins in the holes in the arm. The bezel tab should be seated as shown in the following figure. 1 1 1 1 Pins 2) Ensure the cover arm latch is completely pressed into the hooked slot on the bezel as shown. 98 D10 and D20 Service Guide 3) Turn the cassette assembly over so the cover is up and check the cover latch to ensure it is holding the bezel to the cover as shown. 1 1 Cover Latch l. Position the adapter with the cover side up. Ensure the holes are aligned and insert the bushing as shown. 1 1 Bushing Chapter 3. Removal and Replacement Procedures 99 m. Insert the bushing lock pin into the hole in the bushing. Push on the pin until it seats. 1 1 Bushing Lock Pin n. By pulling on both sides, ensure the gray plastic locking bar on the handle is pulled into the unlocked position. Raise the handle on the cassette linkage until it locks into the up position (the blank filler or adapter moves downward). 1 Handle 2 Gray Plastic Locking Bar 100 D10 and D20 Service Guide o. To install the metal EMC shield on the adapter bracket, do the following: 1) Ensure that the shield slides up inside the top of the cassette. 1 Metal EMC Shield Chapter 3. Removal and Replacement Procedures 101 2) The metal shield has clips that slide over the top of the tailstock. Ensure that these clips are holding the EMC shield to the tailstock. 1 1 Top of Tailstock p. Press the lock on the handle and rotate the adapter handle until the handle is in the down position (adapter or blank filler moves up into the cassette assembly). 1 1 Handle q. Using your system documentation, determine if the adapter you are installing is hot-swappable. If the adapter is hot-swappable, move the slider on the color indicator to allow the orange to be 102 D10 and D20 Service Guide visible. If the adapter is not hot-swappable, the blue color is visible. 1 2 1 Orange for Hot-Swappable 2 Blue for Not Hot-Swappable r. The adapter is ready to be installed into a system or an I/O drawer. Refer to your system documentation for more information. Chapter 3. Removal and Replacement Procedures 103 D10 Fan Before performing the following procedure, read “Safety Considerations” on page 41. Hot-Swap Fan Assembly Removal To remove a hot-swap fan, do the following: 1. Locate the cooling fan assembly on the front of the I/O subsystem. 2. Unscrew the thumbscrew that holds the fan into the I/O subsystem. 3. Pull the fan assembly straight out until it is clear of the I/O subsystem. 2 1 1 Fan 2 Thumbscrew Hot-Swap Fan Assembly Replacement To replace your subsystem’s hot-swap fan, do the following: 1. Position the fan with the connector (located on the fan housing) facing in toward the I/O subsystem. 2. Insert the replacement fan assembly into the I/O subsystem as far as possible. 3. Screw in the thumbscrew to hold the fan in position. 104 D10 and D20 Service Guide D10 RIO Cable Removal 1. Turn off the power to the system. 2. Release the RIO cable retainers. 3. Disconnect the RIO cable. Replacement Replace in reverse order. D10 RIO Bus Adapter Assembly Removal Before performing the following procedure, read the “Safety Considerations” on page 41. 1. Turn off the system power. 2. Perform the procedure to put the I/O subsystem into the service position as described in “D10 Service Position” on page 43. 3. Remove the service access cover. (See “D10 Covers” on page 44). 4. Remove the screws that attach the RIO riser card to the I/O subsystem chassis. 5. Carefully pull the card straight up and out of the slot. Replacement Replace in reverse order. Chapter 3. Removal and Replacement Procedures 105 D10 I/O Backplane Assembly Before performing the following procedure, read the “Safety Considerations” on page 41. Note: The I/O backplane and the SPCN riser card are replaced as a pair. Removal 1. Turn off the system power. 2. Put the I/O subsystem into the service position as described in “D10 Service Position” on page 43. 3. Label and remove the PCI adapters. (See “D10 PCI Adapters” on page 45). 4. Remove the service access cover. (See “D10 Covers” on page 44). 5. Disconnect the fan cable from the I/O backplane. 6. Remove the power supplies. (See “D10 Power Supply” on page 107). 7. Remove the RIO bus adapter. (See “D10 RIO Bus Adapter Assembly” on page 105). 8. Remove the two screws that hold the SPCN connector card, and remove it from the I/O backplane. 9. Remove the screws that hold the PCI adapter mounting-guide, and remove the guides from the I/O backplane. 10. Remove the screws that secure the I/O backplane to the subsystem chassis. 11. Lift the I/O subsystem backplane straight up and out of the subsystem chassis. Note: If you are replacing the I/O subsystem backplane, mark and remove the VPD module (see “D10 I/O Backplane Locations” on page 12 for VPD module location) from the old backplane and move it to the replacement I/O subsystem backplane. Replacement Note: If the power supplies are not removed from the I/O subsystem chassis, remove them before replacing the I/O subsystem backplane. 1. Place the I/O subsystem backplane assembly into the chassis. Align the backplane with the mounting screw holes. Note: If you are replacing an I/O subsystem backplane with a new one, remove the VPD module from the old backplane and move it to the replacement I/O subsystem backplane. For VPD module location, see “D10 I/O Backplane Locations” on page 12. Put the module from the new backplane onto the old backplane. 2. Position the PCI adapter mounting-guides on the I/O backplane, and install the screws to secure the mounting-guides. 3. Secure the backplane with the remaining mounting screws removed earlier. 4. Reconnect the fan cable to the I/O backplane. 5. Install the new SPCN connector card on the new I/O subsystem backplane using the screws that you removed earlier. Note: The I/O backplane and the SPCN connector card are replaced as a pair. 6. Reinstall the RIO bus adapter card. (See “D10 RIO Bus Adapter Assembly” on page 105). 7. Reinstall the power supplies. (See “D10 Power Supply” on page 107). 8. Reinstall the cover. (See “D10 Covers” on page 44). 9. Reinstall all PCI adapters. (See “D10 PCI Adapters” on page 45). 10. Return the I/O subsystem to the operating position (see “D10 Operating Position” on page 43) and connect RIO, SPCN, and adapter cables. 11. Connect the power cables. 12. Power the system on. 106 D10 and D20 Service Guide D10 Power Supply Before performing the following procedure, read the “Safety Considerations” on page 41. Removal Attention: Do not remove two power supplies at the same time if performing hot-plug procedures. Power supplies are considered as hot-pluggable FRUs only if you remove one power supply at a time. The power supplies can be removed from the front of the I/O subsystem. 1. Disconnect the power cord from the power supply. 2. Unlatch the power supply handle and rotate the handle downward to unseat the power supply. Attention: Do not remove a power supply for more than four minutes. If you cannot replace the power supply in less than four minutes, shut down the system and then remove the power supply. 3. Pull the power supply straight out from the I/O subsystem. Replacement To replace the power supply, perform the steps in the removal procedure in reverse order. Chapter 3. Removal and Replacement Procedures 107 Removal and Replacement Procedures for the D20 I/O Subsystem Notes: 1. All of the parts in the D20 I/O Subsystem can be replaced with the subsystem installed in a rack. The power supplies, blowers, disk drives, and PCI adapters are hot-pluggable. Under normal conditions, during the swapping of these FRUs, there is no need to shut down or unplug the system. 2. Before performing any of the removal or replacement procedures in this chapter, read the danger and caution notices on “Safety Considerations” on page 41. D20 FRU Replacement Procedure List Field Replaceable Unit (FRU) FRU Procedure Name and Page Location Adapters “D20 PCI Adapters” on page 123 Bezel “D20 Front Bezel” on page 112 Cover “D20 Service Access Cover” on page 111 Disk Drives “D20 Hot-Plug Disk Drives” on page 137 Disk Drive Backplanes “D20 Disk Drive Backplane” on page 144 Disk Drive 12 Pack Cage “D20 Disk Drive Cage” on page 142 Disk Drive Configuration or Deconfiguration “Deconfiguring (Removing) or Configuring a Disk Drive” on page 137 Blowers “D20 Blowers” on page 115 I/O Backplane “D20 I/O Backplane Assembly” on page 151 Operating Position “D20 Operating Position” on page 110 Operator Panel “D20 Operator Panel” on page 113 Power Supplies “D20 Power Supplies” on page 146 Power Bulkhead “D20 Power Supply Bulkhead” on page 148 RIO Bus Adapter “D20 RIO Bus Adapter Assembly” on page 120 Service Position “D20 Service Position” on page 109 Static-Sensitive Devices “Handling Static-Sensitive Devices” on page 42 Stopping and Starting the System “Stopping and Starting the System” on page 42 108 D10 and D20 Service Guide D20 Service Position Attention: When placing your D20 subsystem into the service position, all stability plates must be firmly in position to prevent the rack from toppling. Ensure that only one system drawer is in the service position at a time. Before doing any service actions inside your D20 subsystem, put the D20 subsystem into the service position. To place the drawer into the service position, do the following: 1. Open the front rack door. 2. If your D20 subsystem is equipped with two blue thumbscrews securing it to the rack, remove the thumbscrews at this time. The screws are located on the bezel, just above each subsystem release latch. 1 Front Bezel 2 Transport Retaining Screw for System Drawer 3. Release the subsystem release latches located on the left and right side. 4. Pull the system drawer out from the rack until the rails are fully extended. Note: When the system rails are fully extended, safety latches on the slide rails lock into place. This action prevents the system from being accidentally pulled out too far and dropped. The following illustration shows a system drawer in the service position. Chapter 3. Removal and Replacement Procedures 109 1 D20 Subsystem 4 Stabilizing Bar 2 19-Inch Rack 5 Front Rack Door 3 Extension Rail Safety Release Latch 6 System Drawer Release Latch After completing the service actions, return the D20 subsystem to the operating position. D20 Operating Position To return your D20 subsystem to the operating position, do the following: 1. Release the rail release latches located on each rail. See the illustration on page 110 for the location of the release tabs. Note: To release the left rail, pull the release latch up. To release the right rail, push the right latch down. 2. Release latches must be in the open position. See the illustration on page 110 for the location of the subsystem release latches. 3. Push the system drawer straight back into the rack until both release latches on the subsystem have locked into position. 4. If you removed two thumbscrews from the front bezel, replace them at this time. 5. Close the front rack door. 110 D10 and D20 Service Guide D20 Service Access Cover Before performing the following procedure, read the “Safety Considerations” on page 41. Opening the Service Access Cover To open the service access cover, do the following: 1. Open the front rack door and place the D20 subsystem into the service position as described in “D20 Service Position” on page 109. 2. Loosen the three captive thumbscrews located on the rear of the cover. See the following illustration for thumbscrew locations. Note: The service access cover pivots on a piano hinge located directly behind the four cooling blowers. 3. To open the service access cover, lift the cover up from the back edge. The hinges allow the service access cover to swing open to about 170 degrees. Do not force the cover to come to a rest on top of the cooling blowers. Note: When you open the service access cover, ensure you have enough height clearance. 4. Close and then secure the service access cover with the three thumbscrews located on its back edge. Attention: For proper cooling and airflow, close the cover before turning on the system. Operating the system for extended periods of time (over 30 minutes) with the cover opened might damage the system components. 1 Thumbscrews 3 D20 Subsystem 2 Service Access Cover Service Access Cover Removal If you need to remove the service access cover from the subsystem, do the following: 1. Open the cover as described in the previous procedure. 2. Remove the four retaining screws that secure the cover hinge to the chassis. 3. Remove the cover. Service Access Cover Replacement To install the service access cover, do the following: 1. Open the hinge, located on the service access cover. 2. Align the four holes located on the hinge with the four screw holes located on the top of the chassis. 3. Start and then tighten the four retaining screws that secure the service access cover to the chassis. 4. Close and then secure the service access cover with the three thumbscrews located on its back edge. Chapter 3. Removal and Replacement Procedures 111 D20 Front Bezel Before performing the following procedure, read the “Safety Considerations” on page 41. Front Bezel Removal To remove the front bezel, do the following: 1. Open the front rack door. If necessary, place the D20 subsystem into the service position as described in “D20 Service Position” on page 109. 2. Simultaneously press in both bezel-release tabs. See the following illustration for bezel-release tab locations. 3. Pivoting the bezel from the bottom, swing the top of the bezel out. 4. Pull the bottom of the bezel up, and then away from the subsystem chassis. This action releases the two tabs located on the bottom of the bezel. 5. Put the bezel in a safe place. 1 Rack Door 3 Bezel-Release Tab (Located on each side of bezel) 2 Bezel 4 19-Inch Rack Front Bezel Replacement To replace the front bezel, do the following: 1. Open the front rack door. If necessary, put the D20 subsystem into the service position as described in “D20 Service Position” on page 109. 2. Insert the two tabs located on the bottom edge of the bezel into their locking slots, located on the chassis. 3. Pivot the front bezel up toward the top of the chassis. 4. Align the release tabs to the matching slots located on the front of the subsystem chassis. 5. Gently push the tabs into the slots until the bezel seats against the front of the subsystem. 6. If the subsystem is in the service position, put the subsystem back into the operating position as described in “D20 Operating Position” on page 110. 7. Close the rack door. 112 D10 and D20 Service Guide D20 Operator Panel Before performing the following procedure, read “Safety Considerations” on page 41. 1 Power LED (Green) 2 Attention LED (Amber) 3 Operator Panel 4 Snap Buttons Operator Panel Removal To remove the operator panel from the subsystem, do the following: 1. Open the rack front door. 2. Shut down the system as described in “Stopping and Starting the System” on page 42. 3. If necessary, put the D20 subsystem into the service position as described in “D20 Service Position” on page 109. 4. Disconnect the power source from the system. Note: This system may be equipped with a second power supply. Before continuing with this procedure, ensure that the power source to the system has been completely disconnected. 5. Remove the front bezel as described in “Front Bezel Removal” on page 112. 6. Locate the operator panel, as shown in the following illustration. 7. Pull out the two snap buttons located on each side of the operator panel. 8. Remove the operator panel by grasping its edges and slowly pulling it out of its bay. 9. Disconnect the power/signal cable connector from the back of the operator panel. 10. Put the operator panel in a safe place. Chapter 3. Removal and Replacement Procedures 113 1 Rack Front Door 4 Snap Buttons 2 D20 5 Front Bezel 3 Operator Panel Operator Panel Replacement To replace the operator panel, do the following: 1. Connect the power/signal cable connector to the back of the operator panel. 2. Carefully slide the operator panel back into the system bay. 3. Ensure that the two snap buttons are in the unlocked position. Note: Snap buttons are in the unlocked position when the button component is pulled back as far as possible. 4. Push the operator panel into the bay until it comes to rest against the subsystem chassis. 5. Push in each snap button, locking the operator panel against the subsystem chassis. 6. Replace the front bezel as described in “Front Bezel Replacement” on page 112. 7. If the system is in the service position, return it to the operating position as described in “D20 Operating Position” on page 110. 8. Reconnect the power source to the system. 9. Power on the system as described in “Stopping and Starting the System” on page 42. 10. Close the rack front door. 114 D10 and D20 Service Guide D20 Blowers Before performing the following procedure, read “Safety Considerations” on page 41. Each cooling blower assembly has a green LED and an amber LED. The green LED indicates when the cooling blower is receiving power. A blinking amber LED indicates when the cooling blower needs attention from a potential cooling problem. When you are standing in front of and facing the I/O subsystem, the cooling blowers are labeled from left to right, 1 through 4, across the top blower retaining bracket. Attention: If the I/O subsystem does not have two functional power supplies installed, do not remove a blower with the power turned on. If only one power supply is functional, removing of a blower will cause the system to turn power off. Hot-Swap Blower Removal To remove a hot-swap blower, do the following: 1. Put the D20 subsystem into the service position as described in “D20 Service Position” on page 109. 2. Locate the cooling blower assemblies, as shown in the following illustration. 3. Pull out the snap button located on the front flange of the failing blower. 4. Using the snap button as a handle, pull the front of the blower up, allowing the back edge to pivot on a recessed ledge located just below the back edge of the blower. Note: Using the snap button, lift the blower from the chassis. The blower connector will disconnect from its docking connector, which is located on the underside of the blower. 5. Pull the blower away from the system. Ensure that the two back tabs, located on the bottom back edge of the blower assembly, have cleared their retaining slots. Chapter 3. Removal and Replacement Procedures 115 1 Green LED 4 Blower Connector 2 Amber LED 5 Hot-Plug Blower Assembly 3 Snap Button 6 Docking Connector 116 D10 and D20 Service Guide Hot-Swap Blower Replacement Note: If a blower assembly is being replaced for a redundant failure, after the service repair action is completed, ask the customer to check the crontab file for any power/cooling warning messages. When a power or cooling error is encountered, AIX adds an entry to the crontab file to ²wall² a warning message every 12 hours, to alert or remind the customer of the problem. Replacing the faulty part does not clear this crontab entry, so unless the crontab file is edited to remove this entry, the customer continues to be reminded of the failure despite its having been repaired. The crontab -l command reads the crontab file to determine if an entry exists. The crontab -e command edits the file. To replace a hot-swap blower, do the following: 1. Grasp the blower by its snap button, and ensure that the snap button is in the unlocked position. Note: Snap buttons are in the unlocked position when the button component is pulled back as far as possible. 2. Install the two alignment tabs located on the lower back edge of the blower assembly into the two retaining slots located on the chassis ledge. 3. Pivoting from the back, lower the front of the blower onto its docking connector. 4. Push down on the front of the blower until the front flange of the blower is seated on top of the chassis ledge. 5. Push down on the snap button, locking the blower in place. 6. Return the subsystem back to the operating position as described in “D20 Operating Position” on page 110. 7. Close the rack door. Chapter 3. Removal and Replacement Procedures 117 D20 Cooling Blower Cable This procedure must be performed with the power turned off. Cooling Blower Cable Removal To remove the cooling blower cable, do the following: 1. Open the rack front door. 2. Shut down the system as described in “Stopping and Starting the System” on page 42. 3. Put the D20 subsystem into the service position as described in “D20 Service Position” on page 109. 4. Disconnect the power source from the system. Note: This system may be equipped with a second power supply. Before continuing with this procedure, ensure that the power source to the system has been completely disconnected. 5. Remove the four cooling blowers as described in “Hot-Swap Blower Removal” on page 115. 6. Open the service access cover as described in “Opening the Service Access Cover” on page 111. 7. Remove the power supplies and, if applicable, the power supply filler. 8. If necessary, remove the power supply bulkhead as described in “Power Supply Bulkhead Removal” on page 148. 9. Disconnect the blower cable from the I/O board. 10. Unclip the three cable ties from the chassis and the three cable ties from the blower connector-support bracket. 11. Remove the blower docking connectors from the chassis. To remove each blower docking connector from the chassis, do the following: a. Depress the connector release tab next to the blower docking connector. The connector release tab has a small tab retaining clip that locks the blower docking connector into the blower connector support bracket. Depress the connector release tab far enough so that the tab retaining clip clears the bottom of the blower connector-support bracket. b. With the connector release tab still depressed, slide the blower docking connector toward the depressed tab. c. Align the blower docking connector’s four retaining ears with the four small slots located on the blower connector-support bracket. d. Push down on the blower docking connector to free it from the blower connector-support bracket. 12. Remove the cable from the subsystem. 118 D10 and D20 Service Guide 1 Cooling Blowers 6 Connector Retaining Ears 2 Cable Ties 7 Blower Docking Connector 3 Power Supply 8 D20 4 Tab Retaining Clip 9 Blower Connector Support 5 Connector Release Tab Cooling Blower Cable Replacement Replace in reverse order. Chapter 3. Removal and Replacement Procedures 119 D20 RIO Bus Adapter Assembly Before performing this procedure, read “Safety Considerations” on page 41. In the following RIO bus adapter removal and replacement procedures, the RIO bus adapter assembly is referred to as the RIO adapter. RIO Bus Adapter Removal To remove the RIO adapter from the subsystem, do the following: 1. Open the rack front door. 2. Shut down the system as described in “Stopping and Starting the System” on page 42. 3. Put the D20 subsystem into the service position as described in“D20 Service Position” on page 109. 4. Disconnect the power source from the system. Note: This system may be equipped with a second power supply. Before continuing with this procedure, ensure that the power source to the system has been completely disconnected. 5. Open the service access cover as described in “Opening the Service Access Cover” on page 111. 6. Disconnect the RIO-2 cables from the RIO connectors located on the rear of the subsystem. 7. Identify, and then disconnect and label all cables that cross over the top of the RIO adapter. These cables might interfere with the removal and installation of the RIO adapter. 8. Release the release latches located on top of the RIO adapter. 9. Simultaneously lift both retention handles. 10. Pivot the release handles up until they are perpendicular (90 degrees) to the top of the RIO adapter. Note: By placing the handles perpendicular to the top of the RIO adapter, the base or hinged portion of each handle acts as a cam and will gently pry the RIO adapter up, disconnecting it from its docking connector. 11. Remove the RIO adapter from the subsystem chassis, and put it in a safe place. 120 D10 and D20 Service Guide 1 RIO Bus Adapter Release Handle 4 Service Access Cover 2 RIO Bus Adapter Release Latch 5 D20 Subsystem 3 RIO Bus Adapter Chapter 3. Removal and Replacement Procedures 121 RIO Bus Adapter Replacement To replace the RIO bus adapter, do the following: 1. Grasp the two RIO adapter release handles. 2. Pivot both handles upward to 90 degrees, ensuring that the handles are perpendicular to the RIO adapter. The handle cams have now been placed into the correct position to assist you when seating the RIO adapter into its docking connector. 3. Before inserting the RIO adapter into its bay, observe the alignment bracket. The alignment bracket is secured to the power bulkhead. 4. Insert the RIO adapter into its bay. Ensure that the power cable receptacle located on the back of the RIO adapter is facing the back of the subsystem chassis. 5. Lower the RIO adapter through the alignment bracket. The alignment bracket will catch the back edge of the RIO adapter closest to it. The RIO adapter should now be resting on the top of its docking connector. The docking connector has two large alignment pins located on each end. These alignment pins will ensure alignment of the RIO adapter to its docking connector when seated. 6. Lower the RIO adapter locking handles, carefully seating the RIO adapter into the docking connector. The plastic latch located beneath each handle clicks when the RIO adapter is fully seated. This click also indicates that the handle is locked in the closed position. 7. Reconnect the RIO-2 cables to the RIO adapter connectors located on the back of the chassis. 8. Reconnect the cables that were disconnected during the RIO adapter removal. 9. Reconnect the power source to the system. 10. Close and then secure the service access cover with the three thumbscrews located on its back edge. 11. Return the D20 to the operating position as described in “D20 Operating Position” on page 110. 12. Power on the system as described in “Stopping and Starting the System” on page 42. 13. Close the rack front door. 122 D10 and D20 Service Guide D20 PCI Adapters Before performing this procedure, read “Safety Considerations” on page 41. Notes: 1. Some PCI adapter cards are shipped from the manufacturer with a blue handle or support along the back edge of the card. In this system unit, you must remove the blue handle or support from the card. 2. With this system, you can install PCI adapters with the power on. These adapters are referred to as hot-pluggable PCI adapters. Some adapters are not hot-pluggable, and the system must be powered off for this type of adapter installation. Before you install the adapter, determine if the PCI adapter you are installing is hot-pluggable. See the PCI Adapter Placement Reference, order number SA23-2504. If you are removing a PCI adapter that is: v Non-hot-pluggable, go to “Non-Hot-Pluggable PCI Adapter Removal” on page 126. v Hot-pluggable, go to “Hot-Pluggable PCI Adapter Removal” on page 129. PCI Hot-Plug Manager Access The installation instructions for hot-pluggable PCI adapters refer you to these procedures when it is appropriate to perform them. Note: A PCI adapter is only hot-pluggable if the PCI adapter is supported for hot-plug applications. For complete information regarding your PCI adapter, see the PCI Adapter Placement Reference, order number SA23-2504. Accessing Hot-Plug Management Functions: Note: Removing or installing a hot-pluggable PCI adapter requires the system administrator to take the PCI adapter offline prior to performing the operation. Before taking an adapter offline, the devices attached to the adapter must be taken offline as well. This action prevents a service representative or user from causing an unexpected outage for system users. For additional information about taking an adapter offline or removing it from the system configuration, see the AIX System Management Guide: Operating System and Devices. This publication is also contained on the AIX Documentation CD. The documentation is made accessible by loading the documentation CD onto the hard disk or by mounting the CD in the CD-ROM drive. Chapter 3. Removal and Replacement Procedures 123 To access the hot-plug menus, do the following: 1. Log in as root user. 2. At the command line, type smitty. 3. Select Devices. 4. Select PCI Hot Plug Manager and press Enter. 5. The PCI Hot-Plug Manager menu displays. Return to the procedure that directed you here. The following section describes the menu options. PCI Hot-Plug Manager Menu: The following options are available from the PCI Hot Plug Manager menu: Note: For information about the PCI slot LED states, see “D20 PCI-X Slots” on page 16. List PCI Hot-Plug Slots Provides a descriptive list of all slots that support PCI hot-plug capability. If the listing for a slot indicates it holds an “Unknown” device, select the Install/Configure Devices Added after IPL to configure the adapter in that slot. Add a PCI Hot-Plug Adapter Allows the user to add a new PCI hot-plug-capable adapter to the slot with the system turned on. You will be asked to identify the PCI slot that you have selected prior to the actual operation. The selected PCI slot will go into the Action state and finally into the On state. Note: The system will indicate the slot holds an “Unknown” device until you perform the Install/Configure Devices Added After IPL option to configure the adapter. Replace/Remove a PCI Hot-Plug Adapter Allows the user to remove an existing adapter, or replace an existing adapter with an identical one. For this option to work, the adapter must be in the Defined state (see the “Unconfigure a Device” option). You will be asked to identify the PCI slot prior to the actual operation. The selected PCI slot will go into the Action state. Identify a PCI Hot-Plug Slot Allows the user to identify a PCI slot. The selected PCI slot will go into the Identify state. See “D20 PCI-X Slots” on page 16. 124 D10 and D20 Service Guide Unconfigure a Device Allows the user to put an existing PCI adapter into the Defined state if the device is no longer in use. This step must be completed successfully before starting any removal or replacement operation. If this step fails, the customer must take action to release the device. Configure a Defined Device Allows a new PCI adapter to be configured into the system if software support is already available for the adapter. The selected PCI slot will go into the On state. Install/Configure Devices Added After IPL The system attempts to configure any new devices and tries to find and install any required software from a user-selected source. The add, remove, and replace functions return information to the user indicating whether the operation was successful. If additional instructions are provided on the screen, complete the recommended actions. If the instructions do not resolve the problem, do the following: v If the adapter is listed as Unknown, perform the Install/Configure Devices Added After IPL option to configure the adapter. v If you receive a warning indicating that needed device packages are not installed, the system administrator must install the specified packages before you can configure or diagnose the adapter. v If you receive a failure message indicating a hardware error, the problem might be either the adapter or the PCI slot. Isolate the problem by retrying the operation in a different PCI slot, or trying a different adapter in the slot. If you determine that you have failing hardware, call your service representative. v Do not use Install/Configure Devices Added After IPL if your system is set up to run HACMP clustering. Consult with your system administrator or software support to determine the correct method to configure the replacement device. Chapter 3. Removal and Replacement Procedures 125 Non-Hot-Pluggable PCI Adapter Removal To remove a PCI adapter, do the following: 1. Open the front and rear rack doors, and place the subsystem into the service position as described in “D20 Service Position” on page 109. 2. Shut down the system as described in “Stopping and Starting the System” on page 42. 3. Disconnect the power source to the system. Note: This system may be equipped with a second power supply. Before continuing with this procedure, ensure that the system power source has been completely disconnected. 4. Open the service access cover as described in “Service Access Cover Removal” on page 111. 5. Determine which adapters you plan to remove, and then label and disconnect all cables attached to that adapter. 6. Record the slot number and location of each adapter being removed. Note: Adapter slots are numbered on the rear of the subsystem unit. 7. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 42. 1 Hot-Plug PCI Adapters 2 Service Access Cover 3 D20 Subsystem 126 D10 and D20 Service Guide 8. Rotate the adapter retainer clip counterclockwise, as shown in the following illustration. 9. Lift the adapter retaining seat that is resting on the adapter retention bracket. 10. Carefully grasp the PCI adapter by its top edge or upper corners, and remove it from the system. 11. Store the adapter in a safe place. 12. If you do not plan to install another adapter into the vacated slot, seal the expansion slot using an expansion-slot cover. If you are installing another adapter, go to “Adding or Replacing a Non-Hot-Pluggable PCI Adapter” on page 128. 13. If you have other options to install, refer to “D20 FRU Replacement Procedure List” on page 108. If you do not have other options to install, continue on to the next step. 14. Close and then secure the service access cover with the three thumbscrews located on its back edge. 15. Connect the adapter cables. 16. Reconnect the power source to the system. 17. Route the cables through the cable-management arm. 18. Push the system drawer back into the operating position as described in “D20 Operating Position” on page 110. 19. Power on the system as described in “Stopping and Starting the System” on page 42. 20. Close the rack doors. 1 D20 2 Service Access Cover 3 Retainer Clip 4 Retainer Seat 5 PCI Adapter Chapter 3. Removal and Replacement Procedures 127 Adding or Replacing a Non-Hot-Pluggable PCI Adapter To add or replace an adapter, do the following: 1. Open the front and rear rack doors, and place the system into the service position as described in “D20 Service Position” on page 109. 2. Shut down the system as described in “Stopping and Starting the System” on page 42. 3. Disconnect the power source to the system. Note: This system may be equipped with a second power supply. Before continuing with this procedure, ensure that the system has been completely disconnected from its power source. 4. Open the service access cover as described in “Service Access Cover Removal” on page 111. 5. Refer to the PCI Adapter Placement Reference for information regarding slot restrictions for adapters that can be used in this subsystem. 6. If necessary, remove the adapter expansion slot shield. 7. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 42. 8. If necessary, remove the adapter from the antistatic package. Attention: Avoid touching the components and gold-edge connectors on the adapter. 9. Place the adapter, component-side up, on a flat, static-protective surface. 10. Set any jumpers or switches as instructed by the adapter’s manufacturer. 11. Carefully grasp the adapter by its top edge, and align the adapter with the expansion slot and its connector on the PCI riser card. 12. Press the adapter firmly into its connector. Attention: When you install an adapter into your subsystem, be sure that it is completely and correctly seated in its connector. 13. Lower the adapter retaining seat onto the PCI adapter EMC shield/connector faceplate. Rotate the adapter retainer clip over the adapter retaining seat until it comes to a stop. 14. Close and then secure the service access cover with the three thumbscrews located on its back edge. 15. Connect the adapter cables. 16. Reconnect the power source to the system. 17. Route the cables through the cable-management arm. 18. Push the system drawer back into the operating position as described in “D20 Operating Position” on page 110. 19. Power on the system as described in “Stopping and Starting the System” on page 42. 20. Close the rack doors. 128 D10 and D20 Service Guide Hot-Pluggable PCI Adapter Removal Attention: The Linux operating system does not support these hot-plug procedures. Also, Linux does not support hot-plugging any hot-pluggable PCI adapters. Systems with Linux installed on one or more partitions must be shut down and powered off before replacing any PCI adapter assigned to a Linux partition. Follow the non-hot-pluggable adapter procedures when replacing a PCI adapter in any partition with Linux installed. Note: Removing a hot-pluggable PCI adapter requires the system administrator to take the PCI adapter offline before performing any PCI adapter hot-plug procedures. Before taking an adapter offline, the devices attached to the adapter must also be taken offline. This action prevents a service representative or user from causing an unexpected outage for system users. For additional information about taking an adapter offline or removing it from the system configuration, see the AIX System Management Guide: Operating System and Devices. This publication is also contained on the AIX Documentation CD. The documentation is made accessible by loading the documentation CD onto the hard disk or by mounting the CD in the CD-ROM drive. To remove a hot-pluggable PCI adapter, do the following: 1. Open the front and rear rack doors, and place the system into the service position as described in “D20 Service Position” on page 109. 2. Open the service access cover as described in “Service Access Cover Removal” on page 111. 3. Determine which adapters you plan to remove. 4. Record the slot number and location of each adapter being removed. Note: Adapter slots are numbered on the rear of the system unit. 5. Ensure that any processes or applications that might use the adapter are stopped. 6. Refer to “PCI Hot-Plug Manager Access” on page 123, and follow the steps in the access procedure to select PCI Hot Plug Manager. Then return here to continue. 7. Select Unconfigure a Device and press Enter. 8. Press F4 to display the Device Names menu. 9. Select the adapter you are removing. 10. Use the Tab key to answer NO to Keep Definition and YES to Unconfigure Child Devices. Press Enter. 11. The ARE YOU SURE screen displays. Press Enter to verify the information. Successful unconfiguration is indicated by the OK message displayed next to the Command field at the top of the screen. 12. Label and disconnect all cables attached to that adapter. 13. Press F3 to return to the PCI Hot-Plug Manager menu. Chapter 3. Removal and Replacement Procedures 129 14. Select Replace/Remove a PCI Hot-Plug Adapter and press Enter. The Replace/Remove a PCI Hot-Plug Adapter menu displays. Note: If the PCI Hot-Plug Adapter menu indicates that the slot for the just replaced adapter is reading empty, do the following: a. From the Replace/Remove a PCI Hot-Plug Adapter menu, select F3 to cancel. b. Select Add PCI Hot-Plug Adapter, and press Enter. The display screen indicates that the PCI slot is empty. c. On the display screen, select the indicated empty slot. Look at the adapter you intend to replace. A slow blinking amber LED located next to the adapter on the PCI riser card is indicating that the slot has been identified. Press Enter. The adapter, has been put into the action state. d. Completely remove the hot-plug adapter from the slot. The LED goes off. e. Reinstall the adapter, ensuring that it is correctly seated. Press Enter. f. Select F3 to return to the PCI Hot-Plug Adapter Menu. g. Select Install/Configure Devices Added after IPL. h. At the display screen, identify the PCI hot-plug slot that your adapter is using. If it still shows as being empty, redo steps a through g. However, before you reinstall the adapter, test it. If the adapter fails the test, redo steps a through g with a new adapter. 15. Move the cursor to select the adapter that you are removing and press Enter. (The description entry displays as unknown). 16. Press the Tab key until the entry field displays the remove operation and then press the Enter key. Follow the instructions that display on the screen until you are instructed to remove the adapter. 17. When you are instructed to remove the adapter from the adapter slot, disconnect any cables that are connected to the adapter being removed. 18. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 42. 1 Hot-Plug PCI Adapters 2 Service Access Cover 3 D20 Subsystem 130 D10 and D20 Service Guide 19. Turn the retainer clip and lift the adapter retaining seat off the adapter. 1 D20 2 Service Access Cover 3 Retainer Clip 4 Retainer Seat 5 PCI Adapter 20. Carefully grasp the adapter by the edges and pull it straight out from the PCI riser card. 21. If you are not installing another adapter in this slot, place an expansion slot cover in the adapter slot opening. 22. Lower the plastic retainer seat over the PCI adapter faceplate. 23. Rotate the locking latch clockwise until it clicks into the locked position. 24. Continue to follow the screen instructions until you receive a message that the adapter removal is successful. Successful removal is indicated by the OK message displayed next to the Command field at the top of the screen. 25. If you do not have other adapters to remove, continue with the next step. OR If you have other adapters to remove, press the F3 key to return to the PCI Hot-Plug Manager menu, and then return to step 14 on page 130. 26. Press F10 to exit the Hot-Plug Manager. 27. If you have added, removed, or replaced any adapters, run the diag -a command. If the system responds with a menu or prompt, follow the instructions to complete the device configuration. Chapter 3. Removal and Replacement Procedures 131 28. If you have other options to install, refer to “D20 FRU Replacement Procedure List” on page 108. If you do not have other options to install, continue on to the next step. 29. Replace the service access cover as described in “Service Access Cover Replacement” on page 111. 30. Push the system drawer back into the operating position as described in “D20 Operating Position” on page 110. 31. Connect the adapter cables. 32. Route the cables through the cable-management arm. 33. Close the rack doors. Replacing a Hot-Pluggable PCI Adapter Attention: The Linux operating system does not support these hot-plug procedures. Also, Linux does not support hot-plugging any hot-pluggable PCI adapters. Systems with Linux installed on one or more partitions must be shut down and powered off before replacing any PCI adapter assigned to a Linux partition. Follow the non-hot-pluggable adapter procedures when replacing a PCI adapter in any partition with Linux installed. Notes: 1. Use this procedure only when you are replacing an adapter with an identical adapter. If you are replacing an adapter with an adapter that is not identical to the adapter removed, go to “Hot-Pluggable PCI Adapter Removal” on page 129 and “Installing a Hot-Pluggable PCI Adapter” on page 135. 2. Removing a hot-pluggable PCI adapter requires the system administrator to take the PCI adapter offline before performing any PCI adapter hot-plug procedures. Before taking an adapter offline, the devices attached to the adapter must be taken offline as well. This action prevents a service representative or user from causing an unexpected outage for system users. For additional information about taking an adapter offline or removing it from the system configuration, see the AIX System Management Guide: Operating System and Devices. This publication is also contained on the AIX Documentation CD. The documentation is made accessible by loading the documentation CD onto the hard disk or by mounting the CD in the CD-ROM drive. To replace a hot-plug PCI adapter, do the following: 1. Open the front and rear rack doors, and place the system into the service position as described in “D20 Service Position” on page 109. 2. Remove the service access cover as described in “Service Access Cover Removal” on page 111. 3. Refer to the PCI Adapter Placement Reference, order number SA23-2504, for information regarding slot restrictions for adapters that can be used in this system. 4. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 42. 5. Determine the slot from which you are removing the adapter. 6. Ensure that any processes or applications that might use the adapter are stopped. 7. Refer to “PCI Hot-Plug Manager Access” on page 123, and follow the steps in the access procedure to select PCI Hot Plug Manager. Then return here to continue. 132 D10 and D20 Service Guide 8. Select Unconfigure a Device and press Enter. Note: If the adapter or device has a problem that prevents it from being replaced by hot-plug, it must be replaced by shutting down the system. Go to “Adding or Replacing a Non-Hot-Pluggable PCI Adapter” on page 128. 9. Press F4 to display the Device Names menu. 10. From the menu, select the adapter you are removing. 11. Use the tab key to answer YES to Keep Definition and YES to set Unconfigure Child Devices. Press Enter. 12. The ARE YOU SURE screen displays. Press Enter to verify the information. Successful unconfiguration is indicated by the OK message displayed next to the Command field at the top of the screen. 13. Press F3 to return to the PCI Hot-Plug Manager menu. 14. Select List PCI Hot-Plug Slots and press Enter. Note: If the List PCI Hot-Plug Slots menu indicates that the slot for the adapter you intend to replace or reseat is empty, do the following: a. Press F3 to cancel. b. Select Add PCI Hot-Plug Adapter, and press Enter. The display screen indicates that the PCI slot is empty. c. On the display screen, select the indicated empty slot. Examine at the adapter you intend to replace or reseat. A slow blinking amber LED located on the PCI riser card is indicating that the slot has been identified. Press Enter. The adapter is now in the action state. d. Completely remove the hot-plug adapter from the slot. The LED goes off. e. Reinstall the adapter, ensuring that it is correctly seated. Press Enter. f. Press F3 to return to the ²PCI Hot-Plug Adapter Menu². g. Connect the appropriate cables and devices to the adapter. h. Select List PCI Hot-Plug Slots, then press Enter. i. Return to the error code or procedure that sent you here, if the ²List PCI Hot-Plug Slots² menu indicates that the slot for the just-replaced or reseated adapter is empty. 15. Press F3 to cancel. 16. Select Replace/Remove a PCI Hot-Plug Adapter, then press Enter. The ²Replace/Remove a PCI Hot-Plug Adapter menu displays. Note: If the PCI Hot-Plug Adapter menu indicates that the slot for the just replaced adapter is reading empty, do the following: a. From the Replace/Remove a PCI Hot-Plug Adapter menu, press F3 to cancel. b. Select Add PCI Hot-Plug Adapter, and press Enter. The display screen indicates that the PCI slot is empty. c. On the display screen, select the indicated empty slot. Examine at the adapter you intend to replace. A slow blinking amber LED located on the PCI riser card is indicating that the slot has been identified. Press Enter. The adapter has been put into the action state. d. Completely remove the hot-plug adapter from the slot. The LED goes off. e. Reinstall the adapter, ensuring that it is correctly seated. Press Enter. f. Press F3 to return to the PCI Hot-Plug Adapter Menu. g. Select Install/Configure Devices Added after IPL. h. At the display screen, identify the PCI hot-plug slot that your adapter is using. If it still shows as being empty, redo steps a through g. However, before you reinstall the adapter, test it. If the adapter fails the test, redo steps a through g with a new adapter. Chapter 3. Removal and Replacement Procedures 133 17. Move the cursor to select the adapter that you are removing, and press Enter. 18. Press the Tab key until the entry field displays the replace operation, and then press Enter. Follow the instructions that display on the screen until you are instructed to remove the adapter. 19. When you are instructed to remove the adapter from the adapter slot, disconnect any cables that are connected to the adapter being removed. 20. Turn the retainer clip, and lift the adapter retaining seat off the top of the adapter. 21. Carefully grasp the adapter by the edges, and pull it straight up and out of the subsystem. 22. If necessary, remove the replacement adapter from the antistatic package. Attention: Avoid touching the components and gold-edge connectors on the adapter. 1 D20 2 Service Access Cover 3 Retainer Clip 4 Retainer Seat 5 PCI Adapter 23. Place the adapter, component-side up, on a flat, static-protective surface. 24. Set any jumpers or switches as instructed by the adapter manufacturer. 25. Replace the adapter into the adapter slot. Carefully grasp the adapter by the edges and align the adapter in the slot guides. Insert the adapter fully into the adapter slot connector. If you are installing a full-length adapter, ensure that both ends of the adapter engage the card guides. 26. Lower the adapter retaining seat over the PCI adapter faceplate. Rotate the retainer clip until it stops. 27. Connect appropriate cables and devices to the adapter. 28. Continue to follow the screen instructions until you receive a message that the replacement is successful. Successful replacement is indicated by the OK message displayed next to the Command field at the top of the screen. 29. Press F3 to return to the PCI Hot-Plug Manager menu. 134 D10 and D20 Service Guide 30. Select Install/Configure Devices Added After IPL and press Enter. Follow the instructions on the screen. Successful replacement is indicated by the OK message displayed next to the Command field at the top of the screen. Do not use Install/Configure Devices Added After IPL if your system is set up to run HACMP clustering. Consult with your system administrator or software support to determine the correct method to configure the replacement device. 31. If you do not have other adapters to replace, continue with the next step. OR If you have other adapters to replace, press the F3 key to return to the PCI Hot-Plug Manager menu and then return to step 14 on page 133. 32. Press F10 to exit the Hot-Plug Manager. If you have added, removed, or replaced any adapters, run the diag -a command. If the system responds with a menu or prompt, follow the instructions to complete the device configuration. 33. Close and then secure the service access cover with the three thumbscrews located on its back edge. 34. Connect the adapter cables. 35. Route the cables through the cable-management arm. 36. Push the system drawer back into the operating position as described in “D20 Operating Position” on page 110. 37. Close the rack doors. Installing a Hot-Pluggable PCI Adapter Attention: The Linux operating system does not support these hot-plug procedures. Also, Linux does not support hot-plugging any hot-pluggable PCI adapters. Systems with Linux installed on one or more partitions must be shut down and powered off before replacing any PCI adapter assigned to a Linux partition. Follow the non-hot-pluggable adapter procedures when replacing a PCI adapter in any partition with Linux installed. Note: This system supports PCI hot-pluggable adapters, so the system does not need to be shut down and the power cables removed before adding or replacing an adapter. To add or install an adapter, do the following: 1. Open the front and rear rack doors, and place the system into the service position as described in “D20 Service Position” on page 109. 2. Open the service access cover as described in “Service Access Cover Removal” on page 111. 3. Refer to the PCI Adapter Placement Reference, order number SA23-2504, for information regarding slot restrictions for adapters that can be used in this system. 4. If necessary, remove the adapter expansion slot shield. 5. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 42. 6. If necessary, remove the adapter from the antistatic package. Attention: Avoid touching the components and gold-edge connectors on the adapter. 7. Place the adapter, component-side up, on a flat, static-protective surface. 8. Set any jumpers or switches as described by the adapter’s manufacturer. Chapter 3. Removal and Replacement Procedures 135 9. Refer to “PCI Hot-Plug Manager Access” on page 123, and follow the steps in the access procedure to select PCI Hot Plug Manager. Then return here to continue. 10. From the PCI Hot-Plug Manager menu, select Add a PCI Hot-Plug Adapter and press Enter. The Add a Hot-Plug Adapter window displays. 11. See the PCI Adapter Placement Reference, order number SA23-2504, for adapter placement information. Then select an empty PCI slot for the adapter. 12. Select the appropriate empty PCI slot from the ones listed on the screen, and press Enter. 13. Rotate the retainer clip, lift the adapter retainer seat, and remove the expansion slot cover. 1 D20 2 Service Access Cover 3 Retainer Clip 4 Retainer Seat 5 PCI Adapter 14. Follow the instructions on the screen to install the adapter until the LED for the specified PCI slot is set to the Action state. See “D20 PCI-X Slots” on page 16. 15. When you are instructed to install the adapter in the adapter slot, carefully grasp the adapter by the edges and align the adapter in the slot guides. Insert the adapter fully into the adapter slot connector. If you are installing a full-length adapter, ensure that both ends of the adapter engage the card guides. 16. Lower the adapter retainer seat onto the PCI adapter faceplate. Rotate the adapter retainer clip until it stops. 17. Close and then secure the service access cover with the three thumbscrews located on its back edge. 18. Connect the adapter cables. 19. Route the adapter cables through the cable-management arm. 20. Push the system drawer back into the operating position as described in “D20 Operating Position” on page 110. 21. Close the rack doors. 136 D10 and D20 Service Guide D20 Hot-Plug Disk Drives Attention: The Linux operating system does not support these hot-plug procedures. Also, Linux does not support hot-plugging any hot-pluggable PCI adapters. Systems with Linux installed on one or more partitions must be shut down and powered off before replacing any PCI adapter assigned to a Linux partition. Follow the non-hot-pluggable adapter procedures when replacing a PCI adapter in any partition with Linux installed. Before performing the following procedure, read “Safety Considerations” on page 41. Note: You do not have to power off the system to remove a hot-plug disk drive. Before you perform these procedures, ensure that the customer has taken appropriate actions to back up the data for the drive you are removing, and that the drive has been removed from the configuration. Physically removing a hot-plug disk drive from the system before it has been removed from the system configuration can cause unrecoverable data corruption. Deconfiguring (Removing) or Configuring a Disk Drive Use this procedure if you are removing or installing a drive while the system power is turned on. Deconfiguring (Removing) Before removing a disk drive from a volume group, do the following: 1. Log in as root user. 2. Before a disk can be removed from a volume group, all data must be backed up and the data removed from the disk to be replaced. Use AIX Logical Volume Manager to perform this task. 3. At the command line, type smitty. 4. Select System Storage Manager (Physical and Logical Storage) and press Enter. 5. Select Logical Volume Manager and press Enter. 6. Select Volume Groups and press Enter. 7. Select Set Characteristics of a Volume Group and press Enter. 8. Select Remove a Physical Volume from a Volume Group. 9. Press F4 to list the available volume groups, then select the volume group name and press Enter. 10. Press F4 to select a physical volume, and follow the instructions on the screen to select the physical volume. Then press Enter. 11. Press F3, Cancel, to go back to the first menu and select System Storage Management (Physical and Logical Storage). 12. Select Removable Disk Management. 13. Select Remove a Disk. 14. Select the desired disk from the list on the screen and press Enter. Chapter 3. Removal and Replacement Procedures 137 15. Follow the instructions on the screen to remove the drive. 16. When you are asked ²Are you sure?², press Enter. The power LED on the drive that you selected will remain on. 17. Remove the disk drive by pulling the disk drive lever toward you until it is completely open. 18. Remove the disk drive from the slot. The LED on the top of the slot will turn off when the disk drive is removed. 19. Press F10 to exit smitty. Configuring (Replacing) 1. Remove the disk drive from its protective packaging, and open the drive latch handle. 2. Install the disk drive in the drive slot. Align the disk drive with the drive slot rails, and slide the disk drive into the slot until it contacts the backplane at the rear of the drive bay. The drive should be in far enough for the latch handle to engage the latch. Push the disk drive lever up and to the rear to lock the disk drive. The LED above the slot will turn on. 3. Log in as root user. 4. At the command line, type smitty. 5. Select Devices. 6. Select Install/Configure Devices Added After IPL and press Enter. Successful configuration is indicated by the OK message displayed next to the Command field at the top of the screen 7. Press F3, Cancel, to go back to the first menu and select System Storage Management (Physical and Logical Storage) and press Enter. 8. Select Logical Volume Manager and press Enter. 9. Select Volume Groups and press Enter. 10. Select Set Characteristics of a Volume Group and press Enter. 11. Select Add a Physical Volume to a Volume Group. 12. Fill in the fields for the drive you are adding to the system. Press F4 for a list of selections. 13. See the AIX System Management Guide: Operating System and Devices to finish the drive configuration. 14. Press F10 to exit smitty. 138 D10 and D20 Service Guide Hot-Plug Disk Drive Removal Attention: The Linux operating system does not support these hot-plug procedures. Also, Linux does not support hot-plugging any hot-pluggable PCI adapters. Systems with Linux installed on one or more partitions must be shut down and powered off before replacing any PCI adapter assigned to a Linux partition. Follow the non-hot-pluggable adapter procedures when replacing a PCI adapter in any partition with Linux installed. Attention: Before you perform the following procedure, ensure that the customer has taken appropriate actions to back up the data for the drive you are removing, and that the drive has been removed from the configuration. Physically removing a hot-plug drive from the system before it has been removed from the system configuration can cause unrecoverable data corruption. Refer to “Deconfiguring (Removing) or Configuring a Disk Drive” on page 137. To remove a hot-plug disk drive, do the following: 1. Before removing the disk, ensure that all data has been saved. 2. Deconfigure the drive that you are removing from the system. See “Deconfiguring (Removing) or Configuring a Disk Drive” on page 137. 3. Open the rack front door. 4. Remove the front bezel as described in “Front Bezel Removal” on page 112. 5. Locate the hot-plug disk drive you plan to remove. If you are removing a faulty disk drive, a lit amber LED located at the front right side of each disk drive will help you isolate the faulty drive. Before physically removing any disk drive, do the following: a. Log in as root user. b. At the command line, type smitty. c. Select System Storage Management (Physical and Logical Storage). d. Select Removable Disk Management. e. Select Remove a Disk. f. Select the desired disk from the list on the screen and press Enter. 6. Remove the disk drive by placing the handle on the disk drive into the open position (perpendicular to the drive). 7. Pull the hot-plug disk drive out of the disk drive bay. The lit LED on the disk drive will turn off when the disk drive is removed. 8. Press F10 to exit smitty. Chapter 3. Removal and Replacement Procedures 139 1 Rack Front Door 3 Disk Drive Release Handle 2 D20 4 Front Bezel 140 D10 and D20 Service Guide Hot-Plug Disk Drive Replacement Attention: The Linux operating system does not support these hot-plug procedures. Also, Linux does not support hot-plugging any hot-pluggable PCI adapters. Systems with Linux installed on one or more partitions must be shut down and powered off before replacing any PCI adapter assigned to a Linux partition. Follow the non-hot-pluggable adapter procedures when replacing a PCI adapter in any partition with Linux installed. Attention: Before you perform these procedures, ensure that the customer has taken appropriate actions to back up the data for the drive you are removing, and that the drive has been removed from the configuration. Physically removing a hot-plug drive from the system before it has been removed from the system configuration can cause unrecoverable data corruption. Refer to “Deconfiguring (Removing) or Configuring a Disk Drive” on page 137. Do not use Install/Configure Devices Added After IPL if your system is set up to run HACMP clustering. Consult with your system administrator or software support to determine the correct method to configure the replacement disk. 1. To install a hot-plug disk drive, do the following: a. Ensure the tray handle is open (perpendicular to the drive). b. Align the drive/tray assembly so that it engages the guide rails in the bay. c. Push the drive assembly into the bay until the tray handle engages the lock mechanism. d. Push the tray handle in until it locks. The lit LED on the disk drive will turn on. 2. Log in as root user. 3. At the command line, type smitty. 4. Select Devices. 5. Select Install/Configure Devices Added After IPL and press Enter. Successful configuration is indicated by the OK message displayed next to the Command field at the top of the screen. 6. Press F10 to exit smitty. 7. Replace the front bezel as described in “Front Bezel Replacement” on page 112. 8. Close the rack doors. Chapter 3. Removal and Replacement Procedures 141 D20 Disk Drive Cage Before performing the following procedure, read “Safety Considerations” on page 41. Disk Drive Cage Removal To remove the disk drive cage, do the following: 1. Open the rack front door. 2. Shut down the system as described in “Stopping and Starting the System” on page 42. 3. Place the system into the service position as described in “D20 Service Position” on page 109. 4. Disconnect the power source from the system. Note: This system may be equipped with a second power supply. Before continuing with this procedure, ensure that the power source to the system has been completely disconnected. 5. Remove the front bezel as described in “Front Bezel Removal” on page 112. 6. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 42. 7. Label and then remove all hot-plug disk drives as described in “Hot-Plug Disk Drive Removal” on page 139. 8. Remove the 8 retaining screws that secure the disk drive cage to the front of the system chassis. See the following illustration. 9. Pull the disk drive cage assembly out of the subsystem. Note: When pulling the disk drive cage out of the subsystem, use the center post and pull straight back toward you. Do Not grab, hold or pull on the light pipes. 142 D10 and D20 Service Guide 1 D20 5 Retaining Screws (Quantity 8) 2 Disk Drive Cage 6 Light Pipes 3 Disk Drives 7 Light Pipe Support Bracket 4 Front Bezel Disk Drive Cage Replacement To replace the disk drive cage, go the following: 1. Insert the disk drive cage assembly into the subsystem. 2. Insert the 8 retaining screws that secure the disk drive cage to the front of the system chassis. 3. Replace all hot-plug disk drives as described in “Hot-Plug Disk Drive Replacement” on page 141. 4. Replace the front bezel as described in “Front Bezel Replacement” on page 112. 5. Reconnect the power source to the system. If the system has two power supplies, two power cables need to be reconnected. 6. Put the system back into the operating position as described in “D20 Operating Position” on page 110. 7. Start the system as described in “Stopping and Starting the System” on page 42. 8. Close the rack door. Chapter 3. Removal and Replacement Procedures 143 D20 Disk Drive Backplane Before performing the following procedure, read “Safety Considerations” on page 41. Disk Drive Backplane Removal To remove the disk drive backplane, do the following: 1. Remove all hot-plug disk drives as described in “Hot-Plug Disk Drive Removal” on page 139. 2. Remove the four cooling blowers as described in “Hot-Swap Blower Removal” on page 115. Note: Removing the cooling blowers enables you to view the disk drive backplanes. 3. Disconnect the cables connected to the disk drive backplane. 4. Remove the 8 retaining screws that secure the disk drive cage as described in “Disk Drive Cage Removal” on page 142. 5. Pull the disk drive cage assembly out of the system. 6. Remove the 2 retaining screws that secure the disk drive backplane to the subsystem chassis. 7. Grasp the disk drive backplane from the bottom of one of the connectors and pull it out of the chassis, bottom first. 1 Cooling blowers 4 Disk Drive Cage 2 D20 5 Disk Drives 3 Disk Drive Backplane 6 Front Bezel 144 D10 and D20 Service Guide Disk Drive Backplane Replacement To replace the disk drive backplane, do the following: 1. Reconnect the power/signal cable to the connector located on the top edge of the disk drive backplane. 2. Grasp a bottom edge of one of the connectors located on the disk drive backplane, and insert the backplane into the disk drive bay. 3. Using the 3 supplied retaining screws, secure the disk drive backplane to the subsystem chassis. 4. Carefully insert the disk drive cage assembly into the system. 5. Align and then secure the disk drive cage to the subsystem chassis with the 7 previously removed retaining screws. 6. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 42. 7. Replace all hot-plug disk drives as described in “Hot-Plug Disk Drive Replacement” on page 141. 8. Reconnect the power source to the system. 9. Replace the front bezel as described in “Front Bezel Replacement” on page 112. 10. Power on the system as described in “Stopping and Starting the System” on page 42. 11. Close and then secure the service access cover with the three thumbscrews located on its back edge. 12. Return the subsystem into the operating position as described in “D20 Operating Position” on page 110. 13. Close the rack front door. Chapter 3. Removal and Replacement Procedures 145 D20 Power Supplies DANGER Do not attempt to open the covers of the power supply. Power supplies are not serviceable and are to be replaced as a unit. D02 The D20 subsystem has power supplies that have hot-swap capabilities. To use this feature, you must have two power supplies in place. Power Supply Removal D20 subsystems configured at the factory with one power supply have a power supply filler panel. The filler panel protects the vacant power supply connector from dust or damage. If you plan to remove one of the power supplies from a subsystem configured with two power supplies, obtain the power supply filler panel. The part number for the power supply filler panel is 53P0233. Note: You need not shut down the system to change a subsystem power supply. Before performing any of the following procedure, read and understand all of the safety notices beginning with “Safety Considerations” on page 41. To remove a power supply from your subsystem, do the following: 1. Put the subsystem into the service position as described in “D20 Service Position” on page 109. 2. Open the service access cover as described in “D20 Service Access Cover” on page 111. 3. If your subsystem is configured with two power supplies, visually check the power supply LEDs. If one of the power supplies has an amber LED that is lit or blinking, suspect that power supply to be faulty. 4. Remove the power cable from the rear of the power supply. Wait at least 10 seconds before performing the next step. 5. Release, and then lift the power supply handles. 6. Lift the handles until they are perpendicular (90 degrees) to the top of the power supply. By placing the handles perpendicular to the top of the power supply, the base or hinged portion of each handle acts as a cam and will gently pry the power supply from its docking connector located on the subsystem system board. 146 D10 and D20 Service Guide 7. Carefully remove the power supply from the subsystem. 1 2 3 4 6 5 1 Power Supplies 4 Green DC Good LED 2 Green Power LED 5 Power Supply Filler 3 Amber Fault/Identify LED 6 D20 Subsystem Note: During normal operation, each power-supply bay must have either a power supply or filler panel installed for proper cooling. Power Supply Replacement To replace a power supply, do the following: 1. Ensure the handles are in the unlocked position and perpendicular to the power supply. 2. Carefully insert the new power supply into the power supply bay. 3. Press the power supply onto the docking connectors. 4. Put the power supply handles into the locked position. Note: The handles act as a cam and will correctly seat the power supply. 5. Reconnect the power source to the replaced power supply. The green LED located on the power supply comes on solid. 6. Close and then secure the service access cover with the three thumbscrews located on its back edge. 7. Put the D20 subsystem back into the operating position as described in “D20 Operating Position” on page 110. 8. If your system has successfully powered on, close the rack doors. Chapter 3. Removal and Replacement Procedures 147 D20 Power Supply Bulkhead Before performing the following procedure, read the “Safety Considerations” on page 41. Power Supply Bulkhead Removal To remove the power supply bulkhead, do the following: 1. Open the rack front door. 2. Shut down the system as described in “Stopping and Starting the System” on page 42. 3. Put the D20 subsystem into the service position. 4. Disconnect the power source from the system. Note: This system may be equipped with a second power supply. Before continuing with this procedure, ensure that the power source to the system has been completely disconnected. 5. Open the service access cover as described in “Opening the Service Access Cover” on page 111. 6. Disconnect the RIO-2 cables from the RIO connectors located on the rear of the subsystem. 7. Disconnect the power cables from the power supplies 8. Label and then disconnect all PCI adapter cables from their respective adapters. 9. Remove the power supplies or power supply filler as described in “Power Supply Removal” on page 146. 10. Remove the RIO bus adapter as described in “RIO Bus Adapter Removal” on page 120. 11. Remove all PCI adapters as described in “Non-Hot-Pluggable PCI Adapter Removal” on page 126. Note: Alternately remove the PCI dividers as you are removing the PCI adapters. 12. Remove the three upper-retaining screws that secure the top edge of the power supply bulkhead to the subsystem chassis. 13. Remove the three lower-retaining screws that secure the bottom portion of the power supply bulkhead to the subsystem chassis. 14. Remove the power supply bulkhead by lifting it straight up and out of the subsystem. 148 D10 and D20 Service Guide 1 Power Supply Filler 5 PCI-X Adapters 2 Power Supply Bulkhead 6 D20 3 PCI Dividers 7 Power Supplies 4 RIO Bus Adapter Chapter 3. Removal and Replacement Procedures 149 Power Supply Bulkhead Replacement To replace the power supply bulkhead, do the following: 1. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 42. 2. Insert the power supply bulkhead into the subsystem. 3. Align the three screw holes located on the top of the power supply bulkhead with the three threaded screw holes located on the subsystem chassis. 4. Align the three screw holes located on the bottom of the power supply bulkhead with the three threaded screw holes located on the subsystem chassis base. 5. Tighten the six retaining screws for the power supply bulkhead. 6. Replace the power supplies or the power supply filler as described in “Power Supply Replacement” on page 147. 7. Reconnect the power cables to the power supplies. 8. Replace the RIO bus adapter as described in “RIO Bus Adapter Replacement” on page 122. 9. Reconnect the RIO-2 cables from the RIO connectors located on the rear of the subsystem. 10. Replace all of the previously removed PCI adapters as described in “Adding or Replacing a Non-Hot-Pluggable PCI Adapter” on page 128. Note: When you are reinstalling the PCI adapters, alternately reinstall the PCI dividers. 11. Reconnect all PCI adapter cables to their respective adapters. 12. Reconnect the power source to the system. The green LED located on the power supply comes on solid. 13. Close then secure the service access cover with the three thumbscrews located on its back edge. 14. Put the D20 subsystem back into the operating position as described in “D20 Operating Position” on page 110. 15. After the system has successfully powered on, close the rack doors. 150 D10 and D20 Service Guide D20 I/O Backplane Assembly Before performing the following procedure, read the “Safety Considerations” on page 41. I/O Backplane Removal To remove the I/O backplane from the subsystem, do the following: 1. Open the rack front door. 2. Shut down the system as described in “Stopping and Starting the System” on page 42. 3. Put the D20 subsystem into the service position. 4. Disconnect the power source from the system. Note: This system may be equipped with a second power supply. Before continuing with this procedure, ensure that the power source to the system has been completely disconnected. 5. Open the service access cover as described in “Opening the Service Access Cover” on page 111. 6. Disconnect the RIO-2 cables from the RIO connectors located on the rear of the subsystem. 7. Disconnect the power cables from the I/O subsystem power supplies. 8. Disconnect the SPCN connectors from the back of the subsystem chassis. 9. Label, and then disconnect all PCI adapter cables from their respective adapters. 10. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 42. 11. Remove the RIO bus adapter as described in “RIO Bus Adapter Removal” on page 120. 12. Remove all of the power supplies or power supply filler as described in “Power Supply Removal” on page 146. 13. Remove all PCI adapters as described in “Non-Hot-Pluggable PCI Adapter Removal” on page 126. Note: When you are removing the PCI adapters, alternately remove the seven PCI dividers. 14. Disconnect the following cables in the order listed from the connectors located on the front of the I/O backplane. See the following illustration for connector locations. v Operator panel cable v Disk drive SPCN cable v Disk drive power cable v Cooling blower cable 1 Operator Panel Connector 3 Disk Drive Power Connector 2 Disk Drive System Power Control Network (SPCN) Connector 4 Cooling Blower Connector Chapter 3. Removal and Replacement Procedures 151 15. Remove the plastic membrane from the top of the I/O backplane by first removing the five retaining screws that secure the plastic membrane to the I/O backplane. Note: When removing the plastic membrane, slide the membrane under the light pipes that identify the green and amber power LEDs. 16. Remove the three retaining screws that secure the front edge of the I/O backplane to the subsystem chassis. 17. Go to the rear of the subsystem, and remove the two retaining screws that secure the back edge of the I/O backplane to the back chassis bulkhead. Note: The two retaining screws that secure the back edge of the I/O backplane to the rear chassis bulkhead are located just below the left SPCN connector and the rack indicator connector. 18. Remove the two retaining screws that secure the SCSI cable bracket to the subsystem-chassis bracket. 19. Remove the SCSI cable bracket by wiggling it up and down, then pulling it straight out and away from the PCI adapter-headstock bracket and the subsystem chassis bracket. 20. Remove the four retaining screws that secure the PCI adapter-headstock bracket to the chassis and power supply bulkhead. 21. Remove the PCI adapter-headstock bracket by lifting it up, then out of the subsystem chassis. 22. Remove the four retaining screws that secure the power-supply bulkhead to the chassis 23. Remove the power-supply bulkhead by doing the following: a. Lift the slotted tab off the alignment pin. The slotted tab and alignment pin are located at the bottom edge of the power-supply bulkhead. b. Move the lower portion of the power-supply bulkhead out and away from the alignment pin. Notes: 1) Ensure that the back edge of the power-supply bulkhead clears the disk drive backplane’s plastic alignment pin. The disk drive backplane’s plastic alignment pin is located on the disk drive-backplane support bracket between the power-supply bulkhead and the PCI adapter-headstock bracket. 2) The disk drive backplane-support bracket is not shown in the following illustration; however, it is easily located. This is the bracket that secures the service access cover to the chassis. It also supports the disk drive backplanes, cooling blowers, power supply bulkhead, and PCI adapter headstock bracket. c. Push the power-supply bulkhead toward the center of the I/O subsystem chassis. Note: The upper-right corner of the power-supply bulkhead might get caught on the underside of the chassis support. If the power-supply bulkhead gets caught on the underside of the chassis support, wiggle the bulkhead until it is freed from under the chassis support. 24. Remove the I/O backplane by sliding the backplane toward the front of the I/O subsystem. 25. Slowly lift the I/O backplane straight up and out of the subsystem. 152 D10 and D20 Service Guide 26. Locate the VPD module. 1 I/O Backplane 2 VPD Module Attention: Note the location of Pin 1 before removing. 27. Note the location of pin 1 on the VPD module. 28. Gently remove the VPD module and put it in a safe place. Note: You will be installing the old VPD module onto the new I/O backplane. Chapter 3. Removal and Replacement Procedures 153 1 PCI Adapter Dividers (maximum quantity 7) 6 Plastic Membrane 2 RIO Bus Adapter 7 D20 3 PCI-X Adapters (maximum quantity 7) 8 SCSI Cable Bracket 4 Power Supplies 9 PCI Adapter-Headstock Bracket 5 I/O Backplane 10 Power-Supply Bulkhead 5a VPD Module 154 D10 and D20 Service Guide I/O Backplane Replacement To replace the I/O backplane, do the following: 1. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 42. 2. Note the location of pin 1 on the VPD module. 3. Remove the VPD module that was installed on the new I/O backplane. 4. Install the VPD module from the old I/O backplane onto the new I/O backplane. Note: When installing the VPD module, ensure that pin 1 is in the correct orientation. 5. Carefully align the I/O backplane to the alignment pins located on the chassis base. 6. Replace the three front retaining screws that secure the I/O backplane to the chassis. 7. Correctly align and then place the plastic membrane that protects the I/O backplane over the I/O backplane. Secure it in place with five retaining screws. 8. Reconnect the following cables, in the order listed, to their respective connectors located on the front of the I/O backplane: v Cooling blower cable v Disk drive power cable v Disk drive SPCN cable v Operator panel cable 9. Go to the rear of the subsystem chassis, and replace the two rear retaining screws. Note: The two threaded screw holes used to secure the rear of the I/O backplane to the rear of the subsystem chassis are located just below the left SPCN connector and the rack-indicator connector. 10. Replace the power-supply bulkhead by doing the following: a. Rotate the power-supply bulkhead, placing the top right corner under the chassis support bracket. b. Align the slotted tab, located on the lower edge of the power supply bulkhead, with the alignment pin, located on the subsystem chassis. c. Align the four clearance holes located on the power-supply bulkhead with the four screw holes located on the upper and lower subsystem chassis. Note: When aligning the power-supply bulkhead, wiggle it until properly seated. d. Secure the power-supply bulkhead to the subsystem chassis with four retaining screws. 11. Replace the PCI adapter-headstock bracket. 12. Align the four clearance holes on the bracket with the four screw holes located on the chassis. 13. Secure the PCI adapter-headstock bracket to the chassis with four retaining screws. 14. Replace the SCSI cable bracket, aligning the two screw holes and securing it with two retaining screws to the chassis retaining bracket. 15. Reconnect the SPCN connectors from the back of the subsystem chassis. 16. Replace the power supplies or the power supply filler as described in “Power Supply Replacement” on page 147. 17. Replace the RIO bus adapter as described in “RIO Bus Adapter Replacement” on page 122. 18. Reconnect the RIO-2 cables from the RIO connectors located on the rear of the subsystem. Chapter 3. Removal and Replacement Procedures 155 19. Replace all of the previously removed PCI adapters as described in “Adding or Replacing a Non-Hot-Pluggable PCI Adapter” on page 128. Note: Alternately reinstall the PCI dividers, when reinstalling the PCI adapters. 20. Reconnect all PCI adapter cables to their respective adapters. 21. Reconnect the power cables to the power supplies. 22. Reconnect the power source to the system. The green LED located on the power supply comes on solid. 23. Close and then secure the service access cover with the three thumbscrews located on its back edge. 24. Put the D20 subsystem back into the operating position as described in “D20 Operating Position” on page 110. 25. After the system has successfully powered on, close the rack doors. 156 D10 and D20 Service Guide Chapter 4. Parts Information This chapter contains parts information for the D10 and D20 I/O drawers. D10 I/O Subsystem This section lists part numbers for the D10 I/O Subsystem. This I/O subsystem is used in conjunction with a system unit. Depending on your configuration, the quantities of parts listed may increase by the actual number of I/O subsystems connected to the system unit. D10 I/O Subsystems Parts 1 2 3 4 5 6 7 8 9 10 12 13 14 11 157 Index Part Number Units Per Assy Description 1 00P2752 1 Bracket, Bulkhead 2 09P4656 1 Tray, cable 3 09P5417 1 Cable, fan 4 09P3128 2 Guide, mounting 5 1624749 8 Screw, guide mounting 6 ******* See index number 9 7 1621829 2 Screw, tray mounting 8 1624743 6 Screw, backplane mounting 9 09P5921 1 I/O backplane assembly (includes SPCN connector card ) 10 1621829 1 Screw, tailstock cover mounting 11 00P4053 1 Dual port RIO bus adapter 00P4080 1 Dual port RIO-2 bus adapter 12 1621829 2 Screw, midplate cover mounting 13 00P2751 1 Drawer chassis assembly 14 09P4664 1 Fan assembly 158 D10 and D20 Service Guide D10 I/O Subsystem Parts (continued) 1 2 3 4 5 6 7 Chapter 4. Parts Information 159 Index Part Number Units Per Assy Description 1 00P2753 1 Cover, access 09P5159 Front bezel, white 2 1624743 4 Screw, guide mounting 3 00P2750 2 Guide, adapter cassette top 44P0239 Empty PCI adapter cassette kit 00P3842 PCI adapter cassette blank filler, long 00P3843 PCI adapter cassette blank filler, short 53P5450 PCI bracket (used in PCI adapter cassette to install Gigabit Ethernet PCI-X adapter) 44P2661 PCI clip (used in PCI adapter cassette to install Ultra SCSI adapter) 44P0321 PCI clip (used in PCI adapter cassette to install 10/100 Eternet adapter) 44P2675 PCI clip for (used in PCI adapter cassette to install FCAL 2GBS adapter) 44P2676 Clip for FC 2969 bracket (used in PCI adapter cassette to install Gigabit Ethernet PCI adapter) 4 09P3185 5 Guide, power cable support 5 09P2401 2 Cable, power 6 09P4665 1 Cover, power cable channel 7 09P3354 2 Power supply 09P4778 1 Front Bezel 8 09P5159 1 Front Bezel 160 D10 and D20 Service Guide D10 SPCN, RIO, and Rack Beacon Cables (These parts are not shown.) Index Part Number Units Per Assy Description 1 53P2231 Up to 12 Rack beacon junction box, 7-port mini connector (4693) 2 53P2237 Up to 12 Rack beacon connector cable, subsystem to junction box, 4-pin std to 4-pin mini connector, 3 meter (4691) 07H6655 As needed Hook-and-loop fastner 3 53P2854 Up to 12 Rack beacon connector cable, 4-pin mini to 4-pin mini 0.25 meter (4692) 4 87G6235 Up to 9 SPCN cable, 2 meter 09P1251 Up to 9 SPCN cable, 3 meter 21F9469 Up to 9 SPCN cable, 6 meter 21F9358 Up to 9 SPCN cable, 15 meter 5 21P5454 Up to 10 RIO-2 cable, 1.2 meter 53P2676 Up to 10 RIO-2 cable, 3.5 meter 21P5456 Up to 10 RIO-2 cable, 10 meter D10 Rack Mounting Enclosure (These parts are not shown.) Index Part Number Units Per Assy Description 1 09P4746 1 Two position I/O subsystem mounting assembly 2 03N3845 1 Left rail 03N3847 1 Right rail 3 00P2402 2 Cable support bracket 4 09P4911 1 Mounting bracket 5 09P4912 1 Rear filler for empty I/O subsystem space in mounting assembly Chapter 4. Parts Information 161 D20 I/O Subsystem This section lists part numbers for the D20 I/O Subsystem. This I/O subsystem is used in conjunction with a system unit. Depending on your configuration, the quantities of parts listed may increase by the actual number of I/O subsystems connected to the system unit. D20 I/O Backplane and Cabling 1 2 3 4 5 6 Index Part Number Units Per Assy Description 1 53P0416 1 DASD cable 2 53P0419 1 Blower cable 3 53P0414 1 Operator panel cable 4 53P0220 1 Chassis assembly 5 44H7366 5 Mounting screw, stiffener 6 53P3472 1 I/O Backplane 162 D10 and D20 Service Guide D20 PCI Slots 1 2 3 4 5 6 7 Index Part Number Units Per Assy Description 1 See Note 7 (maximum quantity) PCI adapter 2 53P0248 1 Power supply bulkhead 3 44H7366 4 Screw, power bulkhead mounting 4 44H7366 3 Screw, PCI plate mounting 5 53P0249 1 PCI adapter headstock bracket 6 53P2728 5 PCI dividers 7 53P2729 2 PCI dividers Note: See Diagnostic Information for Multiple Bus Systems for part numbers. Chapter 4. Parts Information 163 D20 Power, RIO Adapter, Cabling Index Part Number Units Per Assy Description 1 53P4832 2 (maximum quantity) Power supply 2 53P3800 1 RIO bus adapter 97H2459 1 RIO-2 bus adapter 3 53P2690 3 Cable bracket, SCSI 4 44H7366 1 Screw, SCSI cable 5 53P0639 5 RIO-2 cable bracket 6 44H7366 2 Screw, RIO-2 cable bracket 7 53P0417 Configuration dependent SCSI bus cable 8 53P0418 Configuration dependent SCSI bus cable 9 53P0233 1 Power supply filler 164 D10 and D20 Service Guide D20 Operator Panel Index Part Number Units Per Assy Description 1 53P0257 2 Disk-drive-backplane (disk drive cage included) 2 53P0321 4 Screw, disk-drive-backplane mounting 3 53P0330 1 Operator panel 4 53P0275 1 Processor cover 5 44H7366 4 Screw, cover-mounting Chapter 4. Parts Information 165 D20 Fans and Disk Drives Index Part Number Units Per Assy Description 1 53P0262 4 Blower 2 D20 3 53P0257 1 Disk-drive cage 12 pack (disk drive back planes included) 4 53P0319 8 Screw, DASD-cage mounting 5 Disk Drives 6 53P0234 4 Screw, filler-plate mounting 7 44H7366 1 (optional) Disk-drive filler plate 8 53P3451 1 Left Rail 9 53P3452 1 Right Rail 166 D10 and D20 Service Guide D20 Covers and Brackets Index Part Number Units Per Assy Description 1 53P1457 1 Front bezel 2 44H7366 2 Screw, latch bracket 3 53P2572 1 Latch bracket assembly left 4 53P0295 4 Cable arm bracket 5 44H7366 1 Screw, cable arm bracket 6 53P2573 1 Latch bracket assembly right D20 RIO, SPCN, and Rack Beacon Cables Index Part Number Units Per Assy Description 1 53P2231 Up to 12 Rack beacon cable junction box 2 53P2237 Up to 12 Rack beacon connector cable, 4-pin connector 3 53P2854 Up to 12 Rack beacon connector cable, 4-pin 87G6235 SPCN cable, 2 meter 09P1251 SPCN cable, 3 meter 21F9469 SPCN cable, 6 meter 21F9358 SPCN cable, 15 meter 4 53P2676 RIO-2 cable, 3.5 meter 21P5456 RIO-2 cable, 10 meter Chapter 4. Parts Information 167 Power Cables Index Part Number Country or Region 1 1838574 Bahamas, Barbados, Bolivia, Brazil, Canada, Costa Rica, Dominican Republic, El Salvador, Ecuador, Guatemala, Guyana, Haiti, Honduras, Jamaica, Japan, Netherlands Antilles, Panama, Peru, Philippines, Taiwan, Thailand, Trinidad, Tobago, U.S.A. (except Chicago), Venezuela 2 86G7648 Bahamas, Barbados, Bermuda, Bolivia, Brazil, Canada, Cayman Islands, Colombia, Costa Rica, Dominican Republic, Ecuador, El Salvador, Guatemala, Guyana, Haiti, Honduras, Jamaica, Japan, Korea (South), Mexico, Netherlands Antilles, Nicaragua, Panama, Peru, Philippines, Puerto Rico, Saudi Arabia, Suriname, Trinidad, Taiwan, U.S.A. (except Chicago), Venezuela 2 62X1045 Chicago, U.S.A. 3 6952311 Argentina, Australia, New Zealand 4 13F9979 Abu Dhabi, Austria, Belgium, Bulgaria, Botswana, China (Macau S.A.R), Egypt, Finland, France, Germany, Greece, Iceland, Indonesia, Korea (South), Lebanon, Luxembourg, Netherlands, Norway, Portugal, Saudi Arabia, Spain, Sudan, Sweden, Turkey, Yugoslavia 5 13F9997 Denmark 6 14F0015 Bangladesh, Burma, Pakistan, South Africa, Sri Lanka 7 14F0033 Bahrain, Bermuda, Brunei, Channel Islands, Cyprus, China (Hong Kong S.A.R), Ghana, India, Iraq, Ireland, Jordan, Kenya, Kuwait, Malawi, Malaysia, Nigeria, Oman, People’s Republic of China, Qatar, Sierra Leone, Singapore, Tanzania, Uganda, United Arab Emirates (Dubai), United Kingdom, Zambia 8 14F0051 Liechtenstein, Switzerland 9 14F0069 Chile, Ethiopia, Italy 10 14F0087 Israel 11 6952291 Paraguay, Colombia, Uruguay 42F6841 AC Power Cable, (Power distribution bus to AC power supply, 286/C4 only) 1 2 3 4 5 6 7 8 9 10 11 168 D10 and D20 Service Guide Chapter 5. Fault and Attention LEDs This chapter contains information about fault and attention LEDs, which assist in identifying failing components in your system. Component LEDs Individual LEDs are located on or near the failing components. The LEDs are located either on the component itself or on the carrier of the component (for example, PCI adapters and disk drives). LEDs are either green or amber. Green LEDs indicate one of two things: v Electrical power is present. v Activity is occuring on a link. (The system could be sending or receiving information.) Amber-colored LEDs indicate a fault or identify condition. If your system or one of the components on your system has an amber-colored LED turned on or blinking, identify the problem and take the appropriate action to restore the system to normal. The following table identifies the location, color and status of your system component LEDs. 169 Unit (FRU) LED Function Location Viewable Location LED Color OFF ON Blink Power and Packaging LEDs Rack Indicator Rack Summary Top of Rack External Amber Normal Fault Identify AC Input External Front Green No Input AC Good Fault I/O Subsystem Drawer Attention Operator Panel External Front Amber Normal Fault Identify Power External Front Green No AC System On Standby Fans Identify FRU External Amber Normal Identify N/A Power FRU External Green Power Off Power On N/A Power Supply Input Source FRU External Back Green No Input AC Input Good N/A Identify/ COM Fail FRU External Back Amber Normal N/A 1 Hz=Identify .25 Hz=Comm. Failed DC Output Good FRU External Back Green Standby Off System On Standby Disk Drives (DASD) Slot Power DASD Backplane External Front Green Power Off Power On N/A Activity DASD Backplane External Front Green Not Active N/A Activity Identify DASD Backplane External Front Amber Normal Action Remove Identify Media Slot Power Backplane External Front Green Power Off Power On N/A Activity/ Identify Backplane External Front Amber Normal Action Identify Electronic Component LEDs RIO Connector Card Identify FRU (Top) Internal Only Amber Normal Identify N/A RIO Extender Card Identify FRU (Top) Internal Only Amber Normal Identify N/A I/O Backplane Identify FRU Internal Only Amber Normal Identify N/A PCI Adapter Connectors (Slots) Slot Power Backplane External Back Green Power Off Power On N/A Action/ Identify Backplane External Back Amber Normal N/A Grn off - Action Grn on - Identify Disk Drive Backplane No LED Port LED Indicators RIOPorts Identify Top of Each External Connector External Back Amber Normal N/A Identify 170 D10 and D20 Service Guide Resetting the LEDs After the repair action is completed, do the following: 1. Log in as root user. 2. At the command line, type diag. 3. Select Task Selection. 4. Select Log Repair Action. 5. Select the device that was repaired. 6. Press F10 to exit diagnostics. If the attention LED remains on after you have completed the repair action and reset the LEDs, call for service support. Chapter 5. Fault and Attention LEDs 171 172 D10 and D20 Service Guide Appendix A. Environmental Notices Product Recycling and Disposal Components of the system unit, such as structural parts and circuit boards, can be recycled where recycling facilities exist. Companies are available to disassemble, reutilize, recycle, or dispose of electronic products. Contact your account representative for more information. This system unit contains parts such as circuit boards, cables, electromagnetic compatibility gaskets and connectors which may contain lead and copper/beryllium alloys that require special handling and disposal at end of life. Before this unit is disposed, these materials must be removed and recycled or discarded according to applicable regulations. This book contains specific information on each battery type where applicable. This product may contain a sealed lead acid, nickel cadmium, nickel metal hydride, lithium, or lithium ion battery. Consult your user manual or service manual for specific battery information. The battery must be recycled or disposed of properly. Recycling facilities may not be available in your area. For information on disposal of batteries, contact your local waste disposal facility. 173 Acoustical Noise Emissions The equivalent continuous A-weighted sound pressure level at workstations (emission sound pressure level at the 1-meter bystander positions) does not exceed 70 dB(A). Der Geräuschpegel der Einheit ist kleiner oder gleich 70 db(A). D10 Declared Acoustical Noise Emissions Acoustical Characteristic Product Configuration Declared A-Weighted Sound Power Level, LWAd (B) Declared A-Weighted Sound Pressure Level, LpAm (dB) Operating Idle Operating Idle One D10 I/O subsystem 5.65 5.65 406 406 Two D10 I/O subsystems 5.95 5.95 436 436 Four D10 I/O subsystems 6.25 6.25 466 466 Notes: 1. Noise levels cited are for the typical configuration (A-Frame: Bulk Power, CEC cage, battery option, media drawer, and two I/O drawers). 2. The 0.6-B (6-dB) reduction in noise emission levels with the acoustical rear door corresponds to a factor of 4 reduction. That is, the noise level of a single A-Frame with Slimline covers is about the same as the noise level of four A-Frames with acoustical covers. 3. LWAd is the upper-limit A-weighted sound power level; LpAm is the mean A-weighted sound pressure level at the 1-meter bystander positions; 1 B = 10 dB. 4. All measurements made in conformance with ISO 7779 and declared in conformance with ISO 9296. D20 Declared Acoustical Noise Emissions Acoustical Characteristic Product Configuration Declared A-Weighted Sound Power Level, LWAd (B) Declared A-Weighted Sound Pressure Level, LpAm (dB) Operating Idle Operating Idle D20 I/O subsystem 6.15 6.05 446 436 Notes: 1. Noise levels cited are for the typical configuration (A-Frame: Bulk Power, CEC cage, battery option, media drawer, and two I/O drawers). 2. The 0.6-B (6-dB) reduction in noise emission levels with the acoustical rear door corresponds to a factor of 4 reduction. That is, the noise level of a single A-Frame with Slimline covers is about the same as the noise level of four A-Frames with acoustical covers. 3. LWAd is the upper-limit A-weighted sound power level; LpAm is the mean A-weighted sound pressure level at the 1-meter bystander positions; 1 B = 10 dB. 4. All measurements made in conformance with ISO 7779 and declared in conformance with ISO 9296. 174 D10 and D20 Service Guide Appendix B. Notices This information was developed for products and services offered in the U.S.A. The manufacturer may not offer the products, services, or features discussed in this document in other countries. Consult the manufacturer’s representative for information on the products and services currently available in your area. Any reference to the manufacturer’s product, program, or service is not intended to state or imply that only that product, program, or service may be used. Any functionally equivalent product, program, or service that does not infringe any intellectual property right of the manufacturer may be used instead. However, it is the user’s responsibility to evaluate and verify the operation of any product, program, or service. The manufacturer may have patents or pending patent applications covering subject matter described in this document. The furnishing of this document does not give you any license to these patents. You can send license inquiries, in writing, to the manufacturer. The following paragraph does not apply to the United Kingdom or any country where such provisions are inconsistent with local law: THIS MANUAL IS PROVIDED ²AS IS² WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF NON-INFRINGEMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Some states do not allow disclaimer of express or implied warranties in certain transactions; therefore, this statement may not apply to you. This information could include technical inaccuracies or typographical errors. Changes are periodically made to the information herein; these changes will be incorporated in new editions of the publication. The manufacturer may make improvements and/or changes in the product(s) and/or the program(s) described in this publication at any time without notice. Information concerning products made by other than the manufacturer was obtained from the suppliers of those products, their published announcements, or other publicly available sources. The manufacturer has not tested those products and cannot confirm the accuracy of performance, compatibility or any other claims related to products made by other than the manufacturer. Questions on the capabilities of products made by other than the manufacturer should be addressed to the suppliers of those products. 175 176 D10 and D20 Service Guide Index A acoustical noise emissions 174 adapter cassette removing 55 replacing 56 adapter, PCI removal and replacement, D10 45 adapters, PCI D20 123 AIX documentation 47, 53, 123, 129, 132, 135, 137, 138 AIX location codes 14, 31 attention LEDs 169 B backplane, I/O removal and replacement D20 151 battery disposal, recycling 173 bay locations 24 beacon, parts 161 bezel removal, D20 112 bezel, front 43, 112 blower locations 21 blower, cable removal D20 118 blower, cooling 18 blowers, cooling 115 blowers, hot-swap removal, D20 115 blowers, hot-swap replacement, D20 117 brackets, parts 159, 167 bulkhead, power supply removal and replacement, D20 148 C cables, external parts 167 cabling, parts 162 cage, disk drive D20 142 checkpoints 39, 40 code, error 40 codes, location 14, 31 component LEDs 39 cooling 5, 18 cooling blower cable replacement, D20 119 cooling blowers cable, D20 118 cooling, parts 159, 163 cover removal and replacement D10 44 cover, service access 111 covers 44 covers, parts 159, 167 D D10 location code table 15 D10 location codes 14 D10 locations 10 codes, D10 14 D10 locations (continued) D10 15 D10 I/O subsystem, front 10 D10 I/O subsystem, rear 11 I/O backplane 12 PHB 13 D10 replacement procedure list 42 D20 location code table 32 D20 location codes 31 D20 locations 19 blowers 21 codes, D20 31 D20 32 D20 I/O subsystem, front 19 D20 I/O subsystem, rear 19 I/O backplane 22 power supply 20 SCSI bay locations 24 D20 replacement procedure list 108 definitions, LED 2 diagnostics overview 39 disk drive backplane 144 disk drive backplane replacement, D20 145 disk drive cage removal D20 142 disk drive cage replacement, D20 143 disk drive, backplane removal D20 144 disk drive, configuring 138 disk drive, deconfiguring 137 disk drive, hot-plug removal D20 139 disk drive, hot-plug replacement, D20 141 disk drive, parts 166 disk drive, removing 137 disk drive, replacing 138 disk drives, hot-plug 137 documentation AIX 47, 53, 123, 129, 132, 135, 137, 138 E electrical safety vii laser compliance statement ix error code to FRU index 40 external cables, parts 167 F fan, cooling 5 fan, parts 166 fan, removal and replacement D10 104 features, subsystem 1 cooling 5 LED definitions 2 operator indicators 2 pci slot 2 PCI-X slots 2 ports, input/output 5 ports, serial 5 177 features, subsystem (continued) power 4 features, system 16 cooling 18 disk drives 17 operator panel 16 PCI-X slots 16 ports, input/output 18 ports, serial 18 power 18 slots, PCI-X 16 front bezel removal, D10 43 front bezel replacement, D10 43 front bezel replacement, D20 112 front, bezel 43 FRU index 40 FRU isolation 39 H handling static-sensitive devices 42 highlighting xiii hot-plug management functions 123 hot-plug manager 123 hot-plug manager menu 124 hot-pluggable PCI adapter installation overview 53 hot-pluggable PCI adapter removal and replacement, D10 46 hot-swap fan assembly D10 104 hot-swap fan replacement, D10 104 I I/O backplane assembly replacement, D10 106 I/O backplane assembly, removal and replacement, D10 106 I/O backplane locations 12, 22 I/O backplane removal D20 151 I/O backplane replacement, D20 155 I/O backplane, parts 157, 162 index, error code to FRU 40 information, reference 1 input/output ports 5, 18 IO backplane assembly removal, D10 106 isolation, FRU 39 L laser compliance statement ix laser safety information ix LED definitions 2 LED definitions, PCI-X 16 LED, D10 power supply 5 LEDs 169 attention LEDs 169 component 39, 169 component LEDs 169 fault LEDs 169 resetting 171 resetting LEDs 171 location codes 14, 31 location codes, physical 14, 31 locations, D10 10 locations, D20 19 M MAP rules 40 N noise emissions acoustical 174 non-hot-pluggable PCI adapter, removal D20 126 O operating position 110 operating position, D10 43 operator panel 23, 113 operator panel removal, D20 113 operator panel replacement, D20 114 operator panel, parts 165 overview hot-pluggable PCI adapter 53 overview, D10 1 overview, D20 16 overview, diagnostics 39 P parts information covers and brackets 167 external cables 167 fans and disk drives 166 I/O backplane PCI slots 157 I/O backplane and cabling 162 operator panel 165 PCI slots 163 power and cooling 159 power, RIO 164 rack mount and beacon connect 161 RIO cables 167 SPCN cables 167 subsystem 157 parts, beacon 161 PCI adapter cassette removing 55 replacing 56 PCI adapter removal and replacement, D10 45 PCI adapter, hot-plug removal D20 129 PCI-X definitions, LED 16 PHB 13 physical location codes 14, 31 ports, input/output 5, 18 power and lightpath LEDs 39 power cables 37 power supplies, removal and replacement 146 power supply 4, 18 178 D10 and D20 Service Guide power supply bulkhead removal D20 148 power supply bulkhead replacement, D20 150 power supply LED 5 power supply locations 20 power supply removal D20 146 power supply removal, D10 107 power supply replacement, D10 107 power supply, hot-swap replacement, D20 147 power supply, removal and replacement, D10 107 Power-On Self-Test 34 power, D10 4 power, D20 18 power, parts 159, 163, 164 powering on and off 34 product disposal 173 R recycling 173 reference information 1 related publications xiv removal and replacement procedures 41 adapter, adding or replacing a non-hot-plug 128 adapter, hot-plug management functions 123 adapter, hot-plug manager 123 adapter, install hot-plug 135 adapter, replace hot-plug 132 adapters, PCI D20 123 bezel 43 bezel, front 112 blowers, cooling 115 cooling blowers cable, D20 118 cover, service access 111 covers 44 devices, handling static-sensitive 42 disk drive backplane 144 disk drive cage, D20 142 disk drive, configuring 137 disk drive, deconfiguring 137 disk drives, hot-plug 137 fan, D10 104 hot-plug manager menu 124 hot-pluggable PCI adapter, 46 I/O backplane assembly, D10 106 I/O backplane D20 151 operating position 110 operator panel 113 PCI adapter, D10 45 power supplies 146 power supply bulkhead D20 148 power supply, D10 107 RIO bus adapter, D10 105 RIO bus adapter, D20 120 RIO cable, D10 105 safety considerations 41 service position 109 stopping the system 42 removal procedure list D10 42 removal procedure list D20 108 removal procedures 41 adapter, PCI D20 126 removal procedures (continued) bezel, D20 112 blower cable, D20 118 blowers, hot-swap, D20 115 caution and danger 41 disk drive backplane, D20 144 disk drive cage D20 142 front bezel, D10 43 hot-plug adapter, D20 129 hot-plug disk drive, D20 139 hot-swap fan assembly D10 104 I/O backplane, D20 151 IO backplane assembly, D10 106 operator panel, D20 113 power supply bulkhead, D20 148 power supply D20 146 power supply, D10 107 RIO bus adapter, D10 105 RIO bus adapter, D20 120 RIO cable, D10 105 service access cover, D10 44 service access cover, D20 111 removing a PCI adapter cassette 55 replacement procedures 41 blowers, hot-swap, D20 117 cooling blower cable, D20 119 disk drive backplane, D20 145 disk drive cage, D20 143 disk drive, hot-plug, D20 141 front bezel, D10 43 front bezel, D20 112 hot-swap fan, D10 104 I/O backplane, D20 155 operator panel, D20 114 power supply bulkhead, D20 150 power supply, D10 107 power supply, hot-swap, D20 147 RIO bus adapter, D10 105 RIO bus adapter, D20 122 RIO cable, D10 105 service access cover, D10 44 service access cover, D20 111 rereplacing a PCI adapter cassette 56 resetting the LEDs 171 RIO bus adapter removal D20 120 RIO bus adapter removal, D10 105 RIO bus adapter replacement, D10 105 RIO bus adapter replacement, D20 122 RIO bus adapter, D10 removal and replacement 105 RIO bus adapter, D20 120 RIO cable removal D10 105 RIO cable replacement, D10 105 RIO cable, removal and replacement D10 105 RIO-2 cables 5 RIO, parts 164 S safety considerations 41 safety notices vii SCSI IDs 24 Index 179 Self-Test, Power-On 34 serial port configurations 18 service access cover removal, D10 44 service access cover removal, D20 111 service access cover replacement, D10 44 service access cover replacement, D20 111 service inspection guide 38 service position 109 service position, D10 43 SPCN cables 5 SPCN cables, parts 167 specifications 35 specifications, power cables 37 stopping the system 42 subsystem cables 5 subsystem, power 4, 18 system stopping 42 system management services 40 system, powering on and off 34 T trademarks xiv W Web sites AIX documentation 47, 53, 123, 129, 132, 135, 137, 138 180 D10 and D20 Service Guide Vos remarques sur ce document / Technical publication remark form Titre / Title : Bull D10 and D20 I/O Drawers Service Guide Nº Reférence / Reference Nº : 86 A1 38EG 01 Daté / Dated : June 2003 ERREURS DETECTEES / ERRORS IN PUBLICATION AMELIORATIONS SUGGEREES / SUGGESTIONS FOR IMPROVEMENT TO PUBLICATION Vos remarques et suggestions seront examinées attentivement. Si vous désirez une réponse écrite, veuillez indiquer ci-après votre adresse postale complète. Your comments will be promptly investigated by qualified technical personnel and action will be taken as required. If you require a written reply, please furnish your complete mailing address below. NOM / NAME : Date : SOCIETE / COMPANY : ADRESSE / ADDRESS : Remettez cet imprimé à un responsable BULL ou envoyez-le directement à : Please give this technical publication remark form to your BULL representative or mail to: BULL CEDOC 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE Technical Publications Ordering Form Bon de Commande de Documents Techniques To order additional publications, please fill up a copy of this form and send it via mail to: Pour commander des documents techniques, remplissez une copie de ce formulaire et envoyez-la à : BULL CEDOC ATTN / Mr. L. CHERUBIN 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE Phone / Téléphone : +33 (0) 2 41 73 63 96 FAX / Télécopie +33 (0) 2 41 73 60 19 E–Mail / Courrier Electronique : srv.Cedoc@franp.bull.fr Or visit our web sites at: / Ou visitez nos sites web à: http://www.logistics.bull.net/cedoc http://www–frec.bull.com http://www.bull.com CEDOC Reference # No Référence CEDOC Qty Qté CEDOC Reference # No Référence CEDOC Qty Qté CEDOC Reference # No Référence CEDOC Qty Qté _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] [ _ _ ] : no revision number means latest revision / pas de numéro de révision signifie révision la plus récente NOM / NAME : Date : SOCIETE / COMPANY : ADRESSE / ADDRESS : PHONE / TELEPHONE : FAX : E–MAIL : For Bull Subsidiaries / Pour les Filiales Bull : Identification: For Bull Affiliated Customers / Pour les Clients Affiliés Bull : Customer Code / Code Client : For Bull Internal Customers / Pour les Clients Internes Bull : Budgetary Section / Section Budgétaire : For Others / Pour les Autres : Please ask your Bull representative. / Merci de demander à votre contact Bull. BULL CEDOC 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE 86 A1 38EG 01 ORDER REFERENCE PLACE BAR CODE IN LOWER LEFT CORNER Utiliser les marques de découpe pour obtenir les étiquettes. Use the cut marks to get the labels. 86 A1 38EG 01 D10 and D20 I/O Drawers Service Guide 86 A1 38EG 01 D10 and D20 I/O Drawers Service Guide 86 A1 38EG 01 Bull ESCALA PL420T and PL420R Service Guide 86 A1 42EG 00 ORDER REFERENCE Bull ESCALA PL420T and PL420R Service Guide Hardware February 2003 BULL CEDOC 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE 86 A1 42EG 00 ORDER REFERENCE The following copyright notice protects this book under the Copyright laws of the United States of America and other countries which prohibit such actions as, but not limited to, copying, distributing, modifying, and making derivative works. Copyright Bull S.A. 1992, 2003 Printed in France Suggestions and criticisms concerning the form, content, and presentation of this book are invited. A form is provided at the end of this book for this purpose. To order additional copies of this book or other Bull Technical Publications, you are invited to use the Ordering Form also provided at the end of this book. Trademarks and Acknowledgements We acknowledge the right of proprietors of trademarks mentioned in this book. AIX is a registered trademark of International Business Machines Corporation, and is being used under licence. UNIX is a registered trademark in the United States of America and other countries licensed exclusively through the Open Group. Linux is a registered trademark of Linus Torvalds. The information in this document is subject to change without notice. Groupe Bull will not be liable for errors contained herein, or for incidental or consequential damages in connection with the use of this material. Contents Safety Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xi Rack Safety Instructions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xi Electrical Safety . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xii Laser Safety Information. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xiii Laser Compliance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xiii Data Integrity and Verification . . . . . . . . . . . . . . . . . . . . . . . . . . . xv About This Book. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xvii Highlighting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xvii ISO 9000 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xvii Related Publications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xvii Trademarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xviii Chapter 1. Reference Information . . . . . . . . . . . . . . . . . . . . . . . . . . 1 Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 System Features . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 PCI-X Slots and Integrated Adapters. . . . . . . . . . . . . . . . . . . . . . . . . 4 I/O Expansion Drawer . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 Hardware Management Console (HMC). . . . . . . . . . . . . . . . . . . . . . . . . 6 Powering the System on and Off . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 Console Strategy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 Power-On Self-Test . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 POST Indicators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 POST Keys . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8 System Unit Locations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9 286/C4 Rear View. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9 Power Supply Locations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12 Fan Locations. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14 CEC Backplane Locations . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15 PCI Riser Card Locations . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16 Memory DIMM Location . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18 Reading the Operator Panel and System LEDs . . . . . . . . . . . . . . . . . . . . 19 SCSI IDs and Bay Locations . . . . . . . . . . . . . . . . . . . . . . . . . . . 20 System Logic Flow . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21 Systems Configured with a GP Processor and Four-Slotted PCI Riser Card . . . . . . . . . . 21 Systems Configured with a GQ Processor and Six-Slotted PCI Riser Card . . . . . . . . . . 22 Location Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23 Physical Location Codes. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23 Location Code Format. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23 AIX Location Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24 Mapping AIX and Physical Location Codes . . . . . . . . . . . . . . . . . . . . . . . 28 Four-Slotted PCI Riser Card Location Codes . . . . . . . . . . . . . . . . . . . . . 29 Six-Slotted PCI Riser Card Location Codes . . . . . . . . . . . . . . . . . . . . . . 30 Disk Drive Backplane Location Codes . . . . . . . . . . . . . . . . . . . . . . . . 30 AIX and Physical Location Code Table. . . . . . . . . . . . . . . . . . . . . . . . 31 System Cables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37 Subsystem Positioning and Cabling. . . . . . . . . . . . . . . . . . . . . . . . . . 39 SPCN Cabling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39 RIO Cabling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41 Specifications. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42 External AC Power Cables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43 Service Inspection Guide. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44 iii Chapter 2. Diagnostic Overview . . . . . . . . . . . . . . . . . . . . . . . . . . 45 Maintenance Analysis Procedures (MAPs) . . . . . . . . . . . . . . . . . . . . . . . 45 Power and Attention LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46 Component LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46 Resetting the LEDs. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46 System Attention LED. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46 Checkpoints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46 FRU Isolation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47 FRU Identify LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48 Service Focal Point. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48 Getting Started . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49 Testing Error Reporting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49 Service Focal Point Settings . . . . . . . . . . . . . . . . . . . . . . . . . . . 49 Working With Serviceable Events . . . . . . . . . . . . . . . . . . . . . . . . . 51 Hardware Service Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . 53 System Error Interrogation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54 The System is Attached to an HMC. . . . . . . . . . . . . . . . . . . . . . . . . 55 The System is not Attached to an HMC and AIX is Installed. . . . . . . . . . . . . . . . 55 The System is Attached to an HMC and Linux is Installed . . . . . . . . . . . . . . . . 55 Resetting the System Attention LED . . . . . . . . . . . . . . . . . . . . . . . . . 56 Systems Attached To an HMC. . . . . . . . . . . . . . . . . . . . . . . . . . . 56 Systems not Attached to an HMC with AIX Installed. . . . . . . . . . . . . . . . . . . 56 Systems not Attached to an HMC with Linux Installed . . . . . . . . . . . . . . . . . . 56 Chapter 3. Maintenance Analysis Procedures (MAPs) . . . . . . . . . . . . . . . . . . 57 Entry MAP . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57 Quick Entry MAP . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58 MAP 1020: Problem Determination . . . . . . . . . . . . . . . . . . . . . . . . . . 64 MAP 1240: Memory Problem Resolution . . . . . . . . . . . . . . . . . . . . . . . . 68 MAP 1320: Service Focal Point Procedures. . . . . . . . . . . . . . . . . . . . . . . 75 MAP 1321: Quick Entry MAP for Systems with Service Focal Point . . . . . . . . . . . . . . 76 MAP 1322: End of Call MAP for Systems with Service Focal Point . . . . . . . . . . . . . . 84 MAP 1520: Power . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89 FRU Identify LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90 MAP 1521: Base System Drawer With Power Failure and No Error Codes . . . . . . . . . . . 91 Map 1522: Cannot Power On the I/O Drawer, Power Present LED Does Not Come On. . . . . . . 98 Map 1523: The SPCN Error Codes Table Directed You Here and the Error Code is 1011 1B0x . . . 105 MAP 1540: Problem Isolation Procedures . . . . . . . . . . . . . . . . . . . . . . . 108 FRU Identify LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 108 MAP 1541: JTAG Problem Isolation . . . . . . . . . . . . . . . . . . . . . . . . . 110 MAP 1542: I/O Problem Isolation . . . . . . . . . . . . . . . . . . . . . . . . . . 111 MAP 1548: Processor Subsystem Problem Isolation . . . . . . . . . . . . . . . . . . . 123 MAP 1549: Attention Problem Isolation . . . . . . . . . . . . . . . . . . . . . . . . 137 MAP 154C: SPCN Cable Problem Isolation . . . . . . . . . . . . . . . . . . . . . . 139 Chapter 4. Checkpoints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 141 IPL Flow with an HMC Attached to the System . . . . . . . . . . . . . . . . . . . . . 141 IPL Flow Without an HMC Attached to the System. . . . . . . . . . . . . . . . . . . . 142 Service Processor Checkpoints . . . . . . . . . . . . . . . . . . . . . . . . . . . 143 Firmware Checkpoints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 150 Boot Problems/Concerns . . . . . . . . . . . . . . . . . . . . . . . . . . . . 173 Chapter 5. Error Code to FRU Index . . . . . . . . . . . . . . . . . . . . . . . . 179 Performing Slow Boot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 179 Four-Character Checkpoints . . . . . . . . . . . . . . . . . . . . . . . . . . . . 179 Replacing the Operator Panel . . . . . . . . . . . . . . . . . . . . . . . . . . . 179 iv Service Guide Replacing the Network Adapter . . . . . . . . . . . . . . . . . . . . . . . . . . . 180 Determining Location Code . . . . . . . . . . . . . . . . . . . . . . . . . . . . 180 FRU Identify LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 180 Checkpoint and Error Code Index . . . . . . . . . . . . . . . . . . . . . . . . . . 181 Operator Panel Error Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . 182 SPCN Error Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 183 Firmware Error Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 192 Service Processor Error Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . 211 Memory DIMM Present Detect Bits (PD-Bits) . . . . . . . . . . . . . . . . . . . . . . 285 Error Log Utility Procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . 285 System Firmware Update Messages . . . . . . . . . . . . . . . . . . . . . . . . . 286 Scan Dump Messages . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 286 Common Firmware Error Codes . . . . . . . . . . . . . . . . . . . . . . . . . . 286 Scan Log Dump Progress Codes . . . . . . . . . . . . . . . . . . . . . . . . . . 294 Problem Determination Generated Error Codes . . . . . . . . . . . . . . . . . . . . . 295 Chapter 6. Using the Online and Standalone Diagnostics . . . . . . . . . . . . . . . . 297 Operating Considerations for Online and Standalone Diagnostics . . . . . . . . . . . . . . 297 Identifying the Terminal Type to the Diagnostics. . . . . . . . . . . . . . . . . . . . 298 Undefined Terminal Types. . . . . . . . . . . . . . . . . . . . . . . . . . . . 298 Resetting the Terminal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 298 Online Diagnostics Modes of Operation. . . . . . . . . . . . . . . . . . . . . . . . 298 Service Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 298 Concurrent Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 299 Maintenance Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 300 Standalone Diagnostics Operations . . . . . . . . . . . . . . . . . . . . . . . . . 301 Performing Slow Boot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 301 Considerations for Running Standalone Diagnostics on a Partitioned Systems . . . . . . . . 301 Considerations for Running Standalone Diagnostics from CD-ROM. . . . . . . . . . . . . 301 NIM Server Configuration . . . . . . . . . . . . . . . . . . . . . . . . . . . . 302 Client Configuration and Booting Standalone Diagnostics from the NIM Server . . . . . . . . 303 Running the Online Diagnostics with No HMC Attached . . . . . . . . . . . . . . . . . . 304 Chapter 7. Using the Service Processor. . . . . . . . . . . . . . . . . . . . . . . 305 Service Processor Menus . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 305 Accessing the Service Processor Menus Locally . . . . . . . . . . . . . . . . . . . 305 Accessing the Service Processor Menus Remotely. . . . . . . . . . . . . . . . . . . 306 Saving and Restoring Service Processor Settings . . . . . . . . . . . . . . . . . . . 306 Menu Inactivity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 306 General User Menu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 306 Privileged User Menus . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 307 Main Menu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 307 Service Processor Setup Menu . . . . . . . . . . . . . . . . . . . . . . . . . . 309 Passwords . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 309 System Power Control Menu. . . . . . . . . . . . . . . . . . . . . . . . . . . 314 System Information Menu . . . . . . . . . . . . . . . . . . . . . . . . . . . . 317 Language Selection Menu. . . . . . . . . . . . . . . . . . . . . . . . . . . . 324 Call-In/Call-Out Setup Menu . . . . . . . . . . . . . . . . . . . . . . . . . . . 325 Modem Configuration Menu . . . . . . . . . . . . . . . . . . . . . . . . . . . 326 Serial Port Selection Menu . . . . . . . . . . . . . . . . . . . . . . . . . . . 326 Serial Port Speed Setup Menu . . . . . . . . . . . . . . . . . . . . . . . . . . 327 Telephone Number Setup Menu. . . . . . . . . . . . . . . . . . . . . . . . . . 327 Call-Out Policy Setup Menu . . . . . . . . . . . . . . . . . . . . . . . . . . . 328 Customer Account Setup Menu. . . . . . . . . . . . . . . . . . . . . . . . . . 329 Call-out Test Menu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 329 Service Processor Parameters in Service Mode (Full System Partition) . . . . . . . . . . . . 330 Contents v System Power-On Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . 330 Service Processor Reboot/Restart Recovery . . . . . . . . . . . . . . . . . . . . . . 331 Boot (IPL) Speed . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 331 Failure During Boot Process . . . . . . . . . . . . . . . . . . . . . . . . . . . 331 Failure During Normal System Operation . . . . . . . . . . . . . . . . . . . . . . 331 Service Processor Reboot/Restart Policy Controls . . . . . . . . . . . . . . . . . . . 331 System Firmware Updates. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 332 General Information on System Firmware Updates. . . . . . . . . . . . . . . . . . . 333 Determining the Level of Firmware on the System . . . . . . . . . . . . . . . . . . . 334 Updating System Firmware From the Service Processor Menus . . . . . . . . . . . . . . 334 Updating System Firmware from a NIM Server . . . . . . . . . . . . . . . . . . . . 335 Recovery Mode. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 335 Configuring and Deconfiguring Processors or Memory . . . . . . . . . . . . . . . . . . 335 Run-Time CPU Deconfiguration (CPU Repeat Gard) . . . . . . . . . . . . . . . . . . 336 Service Processor System Monitoring - Surveillance . . . . . . . . . . . . . . . . . . . 336 System Firmware Surveillance . . . . . . . . . . . . . . . . . . . . . . . . . . 336 Operating System Surveillance . . . . . . . . . . . . . . . . . . . . . . . . . . 336 Call-Out (Call-Home). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 337 Console Mirroring . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 338 System Configuration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 338 Service Processor Error Logs . . . . . . . . . . . . . . . . . . . . . . . . . . . 339 LCD Progress Indicator Log . . . . . . . . . . . . . . . . . . . . . . . . . . . . 340 Service Processor Operational Phases . . . . . . . . . . . . . . . . . . . . . . . . 341 Pre-Standby Phase . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 341 Standby Phase. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 341 Bring-Up Phase . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 342 Runtime Phase. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 343 Chapter 8. Using System Management Services . . . . . . . . . . . . . . . . . . . 345 Select Language . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 346 Change Password Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . 347 View Error Log . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 347 Setup Remote IPL (Initial Program Load) . . . . . . . . . . . . . . . . . . . . . . . 348 Change SCSI Settings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 351 Select Console . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 351 Select Boot Options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 352 Select Boot Devices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 354 Display Current Settings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 355 Restore Default Settings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 356 Multiboot Startup . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 356 Exiting System Management Services . . . . . . . . . . . . . . . . . . . . . . . . 356 Chapter 9. Removal and Replacement Procedures. . . . . . . . . . . . . . . . . . . 357 Procedure List . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 358 Handling Static-Sensitive Devices . . . . . . . . . . . . . . . . . . . . . . . . . . 359 Stopping the System without an HMC Attached . . . . . . . . . . . . . . . . . . . . . 359 Stopping the System with an HMC Attached and AIX Installed . . . . . . . . . . . . . . . 359 Stopping the System with an HMC Attached and Linux Installed. . . . . . . . . . . . . . . 360 Starting the System without an HMC Attached . . . . . . . . . . . . . . . . . . . . . 360 Starting the System with an HMC Attached . . . . . . . . . . . . . . . . . . . . . . 361 286/C4 Drawer-Release Latch and Brackets . . . . . . . . . . . . . . . . . . . . . . 362 Drawer-Release Latch Removal. . . . . . . . . . . . . . . . . . . . . . . . . . 362 Drawer-Release Latch Replacement . . . . . . . . . . . . . . . . . . . . . . . . 363 Drawer-Release Latch Bracket Removal . . . . . . . . . . . . . . . . . . . . . . 363 Drawer-Release Latch Bracket Replacement . . . . . . . . . . . . . . . . . . . . . 364 Placing the 286/C4 into the Service Position . . . . . . . . . . . . . . . . . . . . . . 364 vi Service Guide Returning the 286/C4 to the Operating Position . . . . . . . . . . . . . . . . . . . . . 365 Covers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 366 Service Access Cover Removal (286/C4) . . . . . . . . . . . . . . . . . . . . . . 366 Service Access Cover Replacement (286/C4). . . . . . . . . . . . . . . . . . . . . 366 Top Cap Removal (286/E4) . . . . . . . . . . . . . . . . . . . . . . . . . . . 367 Top Cap Replacement (286/E4). . . . . . . . . . . . . . . . . . . . . . . . . . 367 Top Cover Removal (286/E4). . . . . . . . . . . . . . . . . . . . . . . . . . . 368 Top Cover Replacement (286/E4) . . . . . . . . . . . . . . . . . . . . . . . . . 369 Outer Side Cover Removal (286/E4) . . . . . . . . . . . . . . . . . . . . . . . . 370 Outer Side Cover Replacement (286/E4) . . . . . . . . . . . . . . . . . . . . . . 370 Service Access Cover Removal (286/E4) . . . . . . . . . . . . . . . . . . . . . . 371 Service Access Cover Replacement (286/E4). . . . . . . . . . . . . . . . . . . . . 372 Back Cover Removal (286/E4) . . . . . . . . . . . . . . . . . . . . . . . . . . 372 Back Cover Replacement (286/E4) . . . . . . . . . . . . . . . . . . . . . . . . 373 Disk Drive Access Cover Removal (286/E4) . . . . . . . . . . . . . . . . . . . . . 373 Disk Drive Access Cover Replacement (286/E4). . . . . . . . . . . . . . . . . . . . 373 Right Side Cover (System Base) Removal (286/E4) . . . . . . . . . . . . . . . . . . 374 Right Side Cover (System Base) Replacement (286/E4). . . . . . . . . . . . . . . . . 375 Bezel Door (286/E4) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 376 Bezel Door Removal. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 376 Bezel Door Replacement . . . . . . . . . . . . . . . . . . . . . . . . . . . . 376 Bezels . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 377 Front Bezel Removal (286/C4) . . . . . . . . . . . . . . . . . . . . . . . . . . 377 Front Bezel Replacement (286/C4) . . . . . . . . . . . . . . . . . . . . . . . . 377 Front Bezel Removal (286/E4) . . . . . . . . . . . . . . . . . . . . . . . . . . 378 Front Bezel Replacement (286/E4) . . . . . . . . . . . . . . . . . . . . . . . . 378 PCI Adapters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 379 PCI Hot-Plug Manager Access . . . . . . . . . . . . . . . . . . . . . . . . . . 379 Non-Hot-Pluggable PCI Adapter Removal . . . . . . . . . . . . . . . . . . . . . . 381 Adding or Replacing a Non-Hot-Pluggable PCI Adapter . . . . . . . . . . . . . . . . . 383 Hot-Pluggable PCI Adapter Removal . . . . . . . . . . . . . . . . . . . . . . . . 385 Replacing a Hot-Pluggable PCI Adapter. . . . . . . . . . . . . . . . . . . . . . . 388 Installing a Hot-Pluggable PCI Adapter . . . . . . . . . . . . . . . . . . . . . . . 392 Fans. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 395 Removing a Processor Fan in a Non-Hot-Swap Environment . . . . . . . . . . . . . . . 395 Replacing a Processor Fan in a Non-Hot-Swap Environment . . . . . . . . . . . . . . . 396 Hot-Swap Fan Assembly Removal. . . . . . . . . . . . . . . . . . . . . . . . . 397 286/C4. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 398 286/E4 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 399 Hot-Swap Fan Assembly Replacement . . . . . . . . . . . . . . . . . . . . . . . 399 Media Drives (Diskette, CD-ROM, Tape, or Disk Drive) . . . . . . . . . . . . . . . . . . 401 Media Removal. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 401 286/C4. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 402 286/E4 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 403 Media Replacement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 404 Hot-Plug Disk Drives. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 405 Deconfiguring (Removing) or Configuring a Disk Drive . . . . . . . . . . . . . . . . . 405 Deconfiguring (Removing). . . . . . . . . . . . . . . . . . . . . . . . . . . . 405 Configuring (Replacing). . . . . . . . . . . . . . . . . . . . . . . . . . . . . 406 Hot-Plug Disk Drive Removal . . . . . . . . . . . . . . . . . . . . . . . . . . 406 Hot-Plug Disk Drive Replacement . . . . . . . . . . . . . . . . . . . . . . . . . 408 PCI Riser Card. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 409 PCI Riser Card Removal . . . . . . . . . . . . . . . . . . . . . . . . . . . . 409 PCI Riser Card Replacement. . . . . . . . . . . . . . . . . . . . . . . . . . . 413 Disk Drive Backplane . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 415 Disk Drive Backplane Removal . . . . . . . . . . . . . . . . . . . . . . . . . . 415 Contents vii Disk Drive Backplane Replacement . . . . . . . . . . . . . . . . . . . . . . . . 418 Operator Panel. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 419 Operator Panel Removal . . . . . . . . . . . . . . . . . . . . . . . . . . . . 419 286/C4. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 419 286/E4 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 420 Operator Panel Replacement. . . . . . . . . . . . . . . . . . . . . . . . . . . 420 System Vital Product Data (VPD) Update Procedure . . . . . . . . . . . . . . . . . . . 421 Processor Card. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 423 Processor Card Removal . . . . . . . . . . . . . . . . . . . . . . . . . . . . 423 Adding or Replacing a Processor Card . . . . . . . . . . . . . . . . . . . . . . . 426 Memory DIMMs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 427 Memory DIMM Removal . . . . . . . . . . . . . . . . . . . . . . . . . . . . 427 Memory DIMM Replacement. . . . . . . . . . . . . . . . . . . . . . . . . . . 431 Replacing the Battery . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 432 Power Supplies. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 436 Non-Hot-Swap Power Supply Removal . . . . . . . . . . . . . . . . . . . . . . . 436 Non-Hot-Swap Power Supply Replacement . . . . . . . . . . . . . . . . . . . . . 438 Hot-Swap Power Supply Removal. . . . . . . . . . . . . . . . . . . . . . . . . 439 Hot-Swap Power Supply Replacement . . . . . . . . . . . . . . . . . . . . . . . 441 Central Electronics Complex (CEC) Backplane . . . . . . . . . . . . . . . . . . . . . 443 CEC Backplane Removal . . . . . . . . . . . . . . . . . . . . . . . . . . . . 443 CEC Backplane Replacement . . . . . . . . . . . . . . . . . . . . . . . . . . 448 Chapter 10. Parts Information. . . . . . . . . . . . . . . . . . . . . . . . . . . 451 System Parts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 452 System Parts (continued) . . . . . . . . . . . . . . . . . . . . . . . . . . . . 454 286/E4 Cover Set and Brackets. . . . . . . . . . . . . . . . . . . . . . . . . . . 456 System Internal Cables. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 459 Power Cable. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 462 Keyboards and Mouse (White) . . . . . . . . . . . . . . . . . . . . . . . . . . . 463 Chapter 11. Fault and Attention LEDs. . . . . . . . . . . . . . . . . . . . . . . . 465 Operator Panel Display. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 465 Component LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 466 Resetting the LEDs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 469 Appendix A. Environmental Notices . . . . . . . . . . . . . . . . . . . . . . . . 471 Product Recycling and Disposal. . . . . . . . . . . . . . . . . . . . . . . . . . . 471 Acoustical Noise Emissions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 472 Declared Acoustical Noise Emissions. . . . . . . . . . . . . . . . . . . . . . . . . 472 Appendix B. Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 473 Appendix C. Service Processor Setup and Test. . . . . . . . . . . . . . . . . . . . 475 Service Processor Setup Checklist. . . . . . . . . . . . . . . . . . . . . . . . . . 475 Testing the Setup . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 476 Testing Call-In . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 476 Testing Call-Out . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 476 Serial Port Configuration . . . . . . . . . . . . . . . . . . . . . . . . . . . . 477 Appendix D. Setting Up the System Rack . . . . . . . . . . . . . . . . . . . . . . 479 Rack Specifications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 479 Model 14T/0 and 14T/4 Rack . . . . . . . . . . . . . . . . . . . . . . . . . . 479 Noise Emission Notes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 480 System Service Clearances . . . . . . . . . . . . . . . . . . . . . . . . . . . 481 286/C4 Rack Installation Inventory. . . . . . . . . . . . . . . . . . . . . . . . . . 481 viii Service Guide Read the Rack Safety Instructions. . . . . . . . . . . . . . . . . . . . . . . . . . 484 Rack Safety Instructions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 484 Attaching the Mounting Hardware to the Rack Enclosure . . . . . . . . . . . . . . . . . 485 System Rail Assembly (Type 1). . . . . . . . . . . . . . . . . . . . . . . . . . 490 System Rail Assembly (Type 2). . . . . . . . . . . . . . . . . . . . . . . . . . 493 286/C4 Removal and Installation into a Rack. . . . . . . . . . . . . . . . . . . . . . 496 System Drawer Removal . . . . . . . . . . . . . . . . . . . . . . . . . . . . 496 Installing the 286/C4 into the Rack Enclosure. . . . . . . . . . . . . . . . . . . . . 497 Installing the Cable-Management Arm . . . . . . . . . . . . . . . . . . . . . . . . 501 Appendix E. Installing a Hardware Management Console (HMC) . . . . . . . . . . . . . 503 Position the HMC and Monitor . . . . . . . . . . . . . . . . . . . . . . . . . . . 503 Connect the Cables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 504 Connect the 8-Port Adapter Cables . . . . . . . . . . . . . . . . . . . . . . . . . 506 Connect the External Modem . . . . . . . . . . . . . . . . . . . . . . . . . . . 506 Check the Microswitch Setting on the Modem . . . . . . . . . . . . . . . . . . . . 507 Connect the LAN Cable (If Applicable) . . . . . . . . . . . . . . . . . . . . . . . . 508 Plug in the HMC Power Cable . . . . . . . . . . . . . . . . . . . . . . . . . . . 508 Configure the Network . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 508 Configure Inventory Scout Services . . . . . . . . . . . . . . . . . . . . . . . . . 508 Collect Vital Product Data Information . . . . . . . . . . . . . . . . . . . . . . . 509 Configure the Service Agent . . . . . . . . . . . . . . . . . . . . . . . . . . . . 510 Appendix F. Modem Configurations . . . . . . . . . . . . . . . . . . . . . . . . 513 Sample Modem Configuration Files . . . . . . . . . . . . . . . . . . . . . . . . . 513 Configuration File Selection . . . . . . . . . . . . . . . . . . . . . . . . . . . . 513 Transfer of a Modem Session . . . . . . . . . . . . . . . . . . . . . . . . . . . 517 Modem Configuration Sample Files . . . . . . . . . . . . . . . . . . . . . . . . . 519 Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 535 Contents ix x Service Guide Safety Notices A danger notice indicates the presence of a hazard that has the potential of causing death or serious personal injury. Danger notices appear on the following pages: v xii v 89 v 90 v 357 v 436 A caution notice indicates the presence of a hazard that has the potential of causing moderate or minor personal injury. Caution notices appear on the following pages: v xii v xiii v 89 v 357 Note: For a translation of these notices, see System Unit Safety Information, order number SA23-2652. Rack Safety Instructions v Do not install this unit in a rack where the internal rack ambient temperatures will exceed 35 degrees C. v Do not install this unit in a rack where the air flow is compromised. Any side, front or back of the unit used for air flow through the unit must not be in direct contact with the rack. v Care should be taken to ensure that a hazardous condition is not created due to uneven mechanical loading when installing this unit in a rack. If the rack has a stabilizer it must be firmly attached before installing or removing this unit. v Consideration should be given to the connection of the equipment to the supply circuit so that overloading of circuits does not compromise the supply wiring or overcurrent protection. To provide the correct power connection to the rack, refer to the rating labels located on the equipment in the rack to determine the total power requirement for the supply circuit. v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of the system or the devices that attach to the system. It is the responsibility of the customer to ensure that the outlet is correctly wired and grounded to prevent an electrical shock. xi Electrical Safety Observe the following safety instructions any time you are connecting or disconnecting devices attached to the workstation. In the system you are about to setup or service: v The ac power interface connector is considered the main power disconnect device. v This system has redundant power supply capabilities, meaning that it has the ability to have two power supplies running simultaneously in the same system unit. When instructed to disconnect the power source, ensure that all power cables have been unplugged. DANGER To prevent electrical shock hazard, disconnect all power cables from the electrical outlet before relocating the system. D01 CAUTION: This product is equipped with a three-wire power cable and plug for the user’s safety. Use this power cable with a properly grounded electrical outlet to avoid electrical shock. C01 DANGER To prevent electrical shock hazard, disconnect all power cables from the electrical outlet before relocating the system. D01 xii Service Guide Laser Safety Information CAUTION: This product may contain a CD-ROM, DVD-ROM, or laser module on a PCI card, which are class 1 laser products. C30 Laser Compliance All lasers are certified in the U.S. to conform to the requirements of DHHS 21 CFR Subchapter J for class 1 laser products. Outside the U.S., they are certified to be in compliance with the IEC 825 (first edition 1984) as a class 1 laser product. Consult the label on each part for laser certification numbers and approval information. CAUTION: All laser modules are designed so that there is never any human access to laser radiation above a class 1 level during normal operation, user maintenance, or prescribed service conditions. Data processing environments can contain equipment transmitting on system links with laser modules that operate at greater than class 1 power levels. For this reason, never look into the end of an optical fiber cable or open receptacle. Only trained service personnel should perform the inspection or repair of optical fiber cable assemblies and receptacles. C25, C26 Safety Notices xiii xiv Service Guide Data Integrity and Verification These computer systems contain mechanisms designed to reduce the possibility of undetected data corruption or loss. This risk, however, cannot be eliminated. Users who experience unplanned outages, system failures, power fluctuations or outages, or component failures must verify the accuracy of operations performed and data saved or transmitted by the system at or near the time of the outage or failure. In addition, users must establish procedures to ensure that there is independent data verification before relying on such data in sensitive or critical operations. Users should periodically check our support websites for updated information and fixes applicable to the system and related software. xv xvi Service Guide About This Book This book provides maintenance information that is specific to the 286/C4 and 286/E4, as well as adapters and attached devices that do not have their own service information. It also contains Maintenance Analysis Procedures (MAPs) that are not common to other systems. In this book, the 286/C4 and 286/E4 are hereafter referred to as the system. MAPs that are common to all systems are contained in the Diagnostic Information for Multiple Bus Systems. This book is used by the service representative to repair system failures. This book assumes that the service representative has had training on the system. Highlighting The following highlighting conventions are used in this book: Bold Identifies commands, subroutines, keywords, files, structures, directories, and other items whose names are predefined by the system. Also identifies graphical objects such as buttons, labels, and icons that the user selects. Italics Identifies parameters whose actual names or values are to be supplied by the user. Monospace Identifies examples of specific data values, examples of text similar to what you might see displayed, examples of portions of program code similar to what you might write as a programmer, messages from the system, or information you should actually type. ISO 9000 ISO 9000 registered quality systems were used in the development and manufacturing of this product. Related Publications The following publications provide additional information about your system: v The 286/C4 and 286/E4 Installation Guide, order number 86 A1 40EG, contains information on how to set up and cable the system, install and remove options, and verify system operation. v The 286/C4 and 286/E4 User’s Guide, order number 86 A1 41EG, contains information to help users use the system, use the service aids, and solve minor problems. v The D20 Installation Guide, order number 86 A1 39EG contains information on how to set up, cable, install and remove options, and verify the subsystem operations. v The D10 and D20 Service Guide, order number 86 A1 38EG, contains reference information, maintenance analysis procedures (MAPs), error codes, removal and replacement procedures, and a parts catalog. v The Diagnostic Information for Multiple Bus Systems, order number 86 A1 26HX, contains diagnostic information, service request numbers (SRNs), and failing function codes (FFCs). v The Adapters Information for Multiple Bus Systems, order number 86 A1 27HX, contains information about adapters for your system. This manual is intended to supplement the service information found in the Diagnostic Information for Multiple Bus Systems. v The Hardware Management Console Installation and Operations Guide, order number 86 A1 83EF, contains information on how to set up a hardware management console and how to use it to manage a system. v The Site Preparation for Rack Systems, order number 86 A1 30PX, contains information to help you plan your installation. xvii v The System Unit Safety Information, order number 86 X1 11WD, contains translations of safety information used throughout this book. xviii Service Guide Notes: In this manual, 286/C4 stand for PL420R and 286/E4 stands for PL420T. The PL420 systems do not have the 4-slotted PCI riser card, but only the 6-slotted one. The PL420 systems have only the POWER4+ 1.2 and 1.45 GHz processors. Chapter 1. Reference Information This chapter provides an overview of the system, including a logical description and a physical overview. The following additional details pertaining to the system are also provided: v Memory overview and placement v General description of the operator panel v Cabling rules v System location rules and descriptions v Power flow v Data flow Overview The 286/C4 and 286/E4 have similar operational capabilities but are packaged differently. The 286/E4 is a deskside system while the 286/C4 is a 4 EIA-unit drawer designed to be housed in a 19-inch rack. Both models can be configured as a 1-way to 4-way symmetric multiprocessing (SMP) system; each can have up to 4 hot-plug disk drives and 6 hot-plug PCI adapters. Both models support redundant hot-plug power supplies, cooling fans, and PCI adapters. Power is provided through ac power cords. Either model can be set up to use 100-127 V ac or 200-240 V ac power. The two media bays can be used to accommodate any of the following drives: v SCSI DVD-RAM drive v IDE DVD-ROM drive v IDE CD-ROM drive v Diskette drive (special mounting carriage needed) v Tape drive Note: A CD-ROM is recommended for use in the base configuration of either model. 1 System Features Bus Architecture Depending on your system configuration, four or six PCI-X slotted riser cards are available. The slots are 64-bit capable at 133 MHz, 3.3 volts. If your system has four PCI-X slots, use slots 3 and 4 with PCI adapters that require high performance. Use slots 1 and 2 with PCI adapters that do not require high performance. On the six-slotted PCI riser cards, all slots are equal. All slots are capable of using the full-length PCI-X adapters. The PCI-X slots are numbered on the rear of the chassis, from left to right. When equipped with POWER4+ processors and a six-slotted PCI riser card, the 286/C4 and 286/E4 include six hot-plug PCI-X slots, dual integrated Ultra3 SCSI controllers, dual 10/100 Mbps integrated Ethernet controllers, and four front-accessible disk bays supporting hot-swappable disks. These disk bays can accommodate up to 587.2 GB of disk storage using 146.8 GB Ultra3 SCSI disk drives. Two media bays are used for a CD-ROM, DVD-RAM, DVD-ROM, or another optional media device, such as a tape or diskette drive. The Converged Service Processor 2 (CSP), including system power control, is also integrated, along with the native I/O functions such as serial ports, keyboard, and mouse. Also included are an enhanced I/O subsystem with the implementation of the remote I/O (RIO) interconnect and PCI-X bus protocols. The 286/C4 can support up to two high-density D20 I/O drawers to provide additional PCI-X slots and disk drive bays. A fully configured system with two I/O drawers has 20 PCI-X slots and 4.1 TB of disk space. Optionally redundant hot-plug cooling fans and power supplies are available. All PCI-X slots can run either 64-bit or 32-bit adapters. However, a 32-bit adapter operates in a 32-bit mode and shows no performance advantage while running in a 64-bit slot. The following illustration shows the PCI adapter slot locations when viewing from the rear of the system. 1 286/C4 Using a 4-Slotted PCI Riser Card 3 286/E4 Using a 4-Slotted PCI Riser Card 2 286/C4 Using a 6-Slotted PCI Riser Card 4 286/E4 Using a 6-Slotted PCI Riser Card 2 Service Guide Processors v One-way and 2-way processor cards with GP 1.0 GHz POWER4 microprocessors with 32 MB of L3 cache per processor card. Only a single 1-way card is allowed per server at a time. A 3-way configuration is not supported. v One-way and 2-way processor cards with GQ 1.45 GHz POWER4+ microprocessors with 8 MB of L3 cache per processor card. Only a single 1-way card is allowed per server at a time. A 3-way configuration is not supported. POWER4 and POWER4+ cards cannot be intermixed. v A 1-way to a 2-way upgrade is available by feature conversion. v A POWER4 to POWER4+ upgrade is available by feature conversion and requires the redundant cooling option, a 6-slot riser card, and a RIO-enabled planar. v L3 cache capacity per 1.0 GHz GP processor configuration: – One-way configuration = 1 L3 chip with a 32 MB memory capacity – Two-way configuration = 1 L3 chip with a 32 MB memory capacity – Four-way configuration = 2 L3 chips per processor, each with 32 MB memory (64 MB memory capacity) v L3 cache capacity per 1.45 GHz GQ processor configuration: – One-way configuration = 1 L3 chip with an 8 MB memory capacity – Two-way configuration = 1 L3 chip with an 8 MB memory capacity – Four-way configuration = 2 L3 chips per processor, each with 8 MB memory (32 MB memory capacity) Memory v 1 GB to 32 GB ECC (error code correction) DDR (double data rate) SDRAM (synchronous dynamic random-access memory). v Memory DIMMs plug into the processor cards (8 DIMM slots per card). v DIMMs must be populated in quads (four DIMMs). A memory feature consists of a quad. Additional quads can consist of any memory size. v A system with a single processor card (1-way or 2-way) can have a maximum of 16 GB of memory. Media Drives Typical system configuration consists of two media bays. One of the media bays can be configured with either IDE CD-ROM or IDE DVD-ROM. v Media bay 1 can accommodate an IDE CD-ROM or IDE DVD-ROM. v Media bay 2 can accommodate a SCSI DVD-RAM, diskette drive, or tape drive. Note: The SCSI DVD-RAM can read CD-ROM installation media. The following media drives are available for the 286/C4 and 286/E4. – Diskette drive – DVD-RAM drive 4.7 GB capacity – 650 MB IDE 48x CD-ROM drive – 4.7 GB IDE 16x/48x DVD-ROM drive – 80/160 GB VXA tape drive – 8 mm 60/150 GB tape drive – 4 mm 20/40 GB tape drive Chapter 1. Reference Information 3 Hot-Pluggable Disk Drives Four hot-plug disk-drive bays: v 18.2 GB to 587.2 GB of disk storage v The following disk-drive sizes and speeds are available: – 18.2 GB Ultra3 10K RPM 1 inch – 36.4 GB Ultra3 10K RPM 1 inch – 73.4 GB Ultra3 10K RPM 1 inch – 146.8 GB Ultra3 10K RPM 1 inch – 36.4 GB Ultra3 15K RPM 1 inch – 73.4 GB Ultra3 15K RPM 1 inch PCI-X Slots and Integrated Adapters The following devices are included in the 286/C4 and 286/E4: v Six hot-plug PCI-X slots, 64-bit, 133 MHz, 3.3 volt on POWER4+ processor models, four slots on POWER models. v Integrated ports: – Two 10/100 Ethernet (IEEE 802.3 compliant). – Two Ultra3 SCSI (one external Ultra3 SCSI (with VHDCI 4 mini 68-pin port), and one internal Ultra3 SCSI disk drive backplane). VHDCI may require a mini 68-pin connector or FC 2118 mini 68-pin to 68-pin 0.3 meter cable as an additional feature. – Three serial. Serial port 1 (S1) has two physical connectors, one RJ-48 connector located in front on the operator panel and a 9-pin D-shell connector located on the rear of the chassis. The use of the front port disables the rear S1 port. – One parallel (the parallel port is not accessible when in partitioned mode). – Keyboard and mouse. I/O Expansion Drawer Expansion is provided by adding additional 11D/20 I/O drawers (up to two) for the 286/C4. The D20 provides additional PCI-X slots and disk drive bays for the 286/C4. v Seven hot-plug PCI-X slots, 64-bit, 133 MHz, 3.3 volt v Up to 12 hot-pluggable disk drive bays The optional disk backplane consists of two 6-pack disk bays and requires SCSI cables to connect a required Ultra3 SCSI PCI adapter or Ultra3 SCSI RAID adapter, in slot 7 to each of the 6-packs, or two SCSI adapters (one in slot 4 and one in slot 7 of the I/O drawer). Power Supplies Up to two power supplies maximum per system; the second power supply is for redundant power. v AC input type 100-127 V ac or 200-240 V ac v Single phase v Frequency = 47 Hz - 63 Hz Keyboard v Standard: 101-key enhanced keyboard v 101/102 or 106-key enhanced keyboard is also available Mouse v Three-button 4 Service Guide Operator Panel v 32-character LED diagnostics display v LEDs for power on, attention, SCSI activity, and LAN activity v Buttons for power on, system reset, and service processor reset switch: System Reset Button Dumps AIX (if dump is enabled) and reboots Service Processor Reset Switch Resets the service processor (into standby mode). The service processor reset switch is the pinhole located on the operator panel above the system reset button. Input/Output Ports v One 25-pin parallel v Keyboard v Mouse v Two Ultra3 SCSI (one to internal 4-pack disk drive backplane, one to external VHDCI mini 68-pin connector) v Integrated Drive Electronics (IDE) v Two System Power Control Network (SPCN) Connectors v Two RIO Connectors v Two 10/100 Ethernet (IEEE 802.3 compliant) v Three serial ports. Serial port 1 (S1) has two physical connectors, one RJ-48 connector located in front on the operator panel, and a 9-pin D-shell connector located on the rear of the chassis. The other two serial ports are 9-pin D-shell connectors located on the rear of the system chassis. 1 Serial Port 1 Connector 3 Serial Port 3 Connector 1a Serial Port 1 Front Connector (RJ-48) 4 Parallel Connector 2 Serial Port 2 Connector Chapter 1. Reference Information 5 Current usage for the serial port connectors are as follows: Serial Port Number Location Examples of Applicable Usage Serial Port 1 (S1 Front) Operator Panel Service Agent, PDA system management applications (for example, handheld devices, laptop systems), Service Processor menus, ASCII terminal for the operating system console, and modems Serial Port 1 (S1 Rear) Rear of the System Service Processor menus, Service Agent, PDA system management applications (interface cable required), ASCII terminal for the operating system console, and modems Serial Port 2 (S2) Rear of the System Service Processor menus, HACMP, ASCII terminal for the operating system console, and modems Serial Port 3 (S3) Rear of the System HACMP, UPS (uninterruptible power supply), ASCII terminal for the operating system console, and modems Notes: 1. Serial port 1 is never used to run HACMP or attach a UPS. If you are configuring your system to run HACMP with a UPS attached, you must connect the HACMP cable to serial port 2 and the UPS cable to serial port 3. Do not run UPS connected to serial port 2. If you disconnect HACMP, you must reset the service processor using the pinhole reset switch before running another application. The service processor pinhole reset switch is located on the operator panel. 2. A modem can be attached to any of the serial ports. Security Features On the 286/C4 and 286/E4, you can set the following types of passwords to limit access to these systems: v General-access password - set only from the service processor menus. It provides limited access to the service processor menus and is usually available to all users who are allowed to power on the system. v Privileged-access password - Set from the service processor menus or from System Management Services (SMS) utilities. Used by the system administrator or root user and allows access to all service processor functions. Hardware Management Console (HMC) The Hardware Management Console (HMC) is an optional feature that allows you to manage configuration and operation of partitions in a system, as well as add and remove hardware without interrupting system operation. In this book, a system that is managed by the HMC is referred to as the managed system. The HMC uses its serial connection to the managed system to perform various functions. The HMC’s main functions include the following: v Detecting, reporting, and storing changes in hardware conditions v Acting as a service focal point for service representatives to determine an appropriate service strategy Service representatives use the Service Focal Point application on the HMC to start and end their service calls. Service Focal Point provides service representatives with serviceable event information, vital product data (VPD), and diagnostic information. The HMC is a closed system. Additional applications cannot be loaded on the HMC. All the tasks needed to maintain the platform, the underlying operating system, and the HMC application code are available by using the HMC’s management applications. 6 Service Guide Powering the System on and Off For procedures to power off and power on the system for various system configurations and operating systems, see “Stopping the System without an HMC Attached” on page 359 and “Starting the System without an HMC Attached” on page 360. Console Strategy Note: This section applies to a system that does not have an HMC attached. The firmware starts a console-selection sequence at system boot time if any of the following is true: v A console has not yet been selected. v A previous console-selection sequence timed out. v A change in the system configuration affects the console (for example, keyboard installed or removed, mouse installed or removed, graphics adapter installed/removed or moved to another PCI slot). The console-selection sequence allows you to select (from the appropriate input device) any one of the available console devices. If no console is selected within approximately 60 seconds, serial port 1 (S1) is selected as the console and the selection sequence times out. Attention: If an ASCII terminal is attached to serial port 1 (S1), and there is any interaction with this terminal: v After OK displays in the operator panel AND v Before the power-on sequence is initiated the firmware will use this terminal as the console, regardless of the previous console selection. After a console has been selected, the console-selection sequence is started at boot time only if there is a change in the system configuration (as previously described), or the contents of the system’s nonvolatile memory (NVRAM) are lost. Note: Moving an ASCII terminal from one serial port to another (from S1 to S2) cannot be detected by the firmware, so it does not constitute a configuration change. You can also initiate a system console-selection sequence from the SMS menus. Power-On Self-Test After power is turned on and before the operating system is installed, the system does a power-on self-test (POST). This test performs checks to ensure that the hardware is functioning correctly before the operating system is installed. During the POST, a POST screen displays, and POST indicators appear on the firmware console (if one is connected). The next section describes the POST indicators and functions that can be accessed during the POST. POST Indicators POST indicators indicate tests that are being performed as the system is preparing to load the operating system. The POST indicators are words that display on the system console. Each time that the system starts a different step in the POST, a POST indicator word appears on the console. Each word is an indicator of the tests that are being performed. Chapter 1. Reference Information 7 The POST screen displays the following words: Memory Memory test Keyboard Initialize the keyboard and mouse. The time period for pressing a key to access the System Management Services, or to initiate a service mode boot is now open. See “POST Keys” for more information. Network Self-test on network adapters SCSI Adapters are being initialized Speaker Sounds an audible tone at the end of POST POST Keys The POST keys, if pressed after the keyboard POST indicator displays and before the last POST indicator speaker displays, cause the system to start services or to initiate service mode boots used for configuring the system and diagnosing problems. The keys are described below: Note: The program function keys (F1-F12) on a keyboard attached to the CEC drawer are not used and will be ignored. After the keyboard POST indicator displays, you must use the numeric number keys. 1 Key The numeric 1 key, when pressed during POST, starts the System Management Services (SMS) interface. 5 Key The numeric 5 key, when pressed during POST, initiates a system boot in service mode using the default service mode boot list. This mode attempts to boot from the first device of each type found in the list. It does not search for other bootable devices of that type if the first device is not bootable. Instead, it continues to the next device type in the list. The firmware supports up to five entries in the boot list. Note: This is the preferred method of loading standalone diagnostics from CD-ROM. The default boot sequence is: 1. Diskette (if installed) 2. CD-ROM (if installed) 3. Hard file 4. Tape drive (if installed) 5. Network a. Token ring b. Ethernet 6 Key The numeric 6 key works like the numeric 5 key, except that firmware uses the customized service mode bootlist that was set up in the AIX service aids or the service processor menus. Note: This is the preferred method of loading online diagnostics from the boot hard disk. 8 Service Guide System Unit Locations 286/C4 Front View 1 Operator Panel 4 Media Bay 2 Front Serial Port 5 Hot-Plug Disk Drives 3 CD-ROM Drive (optional) 286/C4 Rear View 1 PCI-X Slot Access 11a RIO-G0 Connector 2 Redundant Power Cable Receptacle 11b RIO-G1 Connector 3 Primary Power Cable Receptacle 12 Serial Connector 2 4 System Power Control Network Connector SPCN2 13 Serial Connector 3 5 System Power Control Network Connector SPCN1 14 Serial Connector 1 6 Parallel Connector 15 Mouse Connector 7 Hardware Management Console Connector HMC2 16 Keyboard Connector 8 Hardware Management Console Connector HMC1 17 Rack Indicator 9 Ethernet Connector 1 18 External SCSI Connector 10 Ethernet Connector 2 Chapter 1. Reference Information 9 286/E4 Front View 1 Operator Panel 4 Media bay 2 Front Serial Connector 5 Hot-Plug Disk Drives 3 CD-ROM Drive 10 Service Guide Rear View 1 PCI-X Slot Access 10 Ethernet Connector 2 2 Redundant Power Cable Receptacle 11 RIO Connector Cover 3 Primary Power Cable Receptacle 12 Serial Connector 2 4 System Power Control Network Connector SPCN2 13 Serial Connector 3 5 System Power Control Network Connector SPCN1 14 Serial Connector 1 6 Parallel Connector 15 Mouse Connector 7 Hardware Management Console Connector HMC2 16 Keyboard Connector 8 Hardware Management Console Connector HMC1 17 Rack Indicator 9 Ethernet Connector 1 18 External SCSI Connector Chapter 1. Reference Information 11 Power Supply Locations 286/C4 1 Power Supply #1 (Primary) 3 AC Good LED (Green) 2 Power Supply #2 (Redundant) 4 DC Good LED (Green) 5 Fault LED (Amber) 12 Service Guide 286/E4 1 Power Supply #2 (Redundant) (Front View) 5 AC Good LED (Green) 2 Power Supply #1 (Primary) (Front View) 6 Redundant Power Supply (Rear View) 3 Fault LED (Amber) 7 Primary Power Supply (Rear View) 4 DC Good LED (Green) Note: The preceding illustration does not show the LEDs located on the top of the power supplies. For location references, see the illustration on page 12 showing the power supplies for the 286/C4. Chapter 1. Reference Information 13 Fan Locations The following illustration identifies the system cooling fans. The fans have an amber LED located on top of the housing. A lit amber LED indicates that the fan is not operating correctly. The power supplies have two groups of three LEDs located on the front and top. The amber LED on the power supply indicates a problem with the power supply. If a power supply is diagnosed with a cooling problem, the entire power supply must be replaced. 1 Processor Card Cooling Fan (Primary Fan #1 ) 4 PCI Adapter Cooling Fan (Fan #3) 2 Processor Card Cooling Fan (Redundant Fan #2) 5 Stacked Power Supply Cooling Fans 3 PCI Adapter Cooling Fan (Fan #4) Note: The preceding illustration does not show the two cooling fans integrated into each power supply. 14 Service Guide CEC Backplane Locations The following illustration of the CEC backplane identifies the primary connectors used in your system. 1 CEC Backplane 17 Serial Port #3 Connector 2 Power Supply #1 (Primary) Connectors 18 Mouse 3 Power Supply #2 (Redundant) Connectors 19 Keyboard 4 SPCN1 Connector 20 Rack Indicator Connector 5 SPCN2 Connector 21 PCI Riser Card Connectors 6 HMC2 Connector 22 Battery Connector 7 Parallel Port Connector 23 For Manufacturing Use Only 8 HMC1 Connector 24 PCI Cooling Fan #3 and #4 Connector 9 Ethernet #1 Connector 25 IDE CD-ROM Signal Connector 10 Ethernet #2 Connector 26 Diskette Drive Signal Connector 11 Connector RIO-G1 Connector 27 Diskette Drive Power Connector 12 Processor Cooling Fan #1 Connector (Primary Fan) 28 Media Power Connector 13 Processor Cooling Fan #2 Connector (Redundant Fan) 29 Disk Drive Backplane Power Connector 14 Connector RIO-G0 Connector 30 Processor Card #2 Connector 15 Serial Port #2 Connector 31 For Manufacturing Use Only 16 Serial Port #1 Connector 32 Operator Panel Connector 33 Processor Card #1 Connector Chapter 1. Reference Information 15 PCI Riser Card Locations Depending on your system configuration, four-slotted or six-slotted PCI-X riser cards are available. The slots are 64-bit capable at 133 MHz, 3.3 volts. If your system has four PCI-X slots, use slots 3 and 4 with PCI adapters that require high performance. Use slots 1 and 2 with PCI adapters that do not require high performance. On the six-slotted PCi riser cards, all slots are equal. All slots are capable of using the full-length PCI-X adapters. The PCI-X slots are numbered on the rear of the chassis, from left to right. The PCI card has two integrated SCSI controllers. One SCSI controller (PCI, 64-bit) is a SYM53C1010 that operates at 66MHz and has dual LVD (low voltage differential) ports. The other is a 53C875, SE-SCSI controller and is only available on the 6 slotted PCI riser card. The internal port from SCSI controller SYM53C1010 connects to the DASD board through a 68-pin SCSI cable. The port from the SE-SCSI controller is available only on the six–slotted PCI riser card, and connects to the media bay through a 68-pin SCSI cable. All PCI-X slots can run either 64-bit or 32-bit adapters. However, a 32-bit adapter operates in a 32-bit mode and shows no performance advantage while running in a 64-bit slot. The following illustrations show the PCI adapter slot locations when viewing from the rear of the system. 1 286/C4 Using a 4-Slotted PCI-X Riser Card 3 286/E4 Using a 4-Slotted PCI-X Riser Card 2 286/C4 Using a 6-Slotted PCI-X Riser Card 4 286/E4 Using a 6-Slotted PCI-X Riser Card 16 Service Guide The following illustrations show the connector and slot locations on the four-slotted PCI riser card and six-slotted PCI riser card, respectively. Four-Slotted PCI Riser Card 1 PCI Slot 1 7 4 Slotted PCI Riser Card 2 PCI Slot 2 8 ²Push² Labels 3 PCI Slot 3 9 Internal Disk Drive SCSI Connector (for internal disk drive use only) 4 PCI Slot 4 10 External SCSI Connector 5 Manufacturing Use Only 11 Removal Sling 6 Riser Card to CEC Backplane Standoff Connectors (Quantity 4) located on the underside of the PCI Riser Card 12 Riser Card to Disk Drive Backplane Signal Connector Six-Slotted PCI Riser Card 1 PCI Slot 1 9 6-Slotted PCI Riser Card 2 PCI Slot 2 10 ²Push² Labels 3 PCI Slot 3 11 Internal Disk Drive SCSI Connector (for internal disk drive use only) 4 PCI Slot 4 12 Single-Ended SCSI to Media Connector 5 PCI Slot 5 13 External SCSI Connector 6 PCI Slot 6 14 Removal Sling 7 Manufacturing Use Only 15 Riser Card to Disk Drive Backplane Signal Connector 8 Riser Card to CEC Backplane Standoff Connectors (Quantity 4) located on the underside of the PCI Riser Card Chapter 1. Reference Information 17 Memory DIMM Location The memory DIMMs used in the 286/C4 and 286/E4 are located on the processor card. Your system contains either one or two processor cards, each card with 8 memory DIMM slots. Associated with each memory DIMM slot is an LED, which is located on the edge of the processor card. A lit LED indicates a problem with a specific memory DIMM. The following rules must be followed when removing or installing memory DIMMs: v Memory DIMMs must be installed in quads. However, you do not need to install four new DIMMs in a quad when you have a memory problem. Swap out each DIMM, starting the system after each change, until the faulty DIMM has been replaced. Note: A quad is a group of four memory DIMMs. Each memory DIMM in a quad must be the same type and size. However, both quads on the same processor card are not required to be the same. v The system must have at least four memory DIMMs installed (one quad). v The quad slot combinations are 1, 3, 6, and 8 (or 2, 4, 5, and 7). Note: 1, 3, 6, and 8 (or 2, 4, 5, and 7) combinations are the only possible slot combinations recognized by the system. v On a two-processor-card system, either processor card can be populated with memory. Note: A processor card containing no memory is still functional. (For example, a system containing two processor cards has all of its memory installed on processor card 1, and no memory installed on processor card 2. Processor card 2 has no memory-storage capacity, but functions as a processor card.) The following are valid memory configurations for one and two processor cards: Number of Processor Cards Allowable Memory Configurations 1 Processor Card 1 GB through 16 GB installed in quads of 256 MB DIMMs to 2 GB DIMMs 2 Processor Cards 1 GB through 32 GB installed in quads of 256 MB DIMMs to 2 GB DIMMs Note: The maximum amount of system memory varies by model. The following processor card illustration shows the memory DIMM slot locations, as well as the LED location associated with each memory DIMM slot. 1 Memory Module Slot 1 (J2A) 6 Memory Module Slot 6 (J1A) 2 Memory Module Slot 2 (J2B) 7 Memory Module Slot 7 (J0B) 3 Memory Module Slot 3 (J3A) 8 Memory Module Slot 8 (J0A) 4 Memory Module Slot 4 (J3B) 9 Processor Card 5 Memory Module Slot 5 (J1B) 10 Memory Module Fault LEDs 18 Service Guide Reading the Operator Panel and System LEDs The following illustration shows the component location on the operator panel. 1 Power-On Button 6 Operator Panel Display 2 Power LED 7 (FS1) Front Serial Connector (RJ-48 Connector) 3 Attention LED 8 Service Processor Reset Switch (Pinhole) 4 SCSI Port Activity 9 System Reset Button 5 Ethernet Port Activity Index Component Name Component Description 1 Power-On Button Turns the system power on and off. 2 Power LED Blinking - When connected to the power source (System is in standby mode). Solid - When power button has been pressed. Note: There is approximately a 30 second transition period from the time the power button is pressed to when the power LED goes from blinking to on solid. During the transition period, you may observe the blinking intervals speed up. 3 Attention LED Normal State - LED is off. 4 SCSI Port Activity Normal State - LED is on when there is SCSI activity. 5 Ethernet Port Activity Normal State - LED is on when there is Ethernet activity. 6 Operator Panel Display Displays current status of system startup, or diagnostic information in the event of a hardware problem. 7 Front Serial Connector (FS1) Serial port uses RJ-48 connector. Use to plug in external devices at the front of the system unit. 8 Service Processor Reset Switch (Pinhole) Service Personnel Use Only 9 System Reset Button Resets the system For more information about other system LEDs, see “Component LEDs” on page 466. Chapter 1. Reference Information 19 SCSI IDs and Bay Locations Note: To show the disk drive and disk drive bay locations, the 286/C4 and 286/E4 are shown with their bezels removed. Index Bay Location Drive Name SCSI ID 1 Operator Panel 2 D01 IDE CD-ROM (Default) or IDE DVD-ROM (Optional) IDE (Non-SCSI) SCSI ID 1 3 D02 Tape Drive (Optional) SCSI ID 0 SCSI DVD-RAM (Optional) SCSI ID 1 Diskette Drive (Optional) Not Applicable 4 D10 Disk Drive SCSI ID 8 5 D11 Disk Drive SCSI ID 9 6 D12 Disk Drive SCSI ID 10 7 D13 Disk Drive SCSI ID 11 Notes: 1. The SCSI bus IDs are the recommended values and indicate how the IDs are set when the system is shipped from the factory. Field installations might not comply with these recommendations. 2. Media bay location D02 is designed to house one of the following optional devices: v Tape Drive v SCSI DVD-RAM v Diskette Drive 20 Service Guide System Logic Flow When viewing the following system logic flow illustrations, select the diagram that is closest to your system configuration. Systems Configured with a GP Processor and Four-Slotted PCI Riser Card Chapter 1. Reference Information 21 Systems Configured with a GQ Processor and Six-Slotted PCI Riser Card 22 Service Guide Location Codes This system unit uses physical location codes in conjunction with AIX location codes to provide mapping of the failing field replaceable units. The location codes are produced by the system unit’s firmware and AIX. Note: When the system is running under the Linux operating system no ²Linux location codes² exist, but physical location codes might display in the Linux error logs. Physical Location Codes Physical location codes provide a mapping of logical functions in a platform (or expansion sites for logical functions, such as connectors or ports) to their specific locations within the physical structure of the platform. Location Code Format The location code is an alphanumeric string of variable length, consisting of a series of location identifiers, separated by a dash (-), or slash (/), or a pound sign (#) character. The series is hierarchical; that is, each location identifier in the string is a physical or logical child of the one preceding it. v The - (dash) separator character represents a normal structural relationship where the child is a separate physical package and it plugs into (or is connected to) the parent. For example, P1-C1 is a processor card (C1) plugged into a planar (P1), or P1-M1 is a memory card (M1) plugged into a planar (P1). v The / (slash) separator character separates the base location code of a function from any extended location information. A group of logical devices can have the same base location code because they are all on the same physical package, but may require extended location information to describe the connectors they support. For example, P2/S1 describes the location of the serial port 1 controller and its connector (S1), which is located on planar P2 (its base location code), but the / indicates that further devices can be connected to it at the external S1 serial connector. The keyboard controller and its connector likewise have location code P2/K1, which means they have the same base location code (P2) as serial port 1, but a different external connector. In contrast, the location code P2-K1 actually points to the device connected to connector K1; that is, the keyboard. The location code P2/Z1 indicates an integrated SCSI controller which drives connector Z1, while location codes of P2-Z1-... point to the actual SCSI bus and devices. v The # (pound sign) separator character indicates a cable connection between a connector and parent. The following are examples: v U0.1-P1-C1 identifies processor card C1 plugged into the CEC backplane P1. v U0.1-P1-C2-M1 identifies memory card M1 plugged into slot 1 on processor card C2 connected to the CEC backplane P1. v U0.1-P1/S1 identifies serial port 1 controller on the CEC backplane P1 or the connector for serial port 1. v U0.1-P1-K1 identifies a keyboard K1, attached to connector K1 on CEC backplane P1. v U0.1-P2/Z1 identifies an integrated SCSI port controller on PCI riser card P2 that drives internal SCSI drive connector Z1. v U0.1-P2-Z1-A8... points to the actual SCSI bus and devices attached to Z1; in this case, a hot-plug disk drive in bay location D10 with a SCSI ID of 8. Chapter 1. Reference Information 23 AIX Location Codes Note: When the system is running under the Linux operating system, AIX location codes are not available. The basic formats of the AIX location codes are as follows: v For non-SCSI devices/drives: – AB-CD-EF-GH v For SCSI devices/drives: – AB-CD-EF-G,H Non-SCSI Devices/Drives For planars, cards, and non-SCSI devices, the location code is defined as follows: AB-CD-EF-GH | | | | | | | Device/FRU/Port ID | | Connector ID | devfunc Number, Adapter Number or Physical Location Bus Type or PCI Parent Bus v The AB value identifies a bus type or PCI parent bus as assigned by the firmware. v The CD value identifies adapter number, the adapter’s devfunc number, or physical location. The devfunc number is defined as the PCI device number times 8, plus the function number. v The EF value identifies a connector. v The GH value identifies a port, address, device, or FRU. Adapters and cards are identified only with AB-CD. The possible values for AB are: 00 Processor bus 01 ISA bus 02 EISA bus 03 MCA bus 04 PCI bus used in the case where the PCI bus cannot be identified 05 PCMCIA buses xy For PCI adapters where x is equal to or greater than 1. The x and y are characters in the range of 0-9, A-H, J-N, P-Z (O, I, and lowercase are omitted) and are equal to the parent bus’s ’ibm, aix-location’ open firmware property. The possible values for CD depend on the adapter/card: v For pluggable PCI adapters/cards, CD is the device’s devfunc number (PCI device number times 8, plus the function number). The C and D are characters in the range of 0-9, and A-F (hex numbers). Location codes therefore uniquely identify multiple adapters on individual PCI cards. v For pluggable ISA adapters, CD is equal to the order of the ISA cards defined/configured either by SMIT or the ISA Adapter Configuration Service Aid. v For integrated ISA adapters, CD is equal to a unique code identifying the ISA adapter. In most cases, this code is equal to the adapter’s physical location code. In cases where a physical location code is not available, CD will be FF. EF is the connector ID. It is used to identify the adapter’s connector to which a resource is attached. GH is used to identify a port, device, or FRU. For example: v For ASYNC devices, GH defines the port on the fanout box. The values are 00 to 15. v For a diskette drive, H identifies either diskette drive 1 or 2. G is always 0. v For all other devices, GH is equal to 00. 24 Service Guide For an integrated adapter, EF-GH is the same as the definition for a pluggable adapter. For example, the location code for a diskette drive is 01-D1-00-00. A second diskette drive is 01-D1-00-01. SCSI Devices/Drives For SCSI devices, the location code is defined as follows: AB-CD-EF-G,H | | | | | | | | | Logical Unit address of the SCSI Device | | | Control Unit Address of the SCSI Device | | Connector ID | devfunc Number, Adapter Number or Physical Location Bus Type or PCI Parent Bus Where AB-CD-EF are the same as non-SCSI devices. G defines the control unit address of the device. Values of 0 to 15 are valid. H defines the logical unit address of the device. Values of 0 to 255 are valid. A bus location code is also generated as ’00-XXXXXXXX’ where XXXXXXXX is equivalent to the node’s unit address. Examples of physical location codes displayed by AIX are as follows: v Processor card C1 plugged into CEC backplane P1: U0.1-P1-C1 v Processor card C2 attached to CEC backplane P1: U0.1-P1-C2 v Memory DIMM in slot 2 of processor card C1 attached to CEC backplane P1: U0.1-P1-C1-M2 Examples of AIX location codes displayed are as follows: v Integrated PCI adapter: 10-80 Ethernet 10-60 Integrated SCSI Port 1 (internal) 10-88 Integrated SCSI Port 2 (external) Chapter 1. Reference Information 25 v Pluggable PCI adapters: Note: The number of PCI-X slots available is dependent on the number of slots your PCI riser card has available. Your system can be configured with either a four or six slotted PCI riser card. 20-58 to 20-5F Any PCI card in slot 1 20-60 to 20-67 Any PCI card in slot 2 10-68 to 10-6F Any PCI card in slot 3 10-70 to 10-77 Any PCI card in slot 4 10-78 to 10-7F Any PCI card in slot 5 10-80 to 10-87 Any PCI card in slot 6 v Integrated ISA adapters: 01-D1 Diskette adapter 01-R1 Parallel port adapter 01-S1 Serial port 1 adapter 01-S2 Serial port 2 adapter 01-S3 Serial port 3 adapter 01-K1 Keyboard adapter v Device attached to SCSI controller: 10-60-00-4,0 Device attached to integrated SCSI Port 1 Multiple FRU Callout Instructions If an eight-digit error code appears in the operator panel display or in Chapter 5, “Error Code to FRU Index”, on page 179, a location code for a failing part might also be specified. If the location code includes a blank space followed by a lowercase x followed by a number, this is an error code with multiple FRU callouts. This error can typically happen with memory DIMMs or processors, and might involve mixed types of parts. In this case, check the system’s configuration for FRU part numbers to determine the appropriate set of FRUs. To determine the FRU part numbers of the electronic assemblies in the entire system, use the service processor menus. From the General User menu, select Read VPD Image from Last System Boot, then enter 90 to display detailed Vital Product Data (VPD). To determine the FRU part numbers of the electronic assemblies in a partition by using the lscfg -vp | pg command on the AIX command line to display the detailed VPD of all assemblies. The FRU part number information for processors and memory DIMMs might be located at the bottom of the command output. 26 Service Guide FRU Identify LEDs This system is configured with an arrangement of LEDs that help identify various components of the system. These include, but are not limited to the following: v Rack identify LED v CEC drawer identify LED v I/O drawer identify LED v RIO port identify LED v FRU identify LED – Power subsystem FRUs – CEC subsystem FRUs – I/O subsystem FRUs v I/O adapter identify LED v DASD identify LED The identify LEDs are arranged hierarchically, with the FRU identify LED at the bottom of the hierarchy, followed by the corresponding CEC or I/O drawer identify LED, and the corresponding rack identify LED to locate the failing FRU more easily. Any identify LED in the system can be flashed when the system is in the failed state with power on by using the service processor LED Control Menu contained in the “System Information Menu” on page 317. Any identify LED in the system can also be flashed by using the ²Identify and Attention Indicators² task through diagnostics. The procedure to operate the ²Identify and Attention Indicators² task in diagnostics is outlined in the Diagnostic Information for Multiple Bus Systems. If the service processor menus and the AIX diagnostics are not available, the FRU identify LEDs can be flashed by one of following: v If the system is configured as a full system partition, boot to the open firmware prompt. See the “System Power Control Menu” on page 314 for instructions on setting up the boot mode to enable the boot to the open firmware prompt. To display a menu that will allow you to turn on the desired FRU identify LED, type FRU-LED-MENU on the command line. v If the system is logically partitioned, the HMC must be attached. For instructions on activating and deactivating a FRU identify LED, see the Service Focal Point chapter of the Hardware Management Console Installation and Operations Guide, order number SA38-0590. Chapter 1. Reference Information 27 Mapping AIX and Physical Location Codes Note: When the system is running under the Linux operating system, AIX location codes are not available. Use the following illustration to aid you in the mapping of a location code to its connector on the system backplane. 28 Service Guide Use the following illustration to aid you in the mapping of a location code to its DIMM slot on the processor cards. Use the following illustration to aid you in the mapping of a location code to its slot on the PCI riser card. Four-Slotted PCI Riser Card Location Codes Chapter 1. Reference Information 29 Six-Slotted PCI Riser Card Location Codes Disk Drive Backplane Location Codes Use the following illustration to aid you in the mapping of a location code to its slot on the disk drive backplane. 30 Service Guide AIX and Physical Location Code Table Note: When the system is running under the Linux operating system, AIX location codes are not available. This section covers the AIX and Physical Location Code tables for the 286/C4 and 286/E4, as well as the 11D/20 I/O Drawer. 286/C4 and 286/E4 Location Code Table Component Name Physical Location Code AIX Location Code Central Electronics Complex (CEC) Rack Frame U0 CEC Drawer U0.1 CEC Backplane U0.1-P1 PCI Riser Card U0.1-P2 Disk Drive Backplane U0.1-P3 Processor Cards Processor Card 1 U0.1-P1-C1 Processor Card 2 U0.1-P1-C2 Memory DIMMs on Processor Card 1 Memory DIMM 1 U0.1-P1-C1-M1 Memory DIMM 2 U0.1-P1-C1-M2 Memory DIMM 3 U0.1-P1-C1-M3 Memory DIMM 4 U0.1-P1-C1-M4 Memory DIMM 5 U0.1-P1-C1-M5 Memory DIMM 6 U0.1-P1-C1-M6 Memory DIMM 7 U0.1-P1-C1-M7 Memory DIMM 8 U0.1-P1-C1-M8 Memory DIMMs on Processor Card 2 Memory DIMM 1 U0.1-P1-C2-M1 Memory DIMM 2 U0.1-P1-C2-M2 Memory DIMM 3 U0.1-P1-C2-M3 Memory DIMM 4 U0.1-P1-C2-M4 Memory DIMM 5 U0.1-P1-C2-M5 Memory DIMM 6 U0.1-P1-C2-M6 Chapter 1. Reference Information 31 Component Name Physical Location Code AIX Location Code Memory DIMM 7 U0.1-P1-C2-M7 Memory DIMM 8 U0.1-P1-C2-M8 Integrated Devices Debug Connector (Manufacturing Use Only) Labeled Debug Connector Parallel Port U0.1-P1/R1 01-R1 ISA Bus U0.1-P1 1G-18 HMC2 Connector U0.1-P1/S5 HMC1 Connector U0.1-P1/S4 SPCN2 Connector U0.1-P1/Q5 SPCN1 Connector U0.1-P1/Q4 Keyboard Connector U0.1-P1/K1 01-K1-00 Keyboard U0.1-P1/K1-K1 01-K1-00-00 Mouse Connector U0.1-P1/O1 01-K1-01 Mouse U0.1-P1/O1-O1 01-K1-01-00 Ethernet Connector 1 U0.1-P1/E1 14-08 Ethernet Connector 2 U0.1-P1/E2 1L-08 RIO-G1 Connector U0.1-P1/Q3 RIO-G0 Connector U0.1-P1/Q2 Rack Indicator USB Connector U0.1-P1/Q1 Serial Connector 1 U0.1-P1/S1 01-S1 Serial Connector S1F (Located on Operator Panel) U0.1-P1/S1 01-S1 Serial Connector 2 U0.1-P1/S2 01-S2 Serial Connector 3 U0.1-P1/S3 01-S3 Diskette Port U0.1-P1/D1 01-D1 Diskette U0.1-P1-D1 01-D1-00-00 CD-ROM (IDE) U0.1-P1/Q6-A0 1G-19-00 Base CD-ROM (IDE) Controller U0.1-P1/Q6 1G-19 Pluggable Adapters PCI Host Bridge 0 U0.1-P2 PCI Host Bridge 1 U0.1-P2 PCI Slot 1 U0.1-P2/I1 PCI Slot 1 Card U0.1-P2-I1 1V-08 PCI Slot 2 U0.1-P2/I2 PCI Slot 2 Card U0.1-P2-I2 1H-08 PCI Host Bridge 2 U0.1-P2 PCI Slot 3 U0.1-P2/I3 PCI Slot 3 Card U0.1-P2-I3 1n-08 PCI Slot 4 U0.1-P2/I4 32 Service Guide Component Name Physical Location Code AIX Location Code PCI Slot 4 Card U0.1-P2-I4 1Z-08 PCI Slot 5 (six-slotted PCI riser card only) U0.1-P2/I5 PCI Slot 5 Card (six-slotted PCI riser card only) U0.1-P2-I5 1D-08 PCI Slot 6 (six-slotted PCI riser card only) U0.1-P2/I6 PCI Slot 6 Card (six-slotted PCI riser card only) U0.1-P2-I6 11-08 SCSI Devices Internal SCSI Port (for internal disk drive) U0.1-P2/Z1 1S-08 External SCSI Port U0.1-P2/Z2 1S-09 Single-Ended SCSI (six-slotted PCI riser card only) U0.1-P2/Z3 1c-08 Media Device #1 (for example, a tape drive) U0.1-P2-Ix/Zn-A0 Dependent on Ix Media Device #2 (for example, a DVD drive) U0.1-P2-Ix/Zn-A1 Dependent on Ix Hot-swap Disk Drive bay 1 U0.1-P2/Z1-A8 1S-08-00-8,0 Hot-swap Disk Drive bay 2 U0.1-P2/Z1-A9 1S-08-00-9,0 Hot-swap Disk Drive bay 3 U0.1-P2/Z1-AA 1S-08-00-10,0 Hot-swap Disk Drive bay 4 U0.1-P2/Z1-AB 1S-08-00-11,0 Fans Fan 1 (Primary Processor Fan) U0.1-F1 Fan 2 (Redundant Processor Fan) U0.1-F2 Fan 3 (PCI Fan) U0.1-F3 Fan 4 (PCI Fan) U0.1-F4 Operator Panel Operator panel U0.1-L1 Power Supply Power supply 1 (Primary) U0.1-V1 Power supply 2 (Redundant) U0.1-V2 Battery Battery U0.1-P1-V3 Platform Firmware Platform Firmware U0.1-P1/Y1 System VPD module System VPD module U0.1-L1-N1 Chapter 1. Reference Information 33 Component Name Physical Location Code AIX Location Code Notes: 1. The physical location code for the PCI slots, when empty, uses the P1/Ix notation, where the ’/’ identifies an integrated device (in this case the empty slot). A PCI device plugged into the slot uses the P1-Ix notation, where the ’-’ identifies a plugged device. 2. SCSI bus IDs are the recommended values. SCSI IDs shown for media devices indicate how the devices are set when they are shipped from the factory. Field installations may not comply with these recommendations. I/O Subsystem Locations This section covers the AIX and physical location code tables for the 11D/20 I/O drawer. The tables in this section contain the location codes for I/O subsystems when they are attached to the processor-subsystem drawer. A separate table is shown for each I/O subsystem. In the tables, the drawer number for the I/O subsystem is represented by dd. The first time that a system is powered on, the I/O subsystems are numbered according to the order in which they are configured. For example, if a system is first installed with eight I/O subsystems connected, the dd values for the subsystems should number from 2 through 9. If at a later time, an I/O subsystem is removed from the system and another I/O subsystem is substituted, the substitute is assigned the next higher number available for its value of dd. In the earlier example, this is 10. Also, if an I/O backplane is replaced inside an I/O subsystem and power is applied, the system will configure the I/O subsystem with the new backplane in the order following the other I/O subsystems. Its value for dd will not be the same as the one it replaced. In the case of the earlier example, the next higher value for dd is 10, so a drawer number of 10 is assigned to the new I/O backplane. Note: If the I/O backplane VPD module is moved to the new I/O backplane, the I/O subsystem retains its original dd value. If the original I/O subsystem or I/O backplane is reinstalled in the system, the system uses the original dd value for the I/O subsystem. The following diagram defines each part of a location code: Ux.dd-xx-yy | | | | | | | | | | | yy This code is used for the next component (yy, yy, yy, ...) | | xx This code is used for the next component (Px, Fx, Vx, ...) | dd This code is the drawer number for the I/O drawer (2 to 63) Ux This code normally identifies the rack in which a drawer is installed. The x will always be zero (0) for this system. Determining the AIX Location Code of a Physical Slot: Every slot of every I/O subsystem has a unique AIX location code assigned to it. For instance, the PCI adapter in slot 1 of I/O subsystem 1 is 29-08. There is no obvious way to determine the value of the dd number for the I/O subsystem that is interpreted by the system as the first I/O subsystem. If you have only the AIX location code, you might determine the associated I/O subsystem dd number by performing the following steps: 1. On the AIX command line type lsdev -Ccadapter |pg The resulting list gives the logical AIX name of all resources and the corresponding AIX location code. Look for the appropriate AIX location code, and record the corresponding AIX logical name of the resource. 34 Service Guide 2. On the AIX command line type lsslot -c pci |pg This list relates the AIX logical name of all resources to location code. Look for the AIX logical name that you recorded in the first step and read the location code of the resource. You can then determine the physical location of the resource by flashing the FRU identify LED (see “FRU Identify LEDs” on page 27) that corresponds to the location code. FRU Name Physical Location Code D20 I/O Drawer U0.dd (dd = any number from 2 through 63) Rack frame U0 I/O drawer backplane U0.dd-P1 I/O subsystem drawer U0.dd RIO Bus Adapter RIO bus adapter U0.dd-P1.1 RIO port 1 upper connector U0.dd-P1.1/Q1 RIO port 1 cable U0.dd-P1.1/Q1# RIO port 0 lower connector U0.dd-P1.1/Q2 RIO port 0 cable U0.dd-P1.1/Q2# PCI Bus Controller PCI bus controller PHB0 U0.dd-P1 PCI to PCI bridge EADS-X U0.dd-P1 PCI Bus Controller PHB2 PCI bus controller PHB2 U0.dd-P1 PCI to PCI bridge EADS-X U0.dd-P1 Pluggable Adapter Slots PCI slot 1 U0.dd-P1/I1 PCI card in slot 1 U0.dd-P1-I1 PCI slot 2 U0.dd-P1/I2 PCI card in slot 2 U0.dd-P1-I2 PCI slot 3 U0.dd-P1/I3 PCI card in slot 3 U0.dd-P1-I3 PCI slot 4 U0.dd-P1/I4 PCI card in slot 4 U0.dd-P1-I4 PCI slot 5 U0.dd-P1/I5 PCI card in slot 5 U0.dd-P1/I5 PCI slot 6 U0.dd-P1/I6 PCI card in slot 6 U0.dd-P1/I6 PCI slot 7 U0.dd-P1/I7 PCI card in slot 7 U0.dd-P1/I7 Power Supply Power supply 1 (no internal blower) U0.dd-V1 Power supply 2 (no internal blower) U0.dd-V2 Cooling Blowers Cooling blower 1 U0.dd-F1 Chapter 1. Reference Information 35 FRU Name Physical Location Code Cooling blower 2 U0.dd-F2 Cooling blower 3 U0.dd-F3 Cooling blower 4 U0.dd-F4 SPCN Connectors SPCN connector 1 U0.dd-P1/Q3 SPCN connector 2 U0.dd-P1/Q4 SPCN connector 3 U0.dd-P1/Q5 SPCN connector 4 U0.dd-P1/Q6 Rack Indicator Rack beacon (4-pin) U0.dd-P1/Q7 Operator Panel Operator panel U0.dd-L1 Temperature Sensor Temperature sensor (on operator panel) U0.dd-L1 Disk Drive Backplanes Disk drive backplane 1 U0.dd-P3 Disk drive backplane 2 U0.dd-P4 Disk Drive Locations Hot-swap disk drive bay 1 U0.dd-P1-Ix/Zn-A8 Hot-swap disk drive bay 2 U0.dd-P1-Ix/Zn-A9 Hot-swap disk drive bay 3 U0.dd-P1-Ix/Zn-AA Hot-swap disk drive bay 4 U0.dd-P1-Ix/Zn-AB Hot-swap disk drive bay 5 U0.dd-P1-Ix/Zn-AC Hot-swap disk drive bay 6 U0.dd-P1-Ix/Zn-AD Hot-swap disk drive bay 7 U0.dd-P1-Iy/Zn-A8 Hot-swap disk drive bay 8 U0.dd-P1-Iy/Zn-A9 Hot-swap disk drive bay 9 U0.dd-P1-Iy/Zn-AA Hot-swap disk drive bay 10 U0.dd-P1-Iy/Zn-AB Hot-swap disk drive bay 11 U0.dd-P1-Iy/Zn-AC Hot-swap disk drive bay 12 U0.dd-P1-Iy/Zn-AD SCSI SES0 U0.dd-P1-Ix/Zn-AF SES1 U0.dd-P1-Iy/Zn-AF 36 Service Guide System Cables The following diagrams show the system cable connections. The following diagram illustrates the routing of the internal power cables. Internal Power Cable Routing Diagram 1 Operator Panel 2 IDE CD-ROM 3 SCSI Media Device 4 CEC Backplane to Operator Panel Signal/Power Cable 5 2-drop Power Cable from CEC Backplane to IDE CD-ROM and other SCSI Media Devices 6 CEC Backplane to Disk Drive Backplane Power Cable 7 Disk Drive Backplane 8 Diskette Drive to CEC Backplane Power Cable 9 PCI Cooling Fans #3 and #4 Power Cable 10 Diskette Drive 11 CEC Backplane Chapter 1. Reference Information 37 Internal Signal Cable Routing Diagram The following diagram illustrates the routing of the signal cables. 38 Service Guide 1 Operator Panel 2 SCSI Media Device (50-pin) 3 SCSI Media Device (68-pin) 4 SCSI Media Device (68-pin) 5 CEC Backplane to Operator Panel Signal Cable 6 SCSI Media (68-pin) to 50-pin Converter Cable 7 SCSI Media Device Signal Cable to Integrated Single-Ended SCSI Connector on the PCI Riser Card 8 CEC Backplane Connector to Diskette Drive Signal Cable 9 CEC Backplane to IDE CD-ROM Signal Cable 10 Diskette Drive 11 IDE CD-ROM 12 Disk Drive Backplane 13 PCI Riser Card to Disk Drive Backplane Signal Cable 14 PCI Riser Card to Disk Drive Backplane Signal Cable 15 6-Slotted PCI Riser Card 15a 4-Slotted PCI Riser Card (Used on 4-slotted models only 16 CEC Backplane 17 PCI Riser Card to External SCSI Signal Cable 18 External SCSI Port 19 CEC Backplane to Rack Indicator Signal Cable 20 Rack Indicator Port 21 CEC Backplane to Debug Port Signal Cable 22 Debug Parallel Port Subsystem Positioning and Cabling The I/O subsystem drawer can be installed in a standard 19-inch EIA rack in any location near to the system. The cables that connect the drawers allow some flexibility in drawer placement, but the I/O drawers should be located as close to the system unit as possible. The 286/C4 can have a total of two I/O drawers connected. Each drawer is connected to the system unit using a system power control network (SPCN) cable loop and a remote I/O (RIO) cable loop. Although the SPCN cables are connected in a loop, only one loop is needed for both of the I/O drawers. SPCN Cabling I/O drawer SPCN cables are connected to the system using a loop configuration. The SPCN cables are connected in a loop so that the system has two paths to each I/O drawer. The 286/C4 can have a total of two I/O drawers connected in one SPCN loop. The location codes for the I/O drawers are assigned during the system power-on sequence. Each drawer’s location is determined by its position in the SPCN loop. The first drawer in the SPCN loop (the drawer connected to SPCN 0) is given the location U0.2. If there is a second I/O drawer in the SPCN loop, it is assigned the location U0.3. The I/O drawers are numbered sequentially until the end of the loop is reached at SPCN 1. For cabling examples, see the following illustrations. Chapter 1. Reference Information 39 Cabling SPCN Cables to One or Two I/O Subsystems The following illustration shows the SPCN cabling for either one or two I/O drawer configurations. One I/O Drawer Attached to 286/C4: 1 286/C4 3 SPCN 2 to SPCN 2 Cable 2 11D/20 4 SPCN 1 to SPCN 1 Cable Two I/O Drawers Attached to 286/C4: 1 286/C4 4 SPCN 1 to SPCN 1 Cable 2 11D/20 I/O Drawer 5 SPCN 2 to SPCN 2 Cable 3 11D/20 I/O Drawer 6 SPCN 2 to SPCN 1 Cable 40 Service Guide RIO Cabling I/O drawers are connected to the system through remote I/O (RIO-G) cable loops. The cable loops are connected to ports located on the rear of the system. The RIO-G cables are connected in loops so that the system has two paths to each I/O drawer. Up to two I/O drawers can be connected to a system using one RIO loop. For cabling examples, see the following illustrations. Cabling RIO-G Cables to One or Two I/O Subsystems The following illustrations show the RIO-G cabling for either one or two I/O drawer configurations. One I/O Drawer Attached to 286/C4: 1 286/C4 3 RIO-G 0 to RIO-G 0 Cable 2 11D/20 4 RIO-G 1 to RIO-G 1 Cable Two I/O Drawers Attached to 286/C4: 1 286/C4 4 RIO-G 0 to RIO-G 0 Cable 2 11D/20 I/O Drawer 5 RIO-G 1 to RIO-G 1 Cable 3 11D/20 I/O Drawer 6 RIO-G 1 to RIO-G 0 Cable Chapter 1. Reference Information 41 Specifications This section contains system specifications for both 286/C4 and 286/E4. Note: Refer to the 14T/0 and 14T/4 Installation and Service Guide, order number SA38-0577, for information on cabling power from the rack to the I/O subsystem. Dimensions Rack (286/C4) Tower (286/E4) Height 176 mm 6.9 in. 544.0 mm (21.42 in.) 4 EIA Units Width 448 mm 17.5 in. 308.0 mm (12.13 in.) 191 mm without legs Depth 816 mm 32.13 in. - Includes 145 mm for cable management arm 789.0 mm (31 in.) - Includes 70 mm for rear acoustic cover Weight Minimum configuration 32.0 kg 70.4 lbs. 36.0 kg 79.2 lbs. Maximum configuration 47.3 kg 104.0 lbs. 51.0 kg 112.2 lbs. Electrical Power source loading (maximum in kVA) 1.20 Power source loading (typical in kVA) 0.75 Voltage range (V ac) 100-127 Vac or 200-240 Vac (single phase) Frequency (hertz) 50 / 60 Thermal output (maximum) 2540 Btu/hr Thermal output (typical) Btu/hr Power Requirements (minimum load) 350 watts Power Requirements (maximum load) 670 watts Power factor - U.S., World Trade, Japan 0.95 Inrush currentå 75/amps (maximum at <10ms) 25/amps (maximum at 10ms - 150ms) Note: The above amps are held for the full input range of 180 V/ac to 259 V/ac and 47 to 63 Hz. Maximum Altitudeæ, ç 2135 m (7000 ft.) Temperature Requirements Operating 5 to 35°C (41 to 95°F) Storage 1 to 60°C (34 to 140°F) Humidity Requirements Operating Storage (Noncondensing) 8 to 80% 5 to 80% Wet Bulb 27°C (80°F) 27°C (80°F) 286/E4 Noise Emissions4 Operating Idle LWAd 6.0 bels 5.9 bels m 42 dBA 41 dBA 286/C4 Noise Emissions4 Operating Idle LWAd 6.1 bels 6.0 bels m 44 dBA 43 dBA 42 Service Guide Dimensions Rack (286/C4) Tower (286/E4) Install/Air Flow Maintenance of proper service clearance should allow proper airflow. Service When installing a rack unit, refer to “System Service Clearances” on page 481 for the rack correct clearances. 1. Inrush currents occur only at initial application of power, no inrush occurs during normal power off-on cycle. 2. The upper limit of the dry bulb temperature must be derated 1 degree C per 137 m (450 ft.) above 915 m (3000 ft.). 3. The upper limit of the wet bulb temperature must be derated 1 degree C per 274 m (900 ft. ) above 305 m (1000 ft.). 4. Levels are for a single system installed in a T00 32-EIA rack with the center of the unit approximately 1500 mm (59 in.) off the floor. External AC Power Cables To avoid electrical shock, a power cable with a grounded attachment plug is provided. Use only properly grounded outlets. Power cables used in the United States and Canada are listed by Underwriter’s Laboratories (UL) and certified by the Canadian Standards Association (CSA). These power cords consist of the following: v Electrical cables, Type SVT or SJT. v Attachment plugs complying with National Electrical Manufacturers Association (NEMA) 5-15P, that is: ²For 115 V operation, use a UL listed cable set consisting of a minimum 14 AWG, Type SVT or SJT three-conductor cord a maximum of 15 feet in length and a parallel blade, grounding type attachment plug rated at 15 A, 125 V.² ²For 230 V operation in the United States use a UL listed cable set consisting of a minimum 18 AWG, Type SVT or SJT three-conductor cable a maximum of 15 feet in length, and a tandem blade, grounding type attachment plug rated at 15 A, 250 V.² v Appliance couplers complying with International Electrotechnical Commission (IEC) Standard 320, Sheet C13. Power cables used in other countries consist of the following: v Electrical cables, Type HD21. v Attachment plugs approved by the appropriate testing organization for the specific countries where they are used. ²For units set at 230 V (outside of U.S.): use a cable set consisting of a minimum 18 AWG (1.0 mm2 nominal cross sectional area). Outside of North America , a more common cordage reference is in mm2 and for a HAR (harmonized) cord, you may not find a AWG equivalent. To find the power cables that are available, refer to Chapter 10, “Parts Information”, on page 451. Chapter 1. Reference Information 43 Service Inspection Guide Perform a service inspection on the system when: v The system is inspected for a maintenance agreement. v Service is requested and service has not recently been performed. v An alterations and attachments review is performed. v Changes have been made to the equipment that may affect the safe operation of the equipment. v External devices with their own power cables have those cables attached. If the inspection indicates an unacceptable safety condition, the condition must be corrected before anyone can service the machine. Note: The owner of the system is responsible to correct any unsafe conditions. Perform the following checks: 1. Check the covers for sharp edges and for damage or alterations that expose the internal parts of the system. 2. Check the covers for proper fit to the system. They should be in place and secure. 3. Gently rock the system from side to side to determine if it is steady. 4. Set the power button of the system to Off. 5. Remove the covers. 6. Check for alterations or attachments. If there are any, check for obvious safety hazards, such as broken wires, sharp edges, or broken insulation. 7. Check the internal cables for damage. 8. Check for dirt, water, and any other contamination within the system. 9. Check the voltage label on the back of the system to ensure that it matches the voltage at the outlet. 10. Check the external power cable for damage. 11. With the external power cable connected to the system, check for 0.1 ohm or less resistance between the ground lug on the external power cable plug and the metal frame. 12. Perform the following checks on each device that has its own power cables: a. Check for damage to the power cable. b. Check for the correctly grounded power cable. c. With the external power cable connected to the device, check for 0.1 ohm or less resistance between the ground lug on the external power cable plug and the metal frame of the device. 13. Install the covers. 44 Service Guide Chapter 2. Diagnostic Overview The system uses an integrated set of software diagnostic procedures to help isolate failing components and system maintenance. This book, along with the Diagnostic Information for Multiple Bus Systems, is the basis of the diagnostic procedures for the system. In particular, Chapter 4, “Checkpoints”, on page 141, Chapter 5, “Error Code to FRU Index”, on page 179, Chapter 6, “Using the Online and Standalone Diagnostics”, on page 297, and Chapter 10, “Parts Information”, on page 451, in this book are important for the trained service representative to understand and use when isolating a failure on the system. The manufacturer recommends that systems configured with 4 GB of memory or greater have access to a 4-mm or 8-mm tape drive for submission of system dump information if required. This function can be accomplished through locally attached or network-attached devices, as appropriate. Maintenance Analysis Procedures (MAPs) Maintenance Analysis Procedures (MAPs) guide the trained service representative through the system. These MAPs are the entry point for all isolation and error recovery procedures. The MAPs are consistent with existing procedures and methods. The system uses a set of integrated procedures, mentioned earlier, to which the MAPs are the primary entry point. The MAPS are as follows: v Entry MAP v Quick Entry MAP v MAP 1020: Problem Determination MAP v MAP 1240: Memory Problem Resolution v MAP 1321: Quick Entry MAP for Systems with Service Focal Point. This MAP contains information for systems not running under the AIX operating system. v MAP 1322: End of Call MAP for Systems with Service Focal Point v MAP 1520: Power MAP v MAP 1521: The Base System Drawer will not Power On and No Error Codes are Available v MAP 1522: Cannot Power On the I/O Drawer, Power Present LED Does Not Come On v MAP 1523: The SPCN Error Codes Table Directed You Here and the Error Code is 1011 1B0x v MAP 1540: Problem Isolation Procedures v MAP 1541: JTAG Problem Isolation v MAP 1542: I/O Problem Isolation v MAP 1548: Processor Subsystem Problem Isolation The Entry Map is the starting point for problem determination. The purpose of this MAP is to quickly point to the appropriate MAP or service reference information either in this book, or in the common book set, which includes the Diagnostic Information for Multiple Bus Systems and PCI Adapter Placement Reference. The Quick Entry MAP is a subset of the Entry MAP and helps to save time for some types of problems. The Problem Determination MAP provides a structured analysis method to get an error code if one is not provided by the customer, or if diagnostics cannot be loaded. The Power MAP deals with isolation of components to diagnose a power problem. Power problems can be related to powering on and powering off the system, or power failures that occur after power is turned on. 45 The Minimum Configuration MAP is used to locate defective components not found by normal diagnostics or error-isolation methods. This MAP provides a systematic method of isolation to the failing item. Power and Attention LEDs The Power and Attention LEDs provide a means to identify failing components in your system. When a failing component is detected in your system, the system’s Attention LED is turned on. Component LEDs To further help you identify the failing component, many system components have individual LEDs that indicate a failure when lit. The LEDs are either on the component itself or on the carrier of the component (memory card, fan, memory module, CPU). The LEDs are either green or amber in color. A lit green LED indicates that the system or component is receiving power. A lit amber LED indicates the system or component has a failure. Resetting the LEDs To reset the LEDs, do the following: 1. Replace the failing component with the new component. 2. Log in as root user. 3. At the command line, type diag. 4. Select Task Selection. 5. Select Log Repair Action. 6. Select the device that was repaired. (If the device is not listed, select sysplanar0.) 7. Press Enter, then select commit. System Attention LED The system attention LED on the operator panel turns on when an entry is made in the service processor error log, indicating that intervention or service is needed. The error entry is transmitted to the following: v The system-level error logs v The operating system error log v The service action event log (such as loss of surveillance from the HMC to a logical partition) in the Service Focal Point application Depending on your system configuration, refer to one of the following topics for more information: v “System Error Interrogation” on page 54 v “The System is not Attached to an HMC and AIX is Installed” on page 55 v “The System is Attached to an HMC and Linux is Installed” on page 55 Checkpoints The system uses various types of checkpoints, error codes, and SRNs, which are referred to throughout this book (primarily in Chapter 4, “Checkpoints”, on page 141, Chapter 5, “Error Code to FRU Index”, on page 179, Chapter 6, “Using the Online and Standalone Diagnostics”, on page 297, and Chapter 10, “Parts Information”, on page 451). These codes can appear in the service processor boot progress log, the operating system error log, and the operator panel display. Understanding the definition and relationships of these codes is important to the service personnel who are installing or maintaining the system. Codes that can appear in the service processor boot progress log, the operating system error log, the virtual terminal, and the operator panel display are as follows: Checkpoints Checkpoints display in the operator panel from the time ac power is connected to the 46 Service Guide system until the operating system login prompt is displayed after a successful operating system boot. These checkpoints have the following forms: 8xxx 8xxx checkpoints are displayed from the time ac power is connected to the system until the OK prompt is displayed by the service processor on the operator panel display. (Several 9xxx checkpoints are displayed just before the OK prompt displays.) 9xxx 9xxx checkpoints are displayed by the service processor after the power-on sequence is initiated. A system processor takes control when 91FF displays on the operator panel display. Note: Certain checkpoints may remain in the display for long periods of time. A spinning cursor is visible in the upper-right corner of the display during these periods to indicate that system activity is continuing. Exxx Exxx checkpoints indicate that a system processor is in control and is initializing the system resources. Control is being passed to the operating system when E105 displays on the operator panel display. Location code information may also display on the operator panel during this time. 0xxx and 2xxx 0xxx and 2xxx codes are AIX progress codes and configuration codes. AIX progress codes will not appear when Linux is the operating system. Location codes may also be shown on the operator panel display during this time. Error Codes If a fault is detected, an 8-digit error code is displayed in the operator panel display. A location code may be displayed at the same time on the second line. SRNs Service request numbers, in the form xxx-xxx, xxx-xxxx, xxxx-xxx, or xxxx-xxxx may also be displayed on the operator panel display and be noted in the AIX error log. SRNs will not appear in the Linux system error log when Linux is the operating system. Checkpoints can become error codes if the system fails to advance past the point at which the code is presented. For a list of the checkpoints, see Chapter 4, “Checkpoints”, on page 141. Each entry provides a description of the event and the recommended action if the system fails to advance. SRNs are listed in the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. FRU Isolation For a list of error codes and recommended actions for each code, see Chapter 5, “Error Code to FRU Index”, on page 179. These actions can refer to Chapter 10, “Parts Information”, on page 451, Chapter 3, “Maintenance Analysis Procedures (MAPs)”, on page 57, or provide informational message and directions. If a replacement part is indicated, direct reference is made to the part name. The respective AIX and physical location codes are listed for each occurrence as required. For a list of locations codes, see “Location Codes” on page 23. Note: When the system is running under the Linux operating system, AIX location codes are not available. To look up part numbers and view component diagrams, see Chapter 10, “Parts Information”, on page 451. The beginning of that chapter provides a parts index with the predominant field replaceable units (FRUs) listed by name. The remainder of the chapter provides illustrations of the various assemblies and components that make up the system. Chapter 2. Diagnostic Overview 47 FRU Identify LEDs This system is configured with an arrangement of LEDs that help identify various components of the system. These include, but are not limited to the following: v Rack identify LED v CEC drawer identify LED v I/O drawer identify LED v RIO port identify LED v FRU identify LED – Power subsystem FRUs – CEC subsystem FRUs – I/O subsystem FRUs v I/O adapter identify LED v DASD identify LED The identify LEDs are arranged hierarchically, with the FRU identify LED at the bottom of the hierarchy, followed by the corresponding CEC or I/O drawer identify LED, and the corresponding rack identify LED to locate the failing FRU more easily. Any identify LED in the system can be flashed when the system is in the failed state with power on by using the service processor LED Control Menu contained in the “System Information Menu” on page 317. Any identify LED in the system can also be flashed by using the ²Identify and Attention Indicators² task through diagnostics. The procedure to operate the ²Identify and Attention Indicators² task in diagnostics is outlined in the Diagnostic Information for Multiple Bus Systems. If the service processor menus and the AIX diagnostics are not available, the FRU identify LEDs can be flashed by one of following: v If the system is configured as a full system partition, boot to the open firmware prompt. See the “System Power Control Menu” on page 314 for instructions on setting up the boot mode to enable the boot to the open firmware prompt. To display a menu that will allow you to turn on the desired FRU identify LED, type FRU-LED-MENU on the command line. v If the system is logically partitioned, the HMC must be attached. For instructions on activating and deactivating a FRU identify LED, see the Service Focal Point chapter of the Hardware Management Console Installation and Operations Guide, order number SA38-0590. Service Focal Point The Service Focal Point application is used to help the service representative diagnose and repair problems on partitioned systems. Service representatives use the HMC as the starting point for all service issues. The HMC groups various system management issues at one control point, allowing service representatives to use the Service Focal Point application to determine an appropriate service strategy. Traditional service strategies become more complicated in a partitioned environment. Each partition runs on its own, unaware that other partitions exist on the same system. If one partition reports an error for a shared resource, such as a managed system power supply, other active partitions report the same error. The Service Focal Point application enables service representatives to avoid long lists of repetitive call-home information by recognizing that these errors repeat, and by filtering them into one error code. Errors that require service are reported to the HMC as serviceable events. Because the HMC stores these serviceable events for 90 days and then discards them, it is important to have the partition and HMC date and time set correctly. For instance, if the date on a partition’s software is set 90 days behind the HMC’s set time, the serviceable events reported from this partition are immediately discarded. For more 48 Service Guide information about setting the HMC’s date and time, see the Hardware Management Console Installation and Operations Guide, order number SA23-1251. To set the partition’s date and time, see the documentation provided with the operating system that is running on that partition. Getting Started When you are setting up Service Focal Point, keep the following in mind: v If the time configured on a partition is 90 days older than time configured on the HMC, serviceable events cannot be reported. v Verify that the HMC host names are defined. For more information about using fully qualified and short host names, see the Hardware Management Console Installation and Operations Guide, order number SA23-1251. v If you need to add or change a partition name, see the Hardware Management Console Installation and Operations Guide. Testing Error Reporting To ensure that Service Focal Point is configured properly, generate a test error by doing the following: 1. In the partition, run diagnostics to test the managed system’s operator panel. 2. When the diagnostics window asks you if you see 0000 on the managed system’s operator panel, select NO. This action generates an error. 3. In the SRN window, press Enter to proceed. 4. When the system asks you if you want the error sent to Service Focal Point, select YES. 5. Type F3 to exit diagnostics. 6. Wait for one minute while the managed system sends the error to Service Focal Point. 7. Check the Serviceable Event window to ensure that the error was sent to Service Focal Point and that Service Focal Point reported the error. For more information about working with serviceable events, see “Working With Serviceable Events” on page 51. Service Focal Point Settings The Service Focal Point Settings task in the HMC Contents area allows you to configure your Service Focal Point application. Note: The Advanced Operator, Operator, and Viewer roles have read-only access to the following tasks. Automatic Call-Home Feature You can configure the HMC to automatically call an appropriate service center when it identifies a serviceable event. To enable or disable the call-home feature, you must be a member of one of the following roles: v System Administrator v Service Representative To enable or disable the call-home feature for one or more systems, do the following: Note: It is strongly recommended that you not disable the call-home feature. When you disable the call-home feature, serviceable events are not automatically reported to your service representative. 1. In the Navigation area, click the Service Applications icon. 2. In the Navigation area, double-click the Service Focal Point icon. 3. In the Contents area, click Service Focal Point Settings. 4. The Service Focal Point Settings window opens. Select the CEC Call Home tab on the top of the window. Chapter 2. Diagnostic Overview 49 5. Click on the managed system you want to enable or disable. 6. Click Enable to enable call-home for the selected system, or click Disable to disable call-home for the selected system. 7. Click OK. Setting Up Surveillance Service Focal Point surveillance generates serviceable events when it detects communication problems between the HMC and its managed systems. You can configure how you want the HMC to survey the following: v The number of disconnected minutes considered an outage v The number of connected minutes you want the HMC to consider a recovery v The number of minutes between outages considered a new incident To set up surveillance, you must be a member of one of the following roles: v System Administrator v Service Representative To set up surveillance, do the following: 1. In the Navigation area, click the Service Applications icon. 2. In the Navigation area, double-click the Service Focal Point icon. 3. In the Contents area, select Service Focal Point Settings. 4. The Service Focal Point Settings window opens. Select the Surveillance Setup tab on the top of the window. 5. In the first field, select the number of minutes you want the HMC to wait before sending a disconnection error message. 6. In the second field, select the amount of connection time that the HMC is considered to be recovered. This amount is expressed in minutes. 7. In the third field, select the number of minutes between outages that you want the HMC to wait before sending a new incident report. 8. Select one or more managed systems from the table in the lower part of the window and then click Enable or Disable. Surveillance is then either enabled or disabled for the selected managed systems. Enabling Surveillance Notifications You can enable or disable surveillance-error notification from this HMC to connected managed systems. Enabling this notification causes errors to be passed to the Service Agent application for notification. Note: You must further configure Service Agent to handle notifications sent by Service Focal Point. For more information about Service Agent, refer to the Hardware Management Console Installation and Operations Guide, order number SA38-0590. To set up surveillance, you must be a member of one of the following roles: v System Administrator v Service Representative To set up surveillance-error notification, do the following: 1. In the Navigation area, click the Service Applications icon. 2. In the Navigation area, double-click the Service Focal Point icon. 3. In the Contents area, select Service Focal Point Settings. 4. The Service Focal Point Settings window opens. Select the Surveillance Notification tab on the top of the window. 50 Service Guide 5. Select one or more managed systems from the list and then click Enable or Disable. Surveillance notification is then either enabled or disabled for the selected managed systems. Working With Serviceable Events You can view, add, or update serviceable event information, including error details. Viewing Serviceable Events To view serviceable events, you must be a member of one of the following roles: v System Administrator v Service Representative v Advanced Operator v Operator v Viewer To view a serviceable event, do the following: 1. In the Navigation area, click the Service Applications icon. 2. In the Navigation area, double-click the Service Focal Point icon. 3. In the Contents area, click Select Serviceable Event. 4. Designate the set of serviceable events you want to view. When you are finished, click OK. 5. The Serviceable Event Overview window opens, and the entries displayed are ordered by time stamp. Each line in the Serviceable Event Overview window corresponds to one error within a serviceable event. On this window, designate the set of serviceable events you want to view by specifying your search criteria (such as event status or error class). Note: Only events that match all of the criteria that you specify are shown. 6. When you are finished, click OK. When you select a line in the Serviceable Event Overview window, all lines in the same serviceable event are selected. To open the Serviceable Event Details window for the selected event, select the event and click Event Details. Viewing Serviceable Event Details To view serviceable event details, do the following: 1. Perform the steps in “Viewing Serviceable Events”. 2. The Serviceable Event Details window opens, showing extended serviceable event information, including the following: v Status v Earliest original time stamp of any managed object v AIX error log. (The Linux system error log does not place entries into Service Focal Point.) v Should this error ever get called home? v Error was called home v Pointer to extended error-data collection on the HMC The window’s lower table displays all of the errors associated with the selected serviceable event. The information is shown in the following sequence: v Failing device system name v Failing device machine type/model/serial v Error class v Descriptive error text Viewing Serviceable Event Error Details: To view serviceable event error details, do the following: Chapter 2. Diagnostic Overview 51 1. Perform the steps in “Viewing Serviceable Event Details” on page 51. 2. Select an error in the lower table, and click Error Details. Viewing Service Processor Error Details: To view service processor error details, do the following: 1. Perform the steps in “Viewing Serviceable Event Error Details” on page 51. 2. If the serviceable event error details you are viewing are for a service processor-class error, the lower table on the resulting window contains service processor errors. Select a service processor error from the lower table, and click Service Processor Error Details to see further details. Saving and Managing Extended Error Data To save extended error (EE) data, do the following: 1. Perform the steps in “Viewing Serviceable Event Details” on page 51. 2. Click Save EE Data. To save extended error data for only one error associated with the serviceable event (rather than for the entire serviceable event), select the error from the lower table and then click Error Details. In the next menu, click Manage EE Data. Viewing and Adding Serviceable Event Comments To add comments to a serviceable event, you must be a member of the Service Representative role. To add comments to a serviceable event, do the following: Note: You cannot edit or delete previous comments. 1. Perform the steps in “Viewing Serviceable Event Details” on page 51. 2. Select the error to which you want to add comments to and click Comments.... If you want to close the event and add comments, click Close Event from this window. The Serviceable Event Comments window opens. 3. Type your name and add comments as appropriate. You can also review previous comments, but you cannot edit this information. 4. If you clicked Comments on the Serviceable Event Details window, clicking OK commits your entry and returns you to the Serviceable Event Details window. If you clicked Close Event on the Serviceable Event Details window, clicking OK commits all changes and opens the Update FRU Information window. For more information about updating field replaceable unit information, see “Updating Field Replaceable Unit (FRU) Information”. Closing a Serviceable Event To close a serviceable event, do the following: 1. Perform the steps in “Viewing Serviceable Event Details” on page 51. 2. Select the error to which you want to add comments and click Comments.... 3. Click Close Event from this window. The Serviceable Event Comments window opens. 4. Click OK to commit your comments. The Update FRU Information window displays. See “Updating Field Replaceable Unit (FRU) Information” for information on completing this window. Click OK on the Update FRU Information window to close the serviceable event. Note: You must close a serviceable event after it has been serviced to ensure that if a similar error is reported later, it is called home. If an old problem remains open, the new similar problem is reported as a duplicate. Duplicate errors are neither reported nor called home to a service center. Close a serviceable event when the partition that reports the error is active. Closing the event causes the new status of the serviceable event to be properly sent to the partition. Updating Field Replaceable Unit (FRU) Information This task allows you to update the FRU information you changed or modified as a result of this serviceable event. From this panel, you can also activate and deactivate LEDs and search for other serviceable events that contain the same FRU entries. 52 Service Guide To update FRU information, do the following: 1. Perform the steps in “Viewing Serviceable Event Details” on page 51. 2. Click FRU Information... . The Update FRU Information window opens. The lower table shows any parts that you have replaced or added during your current update session but that have not been committed to the serviceable event. The changes from the lower table are committed by clicking OK or Apply. From this window, you can also activate and deactivate LEDs and search for other serviceable events that contain the same FRU entries. Replacing an Existing FRU To replace a part already listed for this serviceable event, do the following: 1. Perform the steps in “Updating Field Replaceable Unit (FRU) Information” on page 52. 2. In the upper table, double-click the part you want to replace. 3. If the FRU has a new part number, type it in the New FRU Part Number field. 4. Click Replace FRU. The Update FRU Information window displays the FRU replacement information in the lower table. Click OK or Apply to commit the changes to the serviceable event. Adding a New FRU You can add a part to the serviceable event that was not listed in the upper table of the Update FRU Information window. To add a new FRU for this serviceable event, do the following: 1. Perform the steps in “Updating Field Replaceable Unit (FRU) Information” on page 52. 2. Click Add New FRU to List. 3. Type the FRU’s location code and its part number in the appropriate fields. 4. Click Add to List. The Update FRU Information window opens and displays the newly added FRU in the lower table. 5. Click OK or Apply to commit these changes to the serviceable event. Note: After you click OK or Apply, you cannot change this information. If you clicked the Close Event button in the Serviceable Event Details window, then clicking OK also completes the close dialog and changes the status of the serviceable event to Closed. Viewing Serviceable Event Partition Information You can view partition information associated with this serviceable event. This information includes each affected partition’s state and resource use. 1. Perform the steps in “Viewing Serviceable Event Details” on page 51. 2. Click Partition Info.... Hardware Service Functions Activating and Deactivating FRU LEDs This task allows you to activate or deactivate a managed system’s system attention LED or any FRU LED. FRU LEDs are helpful in determining which FRUs need servicing. To activate or deactivate a managed system’s System Attention LED, do the following: 1. In the Navigation area, click the Service Applications icon. 2. In the Navigation area, double-click the Service Focal Point icon. 3. In the Contents area, select Hardware Service Functions. The LED Management window opens. 4. In the LED Management window, select one or more managed systems from the table. 5. Select either Activate LED or Deactivate LED. The associated System Attention LED is then either turned on or off. To activate or deactivate a FRU associated with a particular managed system, do the following: Chapter 2. Diagnostic Overview 53 1. In the Navigation area, click the Service Applications icon. 2. In the Navigation area, double-click the Service Focal Point icon. 3. In the Contents area, click Hardware Service Functions. The LED Management window opens. 4. In the LED Management window, select one managed system from the table. 5. Click the List FRUs... button. The list of FRU slot indexes and their respective current LED states display. 6. Select one or more FRU slot indexes. 7. Click either the Activate LED or the Deactivate LED button. The associated FRU LEDs are now either enabled (blinking) or off. System Error Interrogation The following sections explain how to interrogate a system. To identify which section you should use to interrogate your system, answer the following: v Is your system attached to an HMC? v If your system is not attached to an HMC, what operating system is installed ? Depending on your response, select one of the following sections and follow the instructions given in that section to interrogate your system. 54 Service Guide The System is Attached to an HMC On systems attached to an HMC, when the system attention LED comes on, open the Service Action Event Log in the Service Focal Point application on the HMC. Check the open service events using the procedure described in ²Working with Serviceable Events² in the Hardware Management Console Installation and Operations Guide. Note: For information on these Service Focal Point settings, see ²Setting Up Surveillance and Enabling Surveillance Notifications² in the Hardware Management Console Installation and Operations Guide. Events requiring customer intervention are marked Call Home Candidate? NO. For all of these events, examine the description in the serviceable event error details. If actions are listed in the description, perform those actions. If the error indicates a loss of surveillance between the HMC and a partition, check the status of the partition, the network, and the cabling between the HMC and the partition. If a problem is still indicated, call service support. If the service action event is labeled Call Home Candidate? YES, service support is required to correct the faulty condition. If the system is configured to automatically call home on error, the request for service will be placed and the service representative will be dispatched. If the system is not enabled for call home capability, the customer should call service support to report the error condition. Whether a customer intervention error or an service support error is detected, the system attention LED can be reset by following the procedures described in “Resetting the System Attention LED” on page 56. The preferred method to reset the attention LED is by following the HMC Attached Systems procedures. To reset the attention LED, use either the HMC Attached Systems procedure or the Non-HMC Attached Systems procedure. The System is not Attached to an HMC and AIX is Installed When the system attention LED comes on, examine the AIX error log to see if user intervention is required. If the system is powered off, examine the service processor error log. If the AIX error log indicates service support is required, or there are entries in the service processor error log, go to “Entry MAP” on page 57 with the information noted from the error log. The System is Attached to an HMC and Linux is Installed When the system attention light comes on, perform the following: v If you performed “Step 1321-34” on page 81 in “MAP 1321: Quick Entry MAP for Systems with Service Focal Point” on page 76, return to “Step 1321-34” on page 81 to process any other error codes or symptoms. v If you have not performed any of the steps in “MAP 1321: Quick Entry MAP for Systems with Service Focal Point” on page 76, go to step “Step 1321-1” on page 76 to process any other error codes or symptoms. The system attention LED can be reset by following one of the Resetting the System Attention LED procedures. Chapter 2. Diagnostic Overview 55 Resetting the System Attention LED Choose one of the following procedures to reset your attention LED. Systems Attached To an HMC To reset the system attention LED on HMC-attached systems, do the following: 1. On the HMC graphical interface, click Service Applications. 2. Double-click Service Focal Point. 3. In the Contents area of the screen, select Hardware Service Functions. The LED Management window opens. 4. In the LED Management window, select one or more managed systems from the table. 5. Select Deactivate LED. The associated system attention LED is turned off. For more information about the virtual operator panel on the HMC, see the Hardware Management Console Installation and Operations Guide . Systems not Attached to an HMC with AIX Installed As a user with root authority, type diag on the AIX command line, and do the following: 1. Select Task Selection. 2. On the Task Selection Menu, select Identify and Attention Indicators. 3. When the list of LEDs displays, use the cursor to highlight Set System Attention Indicator to Normal. 4. Press Enter, and then press F7 to commit. This action turns off the LED. If the system is powered off, access the service processor menus. From the service processor main menu, do the following: 1. Select System Information Menu. 2. Select LED Control Menu. 3. Select Clear System Attention Indicator. This action turns off the LED. Systems not Attached to an HMC with Linux Installed The system must be powered off to access the service processor menus. From the service processor main menu, do the following: 1. Select System Information Menu. 2. Select LED Control Menu. 3. Select Clear System Attention Indicator. This action turns off the LED. 56 Service Guide Chapter 3. Maintenance Analysis Procedures (MAPs) This chapter contains Maintenance Analysis Procedures (MAPs) for the 286/C4 and 286/E4. Entry MAP When possible, run online diagnostics in service mode. Online diagnostics perform additional functions compared to standalone diagnostics. This ensures that the error state of the system that was captured in NVRAM is available for your use in fixing the problem. The AIX error log and SMIT are only available when diagnostics are run from the hard drive. Always check Service Focal Point for any open service events in Service Action Event log. Attention: AIX diagnostics from the hard drive is not available when Linux is the operating system. If this system has an Hardware Management Console Installation and Operations Guide (HMC) with Service Focal Point, go to “MAP 1321: Quick Entry MAP for Systems with Service Focal Point” on page 76. Otherwise, use the following table to help determine your next step. Notes: 1. Licensed programs frequently rely on system information stored on the VPD module on the operator panel assembly. If the MAPs indicate that the operator panel assembly should be replaced, refer to “Operator Panel Replacement” on page 420. If the old VPD module has to be replaced, call technical support for recovery instructions. If recovery is not possible, notify the system owner that new keys for licensed programs may be required. 2. If a network adapter is replaced, the network administrator must be notified so that the client IP addresses used by the system can be changed. In addition, the operating system configuration of the network controller may need to be changed in order to enable system startup. Also check to ensure that any client or system that addresses this system is updated. 3. If more than eight digits are displayed in the operator panel, use only the first eight digits to find the error in the tables. The digits that display beyond the first eight digits are location codes that can assist you in diagnosing the problem. See “Location Codes” on page 23. 4. If you are not able to isolate the problem, try loading standalone diagnostics from the CD-ROM or a NIM server. Note: The various codes that might display on the HMC are all listed as error codes by Service Focal Point (SFP). To assist you in identifying the types of error data in this guide, use the following table. Number of Digits in Error Code Error code Used in this Service Guide Any Contains # (pound sign) Menu Goal Any Contains - (hyphen) SRN 5 Does not contain # or - SRN 6 Does not contain # or - Error Code 8 Does not contain # or - Error Code, Service Reference Code (SRC) 57 Symptom Starting Point You have a problem that does not prevent the system from booting and the operator panel is functional and the rack indicator LED operates as expected. Go to the Fast Path MAP in the Diagnostic Information for Multiple Bus Systems. You do not have a symptom. Go to MAP 0020 in the Diagnostic Information for Multiple Bus Systems You have an SRN. Go to the Fast Path MAP in the Diagnostic Information for Multiple Bus Systems. The system stops and a 3-digit number is displayed in the operator panel display. Record SRN 101-xxx, where xxx is the 3-digit number displayed in the operator panel display, then go to the Fast Path MAP in the Diagnostic Information for Multiple Bus Systems. The system stops and a 4-digit number beginning with 0 or 2 is displayed in the operator panel display. Go to the Fast Path MAP in the Diagnostic Information for Multiple Bus Systems. All other symptoms. Go to “Quick Entry MAP”. Quick Entry MAP The following is a Quick Entry MAP table of contents. Table 1. Problem Description Page No. Service actions 59 The rack indicator LED does not operate as expected. 59 The system attention LED on the operator panel is on. 59 OK does not appear in the operator panel display before pressing the power-on button. Other symptoms appear in the operator panel display or LEDs before the power-on button is pressed. 59 8-digit error codes 60 System stops with an 8-digit number displayed 60 System stops with a 4-digit number displayed that does not begin with 0 or 2. 60 System stops with a 3-digit number displayed (see below for 888 sequence). 60 Hardware Management Console (HMC) Problem 60 There appears to be a display problem (distortion, blurring, etc.) 60 Power and cooling problems 61 888 sequence in operator panel display 61 Other symptoms or problems 61 Attention: If you replace FRUs or perform an action and the problem is still not corrected, go to “MAP 1540: Problem Isolation Procedures” on page 108 unless you were directed to MAP 1540 by the error code. Otherwise, call for support if the actions for an error code do not resolve the problem. If you replace FRUs or perform an action, and the problem is corrected, go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Note: If the only service actions you perform are on the HMC, do not use MAP 0410 to do the repair checkout. 58 Service Guide Symptom Action Service Actions You have an open service event in the service action event log. Go to “Service Focal Point” on page 48. You have parts to exchange or a corrective action to perform. 1. Go to Chapter 9, “Removal and Replacement Procedures”, on page 357. 2. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. You need to verify that a part exchange or corrective action corrected the problem. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. You need to verify correct system operation. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. The rack indicator LED does not operate as expected. Symptom: The rack indicator LED does not turn on, but a drawer identify LED is on. Action: 1. Make sure the rack indicator LED is properly mounted to the rack. 2. Make sure that the rack identify LED is properly cabled to the bus bar on the rack and to the drawer identify LED connector. 3. Replace the following parts one at a time: v rack LED to bus bar cable. v LED bus bar to drawer cable. v LED bus bar. v Call support. The system attention LED on the operator panel is on. The system attention LED on the operator panel is on. Go to “System Attention LED” on page 46 and perform the listed actions. OK does not appear in the operator panel display before pressing the power-on button. Other symptoms appear in the operator panel display or LEDs before the power-on button is pressed. Symptom: A bouncing or scrolling ball remains on the operator panel display, or the operator panel display is filled with dashes or blocks. Action: Verify the operator panel cable from the operator panel to the CEC backplane is connected and properly seated at both ends. If an ASCII terminal is available, connect it to the system through serial port 1. 1. If the service processor menu is displayed: a. Replace the operator panel assembly, location U0.1-L1. Refer to “Operator Panel” on page 419. b. Replace the CEC backplane, location U0.1-P1. (See notes on page 57). 2. If the service processor menu is not displayed, replace the CEC backplane. (See notes on page 57). If an ASCII terminal is not available, replace the following one at a time. 1. Operator panel assembly, location U0.1-L1. Refer to “Operator Panel” on page 419. 2. CEC backplane, location U0.1-P1. (See notes on page 57). You have a blank display on the operator panel. Other LEDs on the operator panel appear to behave normally. 1. Replace the operator panel assembly, Location: U0.1-L1. Refer to “Operator Panel” on page 419. 2. Replace the CEC backplane, Location: U0.1-P1. (See notes on page 57). You have a blank display on the operator panel. Other LEDs on the operator panel are off. Go to “MAP 1520: Power” on page 89. Chapter 3. Maintenance Analysis Procedures (MAPs) 59 Symptom Action 8-Digit Error Codes You have an 8-digit error code displayed Look up the error code in the table in “Checkpoint and Error Code Index” on page 181. System stops with an 8-digit number displayed The system stops with an 8-digit error code displayed when booting. Look up the error code in the table in “Checkpoint and Error Code Index” on page 181. System stops with a 4-digit number displayed The system stops with a 4-digit number displayed that does not begin with 0 or 2. Go to “Checkpoint and Error Code Index” on page 181. If you do not find the error code there, go to the Fast Path MAP in the Diagnostic Information for Multiple Bus Systems. System stops with a 3-digit number displayed (see below for 888 sequence) The system stops with a 3-digit error code. Add 101- to the left of the three digits to create an SRN. Go to the Diagnostic Information for Multiple Bus Systems. If there is a location code displayed under the 3-digit error code, then look at the location to see if it matches the failing component that the SRN pointed to. If they do not match, perform the action from Diagnostic Information for Multiple Bus Systems first. If the problem still exists, then replace the failing component from that location code. Hardware Management Console (HMC) Problem Symptom: Hardware Management Console (HMC) cannot be used to manage a managed system, or the connection to the managed system is failing. Action: If the managed system is operating normally (no error codes or other symptoms), the HMC might have a problem, or the connection to the managed system might be damaged or incorrectly cabled. Do the following: 1. Check the connections (serial cable) between the HMC and the managed system. Correct any cabling errors if found. If another serial cable is available, connect it in place of the existing cable and refresh the HMC graphical user interface. You may have to wait up to 30 seconds for the managed system to reconnect. 2. Verify that any connected HMC is connected to the managed system by checking the Management Environment of the HMC. Note: The managed system must have power connected and the system running, or waiting for a power-on instruction (the OK prompt is in the operator panel.) If the managed system does not appear in the Navigation area of the HMC Management Environment, then the HMC or the connection to the managed system might be failing. 3. Go to the ²Entry MAP² in the Hardware Management Console Maintenance Guide, order number SA23-1273. 4. If you cannot fix the problem using the HMC tests in the Hardware Management Console Maintenance Guide, there might be a problem with the CEC backplane; replace the CEC backplane. Symptom: Hardware management console (HMC) cannot call out using the attached modem and the customer’s telephone line. Action: If the managed system is operating normally (no error codes or other symptoms), the HMC might have a problem, or the connection to the modem and telephone line may have a problem. Do the following: 1. Check the connections between the HMC and the modem and telephone line. Correct any cabling errors if found. 2. Go to the Entry MAP in the Hardware Management Console Maintenance Guide, order number SA23-1273. There is a Display Problem (Distortion, Blurring, Etc. 60 Service Guide Symptom Action Symptom: All display problems. Action: 1. If you are using the HMC, go to the Hardware Management Console Maintenance Guide, order number SA23-1273. 2. If you are using a graphics display: a. Go to the problem determination procedures for the display. b. If you do not find a problem: v replace the display adapter, location U0.1-P2-In or U0.dd-P1-In. v replace the backplane in which the card is plugged. 3. If you are using an ASCII terminal: a. Make sure that the ASCII terminal is connected to S1. b. If problems persist, go to the problem determination procedures for the terminal. c. If you do not find a problem, replace the CEC backplane, location U0.1-P1. (See notes on page 57). Power and Cooling Problems The system will not power on and no error codes are available. Go to “MAP 1520: Power” on page 89. The power LEDs on the operator panel and the power supply do not come on or stay on. 1. Check the service processor error log. 2. Go to “MAP 1520: Power” on page 89. The power LEDs on the operator panel and the power supply come on and stay on, but the system does not power on. 1. Check the service processor error log. 2. Go to “MAP 1520: Power” on page 89. A rack or a rack-mounted unit will not power on. 1. Check the service processor error log. 2. Go to “MAP 1520: Power” on page 89. The cooling fan(s) do not come on, or come on but do not stay on. 1. Check the service processor error log. 2. Go to “MAP 1520: Power” on page 89. The system attention LED on the operator panel is on and there is no error code displayed. 1. Check the service processor error log. 2. Go to “MAP 1520: Power” on page 89. 888 Sequence in Operator Panel Display An 888 sequence is displayed in the operator panel display. Go to the Fast Path MAP in the Diagnostic Information for Multiple Bus Systems. Other Symptoms or Problems The operator panel displays OK and the fans are off. The service processor is ready. The system is waiting for power-on. Boot the system. If the boot is unsuccessful, and the system returns to the OK, go to “MAP 1020: Problem Determination” on page 64. The operator panel displays STBY. The service processor is ready. The system was shut down by the operating system and is still powered on. This condition can be requested by a privileged system user with no faults. See the service processor error log for possible operating system fault indications. All of the system POST indicators are displayed on the system console, the system pauses and then restarts. The term POST indicators refers to the device mnemonics (the words memory, keyboard, network, scsi, and speaker) that appear on the system console during the power-on self-test (POST). Go to “Boot Problems/Concerns” on page 173. Chapter 3. Maintenance Analysis Procedures (MAPs) 61 Symptom Action The system stops and all of the POST indicators are displayed on the system console. The term POST indicators refers to the device mnemonics (the words memory, keyboard, network, scsi, and speaker) that appear on the system console during the power-on self-test (POST). Go to “Boot Problems/Concerns” on page 173. The system stops and the message STARTING SOFTWARE PLEASE WAIT... is displayed on the firmware console. Go to Chapter 4, “Checkpoints”, on page 141. Symptom: The system does not respond to the password being entered or the system login prompt is displayed when booting in service mode. Action: 1. If the password is being entered from the hardware management console (HMC), go to the Hardware Management Console Maintenance Guide. 2. If the password is being entered from a keyboard attached to the system, then the keyboard or its controller may be faulty. In this case, replace these parts in the following order: a. Keyboard b. CEC backplane, location U0.1-P1. (See notes on page 57). 3. If the password is being entered from an ASCII terminal, then use the problem determination procedures for the ASCII terminal. Make sure the ASCII terminal is connected to S1. If the problem persists, replace the CEC backplane, location U0.1-P1. (See notes on page 57). If the problem is fixed, go to ²MAP 0410: Repair Checkout², in the Diagnostic Information for Multiple Bus Systems. If the problem is still not corrected, go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123. Symptom: No codes are displayed on the operator panel within a few seconds of turning on the system. The operator panel is blank before the system is powered on. Action: Reseat the operator panel cable. If the problem is not resolved, replace in the following order: 1. Operator panel assembly, location U0.1-L1. Swap the VPD module from the old operator panel to the new operator panel. 2. CEC backplane, location U0.1-P1. (See notes on 57). If the problem is fixed, go to ²MAP 0410: Repair Checkout², in the Diagnostic Information for Multiple Bus Systems. If the problem is still not corrected, go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123. Symptom: The SMS configuration list or boot sequence selection menu shows more SCSI devices attached to a controller/adapter than are actually attached. Action: A device may be set to use the same SCSI bus ID as the control adapter. Note the ID being used by the controller/adapter (this can be checked and/or changed through an SMS utility), and verify that no device attached to the controller is set to use that ID. If settings do not appear to be in conflict: 1. Go to “MAP 1542: I/O Problem Isolation” on page 111. 2. Replace the SCSI cable. 3. Replace the device. 4. Replace the SCSI adapter Note: In a ²twin-tailed² configuration where there is more than one initiator device (normally another system) attached to the SCSI bus, it may be necessary to use SMS utilities to change the ID of the SCSI controller or adapter. 62 Service Guide Symptom Action Symptom: The System Management Services menu is displayed. Action: If you are loading diagnostics from the CD-ROM, you may not have pressed the correct key when you were trying to indicate a service mode IPL of the diagnostic programs. If this is the case, start again at the beginning of this step. If you are sure you pressed the correct key, the device or media you are attempting to boot from may be faulty. 1. If you are using the hardware management console (HMC) or an ASCII terminal: v Select View Error Log. v If an error is logged, check the time stamp. v If the error was logged during the current boot attempt, record it. v Look up the error in Chapter 5, “Error Code to FRU Index”, on page 179 and do the listed action. v If no recent error is logged in the error log, go to step 3. 2. If you are using a keyboard and a graphic display attached to the system: v Select View Error Log. v If an error is logged, check the time stamp. v If the error was logged during the current boot attempt, record it. v Look up the error in Chapter 5, “Error Code to FRU Index”, on page 179 v If no recent error is logged in the error log, go to step 3. 3. Try to boot from an alternate boot device connected to the same controller as the original boot device. If the boot succeeds, replace the original boot device (for removable media devices, try the media first). 4. Go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123. You suspect a cable problem. Go to Devices, and Cable Information for Multiple Bus Systems. All other problems. Go to “MAP 1020: Problem Determination” on page 64. Chapter 3. Maintenance Analysis Procedures (MAPs) 63 MAP 1020: Problem Determination Purpose of This MAP Use this MAP to get an error code if you were not provided one by the customer or you are unable to load diagnostics. If you are able to load the diagnostics, go to MAP 0020 in the Diagnostic Information for Multiple Bus Systems. The service processor may have recorded one or more symptoms in its error log. Examine this error log before proceeding (see “System Information Menu” on page 317). The system may have been set up by using the HMC. Check the Service Action Event (SAE) log in the Service Focal Point. The SAE log may have recorded one or more symptoms in the Service Focal Point. To avoid unnecessary replacement of the same FRU for the same problem, check the SAE log for evidence of prior service activity on the same subsystem. The service processor may have been set by the user to monitor system operations and to attempt recoveries. You can disable these actions while you diagnose and service the system. If the system was set up according to the recommendations of the 286/C4 and 286/E4 User’s Guide, all the settings of the service processor (except language) were saved by using the SAVE/RESTORE HARDWARE MAINTENANCE POLICIES service aid. You can use the same service aid to restore the settings at the conclusion of your service action. In case the service processor settings were not saved by the user, if you disable them, make notes of their current settings for restoration before you leave. In addition to the parameters in the following table, you can disconnect the modem to prevent incoming signals that could cause the system to power on. Following are the service processor settings. The service processor menus are described in Chapter 7, “Using the Service Processor”, on page 305. Surveillance From the service processor Setup Menu, go to the Surveillance Setup Menu and disable surveillance. Unattended Start From the service processor System Power Control Menu, disable unattended start mode. Reboot Policy From the System Power Control Menu, go to the Reboot/Restart Policy Setup Menu and set: 1. Number of reboot attempts to 0 (zero) 2. Use OS-Defined restart policy to No 3. Enable supplemental restart policy to No. Call-Out From the Call-In/Call-Out Setup Menu, go to the Serial Port Selection Menu and disable call-out on both serial ports. 64 Service Guide Another feature that could disrupt a service action by powering the system on is an AIX function called Timed Power-On (not available when Linux is the operating system). For more information about timed power-on, see “System Power-On Methods” on page 330. Be prepared to record code numbers and use those numbers in the course of analyzing a problem. Go to “Step 1020-1”. Step 1020-1 The following steps analyze a failure to load the diagnostic programs. Note: Be prepared to answer questions regarding the operator panel display and to perform certain actions based on displayed POST indicators. Be observant of these conditions. 1. Power off the system. Refer to “Stopping the System without an HMC Attached” on page 359. 2. Select slow boot mode (select disable fast boot) on the system power control menu from the service processor main menu. 3. Power on the system. Refer to “Starting the System without an HMC Attached” on page 360. 4. Insert the diagnostic CD-ROM into the CD-ROM drive. 5. When the keyboard indicator is displayed (the word keyboard), press the 5 key on the system console. 6. Enter a password, if requested. 7. Wait until the diagnostics are loaded or the system appears to stop. If you receive an error code or if the system stops before diagnostics are loaded, find your symptom in the following table, then follow the instructions given in the Action column. If no fault is identified, continue to the next step. 8. Run the standalone diagnostics on the entire system. Find your symptom in the following table, then follow the instructions given in the Action column. If no fault is identified, call service support for assistance. Chapter 3. Maintenance Analysis Procedures (MAPs) 65 Symptom Action The system stopped and a code is displayed on the operator panel. Go to the “Quick Entry MAP” on page 58. The rack identify LED does not operate properly. Go to the “Entry MAP” on page 57. The system stops with a prompt to enter a password. Enter the password. You cannot continue until a correct password has been entered. When you have entered a valid password, go to the beginning of this table and wait for one of the other conditions to occur. The diagnostic operating instructions are displayed. Go to ²MAP 0020: Problem Determination Procedure² in the Diagnostic Information for Multiple Bus Systems. Symptom:The system login prompt is displayed. Action: If you are loading the diagnostics from a CD-ROM, you may not have pressed the correct key or you may not have pressed the key soon enough when you were trying to indicate a service mode IPL of the diagnostic programs. If this is the case, start again at the beginning of this step. Note: Perform the system shutdown procedure before turning off the system. If you are sure you pressed the correct key in a timely manner, go to “Step 1020-2” on page 67. If you are loading diagnostics from a NIM server, check for the following: v The bootlist on the client may be incorrect. v Cstate on the NIM server may be incorrect. v There may be network problems preventing you from connecting to the NIM server. Verify the settings and the status of the network. If you continue to have problems refer to “Boot Problems/Concerns” on page 173 and follow the steps for network boot problems. The system does not respond when the password is entered. Go to “Step 1020-2” on page 67. Symptom: The system stopped. A POST indicator is displayed on the system console and an eight-digit error code is not displayed. Action: If the POST indicator represents: 1. Memory, go to “MAP 1240: Memory Problem Resolution” on page 68. 2. Keyboard a. Replace the keyboard. b. Replace the CEC backplane, location U0.1-P1. (See notes on page 57). c. Go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123. 3. Network, go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123. 4. SCSI, go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123. 5. Speaker a. Replace the operator panel, location U0.1-L1. Refer to “Operator Panel” on page 419. b. Replace the CEC backplane, location: U0.1-P1. (See notes on page 57). c. Go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123. 66 Service Guide Symptom Action Symptom: The System Management Services menu is displayed. Action: If you are loading diagnostics from the CD-ROM, you may not have pressed the correct key when you were trying to indicate a service mode IPL of the diagnostic programs. If this is the case, start again at the beginning of this step. If you are sure you pressed the correct key, the device or media you are attempting to boot from may be faulty. 1. If you are using the hardware management console (HMC) or an ASCII terminal: v Select View Error Log. v If an error is logged, check the time stamp. v If the error was logged during the current boot attempt, record it. v Look up the error in Chapter 5, “Error Code to FRU Index”, on page 179 and do the listed action. v If no recent error is logged in the error log, go to step 3. 2. If you are using a keyboard and a graphic display attached to the system: v Select View Error Log. v If an error is logged, check the time stamp. v If the error was logged during the current boot attempt, record it. v Look up the error in Chapter 5, “Error Code to FRU Index”, on page 179 and do the listed action. v If no recent error is logged in the error log, go to step 3. 3. Try to boot from an alternate boot device connected to the same controller as the original boot device. If the boot succeeds, replace the original boot device (for removable media devices, try the media first). 4. Go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123. All other symptoms. If you were directed here from the Entry MAP, go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123. Otherwise, find the symptom in the “Quick Entry MAP” on page 58. Step 1020-2 There is a problem with the keyboard. Find the type of keyboard you are using in the following table; then follow the instructions given in the Action column. Keyboard Type Action Type 101 keyboard (U.S.). Identified by the size of the Enter key. The Enter key is in only one horizontal row of keys. v Replace the type 101 keyboard. v Replace the CEC backplane, location U0.1-P1. (See notes on page 57.) Type 102 keyboard (W.T.). Identified by the size of the Enter key. The Enter key extends into two horizontal rows. v Replace the type 102 keyboard. v Replace the CEC backplane, location U0.1-P1. (See notes on page 57.) Type 106 keyboard. (Identified by the Japanese characters.) v Replace the type 106 keyboard. v Replace the CEC backplane, location U0.1-P1. (See notes on page 57.) ASCII terminal keyboard Go to the documentation for this type of ASCII terminal and continue with problem determination. If the problem was not corrected by one of the previous actions and you were not already directed to go to MAP 1548, go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123. Chapter 3. Maintenance Analysis Procedures (MAPs) 67 MAP 1240: Memory Problem Resolution Note: The firmware or error code that sent you here could be one of the following: v Service Processor Checkpoints 910B, 910C, 910E or 95xx v Firmware Checkpoints E134 or E701 v System Power Control Network Error Code 1011840C v Firmware Error Codes 25Cyy001, 25Cyy002, 25B00001 or 25B00004 v Service Processor Error Codes 406x001x, 406x010x, 406x013x, 406x0CA6 or 450x30xx v Common Firmware Error Codes B1xx466x v Problem Determination Error Code M0MEM002 These checkpoints and error codes are referred to as a memory code in this MAP. Purpose of This MAP This MAP is used to troubleshoot a problem when the system stops with a memory checkpoint or memory error code on the operator panel display or console. Notes: 1. If the symptom changes while you are using this MAP, check for loose cards, cables, and obvious problems. If you do not find a problem, go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123. 2. The service processor may have recorded one or more symptoms in its error log. It is a good idea to examine that error log before proceeding (see Service Processor System Information Menu). 3. The service processor may have been set by the user to monitor service operations and to attempt recoveries. You might want to disable these actions while you diagnose and service the system. If you disable them, make note of their current settings so that you can restore them before you leave. The following settings may be of interest to you. Surveillance From the Service Processor Setup Menu, go to the Surveillance Setup menu and disable surveillance. Unattended Start From the Service Processor System Power Control Menu, disable unattended start mode. Reboot Policy From the System Power Control Menu, go to the Reboot/Restart Policy Setup Menu and set: 1. Number of reboot attempts to 0 (zero) 2. Use OS-Defined restart policy to No. 3. Enable supplemental restart policy to No. Call Out From the Call-In/Call-Out Setup Menu, go to the Serial Port Selection Menu and disable call-out on both serial ports. General Memory Information Be sure to unplug the power cables before removing or installing the processor cards or memory DIMMs to avoid damage to them. It is acceptable for the memory DIMMs to partially populate each processor card in a multi-processor card system. For example, 4 memory DIMMs can be placed onto processor card 1, and another 4 memory DIMMs can be placed onto processor card 2. The system uses both sets of memory DIMMs indiscriminately. 68 Service Guide Memory DIMMs, however, must be installed in matched (size and speed) quads. Refer to “Memory DIMMs” on page 427 for labeling of the processor card and instructions on memory DIMM removal and installation. Do not, however, replace the covers as directed while troubleshooting this problem. Memory DIMM quads can only be installed in one of the following two memory DIMM slot combinations: v 1, 3, 6 and 8 v 2, 4, 5 and 7 There is no requirement that memory DIMM slots 1, 3, 6 and 8 be populated before memory DIMM slots 2, 4, 5 and 7. Attention: Go to the service processor menus. In the System Information Menu, select Memory Configuration/ Deconfiguration. Verify on the first screen that memory repeat gard is disabled. If it is enabled, disable it for the duration of this MAP. When exiting this MAP to go to MAP 0410 in the Diagnostic Information for Multiple Bus Systems; also return the setting to enabled if necessary. Step 1240-1 1. Record the error code that sent you to this map. 2. Go to the service processor menus and disable memory repeat gard if it is not already disabled. 3. Turn off the power. Remove the power cord. 4. Remove and reinstall the processor cards. 5. Reattach the power cord. Wait for OK to display in the operator panel. 6. Turn on the power. Attempt to boot to the SMS menus. (The system may stop with an 8-digit error code before it reaches the SMS menus.) 7. Power off the system using the white button. After the OK displays in the operator panel, look at the memory configuration/deconfiguration menu. Are any DIMMs deconfigured or does the error code recorded earlier in this step still appear? No Reseating the processor card(s) has corrected the problem. Enable memory repeat gard if necessary. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Yes If there is only 1 processor card installed, tag it as ²suspect bad² and go to “Step 1240-6” on page 71. If there are 2 processor cards installed, go to “Step 1240-2”. Step 1240-2 1. Turn off the power. Remove the power cord. 2. Remove processor card 2. 3. Reattach the power cord. Wait for OK to display in the operator panel. 4. Turn on the power. Attempt to boot to the SMS menus. (The system may stop with an 8-digit error code before it reaches the SMS menus.) 5. Power off the system using the white button. After the OK displays in the operator panel, look at the memory configuration/deconfiguration menu. Are any DIMMs deconfigured or does the error code recorded in “Step 1240-1” still appear? No Go to “Step 1240-3” on page 70. Yes Go to “Step 1240-6” on page 71. Chapter 3. Maintenance Analysis Procedures (MAPs) 69 Step 1240-3 1. Turn off the power. Remove the power cord. 2. Remove processor card 1. 3. Reinstall processor card 2 in slot 1. 4. Reattach the power cord. Wait for OK to display in the operator panel. 5. Turn on the power. Attempt to boot to the SMS menus. (The system may stop with an 8-digit error code before it reaches the SMS menus.) 6. Power off the system using the white button. After the OK displays in the operator panel, look at the memory configuration/deconfiguration menu. Are any DIMMs deconfigured or does the error code recorded in “Step 1240-1” on page 69 still appear? No Go to “Step 1240-4”. Yes Go to “Step 1240-6” on page 71. Step 1240-4 1. Turn off the power. Remove the power cord. 2. Install processor card 1 in slot 2. 3. Reattach the power cord. Wait for OK to display in the operator panel. 4. Turn on the power. Attempt to boot to the SMS menus. (The system may stop with an 8-digit error code before it reaches the SMS menus.) 5. Power off the system using the white button. After the OK displays in the operator panel, look at the memory configuration/deconfiguration menu. Have any DIMMs been deconfigured or does the error code recorded in “Step 1240-1” on page 69 still appear? No Reseating the processor cards has corrected the problem. Enable memory repeat gard if necessary. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Yes Go to “Step 1240-5” on page 71. 70 Service Guide Step 1240-5 One of the FRUs remaining in the system unit is defective. 1. Turn off the power. Remove the power cord. 2. Exchange the following FRUs one at a time in the order listed: v CEC backplane v Processor cards 3. Reattach the power cord. Wait for OK to display in the operator panel. 4. Turn on the power. Attempt to boot to the SMS menus. (The system may stop with an 8-digit error code before it reaches the SMS menus.) 5. Power off the system using the white button. After the OK displays in the operator panel, look at the memory configuration/deconfiguration menu. Are any DIMMs deconfigured or does the error code recorded in “Step 1240-1” on page 69 still appear? No Enable memory repeat gard if necessary. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Yes Reinstall the original FRU. Repeat this step until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all FRUs have been exchanged, go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123. Step 1240-6 1. Turn off the power. Remove the power cord. 2. Reseat the DIMMs. Reinstall the processor card. 3. Reattach the power cord. Wait for OK to display in the operator panel. 4. Turn on the power. Attempt to boot to the SMS menus. (The system may stop with an 8-digit error code before it reaches the SMS menus.) 5. Power off the system using the white button. After the OK displays in the operator panel, look at the memory configuration/deconfiguration menu. Are any DIMMs deconfigured or does the error code recorded in “Step 1240-1” on page 69 still appear? NO Reseating the DIMMs has corrected the problem. Enable memory repeat gard if necessary. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. YES If there is only one quad of DIMMs on the card, go to “Step 1240-10” on page 73. If there are two quads of DIMMs on the card, go to “Step 1240-7” on page 72. Chapter 3. Maintenance Analysis Procedures (MAPs) 71 Step 1240-7 1. Turn off the power. Remove the power cord. 2. Remove one quad of DIMMs. Record the positions of the DIMMs as they are removed. When instructed to reinstall the DIMMs, install them back into the same slots from which they were removed. 3. Reattach the power cord. Wait for OK to display in the operator panel. 4. Turn on the power. Attempt to boot to the SMS menus. (The system may stop with an 8-digit error code before it reaches the SMS menus.) 5. Power off the system using the white button. After the OK displays in the operator panel, look at the memory configuration/deconfiguration menu. Are any DIMMs deconfigured or does the error code recorded in “Step 1240-1” on page 69 still appear? NO Go to “Step 1240-8”. YES Go to “Step 1240-10” on page 73. Step 1240-8 1. Turn off the power. Remove the power cord. 2. Move the quad of DIMMs located on the card to the other quad’s slots. 3. Reattach the power cord. Wait for OK to display in the operator panel. 4. Turn on the power. Attempt to boot to the SMS menus. (The system may stop with an 8-digit error code before it reaches the SMS menus.) 5. Power off the system using the white button. After the OK displays in the operator panel, look at the memory configuration/deconfiguration menu. Are any DIMMs deconfigured or does the error code recorded in “Step 1240-1” on page 69 still appear? NO Go to “Step 1240-9”. YES Replace the processor card. Enable memory repeat gard if necessary. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Step 1240-9 The failing DIMM(s) is in the quad of DIMMs that was removed in “Step 1240-7”. Do the following: 1. Turn off the power. Remove the power cord. 2. Remove the quad of DIMMs that is on the processor card and reinstall the quad of DIMMs that had been removed. The DIMMs can be reinstalled in either quad. Go to “Step 1240-11” on page 73. 72 Service Guide Step 1240-10 1. Turn off the power. Remove the power cord. 2. Move the quad of DIMMs that is on the processor card to the other quad’s location. 3. Reattach the power cord. Wait for OK to display in the operator panel. 4. Turn on the power. Attempt to boot to the SMS menus. (The system may stop with an 8-digit error code before it reaches the SMS menus.) 5. Power off the system using the white button. After the OK displays in the operator panel, look at the memory configuration/deconfiguration menu. Are any DIMMs deconfigured or does the error code recorded in “Step 1240-1” on page 69 still appear? NO Replace the processor card. Enable memory repeat gard if necessary. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. YES One or more of the DIMMs on the card is defective. Go to “Step 1240-11”. Step 1240-11 Attention: If only one quad of DIMMs is on the processor card, you must have at least one replacement DIMM of the same size and speed as the DIMMs that are on the card. If there were two quad of DIMMs on the processor card, but they are not the same size, your must have at least one replacement DIMM of the same size and speed as the DIMMs in the quad that is failing. If there were two quad of DIMMs on the processor card, and they are the same size and speed, DIMMs from the good quad can be used when isolating the failing DIMM in the suspect quad. 1. Turn off the power. Remove the power cord. 2. Remove one of the DIMMs on the processor card and install the replacement DIMM or another known good DIMM. 3. Reattach the power cord. Wait for OK to display in the operator panel. 4. Turn on the power. Attempt to boot to the SMS menus. (The system may stop with an 8-digit error code before it reaches the SMS menus.) 5. Power off the system using the white button. After the OK displays in the operator panel, look at the memory configuration/deconfiguration menu. Are any DIMMs deconfigured or does the error code recorded in “Step 1240-1” on page 69 still appear? NO The DIMM that was just removed was the failing DIMM; replace it. Enable memory repeat gard if necessary. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. YES Go to “Step 1240-12” on page 74. Chapter 3. Maintenance Analysis Procedures (MAPs) 73 Step 1240-12 Have all the DIMMs in the quad located on the processor card been replaced? NO Go to “Step 1240-11” on page 73 to test the next DIMM of the quad on the processor card. YES Go to “Step 1240-13”. Step 1240-13 Did the processor originally have only one quad of DIMMs at the start of this map? NO Call service support. YES Replace the processor card. This ends the procedure. Enable memory repeat gard if necessary. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. 74 Service Guide MAP 1320: Service Focal Point Procedures These procedures define the steps to be taken when servicing a machine equipped with Service Focal Point (SFP). Note: The various codes that might display on the HMC are all listed as error codes by Service Focal Point (SFP). To assist you in identifying the types of error data in this guide, use the following table. Number of Digits in Error Code Error code Service Guide Name Any Contains # (pound sign) Menu Goal Any Contains - (hyphen) SRN 5 Does not contain # or - SRN 6 Does not contain # or - Error Code 8 Does not contain # or - Error Code, Service Reference Code (SRC) For steps to be taken when beginning service on a machine with SFP, see the “MAP 1321: Quick Entry MAP for Systems with Service Focal Point” on page 76. For steps to be taken after service is complete on a machine with SFP, see the “MAP 1322: End of Call MAP for Systems with Service Focal Point” on page 84. Chapter 3. Maintenance Analysis Procedures (MAPs) 75 MAP 1321: Quick Entry MAP for Systems with Service Focal Point Step 1321-1 For future reference, record the error code and the location code (that brought you to this MAP) of the FRU to be replaced. Record any other symptoms if an error code is not available. Does the system have an HMC attached? No Go to “Step 1321-22” on page 78. Yes Go to “Step 1321-2”. Step 1321-2 At the HMC console, start Service Focal Point (SFP) and examine the service action event log for any open service action events. For SFP operating instructions, see “Service Focal Point” on page 48. Does the HMC appear to operate correctly? No Go to “Quick Entry MAP” on page 58, look up the HMC symptom, and follow the directions there. Yes Go to “Step 1321-3”. Step 1321-3 Are there any service action events that are open? No Go to “Step 1321-40” on page 82. Yes Go to “Step 1321-4”. Step 1321-4 Record the list of open service action events. Step 1321-5 From the list of serviceable events recorded in “Step 1321-4”, open the details of each service action event and examine the FRU list and error code(s) for each. Step 1321-6 Are there any serviceable actions events with error codes of the form #xxxxxx in the list of error codes found in “Step 1321-5”? No Go to “Step 1321-7”. Yes Go to “Step 1321-18” on page 78. Step 1321-7 Are there any serviceable events with FRU location codes reported in “Step 1321-5”? No Go to “Step 1321-14” on page 77. Yes Go to “Step 1321-8”. Step 1321-8 From the list of open service action events with FRUs reported, examine the details of each service action event and record the error code. 76 Service Guide Step 1321-9 Examine the list of error codes you obtained from “Step 1321-8” on page 76 and look for any of the form 4xxB xxxx, A0D-34x, or A1D-34x. Step 1321-10 Are there any error codes from step“Step 1321-8” on page 76 of the form 4xxB xxxx, A0D-34x, or A1D-34x? No Go to “Step 1321-13”. Yes Go to “Step 1321-11”. Step 1321-11 Do any of the error codes from “Step 1321-8” on page 76 of the form 4xxB xxxx, A0D-34x, or A1D-34x have the same first two characters on the left as the first two characters on the left of the error code recorded in “Step 1321-1” on page 76 that sent you to this MAP? No Go to “Step 1321-13”. Yes Record the error codes from “Step 1321-8” on page 76 of the form 4xxB xxxx, A0D-34x, or A1D-34x that have the same first two characters on the left as the first two characters on the left of the error code recorded in “Step 1321-1” on page 76 that sent you to this MAP. Also, record the service action event associated with each error code. Go to “Step 1321-12”. Step 1321-12 From the list of open service action events with error codes of the form 4xxB xxxx, A0D-34x, or A1D-34x recorded in “Step 1321-11”, choose the service action event with the earliest time stamp (the one that occurred first), record its error code and FRU location(s), and proceed to “Step 1321-14” with this new code. When future steps refer to the error code recorded in “Step 1321-1” on page 76, use this new error code and its location code(s) instead of the original error code that sent you to this MAP. Step 1321-13 From the list of open service action events with FRUs reported, choose the service action event with the earliest time stamp (the one that occurred first), record its error code and FRU location(s), and proceed to “Step 1321-14” with this new error code. When future steps refer to the error code recorded in “Step 1321-1” on page 76, use this new error code and its location code(s) instead of the original error code that sent you to this MAP. Step 1321-14 At the HMC console, open Service Focal Point (SFP) and examine the service action event log for any closed service action events. For SFP operating instructions, refer to “Service Focal Point” on page 48. Step 1321-15 Are there any service action events that are closed? No Go to the table in “Entry MAP” on page 57. Yes Go to “Step 1321-16” on page 78. Chapter 3. Maintenance Analysis Procedures (MAPs) 77 Step 1321-16 Examine the details of the ten most-recently closed service action events. Are there any error codes or symptoms in the ten latest closed service action events that are the same as the error code or symptom recorded in “Step 1321-1” on page 76? No Go to the table in “Entry MAP” on page 57. Yes Go to “Step 1321-17”. Step 1321-17 Examine the FRUs of the same error codes or symptoms. Are there any FRUs in the same error codes or symptoms of the closed service action events left to be replaced? No Go to the table in “Entry MAP” on page 57. Yes Replace the next FRU of same error codes or symptoms of the closed service action events. Go to ²MAP 0410: Repair Checkout² in Diagnostic Information for Multiple Bus Systems. Step 1321-18 From the list of serviceable events with error code(s) of the form #xxxxxx, examine the details of each serviceable event to display the description field of the error code(s) of the form #xxxxxx. Step 1321-19 Double click on the description field to view the complete text. Step 1321-20 Perform any actions that may be listed in the descriptions of all error codes of the form #xxxxxx before proceeding. Step 1321-21 Did you find a problem? No Go to “Step 1321-7” on page 76. Yes Go to ²MAP 0410: Repair Checkout² in Diagnostic Information for Multiple Bus Systems. Step 1321-22 Is Linux installed on all of the system’s partitions? No Go to “Step 1321-2” on page 76. Yes Look for and record any additional error information or software messages on the operator panel or the console (serial TTY, virtual terminal, or HMC console). Go to “Step 1321-23” on page 79. 78 Service Guide Step 1321-23 Is Linux functional in the partition with the problem? No Go to “Step 1321-34” on page 81. Yes Go to “Step 1321-24”. Attention: “Step 1321-24” through “Step 1321-40” on page 82 must be performed on a partition running the Linux operating system. Step 1321-24 If your system is functional, do the following: Examine the Linux system log by logging in to the system as the root user and entering the following command: cat /var/log/messages |grep RTAS |more Linux run-time error messages are logged in the messages file under /var/log. The following is an example of the Linux system error log. Aug 27 18:13:41 rasler kernel: RTAS: -------- event-scan begin -------- Aug 27 18:13:41 rasler kernel: RTAS: Location Code: U0.1-P1-C1 Aug 27 18:13:41 rasler kernel: RTAS: WARNING: (FULLY RECOVERED) type: INTERN_DEV_FAIL Aug 27 18:13:41 rasler kernel: RTAS: initiator: UNKNOWN target: UNKNOWN Aug 27 18:13:41 rasler kernel: RTAS: Status: predictive new Aug 27 18:13:41 rasler kernel: RTAS: Date/Time: 20020827 18134000 Aug 27 18:13:41 rasler kernel: RTAS: CPU Failure Aug 27 18:13:41 rasler kernel: RTAS: CPU id: 0 Aug 27 18:13:41 rasler kernel: RTAS: Failing element: 0x0000 Aug 27 18:13:41 rasler kernel: RTAS: A reboot of the system may correct the problem Aug 27 18:13:41 rasler kernel: RTAS: -------- event-scan end ---------- Step 1321-25 If present, record any additional error information recorded in the Linux system log. If the system is configured with more than one logical partition with Linux installed, repeat “Step 1321-24” and “Step 1321-25” for all logical partitions that have Linux installed. Chapter 3. Maintenance Analysis Procedures (MAPs) 79 Step 1321-26 Examine the Linux boot (IPL) log by logging in to the system as the root user and enter the following command: cat /var/log/boot.msg |grep RTAS |more Linux boot (IPL) error messages are logged into the boot.msg file under /var/log. The following is an example of the Linux boot error log. RTAS daemon started RTAS: -------- event-scan begin -------- RTAS: Location Code: U0.1-F3 RTAS: WARNING: (FULLY RECOVERED) type: SENSOR RTAS: initiator: UNKNOWN target: UNKNOWN RTAS: Status: bypassed new RTAS: Date/Time: 20020830 14404000 RTAS: Environment and Power Warning RTAS: EPOW Sensor Value: 0x00000001 RTAS: EPOW caused by fan failure RTAS: -------- event-scan end ---------- Step 1321-27 If present, record any additional error information reported in the Linux boot (IPL) log. If the system is configured with more than one logical partition with Linux installed, repeat “Step 1321-26” and “Step 1321-27” for all logical partitions that have LINUX installed. Step 1321-28 Examine the Linux extended error data log. The following is an example of the Linux extended error log. <3>RTAS daemon started <3>RTAS: -------- event-scan begin -------- <3>RTAS: Location Code: U0.1-P1-C2 <4>RTAS: Log Debug: 04 4b2726fb04a00011702c0014000000000000000000000000f1800001001801d3ffffffff010000000000000042343138 20202020383030343236464238454134303030303 030303030303030 <4>RTAS: Log Debug: D2 5046413405020d0a000001000271400100000033434d502044415441000001000000000000010000f180000153595 320444154410000000000000000200216271501050920021627150105092002063715010509535243204441544170 2c001400000000000000020018820201d382000000000000000000000000000000000000000000000000000000000 0000000000000000000000000000000000000000000000000000000000000000280048ea400000000000000000000 000000000000000000004350542044415441702cff08000000001c000000702cf0080000000080000000702cf10070 2cf200702c000400000800702c01040bf2002e702c02040c1fffbf702c0300702c1000702c11040bf2002e702c12040 c1fffbf702c1300702ca000702ca108000000000000a03c702ca208000000000000effc702cb000702cb10800000000 0000a03c702cb208000000000000effc702cc000702cc108000000000000a03c702cc208000000000000effc702c3 000702c31080000000000000003702c3208000000000000007b702c8000702c81080000000020e27a39702c820800 000000fffeffff702cd000702cd1080000000010004010702cd208000000007777f3ffffffffffffffffffffffffffffffffffffff ffffffffffffffffffffffffffff <3>RTAS: WARNING: (FULLY RECOVERED) type: INTERN_DEV_FAIL <3>RTAS: initiator: UNKNOWN target: UNKNOWN <3>RTAS: Status: unrecoverable new <3>RTAS: Date/Time: 20020905 15372200 <3>RTAS: CPU Failure <3>RTAS: Internal error (not cache) <3>RTAS: CPU id: 0 <3>RTAS: Failing element: 0x0000 <3>RTAS: -------- event-scan end --------- 80 Service Guide Step 1321-29 If present, record any additional error information in the Linux extended data error log. Note: The line(s) in the Linux extended error log that begin with ²<4>RTAS: Log Debug: 04² contain the error code listed in the next 8 hex characters. In the example above, ²4b27 26fb² is an error code. If the system is configured with more than one logical partition with Linux installed, repeat “Step 1321-28” on page 80 and “Step 1321-29” for all logical partitions that have Linux installed. Step 1321-30 Use the lscfg command to list all resources assigned to all partitions. Record the adapter and the partition for each resource. Step 1321-31 Compare the list of found resources and partition assignments to the customer’s known configuration to determine if any device(s) or adapter(s) are missing. Record any missing adapters or devices. If the system is configured with more than one logical partition with Linux installed, repeat “Step 1321-30” and “Step 1321-31” for all logical partitions that have Linux installed. Step 1321-32 Go to “Step 1321-33”. Step 1321-33 Go to “Step 1321-34”. Step 1321-34 Were any error codes, checkpoints, or symptoms recorded in Steps 1321-1, 1321-22, 1321-25, 1321-27, 1321-29, or 1321-31? No Go to “Step 1321-35”. Yes Go to the “Quick Entry MAP” on page 58 with each recorded error code or symptom. Perform the indicated actions one at a time for each error code until the problem has been corrected. If all recorded error codes have been processed and the problem has not been corrected, go to “Step 1321-35”. Step 1321-35 If no additional error information is available and the problem has not been corrected, shut down the system. If an HMC is not attached, you need a laptop (and cable, part number 62H4857) capable of connecting to serial port 1 on the processor subsystem. The Linux login prompt cannot be seen on a laptop connected to serial port 1. 1. If there is no HMC, attach the laptop and cable to serial port 1 on the processor subsystem. 2. When OK displays in the operator panel, press a key on the virtual terminal on the laptop. The service processor menus are available on the attached laptop or on the HMC. 3. Examine the service processor (SP) error log. Record all error codes and messages written to the SP error log. Go to “Step 1321-36” on page 82. 4. If the service processor menus are not available, go to “Step 1321-37” on page 82. Chapter 3. Maintenance Analysis Procedures (MAPs) 81 Step 1321-36 Boot the system to the system management services (SMS) menus. Record all error codes and messages written to the SMS error log. Step 1321-37 Were any error codes recorded in steps 1321-35 or 1321-36 No Go to “Step 1321-41” on page 83. Yes Go to the “Quick Entry MAP” on page 58 with each error code or symptom you have recorded. Perform the indicated actions, one at a time, until the problem has been corrected. If all recorded error codes have been processed and the problem is has not been corrected, go to “Step 1321-41” on page 83. Step 1321-38 Does the system appear to stop or hang before reaching the login prompt or did you record any missing resources in “Step 1321-31” on page 81? No Go to “Step 1321-39”. Yes There may be a problem with an I/O device. Go to “MAP 1542: I/O Problem Isolation” on page 111. In MAP 1542, when instructed to boot the system, boot a full system partition. Step 1321-39 Boot the AIX standalone diagnostics and run diagnostics on all resources. Be sure to boot a full system partition. Ensure that AIX diagnostics was run on all known resources. You may need to select each resource individually and run diagnostics on each resource one at a time. Check to make sure that AIX diagnostics was run on all known resources. You may need to select each resource individually and run diagnostics on each resource one at a time. Did AIX standalone diagnostics find a problem? No You may have a software problem, call service support. Yes Go to the “Entry MAP” on page 57 with each error code or SRN you have recorded. Until the problem has been corrected, perform the indicated actions, one at a time, for each SRN or error code not already processed in “Step 1321-37”. If all recorded SRNs or error codes have been processed and the problem has not been corrected, call service support. Step 1321-40 Does the system have Linux installed on one or more partitions? No Go to “Step 1321-14” on page 77. Yes Go to “Step 1321-22” on page 78. 82 Service Guide Step 1321-41 Have you replaced all parts whose location codes were recorded in Steps 1321-25, 1321-27,1321-29, or 1321-31? No Replace, one at a time, all parts whose location code was recorded in Steps 1321-25, 1321-27, 1321-29, and 1321-31 that have not been replaced. Power off the system before replacing a part. After replacing the part, power on the system to check if the problem has been corrected. Go to “Step 1321-42” when the problem has been corrected, or all parts in the location codes list have been replaced. Yes Go to “Step 1321-38” on page 82. Step 1321-42 Was the problem corrected in Step 1321-41? No Go to “Step 1321-38” on page 82. Yes Return the system to its original configuration. This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Chapter 3. Maintenance Analysis Procedures (MAPs) 83 MAP 1322: End of Call MAP for Systems with Service Focal Point Step 1322-1 For future reference, record the SRC or symptom and the location code of the FRU you replaced. Step 1322-2 For service focal point operating instructions, refer to “Service Focal Point” on page 48. At the HMC console, open Service Focal Point and examine the service action event log for any open service action events. Step 1322-3 Are there any service action events that are open? No If the System Attention LED is still on, use the HMC to turn off the LED as described in “Systems Attached To an HMC” on page 56. This completes the repair, return the system to the customer. Yes Go to “Step 1322-4”. Step 1322-4 Record the list of open service action events. Step 1322-5 From the list of serviceable events recorded in “Step 1322-4”, perform “Step 1322-6” through “Step 1322-33” on page 87 for each open service action event. Step 1322-6 Determine the error class of the serviceable event. Record for future use. Step 1322-7 Examine the details of the open service action event. Is the error code associated with this service action event the same as recorded in ²Step 1322-1²? No Go to “Step 1322-8”. Yes Go to “Step 1322-11” on page 85. Step 1322-8 Examine the FRU list of the service action event. Are there any FRUs listed for the service action event? No Go to “Step 1321-11” on page 77. Yes Go to “Step 1321-9” on page 77. 84 Service Guide Step 1322-9 Is the FRU list identical (same FRUs, same number of FRUs, and same order of FRUs) to the FRU list of the error code recorded in “Step 1322-1” on page 84? No Go to “Step 1322-10”. Yes Go to “Step 1322-11”. Step 1322-10 The FRU list is different. Is the FRU you replaced and recorded in “Step 1322-1” on page 84 in the list of FRUs for this service action event? No Go to “Step 1322-33” on page 87. Note: There are service action events that will remain open when you leave this MAP. Further service actions may be required to complete the repair. Yes Go to “Step 1322-11”. Step 1322-11 Examine the details of this service action event, and record the partition(s) involved in this service action event for use in a later step. Step 1322-12 Is the error code associated with this service action event of the form A11-xxx or A01-xxx? No Go to “Step 1322-17” on page 86. Yes Go to “Step 1322-13”. Step 1322-13 Have you begun a list of “Axx” partitions from prior service action events that you processed in this MAP? No Go to “Step 1322-14”. Yes Go to “Step 1322-15”. Step 1322-14 Begin a new list of “Axx” partitions by copying the list of partitions obtained in “Step 1322-11”. Go to “Step 1322-16” on page 86. Step 1322-15 Add the partition list obtained in “Step 1322-11” to the existing list of “Axx” partitions obtained from processing previous service action events in this MAP. Chapter 3. Maintenance Analysis Procedures (MAPs) 85 Step 1322-16 Remove all entries in the list of all partition(s) you recorded in “Step 1322-11” on page 85. If you are referred to the list of partition(s) obtained in “Step 1322-11” on page 85 in future steps, the list is empty. Go to “Step 1322-17”. Step 1322-17 Select and highlight the service action event from the “Error Associated With This Serviceable Event” window. Step 1322-18 Select the Close Event button. Step 1322-19 Add comments for the serviceable event. Include any unique additional information. Select the OK button. Step 1322-20 The following steps will add or update FRU information. Step 1322-21 Did you replace, add, or modify a FRU of the open service action event? No Go to “Step 1322-23”. Yes Go to “Step 1322-22”. Step 1322-22 From the FRU list, select a FRU that you need to update. Double-click on the FRU and update the FRU information. Go to “Step 1322-24”. Step 1322-23 Select the No FRU Replaced for this Serviceable Event option. Step 1322-24 Select OK to close the service action event. Step 1322-25 Is the list of all partition(s) you recorded in “Step 1322-11” on page 85 empty? No Go to “Step 1322-26”. Yes Go to “Step 1322-33” on page 87. Step 1322-26 Does the list of all partition(s) you recorded in “Step 1322-11” on page 85 contain more than one entry? No Go to “Step 1322-33” on page 87. Yes Go to “Step 1322-27” on page 87. 86 Service Guide Step 1322-27 Is the error class recorded in “Step 1322-6” on page 84 AIX? No Go to “Step 1322-33”. Yes Go to “Step 1322-28”. Step 1322-28 Perform the following steps for each entry in the list of all partition(s) you recorded in “Step 1322-11” on page 85, except the partition you were using to debug the original problem. Step 1322-29 From the HMC virtual terminal window of a partition in the list of all partitions, type diag at the AIX command prompt. Step 1322-30 1. When the diagnostic operating instructions are displayed, press Enter. 2. Select the Task Selection option. Note: If the terminal type is not defined, you are prompted to define it before you can proceed. 3. Select the Log Repair option. 4. Select the resource associated with the repair action. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your repair action is not displayed on the Resource List, select sysplanar0. 5. Click commit after you have made your selection. Step 1322-31 Exit from diagnostics in this partition and return to the AIX prompt. Step 1322-32 Have all the partitions in the list of all partition(s) you recorded in “Step 1322-11” on page 85 been processed? No Go to “Step 1322-28” to process the next partition in the list you recorded in “Step 1322-11” on page 85. Yes Go to “Step 1322-33”. Step 1322-33 Have all the serviceable events recorded in “Step 1322-4” on page 84 been processed? No Go “Step 1322-5” on page 84 and process the next service action event in the list of serviceable events recorded in the list of serviceable events recorded in “Step 1322-4” on page 84. Yes Go to “Step 1322-34” on page 88. Chapter 3. Maintenance Analysis Procedures (MAPs) 87 Step 1322-34 While processing all service action events, were you directed to “Step 1322-14” on page 85? No If the System Attention LED is still on, use the HMC to turn off the LED as described in “Systems Attached To an HMC” on page 56. This completes the repair, return the system to the customer. Note: If during the processing of the list of open service action events, some service actions events were left open, further service actions may be required to complete the repair. Yes Go to “Step 1322-35”. Step 1322-35 Perform the following steps for each entry in the list of “Axx” partitions you began recording in “Step 1322-14” on page 85, except the partition you were using to debug the original problem. Step 1322-36 From the HMC virtual terminal window of a partition in the list of “Axx” partitions, type diag at the AIX command prompt. Step 1322-37 1. When the diagnostic operating instructions are displayed, press Enter. 2. Select the Task Selection option. Note: If the terminal type is not defined, you are prompted to define it before you can proceed. 3. Select the Log Repair option. 4. Select the resource associated with the repair action. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your repair action is not displayed on the Resource List, select sysplanar0. 5. Click commit after you have made your selection. Step 1322-38 Exit from diagnostics in this partition and return to the AIX prompt. Step 1322-39 Have all the partitions in the list of “Axx” partitions(s) you began recording in “Step 1322-14” on page 85 been processed? No Go to “Step 1322-35” to process the next partition in the list you recorded in “Step 1322-14” on page 85. Yes If the System Attention LED is still on, use the HMC to turn off the LED as described in “Systems Attached To an HMC” on page 56. This completes the repair, return the system to the customer. Note: If during the processing of the list of open service action events, some service actions events were left open, further service actions may be required to complete the repair. 88 Service Guide MAP 1520: Power Notes: 1. This is not a start-of-call MAP. Use this Power MAP only if you have been directed here from another MAP, an SRN in the Diagnostic Information for Multiple Bus Systems, or an error code. 2. This procedure is used to locate power problems in system units. If a problem is detected, this procedure helps you isolate the problem to a failing unit. 3. The 286/C4 and 286/E4 can accommodate two power supplies installed with separate power receptacles on the rear of the power supply. There are three LEDs located on the front and top of the power supply: v AC Good LED (green) v DC Good LED (green) v Power Supply Fault Indicator LED (amber) The LEDs indicate the condition of the power supply. Refer to “Component LEDs” on page 466 for additional information. 4. The condition of two power supplies being present, but only one working, can be determined by the service processor and a warning level EPOW. 5. Observe the following safety notices during service procedures. DANGER An electrical outlet that is not correctly wired could place hazardous voltage on metal parts of the system or the devices that attach to the system. It is the responsibility of the customer to ensure that the outlet is correctly wired and grounded to prevent an electrical shock. Before installing or removing signal cables, ensure that the power cables for the system unit and all attached devices are unplugged. When adding or removing any additional devices to or from the system, ensure that the power cables for those devices are unplugged before the signal cables are connected. If possible, disconnect all power cables from the existing system before you add a device. Use one hand, when possible, to connect or disconnect signal cables to prevent a possible shock from touching two surfaces with different electrical potentials. During an electrical storm, do not connect cables for display stations, printers, telephones, or station protectors for communications lines. D05 CAUTION: This product is equipped with a three-wire power cable and plug for the user’s safety. Use this power cable with a properly grounded electrical outlet to avoid electrical shock. C01 Chapter 3. Maintenance Analysis Procedures (MAPs) 89 DANGER To prevent electrical shock hazard, disconnect all power cables from the electrical outlet before relocating the system. D01 FRU Identify LEDs This system is configured with an arrangement of LEDs that help identify various components of the system. These include, but are not limited to the following: v Rack identify LED v CEC drawer identify LED v I/O drawer identify LED v RIO port identify LED v FRU identify LED – Power subsystem FRUs – CEC subsystem FRUs – I/O subsystem FRUs v I/O adapter identify LED v DASD identify LED The identify LEDs are arranged hierarchically, with the FRU identify LED at the bottom of the hierarchy, followed by the corresponding CEC or I/O drawer identify LED, and the corresponding rack identify LED to locate the failing FRU more easily. Any identify LED in the system can be flashed when the system is in the failed state with power on by using the service processor LED Control Menu contained in the “System Information Menu” on page 317. Any identify LED in the system can also be flashed by using the ²Identify and Attention Indicators² task through diagnostics. The procedure to operate the ²Identify and Attention Indicators² task in diagnostics is outlined in the Diagnostic Information for Multiple Bus Systems. If the service processor menus and the AIX diagnostics are not available, the FRU identify LEDs can be flashed by one of following: v If the system is configured as a full system partition, boot to the open firmware prompt. See the “System Power Control Menu” on page 314 for instructions on setting up the boot mode to enable the boot to the open firmware prompt. To display a menu that will allow you to turn on the desired FRU identify LED, type FRU-LED-MENU on the command line. v If the system is logically partitioned, the HMC must be attached. For instructions on activating and deactivating a FRU identify LED, see the Service Focal Point chapter of the Hardware Management Console Installation and Operations Guide, order number SA38-0590. Symptom Action The base system drawer will not power on and no error codes are available. Go to “MAP 1521: Base System Drawer With Power Failure and No Error Codes” on page 91. The I/O drawer does not power on and the power LED is not lit. Go to “Map 1522: Cannot Power On the I/O Drawer, Power Present LED Does Not Come On” on page 98. Error code 1011 1B0x in the SPCN error codes table directed you here. Go to “Map 1523: The SPCN Error Codes Table Directed You Here and the Error Code is 1011 1B0x” on page 105. 90 Service Guide MAP 1521: Base System Drawer With Power Failure and No Error Codes Step 1521-1 For 286/C4, do the following: 1. Place the drawer into the service position as described in “Placing the 286/C4 into the Service Position” on page 364. 2. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on page 366. For 286/E4, do the following: 1. Place the deskside unit into a comfortable position for you to work on. 2. Remove the service access cover as described in “Service Access Cover Removal (286/E4)” on page 371. You may have been directed to this MAP for one or more of the following reasons: v The ac power supply power LEDs are not on. The power LEDs on the operator panel and the power LEDs on the rear of the system unit are not blinking. The operator panel display is blank. Do the following: – 286/C4, go to “Step 1521-2” on page 92. – 286/E4, go to “Step 1521-5” on page 93. v The ac power supply power LEDs are on. If equipped with dc power, the dc power supply power LEDs are blinking. Power LEDs, for either ac or dc power supplies, on the operator panel and the rear of the system unit are not blinking, and the operator panel display is blank. Go to “Step 1521-7” on page 94. v The ac power supply power LEDs are on. If equipped with dc power, the dc power LEDs are blinking. Power LEDs located on the operator panel and the rear of the system unit are blinking, and OK is displayed on the operator panel. When the power button, located on the operator panel, is pressed: – If equipped with dc power, the dc power LEDs, located on the power supplies, do not come on. – The power LED (for either ac or dc powered systems), located on the operator panel, does not come on. – None of the cooling fans start to turn. Go to “Step 1521-7” on page 94. v The ac power supply power LEDs are on. If equipped with dc power, the dc power supply power LEDs are blinking. The power LED, located on the operator panel, is blinking and OK is displayed on the operator panel. When the power button on the operator panel is pressed: – If equipped with dc power, the dc power LEDs on the power supplies come on. – The power LED (for either ac or dc powered systems), on the operator panel, comes on and all the fans start to turn. However, the dc power LEDs located on the power supplies, and the power LED located on the operator panel and the cooling fans do not stay on. Go to “Step 1521-7” on page 94. v An SRN referenced in the Diagnostic Information for Multiple Bus Systems is listed in MAP 1521 in the ²Actions/Descriptions² column. Go to “Step 1521-11” on page 96. Chapter 3. Maintenance Analysis Procedures (MAPs) 91 Step 1521-2 Is the green power light on the power distribution panel on? NO Go to “Step 1521-3”. YES Go to “Step 1521-5” on page 93. Step 1521-3 Do the following: 1. Unplug the main power cable from the wall outlet. 2. If you are servicing a drawer unit that is mounted in a rack, unplug the main power cable from the power distribution panel. 3. Check that the power cable has continuity. 4. Check that the wall outlet has been wired correctly with the correct voltage. Did you find a problem? NO Go to “Step 1521-4”. YES Correct the problem. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Step 1521-4 If you are servicing a drawer unit that is mounted in a rack, do the following: 1. Unplug the the system unit power cables from the power distribution panel. 2. Plug in the main power cable into the power distribution panel. 3. Plug in the main power cable from the wall outlet. Is the green power light on the power distribution panel on? NO Replace the rack-mounted power distribution panel. YES Go to “Step 1521-5” on page 93. 92 Service Guide Step 1521-5 Turn off the power and do the following: For 286/C4: 1. Unplug the the system unit power cables from the power distribution panel. 2. Unplug the power cables from the system unit. 3. Check that the power cables have continuity. 4. Check that the power distribution panel outlets have the correct voltage. For 286/E4: 1. Unplug the power cables from the wall outlet. 2. Unplug the power cables from the system unit. 3. Check that the power cables have continuity. 4. Check that the wall outlet has been wired correctly with the correct voltage. Did you find a problem? NO Go to “Step 1521-6”. YES Correct the problem. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Step 1521-6 1. One of the following may be defective: v Power supplies v CEC backplane v Memory DIMMs v Processor cards 2. To test each FRU, exchange the FRUs that have not already been exchanged in the following order. v Primary power supply v Redundant power supply (if installed) v CEC backplane v Memory DIMMs v Processor card(s) 3. Turn off the power. 4. Unplug the power cables from the system unit. 5. Exchange one of the FRUs in the list. 6. Connect the power cables to the system unit. Do the ac power LEDs on the power supplies come on within 30 seconds after applying ac power? NO Reinstall the original FRU. Repeat this step until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. YES Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Chapter 3. Maintenance Analysis Procedures (MAPs) 93 Step 1521-7 1. Unplug the power cables from the system unit. 2. Exchange the operator panel assembly. 3. Plug the power cables into the system unit and wait for the OK prompt located on the operator panel display. 4. Turn on the power. Does the power LED on the operator panel come on and stay on? NO Reinstall the original operator panel assembly, go to “Step 1521-8”. YES Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Step 1521-8 1. Turn off the power. 2. Unplug the power cables from the system unit. 3. Record the slot numbers of all the adapters. Label and record the location of any cables attached to the adapters. Disconnect any cables attached to the adapters and remove all the adapters 4. Remove all but one quad of memory DIMMs from the first processor card. 5. Remove processor card #2 (if installed). 6. Unplug the signal and power cables from the DASD backplane. 7. Unplug the disk drives from the DASD backplane. 8. Unplug the signal and power cables from any devices in the media bays. 9. Remove all the fans. 10. Plug the power cables into the system unit. 11. Turn on the power. Does the power LED on the operator panel come on and stay on? NO Go to “Step 1521-9”. YES Go to “Step 1521-10” on page 95. Step 1521-9 Note: Either the processor card, PCI riser card, CEC backplane, or the power supplies may be defective. To test each FRU, exchange the FRUs that have not already been exchanged in the following order: 1. Primary power supply 2. Redundant power supply (if installed) 3. CEC backplane. 4. Processor card 5. Memory 6. PCI riser card 94 Service Guide Do the following as you exchange each of the above FRUs in the order listed: 1. Turn off the power. 2. Unplug the power cable from the system unit. 3. Exchange one of the FRUs in the list. 4. Connect the power cable to the system unit. 5. Turn on the power. Does the power LED on the operator panel come on and stay on? NO Reinstall the original FRU. Repeat this step until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. YES Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Step 1521-10 One of the parts that was removed or unplugged is causing the problem. Install or connect the parts in the following order: 1. Memory DIMMs on processor card #1 2. Processor card #2 (if removed) 3. DASD backplane 4. Disk drives 5. Media bay devices 6. PCI adapters, lowest slot to highest slot 7. Fans Turn on the power after each part is installed or connected. If the system does not power on or the power LED located on the operator panel does not stay on, the most recently installed or connected part is causing the failure. Do the following as you exchange each of the above FRUs in the order listed. 1. Turn off the power. 2. Unplug the power cable from the system unit. 3. Install or connect one of the parts in the list. 4. Plug the power cable into the system unit. 5. Turn on the power. Does the power LED on the operator panel come on and stay on? NO Replace the last part installed. v If memory DIMMs on the first processor card were the last parts installed, exchange the memory DIMMs one at a time to isolate the failing module. If all memory DIMMs have been exchanged and the problem is not resolved, replace the processor card. v If the second processor card was just installed, exchange the memory DIMMs one at a time to isolate a possible failing module. If all modules have been exchanged and the problem is not, resolved replace the processor card. Repeat these steps until all the parts have been installed. Go to “Step 1521-11” on page 96. YES Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Chapter 3. Maintenance Analysis Procedures (MAPs) 95 Step 1521-11 Does the system contain two power supplies? NO Go to “Step 1521-12”. YES Go to “Step 1521-14” on page 97. Step 1521-12 Shut down the system and remove all power cables from the rear of the system unit. Exchange the following FRUs in the order listed: 1. Power supply 2. CEC backplane 3. PCI riser card Restart the system and perform error log analysis. Do you get an SRN indicating a voltage sensor is out of range? NO The last FRU exchanged is defective. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. YES Reinstall the original FRU. Repeat the FRU replacement steps until a defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all FRUs have been exchanged, go to “Step 1521-13”. If the symptom changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to the “Quick Entry MAP” on page 58 and follow the instructions for the new symptom. Step 1521-13 Check that the wall outlet is properly wired and is providing the correct voltage. Did you find a problem? NO Go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123. YES Correct the problem. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. 96 Service Guide Step 1521-14 Because the 286/C4 and 286/E4 accommodate redundant power supplies, it is not necessary to power off the system when replacing a power supply. The power supplies are symmetrical, so replacement starts with the unit closest to the system wall of the system drawer (power supply 1). For instructions on replacing a power supply, refer to “Power Supplies” on page 436. Notes: 1. Always service first the power supply whose green LED, located on the front of the power supplies, is out. 2. Before removing a power supply, be sure the redundant power supply is operational by observing the green LED. The green LED must be on solid. Replace the following FRUs in order: 1. Power supply #1 (primary) 2. Power supply #2 (redundant) Perform error log analysis. Do you receive an SRN indicating a voltage sensor out of range? NO The last FRU exchanged is defective. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. YES Reinstall the original FRU. Repeat the FRU replacement steps until a defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all FRUs have been exchanged, go to “Step 1521-15”. If the symptom changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to the “Quick Entry MAP” on page 58 and follow the instructions for the new symptom. Step 1521-15 The problem is within the system unit hardware or with the line voltage or wiring. Shut down the system and remove the power cable from the system unit. Exchange the following FRUs in the order listed. 1. PCI riser card 2. CEC backplane Restart the system and perform error log analysis. Do you get an SRN indicating a voltage sensor is out of range? NO The last FRU exchanged is defective. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. YES Reinstall the original FRU. Repeat the FRU replacement steps until a defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all FRUs have been exchanged, go to “Step 1521-13” on page 96. If the symptom changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to the “Quick Entry MAP” on page 58 and follow the instructions for the new symptom. Chapter 3. Maintenance Analysis Procedures (MAPs) 97 Map 1522: Cannot Power On the I/O Drawer, Power Present LED Does Not Come On Step 1522-1 Attempt to power on the I/O drawer (see “Starting the System without an HMC Attached” on page 360) Does the I/O drawer power on? NO Go to “Step 1522-2”. YES This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Step 1522-2 Examine the power supply LEDs at the rear of the I/O drawer. Are the green AC Good LEDs of both power supplies of the I/O drawer on solid? NO Go to “Step 1522-8” on page 99. YES Go to “Step 1522-3”. Step 1522-3 Is the amber fault LED of any power supply of the I/O drawer on solid? NO Go to “Step 1522-4”. YES Go to “Step 1522-21” on page 102. Step 1522-4 Is the amber fault LED of any power supply of the I/O drawer blinking? NO Go to “Step 1522-5”. YES Go to “Step 1522-23” on page 103. Step 1522-5 Is the green DC good LED of any power supply of the I/O drawer off? NO Go to “Step 1522- 25” on page 103. YES Go to “Step 1522-6”. Step 1522-6 Are the green DC good LEDs of both power supplies of the I/O drawer off? NO Go to “Step 1522-7” on page 99. YES Go to “Step 1522- 25” on page 103. 98 Service Guide Step 1522-7 Replace the items in the following list one at a time. Turn the power off before removing the item, and turn the power back on after replacing the item to determine if the drawer will power on. 1. Replace the power supply with the green DC good LED off. 2. Replace the other power supply with the one just removed from above. 3. Replace the I/O drawer backplane. Does the I/O drawer power on? NO Have all the items in the list above been replaced? NO Go to beginning of this step and replace the next item. YES Call service support. YES The item just replaced was defective. This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Step 1522-8 Is this machine connected to dc power? NO Go to “Step 1522-12” on page 100. YES Go to “Step 1522-9”. Step 1522-9 Are all circuit breakers reset at the rack power distribution panel? NO Reset the breaker switch by first turning off, then turning on. This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. YES Go to “Step 1522-10”. Step 1522-10 Are all the dc power cables seated firmly and connected correctly from the rack power distribution panel to the rack drawers? NO Fix the connection. This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. YES Go to “Step 1522-11” on page 100. Chapter 3. Maintenance Analysis Procedures (MAPs) 99 Step 1522-11 Note: Read the danger and caution notices under “Safety Notices” on page xi before continuing with this procedure. Perform the following: 1. Disconnect the dc power cable from each power supply of the I/O drawer. 2. Use a multimeter to measure the dc voltage at the drawer end of each dc power cable. Is the dc voltage from -40 V dc to -60 V dc? NO There is no power to the drawer. Refer to the rack documentation to isolate the dc power problem. The 14T/0 and 14T/4 Installation and Service Guide is appropriate for most systems. This ends the procedure. YES Go to “Step 1522-17” on page 101. Step 1522-12 Are all circuit breakers reset at the rack power distribution unit(s)? NO Reset the breaker switch. This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. YES Go to “Step 1522-13”. Step 1522-13 Are all the ac power cables seated firmly and connected correctly from the rack power distribution unit(s) to the rack drawers? NO Fix the connection. This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. YES Go to “Step 1522-14”. Step 1522-14 Follow the ac power cable(s) from the I/O drawer to the appropriate power distribution unit. Is the power distribution unit power LED on? NO Go to “Step 1522-18” on page 102. YES Go to “Step 1522-15” on page 101. 100 Service Guide Step 1522-15 Note: Read the danger and caution notices under “Safety Notices” on page xi before continuing with this procedure. Perform the following: 1. Disconnect the ac power cable(s) from the power distribution unit that supplies power to the I/O drawer. 2. Use a multimeter to measure the ac voltage at the power distribution unit socket(s) where the ac power cable(s), just removed, was (were) attached. Is the ac voltage from 200 V ac to 240 V ac? NO Replace the power distribution unit This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. YES Go to “Step 1522-16”. Step 1522-16 Reconnect the ac power cable(s) to the power distribution unit. Note: Read the danger and caution notices under “Safety Notices” on page xi before continuing with this procedure. Perform the following: 1. Disconnect the ac power cable from each power supply of the I/O drawer. 2. Use a multimeter to measure the ac voltage at the drawer end of each ac power cable. Is the ac voltage from 200 V ac to 240 V ac? NO Replace the ac power cable(s). This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. YES Go to “Step 1522-17”. Step 1522-17 Replace the following in the I/O drawer, one at a time, disconnecting the ac power cable (dc power cable for a dc-powered system) to each power supply before replacement and reconnecting the ac power cable (dc power cable for a dc-powered system) to each supply after replacement: v I/O power supply 1. v I/O power supply 2. v I/O drawer backplane Does the operator panel show the OK prompt? NO Contact your next level of support for assistance. YES This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Chapter 3. Maintenance Analysis Procedures (MAPs) 101 Step 1522-18 Is the ac power cable from the rack power distribution unit to the customer’s ac power outlet connected and seated correctly at both ends? NO Connect the ac power cable correctly at both ends. This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. YES Go to “Step 1522-19”. Step 1522-19 Note: Read the danger and caution notices under “Safety Notices” on page xi before continuing with this procedure. Perform the following: 1. Disconnect the mainline ac power cable from the power distribution unit that supplies power to the I/O drawer. 2. Use a multimeter to measure the ac voltage at the power distribution unit end of the mainline ac power cable. Is the ac voltage from 200 V ac to 240 V ac? NO Go to “Step 1522-20”. YES Replace the power distribution unit. This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Step 1522-20 Note: Read the danger and caution notices under “Safety Notices” on page xi before continuing with this procedure. Perform the following: 1. Disconnect the mainline ac power cable from the customer’s ac power outlet. 2. Use a multimeter to measure the ac voltage at the customer’s ac power outlet. Is the ac voltage from 200 V ac to 240 V ac? NO Inform the customer that the ac voltage at the power outlet is not correct. This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. YES Replace the mainline ac power cable. This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Step 1522-21 Are the amber fault LEDs of both power supplies of the I/O drawer on solid? NO Go to “Step 1522-22” on page 103. YES Go to “Step 1522- 25” on page 103. 102 Service Guide Step 1522-22 Replace the items in the following list one at a time. Turn the power off before removing the item, and turn the power back on after replacing the item to determine if the drawer will power on. 1. Replace the power supply with the amber fault LED on solid. 2. Replace the other power supply with the one just removed. 3. Replace the I/O drawer backplane. Does the I/O drawer power on? NO Have all the items in the list above been replaced? NO Go to “Step 1522-22” and replace the next item. YES Call service support. YES The item just replaced was defective. This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Step 1522-23 Are the amber fault LEDs of both power supplies of the I/O drawer blinking? NO Go to “Step 1522-24”. YES Go to “Step 1522- 25”. Step 1522-24 Perform the action items in the following list one at a time. Turn the power off before removing the item, and turn the power back on after replacing the item to determine if the drawer will power on. 1. Check all SPCN external cabling. 2. Replace the power supply with the amber fault LED blinking. 3. Replace the other power supply with the one just removed. 4. Replace the I/O drawer backplane. Does the I/O drawer power on? NO Have all the items in the list above been replaced? NO Go to “Step 1522-24”, and replace the next item. YES Call service support. YES The item just replaced was defective. This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Step 1522- 25 Disconnect the ac power cable (dc power cable for a dc-powered system) to each power supply of the I/O drawer and remove the following components from the I/O drawer, one at a time, in the sequence listed: 1. RIO adapter card 2. PCI adapter card(s) Reconnect the ac power cables (dc power cable for a dc-powered system) to each power supply of the I/O drawer and attempt to power on the I/O drawer. Does the I/O drawer power-on? NO Go to “Step 1522-28” on page 104. YES Go to “Step 1522-26” on page 104. Chapter 3. Maintenance Analysis Procedures (MAPs) 103 Step 1522-26 Reinsert the following components in the I/O drawer, one card at a time, disconnecting the ac power cable (dc power cable in a dc-powered system) to each power supply before reinsertion and reconnecting the ac power cable (dc power cable for a dc-powered system) to each power supply after reinsertion: 1. RIO adapter card 2. PCI adapter card(s) Does the I/O drawer power-on? NO Replace the defective component just reinserted. Reinsert any remaining components. This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. YES Go to “Step 1522-27”. Step 1522-27 Have all components in step “Step 1522-26” been reinserted? NO Go to “Step 1522-26” to reinsert the next component. YES The symptom has changed. Call your next level of support for assistance. This ends the procedure. Step 1522-28 Replace each I/O drawer, one at a time, disconnecting the power cable to each power supply before replacement and reconnecting the power cable to each power supply after replacement: Does the I/O drawer power on? NO Go to “Step 1522-29”. YES Reinsert the components removed in “Step 1522- 25” on page 103. This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Step 1522-29 Have all components in the previous step been replaced? NO Go to “Step 1522-28” and replace the next component. YES Call your next level of support for assistance. This ends the procedure. 104 Service Guide Map 1523: The SPCN Error Codes Table Directed You Here and the Error Code is 1011 1B0x A base system power supply is reporting a load fault. Note: Read the danger and caution notices under “Safety Notices” on page xi before continuing with this procedure. Perform the following steps to isolate the problem. Step 1523-1 1. Turn off the power. 2. Disconnect the ac power cables (dc power cables in a dc-powered system) from the base system drawer. 3. At the top of the base system drawer, remove the following cards, in the sequence listed: a. Processor card(s) b. RIO adapter card(s) c. PCI adapter card(s) d. DASD device e. Media bay device(s) 4. Reconnect the ac power cables (dc power cables in a dc-powered system) to the base system drawer. 5. Turn on the power. Is the error code 1011 1B0x displayed? NO Go to “Step 1523-7” on page 107. YES Go to “Step 1523-2”. Step 1523-2 1. Turn off the power. 2. Disconnect the ac power cables (dc power cables in a dc-powered system) from the base system drawer. 3. Remove base system power supply 2. 4. Reconnect the ac power cable (dc power cable in a dc-powered system) to the base system power supply 1. 5. Turn on the power. Is the error code 1011 1B0x displayed? NO Go to “Step 1523-3” on page 106. YES Go to “Step 1523-4” on page 106 Chapter 3. Maintenance Analysis Procedures (MAPs) 105 Step 1523-3 1. Turn off the power. 2. Reinstall base system power supply 2 that was removed in “Step 1523-2” on page 105. 3. Reconnect the ac power cable (dc power cable in a dc-powered system) to the base system power supply 2. 4. Turn on the power. Is the error code 1011 1B0x displayed? NO The symptom has changed. This ends the procedure. YES Replace base system power supply 2, location: U0.1-V2. This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Step 1523-4 1. Turn off the power. 2. Reinstall base system power supply 2 that was removed in “Step 1523-2” on page 105. 3. Reconnect the ac power cable (dc power cable in a dc-powered system) to the base system power supply 2. 4. Disconnect the ac cable (dc power cable in a dc-powered system) to base system power supply 1. 5. Remove base system power supply 1. 6. Turn on the power. Is the error code 1011 1B0x displayed? NO Replace base system power supply 1, location: U0.1-V1. This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. YES Go to “Step 1523-5”. Step 1523-5 1. Turn off the power. 2. Reinstall base system power supply 2 that was removed in “Step 1523-4”. 3. Reconnect the ac power cable (dc power cable in a dc-powered system) to the base system power supply 2. 4. Remove base system fan 3. Disregard a base system fan error code if it occurs during this step. 5. Turn on the power. Is the error code 1011 1B0x displayed? NO Replace base system fan 3, location: U0.1-F3.This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. YES Go to “Step 1523-6” on page 107. 106 Service Guide Step 1523-6 1. Turn off the power. 2. Reinstall base system fan 3 that was removed in “Step 1523-5” on page 106. 3. Remove base system fan 4. Disregard a base system fan error code if it occurs during this step. 4. Turn on the power. Is the error code 1011 1B0x displayed? NO Replace base system fan 4, location: U0.1-F4. This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. YES Replace the base system backplane, location: U0.1-P1. This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Step 1523-7 1. Turn off the power. 2. Disconnect the ac power cables (dc power cables in a dc-powered system) from the base system drawer. 3. Reinstall one of the cards that was removed in “Step 1523-1” on page 105, in the sequence listed: a. Processor cards b. RIO adapter card(s) c. PCI adapter card(s) d. DASD device(s) e. Media bay device(s) 4. Reconnect the ac power cables (dc power cables in a dc-powered system) to the base system drawer. 5. Turn on the power. Is the error code 1011 1B0x displayed? NO Go to “Step 1523-8”. YES The last card you installed is defective. Replace the last card you installed. This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Step 1523-8 Have all the cards that were removed in “Step 1523-7” been reinstalled? NO Go to “Step 1523-7” and reinstall the next card in the list. YES Replace the base system backplane, location: U0.1-P1. This ends the procedure. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Chapter 3. Maintenance Analysis Procedures (MAPs) 107 MAP 1540: Problem Isolation Procedures These procedures are used to locate problems in the processor subsystem, I/O subsystem(s) or rack. If a problem is detected, these procedures help you isolate the problem to a failing unit. Find the symptom in the following table; then follow the instructions given in the Action column. FRU Identify LEDs This system is configured with an arrangement of LEDs that help identify various components of the system. These include, but are not limited to the following: v Rack identify LED v CEC drawer identify LED v I/O drawer identify LED v RIO port identify LED v FRU identify LED – Power subsystem FRUs – CEC subsystem FRUs – I/O subsystem FRUs v I/O adapter identify LED v DASD identify LED The identify LEDs are arranged hierarchically, with the FRU identify LED at the bottom of the hierarchy, followed by the corresponding CEC or I/O drawer identify LED, and the corresponding rack identify LED to locate the failing FRU more easily. Any identify LED in the system can be flashed when the system is in the failed state with power on by using the service processor LED Control Menu contained in the “System Information Menu” on page 317. Any identify LED in the system can also be flashed by using the ²Identify and Attention Indicators² task through diagnostics. The procedure to operate the ²Identify and Attention Indicators² task in diagnostics is outlined in the Diagnostic Information for Multiple Bus Systems. If the service processor menus and the AIX diagnostics are not available, the FRU identify LEDs can be flashed by one of following: v If the system is configured as a full system partition, boot to the open firmware prompt. See the “System Power Control Menu” on page 314 for instructions on setting up the boot mode to enable the boot to the open firmware prompt. To display a menu that will allow you to turn on the desired FRU identify LED, type FRU-LED-MENU on the command line. v If the system is logically partitioned, the HMC must be attached. For instructions on activating and deactivating a FRU identify LED, see the Service Focal Point chapter of the Hardware Management Console Installation and Operations Guide, order number SA38-0590. 108 Service Guide This system is configured with an arrangement of LEDs that help identify various components of the system. These include but are not limited to: Problem Isolation Procedures Symptom/Reference Code/Checkpoint Action You were sent here by the Diagnostic Information for Multiple Bus Systems book. Go to “MAP 1542: I/O Problem Isolation” on page 111. 406x 00B7 Go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123. 406x 0EB1, 406x 0EB2, 406x 0EB3, 406x 0EB4, 406x 0EB5, 406x 0EB6, 406x 0EB7, 406x 0EB8, 406x 0EB9, 406x 0EBA, 406x 0EBB, 406x 0EBC Go to “MAP 1549: Attention Problem Isolation” on page 137. 406x 0EB0 Go to “MAP 1541: JTAG Problem Isolation” on page 110. 1011 8400, 1011 8401, 1011 8409, 1011 840A, 1011 840B, 406x 0CA5, B1xx 4643, B1xx 4644, B1xx 4645, B1xx 4646, B1xx 4648 Go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123. E101, E102, E10A, E10B, E111, E120, E121, E122, E130, E131, E132, E133, E134, E135, E138, E139, E13A, E149, E14C, E191, E19A, E19B, E19D, E1A0, E1A1, E1A2, E1A3, E1A4, E1A5, E1A6, E1A7, E1A8, E1A9, E1AA, E1AB, E1AC, E1AD, E1AE, E1AF, E1B1, E1C4, E1C5, E1C6, E1D0, E1D3, E1D4, E1DB, E1F0, E1F1, E1F4, E1F5, E1F6, E1F8, E1F9, E1FA, E1FB, E1FD, E1FE, E5E0, E5E2, E5E3, E5E4, E5E5, E5E6, E5E7, E5E8, E5E9, E5EE, E5EF, B1xx 4601, B1xx 4602, B1xx 4650, B1xx 469E, B1xx 469F B1xx 4660, B1xx 4661, B1xx 4662 Go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123. E500, E501, E502, E503, E504, E505, E506, E507, E508, E509, E50A, E50B, E50C, E50D, E50E, E50F, E51F, E520, E521, E522, E523, E524, E525, E526, E527, E52F, E530, E531, E532, E533, E534, E535, E536, E537, E538, E539, E53A, E53B, E53C, E540, E541, E560, E56F, E57F, E580, E581, E582, E583, E584, E585, E586, E587, E588, E589, E58A, E58B, E58C, E58D, E58E, E58F E590, E591, E592, E593, E594, E595, E59E, E708, E70C, E70F, E7FF, E852, E861, E871 Go to “MAP 1542: I/O Problem Isolation” on page 111. B1xx 4634 Go to “MAP 154C: SPCN Cable Problem Isolation” on page 139. Chapter 3. Maintenance Analysis Procedures (MAPs) 109 MAP 1541: JTAG Problem Isolation Step 1541-1 Record the error code and location code(s) that sent you to this MAP. Step 1541-2 Turn off the power. Step 1541-3 Replace the following, if present, one at a time and in the order listed: v First location code item recorded, if any, “Step 1541-1”. v Second location code item recorded, if any, in “Step 1541-1”. v Third location code item recorded, if any, in “Step 1541-1”. v Fourth location code item recorded, if any, in “Step 1541-1”. v Replace the CEC backplane at U0.1-P1. v Processor card at location U0.1-P1-C1 if not recorded in “Step 1541-1”. v All memory DIMMS on processor card 1 at location U0.1-P1-C1-Mn (n equals 1 to 8), if not recorded in “Step 1541-1”. v Processor card at location U0.1-P1-C2 if not recorded in “Step 1541-1”. v All memory DIMMS on processor card 2 at location U0.1-P1-C2-Mn (n equals 1 to 8), if not recorded in “Step 1541-1”. v PCI riser card at U0.1-P2. Step 1541-4 Turn on the power. Step 1541-5 Did the system stop with the same error code as recorded in “Step 1541-1”? Yes Go to “Step 1541-6”. No The part just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Step 1541-6 Have all the cards listed in “Step 1541-3” been replaced? Yes Go to “Step 1541-7”. No Go to “Step 1541-2”. Step 1541-7 Turn off the power. Call for support. 110 Service Guide MAP 1542: I/O Problem Isolation This MAP is used to locate defective FRUs not found by normal diagnostics. For this procedure, diagnostics are run on a minimally configured system. If a failure is detected on the minimally configured system, the remaining FRUs are exchanged one at a time until the failing FRU is identified. If a failure is not detected, FRUs are added back until the failure occurs. The failure is then isolated to the failing FRU. This I/O problem-determination MAP isolates I/O card and I/O subsystem failures. When I/O problem isolation is complete, all cables and cards exhibiting a failure will have been replaced or reseated. Notes: 1. This MAP assumes that a CD-ROM drive is installed in the base system and connected to an adapter in the base system, and that a Diagnostics CD-ROM is available. 2. If a general-access password or privileged-access password is installed, you are prompted to enter the password before the diagnostic CD-ROM can load. 3. The term POST indicators refers to the device mnemonics (words memory, keyboard, network, SCSI, and speaker) that appear on the system console during the power-on self-test (POST). 4. The service processor may have recorded one or more symptoms in its error log. It is a good idea to examine that error log before proceeding (see “System Information Menu” on page 317). 5. The service processor may have been set by the user to monitor system operations and to attempt recoveries. You can disable these actions while you diagnose and service the system. If you disable them, make notes of their current settings so that you can restore them easily. The following table lists the settings: Surveillance From the service processor Setup Menu, go to the Surveillance Setup Menu and disable surveillance. (Operating System Surveillance is disabled in LPAR mode.) Unattended Start From the service processor System Power Control Menu, disable unattended start mode. Reboot Policy From the System Power Control Menu, go to the Reboot/Restart Policy Setup Menu and set: a. Number of reboot attempts to 0 (zero) b. Use OS-Defined restart policy to No c. Enable supplemental restart policy to No. Call Out From the Call-In/Call-Out Setup Menu, go to the Serial Port Selection Menu and disable call-out on both serial ports. The steps in this procedure will attempt to slow boot the system into service mode diagnostics. The following procedure describes booting a nonpartitioned system with an attached 3151 console. The procedure for booting a partitioned system with an attached HMC is slightly different and is described in “Standalone Diagnostics Operations” on page 301. “Standalone Diagnostics Operation with an HMC Attached to the System” describes the procedure for booting online disgnostics (6 key between the keyboard and speaker indicators) and the following procedure requires booting standalone diagnostics (5 key between the keyboard and speaker indicators). Therefore, on a partitioned system, make sure the partition you selected to boot in the following procedure has the CD-ROM allocated to it, as described in Chapter 6, “Using the Online and Standalone Diagnostics”, on page 297. Chapter 3. Maintenance Analysis Procedures (MAPs) 111 Step 1542-1 1. Ensure that the diagnostics and the operating system are shut down. 2. Turn off the power. 3. Select slow boot mode (select Disable Fast Boot) on the System Power Control menu from the service processor main menu. 4. Turn on the power. 5. Insert the diagnostic CD-ROM into the CD-ROM drive. Does the CD-ROM appear to operate correctly? NO Go to “Boot Problems/Concerns” on page 173. YES Go to “Step 1542-2”. Step 1542-2 1. When the keyboard indicator is displayed (the word keyboard), press the 5 key on the firmware console. 2. Enter the appropriate password when you are prompted to do so. Is the Please define system console screen displayed? NO Go to “Step 1542-3”. YES Go to “Step 1542-4” on page 113. Step 1542-3 The system is unable to boot standalone diagnostics. Check the service processor error log and the operator panel for additional error codes resulting from the slow boot in “Step 1542-1”. Did the slow boot generate a different error code from the one that originally sent you to MAP 1542? NO It appears you have a processor subsystem problem. Call for support. This ends the procedure. YES Restore fast-boot mode (select Enable Fast Boot) on the System Power Control menu from the service processor main menu. Go to “Checkpoint and Error Code Index” on page 181 and follow the actions for the new error code. 112 Service Guide Step 1542-4 The system stopped with the Please define system console prompt appearing on the system console. Standalone diagnostics can be booted. Perform the following: 1. Follow the instructions on the screen to select the system console. 2. When the DIAGNOSTIC OPERATING INSTRUCTIONS screen is displayed, press Enter. 3. If the terminal type has not been defined, you must use the Initialize Terminal option of the FUNCTION SELECTION menu to initialize the AIX operating system environment before you can continue with the diagnostics. This is a separate and different operation from selecting the firmware console display. 4. Select Advanced Diagnostic Routines. 5. When the DIAGNOSTIC MODE SELECTION menu displays, select System Verification to run diagnostics on all resources. Did running diagnostics produce a different symptom? NO Go to substep 6. YES Go to “Entry MAP” on page 57. Use the new symptom. 6. Record any devices missing from the list of all adapters and devices. Continue with this MAP. When you have fixed the problem, use this record to verify that all devices appear when you run System Verification. Are there any devices missing from the list of all adapters and devices? NO Reinstall all remaining adapters, if any, and reconnect all devices. Return the system to its original configuration. Be sure to select fast-boot mode (select Enable Fast Boot) on the System Power Control menu from the service processor main menu. Go to ²Map 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. YES The boot attempts that follow will attempt to isolate any remaining I/O drawer problems with missing devices. Ignore any codes that may display on the operator panel unless stated otherwise. Go to “Step 1542-5”. Step 1542-5 Examine the base system RIO port 0 of RIO connector card 1 (U0.1-P1-H1/Q1). Are there any I/O subsystems attached to the base system? NO Go to “Step 1542-18” on page 119. YES Go to “Step 1542-6”. Step 1542-6 There may be missing devices associated with one or more I/O subsystems. Note: Before continuing, check the cabling from the base system to the I/O subsystem(s) to ensure that the system is cabled correctly. Refer to “Subsystem Positioning and Cabling” on page 39 for valid configurations. Record the current cabling configuration and then continue with the following steps: 1. Turn off the power. 2. At the base system, disconnect the cable connection at RIO-G port 0 (U0.1-P1/Q2). 3. At the other end of the RIO-G cable referred to in substep 2, disconnect the I/O subsystem port connector 0 (U0.x-P1/Q2). The RIO cable that was connected to RIO-G port 0 in the base system should now be loose and can be removed. Record the location of this I/O subsystem. Chapter 3. Maintenance Analysis Procedures (MAPs) 113 4. Examine the connection at the I/O port connector 1 (U0.x-P1/Qu) of the I/O subsytem recorded in substep 3 on page 113. If the RIO cable attached to the I/O port connector 1 of the I/O subsytem connects to the I/O port connector 0 of another I/O subsystem, record that the I/O subsystem identified in substep 3 on page 113 is ²looped one² and record the location of the next I/O subsystem that is connected to I/O port connector 1 of the ²looped one² I/O subsystem, then go to substep 9. 5. If the RIO cable attached to the I/O port connector 1 (U0.x-P1.1/Q2) of the I/O subsytem recorded in substep 3 on page 113 connects to RIO-G port 1 (U0.1-P1/Q3) of the base system, record that the I/O subsystem is ²not looped². 6. At the base system, disconnect the cable connection at RIO-G port 1 (U0.1-P1/Q3) and reconnect it to RIO-G port 0 (U0.1-P1/Q2). 7. At the I/O subsystem recorded in substep 3 on page 113, disconnect the I/O port connector 1 (U0.x-P1/Qu) and reconnect to the I/O port connector 0 (U0.x-P1/Q2). 8. Verify that a single RIO cable connects base system RIO-G port 0 (U0.1-P1/Q2) to the I/O subsystem recorded in substep 4 port connector 0 (U0.x-P1/Q2). Go to substep 13. 9. Record that the next I/O subsystem. This will be the I/O subsystem that is connected to I/O port connector 1 of the ²looped one² I/O subsystem. 10. At the base system, disconnect the cable connection at RIO-G port 1 (U0.1-P1/Q3) and reconnect it to RIO-G port 0 (U0.1-P1/Q2). 11. At the ²looped two² I/O subsystem recorded in substep 14, disconnect the I/O port connector 1 (U0.x-P1/Qu) and reconnect to the I/O port connector 0 (U0.x-P1/Q2) of the ²looped one² I/O subsystem identified in substep 4. 12. Verify that a single RIO cable connects base system RIO-G port 0 (U0.1-P1/Q2) to the I/O subsystem recorded in substep 3 on page 113 port connector 0 (U0.x-P1/Qu). Go to substep 13. 13. If an I/O subsystem was ²looped two², verify that the ²looped one² I/O subsystem I/O port connector 1 (U0.x-P1/Qu) is connected to the ²looped two² I/O subsystem I/O port connector 0 (U0.x-P1/Q2). 14. At this point, a chain of one or two I/O subsystems is attached to RIO-G port 0 (U0.1-P1/Q2) of the base system. 15. Turn on the power to boot standalone diagnostics from CD. 16. If the Please define the System Console screen is displayed, follow directions to select the system console. 17. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, if necessary). 18. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Go to “Step 1542-7” on page 115. YES The RIO cable that was removed in substep 3 on page 113 is defective. Replace the RIO cable. If two I/O subsystems are chained to RIO-G port 0 (U0.1-P1/Q2) of the base system, connect the new RIO cable from the ²looped two² I/O subsystem I/O port connector 1 (U0.x-P1/Q1) to base system RIO-G port 1 (U0.1-P1/Q1). If the I/O subsystem was ²not looped² connect the new RIO cable from the I/O subsystem recorded in substep 4 I/O port connector 1 (P1/Q1) to base system RIO-G port 1 (U0.1-P1/Q3). Reconfigure the system to its original condition. Go to “Map 0410: Repair Checkout” in the Diagnostic Information for Multiple Bus Systems. 114 Service Guide Step 1542-7 The 286/C4 and 286/E4 server may have up to two I/O subsystems attached to its RIO ports. The following steps reduce the system I/O to only the I/O of the base system. 1. Turn off the system power. 2. At the base system, disconnect the cable connection at RIO-G port 0 (U0.1-P1/Q2). Record all connections. 3. At the base system, disconnect the cable connection at RIO-G port 1 (U0.1-P1/Q3). Record all connections. 4. Disconnect the power cables from the I/O subsystem(s) that were attached to the base system. All I/O subsystems should now be physically disconnected from the base system. Go to “Step 1542-8”. Step 1542-8 The base system is running from the integrated I/O or from I/O attached to an adapter installed into one of the integrated PCI slots. 1. Turn on the power to boot standalone diagnostics from CD. 2. If the Please define the System Console screen is displayed, follow directions to select the system console. 3. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 4. Check that all attached devices and adapters are listed. If the Please define the System Console screen did not display or all attached devices and adapters are not listed, the problem is in the base system. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Go to “Step 1542-18” on page 119. YES Go to “Step 1542-17” on page 119. Step 1542-9 For any I/O subsystem attached to the RIO-G ports ot the base system: If the I/O subsystem was recorded as ²looped one², are there any adapters in slots 1, 2, 3, 4, 5, 6, or 7 (location codes U0.x-P1-I1 through I7)? NO Go to “Step 1542-10”. YES Go to “Step 1542-11” on page 116. Step 1542-10 For any I/O subsystem attached to the RIO-G ports of the base: If the I/O subsystem was recorded as ²looped two², are there any adapters in slots 1, 2, 3, 4, 5, 6, or 7 (location codes U0.x-P1-I1 through I7)? NO Go to “Step 1542-15” on page 118. YES Go to “Step 1542-12” on page 116. Chapter 3. Maintenance Analysis Procedures (MAPs) 115 Step 1542-11 1. If it is not already off, turn off the power. 2. Remove all adapters from slots 1, 2, 3, 4, 5, 6, and 7 (location codes Ux.y-P1-I1 through I7) from the I/O drawer. 3. Label and record the location of any cables attached to the adapters. 4. Record the slot number of the adapters. 5. Turn on the power to boot standalone diagnostics from CD. 6. If the TTY screen displays, enter 0 to select this console, press the 0 key on the TTY terminal’s keyboard. 7. If the Please define the System Console screen is displayed, follow directions to select the system console. 8. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 9. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Go to “Step 1542-10” on page 115. YES Go to “Step 1542-13” on page 117. Step 1542-12 1. If it is not already off, turn off the power. 2. Remove all adapters from slots 1, 2, 3, 4, 5, 6, and 7 (location codes Ux.y-P1-I1 through I7) from the I/O drawer. 3. Label and record the location of any cables attached to the adapters. 4. Record the slot number of the adapters. 5. Turn on the power to boot standalone diagnostics from CD. 6. If the TTY screen displays Enter 0 to select this console, press the 0 key on the TTY terminal’s keyboard. 7. If the Please define the System Console screen is displayed, follow directions to select the system console. 8. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary) 9. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Go to “Step 1542-15” on page 118. YES Go to “Step 1542-13” on page 117. 116 Service Guide Step 1542-13 If the Please define the System Console screen does display and all attached devices and adapters are listed, the problem is with one of the adapter cards or devices that was removed or disconnected from the I/O subystem. 1. Turn off the power. 2. Reinstall one adapter or device that was removed. Use the original adapter cards in their original slots when reinstalling adapters. 3. Turn on the power to boot standalone diagnostics from CD. 4. If the Please define the System Console screen is displayed, follow directions to select the system console. 5. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 6. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Go to “Step 1542-14”. YES Reinstall the next adapter or device and return to the beginning of this step. Repeat this process until an adapter or device causes the Please define the System Console screen to not display or all attached devices and adapters to not be listed. After installing all of the adapters and the Please define the System Console screen does display and all attached devices and adapters are listed, return the system to its original configuration. Go to “Map 0410: Repair Checkout” in the Diagnostic Information for Multiple Bus Systems. Step 1542-14 Replace the adapter you just installed with a new adapter and retry the boot to standalone diagnostics from CD. 1. If the Please define the System Console screen is displayed, follow directions to select the system console. 2. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 3. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Go to “Step 1542-25” on page 122. YES The adapter was defective. Go to “Step 1542-16” on page 118. Chapter 3. Maintenance Analysis Procedures (MAPs) 117 Step 1542-15 1. Turn off the power. 2. Disconnect the I/O subsystem power cables. 3. Replace the following parts, one at a time, in the sequence listed: a. I/O subsystem backplane of any ²looped one² I/O subsystem attached to the base system. b. I/O subsystem backplane of any ²looped two² I/O subsystem attached to the base system. 4. Reconnect the I/O subsystem power cables. 5. Turn on the power. 6. Boot standalone diagnostics from CD. 7. If the Please define the System Console screen is displayed, follow directions to select the system console. 8. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769 if necessary). 9. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Replace the next part in the list and return to the beginning of this step. Repeat this process until a part causes the Please define the System Console screen to be displayed and all attached devices and adapters to be listed. If you have replaced all the items listed above and the Please define the System Console screen does not display or all attached devices and adapters are not listed, check any external devices and cabling. If you do not find a problem, contact your next level of support for assistance. YES Go to “Step 1542-16”. Step 1542-16 1. Turn off the power. 2. The item just replaced fixed the problem. 3. If a display adapter with keyboard and mouse were used, reinstall the display adapter card, keyboard, and mouse. 4. Reconnect the diskette drive cable to the diskette drive connector on the base system. 5. Reconnect the tape drive (if previously installed) to the internal SCSI bus cable. 6. Plug in all adapters that were previously removed but not reinstalled. 7. Reconnect the I/O subsystem power cables that were previously disconnected. Reconfigure the system to its original condition. Go to “Map 0410: Repair Checkout” in the Diagnostic Information for Multiple Bus Systems. 118 Service Guide Step 1542-17 1. Turn off the power. 2. At the base system, reconnect the cable connection at RIO-G port 0 (U0.1-P1/Q2) recorded in “Step 1542-7” on page 115. 3. At the base system, reconnect the cable connection at RIO-G port 1 (U0.1-P1/Q3) recorded in “Step 1542-7” on page 115. 4. Reconnect the power cables to the I/O subsystems that were found attached to the base system RIO-G ports mentioned in substeps 2 and 3 above. All I/O subsystems that were attached to the base system RIO-G port 0 (U0.1-P1/Q2) and RIO-G port 1 (U0.1-P1/Q3) should now be physically reconnected to the base system. 5. Make sure the I/O subsystem(s) are cabled correctly as shown in “Subsystem Positioning and Cabling” on page 39. 6. Turn on the power to boot standalone diagnostics from CD. 7. If the Please define the System Console screen is displayed, follow directions to select the system console. 8. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, if necessary). 9. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Go to “Step 1542-9” on page 115 to isolate I/O problems attached to the base system RIO-G connectors. YES The symptom has changed. No items were replaced. Go to “Step 1542-16” on page 118. Step 1542-18 Are there any adapters in slots 1, 2, 3, or 4 (location codes U0.1-P1-I1 through I4) in the base system? NO Go to “Step 1542-19”. YES Go to “Step 1542-21” on page 120. Step 1542-19 Replace the CEC backplane, location U0.1-P1 Step 1542-20 1. Boot standalone diagnostics from CD. 2. If the Please define the System Console screen is displayed, follow directions to select the system console. 3. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 4. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Go to “Step 1542-24” on page 121. YES Go to “Step 1542-16” on page 118. Chapter 3. Maintenance Analysis Procedures (MAPs) 119 Step 1542-21 1. If it is not already off, turn off the power. 2. Remove all adapters from slots 1, 2, 3, and 4 (location codes U0.1-P1-I1 through I4) in the base system that are not attached to the boot device. 3. Label and record the location of any cables attached to the adapters. 4. Record the slot number of the adapters. 5. Turn on the power to boot standalone diagnostics from CD. 6. If the TTY screen displays Enter 0 to select this console, press the 0 key on the TTY terminal’s keyboard. 7. If the Please define the System Console screen is displayed, follow directions to select the system console. 8. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 9. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Go to “Step 1542-19” on page 119. YES Go to “Step 1542-22”. Step 1542-22 If the Please define the System Console screen does display and all attached devices are adapters are listed, the problem is with one of the adapter cards or devices that was removed or disconnected from the I/O subsystem. 1. Turn off the power. 2. Reinstall the adapter or device that was removed. Use the original adapter cards in their original slots when reinstalling adapters. 3. Turn on the power to boot standalone diagnostics from CD. 4. If the Please define the System Console screen is displayed, follow directions to select the system console. 5. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 6. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Go to “Step 1542-23” on page 121. YES Reinstall the next adapter or device and return to the beginning of this step. Continue repeating this process until an adapter or device causes the Please define the System Console screen to not display or all attached devices and adapters to not be listed. After installing all of the adapters and the Please define the System Console screen displays and all attached devices and adapters are listed, go to “Step 1542-16” on page 118. 120 Service Guide Step 1542-23 Replace the adapter you just installed with a new adapter and retry the boot to standalone diagnostics from CD. 1. If the Please define the System Console screen is displayed, follow directions to select the system console. 2. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 3. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Go to “Step 1542-19” on page 119. YES Go to “Step 1542-16” on page 118. Step 1542-24 1. Turn off the power. 2. Disconnect the base system power cables. 3. Replace the following parts, one at a time, in the sequence listed: a. Internal SCSI cable. b. DASD backplane. c. Media backplane. 4. Reconnect the base system power cables. 5. Turn on the power. 6. Boot standalone diagnostics from CD. 7. If the Please define the System Console screen is displayed, follow directions to select the system console. 8. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 9. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Replace the next part in the list and return to the beginning of this step. Repeat this process until a part causes the Please define the System Console screen to be displayed and all attached devices and adapters to be listed. If you have replaced all the items listed above and the Please define the System Console screen does not display or all attached devices and adapters are not listed, check any external devices and cabling. If you do not find a problem, contact your next level of support for assistance. YES Go to “Step 1542-16” on page 118. Chapter 3. Maintenance Analysis Procedures (MAPs) 121 Step 1542-25 1. Turn off the power. 2. Disconnect the I/O subsystem power cables. 3. Replace the following parts, one at a time, in the sequence listed: a. Attached external device cable. b. Attached external device(s), one at a time, if more than one. c. I/O subsystem backplane (U0.x-P1). 4. Reconnect the I/O subsystem power cables. 5. Turn on the power. 6. Boot standalone diagnostics from CD. 7. If the Please define the System Console screen is displayed, follow directions to select the system console. 8. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the Diagnostic Information for Multiple Bus Systems, order number SA23-2769, if necessary). 9. Check that all attached devices and adapters are listed. Did the Please define the System Console screen display and are all attached devices and adapters listed? NO Replace the next part in the list and return to the beginning of this step. Repeat this process until a part causes the Please define the System Console screen to be displayed and all attached devices and adapters to be listed. If you have replaced all the items listed above and the Please define the System Console screen does not display or all attached devices and adapters are not listed, check any external devices and cabling. If you do not find a problem, contact your next level of support for assistance. YES Go to “Step 1542-16” on page 118. 122 Service Guide MAP 1548: Processor Subsystem Problem Isolation Notes: 1. Be sure to unplug the power cords before removing or installing processor cards, the PCI riser card, or the CEC backplane to avoid damage to it. 2. This MAP assumes that a CD-ROM drive is installed and connected to the integrated EIDE adapter, and a diagnostic CD-ROM disc is available. 3. If a power-on password or privileged-access password is installed, you are prompted to enter the password before the diagnostic CD-ROM can load. 4. The term POST indicators refers to the device mnemonics that appear during the power-on self-test (POST). 5. The support leg located on the bottom of the media support shelf is used to deactivate a microswitch. The microswitch is located on the CEC backplane and when deactivated cuts off power to some of the components and connectors located on the CEC backplane. This support shelf must be correctly seated and secured in position after any new FRU is added to the system and you perform a system boot. 6. The service processor might have recorded one or more symptoms in its error log. v If you arrived here after performing a slow boot, look for a possible new error that occurred during the slow boot. If there is a new error, and its actions call for a FRU replacement, perform those actions. If this does not resolve the problem, go to “Step 1548-1” on page 124. v If an additional slow boot has not been performed, or if the slow boot did not yield a new error code, look at the error that occurred just before the original error. Perform the actions associated with that error. If the problem is not resolved, go to “Step 1548-1” on page 124. v If a slow boot results in the same error code, and there are no error codes before the original error code, go to “Step 1548-1” on page 124. Before proceeding, it is a good idea to examine that error log (see “System Information Menu” on page 317). 7. The service processor might have been set by the user to monitor system operations and to attempt recoveries. You might want to disable these actions while you diagnose and service the system. If you disable them, make notes of their current settings so that you can restore them before you leave. The following settings may be of interest to you. Surveillance From the Service Processor Setup Menu, go to the Surveillance Setup Menu and disable surveillance. Unattended Start Mode From the Service Processor System Power Control Menu, disable unattended start mode. Reboot Policy From the System Power Control Menu, go to the Reboot/Restart Policy Setup Menu and set: a. Number of reboot attempts to 0 (zero) b. Use OS-Defined restart policy to No c. Enable supplemental restart policy to No. Call Out From the Call-In/Call-Out Setup Menu, go to the Serial Port Selection Menu and disable call-out on both serial ports. 8. Verify that the system has not been set to boot to the SMS menus or to the open firmware prompt. From the System Power Control Menu, select Boot Mode Menu. Verify that all four options are set to disabled. Chapter 3. Maintenance Analysis Procedures (MAPs) 123 Purpose of this MAP This MAP is used to locate defective FRUs not found by normal diagnostics. For this procedure, diagnostics are run on a minimally configured system. If a failure is detected on the minimally configured system, the remaining FRUs are exchanged one at a time until the failing FRU is identified. If a failure is not detected, FRUs are added back until the failure occurs. The failure is then isolated to the failing FRU. Step 1548-1 1. Ensure that the diagnostics and the operating system are shut down. 2. Turn on the power. 3. Insert the diagnostic CD-ROM into the CD-ROM drive. Note: If you cannot insert the diagnostic CD-ROM, go to “Step 1548-2”. 4. When the keyboard indicator is displayed: v On an ASCII terminal or hardware management console (HMC), the word keyboard, press the number 5 key. v On a graphical display, the word keyboard, press the number 5 key on the directly attached keyboard. 5. If you are prompted to do so, enter the appropriate password. Is the ²Please define the System Console² screen displayed? NO Go to “Step 1548-2”. YES Go to “Step 1548-17” on page 133. Step 1548-2 1. Turn off the power. 2. If you have not already done so, configure the service processor with the instructions in note 7 on page 123. Then return here and continue. 3. Exit the service processor menus and remove the power cords. 4. Disconnect all external cables (parallel, serial port 1, serial port 2, serial port 3, keyboard, mouse, Ethernet, and so on). 5. Remove the service access cover (286/E4) or place the drawer (286/C4) into the service position and remove the service access cover. 6. Record the slot numbers of the PCI adapters. Label and record the locations of any cables attached to the adapters. Disconnect any cables attached to the adapters and remove all the adapters. 7. Disconnect all power and signal cables from the media drives, located in media bay 1 and media bay 2. 8. Remove the media drives as described in “Media Removal” on page 401. 9. Remove the media support shelf by loosening the thumbscrew and grasping the back edge of the shelf, sliding it back, then lifting it up and out of the system chassis. Note: The support leg located on the bottom of the media support shelf is used to deactivate a microswitch. The microswitch is located on the CEC backplane and when deactivated, cuts off all power to the CEC backplane. This support shelf must be correctly seated and secured in position after any new FRU is added to the system. 124 Service Guide 10. Remove processor card #2 (if installed). If processor card #2 is removed, ensure processor card #1 is installed and contains at least one quad of memory DIMMs. The processor card that is left in the system must contain at least one quad of memory DIMMs. For example, a system configuration having two processor cards; processor card #2 contains all of the memory DIMMs, processor card #1 contains no memory DIMMs. Remove processor card #1 that contains no memory DIMMs, then remove all but one quad of memory DIMMs from processor card #2. Place processor card #2 into processor slot #1. 11. Record the slot numbers of the memory DIMMs on processor card #1. Remove all memory DIMMs except for one quad from processor card #1. Notes: a. Place the memory DIMM locking tabs in the locked (upright) position to prevent damage to the tabs. b. Memory DIMMs must be installed in quads and in the correct connectors. Refer to “Memory DIMM Location” on page 18 for complete information on memory DIMMs. 12. Disconnect the cables attached to the PCI riser card. 13. Disconnect the IDE cable from the IDE connector on the CEC backplane. 14. If your system is equipped with a diskette drive, disconnect the diskette drive cable from the diskette drive connector on the CEC backplane. 15. Disconnect the signal and power connectors from the disk drive backplane. 16. Disconnect the disk drives from the disk drive backplane connectors. 17. Plug in the power cords and wait for the OK prompt to display on the operator panel display. 18. Turn on the power. Does the system stop with code E1F2, E1F3, STBY or 20EE000B, 4BA00830 displayed on the operator panel? NO Go to “Step 1548-6” on page 126. YES Go to “Step 1548-3”. Step 1548-3 Were any more memory DIMMs removed from processor card #1? NO Go to “Step 1548-5” on page 126. YES Go to “Step 1548-4”. Step 1548-4 1. Turn off the power, and remove the power cords. 2. Replug the memory DIMMs that were removed from processor #1 in “Step 1548-2” on page 124 in their original locations. Notes: a. Place the memory DIMM locking tabs into the locked (upright) position to prevent damage to the tabs. b. Memory DIMMs must be installed in quads in the correct connectors. Refer to “Memory DIMM Location” on page 18 for complete information on memory DIMMs. 3. Plug in the power cords and wait for the OK prompt to display on the operator panel display. 4. Turn on the power. Chapter 3. Maintenance Analysis Procedures (MAPs) 125 Does the system stop with code E1F2, E1F3, STBY, 20EE000B or 4BA00840 displayed in the operator panel? NO A memory DIMM in the quad you just replaced in the system is defective. Turn off the power, remove the power cords, and exchange the memory DIMMs in that quad, one at a time with new or previously removed memory DIMMs. Repeat this step until the defective memory DIMM is identified, or all memory DIMMs have been exchanged. If your symptom did not change and all the memory DIMMs have been exchanged, call your service support person for assistance. If the symptom changed, check for loose cards and obvious problems. If you do not find a problem, go to the “Quick Entry MAP” on page 58 and follow instructions for the new symptom. YES Go to “Step 1548-5”. Step 1548-5 Was processor card #2 removed from the system? NO Go to “Step 1548-8” on page 127. YES Go to “Step 1548-7” on page 127. Step 1548-6 One of the FRUs remaining in the system unit is defective. Note: If a memory DIMM is exchanged, ensure that the new memory DIMM is the same size and speed as the original memory DIMM. 1. Turn off the power, remove the power cords, and exchange the following FRUs in the order listed: a. Memory DIMMs. Exchange one at a time with new or previously removed memory DIMMs. b. Processor card #1 c. CEC backplane (See the notes on page 57.) d. PCI riser card e. Power supplies. 2. Plug in the power cords and wait for the OK prompt to display on the operator panel display. 3. Turn on the power. Does the system stop with code E1F2, E1F3, STBY or 20EE000B, 4BA00830 displayed on the operator panel? NO Reinstall the original FRU. Repeat the FRU replacement steps until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to the “Quick Entry MAP” on page 58 and follow the instructions for the new symptom. YES Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. 126 Service Guide Step 1548-7 No failure was detected with this configuration. 1. Turn off the power and remove the power cords. 2. Reinstall processor card #2. 3. Plug in the power cords and wait for the OK prompt to display on the operator panel display. 4. Turn on the power. Does the system stop with code E1F2, E1F3, STBY or 20EE000B, 4BA00830 displayed on the operator panel? NO One of the FRUs remaining in the system unit is defective. Exchange the FRUs that have not been exchanged, in the following order: 1. Memory DIMMs, if present on processor card #2, exchange one at a time with new or previously removed memory DIMMs. 2. Processor card #2 (last one installed) 3. CEC backplane (see notes on page 57) Repeat this step until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call your service support person for assistance. If the symptom changed, check for loose cards and obvious problems. If you do not find a problem, go to the “Quick Entry MAP” on page 58 and follow the instructions for the new symptom. YES Go to “Step 1548-8”. Step 1548-8 1. Turn off the power. 2. Reconnect the system console. Notes: a. If an HMC has been defined as the firmware console, attach the HMC cable to the HMC1 connector on the rear of the system unit. b. If an ASCII terminal has been defined as the system console, attach the ASCII terminal cable to the S1 connector on the rear of the system unit. c. If a display attached to a display adapter has been defined as the system console, install the display adapter and connect the display to the adapter. Plug the keyboard into the keyboard connector on the rear of the system unit. 3. Turn on the power. 4. If the ASCII terminal or graphics display (including display adapter) is connected differently than it was before, the Console Selection screen appears and requires that a new console be selected. 5. When the word keyboard is displayed, press the number 1 key on the directly attached keyboard, an ASCII terminal or HMC. This activates the system management services (SMS). 6. If you are prompted to do so, enter the appropriate password. Chapter 3. Maintenance Analysis Procedures (MAPs) 127 Is the SMS screen displayed? NO One of the FRUs remaining in the system unit is defective. Exchange the FRUs that have not been exchanged, in the following order: 1. If you are using the hardware management console (HMC) go to the ²Entry MAP² in the Hardware Management Console Maintenance Guide, order number SA23-1273. If you cannot fix the problem using the HMC tests, there may be a problem with the CEC backplane. If you do not find a problem, do the following: v Replace the CEC backplane, location U0.1-P1. (See notes on page 57.) 2. If you are using a graphics display, go to the problem determination procedures for the display. If you do not find a problem, do the following: v Replace the display adapter. v Replace the backplane in which the graphics adapter is plugged. Repeat this step until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to the “Quick Entry MAP” on page 58 and follow the instructions for the new symptom. YES Go to “Step 1548-9”. Step 1548-9 1. Make sure the diagnostic CD-ROM is inserted into the CD-ROM drive. 2. Turn off the power and remove the power cords. 3. Plug the IDE cable into the IDE connector on the CEC backplane. 4. Connect the signal and power connectors to the IDE CD-ROM drive. 5. Plug in the power cords and wait for the OK prompt to display on the operator panel display. 6. Turn on the power. 7. Insert the diagnostic CD-ROM into the CD-ROM drive. 8. After the word keyboard is displayed, press the number 5 key on either the directly attached keyboard or on an ASCII terminal keyboard. 9. If you are prompted to do so, enter the appropriate password . Is the Please define the System Console screen displayed? NO One of the FRUs remaining in the system unit is defective. Exchange the FRUs that have not been exchanged, in the following order: 1. IDE cable 2. CD-ROM drive 3. CEC backplane (see notes on page 57) 4. Processor card Repeat this step until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to the “Quick Entry MAP” on page 58 and follow the instructions for the new symptom. YES Go to “Step 1548-10” on page 129. 128 Service Guide Step 1548-10 The system is working correctly with this configuration. One of the SCSI devices that you disconnected may be defective. 1. Make sure the diagnostic CD-ROM is inserted into the CD-ROM drive. 2. Turn off the power and remove the power cords. 3. Plug the SCSI cable into the SCSI connector on the CEC backplane. 4. Connect the signal and power connectors to one of the SCSI devices (for example: CD-ROM drive, tape drive, disk drive, and so on). Do not connect the signal and power connectors to the disk drive backplane at this time. 5. Plug in the power cords and wait for the OK prompt to display on the operator panel display. 6. Turn on the power. 7. After the word keyboard is displayed, press the number 5 key on either the directly attached keyboard or on an ASCII terminal keyboard. 8. If you are prompted to do so, enter the appropriate password. Is the Please define the System Console screen displayed? NO One of the FRUs remaining in the system unit is defective. Exchange the FRUs that have not been exchanged, in the following order: 1. SCSI cable 2. Last SCSI device connected (for example: disk drive, tape drive) 3. CEC backplane (See notes on page 57.) 4. Processor card Repeat this step until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to the return to “Quick Entry MAP” on page 58 and follow the instructions for the new symptom. YES Repeat this step, adding one SCSI device at a time, until all the SCSI devices that were attached to the integrated SCSI adapter, except the disk drive backplane, are connected and tested. Go to “Step 1548-11”. Step 1548-11 The system is working correctly with this configuration. The disk drive backplane may be defective. 1. Make sure the diagnostic CD-ROM is inserted into the CD-ROM drive. 2. Turn off the power and remove the power cords. 3. Connect the signal and power connectors to the disk drive backplane. 4. Plug in the power cords and wait for the OK prompt to display on the operator panel display. 5. Turn on the power. 6. If the Console Selection screen is displayed, choose the system console. 7. After the word keyboard is displayed, press the number 5 key on either the directly attached keyboard or on an ASCII terminal keyboard. 8. If you are prompted to do so, enter the appropriate password. Chapter 3. Maintenance Analysis Procedures (MAPs) 129 Is the Please define the System Console screen displayed? NO One of the FRUs remaining in the system unit is defective. Exchange the FRUs that have not been exchanged, in the following order: 1. SCSI cable 2. Disk drive backplane Repeat this step until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to the “Quick Entry MAP” on page 58 and follow the instructions for the new symptom. YES Go to “Step 1548-12”. Step 1548-12 The system is working correctly with this configuration. One of the disk drives that you removed from the disk drive may be defective. 1. Make sure the diagnostic CD-ROM is inserted into the CD-ROM drive. 2. Turn off the power and remove the power cords. 3. Install a disk drive in the disk drive backplane. 4. Plug in the power cords and wait for the OK prompt to display on the operator panel display. 5. Turn on the power. 6. After the word keyboard is displayed, press the number 5 key on either the directly attached keyboard or on an ASCII terminal keyboard. 7. If you are prompted to do so, enter the appropriate password. Is the Please define the System Console screen displayed? No Exchange the FRUs that have not been exchanged, in the following order: 1. Last disk drive installed 2. Disk drive backplane Repeat this step until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to the “Quick Entry MAP” on page 58 and follow the instructions for the new symptom. Yes Repeat this step with the second disk drive (if it was installed). If the system has a diskette drive, go to “Step 1548-13” on page 131. Go to “Step 1548-14” on page 131. 130 Service Guide Step 1548-13 The system is working correctly with this configuration. The diskette drive may be defective. 1. Make sure the diagnostic CD-ROM is inserted into the CD-ROM drive. 2. Turn off the power and remove the power cords. 3. Plug the diskette drive cable into the diskette drive connector on the CEC backplane. 4. Connect the signal and power connectors to the diskette drive. 5. Plug in the power cords and wait for the OK prompt to display on the operator panel display. 6. Turn on the power. 7. After the word keyboard displays, press the number 5 key on either the directly attached keyboard or on an ASCII terminal keyboard. 8. If you are prompted to do so, enter the appropriate password. Is the Please define the System Console screen displayed? NO One of the FRUs remaining in the system is defective. Exchange the FRUs that have not been exchanged, in the following order: 1. Diskette drive 2. Diskette drive cable 3. CEC backplane (see notes on page 57) Repeat this step until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem return, go to the “Quick Entry MAP” on page 58 and follow the instructions for the new symptom. YES Go to “Step 1548-14”. Step 1548-14 The system is working correctly with this configuration. One of the devices that you disconnected from the CEC backplane may be defective. 1. Turn off the power and remove the power cords. 2. Attach a CEC backplane device (for example: parallel, serial port 1, serial port 2, serial port 3, keyboard, mouse, Ethernet, Ultra-2 SCSI) that had been removed. 3. Plug in the power cords and wait for the OK prompt to display on the operator panel display. 4. Turn on the power. 5. If the Console Selection screen is displayed, choose the system console. 6. After the word keyboard displays, press the number 5 key on either the directly attached keyboard or on an ASCII terminal keyboard. 7. If you are prompted to do so, enter the appropriate password. Chapter 3. Maintenance Analysis Procedures (MAPs) 131 Is the Please define the System Console screen displayed? NO The last device or cable that you attached is defective. To test each FRU, exchange the FRUs in the following order: 1. Device and cable (last one attached) 2. CEC backplane (see notes on page 57). If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to the “Quick Entry MAP” on page 58 and follow the instructions for the new symptom. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. YES Repeat this step until all of the devices are attached. Go to “Step 1548-15”. Step 1548-15 The system is working correctly with this configuration. One of the FRUs (adapters) that you removed may be defective. 1. Turn off the power and remove the power cords. 2. Install a FRU (adapter) and connect any cables and devices that were attached to the FRU. 3. Plug in the power cords and wait for the OK prompt to display on the operator panel display. 4. Turn on the power. 5. If the Console Selection screen is displayed, choose the system console. 6. After the word keyboard displays, press the number 5 key on either the directly attached keyboard or on an ASCII terminal keyboard. 7. If you are prompted to do so, enter the appropriate password. Is the Please define the System Console screen displayed? NO Go to “Step 1548-16” on page 133. YES Repeat this step until all of the FRUs (adapters) are installed. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. 132 Service Guide Step 1548-16 The last FRU installed or one of its attached devices is probably defective. 1. Make sure the diagnostic CD-ROM is inserted into the CD-ROM drive. 2. Turn off the power and remove the power cords. 3. Starting with the last installed adapter, disconnect one attached device and cable. 4. Plug in the power cords and wait for the OK prompt to display on the operator panel display. 5. Turn on the power. 6. If the Console Selection screen is displayed, choose the system console. 7. After the word keyboard displays, press the number 5 key on either the directly attached keyboard or on an ASCII terminal keyboard. 8. Enter the appropriate password if you are prompted to do so. Is the Please define the System Console screen displayed? NO Repeat this step until the defective device or cable is identified or all devices and cables have been disconnected. If all the devices and cables have been removed, then one of the FRUs remaining in the system unit is defective. To test each FRU, exchange the FRUs in the following order: 1. Adapter (last one installed) 2. PCI riser card 3. CEC backplane (see notes on page 57) If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to the “Quick Entry MAP” on page 58 and follow the instructions for the new symptom. YES The last device or cable that you disconnected is defective. Exchange the defective device or cable. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Step 1548-17 1. Follow the instructions on the screen to select the system console. 2. When the DIAGNOSTIC OPERATING INSTRUCTIONS screen is displayed, press Enter. 3. Select Advanced Diagnostics Routines. 4. If the terminal type has not been defined, you must use the Initialize Terminal option on the FUNCTION SELECTION menu to initialize the AIX diagnostic environment before you can continue with the diagnostics. This is a separate operation from selecting the console display. 5. If the NEW RESOURCE screen displays, select an option from the bottom of the screen. Note: Adapters or devices that require supplemental media are not shown in the new resource list. If the system has adapters or devices that require supplemental media, select option 1. 6. When the DIAGNOSTIC MODE SELECTION screen is displayed, press Enter. 7. Select All Resources (if you were sent here from “Step 1548-21” on page 134, select the adapter/device that you loaded from the supplemental media). Did you get an SRN? NO Go to “Step 1548-19” on page 134. YES Go to “Step 1548-18” on page 134. Chapter 3. Maintenance Analysis Procedures (MAPs) 133 Step 1548-18 Look at the FRU part numbers associated with the SRN. Have you exchanged all the FRUs that correspond to the failing function codes (FFCs)? NO Exchange the FRU with the highest failure percentage that has not been changed. Repeat this step until all the FRUs associated with the SRN have been exchanged or diagnostics run with no trouble found. Run diagnostics after each FRU is exchanged. If the CEC backplane or a network adapter is removed, see notes on page 57. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. YES If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. Step 1548-19 Does the system have adapters or devices that require supplemental media? NO Go to “Step 1548-20”. YES Go to “Step 1548-21”. Step 1548-20 Consult the PCI adapter configuration documentation for your operating system to verify that all installed adapters are configured correctly. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. Step 1548-21 1. Select Task Selection. 2. Select Process Supplemental Media and follow the onscreen instructions to process the media. Supplemental media must be loaded and processed one at a time. Did the system return to the TASKS SELECTION SCREEN after the supplemental media was processed? NO Go to “Step 1548-22” on page 135. YES Press F3 to return to the FUNCTION SELECTION screen. Go to “Step 1548-17” on page 133, substep 4. 134 Service Guide Step 1548-22 The adapter or device is probably defective. If the supplemental media is for an adapter, replace the FRUs in the following order: 1. Adapter 2. CEC backplane (see notes on page 57) If the supplemental media is for a device, replace the FRUs in the following order: 1. Device and any associated cables 2. The adapter to which the device is attached Repeat this step until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to the “Quick Entry MAP” on page 58 and follow the instructions for the new symptom. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Step 1548-23 1. Ensure that the diagnostics and the operating system are shut down. 2. Turn off the power. 3. If you have not already done so, configure the service processor with the instructions in note 7 on page 123 and then return here and continue. 4. Exit the service processor menus and remove the power cords. 5. Remove the service access cover (286/E4), or place the drawer (286/C4) into the service position and remove the service access cover. 6. Record the slot numbers of the PCI adapters. Label and record the locations of any cables attached to the adapters. Disconnect any cables attached to the adapters and remove all the adapters. 7. Plug in the power cords and wait for the OK prompt to display on the operator panel display. 8. Turn on the power. Does the system stop with the same error code displayed on the operator panel that directed you to this MAP step? NO Go to “Step 1548-25” on page 136. YES Go to “Step 1548-24”. Step 1548-24 One of the FRUs remaining in the system unit is defective. 1. Turn off the power, remove the power cords and exchange the CEC backplane. See the notes on page 57. 2. Plug in the power cable and wait for the OK prompt to display on the operator panel display. 3. Turn on the power. Chapter 3. Maintenance Analysis Procedures (MAPs) 135 Does the system stop with the same error code displayed on the operator panel that directed you to this MAP step? NO Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. YES Reinstall the original FRU. If the symptom did not change and all the FRUs have been exchanged, return to “Step 1548-2” on page 124. Step 1548-25 The system is working correctly with this configuration. One of the FRUs (adapters) that you removed is probably defective. 1. Turn off the power and remove the power cable from the power outlet. 2. Install a FRU (adapter) and connect any cables and devices that were attached to it. 3. Plug in the power cable and wait for the OK prompt to display on the operator panel display. 4. Turn on the power. 5. If the Console Selection screen is displayed, choose the firmware console. 6. Enter the appropriate password if you are prompted to do so. Does the system stop with the same error code displayed on the operator panel that directed you to this MAP step? NO Repeat this step until all of the FRUs (adapters) are installed, then go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. YES Go to “Step 1548-26”. Step 1548-26 The last FRU installed or one of its attached devices is probably defective. 1. Turn off the power and remove the power cables from the power outlet. 2. Starting with the last installed adapter, disconnect one attached device and cable. 3. Plug in the power cords and wait for the OK prompt to display on the operator panel display. 4. Turn on the power. 5. If the Console Selection screen is displayed, choose the firmware console. 6. Enter the appropriate password if you are prompted to do so. Does the system stop with the same error code displayed on the operator panel that directed you to this MAP step? NO The last device or cable that you disconnected is defective. Exchange the defective device or cable. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. YES Repeat this step until the defective device or cable is identified or all of the devices and cables have been disconnected. If all of the devices and cables have been removed, then one of the FRUs remaining in the system unit is defective. To test each FRU, exchange the FRUs in the following order: 1. Adapter (last one installed) 2. CEC backplane (see notes on page 57) Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. If the symptom did not change and all the FRUs have been exchanged, return to “Step 1548-2” on page 124. 136 Service Guide MAP 1549: Attention Problem Isolation Step 1549-1 Record the error code and the location code(s) that sent you to this MAP. Step 1549-2 Turn off the power. Step 1549-3 Examine the following table and locate the error code that sent you to this MAP to determine your next step. Error Code Action 406x 0EB1 Go to “Step 1549-4”. 406x 0EB2 Go to “Step 1549-4”. 406x 0EB3 Go to “Step 1549-4”. 406x 0EB4 Go to “Step 1549-7” on page 138. 406x 0EB5 Go to “Step 1549-7” on page 138. 406x 0EB6 Go to “Step 1549-7” on page 138. 406x 0EB7 Go to “Step 1549-7” on page 138. 406x 0EB8 Go to “Step 1549-7” on page 138. 406x 0EB9 Go to “Step 1549-7” on page 138. 406x 0EBA Go to “Step 1549-7” on page 138. 406x 0EBB Go to “Step 1549-7” on page 138. 406x 0EBC Go to “Step 1549-7” on page 138. Step 1549-4 Replace the following cards if present, one at a time: v First location code item recorded, if any, in “Step 1549-1”. v Second location code item recorded, if any, in “Step 1549-1”. v Third location code item recorded, if any, in “Step 1549-1”. v Processor card 1 at U0.1-P1-C1, if not recorded in “Step 1549-1”. v Processor card 2 at U0.1-P1-C2, if not recorded in “Step 1549-1”. Attention: Before replacing the system backplane, call support. v CEC backplane at U0.1-P1, if not recorded in “Step 1549-1”. Step 1549-5 Turn on the power. Chapter 3. Maintenance Analysis Procedures (MAPs) 137 Step 1549-6 Did the system stop with the same error code as recorded in “Step 1549-1” on page 137? No The card just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Yes If all of the cards listed in “Step 1549-4” on page 137 have been replaced, go to “Step 1549-10”, otherwise continue with “Step 1549-4” on page 137 and replacing the next card on the list. Step 1549-7 Replace, one at a time the following cards (if present): v First location code item recorded, if any, in “Step 1549-1” on page 137. v Second location code item recorded, if any, in “Step 1549-1” on page 137. v Third location code item recorded, if any, in “Step 1549-1” on page 137. Attention: Before replacing the system backplane, call support. v CEC backplane at U0.1-P1, if not recorded in “Step 1549-1” on page 137. Step 1549-8 Turn on the power. Step 1549-9 Did the system stop with the same error code as recorded in “Step 1549-1” on page 137? No The card just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Yes If all of the cards listed in “Step 1549-7” have been replaced, go to “Step 1549-10”, otherwise continue with “Step 1549-7” and replacing the next card on the list. Step 1549-10 Call support. 138 Service Guide MAP 154C: SPCN Cable Problem Isolation Step 154C-1 Record the error code, location code(s), and word 13 that sent you to this MAP. Step 154C-2 Turn off the power. Step 154C-3 Locate the word 13 value that you recorded in Step 154C-1 in the following table and perform the indicated action. Word 13 Value Action A780 xxxx Replace the SPCN cable attached to U0.1-P2/Q12 of the base system drawer. A781 xxxx 1. Trace the SPCN cable attached to U0.1-P2/Q12 of the base system drawer back to an attached I/O drawer connector U0.dd-P1/Q4. 2. Replace the SPCN cable attached to connector U0.dd-P1/Q5 of this I/O drawer. A782 xxxx 1. Trace the SPCN cable attached to U0.1-P2/Q12 of the base system drawer back to an attached I/O drawer connector U0.dd-P1/Q4. 2. Trace the SPCN cable attached to connector U0.dd-P1/Q5 of this I/O drawer back to an attached I/O drawer connector U0.dd-P1/Q4. 3. Replace the SPCN cable attached to connector U0.dd-P1/Q5 of this I/O drawer. A783 xxxx 1. Trace the SPCN cable attached to U0.1-P2/Q12 of the base system drawer back to an attached I/O drawer connector U0.dd-P1/Q4. 2. Trace the SPCN cable attached to connector U0.dd-P1/Q5 of this I/O drawer back to an attached I/O drawer connector U0.dd-P1/Q4. 3. Trace the SPCN cable attached to connector U0.dd-P1/Q5 of this I/O drawer back to an attached I/O drawer connector U0.dd-P1/Q4. 4. Replace the SPCN cable attached to connector U0.dd-P1/Q5 of this I/O drawer. A784 xxxx 1. Trace the SPCN cable attached to U0.1-P2/Q12 of the base system drawer back to an attached I/O drawer connector U0.dd-P1/Q4. 2. Trace the SPCN cable attached to connector U0.dd-P1/Q5 of this I/O drawer back to an attached I/O drawer connector U0.dd-P1/Q4. 3. Trace the SPCN cable attached to connector U0.dd-P1/Q5 of this I/O drawer back to an attached I/O drawer connector U0.dd-P1/Q4. 4. Trace the SPCN cable attached to connector U0.dd-P1/Q5 of this I/O drawer back to an attached I/O drawer connector U0.dd-P1/Q4. 5. Replace the SPCN cable attached to connector U0.dd-P1/Q5 of this I/O drawer. A785 xxxx 1. Trace the SPCN cable attached to U0.1-P2/Q11 of the base system drawer back to an attached I/O drawer connector U0.dd-P1/Q5. 2. Trace the SPCN cable attached to connector U0.dd-P1/Q4 of this I/O drawer back to an attached I/O drawer connector U0.dd-P1/Q5. 3. Trace the SPCN cable attached to connector U0.dd-P1/Q4 of this I/O drawer back to an attached I/O drawer connector U0.dd-P1/Q5. 4. Replace the SPCN cable attached to connector U0.dd-P1/Q4 of this I/O drawer. Chapter 3. Maintenance Analysis Procedures (MAPs) 139 Word 13 Value Action A786 xxxx 1. Trace the SPCN cable attached to U0.1-P2/Q11 of the base system drawer back to an attached I/O drawer connector U0.dd-P1/Q5. 2. Trace the SPCN cable attached to connector U0.dd-P1/Q4 of this I/O drawer back to an attached I/O drawer connector U0.dd-P1/Q5. 3. Replace the SPCN cable attached to connector U0.dd-P1/Q4 of this I/O drawer. A787 xxxx 1. Trace the SPCN cable attached to U0.1-P2/Q11 of the base system drawer back to an attached I/O drawer connector U0.dd-P1/Q5. 2. Replace the SPCN cable attached to connector U0.dd-P1/Q4 of this I/O drawer. A78F xxxx Replace the SPCN cable attached to U0.1-P2/Q11 of the base system drawer. Step 154C-4 Turn on the power. Step 154C-5 Did the system stop with the same error code as recorded in “Step 154C-1” on page 139? No The SPCN cable just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Yes Call service support. 140 Service Guide Chapter 4. Checkpoints Checkpoints display on the media subsystem operator panel and the virtual terminal while the system is powering on and going through the initial program load (IPL). This chapter explains the IPL flow of the system and provides a table that lists checkpoints that you might see in the operator panel and in a virtual terminal window on the HMC. IPL Flow with an HMC Attached to the System The IPL process starts when ac power is connected to the system. The IPL process has the following phases: v Phase 1: Service Processor Initialization Phase 1 starts when ac power is connected to the system and ends when OK is displayed in the processor subsystem operator panel. 8xxx checkpoints are displayed during this phase. Several 9xxx codes may also be displayed. Service processor menus are available at the end of this phase by pressing any key on the console keyboard. v Phase 2: Hardware Initialization by the Service Processor Phase 2 starts when system power-on is initiated by pressing the power on button on the processor subsystem operator panel. 9xxx checkpoints are displayed during this time. 91FF, the last checkpoint in this phase, indicates the transition to phase 3 is taking place. v Phase 3: System Firmware initialization On a full system partition, at phase 3, a system processor takes control and continues initializing partition resources. During this phase, checkpoints in the form of Exxx are displayed. E105, the last checkpoint in this phase, indicates that control is being passed to the operating system boot program. On a partitioned system, there is a global system-wide initialization phase 3 during which a system processor continues the initialization process. Checkpoints in this phase are of the form Exxx. This global phase 3 ends with a LPAR... on the operator panel. As a logical partition begins a partition initialization phase 3, one of the system processors assigned to that partition continues initialization of resource assigned to that partition. Checkpoints in this phase are also of the form Exxx. This partition phase 3 ends with an E105 displayed on the partition’s virtual operator panel on the HMC, indicating control has been passed to that logical partition’s operating system boot program. For both the global and partition phase 3, location codes may also be displayed on the physical operator panel and the partition’s virtual terminal. v Phase 4: Operating System Boot When the operating system starts to boot, checkpoints in the form 0xxx and 2xxx are displayed. This phase ends when the operating system login prompt displays on the operating system console. 141 IPL Flow Without an HMC Attached to the System The IPL process starts when ac power is connected to the system. The IPL process has the following phases: v Phase 1: Service Processor Initialization Phase 1 starts when ac power is connected to the system and ends when OK is displayed in the media subsystem operator panel. 8xxx checkpoints are displayed during this phase. Several 9xxx codes may also be displayed. Service processor menus are available at the end of this phase by striking any key on the console keyboard. v Phase 2: Hardware Initialization by the Service Processor Phase 2 starts when system power-on is initiated by pressing the power on button on the processor subsystem operator panel. 9xxx checkpoints are displayed during this time. 91FF, the last checkpoint in this phase, indicates the transition to phase 3 is taking place. v Phase 3: System Firmware initialization At phase 3, a system processor takes over control and continues initializing partition resources. During this phase, checkpoints in the form Exxx are displayed. E105, the last checkpoint in this phase, indicates that control is being passed to the operating system boot program. v Phase 4: Operating System Boot When the operating system starts to boot, checkpoints in the form 0xxx and 2xxx are displayed. This phase ends when the operating system login prompt displays on the operating system console. The following table outlines the IPL phases from power-on to the operating system login prompt, matching the phases to corresponding operator-panel checkpoints for a typical partition boot. Also listed are the approximate boot times for each phase, along with their dependencies for each phase. Note: The following table lists the major checkpoints only. Phase # Checkpoints on Operator Panel or HMC Time in Phase (minutes) Major Contributors to Time in Phase Phase 1 8xxx => OK Approx. 1 Phase 2 9xxx => 91FF 3 to 6 Number of I/O drawers Phase 3 Exxx => E105 1 to 3 Number of bootable adapters Phase 4 0xxx or 2xxx => blank 2 to 30+ Number of SSA drives Number of SCSI drives Number of Async sessions Number of processors Number of adapters Amount of memory Checkpoints enable users and service personnel to know what the server is doing, with some detail, as it initializes. These checkpoints are not intended to be error indicators, but in some cases a server could hang at one of the checkpoints without displaying an 8-character error code. It is for these hang conditions, only, that any action should be taken with respect to checkpoints. The most appropriate action is included with each checkpoint. 142 Service Guide Before taking actions listed with a checkpoint, check for additional symptoms in the service processor error log. See the “System Information Menu” on page 317 for information on how to access the service processor error log. Notes: 1. Go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123 for any of the following checkpoint hang conditions: v A four-digit code in the range of 8xxx-E4FF, E5Ex, or E700-EFFF that is not listed in the checkpoint tables. v A four-digit code in the range of 8xxx-E4FF, E5Ex, or E700-EFFF that is listed in the checkpoint tables but does not contain a repair action or FRU listing. v A four-digit code in the range of 8xxx-E4FF, E5Ex, or E700-EFFF, in which all of the FRUs listed have been replaced or service actions have been performed and the problem has not been corrected. 2. Go to “MAP 1542: I/O Problem Isolation” on page 111 for any of the following checkpoint hang conditions: v A four-digit code in the range of E500 through E6FF (except E5Ex - see note 1 above) that is not listed in the checkpoint tables. v A four-digit code in the range of E500 through E6FF (except E5Ex - see note 1 above) that is listed in the checkpoint tables but does not contain a repair action or FRU listing. v A four-digit code in the range of E500 through E6FF (except E5Ex - see note 1 above), in which all of the FRUs listed have been replaced or service actions have been performed and the problem has not been corrected. Checkpoints listed in the following tables are in the format 8xxx, 9xxx, Axxx, Bxxx, or Exxx, where x is any hexadecimal digit from 1-9 or A-F. If your system hangs with a checkpoint displayed that begins with anything other than 8, 9, A, B, or E, go to the Diagnostic Information for Multiple Bus Systems. Use the following table to determine where to find the checkpoint. Symptom Action You have a code that is of the form 8xxx, 9xxx, Axxx, or Bxxx. Go to “Service Processor Checkpoints”. You have a code of the form Exxx. Go to “Firmware Checkpoints” on page 150. You have a four-character code that is not of the form 8xxx, 9xxx, Axxx, Bxxx, or Exxx. Refer to the Diagnostic Information for Multiple Bus Systems manual. Service Processor Checkpoints Service processor checkpoints are in the range 8xxx to Bxxx. The message OK indicates successful service processor testing and initialization. Firmware checkpoints are listed in “Firmware Checkpoints” on page 150. Note: A spinning (or twirling) slash displays in the upper-right corner of the operator panel display while some checkpoints are being displayed. If the slash stops spinning, a hang condition is indicated. Checkpoint Description Action/ Possible Failing FRU 8000 Test of the service processor DRAM See note 1 on page 149. 8004 Verify base code checksum See note 1 on page 149. 8008 Verify base code mirror checksum See note 1 on page 149. Chapter 4. Checkpoints 143 Checkpoint Description Action/ Possible Failing FRU 8009 Set NVRAM configuration in TITAN See note 1 on page 149. 800F Start base code See note 1 on page 149. 8010 Start supervisor in base code See note 1 on page 149. 8012 Initialize hardware interrupts See note 1 on page 149. 8014 Allocate and initialize area in DRAM See note 1 on page 149. 8016 Initialize debugger and build VPD See note 1 on page 149. 8018 Initialize service processor interfaces See note 1 on page 149. 801A Initialize external FLASH load See note 1 on page 149. 801B Initialize and test ISA hub chip See note 1 on page 149. 801C Initialize and test SUPER I/O chip See note 1 on page 149. 801F Initialize diskette, verify VDASD and start operational loader See note 1 on page 149. 809x Related to recovery mode system firmware update from diskette See note 3 on page 149. 8092 Diskette initialization during recovery mode system firmware update See note 3 on page 149. 8098 Firmware image being read during recovery mode system firmware update See note 3 on page 149. 809C Firmware image being written during recovery mode system firmware update See note 3 on page 149. 809F Diskette error during recovery mode system firmware update See note 3 on page 149. 8100 Start operational loader See note 1 on page 149. 81xx Load code into DRAM from VDASD (where xx = number of loads) See note 1 on page 149. 81FE Load service processor operating system See note 1 on page 149. 81FF Start service processor operating system See note 1 on page 149. 9000 Primary service processor task initializing See note 1 on page 149. 9009 System processor IPL has started See note 1 on page 149. 9010 Primary service processor thread waiting for response See note 1 on page 149. 9020 Service processor state set to pre-standby See note 1 on page 149. 9021 Service processor state set to standby See note 1 on page 149. 9022 Service processor state set to IPL See note 1 on page 149. 9023 Service processor state set to runtime See note 1 on page 149. 9024 Service processor state set to terminate See note 1 on page 149. 9025 Service processor state set to DPO See note 1 on page 149. 9030 Waiting for secondary thread acknowledgement See note 1 on page 149. 9033 I/O base initialization See note 1 on page 149. 9034 I2C initialization See note 1 on page 149. 9035 ISA initialization See note 1 on page 149. 9036 Mailbox initialization See note1 on page 149. 144 Service Guide Checkpoint Description Action/ Possible Failing FRU 9037 SLLA initialization (VPD) See note 1 on page 149. 9038 NVRAM initialization See note 1 on page 149. 9039 Serial port initialization See note 1 on page 149. 903A Serial I/O initialization See note 1 on page 149. 903B SPCN initialization See note 1 on page 149. 9051 End of reset/reload operation: service processor dump has completed See note 1 on page 149. 9055 Start menus thread See note 1 on page 149. 9060 Checking/configuring modem on serial port 1 See note 4 on page 149. 9061 Checking/configuring modem on serial port 2 See note 4 on page 149. 906A Create mailbox processing thread See note 1 on page 149. 906B Create menu interface thread See note 1 on page 149. 906C Create general thread See note 1 on page 149. 906D Create SPCN thread See note 1 on page 149. 9070 Create thread to control LEDs on I/O subsystems See note 1 on page 149. 9071 Initialize thread to control LEDs on I/O subsystems See note 1 on page 149. 9080 Start VPD collection program See note 1 on page 149. 9081 Start SC VPD data collection See note 1 on page 149. 9082 I2C/SPCN VPD data collection See note 1 on page 149. 9083 End SC VPD data collection See note 1 on page 149. 9084 End I2C/SPCN VPD data collection See note 1 on page 149. 90FD Waiting for IBIST to complete See note 1 on page 149. 9101 Setup load source See note 1 on page 149. 9102 Open load source See note 1 on page 149. 9103 Verify open status of load source See note 1 on page 149. 9104 Waiting for SPCN VPD collection to be completed See note 1 on page 149. 9105 Load IPL LIDs See note 1 on page 149. 9106 Initialize MOPs See note 1 on page 149. 9107 System flush See note 1 on page 149. 9108 Processor scan interface BATs and LBIST See note 1 on page 149. 9109 ABIST See note 1 on page 149. 910A Non-processor scan interface BATs and LBIST See note 1 on page 149. 910B Wire test See note 1 on page 149. 910C Initialize main storage (system memory) See note 1 on page 149. 910D Configure processors See note 1 on page 149. 910E Configure and initialize I/O See note 1 on page 149. Chapter 4. Checkpoints 145 Checkpoint Description Action/ Possible Failing FRU 910F Test and initialize main storage (system memory) See note 1 on page 149. 9110 Free IPL LIDs See note 1 on page 149. 9111 Load run-time LIDS See note 1 on page 149. 9112 Load dump LIDs See note 1 on page 149. 9113 Enable attention See note 1 on page 149. 9114 Enable function 22 See note 1 on page 149. 9115 Load system IPL program See note 1 on page 149. 9116 Load system data areas (NACA/PACA) See note 1 on page 149. 9117 Start system IPL program See note 1 on page 149. 9119 Waiting for VPD collection to complete See note 1 on page 149. 9184 Firmware image being loaded into service processor’s control storage during firmware update. 1. Remove, then reapply power to the system. If the service processor enters recovery mode, (indicated by A1FD 0000 in the display), attempt to update the firmware again in recovery mode. If the service processor comes up to the OK prompt, try again to update the firmware using the service processor menus. 2. Call service support. 91C4 Flash update pending, either a firmware update or hypervisor dump. Note: This checkpoint may be displayed during the firmware update process. The checkpoint may be displayed for a long period of time during the process. If process does not complete, the system may be hung. See note 1 on page 149. 91FF Control being handed to system processor from service processor See note 1 on page 149. 9200 Scan interface BATs See note 1 on page 149. 9300 Logic BIST See note 1 on page 149. 9301 Scanned ABIST See note 1 on page 149. 9302 Dedicated ABIST See note 1 on page 149. 9303 Dedicated ABIST array fuse repair calculation See note 1 on page 149. 9380 Built-in-self-test (BIST) See note 2 on page 149. 9400 Service processor is requesting system flush See note 1 on page 149. 9410 Service processor is issuing request to start instruction execution See note 1 on page 149. 9411 Service processor is issuing request to stop instruction execution See note 1 on page 149. 9420 Service processor is issuing request to start system clocks See note 1 on page 149. 9421 Service processor is issuing request to stop system clocks See note 1 on page 149. 146 Service Guide Checkpoint Description Action/ Possible Failing FRU 94B0 Wire test faulty driver status refcode See note 2 on page 149. 94B1 Wire test shorted net status refcode See note 2 on page 149. 94B2 Wire test elastic interface test See note 2 on page 149. 94BB Wire test starting See note 2 on page 149. 9501 IPL diagnostic initialization See note 2 on page 149. 9502 IPL diagnostic L3 cache march test See note 2 on page 149. 9503 IPL diagnostic L3 connections test See note 2 on page 149. 9504 IPL diagnostic L2 cache march test See note 2 on page 149. 9505 IPL diagnostic RIO wrap test See note 2 on page 149. 9506 IPL diagnostic cleanup See note 2 on page 149. 9507 IPL diagnostic test See note 2 on page 149. 9508 IPL diagnostic main storage march test See note 2 on page 149. 9509 IPL diagnostic main storage connections test See note 2 on page 149. 950A IPL diagnostic elastic interface slack test See note 2 on page 149. 950B IPL diagnostic fast initialization to zeros See note 2 on page 149. 96C2 IPL MOPs processor configuration See note 2 on page 149. 96C3 IPL MOPs main storage size See note 2 on page 149. 96C4 IPL MOPs main storage configuration See note 2 on page 149. 96C6 IPL MOPs I/O configuration See note 2 on page 149. 96C7 IPL MOPs: enable chip machine checks See note 2 on page 149. 96E1 Initialize run-time PRD objects and memory See note 2 on page 149. 96E2 Run-time initialization: enable attention handling See note 2 on page 149. 99FD Service processor receiving firmware update module See note 2 on page 149. Attention: This checkpoint may stay in the operator panel for some period of time during a firmware update or hypervisor dump with no indication of activity. Do not assume that the service processor is hung unless this checkpoint has been in the operator panel for at least 15 minutes. with no indication of activity. Attention: Do not remove power from the system, or reset the service processor, during a firmware update or hypervisor dump. Chapter 4. Checkpoints 147 Checkpoint Description Action/ Possible Failing FRU 99FF Service processor writing firmware update module See note 2 on page 149. Attention: This checkpoint may stay in the operator panel for some period of time during a firmware update or hypervisor dump with no indication of activity. Do not assume that the service processor is hung unless this checkpoint has been in the operator panel for at least 15 minutes. with no indication of activity. Attention: Do not remove power from the system, or reset the service processor, during a firmware update or hypervisor dump. A800 Start service processor dump process See note 1 on page 149. A801 Start dump to NVRAM See note 1 on page 149. A802 Start dump to debug port See note 1 on page 149. A803 NVRAM not usable See note 1 on page 149. A804 NVRAM dump done See note 1 on page 149. A805 Start dump to flash See note 1 on page 149. A806 Flash dump area done See note 1 on page 149. A807 Flash area not usable See note 1 on page 149. A808 Flash error log to debug port See note 1 on page 149. A809 Flash erase start See note 1 on page 149. A80A Flash erase end See note 1 on page 149. A80B Reserved; not used See note 1 on page 149. A80C Reserved; not used See note 1 on page 149. A80D Reserved; not used See note 1 on page 149. A80E Reserved; not used See note 1 on page 149. A80F Service processor dump done See note 1 on page 149. A810-A8FF Description: Scan log dump in progress Action/Possible Failing FRU Informational message. (See “Scan Log Dump Policy” on page 312.) The last two characters of the checkpoints will change as the scan log dump progresses. If the last two characters do not change after 5 minutes, the service processor is hung and must be reset. Follow the procedure outlined in Note 1 on page 149 following this table. When the scan log dump is complete, depending on how the reboot policy is set, the system will either: v Go to the standby state (and the service processor menus will be available), indicated by OK or STBY in the operator panel, OR v Attempt to reboot. B0F5 Delayed power-off sequence has begun See note 1 on page 149. 148 Service Guide Checkpoint Description Action/ Possible Failing FRU B0FA Delayed power-off acknowledged See note 1. B0FF Power-off requested program executing See note 1. Notes: 1. If the system fails to progress after this checkpoint is put in the display, do the following: a. Activate the service processor pinhole reset switch on the system’s operator panel. b. If the hang repeats, check with service support to see if there is a firmware update that fixes the problem. c. If not, or the update does not fix the problem, replace the CEC backplane, location: U0.1-P1. d. If this procedure does not fix the problem, call service support. 2. If the system fails to progress after this checkpoint is put in the operator display, do the following: a. Activate the service processor pinhole reset switch on the system’s operator panel. b. Reboot the system in slow mode. (Slow/fast IPL is set using a service processor menu.) If the hang repeats, check with service support to see if there is a system firmware update that fixes this problem. c. If not, or the update does not fix the problem, call service support. 3. Recovery mode flash update has failed. Do the following: a. Activate the service processor pinhole reset switch on the system’s operator panel and try the recovery mode flash update again. b. If the system hangs again, make a new set of firmware update diskettes, using new, blank diskettes. Retry the flash update with the new diskettes. c. If the same hang occurs with the new diskettes, replace the following FRUs one at a time in this order: 1) Diskette drive 2) CEC backplane, location: U0.1-P1. d. If this procedure does not fix the problem, call service support. 4. The service processor was unable to configure the modem(s) attached to the serial port(s) on the CEC backplane (a problem with the modem attached to the HMC is not indicated). Do the following: a. Check the modem, modem cable, and serial port for loose or unconnected cables. Reset the service processor using the pinhole reset switch on the system’s operator panel. b. If the hang repeats, check with service support to see if there is a firmware update that fixes the problem. c. If there is not, or the update does not fix the problem, replace the following FRUs, one at a time, in the following order: 1) Modem cable 2) Modem d. If this procedure does not fix the problem, call service support. Chapter 4. Checkpoints 149 Firmware Checkpoints Firmware uses checkpoints (progress codes) in the range of Exxx to EFFF. These checkpoints occur during system startup and can be useful in diagnosing certain problems. Service processor checkpoints are listed in “Service Processor Checkpoints” on page 143. If you have a checkpoint with no location code associated with it, see “Determining Location Code” on page 180. If a checkpoint has a location code associated with a FRU replacement, see “Mapping AIX and Physical Location Codes” on page 28 for physical location information. If you replace FRUs and the problem is still not corrected, go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123 unless otherwise indicated in the tables. Note: If you receive a four-digit code in the range of E1xx to EFFF that is not listed in the following table, go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123. Checkpoint Description Action/ Possible Failing FRU E101 Create RTAS node Go to “MAP 1540: Problem Isolation Procedures” on page 108. E102 Load/Init RTAS Go to “MAP 1540: Problem Isolation Procedures” on page 108. E105 Transfer control to operating system (normal boot). See “Boot Problems/Concerns” on page 173. E10A Load RTAS device tree Go to “MAP 1540: Problem Isolation Procedures” on page 108. E10B Set RTAS device properties Go to “MAP 1540: Problem Isolation Procedures” on page 108. E111 GOOD CRC - jump to composite image Go to “MAP 1540: Problem Isolation Procedures” on page 108. E120 Initialize I/O and early memory block Go to “MAP 1540: Problem Isolation Procedures” on page 108. E121 Initialize service processor Go to “MAP 1540: Problem Isolation Procedures” on page 108. E122 RTAS sensor setup complete Go to “MAP 1540: Problem Isolation Procedures” on page 108. E130 Build device tree Go to “MAP 1540: Problem Isolation Procedures” on page 108. E131 Build device tree Go to “MAP 1540: Problem Isolation Procedures” on page 108. E132 Build device tree Go to “MAP 1540: Problem Isolation Procedures” on page 108. E133 Build device tree Go to “MAP 1540: Problem Isolation Procedures” on page 108. E134 Create memory node Go to “MAP 1540: Problem Isolation Procedures” on page 108. E138 Create options node Go to “MAP 1540: Problem Isolation Procedures” on page 108. E139 Create node aliases and system aliases Go to “MAP 1540: Problem Isolation Procedures” on page 108. E13A Create packages node Go to “MAP 1540: Problem Isolation Procedures” on page 108. 150 Service Guide Checkpoint Description Action/ Possible Failing FRU E140 Operating system load See “Boot Problems/Concerns” on page 173. E149 Create boot mgr node Go to “MAP 1540: Problem Isolation Procedures” on page 108. E14C Create terminal-emulator node Go to “MAP 1540: Problem Isolation Procedures” on page 108. E14D Loading boot image See “Boot Problems/Concerns” on page 173. E150 Create host (primary) PCI controller node If a location code is specified, replace the adapter at that location code. If no location code is specified, go to “MAP 1540: Problem Isolation Procedures” on page 108. E151 Probing primary PCI bus If a location code is specified, replace the adapter at that location code. If no location code is specified, go to “MAP 1540: Problem Isolation Procedures” on page 108. E152 Probing for adapter FCODE, evaluate if present Follow the repair action listed for checkpoint E151. E153 End adapter FCODE, probe/evaluate Follow the repair action listed for checkpoint E151. E154 Create PCI bridge node Follow the repair action listed for checkpoint E151. E155 Probing PCI bridge secondary bus Follow the repair action listed for checkpoint E151. E156 Create PCI Ethernet node If a location code is specified, replace the adapter at that location code. If no location code is specified, go to “MAP 1540: Problem Isolation Procedures” on page 108. E158 System firmware waiting for a virtual terminal to be opened on an HMC for SMS menus to be displayed. Open a virtual terminal window on the HMC. E15B Transferring control to operating system (service mode boot) See “Boot Problems/Concerns” on page 173. E15F Adapter VPD probe Follow the repair action listed for checkpoint E151. E170 Start of PCI Bus Probe Follow the repair action listed for checkpoint E151. E172 First pass of PCI device probe Follow the repair action listed for checkpoint E151. Chapter 4. Checkpoints 151 Checkpoint Description Action/ Possible Failing FRU E174 Description: Establishing host connection Action/Possible Failing FRU: If the system is not connected to an active network, or if the target server is inaccessible (which can result from incorrect IP parameters being specified), the system still attempts to boot and because time-out durations are necessarily long to accommodate retries, the system may appear to hang. Restart the system and get to the SMS utilities menu. Under Select Boot Options, verify: v Is the intended boot device correctly specified in the boot list? v Are the IP parameters correct? v Look at the server configuration for this client. v Attempt to ping the target server using the ping utility in SMS. E175 BootP request Follow the repair actions for checkpoint E174. E176 TFTP file transfer Follow the repair actions for checkpoint E174. E177 Transfer failure due to TFTP error condition See “Boot Problems/Concerns” on page 173. E17B Processor clock speed measurement Call service support. E183 Service processor POST 1. Replace the CEC backplane, location: U0.1-P1. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E18D A system reset request has been sent to the service processor. 1. Check for system firmware updates. 2. Call service support. E191 Create ISA reserved node Go to “MAP 1540: Problem Isolation Procedures” on page 108. E19A NVRAM auto-boot? variable not found - assume FALSE Go to “MAP 1540: Problem Isolation Procedures” on page 108. E19B NVRAM menu? variable not found - assume FALSE Go to “MAP 1540: Problem Isolation Procedures” on page 108. E19D Create NVRAM node Go to “MAP 1540: Problem Isolation Procedures” on page 108. E19E Real-time clock (RTC) initialization Refer to error code 28030xxx in “Checkpoint and Error Code Index” on page 181. E1A0 User requested boot to SMS menus via keyboard entry Go to “MAP 1540: Problem Isolation Procedures” on page 108. E1A1 User requested boot to open firmware prompt via keyboard entry Go to “MAP 1540: Problem Isolation Procedures” on page 108. E1A2 User requested boot using default service mode boot list via keyboard entry Go to “MAP 1540: Problem Isolation Procedures” on page 108. E1A3 User requested boot using customized service mode boot list via keyboard entry Go to “MAP 1540: Problem Isolation Procedures” on page 108. E1A4 User requested boot to SMS menus via HMC or CSP command Go to “MAP 1540: Problem Isolation Procedures” on page 108. 152 Service Guide Checkpoint Description Action/ Possible Failing FRU E1A5 User requested boot to open firmware prompt via HMC or CSP command Go to “MAP 1540: Problem Isolation Procedures” on page 108. E1A6 User requested boot using default service mode boot list via HMC or CSP command Go to “MAP 1540: Problem Isolation Procedures” on page 108. E1A7 User requested boot using customized service mode boot list via HMC or CSP command Go to “MAP 1540: Problem Isolation Procedures” on page 108. E1A8 System booting to open firmware prompt Go to “MAP 1540: Problem Isolation Procedures” on page 108. E1A9 Description: System booting to SMS menus Action/Possible Failing FRU: 1. If a graphics adapter is driving the console display, connect a laptop (and cable PN 62H4857) to serial port 1 on the processor subsystem. If the SMS menus appear on the laptop you may have a problem with the display. Go to the “Quick Entry MAP” on page 58 with the display problem symptom. 2. If the SMS menus do not appear on the laptop in step 1 or a graphics adapter is not driving the console display, go to “MAP 1540: Problem Isolation Procedures” on page 108. E1AA System NVRAM settings during boot Go to “MAP 1540: Problem Isolation Procedures” on page 108. E1AB System booting using default service mode boot list Go to “MAP 1540: Problem Isolation Procedures” on page 108. E1AC System booting using customized service mode boot list Go to “MAP 1540: Problem Isolation Procedures” on page 108. E1AD System booting to operating system Go to “MAP 1540: Problem Isolation Procedures” on page 108. E1AE System booted to multiboot menu via NVRAM settings Go to “MAP 1540: Problem Isolation Procedures” on page 108. E1AF System booted to utilities menu via NVRAM settings Go to “MAP 1540: Problem Isolation Procedures” on page 108. E1B1 Create serial node Go to “MAP 1540: Problem Isolation Procedures” on page 108. E1B6 Probe for (ISA) keyboard Replace the CEC backplane, Location: U0.1-P1. E1BD Probe for (ISA) mouse Replace the CEC backplane, Location: U0.1-P1. E1C4 Build ISA timer chip node Go to “MAP 1540: Problem Isolation Procedures” on page 108. E1C5 Create ISA interrupt controller (pic) node Go to “MAP 1540: Problem Isolation Procedures” on page 108. E1C6 Create DMA node Go to “MAP 1540: Problem Isolation Procedures” on page 108. E1D0 Create PCI SCSI node Go to “MAP 1540: Problem Isolation Procedures” on page 108. E1D3 Create SCSI block device node (SD) Go to “MAP 1540: Problem Isolation Procedures” on page 108. E1D4 Create SCSI byte device node (ST) Go to “MAP 1540: Problem Isolation Procedures” on page 108. Chapter 4. Checkpoints 153 Checkpoint Description Action/ Possible Failing FRU E1DB Create floppy controller (FDC) node Go to “MAP 1540: Problem Isolation Procedures” on page 108. E1DC Description: Dynamic console selection. Action/Possibly Failing FRU 1. If a console is attached, but nothing is displayed on it, go to ²All display problems² in the Entry MAP. 2. If selection screens can be seen on the terminals and the appropriate key on the input device is pressed within 60 seconds but there is no response to the keystroke: a. If selecting the console with a keyboard attached to the system, replace the keyboard, then replace the CEC backplane, Location: U0.1-P1. b. If selecting the console with an ASCII terminal, suspect the terminal. Use the problem determination procedures for the terminal. Note: Terminal setting should be: v 9600 baud v no parity v 8 data bits v 1 stop bit E1F0 Start O.B.E. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E1F1 Begin selftest sequence on boot devices. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E1F2 Power-on password prompt The prompt should be visible on the firmware console (maybe a virtual terminal on the HMC). If a console is attached but nothing is displayed on it, go to the “Quick Entry MAP” on page 58 with the symptom ²All display problems.² E1F3 Privileged-access password prompt The prompt should be visible on the firmware console (maybe a virtual terminal on the HMC). If a console is attached but nothing is displayed on it, go to the “Quick Entry MAP” on page 58 with the symptom ²All display problems.² E1F4 End self-test sequence on boot devices; begin SMS. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E1F5 Build boot device list Go to “MAP 1540: Problem Isolation Procedures” on page 108. E1F6 Determine boot device sequence Go to “MAP 1540: Problem Isolation Procedures” on page 108. E1F7 No boot image located Go to “Boot Problems/Concerns” on page 173. E1F8 Building boot device list for SCSI adapters. (The location of the SCSI adapter being scanned is also displayed.) 1. If a location code is available, follow the Repair actions listed for error code 21A0 00XX. 2. If no location code is available, go to “MAP 1540: Problem Isolation Procedures” on page 108. 154 Service Guide Checkpoint Description Action/ Possible Failing FRU E1F9 Building boot device list for fibre-channel adapters. (The location of the SAN adapter being scanned is also displayed.) 1. If a location code is available, follow the Repair actions listed for error code 21A0 00XX. 2. If no location code is available, go to “MAP 1540: Problem Isolation Procedures” on page 108. E1FA Building boot device list for SCSI adapters. (The device ID and the device LUN being scanned are also displayed.) Go to “MAP 1540: Problem Isolation Procedures” on page 108. E1FB Scan SCSI bus for attached devices 1. If a location code is available, follow the Repair actions listed for error code 21A0 00XX. 2. If no location code is available, go to “MAP 1540: Problem Isolation Procedures” on page 108. E1FD The operator panel alternates between the code E1FD and another Exxx code, where Exxx is the point at which the error occurred. 1. If a location code is available, follow the actions listed for error code 21A0 00xx. 2. If no location code is available, go to “MAP 1540: Problem Isolation Procedures” on page 108. E1FE Building boot device list for fibre-channel adapters. (The WWPN of the SAN adapter being scanned is also displayed.) 1. If a location code is available, follow the Repair actions listed for error code 21A0 00XX. 2. If no location code is available, go to “MAP 1540: Problem Isolation Procedures” on page 108. E1FF Building boot device list for fibre-channel adapters. (The WWPN of the SAN adapter being scanned is also displayed.) Go to “MAP 1542: I/O Problem Isolation” on page 111. E20F System firmware has exited to open firmware prompt 1. Replace the CEC backplane, Location: U0.1-P1. 2. Call service support. E440 Validate NVRAM, initialize partitions as needed 1. Check for system firmware updates. 2. Replace the CEC backplane, Location: U0.1-P1. 3. Call service support. E441 Generate /options node NVRAM configuration variable properties 1. Check for system firmware updates. 2. Replace the CEC backplane, Location: U0.1-P1. 3. Call service support. E442 Validate NVRAM partitions 1. Check for system firmware updates. 2. Replace the CEC backplane, Location: U0.1-P1. 3. Call service support. E443 Generate NVRAM configuration variable dictionary words Suspect a system firmware problem if problem persists. If a system firmware update is available, install it. E500 Begin I/O drawer configuration 1. Check for system firmware updates. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108. Chapter 4. Checkpoints 155 Checkpoint Description Action/ Possible Failing FRU E501 Initialize I/O data structures 1. Check for system firmware updates. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E502 Set up I/O data structure pointers 1. Check for system firmware updates. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E503 Initialize location code data structures 1. Check for system firmware updates. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E504 Initialize memory map data structures 1. Check for system firmware updates. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E505 Enable the time base for all RIO hubs 1. Check for system firmware updates. 2. Replace the RIO riser card specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E506 Reset time base of processors 1. Check for system firmware updates. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E507 RIO hub assignment 1. Check for system firmware updates. 2. Replace the RIO riser card specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E508 Begin initializing RIO hubs 1. Check for system firmware updates. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E509 Begin configuring EADS 1. Check for system firmware updates. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E50A Check for missing RIO interface chips 1. Check for system firmware updates. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E50B Save RIO hub information 1. Check for system firmware updates. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E50C Copy all of the memory ranges 1. Check for system firmware updates. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E50D Copy all of the I/O data 1. Check for system firmware updates. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108. 156 Service Guide Checkpoint Description Action/ Possible Failing FRU E50E Copy all of the RIO hub information 1. Check for system firmware updates. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E50F Clean up the error registers 1. Check for system firmware updates. 2. Replace the RIO riser card specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E51F End of I/O configuration 1. Check for system firmware updates. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E520 Set up the RIO architecture registers 1. Check for system firmware updates. 2. Replace the RIO riser card specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E521 Set up interrupts for the current RIO hub 1. Check for system firmware updates. 2. Replace the RIO riser card specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E522 Set up miscellaneous registers for the current RIO hub 1. Check for system firmware updates. 2. Replace the RIO riser card specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E523 Initialize data structures and enable all RIO hub ports 1. Check for system firmware updates. 2. Replace the RIO riser card specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E524 Determine I/O configuration of current RIO hub 1. Check for system firmware updates. 2. Replace the RIO riser card specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E525 Set up the ETE of the current RIO hub 1. Check for system firmware updates. 2. Replace the RIO riser card specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. Chapter 4. Checkpoints 157 Checkpoint Description Action/ Possible Failing FRU E526 Reassign the I/O subsystems to the closest RIO hub 1. Replace the RIO-G cable specified by the location code. 2. Replace the RIO riser card that the failing drawer is attached to. 3. Replace the I/O subsystem planar that the RIO-G cable in step 1 is attached to. 4. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E527 Enable the memory MAP and I/O MAP for the current RIO hub port 1. Check for system firmware updates. 2. Replace the RIO riser card specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E52F End of configuration of the RIO hub 1. Check for system firmware updates. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E528 Begin search for the next available I/O drawer from a current RIO port 1. Check the RIO cable specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, check the RIO cables going into and out of the I/O subsystem. 2. Replace the RIO cables specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, replace the RIO cables going into and out of the I/O subsystem. 3. Replace the CEC backplane, U0.1-P1. 4. Replace the I/O subsystem backplane specified by the location code, or the I/O subsystem backplane that the RIO cables in step 1 are attached to. E529 Enable the control port and driver of the current RIO hub 1. Check the RIO cable specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, check the RIO cables going into and out of the I/O subsystem. 2. Replace the RIO cables specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, replace the RIO cables going into and out of the I/O subsystem. 3. Replace the CEC backplane, U0.1-P1. 4. Replace the I/O subsystem backplane specified by the location code, or the I/O subsystem backplane that the RIO cables in step 1 are attached to. 158 Service Guide Checkpoint Description Action/ Possible Failing FRU E52A Search for the next available I/O drawer from a current RIO port 1. Check the RIO cable specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, check the RIO cables going into and out of the I/O subsystem. 2. Replace the RIO cables specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, replace the RIO cables going into and out of the I/O subsystem. 3. Replace the CEC backplane, U0.1-P1. 4. Replace the I/O subsystem backplane specified by the location code, or the I/O subsystem backplane that the RIO cables in step 1 are attached to. E52B The RIO cable connected to the RIO port has been detected 1. Check the RIO cable specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, check the RIO cables going into and out of the I/O subsystem. 2. Replace the RIO cables specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, replace the RIO cables going into and out of the I/O subsystem. 3. Replace the CEC backplane, U0.1-P1. 4. Replace the I/O subsystem backplane specified by the location code, or the I/O subsystem backplane that the RIO cables in step 1 are attached to. E530 Checking RIO hub ports on current RIO hub 1. Check for system firmware updates. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E531 Get RIO hub node ID 1. Replace the RIO-G cable specified by the location code. 2. Replace the RIO riser card that the failing drawer is attached to. 3. Replace the I/O subsystem planar that the RIO-G cable in step 1 is attached to. 4. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E532 RIO cable detected; enable the RIO hub port 1. Replace the RIO-G cable specified by the location code. 2. Replace the RIO riser card that the failing drawer is attached to. 3. Replace the I/O subsystem planar that the RIO-G cable in step 1 is attached to. 4. Go to “MAP 1540: Problem Isolation Procedures” on page 108. Chapter 4. Checkpoints 159 Checkpoint Description Action/ Possible Failing FRU E533 Set up the RIO hub routing table 1. Replace the RIO-G cable specified by the location code. 2. Replace the RIO riser card that the failing drawer is attached to. 3. Replace the I/O subsystem planar that the RIO-G cable in step 1 is attached to. 4. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E534 Get the node ID of the node on the other end of the RIO cable 1. Replace the RIO-G cable specified by the location code. 2. Replace the RIO riser card that the failing drawer is attached to. 3. Replace the I/O subsystem planar that the RIO-G cable in step 1 is attached to. 4. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E535 Valid node ID detected at other end of the RIO cable 1. This error code may indicate that the RIO-G cables are not properly connected to the I/O subsystems. Check the RIO cabling. 2. Replace the RIO-G cable specified by the location code. 3. Replace the RIO riser card that the failing drawer is attached to. 4. Replace the I/O subsystem planar that the RIO-G cable in step 1 is attached to. 5. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E536 I/O subsystem detected at other end of the RIO cable 1. Check for system firmware updates. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E537 Configuring the I/O subsystem 1. Replace the RIO-G cable specified by the location code. 2. Replace the RIO riser card that the failing drawer is attached to. 3. Replace the I/O subsystem planar that the RIO-G cable in step 1 is attached to. 4. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E538 Detected the RIO hub on the other end of the RIO cable 1. Check for system firmware updates. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108. 160 Service Guide Checkpoint Description Action/ Possible Failing FRU E539 Enabling the RIO hub ports 1. Check for system firmware updates. 2. Replace the RIO riser card specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E53A Set up the RIO port speed of the exit port of the I/O subsystem connected to the RIO hub port 1. Replace the RIO-G cable specified by the location code. 2. Replace the RIO riser card that the failing drawer is attached to. 3. Replace the I/O subsystem planar that the RIO-G cable in step 1 is attached to. 4. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E53B End of looking for next node port 1. Check for system firmware updates. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E53C End of looking for next RIO hub port 1. Check for system firmware updates. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E53D Disable the anynode mode Replace the I/O subsytem backplane that the RIO cable specified by the location code is attached to. E540 Begin initialization of I/O drawer 1. Check for system firmware updates. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E541 Initialize the routing table 1. Replace the I/O subsystem planar specified by the location code. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E544 Enable the RIO ports Replace the I/O subsystem planar specified by the location code. E545 Set up the device routing address for this new node ID Replace the I/O subsystem planar specified by the location code. E546 Set up the RIO interface chip new node ID Replace the I/O subsystem planar specified by the location code. E547 Set up the RIO interface chip routing table Replace the I/O subsystem planar specified by the location code. E548 Disable any node mode Replace the I/O subsystem planar specified by the location code. E549 Adjust RIO hub table Replace the I/O subsystem planar specified by the location code. Chapter 4. Checkpoints 161 Checkpoint Description Action/ Possible Failing FRU E54A Configure the I/O drawer 1. Check the RIO cable specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, check the RIO cables going into and out of the I/O subsystem. 2. Replace the RIO cables specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, replace the RIO cables going into and out of the I/O subsystem. 3. Replace the I/O subsystem backplane specified by the location code, or the I/O subsystem backplane that the RIO cables in step 1 are attached to. E54B Update the I/O drawer routing table of RIO ports 1. Check the RIO cable specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, check the RIO cables going into and out of the I/O subsystem. 2. Replace the RIO cables specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, replace the RIO cables going into and out of the I/O subsystem. 3. Replace the I/O subsystem backplane specified by the location code, or the I/O subsystem backplane that the RIO cables in step 1 are attached to. E550 Begin to initialize RIO interface chip Replace the device specified by the location code. E551 Set up the RIO speed Replace the device specified by the location code. E552 Set up RIO interface chip registers Replace the device specified by the location code. E553 Set up the RIO interface chip base address Replace the device specified by the location code. E554 Set up more RIO interface chip registers Replace the device specified by the location code. E555 Set up the RIO interface chip RIO link Replace the device specified by the location code. E556 Set up and initialize RIO interface chips under current RIO interface chip Replace the device specified by the location code. E55F End of I/O drawer initialization Replace the device specified by the location code. E560 Begin initializing the I/O drawer 1. Check for system firmware updates. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108. 162 Service Guide Checkpoint Description Action/ Possible Failing FRU E561 Set up RIO interface chip registers If the location code associated with this checkpoint is either U0.1-P1 or U0.1-P2, do the following: 1. Replace the PCI riser card in the CEC drawer, location: U0.1-P2. 2. Replace the CEC backplane: location U0.1-P1. Otherwise, replace the device specified by the location code. E562 Set up RIO interface chip registers If the location code associated with this checkpoint is either U0.1-P1 or U0.1-P2, do the following: 1. Replace the PCI riser card in the CEC drawer, location: U0.1-P2. 2. Replace the CEC backplane: location U0.1-P1. Otherwise, replace the device specified by the location code. E563 Set up RIO interface chip registers If the location code associated with this checkpoint is either U0.1-P1 or U0.1-P2, do the following: 1. Replace the PCI riser card in the CEC drawer, location: U0.1-P2. 2. Replace the CEC backplane: location U0.1-P1. Otherwise, replace the device specified by the location code. E564 Set up RIO interface chip registers If the location code associated with this checkpoint is either U0.1-P1 or U0.1-P2, do the following: 1. Replace the PCI riser card in the CEC drawer, location: U0.1-P2. 2. Replace the CEC backplane: location U0.1-P1. Otherwise, replace the device specified by the location code. E565 Set up RIO interface chip registers If the location code associated with this checkpoint is either U0.1-P1 or U0.1-P2, do the following: 1. Replace the PCI riser card in the CEC drawer, location: U0.1-P2. 2. Replace the CEC backplane: location U0.1-P1. Otherwise, replace the device specified by the location code. Chapter 4. Checkpoints 163 Checkpoint Description Action/ Possible Failing FRU E566 Enable device arbitration on the RIO interface chip If the location code associated with this checkpoint is either U0.1-P1 or U0.1-P2, do the following: 1. Replace the PCI riser card in the CEC drawer, location: U0.1-P2. 2. Replace the CEC backplane: location U0.1-P1. Otherwise, replace the device specified by the location code. E567 Set up RIO interface chip registers If the location code associated with this checkpoint is either U0.1-P1 or U0.1-P2, do the following: 1. Replace the PCI riser card in the CEC drawer, location: U0.1-P2. 2. Replace the CEC backplane: location U0.1-P1. Otherwise, replace the device specified by the location code. E568 Set up the external interrupt vector register If the location code associated with this checkpoint is either U0.1-P1 or U0.1-P2, do the following: 1. Replace the PCI riser card in the CEC drawer, location: U0.1-P2. 2. Replace the CEC backplane: location U0.1-P1. Otherwise, replace the device specified by the location code. E569 Set up RIO interface chip registers If the location code associated with this checkpoint is either U0.1-P1 or U0.1-P2, do the following: 1. Replace the PCI riser card in the CEC drawer, location: U0.1-P2. 2. Replace the CEC backplane: location U0.1-P1. Otherwise, replace the device specified by the location code. E56A Set up RIO interface chip registers If the location code associated with this checkpoint is either U0.1-P1 or U0.1-P2, do the following: 1. Replace the PCI riser card in the CEC drawer, location: U0.1-P2. 2. Replace the CEC backplane: location U0.1-P1. Otherwise, replace the device specified by the location code. 164 Service Guide Checkpoint Description Action/ Possible Failing FRU E56B Set up RIO interface chip registers If the location code associated with this checkpoint is either U0.1-P1 or U0.1-P2, do the following: 1. Replace the PCI riser card in the CEC drawer, location: U0.1-P2. 2. Replace the CEC backplane: location U0.1-P1. Otherwise, replace the device specified by the location code. E56C Set up the RIO interface chip PCI bus If the location code associated with this checkpoint is either U0.1-P1 or U0.1-P2, do the following: 1. Replace the PCI riser card in the CEC drawer, location: U0.1-P2. 2. Replace the CEC backplane: location U0.1-P1. Otherwise, replace the device specified by the location code. E56D Identify devices on current bus If the location code associated with this checkpoint is either U0.1-P1 or U0.1-P2, do the following: 1. Replace the PCI riser card in the CEC drawer, location: U0.1-P2. 2. Replace the CEC backplane: location U0.1-P1. Otherwise, replace the device specified by the location code. E56E Begin initializing the service processor control chip If the location code associated with this checkpoint is either U0.1-P1 or U0.1-P2, do the following: 1. Replace the PCI riser card in the CEC drawer, location: U0.1-P2. 2. Replace the CEC backplane: location U0.1-P1. Otherwise, replace the device specified by the location code. E56F End initializing the service processor control chip 1. Check for system firmware updates. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E570 Initialize the PCI-ISA bridge chip Replace the device specified by the location code. E571 Initialize the integrated Ethernet (if present) Replace the device specified by the location code. E572 Initialize the integrated SCSI Replace the device specified by the location code. E57F End of RIO interface chip initialization 1. Check for system firmware updates. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108. Chapter 4. Checkpoints 165 Checkpoint Description Action/ Possible Failing FRU E580 Begin initializing the EADS module. 1. Check for system firmware updates. 2. Replace the I/O subsystem backplane specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E581 Read the PCI bus configuration header. 1. Check for system firmware updates. 2. Replace the I/O subsystem backplane specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E582 EADS has been detected. 1. Check for system firmware updates. 2. Replace the I/O subsystem backplane specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E583 Running BIST on the current EADS 1. Check for system firmware updates. 2. Replace the I/O subsystem backplane specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E584 Checking function of current EADS 1. Check for system firmware updates. 2. Replace the I/O subsystem backplane specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E585 Valid function of current EADS detected 1. Check for system firmware updates. 2. Replace the I/O subsystem backplane specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E586 Set up EADS function 1. Check for system firmware updates. 2. Replace the I/O subsystem backplane specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E587 Set up EADS BIST 1. Check for system firmware updates. 2. Replace the I/O subsystem backplane specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E588 Set up EADS function register 1. Check for system firmware updates. 2. Replace the I/O subsystem backplane specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. 166 Service Guide Checkpoint Description Action/ Possible Failing FRU E589 Set up EADS function registers 1. Check for system firmware updates. 2. Replace the I/O subsystem backplane specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E58A Set up EADS function registers 1. Check for system firmware updates. 2. Replace the I/O subsystem backplane specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E58B Set up EADS function registers 1. Check for system firmware updates. 2. Replace the I/O subsystem backplane specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E58C Set up EADS function registers 1. Check for system firmware updates. 2. Replace the I/O subsystem backplane specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E58D Set up EADS function registers 1. Check for system firmware updates. 2. Replace the I/O subsystem backplane specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E58E Set up EADS function registers 1. Check for system firmware updates. 2. Replace the I/O subsystem backplane specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E58F Set up EADS function registers 1. Check for system firmware updates. 2. Replace the I/O subsystem backplane specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E590 Set up EADS function registers 1. Check for system firmware updates. 2. Replace the I/O subsystem backplane specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E591 Set up EADS function registers 1. Check for system firmware updates. 2. Replace the I/O subsystem backplane specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. Chapter 4. Checkpoints 167 Checkpoint Description Action/ Possible Failing FRU E592 Set up EADS function registers 1. Check for system firmware updates. 2. Replace the I/O subsystem backplane specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E593 Set up EADS function registers 1. Check for system firmware updates. 2. Replace the I/O subsystem backplane specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E594 Begin probing slots 1. Check for system firmware updates. 2. Replace the I/O subsystem backplane specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E595 Detected valid adapter in slot 1. Check for system firmware updates. 2. Replace the I/O subsystem backplane specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E596 Read the EADS revision ID and class code. 1. Replace the RIO cable specified by the location code. 2. Replace the I/O subsystem planar that the RIO cable in step 1 is attached to. E59E End of walking bus of current EADS 1. Check for system firmware updates. 2. Replace the I/O subsystem backplane specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E5E0 Initialize system processor array 1. Check for system firmware updates. 2. Replace the I/O subsystem backplane specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E5E1 Reserved E5E2 Initialization of the hypervisor, phase 1 1. Check for system firmware updates. 2. Go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123. E5E3 Not in quick-restart: load and set up open firmware code in memory 1. Check for system firmware updates. 2. Go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123. 168 Service Guide Checkpoint Description Action/ Possible Failing FRU E5E4 Activate slave processors 1. Check for system firmware updates. 2. Go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123. E5E5 Initialize the hypervisor, phase 2 1. Check for system firmware updates. 2. Go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123. E5E6 Set L2 and processor status 1. Check for system firmware updates. 2. Go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123. E5E7 Save I/O hub information for RTAS and open firmware 1. Check for system firmware updates. 2. Go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123. E5E8 Set up registers 1. Check for system firmware updates. 2. Go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123. E5E9 Reset the quick restart path 1. Check for system firmware updates. 2. Go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123. E5EE Hang state in main control code 1. Check for system firmware updates. 2. Go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123. E5EF End of low-level firmware 1. Check for system firmware updates. 2. Go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123. E5FE Get the system’s licensed memory quantity from the service processor 1. Remove, then reapply AC power to the system using the UEPO switch. 2. Verify that the CUoD capacity card is installed and properly seated. 3. Call service support. E600 SSA PCI adapter open firmware has run successfully. Replace the adapter. E601 SSA PCI adapter BIST has started but failed to complete after 4 seconds. Replace the adapter. E602 SSA PCI open firmware has started. Replace the adapter. E603 SSA PCI adapter BIST has completed with an error. Replace the adapter. E604 SSA PCI adapter BIST and subsequent POSTs have completed successfully. Replace the adapter. Chapter 4. Checkpoints 169 Checkpoint Description Action/ Possible Failing FRU E605 SSA PCI adapter BIST has completed successfully but subsequent POSTS have failed. Replace the adapter. E60E SSA PCI open firmware about to exit (no stack corruption). Replace the adapter. E60F SSA PCI open firmware has run unsuccessfully. Replace the adapter. E6FF SSA PCI open firmware about to exit (with stack corruption). Replace the adapter. E700 Create system firmware VPD System firmware may be corrupted. Reinstall system firmware. E701 Create memory VPD. 1. Check for system firmware updates. 2. Call for support. E702 Create processor card VPD. 1. Check for system firmware updates. 2. Call for support. E703 Create operator panel VPD. 1. Check for system firmware updates. 2. Call for support. E704 Create CEC VPD. 1. Check for system firmware updates. 2. Call for support. E705 Create VPD. 1. Check for system firmware updates. 2. Call for support. E706 Create VPD. 1. Check for system firmware updates. 2. Call for support. E707 Create system VPD. 1. Check for system firmware updates. 2. Call for support. E708 Create VPD. Go to “MAP 1540: Problem Isolation Procedures” on page 108. E709 Create VPD via SPCN. 1. Check for system firmware updates. 2. Call service support. E70A Create service processor VPD. 1. Check for system firmware updates. 2. Call for support. E70B Create L3 module VPD 1. Check for system firmware updates. 2. Call service support. E70C Create I/O backplane VPD (I2C problem) Go to “MAP 1540: Problem Isolation Procedures” on page 108. E70D Create VPD 1. Check for system firmware updates. 2. Call service support. E70F Collect CEC VPD Go to “MAP 1540: Problem Isolation Procedures” on page 108. E7EE Delaying while waiting for power subsystem VPD collection to be completed; informational only 1. Check for system firmware updates. 2. Call service support. 170 Service Guide Checkpoint Description Action/ Possible Failing FRU E7FF Successful end of VPD creation Informational message. If the system hangs here, go to “MAP 1540: Problem Isolation Procedures” on page 108. E800 RTAS initialization problem 1. Check for system firmware updates. 2. Call service support. E841 Initializing RTAS/open firmware device tree interface 1. Check for system firmware updates. 2. Call service support. E842 Initializing RTAS/service processor interface 1. Check for system firmware updates. 2. Replace the CEC backplane Location: U0.1-P1. 3. Call service support. E843 Initializing RTAS/sensor interface 1. Check for system firmware updates. 2. Call service support. E844 Initializing RTAS/time-of-day clock interface 1. Check for system firmware updates. 2. Replace the CEC backplane Location: U0.1-P1. 3. Call service support. E845 Initializing interface/sensor access 1. Check for system firmware updates. 2. Call service support. E846 Initializing interface/sensor access 1. Check for system firmware updates. 2. Call service support. E84F Initializing RTAS/operator panel interface 1. Check for system firmware updates. 2. Call service support. E850 Initializing RTAS/cache interface 1. Check for system firmware updates. 2. Call service support. E851 Initializing RTAS/hardware access 1. Check for system firmware updates. 2. Call service support. E852 Initializing RTAS/PCI bus interface 1. Check for system firmware updates. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108. 3. Call service support. E853 Start cable ID request process 1. Check the SPCN and RIO cabling to the I/O subystems. 2. Remove, then reapply power to the system. 3. Go to “MAP 1542: I/O Problem Isolation” on page 111. 4. Call service support. Chapter 4. Checkpoints 171 Checkpoint Description Action/ Possible Failing FRU E854 Description: Start collecting cable IDs Action: 1. Check the SPCN and RIO cabling to the I/O subystems. 2. Remove, then reapply power to the system. 3. Go to “MAP 1542: I/O Problem Isolation” on page 111. 4. Call service support. E855 Description: Start setting RIO link speed Action: 1. Check the SPCN and RIO cabling to the I/O subystems. 2. Remove, then reapply power to the system. 3. Go to “MAP 1542: I/O Problem Isolation” on page 111. 4. Call service support. E860 Initializing RTAS/power subsystem interface 1. Check for system firmware updates. 2. Call service support. E861 Initializing RTAS/PCI bus interface 1. Check for system firmware updates. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108. 3. Call service support. E870 Initializing RTAS/logs interface 1. Check for system firmware updates. 2. Call service support. E871 Initializing RTAS/EEH handling interface 1. Check for system firmware updates. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108. 3. Call service support. E872 Initializing RTAS/error inject interface 1. Check for system firmware updates. 2. Call service support. E873 Initializing RTAS/error handling interface 1. Check for system firmware updates. 2. Call service support. E880 Initializing RTAS/debug interface 1. Check for system firmware updates. 2. Call service support. E881 Initializing RTAS/utility interface 1. Check for system firmware updates. 2. Call service support. E882 Initializing RTAS/softpatch register interface 1. Check for system firmware updates. 2. Call service support. E890 Initializing RTAS/hot-plug interface 1. Check for system firmware updates. 2. Call service support. E8C1 Initializing interface/client interface access 1. Check for system firmware updates. 2. Call service support. E8D1 Initializing interface/memory access 1. Check for system firmware updates. 2. Call service support. EAA1 Probe PCI-PCI bridge bus Follow the repair actions for checkpoint E151. 172 Service Guide Boot Problems/Concerns Depending on the boot device, a checkpoint may be displayed on the operator panel for an extended period of time while the boot image is retrieved from the device. This is particularly true for tape and network boot attempts. If booting from CD-ROM or tape, watch for activity on the drive’s LED indicator. A blinking LED indicates that the loading of either the boot image or additional information required by the operating system being booted is still in progress. If the checkpoint is displayed for an extended period of time and the drive LED is not indicating any activity, there might be a problem loading the boot image from the device. Note: For network boot attempts, if the system is not connected to an active network or if the target server is inaccessible (this can also result from incorrect IP parameters being supplied), the system will still attempt to boot. Because time-out durations are necessarily long to accommodate retries, the system may appear to be hung. This procedure assumes that a diagnostic CD-ROM is available and that an IDE CD-ROM or IDE DVD-RAM drive is connected to the internal IDE connector. If your CD-ROM or DVD-RAM drive is SCSI note the following: v If your system is configured with a four slotted riser card, the CD-ROM or DVD RAM drive must be connected to a SCSI adapter. v If your system is configured with a four slotted riser card, for peek performance load the SCSI adapter into either PCI-X slot 3 or PCI-X slot 4 on the PCI riser card. See “PCI Riser Card Locations” on page 16 for riser card description and connector locations. v If your system is configured with a six slotted riser card the CD-ROM or DVD-RAM can be connected to the integratted SE-SCSI connector located on the PCI riser card. See “PCI Riser Card Locations” on page 16 for riser card description and connector locations. Booting the diagnostics image from CD-ROM or a NIM server is referred to as running standalone diagnostics. Step 1 Without an HMC Attached to the System: Restart the system and access the firmware SMS main menu. Select View Error Log. Look for any errors with time stamps that are close to the boot during which the error was received that sent you here. Perform the actions associated with these error code(s) if present. If the problem is resolved, go to ²MAP 410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. If the problem is not resolved, restart the system and access the firmware SMS menu. Select Select Boot Options. 1. Check to see if the intended boot device is correctly specified in the boot list. If it is in the boot list: a. Remove all removable media from devices in the boot list from which you do not want to boot. b. If attempting to boot from the network, go to “Step 2” on page 174. c. If attempting to boot from a disk drive or CD-ROM, go to “Step 3” on page 175. Chapter 4. Checkpoints 173 2. If the intended boot device is not correctly identified in the boot sequence, add it to the boot sequence using the SMS menus. If the device can be added to the boot sequence, reboot the system, then go to MAP 0410: Repair Checkout in the Diagnostic Information for Multiple Bus Systems. If the intended boot device cannot be added to the boot list, go to “Step 3” on page 175. With an HMC Attached to the System: Attention: If the system is running partitions from partition standby (LPAR), the following procedure addresses the problem in which one partition will not boot while other partitions boot successfully and run the operating system successfully. This procedure also addresses the problem in which a system does not boot to a full system partition. All of the system’s resources are available to the diagnostics; the same general procedure applies, with the exception of moving resources between partitions. Attention: It is the customer’s responsibility to move devices between partitions. If a device must be moved to another partition to run standalone diagnostics, contact the customer or system administrator. (If a SCSI CD-ROM or DVD-RAM must be moved to another partition, all SCSI devices connected to that SCSI adapter must be moved because moves are done at the slot level, not at the device level. In addition, if a CD-ROM or DVD-RAM drive is connected to the internal IDE adapter, and the CD-ROM or DVD-RAM drive must be moved to another partition, all of the integrated I/O devices will move with it.) Both partitions must be rebooted if a device is moved between partitions. Step 2 This procedure assumes that a CD-ROM or DVD-RAM drive is connected to the system, or that the diagnostics can be run from a NIM (Network Installation Management) server. 1. Look at the service action event-error log in the Service Focal Point on the HMC (see “Service Focal Point” on page 48 for instructions on accessing the service action event log). If there are open entries that affect devices in the boot path of the partition, perform the actions to correct those errors. If there are open entries that indicate problems with I/O cabling, perform the actions to correct those problems. Try to reboot the partition. If the partition hangs with a 4-character checkpoint in the display, the partition must be deactivated, then reactivated before attempting to reboot. If a 20EE xxxx error code is reported, the partition is already deactivated and in the error state. Reboot by activating the partition. If the system is running in full system partition, the power must be cycled before the system can be rebooted. If the reboot is still not successful, continue to step substep 2. 2. Boot to the SMS main menu as follows: v If you are rebooting a partition from partition standby (LPAR), go to the properties of the partition and select Boot to SMS, then activate the partition. v If you are rebooting the system in full system partition, start the service processor menus, select option 2, System Power Control Menu, then select option 7, Boot Mode Menu, then select option 1, Boot to SMS menu. Enter 98 to return to the system power control menu, then select option 4, Power-on System. Enter y to confirm. At the SMS main menu, select Select Boot Devices. a. Check to see if the intended boot device is correctly specified in the boot list. If it is in the boot list: 1) Remove all removable media from devices in the boot list from which you do not want to boot. 2) If attempting to boot from a network, go to step “Step 2”. If attempting to boot from a disk drive or CD-ROM, go to step “Step 3” on page 175. 174 Service Guide b. If the intended boot device is not correctly listed in the boot sequence, add it to the boot sequence using the SMS menus. If the device can be added to the boot sequence, reboot the partition. If the intended boot device cannot be added to the boot list, ask the customer or system administrator to verify that the device you are trying to boot from is assigned to the correct partition. If it is, go to step “Step 3”. If attempting to boot from the network: 1. Verify that IP parameters are correct. 2. Attempt to ²ping² the target server using the SMS Ping utility. If the ping is not successful, have the network administrator verify the server configuration for this client. 3. Check with the network administrator to ensure that the network is up. 4. Check the network cabling to the adapter. 5. Turn the power off, then on and retry the boot operation. Step 3 Try to boot and run standalone diagnostics against the system, particularly against the intended boot device. Standalone diagnostics can be run from CD-ROM or a NIM server. To boot standalone diagnostics, follow the detailed procedures in “Considerations for Running Standalone Diagnostics from CD-ROM” on page 301 or “Running Standalone Diagnostics from a Network Installation Management (NIM) Server with an HMC Attached to the System” on page 302. Attention: Standalone diagnostics may be run on systems configured for either full system partition or partition standby. When attempting to boot diagnostics on a partition from partition standby, the device from which you are booting standalone diagnostics must be made available to the partition that is not able to boot, if it is not already in that partition. Contact the customer or system administrator if a device must be moved between partitions in order to boot standalone diagnostics. If devices are moved between partitions, both partitions must be rebooted. For more information, see “Standalone Diagnostics Operations” on page 301 or “Running the Standalone Diagnostics with No HMC Attached” on page 302. If diagnostics do not boot successfully, and a SCSI boot failure (cannot boot from a SCSI attached device) is also occurring, go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123. If diagnostics do not boot successfully, and a SCSI boot failure (cannot boot from a SCSI attached device) is not occurring: 1. Check IDE cabling to boot device. 2. Check device configuration jumpers. If no problem is found with the cabling or the jumpers, go to “Step 4” on page 176. If diagnostics boot successfully and ²No Trouble Found² was the result when diagnostics were run against the intended boot device, go to substep 4. If the diagnostics boot successfully, but the intended boot device was not present in the output of the ²Display Configuration and Resource List², which is run from the Task Selelction Menu: 1. If you booted standalone diagnostics from IDE CD-ROM, follow these steps. After each action, do the following: Restart the system. Stop at the SMS menus and select Select Boot Options. If the device is present in the boot list, and it was present previously, go to MAP 0410: Repair Checkout in the Diagnostic Information for Multiple Bus Systems. Chapter 4. Checkpoints 175 If the device still is not present, continue with the actions shown in the following list. a. Check the SCSI cables b. Remove all hot-swap disk drives except the intended boot device if its a hot-swap drive. If the boot device is present in the boot list when the system is booted to the SMS menus, add the hot-swap disk drives back in, one at a time, until the failing device is isolated. c. Disconnect all other internal SCSI devices. If the boot device is present in the boot list when the system is booted to the SMS menus, reconnect the internal SCSI devices, one at a time, until the failing device or cable is isolated. d. Replace the SCSI cables. e. Replace the PCI riser card. f. Replace the SCSI backplane. g. Replace the intended boot device. h. Replace the CEC backplane. 2. Go to the Task Selection Menu and select Display Configuration and Resource List. If the intended boot device is not listed, go to ²MAP 0290: Missing Resource Problem Resolution² in Diagnostic Information for Multiple Bus Systems. 3. If an SRN, not an 8-digit error code, is reported, go to Diagnostic Information for Multiple Bus Systems. 4. Choose one of the following, then go to step 5. v If the diagnostics are successful, and no other devices have been disconnected, it may be necessary to perform an operating system-specific recovery process, or reinstall the operating system. v If the diagnostics are successful, and devices have been removed, reinstall them one at a time. After each device is reinstalled, reboot the system. Continue this procedure until the failing device is isolated. Replace the failing device. 5. If you replaced the indicated FRUs and the problem is not corrected, or the above descriptions did not address your particular situation, go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123. If the problem has been corrected, go to ²MAP 0410: Repair Checkout² in Diagnostic Information for Multiple Bus Systems. Step 4 It is possible that another installed adapter is causing the problem. Do the following: 1. Remove all installed adapters except the one the CD-ROM drive is attached to and the one used for the console. 2. Try to boot the standalone diagnostics again. 3. If unable to load standalone diagnostics, go to “Step 5” on page 177. 4. If standalone diagnostics load, reinstall adapters (and attached devices as applicable) one at a time and retry the boot operation until the problem recurs. Then replace the adapter or device that caused the problem. Go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. 176 Service Guide Step 5 The CD-ROM drive, IDE cable, graphics adapter (if installed), or the system board is most likely defective. A TTY terminal attached to the serial port also can be used to determine if the graphics adapter is causing the problem. This is done by removing the graphics adapter, attaching a TTY to the serial port, and retrying standalone diagnostics. If the standalone diagnostics load, replace the graphics adapter. 1. Replace the CD-ROM drive. 2. Replace the IDE cable. 3. Replace the system board. 4. If you replaced the indicated FRUs and the problem is still not corrected, or the above descriptions did not address your particular situation, go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123. 5. Go to ²MAP 0410: Repair Checkout² in Diagnostic Information for Multiple Bus Systems. Chapter 4. Checkpoints 177 178 Service Guide Chapter 5. Error Code to FRU Index Attention: Start your activity in this chapter by going to “Checkpoint and Error Code Index” on page 181. In this section, you may be asked to perform additional isolation procedures before looking up the actions for a 4-character checkpoint or 8-character error code. The Error Code to FRU Index lists fault symptoms and possible causes. The most likely cause is listed first. Use this index to help you decide which FRUs to replace when servicing the system. v If you replace FRUs or perform an action and the problem is still not corrected, go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123 unless otherwise indicated in the tables. v If you replace FRUs and the problem is corrected, go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Note: Do not shut down the system if the FRU was replaced by hot swap. Performing Slow Boot On this system, the error code displayed in the operator panel may not indicate the root cause of the problem. To fully analyze all of the available information, perform the following steps before doing a hardware repair or replacement: 1. Record the 8-character error code (and location code if present) in the operator panel. 2. Power off the system using the white button. Refer to “Stopping the System without an HMC Attached” on page 359. 3. Select slow boot mode (select disable fast boot) on the system power control menu from the service processor main menu. 4. Do a slow-mode boot to the SMS menus. This boot can be specified using the System Power Control Menu on the service processor main menu. Refer to “System Power Control Menu” on page 314. (A fast-mode boot skips much of the built-in diagnostic testing.) A slow-mode boot may yield a new 8-character error code on the operator panel and new errors in the service processor error log. Power on the system. Refer to “Starting the System without an HMC Attached” on page 360. v If a new error code is reported, use it in subsequent steps to continue problem analysis. v If the system hangs with the same error code, continue with the next step in the Checkpoints and Error Codes Index. v If no 8-character error code is displayed in the operator panel, use the original error code and continue with the next step in the Checkpoints and Error Codes Index. 5. Power off the system. 6. At this point, the service processor menus should be available to examine the service processor error log. Four-Character Checkpoints Four-character progress codes and checkpoints in the form 8xxx, 9xxx, Axxx, Bxxx, and Exxx are listed in “Firmware Checkpoints” on page 150. If you have a four-character code that begins with any character other than 8, 9, A, B, or E, refer to the Diagnostic Information for Multiple Bus Systems. Replacing the Operator Panel Licensed programs frequently rely on system information that is stored in the VPD module on the operator panel. If the MAPs indicate that the operator panel should be replaced, the system VPD must be written into the VPD module on the new operator panel. See “System Vital Product Data (VPD) Update Procedure” on page 421. 179 Replacing the Network Adapter If a network adapter is replaced, the network administrator must be notified so that the client IP addresses used by the system can be updated. In addition, the operating system configuration of the network controller may need to be changed in order to enable system startup. Also check to ensure that any client or server that accesses this system is updated. Determining Location Code If you have an error code in the form 2xxx xxxx or Exxx, and no location code associated with it, the location code may possibly be found in three places, depending on the state of the system: v For systems with AIX installed; if the system will boot into AIX, the location code will be reported when error log analysis is run under AIX diagnostics. v If the system will boot into the system management services (SMS) menus, the SMS error log will contain the location code associated with the error code. See “View Error Log” on page 347. v If the system halted when the error occurred, the error code and location can be found in the LCD progress indicator log, which can be accessed using the service processor menus. See “System Information Menu” on page 317, then select Read Progress Indicators From Last System Boot. Find the error code in the list that directed you to this note, and the location code should be a part of the entry in the log. FRU Identify LEDs This system is configured with an arrangement of LEDs that help identify various components of the system. These include, but are not limited to the following: v Rack identify LED v CEC drawer identify LED v I/O drawer identify LED v RIO port identify LED v FRU identify LED – Power subsystem FRUs – CEC subsystem FRUs – I/O subsystem FRUs v I/O adapter identify LED v DASD identify LED The identify LEDs are arranged hierarchically, with the FRU identify LED at the bottom of the hierarchy, followed by the corresponding CEC or I/O drawer identify LED, and the corresponding rack identify LED to locate the failing FRU more easily. Any identify LED in the system can be flashed when the system is in the failed state with power on by using the service processor LED Control Menu contained in the “System Information Menu” on page 317. Any identify LED in the system can also be flashed by using the ²Identify and Attention Indicators² task through diagnostics. The procedure to operate the ²Identify and Attention Indicators² task in diagnostics is outlined in the Diagnostic Information for Multiple Bus Systems. 180 Service Guide If the service processor menus and the AIX diagnostics are not available, the FRU identify LEDs can be flashed by one of following: v If the system is configured as a full system partition, boot to the open firmware prompt. See the “System Power Control Menu” on page 314 for instructions on setting up the boot mode to enable the boot to the open firmware prompt. To display a menu that will allow you to turn on the desired FRU identify LED, type FRU-LED-MENU on the command line. v If the system is logically partitioned, the HMC must be attached. For instructions on activating and deactivating a FRU identify LED, see the Service Focal Point chapter of the Hardware Management Console Installation and Operations Guide, order number SA38-0590. Checkpoint and Error Code Index The following table is an index to the checkpoints or error codes that you may receive to help diagnose a failure. Locate the code you received, and follow the instructions to determine your next step. Checkpoint or Error Code What You Should Do 0000 xxxx Operator Panel Error Codes Go to “Operator Panel Error Codes” on page 182 and follow the instructions in the Operator Panel Error Code table. 1xxx xxxx System Power Control Network (SPCN) Error Codes Go to “SPCN Error Codes” on page 183 and follow the instructions in the SPCN Error Code table. 2xxx xxxx Firmware Error Codes Go to “Firmware Error Codes” on page 192 and follow the instructions in the Firmware Error Code table. 4xxx xxxx Service Processor Error Codes 1. Perform slow boot, if not already performed earlier. See “Performing Slow Boot” on page 179. 2. If the error codes are 450x xxxx, and if their actions call out memory DIMMs, go to “MAP 1240: Memory Problem Resolution” on page 68. 3. Go to “Service Processor Error Codes” on page 211 and follow the instructions in the Service Processor Error Codes table. 8xxx 9xxx Axxx Bxxx Service Processor Checkpoints Go to “Service Processor Checkpoints” on page 143 for more information on these checkpoints. A1FD 000x The system has entered recovery mode. Go to “Recovery Mode” on page 335. B006 xxxx B1xx xxxx Common Firmware Error Codes Go to “Common Firmware Error Codes” on page 286 and follow the instructions in the Common Firmware Error Codes table. Exxx Firmware Checkpoints Go to “Firmware Checkpoints” on page 150 for more information on these checkpoints. Mxxx xxxx Problem Determination-Generated Error Codes Go to “Problem Determination Generated Error Codes” on page 295 and follow the instructions in the Problem Determination Generated Error Codes table. Chapter 5. Error Code to FRU Index 181 Operator Panel Error Codes Attention: If you replace FRUs or perform an action and the problem is still not corrected, go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123 unless otherwise indicated in the tables. If you replace FRUs and the problem is corrected, go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Table 2. Operator Panel Error Codes. Error Code Description Action / Possible Failing FRU 0000 1111 At power on, failure of power-good signal from power supply. 1. CEC backplane Location: U0.1-P1 2. CEC backplane power supply Locations: U0.1-V1 U0.1-V2 0000 1112 At power off, failure of power-good signal from power supply. 1. CEC backplane Location: U0.1-P1 2. CEC backplane power supply Locations: U0.1-V1 U0.1-V2 0000 1113 While power on, failure of power-good signal from power supply. 1. CEC backplane Location: U0.1-P1 2. CEC backplane power supply Locations: U0.1-V1 U0.1-V2 0000 1114 Operator panel and service processor have been reset after main power disturbance. Information message 0000 2222 Operator panel not communicating with service processor 1. If system cannot be powered off, reset the service processor (via the pinhole reset or by disconnecting then reconnecting power). 2. CEC backplane Location: U0.1-P1 0000 3333 SPCN communications error CEC backplane Location: U0.1-P1 0000 3334 I2C repeater error CEC backplane Location: UO.1-P1 0000 BBBB TOD battery malfunction TOD battery Location: U0.1-P1-V3 182 Service Guide SPCN Error Codes Attention: If a power supply or fan is hot-swapped, wait five minutes, then recheck the operating system error log. If the system has to be shut down to replace a power supply or fan, reboot the system, wait five minutes after the operating system login prompt appears, then recheck the operating system error log. If the detailed information for the most recent error with the same identifier indicates that the original problem has not been fixed by performing the first action in the action column, continue with the next action in the action column. Attention: If you replace FRUs or perform an action and the problem is still not corrected, go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123 unless otherwise indicated in the tables. If you replace FRUs and the problem is corrected, go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Table 3. SPCN Error Codes for the CEC (system unit) Error Code Description Action/ Possible Failing FRU 1011 00AC AC loss or brownout detected Informational 1011 00EF Room emergency power off detected Informational 1011 1510 Power supply 1 power source failure 1. Check input power to the power supply 2. System power supply 1 Location: U0.1-V1 3. CEC backplane Location: U0.1-P1 1011 1511 Power supply 1 fault 1. Verify that the media support shelf is installed and properly seated. 2. System power supply 1 Location: U0.1-V1 3. CEC backplane Location: U0.1-P1 1011 1512 Power supply 1 non-power fault 1. System power supply 1 Location: U0.1-V1 2. CEC backplane Location: U0.1-P1 1011 1513 Power supply 1 domain fault 1. Verify that the media support shelf is installed and properly seated. 2. System power supply 1 Location: U0.1-V1 3. CEC backplane Location: U0.1-P1 1011 1520 Power supply 2 power source failure 1. Check input power to power supply 2 2. System power supply 2 Location: U0.1-V2 3. CEC backplane Location: U0.1-P1 1011 1521 Power supply 2 fault 1. System power supply 2 Location: U0.1-V2 2. CEC backplane Location: U0.1-P1 Chapter 5. Error Code to FRU Index 183 Table 3. SPCN Error Codes for the CEC (system unit) (continued) Error Code Description Action/ Possible Failing FRU 1011 1522 Power supply 2 non-power fault 1. System power supply 2 Location: U0.1-V2 2. CEC backplane Location: U0.1-P1 1011 1523 Power supply 2 domain fault 1. System power supply 2 Location: U0.1-V2 2. CEC backplane Location: U0.1-P1 1011 1B0A +12V/-12V load fault 1. Replace power supply 1 Location: U0.1-V1 2. If power supply 2 is present, use removed power supply 1 to replace power supply 2 3. CEC backplane Location: U0.1-P1 1011 1B0B 1.8V load fault 1. Replace power supply 1 Location: U0.1-V1 2. If power supply 2 is present, use removed power supply 1 to replace power supply 2 3. CEC backplane Location: U0.1-P1 1011 1B0C 3.3V/1.5V load fault 1. Replace power supply 1 Location: U0.1-V1 2. If power supply 2 is present, use removed power supply 1 to replace power supply 2 3. CEC backplane Location: U0.1-P1 1011 1B0D 2.5V load fault 1. Replace power supply 1 Location: U0.1-V1 2. If power supply 2 is present, use removed power supply 1 to replace power supply 2 3. CEC backplane Location: U0.1-P1 1011 1B0E +5V load fault 1. Replace power supply 1 Location: U0.1-V1 2. If power supply 2 is present, use removed power supply 1 to replace power supply 2 3. CEC backplane Location: U0.1-P1 1011 1B0F 1.5V/1.3V load fault 1. Replace power supply 1 Location: U0.1-V1 2. If power supply 2 is present, use removed power supply 1 to replace power supply 2 3. CEC backplane Location: U0.1-P1 1011 1F01 TMS log limit reached Call service support. 184 Service Guide Table 3. SPCN Error Codes for the CEC (system unit) (continued) Error Code Description Action/ Possible Failing FRU 1011 2600 Pgood fault CEC backplane Location: U0.1-P1 1011 2603 Pgood fault CEC backplane Location: U0.1-P1 1011 2612 Optical converter fault CEC backplane Location: U0.1-P1 1011 3100 I2C controller fault CEC backplane Location: U0.1-P1 1011 3105 I2C port 5 fault CEC backplane Location: U0.1-P1 1011 3114 PCI riser card fault 1. PCI riser card Location: U0.1-P2 2. CEC backplane Location: U0.1-P1 1011 3116 I2C VPD interface CEC backplane Location: U0.1-P1 1011 3118 SPCN VPD and NVS interface CEC backplane Location: U0.1-P1 1011 7610 System processor fan 1 failure 1. Processor fan 1 Location: U0.1-F1 2. CEC backplane Location: U0.1-P1 1011 7611 Processor card cooling fan missing from primary location 1. Processor fan 1 Location: U0.1-F1 2. CEC backplane Location: U0.1-P1 1011 7620 System processor fan 2 failure 1. Processor fan 2 Location: U0.1-F2 2. CEC backplane Location: U0.1-P1 1011 7621 System processor fan 2 missing 1. Processor fan 2 Location: U0.1-F2 2. CEC backplane Location: U0.1-P1 1011 7630 Slow system PCI fan 3 failure 1. PCI fan 3 Location: U0.1-F3 2. CEC backplane Location: U0.1-P1 1011 7631 System PCI fan 3 missing 1. PCI fan 3 Location: U0.1-F3 2. CEC backplane Location: U0.1-P1 1011 7640 Slow system PCI fan 3 failure 1. PCI fan 3 Location: U0.1-F3 2. CEC backplane Location: U0.1-P1 Chapter 5. Error Code to FRU Index 185 Table 3. SPCN Error Codes for the CEC (system unit) (continued) Error Code Description Action/ Possible Failing FRU 1011 7641 System PCI fan 3 missing 1. PCI fan 3 Location: U0.1-F3 2. CEC backplane Location: U0.1-P1 1011 7650 Slow system PCI fan 4 failure 1. PCI fan 4 Location: U0.1-F4 2. CEC backplane Location: U0.1-P1 1011 7651 System PCI fan 4 missing 1. PCI fan 4 Location: U0.1-F4 2. CEC backplane Location: U0.1-P1 1011 7660 System PCI fan 4 failure 1. PCI fan 4 Location: U0.1-F4 2. CEC backplane Location: U0.1-P1 1011 7661 System PCI fan 4 missing 1. PCI fan 4 Location: U0.1-F4 2. CEC backplane Location: U0.1-P1 1011 8400 VPD configuration Ignore all location codes. Go to “MAP 1540: Problem Isolation Procedures” on page 108. 1011 8401 VPD access Ignore all location codes. Go to “MAP 1540: Problem Isolation Procedures” on page 108. 1011 8402 VPD access-concentrator CEC backplane, U0.1-P1 1011 8403 VPD level mismatch 1. System VPD module, U0.1-L1-N1 2. CEC backplane, U0.1-P1 1011 8404 PU 2 mismatch 1. Processor card 2, Location: U0.1-P1-C2 2. CEC backplane Location: U0.1-P1 186 Service Guide Table 3. SPCN Error Codes for the CEC (system unit) (continued) Error Code Description Action/ Possible Failing FRU 1011 8409 No valid processor card VPD found 1. Verify that the processor card(s) is of the right type for your system and is properly installed. 2. If only one processor card is installed, replace it. If there are two processor cards installed, remove the second processor card. If the problem is resolved, replace that processor card. If it does not resolve the problem, swap the first processor card with the second, just removed, processor card. If this resolves the problem, replace the processor card that was originally in the first slot. If it does not resolve the problem, go to “MAP 1540: Problem Isolation Procedures” on page 108. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. 1011 840A VPD +5 stuck high Ignore all location codes. Go to “MAP 1540: Problem Isolation Procedures” on page 108 and run minimum configuration. 1011 840B VPD +5 stuck low Ignore all location codes. Go to “MAP 1540: Problem Isolation Procedures” on page 108 and run minimum configuration. 1011 840C Memory DIMM misplugged. Inspect system memory for proper installation of memory DIMMs. 1011 840D SPCN configuration 1. Check cabling 2. Set the I/O type. See Power Control Network Utilities Menu on page 321. 3. CEC backplane Location: U0.1-P1 1011 840E SPCN default configuration loaded 1. Check cabling 2. Set the I/O type. See Power Control Network Utilities Menu on page 321. 3. CEC backplane Location: U0.1-P1 1011 840F I/O type not recognized 1. Check cabling 2. Set the I/O type. See Power Control Network Utilities Menu on page 321. 3. CEC backplane Location: U0.1-P1 1011 8413 Invalid processor 1 VPD Processor card Location: U0.1-P1-C1 1011 8414 Invalid processor 2 VPD Processor card Location: U0.1-P1-C2 1011 8423 No processor 1 VPD Processor card Location: U0.1-P1-C1 1011 8424 No processor 2 VPD Processor card Location: U0.1-P1-C2 Chapter 5. Error Code to FRU Index 187 Table 3. SPCN Error Codes for the CEC (system unit) (continued) Error Code Description Action/ Possible Failing FRU 1011 9280 SPCN interface failure 1. Check the SPCN cables going to the CEC drawer. 2. Check for system firmware updates. Apply the update if it is available. 3. Replace the SPCN cables going to the CEC drawer, locations: U0.1-P1/Q4#, U0.1-P1/Q5# 4. Replace the system backplane, location: U0.1-P1. 1011 9281 SPCN interface failure 1. Check the SPCN cables going to the CEC drawer. 2. Check for system firmware updates. Apply the update if it is available. 3. Replace the SPCN cables going to the CEC drawer, locations: U0.1-P1/Q4#, U0.1-P1/Q5# 4. Replace the system backplane, location: U0.1-P1. 1011 9282 SPCN interface failure 1. Check the SPCN cables going to the CEC drawer. 2. Check for system firmware updates. Apply the update if it is available. 3. Replace the SPCN cables going to the CEC drawer, locations: U0.1-P1/Q4#, U0.1-P1/Q5# 4. Replace the system backplane, location: U0.1-P1. Attention: If you replace FRUs or perform an action on an I/O subsystem and the problem is still not corrected, go to “MAP 1542: I/O Problem Isolation” on page 111 unless you were already directed to any MAP 154x by the error code. Otherwise call support if the action(s) for an error code do not resolve the problem. Attention: If the error code that brought you to this chapter originated in the operating system error log, please be aware that an error may require an additional 15 minutes to be propagated to the operating system error log. If you replace FRUs and the problem is corrected, go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. The “nn” in “1nn1 xxxx”, where nn is between 2 and 3F hexadecimal, inclusive, is replaced by the I/O subsystem drawer number (in hexadecimal) in the error code that is reported by the system. The I/O drawer number in the location code of a failing FRU will be converted to decimal (which is shown as U0.dd in the following table). The “dd” value ranges from 02 to 63, inclusive. For example, the error code 11F1 1510 will have location codes U0.31-V1 and U0.31-P1 listed. 188 Service Guide Table 4. SPCN Error Codes for I/O Subsystems Error Code Description Action/ Possible Failing FRU 1nn1 00AC AC loss or brownout detected Informational message. 1nn1 00EF Remote emergency power off occurred Informational message. 1nn1 1510 Power supply 1 source failure 1. Check the input voltage to the power supply. 2. I/O subsystem power supply 1, U0.dd-V1. 3. I/O subsystem backplane, U0.dd-P1. 1nn1 1511 Power supply 1 fault 1. I/O subsystem power supply 1, U0.dd-V1. 2. I/O subsystem backplane, U0.dd-P1. 1nn1 1512 Power supply 1 non-power fault 1. I/O subsystem power supply 1, U0.dd-V1. 2. I/O subsystem backplane, U0.dd-P1. 1nn1 1513 Power supply 1 domain fault 1. I/O subsystem power supply 1, U0.dd-V1. 2. I/O subsystem backplane, U0.dd-P1. 1nn1 1517 Power supply 1 thermal fault 1. Check for obstructions to the air flow around the I/O subsystem. 2. I/O subsystem power supply 1, U0.dd-V1. 3. I/O subsystem backplane, U0.dd-P1. 1nn1 1520 Power supply 2 source failure 1. Check the input voltage to the power supply. 2. I/O subsystem power supply 2, U0.dd-V2. 3. I/O subsystem backplane, U0.dd-P1. 1nn1 1521 Power supply 2 fault 1. I/O subsystem power supply 2, U0.dd-V2. 2. I/O subsystem backplane, U0.dd-P1. 1nn1 1522 Power supply 2 non-power fault 1. I/O subsystem power supply 2, U0.dd-V2. 2. I/O subsystem backplane, U0.dd-P1. 1nn1 1523 Power supply 2 domain fault 1. I/O subsystem power supply 2, U0.dd-V2. 2. I/O subsystem backplane, U0.dd-P1. 1nn1 1527 Power supply 2 thermal fault 1. Check for obstructions to the air flow around the I/O subsystem. 2. I/O subsystem power supply 2, U0.dd-V2. 3. I/O subsystem backplane, U0.dd-P1. 1nn1 1B01 +12 V load fault 1. Replace power supply 1, U0.dd-V1. 2. If power supply 2 is present, use power supply 1 that was removed to replace power supply 2. 3. I/O subsystem backplane, U0.dd-P1. 1nn1 1B02 +5 V load fault 1. Replace power supply 1, U0.dd-V1. 2. If power supply 2 is present, use power supply 1 that was removed to replace power supply 2. 3. I/O subsystem backplane, U0.dd-P1. Chapter 5. Error Code to FRU Index 189 Table 4. SPCN Error Codes for I/O Subsystems (continued) Error Code Description Action/ Possible Failing FRU 1nn1 1B03 +3.3 V load fault 1. Replace power supply 1, U0.dd-V1. 2. If power supply 2 is present, use power supply 1 that was removed to replace power supply 2. 3. I/O subsystem backplane, U0.dd-P1. 1nn1 1B04 -12 V load fault 1. Replace power supply 1, U0.dd-V1. 2. If power supply 2 is present, use power supply 1 that was removed to replace power supply 2. 3. I/O subsystem backplane, U0.dd-P1. 1nn1 2600 Pgood fault I/O subsystem backplane, U0.dd-P1. 1nn1 2603 Power domain fault I/O subsystem backplane, U0.dd-P1. 1nn1 2605 Pgood 1.8 V fault I/O subsystem backplane, U0.dd-P1. 1nn1 2606 Pgood 2.5 V fault I/O subsystem backplane, U0.dd-P1. 1nn1 3100 I2C controller fault I/O subsystem backplane, U0.dd-P1. 1nn1 3101 I2C bus fault I/O subsystem backplane, U0.dd-P1. 1nn1 3102 I2C interface fault I/O subsystem backplane, U0.dd-P1. 1nn1 3103 I2C interface fault I/O subsystem backplane, U0.dd-P1. 1nn1 3104 I2C interface fault I/O subsystem backplane, U0.dd-P1. 1nn1 3110 SES module fault on SCSI backplane 1 Replace the SCSI backplane, location: U0.dd-P3 1nn1 3111 SES module fault on SCSI backplane 2 Replace the SCSI backplane, location: U0.dd-P4 1nn1 3113 PCI bridge chip fault I/O subsystem backplane, U0.dd-P1. 1nn1 3114 I2C fault I/O subsystem backplane, U0.dd-P1. 1nn1 3115 Operator panel fault I/O subsystem backplane, U0.dd-P1. 1nn1 3118 SPCN VPD fault I/O subsystem backplane, U0.dd-P1. 1nn1 3121 I2C VPD fault I/O subsystem backplane, U0.dd-P1. 1nn1 7610 Blower 1 fault Blower 1, U0.dd-F1. 1nn1 7611 Blower 1 missing Blower 1, U0.dd-F1. 1nn1 7620 Blower 2 fault Blower 2, U0.dd-F2. 1nn1 7621 Blower 2 missing Blower 2, U0.dd-F2. 1nn1 7630 Blower 3 fault Blower 3, U0.dd-F3. 1nn1 7631 Blower 3 missing Blower 3, U0.dd-F3. 1nn1 7640 Blower 4 fault Blower 4, U0.dd-F4. 1nn1 7641 Blower 4 missing Blower 4, U0.dd-F4. 1nn1 8400 Configuration ID conflict 1. Check the RIO and the SPCN cabling. 2. Set the I/O type using the service processor menus. 3. I/O subsystem backplane, U0.dd-P1. 190 Service Guide Table 4. SPCN Error Codes for I/O Subsystems (continued) Error Code Description Action/ Possible Failing FRU 1nn1 840D I/O type mismatch 1. Check the RIO and the SPCN cabling. 2. Set the I/O type using the service processor menus. 3. I/O subsystem backplane, U0.dd-P1. 1nn1 840E I/O type mismatch 1. Check the RIO and the SPCN cabling. 2. Set the I/O type using the service processor menus. 3. I/O subsystem backplane, U0.dd-P1. 1nn1 8A00 The power controlling the CEC changed Informational message 1nn1 C62E Polling error I/O subsystem backplane, U0.dd-P1. Chapter 5. Error Code to FRU Index 191 Firmware Error Codes Attention: If you replace FRUs or perform an action and the problem is still not corrected, go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123 unless otherwise indicated in the tables. If you replace FRUs and the problem is corrected, go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Table 5. Firmware Error Codes. Error Code Description Action/ Possible Failing FRU 20A8 0000 Insufficient information to boot. Verify the IP address. 20A8 0001 The client IP address is already in use by some other network device. Change IP address. 20A8 0002 Cannot get gateway IP address. Refer to 150 using code E174. 20A8 0003 Cannot get system hardware address. Refer to 150 using code E174. 20A8 0004 Bootup failed. Refer to 150 using code E175. 20A8 0005 File transmission (TFTP) failed. Check network connection, try again. 20A8 0006 Boot image too large Contact your next level of support for assistance. 20D0 0001 I/O subsystem has changed locations This is an informational message. The I/O subsystem at the location code associated with this message was found at another location code during a prior boot of the system. If an I/O subsystem has not been moved, do the following: 1. Boot the system and match the I/O subsystem location codes reported by the operating system to the subsystems attached to the CEC. Check the SPCN and RIO cabling to the I/O subsystems. If an cabling problem is found, correct it. 2. If no cabling problem is found, call your next level of support. 20D0 000F Self-test failed on a device, no error and/or location code information available. Refer to “Mapping AIX and Physical Location Codes” on page 28 to see if a location code is available for the device the error was reported against. 1. Replace device indicated by the location code (if available). 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108 20D0 0010 Self-test failed on a device, cannot locate package Contact your service support representative for assistance. 20D0 0011 Firmware RTAS attempt to allocate memory failed. Contact your next level of support for assistance. 20D0 0800 Firmware update failure Retry the flash update. 20D0 0801 System firmware update failure Retry the firmware update. 20D0 0997 Operating system terminated with error string Call service support. 192 Service Guide Table 5. Firmware Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 20D0 0998 Operating system terminated with error string Call service support. 20D0 0999 Operating system terminated with error string Call service support. 20D0 0A01 Error initializing RTAS NVRAM 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A02 Error initializing RTAS debugging token 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A03 Error initializing interface/service processor access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A04 Error initializing interface/operator panel access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A05 Error initializing hardware access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A06 Error initializing interface/PCI access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A07 Error initializing interface/operating system access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A08 Error initializing interface/client_interface access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A09 Error initializing interface/NUMA access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A0A Error initializing interface/interrupt access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. Chapter 5. Error Code to FRU Index 193 Table 5. Firmware Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 20D0 0A0B Error initializing interface/OFDT access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A0C Error initializing interface/sensor access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A0D Error initializing interface/sensors/led_table access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A0E Error initializing interface/timebase access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A0F Error initializing interface/time-of-day access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A10 Error initializing interface/cache access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A11 Error initializing interface/power access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A12 Error initializing interface/hot-plug access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A13 Error initializing interface/log access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A14 Error initializing interface/EEH access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A15 Error initializing interface/error injection access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 194 Service Guide Table 5. Firmware Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 20D0 0A16 Error initializing interface/handling access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A17 Error initializing interface/utility access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 0A18 Error initializing register softpatch access 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D0 9001 Subsystem information not found in VPD 1. Power off, then power on the system. 2. If the error persists, call service support. 20D0 9002 Subsystem machine type and model information not found in VPD 1. Power off, then power on the system. 2. If the error persists, call service support. 20D0 9003 Subsystem serial number information not found in VPD 1. Power off, then power on the system. 2. If the error persists, call service support. 20D0 9004 Subsystem machine type and model field is empty in VPD 1. Power off, then power on the system. 2. If the error persists, call service support. 20D0 9005 Subsystem serial number field is empty in VPD 1. Power off, then power on the system. 2. If the error persists, call service support. 20D0 9006 Duplicate subsystem serial number found in VPD 1. Power off, then power on the system. 2. If the error persists, call service support. 20D0 9007 Invalid subsystem machine type and model information in VPD 1. Power off, then power on the system. 2. If the error persists, call service support. 20D0 9008 Invalid subsystem serial number found in VPD 1. Power off, then power on the system. 2. If the error persists, call service support. 20D1 0001 Failed to send state-change message to service processor 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D1 0002 Failed to disable heartbeat alarm 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D1 0003 Failed to send boot status 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. Chapter 5. Error Code to FRU Index 195 Table 5. Firmware Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 20D1 0004 Failed to locate service processor device tree node 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D1 0005 Failed to reboot system 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D1 0006 Failed to reboot a partition 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D1 0007 Failed to set PCI read/write permissions array 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D1 0008 Failed to set PCI read/write permissions array for slot 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D1 0009 Description: Missing or invalid subsystem serial number Action: 1. Check for system firmware updates. Apply update if available. 2. If a location code is shown with this error code, replace the part at that location. 3. Contact your next level of support for assistance. Note: The location code identifies the failing subsystem. 20D1 000A Failed to send boot failed message to service processor 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D1 000B Failed to reset PCI read/write permissions array 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D1 000C Failed to send LMB-TABLE-INFO mailbox repair 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D1 000D Function to get time-of-day failed. 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 196 Service Guide Table 5. Firmware Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 20D1 000E Function to get time-of-day failed. 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D1 000F Hypervisor call to reset PHB or to check for drawer existence failed 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D1 0010 Stack underflow from fiber channel adapter 1. Check for microcode updates for the fiber channel adapter. Apply an update if available. 2. Check for system firmware updates. Apply an update if available. 3. Contact your next level of support for assistance. 20D1 0011 PHB recovery was not successful. Open firmware received failure in phb-recovery-buff 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 20D1 0012 Attention: One of the I/O subsystems has been disconnected from the system. This is indicates that an I/O subsystem that was previously in the configuration was not found during the current boot. If an I/O subsystem has been configuration has not changed, do the following: 1. If not already booted, boot the system and match the I/O subsystem location codes reported by the operating system to the subsystems attached to the CEC. Check the SPCN and RIO cabling to the I/O subsystems. If an cabling problem is found, correct it. 2. Check for system firmware updates. Apply the update if available. 3. Contact your next level of support for assistance. 20E0 0001 Privileged-access password entry error. The password has been entered incorrectly. Retry installing the password. 20E0 0004 Battery drained or needs replacement 1. Replace battery. Location: U0.1-P1-V3 2. If problem persists, replace the CEC backplane Location: U0.1-P1 20E0 0009 Invalid password entered - system locked The password has been entered incorrectly 3 times. Turn off, then turn on the system unit, then enter the password correctly. Chapter 5. Error Code to FRU Index 197 Table 5. Firmware Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 20EE 0003 IP parameter requires 3 dots ².² Enter valid IP parameter. Example: 000.000.000.000 20EE 0004 Invalid IP parameter Enter valid IP parameter. Example: 000.000.000.000 20EE 0005 Invalid IP parameter (>255) Enter valid IP parameter. Example: 255.192.002.000 20EE 0007 Keyboard not found 1. Plug in keyboard. 2. Replace the CEC backplane. Location: U0.1-P1 20EE 0008 No configurable adapters were found in the system by the remote IPL menu in the SMS utilities. This warning occurs when the remote IPL menu in the SMS utilities cannot locate any LAN adapters that are supported by the remote IPL function. If a supported device is installed: 1. Replace the adapter. 2. Go to “MAP 1542: I/O Problem Isolation” on page 111. 20EE 0009 Unable to communicate with the service processor. Replace the CEC backplane Location: U0.1-P1. 20EE 000B The system was not able to find an operating system boot record on the devices listed in the boot list. This can be caused by the wrong devices being in the boot list, a problem that prevents the system from communicating with the device that contains the boot record, or the boot record being corrupted. Go to “Boot Problems/Concerns” on page 173. 20EE 000C Description: Pointer to the operating system not found in non-volatile storage. Action: Values normally found in non-volatile storage that point to the location of an operating system were not found. This can happen for two reasons, either your installed operating system does not support storing the values or some event occurred that caused the system to lose non-volatile storage information (drainage or replacement of the battery). If you are running AIX, this information can be reconstructed by running the bootlist command specifying the device that the operating system is installed on. Refer to your AIX documentation for the exact syntax and usage of the bootlist command. To boot the operating system so that the above-mentioned values can be reconstructed, power off the system and power it on again. This should cause the system to look for the operating system in the devices contained in the custom boot list or in the default boot list, depending on the condition of the system. If this is not successful, use the System Management Services Menus to modify the boot sequence (also known as boot list) to include devices that are known to contain a copy of the operating system. For example, select a hard disk known to have a copy of the operating system as the first and only device in the boot sequence (boot list) and attempt to boot again. If the problem persists, replace the CEC backplane. Location: U0.1-P1. 20EE 000E Unable to execute the open firmware method. 1. Reboot the system. 2. Reflash the system with the latest level of system firmware. 3. Call service support. 198 Service Guide Table 5. Firmware Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 20EE 000F Unable to find the package node. 1. Reboot the system. 2. Reflash the system with the latest level of system firmware. 3. Call service support. 20EE 0010 Unable to execute the root VPD method. 1. Reboot the system. 2. Reflash the system with the latest level of system firmware. 3. Call service support. 20EE 0011 Unable to execute the VPD method. 1. Reboot the system. 2. Reflash the system with the latest level of system firmware. 3. Call service support. 20EE 0012 Unable to execute the mem-dimm-VPD method 1. Reboot the system. 2. Reflash the system with the latest level of system firmware. 3. Call service support. 20EE 0013 Unable to allocate memory for VPD. 1. Reboot the system. 2. Reflash the system with the latest level of system firmware. 3. Call service support. 20EE 0014 Unable to allocate memory for location codes. The system firmware may be corrupted. 20EE 0100 Fcode resident on an I/O adapter is not supported on this system Informational message. The adapter’s Fcode driver is not supported on this system. Additional information may be available from service support. This error code only affects boot-time operation, not the adapter’s functionality under the operating system. 20FC 0020 Failed to retrieve power, packaging, and/or cooling VPD via SPCN for the corresponding subsystem. Service processor mailbox failed to get VPD. 1. Check the RIO cabling. 2. Check for system firmware updates. 3. Replace the part specified by the location code. 20FC 0021 Failed to retrieve power, packaging, and/or cooling VPD via SPCN for the corresponding subsystem. Service processor mailbox failed to get power, packaging, and cooling device VPD. 1. Check the RIO cabling. 2. Check for system firmware updates. 3. Replace the part specified by the location code. 20FC 0030 Corrupted power, packaging, and cooling device VPD format in the SPCN configuration table. 1. Check for system firmware updates. 2. Replace the part specified by the location code. 20FF 0001 Problem with VPD 1. Replace the part indicated by the location code. 2. If no location code is specified, call service support. Chapter 5. Error Code to FRU Index 199 Table 5. Firmware Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 2100 0xxx SCSI controller fault 1. Before replacing any system components, refer to the notes in error code 21A00001. 2. Replace the SCSI device. 2101 0010 The IDE device remained busy longer than time-out period allows. 1. Check for system firmware updates. 2. Call service support 21A0 0001 Description: SCSI DASD - test unit ready failed - hardware error Action: Note: 1. Before replacing any system components: a. Ensure that the controller and each device on the SCSI bus is assigned a unique SCSI ID. b. Ensure SCSI bus is properly terminated. c. Ensure SCSI signal and power cables are securely connected and not damaged. 2. The location code information is required to identify the ID of SCSI device failures as well as to indicate the location of the controller to which the device is attached. Check the system error logs to determine the location code information associated with the error code. a. Replace the SCSI device. b. Replace the SCSI cable. c. Replace the SCSI controller. 21A0 0002 SCSI DASD - test unit ready failed - sense data available Before replacing any system components, refer to the notes in error code 21A0 0001. 1. Replace the media (removable media devices). 2. Replace the SCSI device. 21A0 0003 SCSI DASD - send diagnostic failed - sense data available Before replacing any system components, refer to the notes in error code 21A0 0001. Replace the SCSI device. 21A0 0004 SCSI DASD - send diagnostic failed - DevOfl cmd Before replacing any system components, refer to the notes in error code 21A0 0001. Replace the SCSI device. 21E0 0001 SCSI tape - test unit ready failed - hardware error Before replacing any system components, refer to the notes in error code 21A0 0001. 1. Replace the SCSI device. 2. Replace the SCSI cable. 3. Replace the SCSI controller. 21E0 0002 SCSI tape - test unit ready failed - sense data available Before replacing any system components, refer to the notes in error code 21A0 0001. 1. Replace the media. 2. Replace the SCSI device. 21E0 0003 SCSI tape - send diagnostic failed - sense data available Before replacing any system components, refer to the notes in error code 21A0 0001. Replace the SCSI device. 21E0 0004 SCSI tape - send diagnostic failed - DevOfl cmd Before replacing any system components, refer to the notes in error code 21A0 0001. Replace the SCSI device. 200 Service Guide Table 5. Firmware Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 21ED 0001 SCSI changer - test unit ready failed - hardware error Before replacing any system components, refer to the notes in error code 21A0 0001. 1. Replace the SCSI device. 2. Replace the SCSI cable. 3. Replace the SCSI controller. 21ED 0002 SCSI changer - test unit ready failed - sense data available Before replacing any system components, refer to the notes in error code 21A0 0001. 1. Replace the media. 2. Replace the SCSI device. 21ED 0003 SCSI changer - send diagnostic failed - sense data available Before replacing any system components, refer to the notes in error code 21A0 0001. Replace the SCSI device. 21ED 0004 SCSI changer - send diagnostic failed - DevOfl cmd Before replacing any system components, refer to the notes in error code 21A0 0001. Replace the SCSI device. 21EE 0001 Undetermined SCSI device type - test unit ready failed - hardware error Before replacing any system components, refer to the notes in error code 21A0 0001. 1. Replace the SCSI device. 2. Replace the SCSI cable. 3. If the missing SCSI devices are connected to the same backplane, replace the SCSI backplane. 4. Replace the SCSI controller. 21EE 0002 Undetermined SCSI device type - test unit ready failed - sense data available Before replacing any system components, refer to the notes in error code 21A0 0001. 1. Replace the media (removable media devices). 2. Replace the SCSI device. 21EE 0003 Undetermined SCSI device type - send diagnostic failed - sense data available Before replacing any system components, refer to the notes in error code 21A0 0001. Replace the SCSI device. 21EE 0004 Undetermined SCSI device type - send diagnostic failed - DevOfl cmd Before replacing any system components, refer to the notes in error code 21A0 0001. Replace the SCSI device. 21F0 0001 SCSI CD-ROM - test unit ready failed - hardware error Before replacing any system components, refer to the notes in error code 21A0 0001. 1. Replace the SCSI device. 2. Replace the SCSI cable. 3. Replace the SCSI controller. 21F0 0002 SCSI CD-ROM - test unit ready failed - sense data available Before replacing any system components, refer to the notes in error code 21A0 0001. 1. Replace the media. 2. Replace the SCSI device. Chapter 5. Error Code to FRU Index 201 Table 5. Firmware Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 21F0 0003 SCSI CD-ROM - send diagnostic failed - sense data available Before replacing any system components, refer to the notes in error code 21A0 0001. Replace the SCSI device. 21F0 0004 SCSI CD-ROM - send diagnostic failed - DevOfl cmd Before replacing any system components, refer to the notes in error code 21A0 0001. Replace the SCSI device. 21F2 0001 SCSI read/write optical - test unit ready failed - hardware error Before replacing any system components, refer to the notes in error code 21A0 0001. 1. Replace the SCSI device. 2. Replace the SCSI cable. 3. Replace the SCSI controller. 21F2 0002 SCSI read/write optical - test unit ready failed - sense data available Before replacing any system components, refer to the notes in error code 21A0 0001. 1. Replace the media. 2. Replace the SCSI device. 21F2 0003 SCSI read/write optical - send diagnostic failedsense data available Before replacing any system components, refer to the notes in error code 21A0 0001. Replace the SCSI device. 21F2 0004 SCSI read/write optical - send diagnostic failed- DevOfl cmd Before replacing any system components, refer to the notes in error code 21A0 0001. Replace the SCSI device. 2200 0001 PCI Ethernet BNC/RJ-45 or PCI Ethernet AUI/RJ-45 adapter internal wrap test failure Replace the adapter. See “Checkpoint and Error Code Index” on page 181 for location code information related to this error. Also see “Replacing the Network Adapter” on page 180. 2200 1001 10/100 Mbps Ethernet PCI Adapter internal wrap test failure Replace the adapter. See “Checkpoint and Error Code Index” on page 181 for location code information related to this error. Also see “Replacing the Network Adapter” on page 180. 2200 1002 10/100 Mbps Ethernet PCI adapter failure Replace the adapter. See “Checkpoint and Error Code Index” on page 181 for location code information related to this error. Also see “Replacing the Network Adapter” on page 180. 2201 0001 PCI Auto LANstreamer token ring adapter failed to complete hardware initialization. Replace the adapter. See “Checkpoint and Error Code Index” on page 181 for location code information related to this error. Also see “Replacing the Network Adapter” on page 180. 2201 1001 PCI token ring adapter failed to complete hardware initialization. Replace the adapter. See “Checkpoint and Error Code Index” on page 181 for location code information related to this error. Also see “Replacing the Network Adapter” on page 180. 202 Service Guide Table 5. Firmware Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 25A0 0001 Cache L2 controller failure 1. Replace the processor card. Location: U0.1-P1-C1 2. Replace the CEC backplane assembly. Location: U0.1-P1. 25A1 0001 Cache L2 SRAM failure 1. Replace the processor card. Location: U0.1-P1-C1 25A8 0xxx Description: NVRAM problems Action: Errors reported against NVRAM can be caused by low battery voltage and (more rarely) power outages that occur during normal system usage. With the exception of the 25A80000 error, these errors are warnings that the NVRAM data content had to be re-established and do not require any FRU replacement unless the error is persistent. When one of these errors occurs, any system customization (for example, boot device list) information has been lost, the system may need to be reconfigured. If the error is persistent, replace CEC backplane Location: U0.1-P1 25A8 0000 Initialization failed, device test failed Refer to Action under error code 25A80xxx. 25A8 0100 NVRAM data validation check failed. Turn off, then turn on system unit and retry the operation before replacing any system component. Refer to Action under error code 25A80xxx. 25A8 0201 Unable to expand target partition - saving configuration variable. Refer to Action under error code 25A80xxx. 25A8 0202 Unable to expand target partition - writing error log entry. Refer to Action under error code 25A80xxx. 25A8 0203 Unable to expand target partition - writing VPD data. Refer to Action under error code 25A80xxx. 25A8 0210 Setenv/$Setenv parameter error - name contains a null character. Refer to Action under error code 25A80xxx. 25A8 0211 Setenv/$Setenv parameter error - value contains a null character. Refer to Action under error code 25A80xxx. 25A8 0998 Description: NVRAMRC script evaluation error - command line execution error Action: Execution of a command line within the nvram configuration variable “nvramrc” (script) resulted in a “throw” being executed. This script can be modified by the system firmware SMS utilities, the operating system, PCI adapter ROM code or utility, or an operator (via the open firmware script editing command nvedit). It may not be possible to resolve the problem without a detailed analysis of the nvram script, the current system configuration, and device tree contents. 1. The problem can be caused by a SCSI adapter, whose SCSI bus ID has been changed from the default setting, no longer appearing in the system. This can be caused either by removing a SCSI adapter, or a problem with a SCSI adapter. a. Select the SCSI ID utility from the SMS menu(s). 1) Verify the list of SCSI controllers/adapters. If the list is not correct, suspect a problem with the adapter(s) installed but not listed. 2) Select the option to Save the configuration information. 3) Restart the system. b. If the problem persists, boot the operating system and verify the SCSI bus IDs of any installed/available SCSI controllers (change as necessary), and restart the system. 2. Contact support support for further assistance. Chapter 5. Error Code to FRU Index 203 Table 5. Firmware Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 25A8 0999 NVRAMRC script evaluation error - stack unbalanced on completion This is a firmware debug environment error. There is no user action or FRU replacement for this error. 25A8 0A00 Unable to retrieve NVRAM LPAR table information Refer to the actions column for error code 25A8 0xxx. Memory Error Notes Use the location code obtained from the SMS Error Log utility (described in “Error Log Utility Procedure” on page 285) to identify which memory DIMM (or memory DIMM quad) the error is reported against. Memory error codes are in the form 25Cyyxxx where yy indicates memory DIMM present detect-bits (PD-bits) and xxx indicates the last 3 digits of the error code. See “Memory DIMM Present Detect Bits (PD-Bits)” on page 285 for information on how to interpret PD bits. 25B0 0001 No memory DIMMs detected. Go to “MAP 1240: Memory Problem Resolution” on page 68. 25B0 0004 Multiple memory DIMMs failed a memory test. Go to “MAP 1240: Memory Problem Resolution” on page 68. 25Cy y001 DIMM is not supported Replace unsupported DIMM(s). Note: Memory DIMMs must be replaced with a supported type of memory DIMM. If an unsupported type of memory DIMM is replaced with the same unsupported type, the error persists. See the ²Memory Error Notes² on page 204 for more information. 25Cy y002 Memory DIMM fails memory test. Go to “MAP 1240: Memory Problem Resolution” on page 68. 2602 0001 Description: Invalid PCI adapter vendor ID Action: Attention: Before moving the adapter to another slot, verify with the customer or system administrator that the slot that the adapter is being moved to is available to the partition in which the adapter presently resides. 1. Move adapter to another slot (behind a different PCI bus). 2. Check for available firmware updates for adapter. Apply update if available. 3. Run AIX diagnostics on adapter. Replace if indicated. 4. Replace adapter. 5. Check for system firmware updates. Apply if update available. 6. If the card is in the system drawer, replace the PCI riser card, location: U0.1-P2. If the card is in an I/O subsystem, replace the I/O backplane that contains the slot in which the adapter was originally installed, location: U0.dd-P1. 204 Service Guide Table 5. Firmware Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 2602 0002 Description: Invalid PCI adapter device ID Action: Attention: Before moving the adapter to another slot, verify with the customer or system administrator that the slot that the adapter is being moved to is available to the partition in which the adapter presently resides. 1. Move adapter to another slot (behind a different PCI bus). 2. Check for available firmware updates for adapter. Apply update if available. 3. Run AIX diagnostics on adapter. Replace if indicated. 4. Replace adapter. 5. Check for system firmware updates. Apply if update available. 6. If the card is in the system drawer, replace the PCI riser card, location: U0.1-P2. If the card is in an I/O subsystem, replace the I/O backplane that contains the slot in which the adapter was originally installed, location: U0.dd-P1. 2602 0007 Description: PCI device I/O or memory space request could not be satisfied Action: 1. If the location code identifies a slot: a. Reseat the adapter and reboot the system. b. Replace the adapter. c. Check for system firmware update. Apply update if available. 2. If the location code identifies an I/O planar: a. Check for system firmware updates. Apply if available. b. Replace the I/O planar. 3. Call service support. 2602 0008 Description: PCI device Fcode evaluation error Action: 1. If the location code identifies a slot: a. Check for adapter firmware updates. Apply if available. b. Replace the adapter. c. Check for system firmware updates. Apply if available. 2. If the location code identifies an I/O planar: a. Check for system firmware updates. Apply if available. b. Replace the I/O planar. 3. Call service support. Chapter 5. Error Code to FRU Index 205 Table 5. Firmware Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 2602 0010 Description: PCI probe error, bridge in freeze state Action: If the location code identifies a slot, do the following: 1. Check for system firmware updates. Apply if available. 2. Check the cabling to the adapter, in particular, the adapters that have serial ports. Serial ports may require null modems or special cabling configurations to avoid connecting driver outputs together. This may create a PCI slot power problem and force the adapter to be deconfigured. 3. Is the system running Linux? YES Go to The System is Running Linux on page 206. NO Check the PCI Adapter Placement Reference to see if the adapter is hot-pluggable. If it is not hot-pluggable, go to The System is Running Linux on page 206 because the system must be powered off to continue. If it is hot-pluggable, continue. 4. Use the hot plug service aid to reseat the card specified by the location code. See “Replacing a Hot-Pluggable PCI Adapter” on page 388. If reseating the adapter fixes the problem, go to MAP 410: ²Repair Checkout² in the Diagnostic Information for Multiple Bus Systems manual. If the problem is not resolved, continue. Attention: Before moving the adapter to another slot, verify with the customer or system administrator that the slot that the adapter is being moved to is available to the partition in which the adapter presently resides. 5. Is there an empty slot available into which the adapter could be plugged? YES Use the hot plug task to move the adapter card to another slot behind a different PCI bridge. The “System Logic Flow” on page 21 identifies the PCI bridges and the associated slots. See “Hot-Pluggable PCI Adapter Removal” on page 385 when removing the adapter, and “Installing a Hot-Pluggable PCI Adapter” on page 392 when installing the card in the new slot. If the card is successfully reconfigured in the new slot (behind a different PCI bridge), the slot in which the card was originally plugged is bad. Replace the PCI riser card or I/O planar in which the card was installed, location: U0.1-P2 or U0.dd-P1. If the adapter does not reconfigure successfully in the new slot, replace the adapter. NO Replace the adapter. If the problem is not resolved, replace the PCI riser card or I/O planar in which the card was installed, location: U0.1-P2 or U0.dd-P1. 6. Go to “MAP 1542: I/O Problem Isolation” on page 111. If the location code does not identify a slot or PCI adapter, or no location code is specified, go to “MAP 1542: I/O Problem Isolation” on page 111. The System is Running Linux 206 Service Guide Table 5. Firmware Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 1. Power off the system. See “Non-Hot-Pluggable PCI Adapter Removal” on page 381 and “Adding or Replacing a Non-Hot-Pluggable PCI Adapter” on page 383 for instructions on reseating the card. If reseating the adapter fixes the problem, go to ²MAP 410, Repair Checkout² in the Diagnostic Information for Multiple Bus Systems manual. Attention: Before moving the adapter to another slot, verify with the customer or system administrator that the slot that the adapter is being moved to is in the partition in which the adapter presently resides. 2. Is there an empty slot available into which the adapter could be plugged? YES Move the adapter to another slot behind a different PCI bridge. The “System Logic Flow” on page 21 identifies the PCI bridges and the associated slots. See “Non-Hot-Pluggable PCI Adapter Removal” on page 381 when removing the adapter, and “Adding or Replacing a Non-Hot-Pluggable PCI Adapter” on page 383 when installing the card in the new slot. If the card is successfully reconfigured in the new slot (behind a different PCI bridge), the slot in which the card was originally plugged is bad. Replace the PCI riser card or I/O planar in which the card was installed, location: U0.1-P2 or U0.dd-P1. If the adapter does not reconfigure successfully in the new slot, replace the adapter. NO Replace the adapter. If the problem is not resolved, replace the PCI riser card or I/O planar in which the card was installed, location: U0.1-P2 or U0.dd-P1. 3. Go to “MAP 1542: I/O Problem Isolation” on page 111. If the location code does not identify a slot or PCI adapter, or no location code is specified, go to “MAP 1542: I/O Problem Isolation” on page 111. 2602 0011 Description: PCI probe error, bridge is not usable Action: If the location code identifies a slot: 1. Check the cabling to the adapter, in particular, the adapters that have serial ports. Serial ports may require null modems or special cabling configurations to avoid connecting driver outputs together. This may create a PCI slot power problem and force the adapter to be deconfigured. Attention: Before moving the adapter to another slot, verify with the customer or system administrator that the slot that the adapter is being moved to is available to the partition in which the adapter presently resides. 2. Move adapter to another slot (behind a different PCI bus). 3. Check for available firmware updates for adapter. Apply update if available. 4. Replace the adapter. 5. Check for system firmware updates. Apply update if available. 6. If the card is in the system drawer, replace the PCI riser card, location: U0.1-P2. If the card is in an I/O subsystem, replace the I/O backplane that contains the slot in which the adapter was originally installed, location: U0.dd-P1. Chapter 5. Error Code to FRU Index 207 Table 5. Firmware Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 2602 0012 Description: PCI device run-time error, bridge in freeze state Action: If the location code identifies a slot: 1. Check the cabling to the adapter, in particular, the adapters that have serial ports. Serial ports may require null modems or special cabling configurations to avoid connecting driver outputs together. This may create a PCI slot power problem and force the adapter to be deconfigured. Attention: Before moving the adapter to another slot, verify with the customer or system administrator that the slot that the adapter is being moved to is available to the partition in which the adapter presently resides. 2. Move adapter to another slot (behind a different PCI bus). 3. Check for available firmware updates for adapter. Apply update if available. 4. Replace the adapter. 5. Check for system firmware updates. Apply if update available. 6. If the card is in the system drawer, replace the PCI riser card, location: U0.1-P2. If the card is in an I/O subsystem, replace the I/O backplane that contains the slot in which the adapter was originally installed, location: U0.dd-P1. 2680 0Axy MX-PCI bridge BIST failure The problem is associated with bridge ²y² in I/O drawer ²x.² Replace the I/O backplane in I/O drawer ²x.² Location U0.x-P1 See “Determining Location Code” on page 180 or location code information related to this error. 2680 0Cxx Machine check occurred 1. Replace the part identified by the location code associated with this error code. 2. If the location code associated with this error code identifies an adapter, and replacing the adapter does not correct the problem, replace the PCI riser card Location: U0.1-P2 or U0.dd-P1 2680 0Dxx Machine check occurred; unable to isolate to a single device Go to “MAP 1540: Problem Isolation Procedures” on page 108 27A0 0001 Error obtaining external RIO cable IDs 1. Check the SPCN and RIO cabling going to the I/O subsystem specified by the location code. 2. Check for system firmware updates. 3. Replace the I/O subsystem planar specified by the location code. 208 Service Guide Table 5. Firmware Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 2803 0xxx Description: Real-time-clock (RTC) errors Action: 1. Errors reported against the real-time clock can be caused by low battery voltage and (more rarely) power outages that occur during normal system usage. These errors are warnings that the real-time clock data content needs to be re-established and do not require any FRU replacement unless the error is persistent. When one of these errors occurs, the time and date information has been lost. To set or restore the time and date, use the operating system facility. 2. If the error is persistent, replace the battery. Location: U0.1-P1-V3 3. If the error is persistent after the battery is replaced, replace the CEC backplane. Location: U0.1-P1 2803 0001 RTC not updating - RTC initialization required 1. Set time and date. 2. Refer to Action under error code 2803 0xxx. 2803 0002 Bad time/date values 1. Set time and date. 2. Refer to Action under error code 2803 0xxx. 2803 0003 RTC not updating - not correctable Replace CEC backplane Location: U0.1-P1 2803 0004 RTC operating mode parameters (eg. data mode) changed 1. Set time and date. 2. Refer to Action under error code 28030001. 2803 0005 RTC battery error 1. Replace the battery. Note: Password, time, and date need to be set. 2. Refer to Action under error code 2803 0001. 2803 0010 The real-time-clock cannot be protected in the LPAR environment 1. Check for system firmware updates. Apply update if available. 2. Contact your next level of support for assistance. 2803 F003 Real-time-clock not updating - not correctable (detected by hypervisor) Replace the CEC backplane Location: U0.1-P1 2900 0002 Keyboard/mouse controller failed self-test Replace CEC backplane Location: U0.1-P1 29A0 0003 Keyboard not present/detected 1. Keyboard 2. Replace CEC backplane Location: U0.1-P1 29A0 0004 Keyboard stuck key detected 1. Keyboard 2. Replace CEC backplane Location: U0.1-P1 29B0 0004 Mouse not present/detected 1. Mouse 2. Replace CEC backplane Location: U0.1-P1 2B20 0011 Processor is manually disabled Use the service processor menus to re-enable the processor and reboot the system Chapter 5. Error Code to FRU Index 209 Table 5. Firmware Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 2B20 0022 Processor is deconfigured by the system The processor has previously failed BIST or POST. Replace the processor card indicated by the location code associated with the error code in the error log. 2B20 0031 Processor is failing POST. Replace the processor card indicated by the location code associated with the error code in the error log. 2B20 0042 Unknown processor card Remove the unknown/unsupported processor card. 2B20 8880 Unrecoverable internal hardware error Replace the FRU indicated in the service processor error log entry. If the system has been rebooted, run AIX diagnostics and look for an operating system error log entry in which the description says cache or cache controller error. 2B20 8881 Software error Call service support. 2B20 8882 Software error Call service support. 2B20 8883 Software error Call service support. 2B20 8884 Unrecoverable processor subsystem error Replace the FRU indicated in the service processor error log entry. If the system has been rebooted, run AIX diagnostics and look for an operating system error log entry in which the description says cache or cache controller error. Replace the FRUs listed in this entry. 2B20 8885 Unrecoverable internal hardware error Run AIX diagnostics. Look for an operating system error log entry in which the description says I/O bus time out, access, or other error or I/O bridge/device internal error. Replace the FRUs listed in this entry. 2B20 8888 Operating system termination request received Informational message. 2B2x 00EE Description: An exception error has been reported on a global processor Action: The type of interrupt that caused the exception is specified by the x as follows: 0: Unknown interrupt 1: System reset interrupt (SRI) 2. Machine check interrupt (MCI) 3. Data storage interrupt (DSI) 4. Instruction storage interrupt (ISI) 5. External interrupt (EXI) 6. Alignment interrupt (ALI) 7. Program interrupt (SRI) 8. Floating unavailable interrupt (FUI) 1. Check for system firmware updates. Apply if available. 2. Call service support. 2BA0 0000 Service processor POST failure Replace CEC backplane Location: U0.1-P1 2BA0 0012 Service processor reports self test failure Replace CEC backplane Location: U0.1-P1 210 Service Guide Table 5. Firmware Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 2BA0 0013 Service processor reports bad NVRAM CRC Refer to the actions for error code 25A8 0xxx. 2BA0 0017 Service processor reports bad or low battery. 1. Replace the battery. 2. Refer to the actions for error code 25A8 0xxx. 2BA0 0021 Open firmware mailbox call to activate attention LED failed Call service support. 2BA0 0041 Service processor VPD is corrupted. Replace CEC backplane Location: U0.1-P1 2BA0 0071 VPD data is corrupted for processor card 1. Replace the processor card Location: U0.1-P1-C1 2BA0 0073 VPD data is corrupted for processor card 2. Replace the processor card Location: U0.1-P1-C2 2BA0 0080 Service processor hardware reset request failure 1. Check for system firmware updates. Apply if available. 2. Call service support. Service Processor Error Codes Attention: Follow the procedure defined in the “Checkpoint and Error Code Index” on page 181. If you replace FRUs or perform an action and the problem is still not corrected, go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123 unless otherwise indicated in the tables. If you replace FRUs and the problem is corrected, go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Table 6. Service Processor Error Codes. Error Code Description Action/ Possible Failing FRU 4064 0001 Scan interface basic assurance tests detected a CEC backplane fault CEC backplane Location: U0.1-P1 4064 0002 CEC backplane fault CEC backplane Location: U0.1-P1 4064 0003 Scan interface basic assurance tests detected processor fault Processor card Location: U0.1-P1-C1 4064 0004 Processor fault Processor card Location: U0.1-P1-C2 4064 000E Processor capacity card Operator panel Location: U0.1-L1 4064 0010 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M8 4064 0011 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M6 4064 0012 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M1 4064 0013 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M3 Chapter 5. Error Code to FRU Index 211 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4064 0014 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M7 4064 0015 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M5 4064 0016 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M2 4064 0017 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M4 4064 0018 Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M8 4064 0019 Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M6 4064 001A Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M1 4064 001B Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M3 4064 001C Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M7 4064 001D Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M5 4064 001E Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M2 4064 001F Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M4 4064 00B0 Bus controller failure 1. Check for system firmware updates 2. PCI riser card Location: U0.1-P2 4064 00B1 Bus controller failure 1. Check for system firmware updates 2. PCI riser card Location: U0.1-P2 4064 00B3 Processor/VPD problem detected 1. Check for system firmware updates 2. Processor card 1 Location: U0.1-P1-C1 4064 00B4 Processor/VPD problem detected 1. Check for system firmware updates 2. Processor card 2 Location: U0.1-P1-C2 4064 00B7 Insufficient hardware resources were detected to continue IPL of the system after processor subsystem testing and initialization was completed Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4064 00D1 Informational message Informational message 4064 0100 Memory extender 0 on processor card 1 Processor card, Location: U0.1-P1-C1 4064 0101 Memory extender 2 on processor card 1 Processor card, Location: U0.1-P1-C1 212 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4064 0130 Memory extender 0 on processor card 2 Processor card, Location: U0.1-P1-C2 4064 0131 Memory extender 2 on processor card 2 Processor card, Location: U0.1-P1-C2 4064 0A03 Reboot message on VPD channel 3 processor card 1. Reboot is necessary for run-time array gard to take effect. Reboot the system in slow mode. Run diagnostics, then go to ²MAP 0235: System Array Self-Repair Problem Resolution² in the Diagnostic Information for Multiple Bus Systems. 4064 0A04 Reboot message on VPD channel 4 processor card 2. Reboot is necessary for run-time array gard to take effect. Reboot the system in slow mode. Run diagnostics, then go to ²MAP 0235: System Array Self-Repair Problem Resolution² in the Diagnostic Information for Multiple Bus Systems. 4064 0B00 Problem with internal RIO interface (I/O slot 0, I/O port 0) CEC backplane, U0.1-P1 4064 0B01 Problem with internal RIO interface (I/O slot 0, I/O port 1) CEC backplane, U0.1-P1 4064 0B02 Problem with internal RIO interface (I/O slot 0, I/O port 2) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 4064 0B03 Problem with internal RIO interface (I/O slot 0, I/O port 3) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 4064 0B40 Problem with controller chip on CEC backplane 1. CEC backplane, U0.1-P1. 2. Call service support. 4064 0BF0 Problem with RIO interface (I/O slot 0, I/O port 3) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 4064 0C10 Processor 0 configuration error: VPD mismatch for processor card 0. (Service processor code problem.) 1. Check for system firmware updates. 2. Call service support. 4064 0C11 Processor 1 configuration error: VPD mismatch for processor card 1. (Service processor code problem.) 1. Check for system firmware updates. 2. Call service support. 4064 0CA5 Invalid system configuration 1. Check that the processor cards and memory DIMMs are properly installed and configured. 2. If there are any FRUs listed in the service processor error log entry, replace them one at a time. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. Chapter 5. Error Code to FRU Index 213 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4064 0CA6 No usable memory left in the system. The system must be rebooted. Go to “MAP 1240: Memory Problem Resolution” on page 68. 40640 CA9 Incorrect/incompatible memory DIMMs are installed. Check that the memory DIMMs are properly configured, and are the right type for the system. 4064 0CAA System memory exceeds the limit for this system type and model. Remove the memory DIMMs until the memory does not exceed the limit for this system type and model. 4064 0EA0 MOPS service processor code error. 1. Check for system firmware updates. 2. Call service support. 4064 0EA1 Operating system code on main processor 1. Check for operating system code updates. 2. If there are other symptoms, go (one symptom at a time) to the “Quick Entry MAP” on page 58 and follow the instructions. 3. Call service support. 4064 0EA2 PRD code on service processor error. 1. Check for system firmware updates. 2. Call service support. 4064 0EA4 The service processor detected a problem, but was unable to isolate the source. Contact next level of support. 4064 0EAF Service processor code and system type are unknown 1. Check for system firmware updates. 2. Call service support. 4064 0EB0 JTAG problem Go to “MAP 1541: JTAG Problem Isolation” on page 110. 4064 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4064 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4064 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4064 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4064 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4064 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4064 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4064 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4064 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4064 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4064 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 214 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4064 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4064 0EE1 Service processor firmware: invalid MRU to FRU mapping 1. Check for system firmware updates. 2. Call service support. 4064 0EE2 Service processor firmware fault 1. Check for system firmware updates. 2. Call service support. 4064 0EE5 Firmware error, no callout available 1. Check for system firmware updates. 2. Call service support. 4065 0001 Scan interface basic assurance tests detected a CEC backplane fault CEC backplane Location: U0.1-P1 4065 0002 CEC backplane fault 1. Check JTAG cable connections 2. CEC backplane Location: U0.1-P1 4065 0003 Scan interface basic assurance tests detected processor fault 1. Check JTAG cable connections. 2. Processor card Location: U0.1-P1-C1 4065 0004 Processor fault 1. Check JTAG cable connections. 2. Processor card Location: U0.1-P1-C2 4065 000E Processor capacity card Operator Panel Location: U0.1-L1 4065 0010 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M8 4065 0011 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M6 4065 0012 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M1 4065 0013 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M3 4065 0014 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M7 4065 0015 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M5 4065 0016 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M2 4065 0017 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M4 4065 0018 Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M8 4065 0019 Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M6 4065 001A Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M1 Chapter 5. Error Code to FRU Index 215 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4065 001B Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M3 4065 001C Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M7 4065 001D Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M5 4065 001E Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M2 4065 001F Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M4 4065 00B0 Bus controller failure 1. Check for system firmware updates 2. PCI riser card Location: U0.1-P2 4065 00B1 Bus controller failure 1. Check for system firmware updates 2. PCI riser card Location: U0.1-P2 4065 00B3 Processor/VPD problem detected 1. Check for system firmware updates 2. Processor card 1 Location: U0.1-P1-C1 4065 00B4 Processor/VPD problem detected 1. Check for system firmware updates 2. Processor card 2 Location: U0.1-P1-C2 4065 00B7 Insufficient hardware resources were detected to continue IPL of the system after processor subsystem testing and initialization was completed Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4065 00D1 Informational message Informational message 4065 0100 Memory extender 0 on processor card 1 Processor card, Location: U0.1-P1-C1 4065 0101 Memory extender 2 on processor card 1 Processor card, Location: U0.1-P1-C1 4065 0130 Memory extender 0 on processor card 2 Processor card, Location: U0.1-P1-C2 4065 0131 Memory extender 2 on processor card 2 Processor card, Location: U0.1-P1-C2 4065 0A03 Reboot message on VPD channel 3 processor card 1. Reboot is necessary for run-time array gard to take effect. Reboot the system in slow mode. Run diagnostics, then go to ²MAP 0235: System Array Self-Repair Problem Resolution² in the Diagnostic Information for Multiple Bus Systems. 4065 0A04 Reboot message on VPD channel 4 processor card 2. Reboot is necessary for run-time array gard to take effect. Reboot the system in slow mode. Run diagnostics, then go to ²MAP 0235: System Array Self-Repair Problem Resolution² in the Diagnostic Information for Multiple Bus Systems. 4065 0B00 Problem with internal RIO interface (I/O slot 0, I/O port 0) CEC backplane, U0.1-P1 216 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4065 0B01 Problem with internal RIO interface (I/O slot 0, I/O port 1) CEC backplane, U0.1-P1 4065 0B02 Problem with internal RIO interface (I/O slot 0, I/O port 2) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 4065 0B03 Problem with internal RIO interface (I/O slot 0, I/O port 3) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 4065 0B40 Problem with controller chip on CEC backplane 1. CEC backplane, U0.1-P1. 2. Call service support. 4065 0BF0 Problem with RIO interface (I/O slot 0, I/O port 3) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 4065 0C10 Processor 0 configuration error: VPD mismatch for processor card 0. (Service processor code problem.) 1. Check for system firmware updates. 2. Call service support. 4065 0C11 Proc 1 configuration error: VPD mismatch for processor card 1. (Service processor code problem.) 1. Check for system firmware updates. 2. Call service support. 4065 0CA5 Invalid system configuration 1. Check that the processor cards and memory DIMMs are properly installed and configured. 2. If there are any FRUs listed in the service processor error log entry, replace them one at a time. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4065 0CA6 No usable memory left in the system. The system must be rebooted. Go to “MAP 1240: Memory Problem Resolution” on page 68. 4065 0CA9 Incorrect/incompatible memory DIMMs are installed. Check that the memory DIMMs are properly configured, and are the right type for the system. 4065 0CAA System memory exceeds the limit for this system type and model. Remove the memory DIMMs until the memory does not exceed the limit for this system type and model. 4065 0EA0 MOPS service processor code error. 1. Check for system firmware updates. 2. Call service support. Chapter 5. Error Code to FRU Index 217 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4065 0EA1 Operating system code on main processor 1. Check for operating system code updates. 2. If there are other symptoms, go (one symptom at a time) to the “Quick Entry MAP” on page 58 and follow the instructions. 3. Call service support. 4065 0EA2 PRD code on service processor error. 1. Check for system firmware updates. 2. Call service support. 4065 0EA4 The service processor detected a problem, but was unable to isolate the source. Contact next level of support. 4065 0EAF Service processor code and system type are unknown 1. Check for system firmware updates. 2. Call service support. 4065 0EB0 JTAG problem Go to “MAP 1541: JTAG Problem Isolation” on page 110. 4065 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4065 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4065 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4065 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4065 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4065 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4065 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4065 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4065 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4065 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4065 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4065 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4065 0EE1 Service processor firmware: invalid MRU to FRU mapping 1. Check for system firmware updates. 2. Call service support. 4065 0EE2 Service processor firmware fault 1. Check for system firmware updates. 2. Call service support. 4065 0EE5 Firmware error, no callout available 1. Check for system firmware updates. 2. Call service support. 218 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4066 0001 Scan interface basic assurance tests detected a CEC backplane fault CEC backplane Location: U0.1-P1 4066 0002 CEC backplane fault 1. Check JTAG cable connections 2. CEC backplane Location: U0.1-P1 4066 0003 Scan interface basic assurance tests detected processor fault 1. Check JTAG cable connections. 2. Processor card Location: U0.1-P1-C1 4066 0004 Processor fault 1. Check JTAG cable connections. 2. Processor card Location: U0.1-P1-C2 4066 000E Processor capacity card Operator Panel Location: U0.1-L1 4066 0010 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M8 4066 0011 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M6 4066 0012 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M1 4066 0013 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M3 4066 0014 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M7 4066 0015 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M5 4066 0016 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M2 4066 0017 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M4 4066 0018 Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M8 4066 0019 Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M6 4066 001A Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M1 4066 001B Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M3 4066 001C Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M7 4066 001D Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M5 4066 001E Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M2 4066 001F Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M4 Chapter 5. Error Code to FRU Index 219 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4066 00B0 Bus controller failure 1. Check for system firmware updates 2. PCI riser card Location: U0.1-P2 4066 00B1 Bus controller failure 1. Check for system firmware updates 2. PCI riser card Location: U0.1-P2 4066 00B3 Processor/VPD problem detected 1. Check for system firmware updates 2. Processor card 1 Location: U0.1-P1-C1 4066 00B4 Processor/VPD problem detected 1. Check for system firmware updates 2. Processor card 2 Location: U0.1-P1-C2 4066 00B7 Insufficient hardware resources were detected to continue IPL of the system after processor subsystem testing and initialization was completed Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4066 00D1 Informational message Informational message 4066 0100 Memory extender 0 on processor card 1 Processor card, Location: U0.1-P1-C1 4066 0101 Memory extender 2 on processor card 1 Processor card, Location: U0.1-P1-C1 4066 0130 Memory extender 0 on processor card 2 Processor card, Location: U0.1-P1-C2 4066 0131 Memory extender 2 on processor card 2 Processor card, Location: U0.1-P1-C2 4066 0A03 Reboot message on VPD channel 3 processor card 1. Reboot is necessary for run-time array gard to take effect. Reboot the system in slow mode. Run diagnostics, then go to ²MAP 0235: System Array Self-Repair Problem Resolution² in the Diagnostic Information for Multiple Bus Systems. 4066 0A04 Reboot message on VPD channel 4 processor card 2. Reboot is necessary for run-time array gard to take effect. Reboot the system in slow mode. Run diagnostics, then go to ²MAP 0235: System Array Self-Repair Problem Resolution² in the Diagnostic Information for Multiple Bus Systems. 4066 0B00 Problem with internal RIO interface (I/O slot 0, I/O port 0) CEC backplane, U0.1-P1 4066 0B01 Problem with internal RIO interface (I/O slot 0, I/O port 1) CEC backplane, U0.1-P1 4066 0B02 Problem with internal RIO interface (I/O slot 0, I/O port 2) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 220 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4066 0B03 Problem with internal RIO interface (I/O slot 0, I/O port 3) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 4066 0B40 Problem with controller chip on CEC backplane 1. CEC backplane, U0.1-P1. 2. Call service support. 4066 0BF0 Problem with RIO interface (I/O slot 0, I/O port 3) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 4066 0C10 Processor 0 configuration error: VPD mismatch for processor card 0. (Service processor code problem.) 1. Check for system firmware updates. 2. Call service support. 4066 0C11 Processor 1 configuration error: VPD mismatch for processor card 1. (Service processor code problem.) 1. Check for system firmware updates. 2. Call service support. 4066 0CA5 Invalid system configuration 1. Check that the processor cards and memory DIMMs are properly installed and configured. 2. If there are any FRUs listed in the service processor error log entry, replace them one at a time. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4066 0CA6 No usable memory left in the system. The system must be rebooted. Go to “MAP 1240: Memory Problem Resolution” on page 68. 4066 0CA9 Incorrect/incompatible memory DIMMs are installed. Check that the memory DIMMs are properly configured, and are the right type for the system. 4066 0CAA System memory exceeds the limit for this system type and model. Remove the memory DIMMs until the memory does not exceed the limit for this system type and model. 4066 0EA0 MOPS service processor code error. 1. Check for system firmware updates. 2. Call service support. 4066 0EA1 Operating system code on main processor 1. Check for operating system code updates. 2. If there are other symptoms, go (one symptom at a time) to the “Quick Entry MAP” on page 58 and follow the instructions. 3. Call service support. 4066 0EA2 PRD code on service processor error. 1. Check for system firmware updates. 2. Call service support. 4066 0EA4 The service processor detected a problem, but was unable to isolate the source. Contact next level of support. 4066 0EAF Service processor code and system type are unknown 1. Check for system firmware updates. 2. Call service support. Chapter 5. Error Code to FRU Index 221 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4066 0EB0 JTAG problem Go to “MAP 1541: JTAG Problem Isolation” on page 110. 4066 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4066 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4066 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4066 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4066 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4066 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4066 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4066 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4066 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4066 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4066 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4066 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4066 0EE1 Service processor firmware: invalid MRU to FRU mapping 1. Check for system firmware updates. 2. Call service support. 4066 0EE2 Service processor firmware fault 1. Check for system firmware updates. 2. Call service support. 4066 0EE5 Firmware error, no callout available 1. Check for system firmware updates. 2. Call service support. 4067 0001 Scan interface basic assurance tests detected a CEC backplane fault CEC backplane Location: U0.1-P1 4067 0002 CEC backplane fault 1. Check JTAG cable connections 2. CEC backplane Location: U0.1-P1 4067 0003 Scan interface basic assurance tests detected processor fault 1. Check JTAG cable connections. 2. Processor card Location: U0.1-P1-C1 4067 0004 Processor fault 1. Check JTAG cable connections. 2. Processor card Location: U0.1-P1-C2 222 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4067 000E Processor capacity card Operator Panel Location: U0.1-L1 4067 0010 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M8 4067 0011 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M6 4067 0012 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M1 4067 0013 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M3 4067 0014 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M7 4067 0015 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M5 4067 0016 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M2 4067 0017 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M4 4067 0018 Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M8 4067 0019 Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M6 4067 001A Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M1 4067 001B Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M3 4067 001C Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M7 4067 001D Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M5 4067 001E Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M2 4067 001F Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M4 4067 00B0 Bus controller failure 1. Check for system firmware updates 2. PCI riser card Location: U0.1-P2 4067 00B1 Bus controller failure 1. Check for system firmware updates 2. PCI riser card Location: U0.1-P2 4067 00B3 Processor/VPD problem detected 1. Check for system firmware updates 2. Processor card 1 Location: U0.1-P1-C1 Chapter 5. Error Code to FRU Index 223 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4067 00B4 Processor/VPD problem detected 1. Check for system firmware updates 2. Processor card 2 Location: U0.1-P1-C2 4067 00B7 Insufficient hardware resources were detected to continue IPL of the system after processor subsystem testing and initialization was completed Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4067 00D1 Informational message Informational message 4067 0100 Memory extender 0 on processor card 1 Processor card, Location: U0.1-P1-C1 4067 0101 Memory extender 2 on processor card 1 Processor card, Location: U0.1-P1-C1 4067 0130 Memory extender 0 on processor card 2 Processor card, Location: U0.1-P1-C2 4067 0131 Memory extender 2 on processor card 2 Processor card, Location: U0.1-P1-C2 4067 0A03 Reboot message on VPD channel 3 processor card 1. Reboot is necessary for run-time array gard to take effect. Reboot the system in slow mode. Run diagnostics, then go to ²MAP 0235: System Array Self-Repair Problem Resolution² in the Diagnostic Information for Multiple Bus Systems. 4067 0A04 Reboot message on VPD channel 4 processor card 2. Reboot is necessary for run-time array gard to take effect. Reboot the system in slow mode. Run diagnostics, then go to ²MAP 0235: System Array Self-Repair Problem Resolution² in the Diagnostic Information for Multiple Bus Systems. 4067 0B00 Problem with internal RIO interface (I/O slot 0, I/O port 0) CEC backplane, U0.1-P1 4067 0B01 Problem with internal RIO interface (I/O slot 0, I/O port 1) CEC backplane, U0.1-P1 4067 0B02 Problem with internal RIO interface (I/O slot 0, I/O port 2) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 4067 0B03 Problem with internal RIO interface (I/O slot 0, I/O port 3) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 4067 0B40 Problem with controller chip on CEC backplane 1. CEC backplane, U0.1-P1. 2. Call service support. 4067 0BF0 Problem with RIO interface (I/O slot 0, I/O port 3) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 224 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4067 0C10 Processor 0 configuration error: VPD mismatch for processor card 0. (Service processor code problem.) 1. Check for system firmware updates. 2. Call service support. 4067 0C11 Processor 1 configuration error: VPD mismatch for processor card 1. (Service processor code problem.) 1. Check for system firmware updates. 2. Call service support. 4067 0CA5 Invalid system configuration 1. Check that the processor cards and memory DIMMs are properly installed and configured. 2. If there are any FRUs listed in the service processor error log entry, replace them one at a time. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4067 0CA6 No usable memory left in the system. The system must be rebooted. Go to “MAP 1240: Memory Problem Resolution” on page 68. 4067 0CA9 Incorrect/incompatible memory DIMMs are installed. Check that the memory DIMMs are properly configured, and are the right type for the system. 4067 0CAA System memory exceeds the limit for this system type and model. Remove the memory DIMMs until the memory does not exceed the limit for this system type and model. 4067 0EA0 MOPS service processor code error. 1. Check for system firmware updates. 2. Call service support. 4067 0EA1 Operating system code on main processor 1. Check for operating system code updates. 2. If there are other symptoms, go (one symptom at a time) to the “Quick Entry MAP” on page 58 and follow the instructions. 3. Call service support. 4067 0EA2 PRD code on service processor error. 1. Check for system firmware updates. 2. Call service support. 4067 0EA4 The service processor detected a problem, but was unable to isolate the source. Contact next level of support. 4067 0EAF Service processor code and system type are unknown 1. Check for system firmware updates. 2. Call service support. 4067 0EB0 JTAG problem Go to “MAP 1541: JTAG Problem Isolation” on page 110. 4067 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4067 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4067 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4067 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4067 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. Chapter 5. Error Code to FRU Index 225 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4067 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4067 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4067 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4067 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4067 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4067 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4067 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4067 0EE1 Service processor firmware: invalid MRU to FRU mapping 1. Check for system firmware updates. 2. Call service support. 4067 0EE2 Service processor firmware fault 1. Check for system firmware updates. 2. Call service support. 4067 0EE5 Firmware error, no callout available 1. Check for system firmware updates. 2. Call service support. 4068 0001 Scan interface basic assurance tests detected a CEC backplane fault CEC backplane Location: U0.1-P1 4068 0002 CEC backplane fault 1. Check JTAG cable connections 2. CEC backplane Location: U0.1-P1 4068 0003 Scan interface basic assurance tests detected processor fault 1. Check JTAG cable connections. 2. Processor card Location: U0.1-P1-C1 4068 0004 Processor fault 1. Check JTAG cable connections. 2. Processor card Location: U0.1-P1-C2 4068 000E Processor capacity card Operator Panel Location: U0.1-L1 4068 0010 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M8 4068 0011 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M6 4068 0012 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M1 4068 0013 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M3 4068 0014 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M7 226 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4068 0015 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M5 4068 0016 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M2 4068 0017 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M4 4068 0018 Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M8 4068 0019 Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M6 4068 001A Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M1 4068 001B Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M3 4068 001C Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M7 4068 001D Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M5 4068 001E Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M2 4068 001F Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M4 4068 00B0 Bus controller failure 1. Check for system firmware updates 2. PCI riser card Location: U0.1-P2 4068 00B1 Bus controller failure 1. Check for system firmware updates 2. PCI riser card Location: U0.1-P2 4068 00B3 Processor/VPD problem detected 1. Check for system firmware updates 2. Processor card 1 Location: U0.1-P1-C1 4068 00B4 Processor/VPD problem detected 1. Check for system firmware updates 2. Processor card 2 Location: U0.1-P1-C2 4068 00B7 Insufficient hardware resources were detected to continue IPL of the system after processor subsystem testing and initialization was completed Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4068 00D1 Informational message Informational message 4068 0100 Memory extender 0 on processor card 1 Processor card, Location: U0.1-P1-C1 4068 0101 Memory extender 2 on processor card 1 Processor card, Location: U0.1-P1-C1 4068 0130 Memory extender 0 on processor card 2 Processor card, Location: U0.1-P1-C2 Chapter 5. Error Code to FRU Index 227 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4068 0131 Memory extender 2 on processor card 2 Processor card, Location: U0.1-P1-C2 4068 0A03 Reboot message on VPD channel 3 processor card 1. Reboot is necessary for run-time array gard to take effect. Reboot the system in slow mode. Run diagnostics, then go to ²MAP 0235: System Array Self-Repair Problem Resolution² in the Diagnostic Information for Multiple Bus Systems. 4068 0A04 Reboot message on VPD channel 4 processor card 2. Reboot is necessary for run-time array gard to take effect. Reboot the system in slow mode. Run diagnostics, then go to ²MAP 0235: System Array Self-Repair Problem Resolution² in the Diagnostic Information for Multiple Bus Systems. 4068 0B00 Problem with internal RIO interface (I/O slot 0, I/O port 0) CEC backplane, U0.1-P1 4068 0B01 Problem with internal RIO interface (I/O slot 0, I/O port 1) CEC backplane, U0.1-P1 4068 0B02 Problem with internal RIO interface (I/O slot 0, I/O port 2) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 4068 0B03 Problem with internal RIO interface (I/O slot 0, I/O port 3) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 4068 0B40 Problem with controller chip on CEC backplane 1. CEC backplane, U0.1-P1. 2. Call service support. 4068 0BF0 Problem with RIO interface (I/O slot 0, I/O port 3) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 4068 0C10 Processor 0 configuration error: VPD mismatch for processor card 0. (Service processor code problem.) 1. Check for system firmware updates. 2. Call service support. 4068 0C11 Processor 1 configuration error: VPD mismatch for processor card 1. (Service processor code problem.) 1. Check for system firmware updates. 2. Call service support. 4068 0CA5 Invalid system configuration 1. Check that the processor cards and memory DIMMs are properly installed and configured. 2. If there are any FRUs listed in the service processor error log entry, replace them one at a time. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4068 0CA6 No usable memory left in the system. The system must be rebooted. Go to “MAP 1240: Memory Problem Resolution” on page 68. 228 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4068 0CA9 Incorrect/incompatible memory DIMMs are installed. Check that the memory DIMMs are properly configured, and are the right type for the system. 4068 0CAA System memory exceeds the limit for this system type and model. Remove the memory DIMMs until the memory does not exceed the limit for this system type and model. 4068 0EA0 MOPS service processor code error. 1. Check for system firmware updates. 2. Call service support. 4068 0EA1 Operating system code on main processor 1. Check for operating system code updates. 2. If there are other symptoms, go (one symptom at a time) to the “Quick Entry MAP” on page 58 and follow the instructions. 3. Call service support. 4068 0EA2 PRD code on service processor error. 1. Check for system firmware updates. 2. Call service support. 4068 0EA4 The service processor detected a problem, but was unable to isolate the source. Contact next level of support. 4068 0EAF Service processor code and system type are unknown 1. Check for system firmware updates. 2. Call service support. 4068 0EB0 JTAG problem Go to “MAP 1541: JTAG Problem Isolation” on page 110. 4068 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4068 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4068 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4068 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4068 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4068 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4068 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4068 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4068 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4068 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4068 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4068 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. Chapter 5. Error Code to FRU Index 229 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4068 0EE1 Service processor firmware: invalid MRU to FRU mapping 1. Check for system firmware updates. 2. Call service support. 4068 0EE2 Service processor firmware fault 1. Check for system firmware updates. 2. Call service support. 4068 0EE5 Firmware error, no callout available 1. Check for system firmware updates. 2. Call service support. 4069 0001 Scan interface basic assurance tests detected a CEC backplane fault CEC backplane Location: U0.1-P1 4069 0002 CEC backplane fault 1. Check JTAG cable connections 2. CEC backplane Location: U0.1-P1 4069 0003 Scan interface basic assurance tests detected processor fault 1. Check JTAG cable connections. 2. Processor card Location: U0.1-P1-C1 4069 0004 Processor fault 1. Check JTAG cable connections. 2. Processor card Location: U0.1-P1-C2 4069 000E Processor capacity card Operator Panel Location: U0.1-L1 4069 0010 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M8 4069 0011 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M6 4069 0012 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M1 4069 0013 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M3 4069 0014 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M7 4069 0015 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M5 4069 0016 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M2 4069 0017 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M4 4069 0018 Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M8 4069 0019 Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M6 4069 001A Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M1 4069 001B Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M3 230 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4069 001C Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M7 4069 001D Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M5 4069 001E Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M2 4069 001F Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M4 4069 00B0 Bus controller failure 1. Check for system firmware updates 2. PCI riser card Location: U0.1-P2 4069 00B1 Bus controller failure 1. Check for system firmware updates 2. PCI riser card Location: U0.1-P2 4069 00B3 Processor/VPD problem detected 1. Check for system firmware updates 2. Processor card 1 Location: U0.1-P1-C1 4069 00B4 Processor/VPD problem detected 1. Check for system firmware updates 2. Processor card 2 Location: U0.1-P1-C2 4069 00B7 Insufficient hardware resources were detected to continue IPL of the system after processor subsystem testing and initialization was completed Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4069 00D1 Informational message Informational message 4069 0100 Memory extender 0 on processor card 1 Processor card, Location: U0.1-P1-C1 4069 0101 Memory extender 2 on processor card 1 Processor card, Location: U0.1-P1-C1 4069 0130 Memory extender 0 on processor card 2 Processor card, Location: U0.1-P1-C2 4069 0131 Memory extender 2 on processor card 2 Processor card, Location: U0.1-P1-C2 4069 0A03 Reboot message on VPD channel 3 processor card 1. Reboot is necessary for run-time array gard to take effect. Reboot the system in slow mode. Run diagnostics, then go to ²MAP 0235: System Array Self-Repair Problem Resolution² in the Diagnostic Information for Multiple Bus Systems. 4069 0A04 Reboot message on VPD channel 4 processor card 2. Reboot is necessary for run-time array gard to take effect. Reboot the system in slow mode. Run diagnostics, then go to ²MAP 0235: System Array Self-Repair Problem Resolution² in the Diagnostic Information for Multiple Bus Systems. 4069 0B00 Problem with internal RIO interface (I/O slot 0, I/O port 0) CEC backplane, U0.1-P1 4069 0B01 Problem with internal RIO interface (I/O slot 0, I/O port 1) CEC backplane, U0.1-P1 Chapter 5. Error Code to FRU Index 231 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4069 0B02 Problem with internal RIO interface (I/O slot 0, I/O port 2) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 4069 0B03 Problem with internal RIO interface (I/O slot 0, I/O port 3) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 4069 0B40 Problem with controller chip on CEC backplane 1. CEC backplane, U0.1-P1. 2. Call service support. 4069 0BF0 Problem with RIO interface (I/O slot 0, I/O port 3) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 4069 0C10 Processor 0 configuration error: VPD mismatch for processor card 0. (Service processor code problem.) 1. Check for system firmware updates. 2. Call service support. 4069 0C11 Processor 1 configuration error: VPD mismatch for processor card 1. (Service processor code problem.) 1. Check for system firmware updates. 2. Call service support. 4069 0CA5 Invalid system configuration 1. Check that the processor cards and memory DIMMs are properly installed and configured. 2. If there are any FRUs listed in the service processor error log entry, replace them one at a time. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4069 0CA6 No usable memory left in the system. The system must be rebooted. Go to “MAP 1240: Memory Problem Resolution” on page 68. 4069 0CA9 Incorrect/incompatible memory DIMMs are installed. Check that the memory DIMMs are properly configured, and are the right type for the system. 4069 0CAA System memory exceeds the limit for this system type and model. Remove the memory DIMMs until the memory does not exceed the limit for this system type and model. 4069 0EA0 MOPS service processor code error. 1. Check for system firmware updates. 2. Call service support. 4069 0EA1 Operating system code on main processor 1. Check for operating system code updates. 2. If there are other symptoms, go (one symptom at a time) to the “Quick Entry MAP” on page 58 and follow the instructions. 3. Call service support. 232 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4069 0EA2 PRD code on service processor error. 1. Check for system firmware updates. 2. Call service support. 4069 0EA4 The service processor detected a problem, but was unable to isolate the source. Contact next level of support. 4069 0EAF Service processor code and system type are unknown 1. Check for system firmware updates. 2. Call service support. 4069 0EB0 JTAG problem Go to “MAP 1541: JTAG Problem Isolation” on page 110. 4069 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4069 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4069 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4069 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4069 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4069 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4069 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4069 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4069 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4069 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4069 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4069 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 4069 0EE1 Service processor firmware: invalid MRU to FRU mapping 1. Check for system firmware updates. 2. Call service support. 4069 0EE2 Service processor firmware fault 1. Check for system firmware updates. 2. Call service support. 4069 0EE5 Firmware error, no callout available 1. Check for system firmware updates. 2. Call service support. 406A 0001 Scan interface basic assurance tests detected a CEC backplane fault CEC backplane Location: U0.1-P1 406A 0002 CEC backplane fault 1. Check JTAG cable connections 2. CEC backplane Location: U0.1-P1 Chapter 5. Error Code to FRU Index 233 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 406A 0003 Scan interface basic assurance tests detected processor fault 1. Check JTAG cable connections. 2. Processor card Location: U0.1-P1-C1 406A 0004 Processor fault 1. Check JTAG cable connections. 2. Processor card Location: U0.1-P1-C2 406A 000E Processor capacity card Operator Panel Location: U0.1-L1 406A 0010 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M8 406A 0011 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M6 406A 0012 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M1 406A 0013 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M3 406A 0014 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M7 406A 0015 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M5 406A 0016 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M2 406A 0017 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M4 406A 0018 Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M8 406A 0019 Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M6 406A 001A Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M1 406A 001B Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M3 406A 001C Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M7 406A 001D Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M5 406A 001E Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M2 406A 001F Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M4 406A 00B0 Bus controller failure 1. Check for system firmware updates 2. PCI riser card Location: U0.1-P2 234 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 406A 00B1 Bus controller failure 1. Check for system firmware updates 2. PCI riser card Location: U0.1-P2 406A 00B3 MCM/VPD problem detected 1. Check for system firmware updates 2. Processor card 1 Location: U0.1-P1-C1 406A 00B4 MCM/VPD problem detected 1. Check for system firmware updates 2. Processor card 2 Location: U0.1-P1-C2 406A 00B7 Insufficient hardware resources were detected to continue IPL of the system after processor subsystem testing and initialization was completed Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406A 00D1 Informational message Informational message 406A 0100 Memory extender 0 on processor card 1 Processor card, Location: U0.1-P1-C1 406A 0101 Memory extender 2 on processor card 1 Processor card, Location: U0.1-P1-C1 406A 0130 Memory extender 0 on processor card 2 Processor card, Location: U0.1-P1-C2 406A 0131 Memory extender 2 on processor card 2 Processor card, Location: U0.1-P1-C2 406A 0A03 Reboot message on VPD channel 3 processor card 1. Reboot is necessary for run-time array gard to take effect. Reboot the system in slow mode. Run diagnostics, then go to ²MAP 0235: System Array Self-Repair Problem Resolution² in the Diagnostic Information for Multiple Bus Systems. 406A 0A04 Reboot message on VPD channel 4 processor card 2. Reboot is necessary for run-time array gard to take effect. Reboot the system in slow mode. Run diagnostics, then go to ²MAP 0235: System Array Self-Repair Problem Resolution² in the Diagnostic Information for Multiple Bus Systems. 406A 0B00 Problem with internal RIO interface (I/O slot 0, I/O port 0) CEC backplane, U0.1-P1 406A 0B01 Problem with internal RIO interface (I/O slot 0, I/O port 1) CEC backplane, U0.1-P1 406A 0B02 Problem with internal RIO interface (I/O slot 0, I/O port 2) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 406A 0B03 Problem with internal RIO interface (I/O slot 0, I/O port 3) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. Chapter 5. Error Code to FRU Index 235 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 406A 0B40 Problem with controller chip on CEC backplane 1. CEC backplane, U0.1-P1. 2. Call service support. 406A 0BF0 Problem with RIO interface (I/O slot 0, I/O port 3) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 406A 0C10 Processor 0 configuration error: VPD mismatch for processor card 0. (Service processor code problem.) 1. Check for system firmware updates. 2. Call service support. 406A 0C11 Processor 1 configuration error: VPD mismatch for processor card 1. (Service processor code problem.) 1. Check for system firmware updates. 2. Call service support. 406A 0CA5 Invalid system configuration 1. Check that the processor cards and memory DIMMs are properly installed and configured. 2. If there are any FRUs listed in the service processor error log entry, replace them one at a time. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406A 0CA6 No usable memory left in the system. The system must be rebooted. Go to “MAP 1240: Memory Problem Resolution” on page 68. 406A 0CA9 Incorrect/incompatible memory DIMMs are installed. Check that the memory DIMMs are properly configured, and are the right type for the system. 406A 0CAA System memory exceeds the limit for this system type and model. Remove the memory DIMMs until the memory does not exceed the limit for this system type and model. 406A 0EA0 MOPS service processor code error. 1. Check for system firmware updates. 2. Call service support. 406A 0EA1 Operating system code on main processor 1. Check for operating system code updates. 2. If there are other symptoms, go (one symptom at a time) to the “Quick Entry MAP” on page 58 and follow the instructions. 3. Call service support. 406A 0EA2 PRD code on service processor error. 1. Check for system firmware updates. 2. Call service support. 406A 0EA4 The service processor detected a problem, but was unable to isolate the source. Contact next level of support. 406A 0EAF Service processor code and system type are unknown 1. Check for system firmware updates. 2. Call service support. 406A 0EB0 JTAG problem Go to “MAP 1541: JTAG Problem Isolation” on page 110. 406A 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 236 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 406A 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406A 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406A 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406A 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406A 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406A 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406A 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406A 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406A 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406A 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406A 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406A 0EE1 Service processor firmware: invalid MRU to FRU mapping 1. Check for system firmware updates. 2. Call service support. 406A 0EE2 Service processor firmware fault 1. Check for system firmware updates. 2. Call service support. 406A 0EE5 Firmware error, no callout available 1. Check for system firmware updates. 2. Call service support. 406B 0001 Scan interface basic assurance tests detected a CEC backplane fault CEC backplane Location: U0.1-P1 406B 0002 CEC backplane fault 1. Check JTAG cable connections 2. CEC backplane Location: U0.1-P1 406B 0003 Scan interface basic assurance tests detected processor fault 1. Check JTAG cable connections. 2. Processor card Location: U0.1-P1-C1 406B 0004 Processor fault 1. Check JTAG cable connections. 2. Processor card Location: U0.1-P1-C2 406B 000E Processor capacity card Operator Panel Location: U0.1-L1 406B 0010 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M8 Chapter 5. Error Code to FRU Index 237 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 406B 0011 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M6 406B 0012 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M1 406B 0013 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M3 406B 0014 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M7 406B 0015 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M5 406B 0016 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M2 406B 0017 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M4 406B 0018 Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M8 406B 0019 Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M6 406B 001A Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M1 406B 001B Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M3 406B 001C Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M7 406B 001D Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M5 406B 001E Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M2 406B 001F Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M4 406B 00B0 Bus controller failure 1. Check for system firmware updates 2. PCI riser card Location: U0.1-P2 406B 00B1 Bus controller failure 1. Check for system firmware updates 2. PCI riser card Location: U0.1-P2 406B 00B3 MCM/VPD problem detected 1. Check for system firmware updates 2. Processor card 1 Location: U0.1-P1-C1 406B 00B4 MCM/VPD problem detected 1. Check for system firmware updates 2. Processor card 2 Location: U0.1-P1-C2 406B 00B7 Insufficient hardware resources were detected to continue IPL of the system after processor subsystem testing and initialization was completed Go to “MAP 1540: Problem Isolation Procedures” on page 108. 238 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 406B 00D1 Informational message Informational message 406B 0100 Memory extender 0 on processor card 1 Processor card, Location: U0.1-P1-C1 406B 0101 Memory extender 2 on processor card 1 Processor card, Location: U0.1-P1-C1 406B 0130 Memory extender 0 on processor card 2 Processor card, Location: U0.1-P1-C2 406B 0131 Memory extender 2 on processor card 2 Processor card, Location: U0.1-P1-C2 406B 0A03 Reboot message on VPD channel 3 processor card 1. Reboot is necessary for run-time array gard to take effect. Reboot the system in slow mode. Run diagnostics, then go to ²MAP 0235: System Array Self-Repair Problem Resolution² in the Diagnostic Information for Multiple Bus Systems. 406B 0A04 Reboot message on VPD channel 4 processor card 2. Reboot is necessary for run-time array gard to take effect. Reboot the system in slow mode. Run diagnostics, then go to ²MAP 0235: System Array Self-Repair Problem Resolution² in the Diagnostic Information for Multiple Bus Systems. 406B 0B00 Problem with internal RIO interface (I/O slot 0, I/O port 0) CEC backplane, U0.1-P1 406B 0B01 Problem with internal RIO interface (I/O slot 0, I/O port 1) CEC backplane, U0.1-P1 406B 0B02 Problem with internal RIO interface (I/O slot 0, I/O port 2) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 406B 0B03 Problem with internal RIO interface (I/O slot 0, I/O port 3) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 406B 0B40 Problem with controller chip on CEC backplane 1. CEC backplane, U0.1-P1. 2. Call service support. 406B 0BF0 Problem with RIO interface (I/O slot 0, I/O port 3) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 406B 0C10 Processor 0 configuration error: VPD mismatch for processor card 0. (Service processor code problem.) 1. Check for system firmware updates. 2. Call service support. 406B 0C11 Processor 1 configuration error: VPD mismatch for processor card 1. (Service processor code problem.) 1. Check for system firmware updates. 2. Call service support. Chapter 5. Error Code to FRU Index 239 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 406B 0CA5 Invalid system configuration 1. Check that the processor cards and memory DIMMs are properly installed and configured. 2. If there are any FRUs listed in the service processor error log entry, replace them one at a time. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406B 0CA6 No usable memory left in the system. The system must be rebooted. Go to “MAP 1240: Memory Problem Resolution” on page 68. 406B 0CA9 Incorrect/incompatible memory DIMMs are installed. Check that the memory DIMMs are properly configured, and are the right type for the system. 406B 0CAA System memory exceeds the limit for this system type and model. Remove the memory DIMMs until the memory does not exceed the limit for this system type and model. 406B 0EA0 MOPS service processor code error. 1. Check for system firmware updates. 2. Call service support. 406B 0EA1 Operating system code on main processor 1. Check for operating system code updates. 2. If there are other symptoms, go (one symptom at a time) to the “Quick Entry MAP” on page 58 and follow the instructions. 3. Call service support. 406B 0EA2 PRD code on service processor error. 1. Check for system firmware updates. 2. Call service support. 406B 0EA4 The service processor detected a problem, but was unable to isolate the source. Contact next level of support. 406B 0EAF Service processor code and system type are unknown 1. Check for system firmware updates. 2. Call service support. 406B 0EB0 JTAG problem Go to “MAP 1541: JTAG Problem Isolation” on page 110. 406B 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406B 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406B 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406B 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406B 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406B 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406B 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406B 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 240 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 406B 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406B 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406B 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406B 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406B 0EE1 Service processor firmware: invalid MRU to FRU mapping 1. Check for system firmware updates. 2. Call service support. 406B 0EE2 Service processor firmware fault 1. Check for system firmware updates. 2. Call service support. 406B 0EE5 Firmware error, no callout available 1. Check for system firmware updates. 2. Call service support. 406C 0001 Scan interface basic assurance tests detected a CEC backplane fault CEC backplane Location: U0.1-P1 406C 0002 CEC backplane fault 1. Check JTAG cable connections 2. CEC backplane Location: U0.1-P1 406C 0003 Scan interface basic assurance tests detected processor fault 1. Check JTAG cable connections. 2. Processor card Location: U0.1-P1-C1 406C 0004 Processor fault 1. Check JTAG cable connections. 2. Processor card Location: U0.1-P1-C2 406C 000E Processor capacity card Operator Panel Location: U0.1-L1 406C 0010 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M8 406C 0011 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M6 406C 0012 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M1 406C 0013 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M3 406C 0014 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M7 406C 0015 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M5 406C 0016 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M2 406C 0017 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M4 Chapter 5. Error Code to FRU Index 241 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 406C 0018 Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M8 406C 0019 Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M6 406C 001A Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M1 406C 001B Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M3 406C 001C Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M7 406C 001D Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M5 406C 001E Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M2 406C 001F Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M4 406C 00B0 Bus controller failure 1. Check for system firmware updates 2. PCI riser card Location: U0.1-P2 406C 00B1 Bus controller failure 1. Check for system firmware updates 2. PCI riser card Location: U0.1-P2 406C 00B3 Processor/VPD problem detected 1. Check for system firmware updates 2. Processor card 1 Location: U0.1-P1-C1 406C 00B4 Processor/VPD problem detected 1. Check for system firmware updates 2. Processor card 2 Location: U0.1-P1-C2 406C 00B7 Insufficient hardware resources were detected to continue IPL of the system after processor subsystem testing and initialization was completed Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406C 00D1 Informational message Informational message 406C 0100 Memory extender 0 on processor card 1 Processor card, Location: U0.1-P1-C1 406C 0101 Memory extender 2 on processor card 1 Processor card, Location: U0.1-P1-C1 406C 0130 Memory extender 0 on processor card 2 Processor card, Location: U0.1-P1-C2 406C 0131 Memory extender 2 on processor card 2 Processor card, Location: U0.1-P1-C2 406C 0A03 Reboot message on VPD channel 3 processor card 1. Reboot is necessary for run-time array gard to take effect. Reboot the system in slow mode. Run diagnostics, then go to ²MAP 0235: System Array Self-Repair Problem Resolution² in the Diagnostic Information for Multiple Bus Systems. 242 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 406C 0A04 Reboot message on VPD channel 4 processor card 2. Reboot is necessary for run-time array gard to take effect. Reboot the system in slow mode. Run diagnostics, then go to ²MAP 0235: System Array Self-Repair Problem Resolution² in the Diagnostic Information for Multiple Bus Systems. 406C 0B00 Problem with internal RIO interface (I/O slot 0, I/O port 0) CEC backplane, U0.1-P1 406C 0B01 Problem with internal RIO interface (I/O slot 0, I/O port 1) CEC backplane, U0.1-P1 406C 0B02 Problem with internal RIO interface (I/O slot 0, I/O port 2) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 406C 0B03 Problem with internal RIO interface (I/O slot 0, I/O port 3) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 406C 0B40 Problem with controller chip on CEC backplane 1. CEC backplane, U0.1-P1. 2. Call service support. 406C 0BF0 Problem with RIO interface (I/O slot 0, I/O port 3) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 406C 0C10 Processor 0 configuration error: VPD mismatch for processor card 0. (Service processor code problem.) 1. Check for system firmware updates. 2. Call service support. 406C 0C11 Processor 1 configuration error: VPD mismatch for processor card 1. (Service processor code problem.) 1. Check for system firmware updates. 2. Call service support. 406C 0CA5 Invalid system configuration 1. Check that the processor cards and memory DIMMs are properly installed and configured. 2. If there are any FRUs listed in the service processor error log entry, replace them one at a time. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406C 0CA6 No usable memory left in the system. The system must be rebooted. Go to “MAP 1240: Memory Problem Resolution” on page 68. 406C 0CA9 Incorrect/incompatible memory DIMMs are installed. Check that the memory DIMMs are properly configured, and are the right type for the system. 406C 0CAA System memory exceeds the limit for this system type and model. Remove the memory DIMMs until the memory does not exceed the limit for this system type and model. Chapter 5. Error Code to FRU Index 243 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 406C 0EA0 MOPS service processor code error. 1. Check for system firmware updates. 2. Call service support. 406C 0EA1 Operating system code on main processor 1. Check for operating system code updates. 2. If there are other symptoms, go (one symptom at a time) to the “Quick Entry MAP” on page 58 and follow the instructions. 3. Call service support. 406C 0EA2 PRD code on service processor error. 1. Check for system firmware updates. 2. Call service support. 406C 0EA4 The service processor detected a problem, but was unable to isolate the source. Contact next level of support. 406C 0EAF Service processor code and system type are unknown 1. Check for system firmware updates. 2. Call service support. 406C 0EB0 JTAG problem Go to “MAP 1541: JTAG Problem Isolation” on page 110. 406C 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406C 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406C 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406C 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406C 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406C 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406C 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406C 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406C 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406C 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406C 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406C 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406C 0EE1 Service processor firmware: invalid MRU to FRU mapping 1. Check for system firmware updates. 2. Call service support. 406C 0EE2 Service processor firmware fault 1. Check for system firmware updates. 2. Call service support. 244 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 406C 0EE5 Firmware error, no callout available 1. Check for system firmware updates. 2. Call service support. 406D 0001 Scan interface basic assurance tests detected a CEC backplane fault CEC backplane Location: U0.1-P1 406D 0002 CEC backplane fault 1. Check JTAG cable connections 2. CEC backplane Location: U0.1-P1 406D 0003 Scan interface basic assurance tests detected processor fault 1. Check JTAG cable connections. 2. Processor card Location: U0.1-P1-C1 406D 0004 Processor fault 1. Check JTAG cable connections. 2. Processor card Location: U0.1-P1-C2 406D 000E Processor capacity card Operator Panel Location: U0.1-L1 406D 0010 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M8 406D 0011 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M6 406D 0012 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M1 406D 0013 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M3 406D 0014 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M7 406D 0015 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M5 406D 0016 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M2 406D 0017 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M4 406D 0018 Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M8 406D 0019 Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M6 406D 001A Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M1 406D 001B Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M3 406D 001C Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M7 406D 001D Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M5 406D 001E Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M2 Chapter 5. Error Code to FRU Index 245 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 406D 001F Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M4 406D 00B0 Bus controller failure 1. Check for system firmware updates 2. PCI riser card Location: U0.1-P2 406D 00B1 Bus controller failure 1. Check for system firmware updates 2. PCI riser card Location: U0.1-P2 406D 00B3 Processor/VPD problem detected 1. Check for system firmware updates 2. Processor card 1 Location: U0.1-P1-C1 406D 00B4 Processor/VPD problem detected 1. Check for system firmware updates 2. Processor card 2 Location: U0.1-P1-C2 406D 00B7 Insufficient hardware resources were detected to continue IPL of the system after processor subsystem testing and initialization was completed Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406D 00D1 Informational message Informational message 406D 0100 Memory extender 0 on processor card 1 Processor card, Location: U0.1-P1-C1 406D 0101 Memory extender 2 on processor card 1 Processor card, Location: U0.1-P1-C1 406D 0130 Memory extender 0 on processor card 2 Processor card, Location: U0.1-P1-C2 406D 0131 Memory extender 2 on processor card 2 Processor card, Location: U0.1-P1-C2 406D 0A03 Reboot message on VPD channel 3 processor card 1. Reboot is necessary for run-time array gard to take effect. Reboot the system in slow mode. Run diagnostics, then go to ²MAP 0235: System Array Self-Repair Problem Resolution² in the Diagnostic Information for Multiple Bus Systems. 406D 0A04 Reboot message on VPD channel 4 processor card 2. Reboot is necessary for run-time array gard to take effect. Reboot the system in slow mode. Run diagnostics, then go to ²MAP 0235: System Array Self-Repair Problem Resolution² in the Diagnostic Information for Multiple Bus Systems. 406D 0B00 Problem with internal RIO interface (I/O slot 0, I/O port 0) CEC backplane, U0.1-P1 406D 0B01 Problem with internal RIO interface (I/O slot 0, I/O port 1) CEC backplane, U0.1-P1 406D 0B02 Problem with internal RIO interface (I/O slot 0, I/O port 2) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 246 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 406D 0B03 Problem with internal RIO interface (I/O slot 0, I/O port 3) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 406D 0B40 Problem with controller chip on CEC backplane 1. CEC backplane, U0.1-P1. 2. Call service support. 406D 0BF0 Problem with RIO interface (I/O slot 0, I/O port 3) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 406D 0C10 Processor 0 configuration error: VPD mismatch for processor card 0. (Service processor code problem.) 1. Check for system firmware updates. 2. Call service support. 406D 0C11 Processor 1 configuration error: VPD mismatch for processor card 1. (Service processor code problem.) 1. Check for system firmware updates. 2. Call service support. 406D 0CA5 Invalid system configuration 1. Check that the processor cards and memory DIMMs are properly installed and configured. 2. If there are any FRUs listed in the service processor error log entry, replace them one at a time. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406D 0CA6 No usable memory left in the system. The system must be rebooted. Go to “MAP 1240: Memory Problem Resolution” on page 68. 406D 0CA9 Incorrect/incompatible memory DIMMs are installed. Check that the memory DIMMs are properly configured, and are the right type for the system. 406D 0CAA System memory exceeds the limit for this system type and model. Remove the memory DIMMs until the memory does not exceed the limit for this system type and model. 406D 0EA0 MOPS service processor code error. 1. Check for system firmware updates. 2. Call service support. 406D 0EA1 Operating system code on main processor 1. Check for operating system code updates. 2. If there are other symptoms, go (one symptom at a time) to the “Quick Entry MAP” on page 58 and follow the instructions. 3. Call service support. 406D 0EA2 PRD code on service processor error. 1. Check for system firmware updates. 2. Call service support. 406D 0EA4 The service processor detected a problem, but was unable to isolate the source. Contact next level of support. 406D 0EAF Service processor code and system type are unknown 1. Check for system firmware updates. 2. Call service support. Chapter 5. Error Code to FRU Index 247 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 406D 0EB0 JTAG problem Go to “MAP 1541: JTAG Problem Isolation” on page 110. 406D 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406D 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406D 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406D 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406D 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406D 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406D 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406D 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406D 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406D 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406D 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406D 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406D 0EE1 Service processor firmware: invalid MRU to FRU mapping 1. Check for system firmware updates. 2. Call service support. 406D 0EE2 Service processor firmware fault 1. Check for system firmware updates. 2. Call service support. 406D 0EE5 Firmware error, no callout available 1. Check for system firmware updates. 2. Call service support. 406E 0001 Scan interface basic assurance tests detected a CEC backplane fault CEC backplane Location: U0.1-P1 406E 0002 CEC backplane fault 1. Check JTAG cable connections 2. CEC backplane Location: U0.1-P1 406E 0003 Scan interface basic assurance tests detected processor fault 1. Check JTAG cable connections. 2. Processor card Location: U0.1-P1-C1 406E 0004 Processor fault 1. Check JTAG cable connections. 2. Processor card Location: U0.1-P1-C2 248 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 406E 000E Processor capacity card Operator Panel Location: U0.1-L1 406E 0010 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M8 406E 0011 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M6 406E 0012 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M1 406E 0013 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M3 406E 0014 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M7 406E 0015 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M5 406E 0016 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M2 406E 0017 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M4 406E 0018 Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M8 406E 0019 Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M6 406E 001A Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M1 406E 001B Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M3 406E 001C Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M7 406E 001D Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M5 406E 001E Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M2 406E 001F Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M4 406E 00B0 Bus controller failure 1. Check for system firmware updates 2. PCI riser card Location: U0.1-P2 406E 00B1 Bus controller failure 1. Check for system firmware updates 2. PCI riser card Location: U0.1-P2 406E 00B3 Processor/VPD problem detected 1. Check for system firmware updates 2. Processor card 1 Location: U0.1-P1-C1 Chapter 5. Error Code to FRU Index 249 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 406E 00B4 Processor/VPD problem detected 1. Check for system firmware updates 2. Processor card 2 Location: U0.1-P1-C2 406E 00B7 Insufficient hardware resources were detected to continue IPL of the system after processor subsystem testing and initialization was completed Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406E 00D1 Informational message Informational message 406E 0100 Memory extender 0 on processor card 1 Processor card, Location: U0.1-P1-C1 406E 0101 Memory extender 2 on processor card 1 Processor card, Location: U0.1-P1-C1 406E 0130 Memory extender 0 on processor card 2 Processor card, Location: U0.1-P1-C2 406E 0131 Memory extender 2 on processor card 2 Processor card, Location: U0.1-P1-C2 406E 0A03 Reboot message on VPD channel 3 processor card 1. Reboot is necessary for run-time array gard to take effect. Reboot the system in slow mode. Run diagnostics, then go to ²MAP 0235: System Array Self-Repair Problem Resolution² in the Diagnostic Information for Multiple Bus Systems. 406E 0A04 Reboot message on VPD channel 4 processor card 2. Reboot is necessary for run-time array gard to take effect. Reboot the system in slow mode. Run diagnostics, then go to ²MAP 0235: System Array Self-Repair Problem Resolution² in the Diagnostic Information for Multiple Bus Systems. 406E 0B00 Problem with internal RIO interface (I/O slot 0, I/O port 0) CEC backplane, U0.1-P1 406E 0B01 Problem with internal RIO interface (I/O slot 0, I/O port 1) CEC backplane, U0.1-P1 406E 0B02 Problem with internal RIO interface (I/O slot 0, I/O port 2) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 406E 0B03 Problem with internal RIO interface (I/O slot 0, I/O port 3) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 406E 0B40 Problem with controller chip on CEC backplane 1. CEC backplane, U0.1-P1. 2. Call service support. 406E 0BF0 Problem with RIO interface (I/O slot 0, I/O port 3) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 250 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 406E 0C10 Processor 0 configuration error: VPD mismatch for processor card 0. (Service processor code problem.) 1. Check for system firmware updates. 2. Call service support. 406E 0C11 Processor 1 configuration error: VPD mismatch for processor card 1. (Service processor code problem.) 1. Check for system firmware updates. 2. Call service support. 406E 0CA5 Invalid system configuration 1. Check that the processor cards and memory DIMMs are properly installed and configured. 2. If there are any FRUs listed in the service processor error log entry, replace them one at a time. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406E 0CA6 No usable memory left in the system. The system must be rebooted. Go to “MAP 1240: Memory Problem Resolution” on page 68. 406E 0CA9 Incorrect/incompatible memory DIMMs are installed. Check that the memory DIMMs are properly configured, and are the right type for the system. 406E 0CAA System memory exceeds the limit for this system type and model. Remove the memory DIMMs until the memory does not exceed the limit for this system type and model. 406E 0EA0 MOPS service processor code error. 1. Check for system firmware updates. 2. Call service support. 406E 0EA1 Operating system code on main processor 1. Check for operating system code updates. 2. If there are other symptoms, go (one symptom at a time) to the “Quick Entry MAP” on page 58 and follow the instructions. 3. Call service support. 406E 0EA2 PRD code on service processor error. 1. Check for system firmware updates. 2. Call service support. 406E 0EA4 The service processor detected a problem, but was unable to isolate the source. Contact next level of support. 406E 0EAF Service processor code and system type are unknown 1. Check for system firmware updates. 2. Call service support. 406E 0EB0 JTAG problem Go to “MAP 1541: JTAG Problem Isolation” on page 110. 406E 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406E 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406E 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406E 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406E 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. Chapter 5. Error Code to FRU Index 251 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 406E 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406E 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406E 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406E 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406E 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406E 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406E 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406E 0EE1 Service processor firmware: invalid MRU to FRU mapping 1. Check for system firmware updates. 2. Call service support. 406E 0EE2 Service processor firmware fault 1. Check for system firmware updates. 2. Call service support. 406E 0EE5 Firmware error, no callout available 1. Check for system firmware updates. 2. Call service support. 406F 0001 Scan interface basic assurance tests detected a CEC backplane fault CEC backplane Location: U0.1-P1 406F 0002 CEC backplane fault 1. Check JTAG cable connections 2. CEC backplane Location: U0.1-P1 406F 0003 Scan interface basic assurance tests detected processor fault 1. Check JTAG cable connections. 2. Processor card Location: U0.1-P1-C1 406F 0004 Processor fault 1. Check JTAG cable connections. 2. Processor card Location: U0.1-P1-C2 406F 000E Processor capacity card Operator Panel Location: U0.1-L1 406F 0010 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M8 406F 0011 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M6 406F 0012 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M1 406F 0013 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M3 406F 0014 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M7 252 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 406F 0015 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M5 406F 0016 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M2 406F 0017 Memory DIMM fault Memory DIMM Location: U0.1-P1-C1-M4 406F 0018 Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M8 406F 0019 Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M6 406F 001A Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M1 406F 001B Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M3 406F 001C Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M7 406F 001D Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M5 406F 001E Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M2 406F 001F Memory DIMM fault Memory DIMM Location: U0.1-P1-C2-M4 406F 00B0 Bus controller failure 1. Check for system firmware updates 2. PCI riser card Location: U0.1-P2 406F 00B1 Bus controller failure 1. Check for system firmware updates 2. PCI riser card Location: U0.1-P2 406F 00B3 Processor/VPD problem detected 1. Check for system firmware updates 2. Processor card 1 Location: U0.1-P1-C1 406F 00B4 Processor/VPD problem detected 1. Check for system firmware updates 2. Processor card 2 Location: U0.1-P1-C2 406F 00B7 Insufficient hardware resources were detected to continue IPL of the system after processor subsystem testing and initialization was completed Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406F 00D1 Informational message Informational message 406F 0100 Memory extender 0 on processor card 1 Processor card, Location: U0.1-P1-C1 406F 0101 Memory extender 2 on processor card 1 Processor card, Location: U0.1-P1-C1 406F 0130 Memory extender 0 on processor card 2 Processor card, Location: U0.1-P1-C2 Chapter 5. Error Code to FRU Index 253 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 406F 0131 Memory extender 2 on processor card 2 Processor card, Location: U0.1-P1-C2 406F 0A03 Reboot message on VPD channel 3 processor card 1. Reboot is necessary for run-time array gard to take effect. Reboot the system in slow mode. Run diagnostics, then go to ²MAP 0235: System Array Self-Repair Problem Resolution² in the Diagnostic Information for Multiple Bus Systems. 406F 0A04 Reboot message on VPD channel 4 processor card 2. Reboot is necessary for run-time array gard to take effect. Reboot the system in slow mode. Run diagnostics, then go to ²MAP 0235: System Array Self-Repair Problem Resolution² in the Diagnostic Information for Multiple Bus Systems. 406F 0B00 Problem with internal RIO interface (I/O slot 0, I/O port 0) CEC backplane, U0.1-P1 406F 0B01 Problem with internal RIO interface (I/O slot 0, I/O port 1) CEC backplane, U0.1-P1 406F 0B02 Problem with internal RIO interface (I/O slot 0, I/O port 2) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 406F 0B03 Problem with internal RIO interface (I/O slot 0, I/O port 3) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 406F 0B40 Problem with controller chip on CEC backplane 1. CEC backplane, U0.1-P1. 2. Call service support. 406F 0BF0 Problem with RIO interface (I/O slot 0, I/O port 3) 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems. 3. CEC backplane, U0.1-P1 4. Call service support. 406F 0C10 Processor 0 configuration error: VPD mismatch for processor card 0. (Service processor code problem.) 1. Check for system firmware updates. 2. Call service support. 406F 0C11 Processor 1 configuration error: VPD mismatch for processor card 1. (Service processor code problem.) 1. Check for system firmware updates. 2. Call service support. 406F 0CA5 Invalid system configuration 1. Check that the processor cards and memory DIMMs are properly installed and configured. 2. If there are any FRUs listed in the service processor error log entry, replace them one at a time. 3. Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406F 0CA6 No usable memory left in the system. The system must be rebooted. Go to “MAP 1240: Memory Problem Resolution” on page 68. 254 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 406F 0CA9 Incorrect/incompatible memory DIMMs are installed. Check that the memory DIMMs are properly configured, and are the right type for the system. 406F 0CAA System memory exceeds the limit for this system type and model. Remove the memory DIMMs until the memory does not exceed the limit for this system type and model. 406F 0EA0 MOPS service processor code error. 1. Check for system firmware updates. 2. Call service support. 406F 0EA1 Operating system code on main processor 1. Check for operating system code updates. 2. If there are other symptoms, go (one symptom at a time) to the “Quick Entry MAP” on page 58 and follow the instructions. 3. Call service support. 406F 0EA2 PRD code on service processor error. 1. Check for system firmware updates. 2. Call service support. 406F 0EA4 The service processor detected a problem, but was unable to isolate the source. Contact next level of support. 406F 0EAF Service processor code and system type are unknown 1. Check for system firmware updates. 2. Call service support. 406F 0EB0 JTAG problem Go to “MAP 1541: JTAG Problem Isolation” on page 110. 406F 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406F 0EB2 Attention line 1 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406F 0EB3 Attention line 2 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406F 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406F 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406F 0EB6 Attention line 5 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406F 0EB7 Attention line 6 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406F 0EB8 Attention line 7 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406F 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406F 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406F 0EBB Attention line 10 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. 406F 0EBC Attention line 11 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. Chapter 5. Error Code to FRU Index 255 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 406F 0EE1 Service processor firmware: invalid MRU to FRU mapping 1. Check for system firmware updates. 2. Call service support. 406F 0EE2 Service processor firmware fault 1. Check for system firmware updates. 2. Call service support. 406F 0EE5 Firmware error, no callout available 1. Check for system firmware updates. 2. Call service support. 4503 26F6 Description: 1-way processor card fault Action: This error code indicates that the processor repeat gard function has deconfigured a processor. Do the following: 1. Go to the processor configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of manually deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the processor card (on which the deconfigured processor resides) specified by the location code. 4503 26F7 Description: 2-way processor card fault Action: This error code indicates that the processor repeat gard function has deconfigured a processor. Do the following: 1. Go to the processor configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of manually deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the processor card (on which the deconfigured processor resides) specified by the location code. 4503 26F8 Description: 1-way processor card fault Action: This error code indicates that the processor repeat gard function has deconfigured a processor. Do the following: 1. Go to the processor configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of manually deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the processor card (on which the deconfigured processor resides) specified by the location code. 256 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4503 26FA Description: 2-way processor card fault Action: This error code indicates that the processor repeat gard function has deconfigured a processor. Do the following: 1. Go to the processor configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of manually deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the processor card (on which the deconfigured processor resides) specified by the location code. 4503 26FB Description: 2-way processor card fault Action: This error code indicates that the processor repeat gard function has deconfigured a processor. Do the following: 1. Go to the processor configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of manually deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the processor card (on which the deconfigured processor resides) specified by the location code. 4503 26FD Description: 1-way processor card fault Action: This error code indicates that the processor repeat gard function has deconfigured a processor. Do the following: 1. Go to the processor configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of manually deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the processor card (on which the deconfigured processor resides) specified by the location code. Chapter 5. Error Code to FRU Index 257 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4503 26FE Description: 2-way processor card fault Action: This error code indicates that the processor repeat gard function has deconfigured a processor. Do the following: 1. Go to the processor configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of manually deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the processor card (on which the deconfigured processor resides) specified by the location code. 4503 26FF Description: 1-way processor card fault Action: This error code indicates that the processor repeat gard function has deconfigured a processor. Do the following: 1. Go to the processor configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of manually deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the processor card (on which the deconfigured processor resides) specified by the location code. 4503 301A Description: 128 MB DIMM fault Action: This error code indicates that the memory repeat gard function has deconfigured a quad of memory. Do the following: 1. Go to the memory configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured, reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the DIMMs. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the DIMMs have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the quad of memory containing the DIMM specified by the location code. The entire quad of memory must be replaced. 258 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4503 301B Description: 1 GB DIMM fault Action:This error code indicates that the memory repeat gard function has deconfigured a quad of memory. Do the following: 1. Go to the memory configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured, reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the DIMMs. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the DIMMs have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the quad of memory containing the DIMM specified by the location code. The entire quad of memory must be replaced. 4503 301C Description: 256 MB DIMM fault Action: This error code indicates that the memory repeat gard function has deconfigured a quad of memory. Do the following: 1. Go to the memory configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured, reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the DIMMs. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the DIMMs have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the quad of memory containing the DIMM specified by the location code. The entire quad of memory must be replaced. 4503 301D Description: 512 MB DIMM fault Action: This error code indicates that the memory repeat gard function has deconfigured a quad of memory. Do the following: 1. Go to the memory configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured, reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the DIMMs. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the DIMMs have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the quad of memory containing the DIMM specified by the location code. The entire quad of memory must be replaced. Chapter 5. Error Code to FRU Index 259 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4503 301E Description: 1 GB DIMM fault Action: This error code indicates that the memory repeat gard function has deconfigured a quad of memory. Do the following: 1. Go to the memory configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured, reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the DIMMs. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the DIMMs have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the quad of memory containing the DIMM specified by the location code. The entire quad of memory must be replaced. 4503 302E Description: 2 GB DIMM fault Action: This error code indicates that the memory repeat gard function has deconfigured a quad of memory. Do the following: 1. Go to the memory configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured, reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the DIMMs. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the DIMMs have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the quad of memory containing the DIMM specified by the location code. The entire quad of memory must be replaced. 4503 302F Description: 256 MB DIMM fault Action: This error code indicates that the memory repeat gard function has deconfigured a quad of memory. Do the following: 1. Go to the memory configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured, reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the DIMMs. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the DIMMs have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the quad of memory containing the DIMM specified by the location code. The entire quad of memory must be replaced. 260 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4503 303B Description: 1 GB DIMM fault Action: This error code indicates that the memory repeat gard function has deconfigured a quad of memory. Do the following: 1. Go to the memory configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured, reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the DIMMs. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the DIMMs have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the quad of memory containing the DIMM specified by the location code. The entire quad of memory must be replaced. 4503 303C Description: 1 GB DIMM fault Action: This error code indicates that the memory repeat gard function has deconfigured a quad of memory. Do the following: 1. Go to the memory configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured, reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the DIMMs. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the DIMMs have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the quad of memory containing the DIMM specified by the location code. The entire quad of memory must be replaced. 4503 303D Description: 2 GB DIMM fault Action: This error code indicates that the memory repeat gard function has deconfigured a quad of memory. Do the following: 1. Go to the memory configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured, reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the DIMMs. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the DIMMs have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the quad of memory containing the DIMM specified by the location code. The entire quad of memory must be replaced. Chapter 5. Error Code to FRU Index 261 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4503 309A Description: 128 MB DIMM fault Action: This error code indicates that the memory repeat gard function has deconfigured a quad of memory. Do the following: 1. Go to the memory configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured, reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the DIMMs. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the DIMMs have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the quad of memory containing the DIMM specified by the location code. The entire quad of memory must be replaced. 4503 309B Description: 256 MB DIMM fault Action: This error code indicates that the memory repeat gard function has deconfigured a quad of memory. Do the following: 1. Go to the memory configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured, reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the DIMMs. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the DIMMs have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the quad of memory containing the DIMM specified by the location code. The entire quad of memory must be replaced. 4503 309C Description: 256 MB DIMM fault Action: This error code indicates that the memory repeat gard function has deconfigured a quad of memory. Do the following: 1. Go to the memory configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured, reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the DIMMs. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the DIMMs have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the quad of memory containing the DIMM specified by the location code. The entire quad of memory must be replaced. 262 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4503 309D Description: 512 MB DIMM fault Action: This error code indicates that the memory repeat gard function has deconfigured a quad of memory. Do the following: 1. Go to the memory configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured, reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the DIMMs. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the DIMMs have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the quad of memory containing the DIMM specified by the location code. The entire quad of memory must be replaced. 4503 309E Description: 1 GB DIMM fault Action: This error code indicates that the memory repeat gard function has deconfigured a quad of memory. Do the following: 1. Go to the memory configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured, reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the DIMMs. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the DIMMs have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the quad of memory containing the DIMM specified by the location code. The entire quad of memory must be replaced. 4503 309F Description: 1 GB DIMM fault Action: This error code indicates that the memory repeat gard function has deconfigured a quad of memory. Do the following: 1. Go to the memory configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured, reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the DIMMs. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the DIMMs have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the quad of memory containing the DIMM specified by the location code. The entire quad of memory must be replaced. Chapter 5. Error Code to FRU Index 263 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4503 30AA Description: 2 GB DIMM fault Action: This error code indicates that the memory repeat gard function has deconfigured a quad of memory. Do the following: 1. Go to the memory configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the DIMMs. If any of the DIMMs have a status of manually deconfigured, reconfigure the DIMMs and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the DIMMs. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If all of the DIMMs are still configured in the memory configuration/deconfiguration menu, the problem has been resolved. If any of the DIMMs have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the quad of memory containing the DIMM specified by the location code. The entire quad of memory must be replaced. 4504 26F6 1-way processor card fault Replace the processor card specified by the location code. 4504 26F7 2-way processor card fault Replace the processor card specified by the location code. 4504 26F8 1-way processor card fault Replace the processor card specified by the location code. 4504 26FA 2-way processor card fault Replace the processor card specified by the location code. 4504 26FB 2-way processor card fault Replace the processor card specified by the location code. 4504 26FD 1-way processor card fault Replace the processor card specified by the location code. 4504 26FE 2-way processor card fault Replace the processor card specified by the location code. 4504 26FF 1-way processor card fault Replace the processor card specified by the location code. 4504 301A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4504 301B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4504 301C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4504 301D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4504 301E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4504 302E 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4504 302F 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4504 303B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 264 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4504 303C 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4504 303D 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4504 309A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4504 309B 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4504 309C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4504 309D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4504 309E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4504 309F 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4504 30AA 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4505 26F6 1-way processor card fault Replace the processor card specified by the location code. 4505 26F7 2-way processor card fault Replace the processor card specified by the location code. 4505 26F8 1-way processor card fault Replace the processor card specified by the location code. 4505 26FA 2-way processor card fault Replace the processor card specified by the location code. 4505 26FB 2-way processor card fault Replace the processor card specified by the location code. 4505 26FD 1-way processor card fault Replace the processor card specified by the location code. 4505 26FE 2-way processor card fault Replace the processor card specified by the location code. 4505 26FF 1-way processor card fault Replace the processor card specified by the location code. 4505 301A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4505 301B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4505 301C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4505 301D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4505 301E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4505 302E 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. Chapter 5. Error Code to FRU Index 265 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4505 302F 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4505 303B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4505 303C 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4505 303D 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4505 309A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4505 309B 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4505 309C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4505 309D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4505 309E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4505 309F 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4505 30AA 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4506 26F6 1-way processor card fault Replace the processor card specified by the location code. 4506 26F7 2-way processor card fault Replace the processor card specified by the location code. 4506 26F8 1-way processor card fault Replace the processor card specified by the location code. 4506 26FA 2-way processor card fault Replace the processor card specified by the location code. 4506 26FB 2-way processor card fault Replace the processor card specified by the location code. 4506 26FD 1-way processor card fault Replace the processor card specified by the location code. 4506 26FE 2-way processor card fault Replace the processor card specified by the location code. 4506 26FF 1-way processor card fault Replace the processor card specified by the location code. 4506 301A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4506 301B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4506 301C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4506 301D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 266 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4506 301E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4506 302E 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4506 302F 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4506 303B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4506 303C 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4506 303D 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4506 309A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4506 309B 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4506 309C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4506 309D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4506 309E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4506 309F 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4506 30AA 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4507 26F6 1-way processor card fault Replace the processor card specified by the location code. 4507 26F7 2-way processor card fault Replace the processor card specified by the location code. 4507 26F8 1-way processor card fault Replace the processor card specified by the location code. 4507 26FA 2-way processor card fault Replace the processor card specified by the location code. 4507 26FB 2-way processor card fault Replace the processor card specified by the location code. 4507 26FD 1-way processor card fault Replace the processor card specified by the location code. 4507 26FE 2-way processor card fault Replace the processor card specified by the location code. 4507 26FF 1-way processor card fault Replace the processor card specified by the location code. 4507 301A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4507 301B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. Chapter 5. Error Code to FRU Index 267 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4507 301C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4507 301D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4507 301E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4507 302E 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4507 302F 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4507 303B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4507 303C 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4507 303D 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4507 309A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4507 309B 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4507 309C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4507 309D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4507 309E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4507 309F 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4507 30AA 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4508 26F6 1-way processor card fault Replace the processor card specified by the location code. 4508 26F7 2-way processor card fault Replace the processor card specified by the location code. 4508 26F8 1-way processor card fault Replace the processor card specified by the location code. 4508 26FA 2-way processor card fault Replace the processor card specified by the location code. 4508 26FB 2-way processor card fault Replace the processor card specified by the location code. 4508 26FD 1-way processor card fault Replace the processor card specified by the location code. 4508 26FE 2-way processor card fault Replace the processor card specified by the location code. 4508 26FF 1-way processor card fault Replace the processor card specified by the location code. 268 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4508 301A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4508 301B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4508 301C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4508 301D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4508 301E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4508 302E 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4508 302F 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4508 303B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4508 303C 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4508 303D 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4508 309A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4508 309B 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4508 309C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4508 309D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4508 309E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4508 309F 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4508 30AA 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4509 26F6 1-way processor card fault Replace the processor card specified by the location code. 4509 26F7 2-way processor card fault Replace the processor card specified by the location code. 4509 26F8 1-way processor card fault Replace the processor card specified by the location code. 4509 26FA 2-way processor card fault Replace the processor card specified by the location code. 4509 26FB 2-way processor card fault Replace the processor card specified by the location code. 4509 26FD 1-way processor card fault Replace the processor card specified by the location code. Chapter 5. Error Code to FRU Index 269 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4509 26FE 2-way processor card fault Replace the processor card specified by the location code. 4509 26FF 1-way processor card fault Replace the processor card specified by the location code. 4509 301A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4509 301B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4509 301C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4509 301D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4509 301E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4509 302E 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4509 302F 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4509 303B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4509 303C 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4509 303D 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4509 309A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4509 309B 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4509 309C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4509 309D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4509 309E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4509 309F 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4509 30AA 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450A 26F6 1-way processor card fault Replace the processor card specified by the location code. 450A 26F7 2-way processor card fault Replace the processor card specified by the location code. 450A 26F8 1-way processor card fault Replace the processor card specified by the location code. 450A 26FA 2-way processor card fault Replace the processor card specified by the location code. 270 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 450A 26FB 2-way processor card fault Replace the processor card specified by the location code. 450A 26FD 1-way processor card fault Replace the processor card specified by the location code. 450A 26FE 2-way processor card fault Replace the processor card specified by the location code. 450A 26FF 1-way processor card fault Replace the processor card specified by the location code. 450A 301A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450A 301B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450A 301C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450A 301D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450A 301E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450A 302E 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450A 302F 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450A 303B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450A 303C 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450A 303D 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450A 309A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450A 309B 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450A 309C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450A 309D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450A 309E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450A 309F 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450A 30AA 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450B 26F6 1-way processor card fault Replace the processor card specified by the location code. 450B 26F7 2-way processor card fault Replace the processor card specified by the location code. Chapter 5. Error Code to FRU Index 271 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 450B 26F8 1-way processor card fault Replace the processor card specified by the location code. 450B 26FA 2-way processor card fault Replace the processor card specified by the location code. 450B 26FB 2-way processor card fault Replace the processor card specified by the location code. 450B 26FD 1-way processor card fault Replace the processor card specified by the location code. 450B 26FE 2-way processor card fault Replace the processor card specified by the location code. 450B 26FF 1-way processor card fault Replace the processor card specified by the location code. 450B 301A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450B 301B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450B 301C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450B 301D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450B 301E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450B 302E 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450B 302F 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450B 303B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450B 303C 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450B 303D 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450B 309A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450B 309B 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450B 309C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450B 309D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450B 309E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450B 309F 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450B 30AA 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 272 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 450C 26F6 1-way processor card fault Replace the processor card specified by the location code. 450C 26F7 2-way processor card fault Replace the processor card specified by the location code. 450C 26F8 1-way processor card fault Replace the processor card specified by the location code. 450C 26FA 2-way processor card fault Replace the processor card specified by the location code. 450C 26FB 2-way processor card fault Replace the processor card specified by the location code. 450C 26FD 1-way processor card fault Replace the processor card specified by the location code. 450C 26FE 2-way processor card fault Replace the processor card specified by the location code. 450C 26FF 1-way processor card fault Replace the processor card specified by the location code. 450C 301A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450C 301B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450C 301C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450C 301D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450C 301E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450C 302E 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450C 302F 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450C 303B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450C 303C 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450C 303D 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450C 309A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450C 309B 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450C 309C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450C 309D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450C 309E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. Chapter 5. Error Code to FRU Index 273 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 450C 309F 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450C 30AA 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450D 26F6 1-way processor card fault Replace the processor card specified by the location code. 450D 26F7 2-way processor card fault Replace the processor card specified by the location code. 450D 26F8 1-way processor card fault Replace the processor card specified by the location code. 450D 26FA 2-way processor card fault Replace the processor card specified by the location code. 450D 26FB 2-way processor card fault Replace the processor card specified by the location code. 450D 26FD 1-way processor card fault Replace the processor card specified by the location code. 450D 26FE 2-way processor card fault Replace the processor card specified by the location code. 450D 26FF 1-way processor card fault Replace the processor card specified by the location code. 450D 301A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450D 301B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450D 301C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450D 301D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450D 301E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450D 302E 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450D 302F 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450D 303B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450D 303C 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450D 303D 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450D 309A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450D 309B 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450D 309C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 274 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 450D 309D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450D 309E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450D 309F 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450D 30AA 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450E 26F6 1-way processor card fault Replace the processor card specified by the location code. 450E 26F7 2-way processor card fault Replace the processor card specified by the location code. 450E 26F8 1-way processor card fault Replace the processor card specified by the location code. 450E 26FA 2-way processor card fault Replace the processor card specified by the location code. 450E 26FB 2-way processor card fault Replace the processor card specified by the location code. 450E 26FD 1-way processor card fault Replace the processor card specified by the location code. 450E 26FE 2-way processor card fault Replace the processor card specified by the location code. 450E 26FF 1-way processor card fault Replace the processor card specified by the location code. 450E 301A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450E 301B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450E 301C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450E 301D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450E 301E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450E 302E 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450E 302F 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450E 303B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450E 303C 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450E 303D 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450E 309A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. Chapter 5. Error Code to FRU Index 275 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 450E 309B 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450E 309C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450E 309D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450E 309E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450E 309F 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450E 30AA 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450F 26F6 1-way processor card fault Replace the processor card specified by the location code. 450F 26F7 2-way processor card fault Replace the processor card specified by the location code. 450F 26F8 1-way processor card fault Replace the processor card specified by the location code. 450F 26FA 2-way processor card fault Replace the processor card specified by the location code. 450F 26FB 2-way processor card fault Replace the processor card specified by the location code. 450F 26FD 1-way processor card fault Replace the processor card specified by the location code. 450F 26FE 2-way processor card fault Replace the processor card specified by the location code. 450F 26FF 1-way processor card fault Replace the processor card specified by the location code. 450F 301A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450F 301B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450F 301C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450F 301D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450F 301E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450F 302E 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450F 302F 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450F 303B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450F 303C 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 276 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 450F 303D 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450F 309A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450F 309B 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450F 309C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450F 309D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450F 309E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450F 309F 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 450F 30AA 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 68. 4604 287A CEC backplane Replace the CEC backplane, Location: U0.1-P1 4604 28C4 CEC backplane fault Replace the CEC backplane, Location: U0.1-P1 4605 287A CEC backplane Replace the CEC backplane, Location: U0.1-P1 4605 28C4 CEC backplane fault Replace the CEC backplane, Location: U0.1-P1 4606 287A CEC backplane Replace the CEC backplane, Location: U0.1-P1 4606 28C4 CEC backplane fault Replace the CEC backplane, Location: U0.1-P1 4607 287A CEC backplane Replace the CEC backplane, Location: U0.1-P1 4607 28C4 CEC backplane fault Replace the CEC backplane, Location: U0.1-P1 4608 287A CEC backplane Replace the CEC backplane, Location: U0.1-P1 4608 28C4 CEC backplane fault Replace the CEC backplane, Location: U0.1-P1 4609 287A CEC backplane Replace the CEC backplane, Location: U0.1-P1 4609 28C4 CEC backplane fault Replace the CEC backplane, Location: U0.1-P1 460A 287A CEC backplane Replace the CEC backplane, Location: U0.1-P1 460A 28C4 CEC backplane fault Replace the CEC backplane, Location: U0.1-P1 460B 287A CEC backplane Replace the CEC backplane, Location: U0.1-P1 Chapter 5. Error Code to FRU Index 277 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 460B 28C4 CEC backplane fault Replace the CEC backplane, Location: U0.1-P1 460C 287A CEC backplane Replace the CEC backplane, Location: U0.1-P1 460C 28C4 CEC backplane fault Replace the CEC backplane, Location: U0.1-P1 460D 287A CEC backplane Replace the CEC backplane, Location: U0.1-P1 460D 28C4 CEC backplane fault Replace the CEC backplane, Location: U0.1-P1 460E 287A CEC backplane Replace the CEC backplane, Location: U0.1-P1 460E 28C4 CEC backplane fault Replace the CEC backplane, Location: U0.1-P1 460F 287A CEC backplane Replace the CEC backplane, Location: U0.1-P1 460F 28C4 CEC backplane fault Replace the CEC backplane, Location: U0.1-P1 4B23 26F6 Description: 1-way processor card fault Action: This error code indicates that the processor repeat gard function has deconfigured a processor. Do the following: 1. Go to the processor configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of manually deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power down the system using the white power button and look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the processor card (on which the deconfigured processor resides) specified by the location code. 4B23 26F7 Description: 2-way processor card fault Action: This error code indicates that the processor repeat gard function has deconfigured a processor. Do the following: 1. Go to the processor configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of manually deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the processor card (on which the deconfigured processor resides) specified by the location code. 278 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4B23 26F8 Description: 1-way processor card fault Action: This error code indicates that the processor repeat gard function has deconfigured a processor. Do the following: 1. Go to the processor configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of manually deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the processor card (on which the deconfigured processor resides) specified by the location code. 4B23 26FA Description: 2-way processor card fault Action: This error code indicates that the processor repeat gard function has deconfigured a processor. Do the following: 1. Go to the processor configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of manually deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the processor card (on which the deconfigured processor resides) specified by the location code. 4B23 26FB Description: 2-way processor card fault Action: This error code indicates that the processor repeat gard function has deconfigured a processor. Do the following: 1. Go to the processor configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of manually deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the processor card (on which the deconfigured processor resides) specified by the location code. Chapter 5. Error Code to FRU Index 279 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4B23 26FD Description: 1-way processor card fault Action: This error code indicates that the processor repeat gard function has deconfigured a processor. Do the following: 1. Go to the processor configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of manually deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the processor card (on which the deconfigured processor resides) specified by the location code. 4B23 26FE Description: 2-way processor card fault Action: This error code indicates that the processor repeat gard function has deconfigured a processor. Do the following: 1. Go to the processor configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of manually deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the processor card (on which the deconfigured processor resides) specified by the location code. 4B23 26FF Description: 1-way processor card fault Action: This error code indicates that the processor repeat gard function has deconfigured a processor. Do the following: 1. Go to the processor configuration/deconfiguration menu, located under the system information menu in the service processor menus. Look at the status of the processors. If any of the processors have a status of manually deconfigured, reconfigure the processor(s) and boot the system to the SMS menus. Power off the system using the white power button and look again at the status of the processors. If all of the processors are still configured in the processor configuration/deconfiguration menu, the problem has been resolved. If any of the processors have a status of deconfigured by the system, either before or after the boot to the SMS menus, go to step 2. 2. Replace the processor card (on which the deconfigured processor resides) specified by the location code. 4B24 26F6 1-way processor card fault Replace the processor card specified by the location code. 4B24 26F7 2-way processor card fault Replace the processor card specified by the location code. 4B24 26F8 1-way processor card fault Replace the processor card specified by the location code. 4B24 26FA 2-way processor card fault Replace the processor card specified by the location code. 280 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4B24 26FB 2-way processor card fault Replace the processor card specified by the location code. 4B24 26FD 1-way processor card fault Replace the processor card specified by the location code. 4B24 26FE 2-way processor card fault Replace the processor card specified by the location code. 4B24 26FF 1-way processor card fault Replace the processor card specified by the location code. 4B25 26F6 1-way processor card fault Replace the processor card specified by the location code. 4B25 26F7 2-way processor card fault Replace the processor card specified by the location code. 4B25 26F8 1-way processor card fault Replace the processor card specified by the location code. 4B25 26FA 2-way processor card fault Replace the processor card specified by the location code. 4B25 26FB 2-way processor card fault Replace the processor card specified by the location code. 4B25 26FD 1-way processor card fault Replace the processor card specified by the location code. 4B25 26FE 2-way processor card fault Replace the processor card specified by the location code. 4B25 26FF 1-way processor card fault Replace the processor card specified by the location code. 4B26 26F6 1-way processor card fault Replace the processor card specified by the location code. 4B26 26F7 2-way processor card fault Replace the processor card specified by the location code. 4B26 26F8 1-way processor card fault Replace the processor card specified by the location code. 4B26 26FA 2-way processor card fault Replace the processor card specified by the location code. 4B26 26FB 2-way processor card fault Replace the processor card specified by the location code. 4B26 26FD 1-way processor card fault Replace the processor card specified by the location code. 4B26 26FE 2-way processor card fault Replace the processor card specified by the location code. 4B26 26FF 1-way processor card fault Replace the processor card specified by the location code. 4B27 26F6 1-way processor card fault Replace the processor card specified by the location code. 4B27 26F7 2-way processor card fault Replace the processor card specified by the location code. 4B27 26F8 1-way processor card fault Replace the processor card specified by the location code. Chapter 5. Error Code to FRU Index 281 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4B27 26FA 2-way processor card fault Replace the processor card specified by the location code. 4B27 26FB Description: 2-way processor card fault Action: 1. If the system has one 2-way processor card or two 2-way processor cards, look at word 13 in the service processor error log entry detail data. If word 13 is not 7xxxD023 or 7xxxD0E1, replace the processor card specified by the location code. If word 13 is 7xxxD023 or 7xxxD0E1, look for an earlier entry in the service processor error log of the form 4067xxxx, 4507xxxx, 4607xxxx, or 4B2726FB that has not already been corrected by a parts replacement. If you find an earlier entry, replace the FRU identified by that error, not the latest occurrence of 4B2726FB. 2. If this does not resolve the problem, call service support. 4B27 26FD 1-way processor card fault Replace the processor card specified by the location code. 4B27 26FE 2-way processor card fault Replace the processor card specified by the location code. 4B27 26FF 1-way processor card fault Replace the processor card specified by the location code. 4B28 26F6 1-way processor card fault Replace the processor card specified by the location code. 4B28 26F7 2-way processor card fault Replace the processor card specified by the location code. 4B28 26F8 1-way processor card fault Replace the processor card specified by the location code. 4B28 26FA 2-way processor card fault Replace the processor card specified by the location code. 4B28 26FB 2-way processor card fault Replace the processor card specified by the location code. 4B28 26FD 1-way processor card fault Replace the processor card specified by the location code. 4B28 26FE 2-way processor card fault Replace the processor card specified by the location code. 4B28 26FF 1-way processor card fault Replace the processor card specified by the location code. 4B29 26F6 1-way processor card fault Replace the processor card specified by the location code. 4B29 26F7 2-way processor card fault Replace the processor card specified by the location code. 4B29 26F8 1-way processor card fault Replace the processor card specified by the location code. 4B29 26FA 2-way processor card fault Replace the processor card specified by the location code. 4B29 26FB 2-way processor card fault Replace the processor card specified by the location code. 4B29 26FD 1-way processor card fault Replace the processor card specified by the location code. 4B29 26FE 2-way processor card fault Replace the processor card specified by the location code. 282 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4B29 26FF 1-way processor card fault Replace the processor card specified by the location code. 4B2A 26F6 1-way processor card fault Replace the processor card specified by the location code. 4B2A 26F7 2-way processor card fault Replace the processor card specified by the location code. 4B2A 26F8 1-way processor card fault Replace the processor card specified by the location code. 4B2A 26FA 2-way processor card fault Replace the processor card specified by the location code. 4B2A 26FB 2-way processor card fault Replace the processor card specified by the location code. 4B2A 26FD 1-way processor card fault Replace the processor card specified by the location code. 4B2A 26FE 2-way processor card fault Replace the processor card specified by the location code. 4B2A 26FF 1-way processor card fault Replace the processor card specified by the location code. 4B2B 26F6 1-way processor card fault Replace the processor card specified by the location code. 4B2B 26F7 2-way processor card fault Replace the processor card specified by the location code. 4B2B 26F8 1-way processor card fault Replace the processor card specified by the location code. 4B2B 26FA 2-way processor card fault Replace the processor card specified by the location code. 4B2B 26FB Description: 2-way processor card fault Action: 1. Replace the processor card at location U0.1-P1-C2. 2. If the problem is not resolved, remove the processor card at U0.1-P1-C1 and replace it with the processor card that was removed in the previous step. 3. If the problem is still not resolved, replace the CEC backplane, at location U0.1-P1. 4B2B 26FD 1-way processor card fault Replace the processor card specified by the location code. 4B2B 26FE 2-way processor card fault Replace the processor card specified by the location code. 4B2B 26FF 1-way processor card fault Replace the processor card specified by the location code. 4B2C 26F6 1-way processor card fault Replace the processor card specified by the location code. 4B2C 26F7 2-way processor card fault Replace the processor card specified by the location code. 4B2C 26F8 1-way processor card fault Replace the processor card specified by the location code. 4B2C 26FA 2-way processor card fault Replace the processor card specified by the location code. Chapter 5. Error Code to FRU Index 283 Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4B2C 26FB 2-way processor card fault Replace the processor card specified by the location code. 4B2C 26FD 1-way processor card fault Replace the processor card specified by the location code. 4B2C 26FE 2-way processor card fault Replace the processor card specified by the location code. 4B2C 26FF 1-way processor card fault Replace the processor card specified by the location code. 4B2D 26F6 1-way processor card fault Replace the processor card specified by the location code. 4B2D 26F7 2-way processor card fault Replace the processor card specified by the location code. 4B2D 26F8 1-way processor card fault Replace the processor card specified by the location code. 4B2D 26FA 2-way processor card fault Replace the processor card specified by the location code. 4B2D 26FB 2-way processor card fault Replace the processor card specified by the location code. 4B2D 26FD 1-way processor card fault Replace the processor card specified by the location code. 4B2D 26FE 2-way processor card fault Replace the processor card specified by the location code. 4B2D 26FF 1-way processor card fault Replace the processor card specified by the location code. 4B2E 26F6 1-way processor card fault Replace the processor card specified by the location code. 4B2E 26F7 2-way processor card fault Replace the processor card specified by the location code. 4B2E 26F8 1-way processor card fault Replace the processor card specified by the location code. 4B2E 26FA 2-way processor card fault Replace the processor card specified by the location code. 4B2E 26FB 2-way processor card fault Replace the processor card specified by the location code. 4B2E 26FD 1-way processor card fault Replace the processor card specified by the location code. 4B2E 26FE 2-way processor card fault Replace the processor card specified by the location code. 4B2E 26FF 1-way processor card fault Replace the processor card specified by the location code. 4B2F 26F6 1-way processor card fault Replace the processor card specified by the location code. 4B2F 26F7 2-way processor card fault Replace the processor card specified by the location code. 4B2F 26F8 1-way processor card fault Replace the processor card specified by the location code. 284 Service Guide Table 6. Service Processor Error Codes. (continued) Error Code Description Action/ Possible Failing FRU 4B2F 26FA 2-way processor card fault Replace the processor card specified by the location code. 4B2F 26FB 2-way processor card fault Replace the processor card specified by the location code. 4B2F 26FD 1-way processor card fault Replace the processor card specified by the location code. 4B2F 26FE 2-way processor card fault Replace the processor card specified by the location code. 4B2F 26FF 1-way processor card fault Replace the processor card specified by the location code. Memory DIMM Present Detect Bits (PD-Bits) The following table expands on the Memory Error Notes found on page 204, where yy is the PD value in the table below and xxx are the last three digits of the error code. Use these values to identify the type of memory that generated the error. If you replace FRUs or perform an action, and the problem is still not corrected, go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123 unless otherwise indicated in the Memory Error Notes found on page 204. PD value Size 28 512 MB 38 128 MB 68 256 MB 78 1024 MB 88 2048 MB Note: Memory DIMMs must be installed in quads. Error Log Utility Procedure When the System Management Services menu appears, check the error log for any errors. Use the location code obtained from the SMS error log utility to identify the component the error is reported against. To obtain the location code from the error log utility do the following: 1. From the SMS main menu, choose View Error Log. 2. If an error is logged, check the time stamp. 3. If the error was logged during the current boot attempt, record it. 4. Look up the error in the Chapter 5, “Error Code to FRU Index”, on page 179 and perform the listed action. 5. If no recent error is logged in the error log, go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123. Chapter 5. Error Code to FRU Index 285 System Firmware Update Messages Error Code Description Action/ Possible Failing FRU A1FD 0000 Recovery mode entered Information message A1FD 0001 Service processor flash recovery Insert Diskette #1 A1FD 0002 Service processor flash recovery Insert Diskette #2 A1FD 0003 Service processor flash recovery Insert Diskette #3 A1FD 0004 Service processor flash recovery Insert Diskette #4 Scan Dump Messages Error Code Description Action/ Possible Failing FRU A100 3000 Successful completion of scan dump Informational message; may or may not be seen during a scan log dump. A1FF 3000 Scan dump has been disabled Informational message; may or may not be seen during a scan log dump. Common Firmware Error Codes Attention: Follow the procedure defined in “Checkpoint and Error Code Index” on page 181. If you replace FRUs or perform an action and the problem is still not corrected, go to “MAP 1548: Processor Subsystem Problem Isolation” on page 123 unless otherwise indicated in the tables. If you replace FRUs and the problem is corrected, go to ²MAP 0410: Repair Checkout² in the Diagnostic Information for Multiple Bus Systems. Error Code Description Action/ Possible Failing FRU B006 1403 Missing service processor resource 1. Update system firmware 2. CEC backplane Location: U0.1-P1 B006 1404 Service processor firmware fault 1. Update system firmware 2. CEC backplane Location: U0.1-P1 B006 1405 Service processor firmware fault 1. Update system firmware 2. CEC backplane Location: U0.1-P1 B006 1406 Service processor program error 1. Update system firmware 2. Check for operating system updates 3. CEC backplane Location: U0.1-P1 B006 1407 Service processor resource allocation problem 1. Update system firmware 2. Check for operating system updates 3. CEC backplane Location: U0.1-P1 286 Service Guide Error Code Description Action/ Possible Failing FRU B006 1408 Service processor program error 1. Update system firmware 2. Check for operating system updates 3. CEC backplane Location: U0.1-P1 B006 1409 Service processor program error 1. Update system firmware 2. CEC backplane Location: U0.1-P1 B10F 1000 Service processor DRAM failure CEC backplane Location: U0.1-P1 B10F 1002 Service processor failure CEC backplane Location: U0.1-P1 B10F 1005 Service processor failure CEC backplane Location: U0.1-P1 B10F 1007 Service processor failure CEC backplane Location: U0.1-P1 B10F 1008 Service processor failure CEC backplane Location: U0.1-P1 B10F 100E Service processor boot flash corrupted CEC backplane Location: U0.1-P1 B10F 100F Service processor base flash corrupted CEC backplane Location: U0.1-P1 B10F 1300 Service processor failure CEC backplane Location: U0.1-P1 B10F 1370 Service processor flash update failure 1. Retry flash update 2. CEC backplane Location: U0.1-P1 B10F 1380 Service processor program failure 1. Update system firmware 2. CEC backplane Location: U0.1-P1 B10F 1381 Service processor flash error 1. Update system firmware 2. CEC backplane Location: U0.1-P1 B10F 1384 Service processor failure CEC backplane Location: U0.1-P1 B10F 1387 Service processor failure CEC backplane Location: U0.1-P1 B10F 1400 Service processor failure CEC backplane Location: U0.1-P1 B10F 1401 Service processor failure CEC backplane Location: U0.1-P1 B10F 1670 Service processor flash update failure 1. Retry flash update 2. CEC backplane Location: U0.1-P1 B10F 1672 Service processor firmware update invalid Call service support B10F 1675 Service processor failure CEC backplane Location: U0.1-P1 Chapter 5. Error Code to FRU Index 287 Error Code Description Action/ Possible Failing FRU B10F 1681 Service processor firmware update invalid Call service support B10F 1682 Service processor firmware update invalid Call service support B10F 1683 Service processor firmware update invalid Call service support B1xx 0013 A firmware update has failed because the update image is invalid. The update was stopped before any changes were made to the currently installed firmware. Install a valid firmware image for the specific type and model. B1xx 4600 Service processor failure CEC backplane Location: U0.1-P1 B1xx 4601 System processor failure Go to “MAP 1540: Problem Isolation Procedures” on page 108. B1xx 4602 Service processor firmware failure 1. Do a slow boot (see “Performing Slow Boot” on page 179), then look at the service processor error log. Look for the first new error after the B1xx 4602. Go to the “Checkpoint and Error Code Index” on page 181 with the new error code and perform the action indicated. If there are no new error codes, go to “MAP 1540: Problem Isolation Procedures” on page 108. 2. Call service support. B1xx 4603 Service processor firmware corrupted 1. Reload system firmware 2. CEC backplane Location: U0.1-P1 B1xx 4606 Firmware failure 1. Check for system firmware updates 2. Operator panel Location: U0.1-L1 3. CEC backplane Location: U0.1-P1 B1xx 4608 Service processor early termination 1. Reset the system by disconnecting and reconnecting system power 2. Reload system firmware 3. CEC backplane Location: U0.1-P1 B1xx 460A TOD clock has been reset 1. Set TOD clock 2. CEC backplane Location: U0.1-P1 B1xx 460B TOD battery failure TOD battery Location: U0.1-P1-V3 B1xx 4611 Service processor failure 1. Processor card Location: U0.1-P1-C1 2. CEC backplane Location: U0.1-P1 288 Service Guide Error Code Description Action/ Possible Failing FRU B1xx 4620 Modem configuration failure 1. Check modem configuration files 2. Check for system firmware update 3. Modem 4. CEC backplane Location: U0.1-P1 B1xx 4621 Modem disconnect failure 1. Check modem configuration files 2. Check for system firmware update 3. Modem 4. CEC backplane Location: U0.1-P1 B1xx 4622 Service processor failure 1. Operator panel Location: U0.1-L1 2. Check for system firmware update 3. CEC backplane Location: U0.1-P1 B1xx 4633 System power control network failure 1. Check for system firmware update 2. CEC backplane Location: U0.1-P1 B1xx 4634 SPCN loop fault 1. Check SPCN cabling 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108. B1xx 4643 I2C card/flex cable detection error Go to “MAP 1540: Problem Isolation Procedures” on page 108. B1xx 4644 Processor VPD system fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. B1xx 4645 I2C (Non-processor) VPD System Fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. B1xx 4646 General VPD fault Go to “MAP 1540: Problem Isolation Procedures” on page 108. B1xx 4647 Write TM field in system VPD Replace the operator panel, location: U0.1-L1. Do not swap the VPD module from the old operator panel onto the new operator panel; leave the new, unprogrammed VPD module on the new operator panel. Call service support for instructions. B1xx 4648 CPU Backplane VPD or Power Source Go to “MAP 1540: Problem Isolation Procedures” on page 108. B1xx 4649 Power supply VPD failure CEC power supply Location: U0.1-V1, U0.1-V2 B1xx 4650 A system processor failure has been detected by the service processor. All system processors have been reset in an attempt to recover operations. Information only. If the problem persists, check the error logs for other fault indications. If no other fault indications are found, go to “MAP 1540: Problem Isolation Procedures” on page 108. B1xx 4651 CPU VPD fault Processor card Location: U0.1-P1-C1, U0.1-P1-C2 Chapter 5. Error Code to FRU Index 289 Error Code Description Action/ Possible Failing FRU B1xx 4660 Memory subsystem failure Go to “MAP 1540: Problem Isolation Procedures” on page 108. B1xx 4661 Memory card VPD failure Go to “MAP 1540: Problem Isolation Procedures” on page 108. B1xx 4662 Memory DIMM VPD failure Go to “MAP 1540: Problem Isolation Procedures” on page 108. B1xx 4670 Processor drawer cooling 1. Check ambient temperature 2. Check air supply to fans 3. Check fans for proper installation and operation 4. Check for any unsupported cards (that may be too hot) 5. CEC backplane Location: U0.1-P1 B1xx 4671 Critical temperature in CEC drawer 1. Check ambient temperature. 2. Check air supply to fans. 3. Check fans for proper installation and operation. 4. Check for unsupported cards (which may be too hot) 5. CEC planar, U0.1-P1 B1xx 4672 Critical temperature in I/O subsystem 1. Check ambient temperature. 2. Check air supply to fans. 3. Check fans for proper installation and operation. 4. Check for unsupported cards (which may be too hot) 5. I/O subsystem planar, U0.dd-P1 B1xx 4681 JTAG scan interface 1. JTAG cabling 2. CEC backplane Location: U0.1-P1 3. Processor card Location: U0.1-P1-C1 B1xx 4682 Problem with system VPD on operator panel. 1. Replace the operator panel, location: U0.1-L1. Do not swap the old VPD module onto the new operator panel. Call service support for instructions on programming the new VPD module. 2. Call service support. 290 Service Guide Error Code Description Action/ Possible Failing FRU B1xx 4690 Description: Operating system surveillance time out (AIX to service processor interface failure). Action: 1. This error code indicates that the operating system terminated early (which usually implies an operating system crash). This error code may appear in the service processor error log by itself. However, in the AIX error log, another error points to the cause of the operating system crash. Use the other error as the starting point for your service action. If the system will not boot to AIX, power off and look at the service processor error log. Look for error codes (with time stamps that correspond to the boot that was just attempted) that indicate hardware failures. If these error codes are found, record them, find the error code listed in the Chapter 5, “Error Code to FRU Index”, on page 179, and perform the indicated actions. 2. The other possibility is that the operating system was not found during a prior boot attempt. To determine if this occurred, do the following: a. Look at the AIX error log entry containing B1xx4690. This is a ²SCAN_ERROR_CHRP² error with an identifier of BFEC0425. Look at the AIX error log entry containing B1xx4690. This is a ²SCAN_ERROR_CHRP² error with an identifier of BFEC0425. b. In the detail data, find the string ²B1xx4690² (If present, it will be at byte 60 of the detail data.) c. Go forward 8 bytes after the ²B1² to byte 68, and look at bytes 68 and 69. If the values of bytes 68 and 69 are A2B0, this indicates that the firmware was unable to find a bootable device in the boot list that is set in the SMS menus. If the system is up, the boot list problem has been corrected and the B1xx 4690 can be treated as an informational message with no actions required. 3. Call service support. B1xx 4691 Description: System firmware to service processor interface failure. (System firmware surveillance time out) Action: 1. Check for system firmware updates. 2. Go to the service processor main menu and select System Information Menu. Then select Read Progress Indicators From Last System Boot. Begin your repair action with the error code or checkpoint immediately preceding B1xx 4691: If a location code displays with the error code or checkpoint, replace the part at that location. Note: If the location code is U0.1-P1/E1 or U0.1-P1/E2, change the PCI riser card (U0.1-P2) first, then change the CEC backplane (U0.1-P1). If no location code is specified, and you have an 8-character error code, go to the “Checkpoint and Error Code Index” on page 181. If no location code is specified, and you have a 4-character checkpoint (for example, Exxx), go to “Firmware Checkpoints” on page 150. 3. If your checkpoint is 91FF, perform the following: a. Replace the RIO cable attached to RIO-G port 0 (U0.1-P1/Qx) of the base system. b. Replace the RIO adapter card of the I/O subsystem attached to the other end of the RIO cable attached to RIO-G port 0 (U0.1-P1/Qx) of the base system. c. Replace the CEC backplane, location U0.1-P1. 4. If the problem is not resolved, call service support. B1xx 4692 Operator panel task interface failure 1. Check for system firmware updates. 2. Call service support. B1xx 4693 Service processor firmware error 1. Check for system firmware updates. 2. Call service support. B1xx 4694 System processor firmware failure 1. Check for system firmware updates. 2. CEC backplane Location: U0.1-P1 Chapter 5. Error Code to FRU Index 291 Error Code Description Action/ Possible Failing FRU B1xx 4695 System power control network to service processor interface failure 1. Check for system firmware updates. 2. CEC backplane Location: U0.1-P1 B1xx 4696 System firmware interface failure 1. Check for system firmware updates. 2. Call service support. B1xx 4698 Firmware problem 1. Check for system firmware updates. 2. Call service support. B1xx 4699 Service processor firmware: This is usually an indication of a problem in the communication path between the HMC and the service processor. It may only be an informational message. If the managed system is down, go to the service processor error log and find the error log entry containing B1xx 4699. Look at the first two bytes of word 13 of the detailed entry information. If the managed system is running, look at the AIX error log entry containing B1xx 4699. This is a ²SCAN_ERROR_CHRP² error with an identifier of BFE4C025. In the detail data, find the string ²B1xx 4699². (If present, it will be at byte 60 of the detail data.) Go forward 8 bytes after the ²B1² to byte 68 and look at bytes 68 and 69. Perform the following actions based on the following values of bytes 68 and 69 from the AIX error log entry, or on the first two bytes of word 13 from the service processor error log entry: 2306 No processor card is detected in slot one (U0.1-P1-C1); a processor card is required in the first slot for the system to boot. Actions: 1. If a processor card is not plugged into slot one (U0.1-P1-C1), plug one in. 2. If a processor card is plugged into slot one (U0.1-P1-C1), reseat it. If reseating the processor card does not fix the problem, replace it. A205: Machine type and model fields are not valid in the VPD module. Obtain an operator panel. Do not swap the old VPD module onto the new operator panel. Call service support for instructions on how to write the machine type and model into the new VPD module. A20B: Error requesting trace buffer for service processor. Actions: 1. Reset the service processor, if possible. 2. Check for system firmware updates. Apply the updates if they are available. A218: Unknown return code detected. Actions: Check for system firmware updates. Apply the updates if they are available. A21A: Error allocating an internal service processor memory space. Actions: 1. Reset the service processor, if possible. 2. Check for system firmware updates. Apply the updates if they are available. A800: HMC/service processor initialization failure. Actions: 1. Check for system firmware updates. 2. Replace the service processor, location: U0.1-P1. A801: HMC wrap failure. Actions: Replace the service processor, location: U0.1-P1. 292 Service Guide Error Code Description Action/ Possible Failing FRU B1xx 4699 (continued) (Continued from the previous page.) A806: Loss of the surveillance heartbeat between the HMC and the service processor. Actions: 1. Make sure that the HMC is booted and operational. 2. Check the serial cables that go from the HMC to the service processor, location: U0.1-P1. If there are no other error codes or indications of a problem, the A806 (loss of surveillance heartbeat) was a temporary condition and has been resolved; the B1xx 4699 code is then an informational message only. If the problem persists: 1. Check the serial cables connecting the HMC to the CEC backplane, location: U0.1-P1. 2. Run diagnostics on the serial port on the HMC. 3. Run diagnostics on the serial ports on the service processor. Values of A009 and A719 of bytes 68 and 69 in the AIX error log entry or the first two bytes of word 13 in the service processor error log entry are also informational entries: A009: The system received a power-off request at run time from the HMC. A719: Primary power failed; the system switched to battery backup power. For all other values of bytes 68 and 69, or the first two bytes of word 13, do the following: 1. Check for system firmware updates. 2. Reset the service processor by activating the pinhole reset switch on the operator panel. 3. Call service support. B1xx 469A System firmware interface 1. Check for system firmware updates. 2. Call service support. B1xx 469B Firmware to I/O interface 1. Check for system firmware updates. 2. CEC backplane Location: U0.1-P1 B1xx 469C Firmware to I/O interface 1. Check for system firmware updates. 2. CEC backplane Location: U0.1-P1 B1xx 469E Service processor firmware failure 1. Check for system firmware updates. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108, CEC minimum configuration 3. CEC backplane Location: U0.1-P1 B1xx 469F Service processor failure 1. Check for system firmware updates. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 108, CEC minimum configuration 3. CEC backplane Location: U0.1-P1 B1xx 8FF0 Informational message No action required. B1FD 0013 A firmware update has failed because the update image is invalid. The update was stopped before any changes were made to the currently installed firmware. Reset the service processor (by activating the pinhole reset switch or by disconnecting, then reconnecting power). Install a valid firmware update image for the specific system type and model. Chapter 5. Error Code to FRU Index 293 Error Code Description Action/ Possible Failing FRU B1FD 0015 Too many firmware update diskettes inserted (or wrong or faulty update image) 1. Ensure the firmware update image is correct for the specific system type and model. 2. Replace the update diskettes with a new set of firmware update diskettes. B1FD 0016 Diskette read error 1. The firmware diskette may be faulty. Try another diskette. 2. Replace the diskette drive. 3. Replace the CEC backplane Location: U0.1-P1 B1FD 001A Service processor recovery mode update fault Reset service processor (by activating pinhole reset switch or by disconnecting, then reconnecting power) B1FD 001E Service processor recovery mode update fault Wrong firmware update diskette (Insert correct diskette) B1FD 001F Bad firmware update diskette Replace diskette Scan Log Dump Progress Codes A scan dump is the collection of chip data that the service processor gathers after a system malfunction, such as a checkstop or hang. The scan dump data may contain chip scan rings, chip trace arrays, and SCOM contents. For more information on scan dumps, refer to “Scan Log Dump Policy” on page 312. The following table shows the error code produced by a scan dump. Error Code Description Action/Possible Failing FRU A1FF 3000 Scan log dump request rejected This is an informational message. The service processor firmware was not able to perform a scan log dump even though one was requested. D130 xxxx Scan log dump in progress Informational message only. As the dump progresses, the xxx characters in the display changes. When completed, the system may reboot, depending on the reboot policy settings. 294 Service Guide Problem Determination Generated Error Codes Table 7. Problem Determination Generated Error Codes Error Code Description Action/ Possible Failing FRU M0BT 0000 The system hung during speaker POST. This error code is generated by the customer performing “Problem Determination” in the 286/C4 and 286/E4 User’s Guide. Go to “MAP 1540: Problem Isolation Procedures” on page 108. M0BT 0001 The system hung during “Starting Software...”. This error code is generated by the customer performing “Problem Determination” in the 286/C4 and 286/E4 User’s Guide. Go to “MAP 1540: Problem Isolation Procedures” on page 108. M0KB D000 The system hung during keyboard POST. This error code is generated by the customer performing “Problem Determination” in the 286/C4 and 286/E4 User’s Guide. 1. Keyboard 2. CEC backplane Location: U0.1-P1 M0KB D001 The system did not respond to a keyboard entry. This error code is generated by the customer performing “Problem Determination” in the 286/C4 and 286/E4 User’s Guide. Type 101 keyboard 1. Keyboard 2. CEC backplane Location: U0.1-P1 M0KB D002 The system did not respond to a keyboard entry. This error code is generated by the customer performing “Problem Determination” in the 286/C4 and 286/E4 User’s Guide. Type 102 keyboard 1. Keyboard 2. CEC backplane Location: U0.1-P1 M0KB D003 The system did not respond to a keyboard entry. This error code is generated by the customer performing “Problem Determination” in the 286/C4 and 286/E4 User’s Guide. 1. Keyboard 2. CEC backplane Location: U0.1-P1 M0ME M002 The system hung during memory POST. This error code is generated by the customer performing “Problem Determination” in the 286/C4 and 286/E4 User’s Guide. Go to “MAP 1540: Problem Isolation Procedures” on page 108. M0NE T000 The system hung during network POST. This error code is generated by the customer performing “Problem Determination” in the 286/C4 and 286/E4 User’s Guide. Go to “MAP 1540: Problem Isolation Procedures” on page 108. M0PS 0000 Power failure. This error code is generated by the customer performing “Problem Determination” in the 286/C4 and 286/E4 User’s Guide. Go to “MAP 1520: Power” on page 89. M0SC SI00 Unable to load diagnostics. This error code is generated by the customer performing “Problem Determination” in the 286/C4 and 286/E4 User’s Guide. Go to “Boot Problems/Concerns” on page 173. M0SC SI01 Unable to load diagnostics. This error code is generated by the customer performing “Problem Determination” in the 286/C4 and 286/E4 User’s Guide. Go to “Boot Problems/Concerns” on page 173. Chapter 5. Error Code to FRU Index 295 296 Service Guide Chapter 6. Using the Online and Standalone Diagnostics The AIX diagnostics consist of online diagnostics and standalone diagnostics. Attention: To run online diagnostics on a partition, the AIX operating system must be installed on that partition. If the AIX operating system is not installed, use the standalone diagnostic procedures. Online diagnostics, when they are installed, reside with AIX in the file system. They can be booted: v In single user mode (referred to as service mode) v To run in maintenance mode (referred to as maintenance mode) v To run concurrently with other applications (referred to as concurrent mode) Standalone diagnostics must be booted before they can be run. If booted, the diagnostics have no access to the AIX error log or the AIX configuration data. Does the system have AIX diagnostics preinstalled? Yes If there is an HMC attached to the system, go to “Operating Considerations for Online and Standalone Diagnostics”. If an HMC is not attached to the system, go to “Running the Online Diagnostics with No HMC Attached” on page 304. No If there is an HMC attached to the system, go to “Standalone Diagnostics Operations” on page 301. If an HMC is not attached to the system, go to “Running the Standalone Diagnostics with No HMC Attached” on page 302. If standalone diagnostics will be run from a NIM server, go to “Running Standalone Diagnostics from a Network Installation Management (NIM) Server with an HMC Attached to the System” on page 302. If an HMC is not attached to the system, follow the directions for a full system partition. Operating Considerations for Online and Standalone Diagnostics Note: When possible, run online diagnostics in service mode. Online diagnostics perform additional functions compared to standalone diagnostics. Running online diagnostics in service mode ensures that the error state of the system that has been captured in NVRAM is available for your use in analyzing the problem. The AIX error log and certain SMIT functions are available only when diagnostics are run from the disk drive. Consider the following items before using the diagnostics: v Standalone diagnostics can run on systems configured for either a full (or single) system partition or a multiple partitioned system. When running in a multiple partitioned system, the device from which you are booting standalone diagnostics must be made available to the partition dedicated to running standalone diagnostics. This might require moving the device from the partition that currently contains the boot device (for example, the CD-ROM or network adapter connected to the NIM server that has a standalone diagnostic image) to the partition used to run standalone diagnostics. Unless the system supports dynamic logical partitioning, reboot both partitions if you move a device. For more information, see “Standalone Diagnostics Operations” on page 301. v When diagnostics are installed, the device support for some devices might not get installed. If this is the case, that device does not display in the diagnostic test list when running disk-based diagnostics. v When running diagnostics in a partitioned system, diagnostics will work only with the resources that were assigned to that partition. You must run diagnostics in the partition containing the resource that you want to test. 297 Identifying the Terminal Type to the Diagnostics When you run diagnostics, you must identify which type of terminal you are using. If the terminal type is not known when the FUNCTION SELECTION menu is displayed, the diagnostics do not allow you to continue until a terminal is selected from the DEFINE TERMINAL option menu. Undefined Terminal Types If you specify an undefined terminal type from the DEFINE TERMINAL option menu, the menu prompts the user to enter a valid terminal type. The menu redisplays until either a valid type is entered or you exit the DEFINE TERMINAL option. For a system attached to an HMC use VT320 emulation. If the AIX operating system is being used, from the command line, type: export term=vt320 Resetting the Terminal If you enter a terminal type that is valid (according to the DEFINE TERMINAL option menu) but is not the correct type for the Hardware Management Console (HMC) virtual terminal window being used, you may be unable to read the screen, use the function keys, or use the Enter key. To pass these difficulties, press Ctrl-C to reset the terminal. The screen display that results from this reset depends on the mode in which the system is being run: v Online Normal or Maintenance Mode - The command prompt displays. v Standalone Mode or Online Service Mode - The terminal type is reset to dumb, the Diagnostic Operating Instruction screen displays, and you are required to go through the DEFINE TERMINAL process again. Online Diagnostics Modes of Operation Notes: 1. When running online diagnostics on a partition in a partitioned system, diagnostics can be run only on resources that are allocated to that partition. 2. Online diagnostics can only be run in a partition with AIX installed. The online diagnostics can be run in the following modes: v Service Mode v Concurrent Mode v Maintenance Mode Service Mode Service mode provides the most complete checkout of the system resources. This mode also requires that no other programs be running on the partition or on a full system partition. All partition on a full system partition resources, except the SCSI adapter and the disk drives used for paging, can be tested. However, note that the memory and processor are only tested during POST, and the results of the POST tests are reported by diagnostics. Error-log analysis is done in service mode when you select Problem Determination on the DIAGNOSTIC MODE SELECTION menu. Running Online Diagnostics with an HMC Attached to the System Consider the following when you run the online diagnostics from a server or a disk: v The diagnostics cannot be loaded and run from a disk until the AIX operating system has been installed and configured. v The diagnostics cannot be loaded on a system (client) from a server if that system is not set up to boot from a server over a network. When the system is set up to boot from a server, the diagnostics are run in the same manner as they are from disk. 298 Service Guide v On a full system partition, if the diagnostics are loaded from disk or a server, you must shut down the AIX operating system before turning off the system unit to prevent possible damage to disk data. Do this in either of the following ways: – If the diagnostics were loaded in standalone mode, press the F3 key until DIAGNOSTIC OPERATING INSTRUCTIONS displays. Then press the F3 key once again to shut down the AIX operating system. – If the diagnostics were loaded in maintenance or concurrent mode, type the shutdown -F command. v Under some conditions, the system might stop, with instructions displayed on attached displays and terminals. Follow the instructions to select a console display. Running Online Diagnostics in Service Mode with an HMC Attached to the System To run the online diagnostics in service mode from the boot hard disk, do the following: 1. From the HMC, select Partition Manager. 2. Right-click on the mouse and select Open Terminal Window. 3. From the Service Processor Menu on the virtual terminal, select Option 2 System Power Control. 4. Select option 6. Verify that the state changes to currently disabled. Disabling fast system boot automatically enables slow boot. 5. Select Option 98 to exit the system power control menu. 6. Use the HMC to power on the managed system in a full system partition by selecting the managed system in the Contents area. 7. Right-click or select the desired system in the Contents area. Next, on the menu, choose Selected. 8. Select Power On. 9. Select the Power on Diagnostic Stored Boot list option. For more information about full system partitions, see “Full System Management Tasks” in the Hardware Management Console Maintenance Guide, order number SA38-0590. 10. Make sure that there are no media in the devices in the media subsystem. 11. Enter any passwords, if requested. Concurrent Mode Use concurrent mode to run online diagnostics on some of the system resources while the system is running normal activity. Because the system is running in normal operation, the following resources cannot be tested in concurrent mode: v SCSI adapters connected to paging devices v Disk drive used for paging v Some display adapters and graphics related devices v Memory (tested during POST) v Processor (tested during POST) The following levels of testing exist in concurrent mode: v The share-test level tests a resource while the resource is being shared by programs running in the normal operation. This testing is mostly limited to normal commands that test for the presence of a device or adapter. v The sub-test level tests a portion of a resource while the remaining part of the resource is being used in normal operation. For example, this test could test one port of a multiport device while the other ports are being used in normal operation. v The full-test level requires the device not be assigned to or used by any other operation. This level of testing on a disk drive might require the use of the varyoff command. The diagnostics display menus to allow you to vary off the needed resource. Chapter 6. Using the Online and Standalone Diagnostics 299 Error-log analysis is done in concurrent mode when you select Problem Determination on the DIAGNOSTIC MODE SELECTION menu. To run the online diagnostics in concurrent mode, you must be logged in to the AIX operating system and have proper authority to issue the commands (if help is needed, see the system operator). The diag command loads the diagnostic controller and displays the online diagnostic menus. Running the Online Diagnostics in Concurrent Mode To run online diagnostics in concurrent mode, do the following: 1. Log in to the AIX operating system as root user or use CE Login. 2. Enter the diag command. 3. When the DIAGNOSTIC OPERATING INSTRUCTIONS display, follow the instructions to check out the desired resources. 4. When testing is complete, press the F3 key to return to the DIAGNOSTIC OPERATING INSTRUCTIONS. Press the F3 key again to return to the AIX operating system prompt. Be sure to vary on any resource that you had varied to off. 5. Press Ctrl-D to log off from root user or CE Login. Maintenance Mode Maintenance mode runs the online diagnostics using the customer’s version of the AIX operating system. This mode requires that all activity on the partition running the AIX operating system be stopped so that the online diagnostics have most of the resources available to check. All of the system resources, except the SCSI adapters, memory, processor, and the disk drive used for paging, can be checked. Error log analysis is done in maintenance mode when you select Problem Determination on the DIAGNOSTIC MODE SELECTION menu. Use the shutdown -m command to stop all activity on the AIX operating system and put the AIX operating system into maintenance mode. The diag command is then used to invoke the diagnostic controller so you can run the diagnostics. After the diagnostic controller is loaded, follow the normal diagnostic instructions. Running the Online Diagnostics in Maintenance Mode To run the online diagnostics in maintenance mode, do the following: 1. Stop all programs running on the partition except the AIX operating system (if help is needed, see the system operator). 2. Log in to the AIX operating system as root user or use CE Login. 3. Type the shutdown -m command. 4. When a message indicates the system is in maintenance mode, enter the diag command. Note: It might be necessary to set TERM type again. 5. When DIAGNOSTIC OPERATING INSTRUCTIONS screen displays, follow the displayed instructions to check out the desired resources. 6. When testing is complete, press the F3 key to return to DIAGNOSTIC OPERATING INSTRUCTIONS. Press the F3 key again to return to the AIX operating system prompt. 7. Press Ctrl-D to log off from root user or CE Login. 300 Service Guide Standalone Diagnostics Operations Use standalone diagnostics to test the system when the online diagnostics are not installed and as a method of testing the disk drives that cannot be tested by the online diagnostics. Note: No error log analysis is done by the standalone diagnostics. The CD-ROM drive and the controller that controls it cannot be tested by the standalone diagnostics. Standalone diagnostics: v Are resident on CD-ROM or a Network Installation Management (NIM) server v Provide a method to test the system when the online diagnostics are not installed or cannot be loaded from the disk drive v Allow testing of the disk drives and other resources that cannot be tested by the online diagnostics v Do not have access to the AIX configuration data v Do not have access to the AIX error log v Do not allow for running of error log analysis Performing Slow Boot To fully analyze all of the available information, perform the following steps before doing a hardware repair or replacement: 1. Record the 8-character error code (and location code if present) in the operator panel or that was reported by the customer. 2. Do a slow-mode boot in service mode. This boot can be specified using the System Power Control Menu on the service processor main menu. (A fast-mode boot skips much of the built-in diagnostic testing.) A slow-mode boot may yield a new 8-character error code on the operator panel and new errors in the service processor error log. If a new error code is reported, use this code to continue problem analysis. Considerations for Running Standalone Diagnostics on a Partitioned Systems To run standalone diagnostics on a full system partition, you must reboot the entire system. However, for a partition in a partitioned system, you can boot standalone diagnostics either in a given partition or on the entire system (which is the same procedure as a full system partition). When running in a multiple partitioned system, the device from which you are booting standalone diagnostics must be made available to the partition dedicated to running standalone diagnostics. This might require moving the device from the partition that currently contains the boot device (for example, the CD-ROM or network adapter connected to the NIM server that has a standalone diagnostic image) to the partition used to run standalone diagnostics. Reboot both partitions if you move a device. Devices on a partitioned system are moved on an I/O-slot basis. The CD-ROM is attached to an integrated adapter in the base system. All integrated devices must be moved together from partition to partition. Considerations for Running Standalone Diagnostics from CD-ROM Consider the following when you run standalone diagnostics: v The diagnostic CD must remain in the CD-ROM drive for the entire time that diagnostics are running. v The diagnostic CD cannot be ejected from the CD-ROM drive after the diagnostics have loaded. The CD can only be ejected after the system has been turned off and then turned on (standalone mode) or after the diagnostics program has terminated (online concurrent mode). v The CD-ROM drive from which diagnostics were loaded cannot be tested. v The adapter (or circuitry) controlling the CD-ROM drive from which diagnostics were loaded cannot be tested. Chapter 6. Using the Online and Standalone Diagnostics 301 Loading the Standalone Diagnostics from CD-ROM Using an HMC To run standalone diagnostics in service mode from CD-ROM, do the following: 1. Stop all programs, including the operating system (get help if needed). 2. Remove all tapes, diskettes, and CD-ROMs. 3. Power off the system. For more information about full system partitions, see “Full System Management Tasks” in the Hardware Management Console Maintenance Guide, order number SA38-0590. 4. Right-click the mouse and select Open Terminal Window. 5. From the service processor menu on the virtual terminal, select option 2, System Power Control Menu. 6. Select option 6. Verify that the state changes to currently disabled. Disabling fast system boot automatically enables slow boot. 7. Select option 98 to exit the system power control menu. 8. Use the HMC to power on the managed server in a full system partition. Select the Power on Diagnostic Default Boot List. For more information about full system partitions, see “Full System Management Tasks” in the Hardware Management Console Maintenance Guide, order number SA38-0590. 9. Insert the CD-ROM into the system’s CD-ROM drive (not into the HMC’s CD-ROM drive). Running the Standalone Diagnostics with No HMC Attached To load standalone diagnostics from the default boot list, perform the following procedure: 1. Verify with the system administrator and users that all programs may be stopped, then do so. 2. Turn off the system. Note: If AIX is running, type the shutdown command. If Linux is running, type the shutdown now -h command. 3. Wait until the Power-on LED stops flashing and remains off, then press the power button to turn on the system. 4. Immediately insert the diagnostic CD-ROM into the CD-ROM drive. 5. When or after the word keyboard appears during startup, press the numeric 5 key on the keyboard. 6. Enter any requested passwords. Running Standalone Diagnostics from a Network Installation Management (NIM) Server with an HMC Attached to the System A client system connected to a network with a Network Installation Management (NIM) server can boot standalone diagnostics from the NIM server if the client-specific settings on both the NIM server and client are correct. Notes: 1. All operations to configure the NIM server require root user authority. 2. If you replace the network adapter in the client, the network-adapter hardware-address settings for the client must be updated on the NIM server. 3. The Cstate for each standalone diagnostics client on the NIM server should be kept in the diagnostic boot has been enabled state. 4. On the client system, the NIM server network adapter should be put in the bootlist after the boot disk drive. This allows the system to boot in standalone diagnostics from the NIM server if there is a problem booting from the disk drive. For information about setting the bootlist, see the Multiboot section under “SMS” in the client system’s service guide. NIM Server Configuration Refer to the “Advanced NIM Configuration Tasks” chapter of the AIX Installation Guide and Reference, order number SC23-4389, for information on doing the following: 302 Service Guide v Registering a client on the NIM server v Enabling a client to run diagnostics from the NIM server To verify that the client system is registered on the NIM server and the diagnostic boot is enabled, run the lsnim -a Cstate -Z ClientName command from the command line on the NIM server. Refer to the following table for system responses. Note: The ClientName is the name of the system on which you want to run standalone diagnostics. System Response Client Status #name:Cstate: ClientName:diagnostic boot has been enabled: The client system is registered on the NIM server and enabled to run diagnostics from the NIM server. #name:Cstate: ClientName:ready for a NIM operation: or #name:Cstate: ClientName:BOS installation has been enabled: The client is registered on the NIM server but not enabled to run diagnostics from the NIM server. Note: If the client system is registered on the NIM server but Cstate has not been set, no data will be returned. 0042-053 lsnim: there is no NIM object named "ClientName" The client is not registered on the NIM server. Client Configuration and Booting Standalone Diagnostics from the NIM Server To run standalone diagnostics on a client from the NIM server, do the following: 1. Remove any removable media (tape or CD-ROM disc). 2. Stop all programs including the operating system (get help if needed). 3. If you are running standalone diagnostics in a full system partition, verify with the system administrator and system users that the system unit can shut down. Stop all programs, including the operating system. Refer to the operating system documentation for shutdown command information. In a partitioned system, make the CD-ROM drive available to the partition used to run standalone diagnostics (refer to the Hardware Management Console Maintenance Guide for more information). Verify with the system administrator and system users using that partition that all applications on that partition must be stopped, and that the partition will be rebooted. Stop all programs on that partition, including the operating system. 4. If you are in a full system partition, power on the system unit to run standalone diagnostics. In a partitioned system, reboot the partition to run standalone diagnostics. 5. When the keyboard indicator is displayed (the word keyboard), press the number 1 key on the keyboard to display the SMS menu. 6. Enter any requested passwords. 7. Select Setup Remote IPL (Initial Program Load). 8. Enter the client address, server address, gateway address (if applicable), and subnet mask. Exit to the Network Parameters screen. 9. If the NIM server is set up to allow pinging from the client system, use the ping utility in the RIPL utility to verify that the client system can ping the NIM server. Under the ping utility, choose the network adapter that provides the attachment to the NIM server to do the ping operation. If the ping returns with an OK prompt, the client is prepared to boot from the NIM server. If ping returns with a FAILED prompt, the client cannot proceed with the NIM boot. Chapter 6. Using the Online and Standalone Diagnostics 303 Note: If the ping fails, refer to “Boot Problems/Concerns” on page 173 and follow the steps for network boot problems. To do a one-time boot of the network adapter attached to the NIM server network, do the following: 1. Exit to the SMS Main screen. 2. Select Select Boot Options. 3. Select Install or Boot a Device. 4. On the Select Device Type screen, select Network. 5. Set the network parameters for the adapter from which you want to boot. 6. Exit completely from SMS. The system starts loading packets while doing a bootp from the network. Follow the instructions on the screen to select the system console. v If Diagnostics Operating Instructions Version x.x.x displays, standalone diagnostics have loaded successfully. v If the operating system login prompt displays, standalone diagnostics did not load. Check the following items: – The network parameters on the client may be incorrect. – Cstate on the NIM server may be incorrect. – Network problems might be preventing you from connecting to the NIM server. Verify the settings and the status of the network. If you continue to have problems, refer to “Boot Problems/Concerns” on page 173 and follow the steps for network boot problems. Running the Online Diagnostics with No HMC Attached Online diagnostics can be loaded by booting from a hard file in the custom boot list by pressing the numeric 6 key on the keyboard. To boot from the devices listed in the service mode boot list, perform the following procedure: 1. Verify with the system administrator and users that all programs may be stopped, then do so. 2. Turn off the system. Note: If AIX is running, type the shutdown command. 3. Wait until the Power-on LED stops flashing and remains off, then press the power button to turn the system on. 4. When the keyboard indicator appears during startup, press the numeric 6 key on the keyboard. 5. Enter any requested passwords. After any requested passwords have been entered, the system attempts to boot from the first device of each type found on the list. If no bootable image is found on the first device of each type on the list, the system does not search through the other devices of that type for a bootable image; instead, it polls the first device of the next type. If all types of devices in the boot list have been polled without finding a bootable image, the system restarts. This action gives the user the opportunity to start the System Management Services (by pressing the number 1 key on a directly attached keyboard or the number 1 on an ASCII terminal) before the system attempts to boot again. 304 Service Guide Chapter 7. Using the Service Processor Notes: 1. On some of the system management services (or service processor) screens, you will see the term LPAR. LPAR is equivalent to the term logical partitioned system or partitioned system. 2. The information in this chapter regarding the configuring of serial ports (and modems attached to those serial ports), applies only to the serial ports (S1 and S2) on the CEC backplane (location U0.1-P1). This information is not applicable to the serial ports (or modems attached to those serial ports), on the HMC. The service processor runs on its own power boundary and continually monitors hardware attributes and the environmental conditions within the system. The service processor is controlled by firmware and does not require the operating system to be operational to perform its tasks. The service processor menus allow you to configure service processor options, as well as enable and disable functions. Service processor menus are available using an ASCII terminal or an HMC virtual terminal window when OK is displayed on the operator panel or when the service processor has detected a system problem (such as a surveillance failure). Service Processor Menus The service processor menus are divided into the following groups: v General user menu - the user must know the general-access password. v Privileged user menus - the user must know the privileged-access password. If the system is powered off, the service processor menus can be accessed locally or remotely on the following: v Serial port 1 (S1) v Serial port 2 (S2) v HMC Accessing the Service Processor Menus Locally Service processor menus can be accessed by opening a virtual terminal window on the HMC, or by attaching an ASCII terminal to serial port 1 (S1) or serial port 2 (S2). After OK displays in the operator panel, press any key on the keyboard to signal the service processor. Note: The service processor menus cannot be accessed simultaneously on a virtual terminal window on the HMC and on an ASCII terminal. Accessing the menus on the HMC prevents access to the ASCII terminals and vice versa. When you gain access, the service processor prompts you for a password (if one is set), and when verified, displays the service processor menus. The service processor menu prompt, represented by 0> or 1>, indicates the serial port to which the terminal is connected. v An ASCII terminal can have the following prompts: – 0> indicates serial port 1 (S1) – 1> indicates serial port 2 (S2) v The HMC always displays a prompt of 0> 305 Accessing the Service Processor Menus Remotely If your system has a modem connected to serial port 1 or serial port 2 and is configured for call-in (see “Modem Configuration Menu” on page 326), the service processor menus can be accessed remotely as follows: 1. With the system powered off, call in from a remote terminal. 2. The service processor detects ring-indicate and prompts you for a password (if one is set). When verified, the service processor menus display remotely. Saving and Restoring Service Processor Settings All the settings that you make (except language) from the service processor menus can be backed up either for recovering from a fault that may corrupt these settings, or for replicating these settings to other servers that include a service processor. The service aid ²Save or Restore Hardware Management Policies² can be used to save your settings after initial setup or whenever the settings must be changed for system operation purposes. It is strongly recommended that you use this service aid for backing up service processor settings to protect the usefulness of the service processor and the availability of the system. Refer to “Save or Restore Hardware Management Policies,” in the “Introduction to Tasks and Service Aids” section of the Diagnostic Information for Multiple Bus Systems, order number SA23-2769. Menu Inactivity The service processor exits menu mode after ten minutes of inactivity and displays a message indicating that it has done so. Pressing any key on the virtual terminal window causes the main menu to display. General User Menu The menu options presented to the general user are a subset of the options available to the privileged user. The user must know the general-access password, if one is set, to access this menu. GENERAL USER MENU 1. Power-on System 2. Power-off System 3. Read VPD Image from Last System Boot 4. Read Progress Indicators from Last System Boot 5. Read Service Processor Error Logs 6. Read System POST Errors 99. Exit from Menus 0> v Power-on System Allows the user to start the system using the current virtual terminal window as the active console. v Power-off System This option is not available on this system. v Read VPD Image from Last System Boot Displays manufacturer vital product data, such as serial numbers, part numbers, and so on, that were stored from the system boot prior to the one in progress now, for the entire system. 306 Service Guide v Read Progress Indicators from Last System Boot Displays a number of the boot progress indicators, which may include service processor checkpoints, IPL checkpoints, or AIX configuration codes, from the previous system boot. This information can be useful in diagnosing system faults. Note: If you are running one or more logical partitions, enter the partition ID (0-15) to display progress indicators for that partition since the last system boot. If your system is running in Full System Partition mode, this option automatically displays details from partition 0. The progress indicator codes are listed from top (latest) to bottom (oldest). This information is not stored in nonvolatile storage. If the system is powered off using the power-on button on the operator panel, this information is retained. If the ac power is disconnected from the system, this information will be lost. For an example, refer to “LCD Progress Indicator Log” on page 340. v Read Service Processor Error Logs Displays the service processor error logs. For an example, refer to “Service Processor Error Logs” on page 339. v Read System POST Errors Displays additional error log information (this option is only for service personnel). v Exit from Menus Selecting this option will exit the service processor menus. You can re-enter the menus by pressing any key on the console. Privileged User Menus The following menus are available to privileged users only. The user must know the privileged-access password, if one is set, to access these menus. Main Menu A listing at the top of the main menu contains the following: v Your system’s current firmware version v The firmware copyright notice v The system name given to your system during setup You need the firmware version for reference when you either update or repair the functions of your service processor. The system name, an optional field, is the name that your system reports in problem messages. This name helps your support team (for example, your system administrator, network administrator, or service representative) to more quickly identify the location, configuration, and history of your system. Set the system name, from the main menu, using option 6. Chapter 7. Using the Service Processor 307 Note: The information under the Service Processor Firmware heading in the following Main Menu illustration is example information only. Service Processor Firmware VERSION: RH011007 Copyright 2001 IBM Corporation SYSTEM NAME MAIN MENU 1. Service Processor Setup Menu 2. System Power Control Menu 3. System Information Menu 4. Language Selection Menu 5. Call-In/Call-Out Setup Menu 6. Set System Name 99. Exit from Menus 0> v Service Processor Setup Menu See “Service Processor Setup Menu” on page 309 for more information. v System Power Control Menu See “System Power Control Menu” on page 314 for more information. v System Information Menu See “System Information Menu” on page 317 for more information. v Language Selection Menu See “Language Selection Menu” on page 324 for more information. v Call-In/Call-Out Setup Menu See “Call-In/Call-Out Setup Menu” on page 325 for more information. v Set System Name Allows setting of the system name. 308 Service Guide Service Processor Setup Menu The following Service Processor Setup Menu is accessed from the Main Menu: SERVICE PROCESSOR SETUP MENU 1. Change Privileged Access Password 2. Change General Access Password 3. Enable/Disable Console Mirroring: Currently Enabled 4. Start Talk Mode 5. OS Surveillance Setup Menu NOT supported in LPAR mode 6. Reset Service Processor 7. Reprogram Flash EPROM Menu 8. Serial Port Snoop Setup Menu NOT supported in LPAR mode 9. Scan Log Dump Setup Menu: Currently As Needed 98. Return to Previous Menu 99. Exit from Menus 0> Note: Unless otherwise stated in menu responses, settings become effective when a menu is exited using option 98 or 99. Passwords Passwords can be any combination of up to eight alphanumeric characters. You can enter longer passwords, but the entries are truncated to include only the first eight characters. The privileged-access password can be set from service processor menus or from System Management Services (SMS) utilities (see Chapter 8, “Using System Management Services”, on page 345). The general-access password can be set only from service processor menus. For security purposes, the service processor counts the number of attempts to enter passwords. The results of not recognizing a password within this error threshold are different, depending on whether the attempts are being made locally (at the system) or remotely (through a modem). The error threshold is three attempts. If the error threshold is reached by someone entering passwords at the system, the service processor commands the system to resume the initial program load (IPL). This action is taken based on the assumption that the system is in an adequately secure location with only authorized users having access. Such users must still successfully enter a login password to access the operating system. If the error threshold is reached by someone entering passwords remotely, the service processor commands the system to power off to prevent potential security attacks on the system by unauthorized remote users. The following table lists what you can access with the privileged-access password and the general-access password. Chapter 7. Using the Service Processor 309 Privileged Access Password General Access Password Resulting Menu None None Service processor MAIN MENU displays. Set None Users with the password see the service processor MAIN MENU. Users without password cannot log in. Set Set Users see menus associated with the entered password. If you forget the password, you must remove the battery for at least 30 seconds to disable the password. v Change Privileged-Access Password Set or change the privileged-access password. It provides the user with the capability to access all service processor functions. This password is usually used by the system administrator or root user. v Change General-Access Password Set or change the general-access password. It provides limited access to service processor menus, and is usually available to all users who are allowed to power on the system, especially remotely. Note: The general-access password can only be set or changed after the privileged-access password is set. v Enable/Disable Console Mirroring Note: Console mirroring is disabled in partitioned systems. Console mirroring is supported on serial port 1 (S1) and serial port 2 (S2). When console mirroring is enabled, the service processor sends information to all serial ports. The serial port from which console mirroring is enabled is referred to as the active port. The mirror port is determined when keyboard input is detected from one of the other ports. From this point on, the service processor sends information only to the active port and the mirror port. This capability can be enabled by local or remote users, providing local users with the capability to monitor remote sessions. Console mirroring can be enabled for the current session only. For more information, see “Console Mirroring” on page 338. v Start Talk Mode In a console-mirroring session, it is useful for those who are monitoring the session to be able to communicate with each other. Selecting Start Talk Mode activates the keyboards and displays for such communications while console mirroring is established. This is a full duplex link, so message interference is possible. Alternating messages between users works best. v OS Surveillance Setup Menu Note: This option is disabled in partitioned systems. This menu can be used to set up operating system (OS) surveillance. OS Surveillance Setup Menu 1. Surveillance: Currently Enabled 2. Surveillance Time Interval: 2 minutes 3. Surveillance Delay: 2 minutes 98. Return to Previous Menu 0> 310 Service Guide – Surveillance Can be set to Enabled or Disabled. – Surveillance Time Interval Can be set to any number from 2 through 255. – Surveillance Delay Can be set to any number from 0 through 255. Refer to “Service Processor System Monitoring - Surveillance” on page 336 for more information about surveillance. v Reset Service Processor If this option is selected, entering Y causes the service processor to reboot. v Reprogram Flash EPROM Menu This option requires a diskette drive to be installed in the system. If this option is selected, and no diskette drive is present in the system, the service processor will indicate ²Not Supported² on the screen. This option updates the system EPROMs. After entering Y to indicate that you want to continue, you are prompted to enter the update diskettes. Follow the instructions on the screen. When the update is complete, the service processor reboots. All system EPROMs that can be reprogrammed are updated at the same time and are as follows: – System power control network programming – Service processor programming – System firmware programming – Run-Time Abstraction Services v Serial Port Snoop Setup Menu Note: This option is disabled in partitioned systems. This menu can be used to set up serial port snooping, in which the user can configure serial port 1 as a ²catch-all² reset device. From the service processor main menu, select option 1, service processor setup menu, then select option 8 (Serial Port Snoop Setup Menu). SERIAL PORT SNOOP SETUP MENU 1. System reset string: Currently Unassigned 2. Snoop Serial Port: Currently Unassigned 98. Return to Previous Menu 1> Use the Snoop Serial Port option to select the serial port to snoop. Note: Only serial port 1 is supported. Chapter 7. Using the Service Processor 311 Use the system reset string option to enter the system reset string, which resets the machine when it is detected on the main console on Serial Port 1. After serial port snooping is correctly configured, at any point after the system is booted to the operating system, whenever the reset string is typed on the main console, the system uses the service processor reboot policy to restart. Because pressing Enter after the reset string is not required, make sure that the string is not common or trivial. A mixed-case string is recommended. v Scan Log Dump Policy A scan dump is the collection of chip data that the service processor gathers after a system malfunction, such as a checkstop or hang. The scan dump data may contain chip scan rings, chip trace arrays, and SCOM contents. The scan dump data are stored in the system control store. The size of the scan dump area is approximately 4 MB. During the scan log dump, A8xx (in the range A810 to A8FF) displays in the operator panel. The xx characters will change as the scan log dump progresses. If the xx characters do not change after five minutes, the service processor is hung and must be reset. When the scan log dump is complete, depending on how the reboot policy is set, the system will either: – Go to the standby state (and the service processor menus will be available), indicated by OK or STBY in the operator panel OR – Attempt to reboot. Scan Log Dump Setup Menu 1. Scan Log Dump Policy: Currently As Needed 2. Scan Log Dump Content: Currently As Requested 3. Immediate Dump 98. Return to Previous Menu 0> 1 Select from the following options: (As Needed=2, Always=3) Enter New Option: 0> The scan log dump policy can be set to the following: 2 = As Needed The processor run-time diagnostics record the dump data based on the error type. This is the default value. 3 = Always Selecting this option allows the service processor to record a scan log dump for all error types. The scan log dump policy can also be set from the Tasks menu in the AIX service aids. 312 Service Guide Option 2 displays the following screen: Scan Log Dump Setup Menu 1. Scan Log Dump Policy: Currently As Needed 2. Scan Log Dump Content: Currently As Requested 3. Immediate Dump 98. Return to Previous Menu 0> 2 Select from the following options: (As Requested=1, Optimum=2, Complete=3, Minimum=4) Enter New Option: 0> The scan log dump content can be set to the following: 1 = As Requested The processor run-time diagnostics will select the contents of the dump file based on the type of error that occurs. This is the default. 2 = Optimum The dump will include the smallest amount of information to diagnose a hardware error. 3 = Complete The dump will include as much information as possible to allow the complete analysis of hardware and software errors. 4 = Minimum The dump will include the smallest amount of information possible (a minimum number of hardware scan log rings). The complete dump will take the longest time to finish; it may take as long as 1.5 hours on a fully configured system. Option 3, Immediate Dump, can only be used when the system is in the standby state with power on. It is used to dump the system data after a checkstop or machine check occurs when the system firmware is running, or when the operating system is booting or running. The scan log dump content can also be set from the Tasks menu in the AIX diagnostic service aids. If a valid dump file already exists, the dump control code will stop because the contents of the prior dump must be protected. Chapter 7. Using the Service Processor 313 System Power Control Menu This menu is used to set power control options. Other menus that control boot options are available from this menu. SYSTEM POWER CONTROL MENU 1. Enable/Disable Unattended Start Mode: Currently Enabled 2. Ring Indicate Power-On Menu 3. Reboot/Restart Policy Setup Menu 4. Power-On System 5. Power-Off System 6. Enable/Disable Fast System Boot Currently Fast Boot 7. Boot Mode Menu 98. Return to Previous Menu 99. Exit from Menus 0> v Enable/Disable Unattended Start Mode Use this option to instruct the service processor to restore the power state of the system after a temporary power failure. Unattended start mode can also be set through the System Management Services (SMS) menus. This option is intended to be used on systems that require automatic power-on after a power failure. For more information, see “System Power-On Methods” on page 330. v Ring Indicate Power-On Menu RING INDICATE POWER-ON MENU 1. Ring indicate power-on : Currently Enabled 2. Number of rings: Currently 3 30. Refresh Modem Settings 98. Return to Previous Menu Ring indicate power-on is enabled by default on both serial port 1 (S1) and serial port 2 (S2). When ring indicate power-on is enabled, call-in is disabled. If ring indicate power-on is enabled and call-in is already enabled, you will be asked to confirm your choice. Refer to the message displayed on your screen. If the ring indicate power-on setting is changed, you must select option 30, Refresh Modem Settings to update the modem settings. If Refresh Modem Settings is selected, and the modem(s) have not been configured, you will be asked to configure the modems first. See “Call-In/Call-Out Setup Menu” on page 325 for information on configuring modems. Option 2 is used to set the number of rings. 314 Service Guide v Reboot/Restart Policy Setup Menu The following menu controls the Reboot/Restart Policy: Reboot/Restart Policy Setup Menu 1. Number of reboot attempts: Currently 1 2. Use OS-Defined restart policy? Currently No 3. Enable supplemental restart policy? Currently Yes 4. Call-Out before restart: Currently Disabled 98. Return to Previous Menu 0> Reboot is the process of bringing up the system hardware; for example, from a system reset or power on. Restart is activating the operating system after the system hardware is reinitialized. Restart must follow a successful reboot. – Number of reboot attempts - If the system fails to successfully complete the boot process, it attempts to reboot the number of times specified. Entry values equal to or greater than 0 are valid. Only successive failed reboot/restart attempts are counted. – Use OS-Defined restart policy - In a full system partition, this allows the service processor to react in the same way that the operating system does to major system faults by reading the setting of the operating system parameter Automatically Restart/Reboot After a System Crash. This parameter might already be defined, depending on the operating system (or its version or level). If the operating system automatic restart setting is defined, it can be set to respond to a major fault by restarting or by not restarting. See your operating system documentation for details on setting up operating system automatic restarts. The default value is No. – Enable supplemental restart policy - The default setting is Yes. When set to Yes in a full system partition, the service processor restarts the system when the system loses control as detected by service processor surveillance, and either: The Use OS-Defined restart policy is set to No. OR The Use OS-Defined restart policy is set to Yes, and the operating system has no automatic restart policy. – Call-Out before restart (Enabled/Disabled) - If a restart is necessary due to a system fault, and you are running a full system partition, you can enable the service processor to call out and report the event. This option can be valuable if the number of these events becomes excessive, which might signal a bigger problem. This setting is ignored on a partitioned system. Chapter 7. Using the Service Processor 315 v Power-On System Allows immediate power-on of the system. v Power-Off System This option is not available on this system. v Enable/Disable Fast System Boot Allows the user to select the IPL type, mode, and speed of the system boot. Attention: Selecting the fast IPL results in several diagnostic tests being skipped and a shorter memory test being run. v Boot Mode Menu Note: This option is disabled in partitioned systems. The Boot Mode Menu allows you to select a boot mode. Boot Mode Menu 1. Boot to SMS Menu: Currently Disabled 2. Service Mode Boot from Saved List: Currently Disabled 3. Service Mode Boot from Default List: Currently Disabled 4. Boot to Open Firmware Prompt: Currently Disabled 98. Return to Previous Menu 0> To select a boot mode, select a number and press Enter. The item corresponding to the selected number toggles between Disabled to Enabled. If a boot mode is Enabled, the boot mode selected is performed, and the Disabled/Enabled selection is reset to Disabled. The following describes each boot mode: – Boot to SMS Menu When this selection is enabled, the system boots to the System Management Services (SMS) Menu. – Service Mode Boot from Saved List This selection causes the system to perform a service mode boot using the service mode boot list saved in NVRAM. If the system boots AIX from the disk drive and AIX diagnostics are loaded on the disk drive, AIX boots to the diagnostics menu. Using this option to boot the system is the preferred way to run online diagnostics. – Service Mode Boot from Default List This selection is similar to Service Mode Boot from Saved List, except the system boots using the default boot list that is stored in the system firmware. This is normally used to try to boot customer diagnostics from the CD-ROM drive. Using this option to boot the system is the preferred way to run standalone diagnostics. – Boot to Open Firmware This option should only be used by service personnel to obtain additional debug information. When this selection is enabled, the system boots to the open firmware prompt. 316 Service Guide System Information Menu This menu provides access to system configuration information, error logs, system resources, and processor configuration. SYSTEM INFORMATION MENU 1. Read VPD Image from Last System Boot 2. Read Progress Indicators from Last System Boot 3. Read Service Processor Error Logs 4. Read System POST Errors 5. Read NVRAM 6. Read Service Processor Configuration 7. Processor Configuration/Deconfiguration Menu 8. Memory Configuration/Deconfiguration Menu 9. Power Control Network Utilities Menu 10. LED Control Menu 11. MCM/L3 Interposer Plug Count Menu Not Supported 12. Performance Mode Setup Menu 98. Return to Previous Menu 99. Exit from Menus 0> v Read VPD Image from Last System Boot Displays manufacturer’s vital product data (VPD), such as serial numbers, part numbers, and so on, that was stored from the system boot prior to the one in progress now. VPD from all devices in the system is displayed. v Read Progress Indicators from Last System Boot Displays a number of the boot progress indicators, which may include Service Processor checkpoints, IPL checkpoints, or AIX configuration codes, from the previous system boot. This information can be useful in diagnosing system faults. Note: If you are running from a partitioned system, enter the partition ID (0-15) to display progress indicators for that partition since the last system boot. In a full system partition, this option automatically displays details from partition 0. The progress indicator codes are listed from top (latest) to bottom (oldest). This information is not stored in nonvolatile storage. If the system is powered off using the power-on button on the operator panel, this information is retained. If the ac power is disconnected from the system, this information will be lost. For an example, refer to “LCD Progress Indicator Log” on page 340. v Read Service Processor Error Logs Displays error conditions detected by the service processor. Refer to “Service Processor Error Logs” on page 339 for an example of this error log. Chapter 7. Using the Service Processor 317 v Read System POST Errors This option should only be used by service personnel to obtain additional debug information. v Read NVRAM Displays Non Volatile Random Access Memory (NVRAM) content. v Read Service Processor Configuration Displays current service processor configuration. v Processor Configuration/Deconfiguration Menu Enable/Disable CPU Repeat Gard CPU repeat gard will automatically deconfigure a CPU during a system boot if a processor has failed BIST (power-on self-test), caused a machine check or check stop, or has reached a threshold of recoverable errors. The processor will remain deconfigured until repeat gard is disabled or the processor is replaced. The default is enabled. For more information, see “Configuring and Deconfiguring Processors or Memory” on page 335. Processor Hot Sparing This option is not available on this system. This menu allows the user to change the system processor configuration. If it is necessary to take one of the processors offline, use this menu to deconfigure a processor, and then reconfigure the processor at a later time. An example of this menu follows: PROCESSOR CONFIGURATION/DECONFIGURATION MENU 77. Enable/Disable CPU Repeat Gard: Currently Enabled 78. Enable/Disable Processor Hot Sparing (if available): Currently Disabled 1. 0 3.0 (00) Configured by system 2. 1 3.1 (00) Deconfigured by system 3. 24 4.0 (00) Configured by system 4. 25 4.1 (00) Configured by system 98. Return to Previous Menu 0> Note: This table is built from vital product data collected during the last boot sequence. The first time the system is powered on, or after the system’s nonvolatile RAM (NVRAM) has been erased, this table may be empty. The table is rebuilt during the next boot into the operating system. The fields of the previous table represent the following: Column 1 (1.) Menu selection index. Column 2 (0) Logical processor device number assigned by AIX. If the system is running AIX, you can display these logical device numbers by issuing the following command on the AIX command line: lsdev -C | grep proc Column 3 (3.0) Processor address list used by the service processor. Column 4 (00) Error status of the processors. 318 Service Guide The error status of each processor is indicated by AB, where B indicates the number of errors and A indicates the type of error according to the following: 1. Bring-up failure 2. Run-time non-recoverable failure 3. Run-time recoverable failure 4. Group integrity failure 5. Non-repeat-gardable error. The resource may be reconfigured on the next boot. A status of 00 indicates that the CPU has not had any errors logged against it by the service processor. To enable or disable CPU repeat gard, use menu option 77. CPU repeat gard is enabled by default. If CPU repeat gard is disabled, processors that are in the ²deconfigured by system² state will be reconfigured. These reconfigured processors are then tested during the boot process, and if they pass, they remain online. If they fail the boot testing, they are deconfigured even though CPU repeat gard is disabled. The failure history of each CPU is retained. If a processor with a history of failures is brought back online by disabling repeat gard, it remains online if it passes testing during the boot process. However, if repeat gard is enabled, the processor is taken offline again because of its history of failures. Notes: 1. The processor numbering scheme used by the service processor is different from the numbering scheme used by the operating system. Consult the operating system documentation before configuring or deconfiguring a processor to ensure that the correct processor is selected. 2. To determine the number of processors available to AIX, run the following command on the AIX command line: bindprocessor -q Note: The number of processor available to Linux can be determined by issuing the following command on the Linux command line: cat /proc/cpuinfo | grep POWER v Memory Configuration/Deconfiguration Menu Enable/Disable Memory Repeat Gard Memory repeat gard will automatically deconfigure a quad of memory during a system boot if a memory DIMM has failed BIST (power-on self-test), caused a machine check or check stop, or has reached a threshold of recoverable errors. The quad will remain deconfigured until repeat gard is disabled or the memory is replaced. The default is enabled. For more information, see “Configuring and Deconfiguring Processors or Memory” on page 335. Runtime Recoverable Error Repeat Gard The runtime recoverable error repeat gard flag controls the deallocation of the memory if a recoverable error occurs during runtime. If a recoverable memory error occurs, and runtime recoverable error repeat gard is disabled, the system will continue running with no change in the memory configuration. If a recoverable memory error occurs, and runtime recoverable error repeat gard is enabled, the memory quad in which the error occurred will be garded out (taken offline). The default is disabled. Chapter 7. Using the Service Processor 319 These menus allow the user to change the system memory configuration. If it is necessary to take one of the memory DIMMs offline, this menu allows you to deconfigure a DIMM, and then reconfigure the DIMM at a later time. When this option is selected, a menu displays. The following is an example of this menu: MEMORY CONFIGURATION/DECONFIGURATION MENU 77. Enable/Disable Memory Repeat Gard: Currently Enabled 78. Runtime Recoverable Error Repeat Gard: Currently Disabled 1. Memory card 98. Return to Previous Menu After you select the memory card option by entering 1, a menu displays, allowing the selection of a memory DIMM. The following is an example of this menu. MEMORY CONFIGURATION/DECONFIGURATION MENU 1. 3.16(00) Configured by system 2. 3.17(00) Configured by system 3. 3.18(00) Configured by system 4. 3.19(00) Configured by system 5. 4.24(00) Configured by system 6. 4.25(00) Configured by system 7. 4.26(00) Configured by system 8. 4.27(00) Configured by system 98. Return to Previous Menu Note: This table is built from vital product data collected during the last boot sequence. The first time the system is powered on, or after the system’s nonvolatile RAM (NVRAM) has been erased, this table may be empty. The table is rebuilt during the next boot into the operating system. The fields in the previous table represent the following: Column 1 1. Menu selection index/DIMM number Column 2 xx.xx : DIMM address used by service processor Column 3 (00) Error status The error status of the each memory DIMM is indicated by AB, where B indicates the number of errors and A indicates the type of error according to the following table: 1. Bring-up failure 2. Run-time non-recoverable failure 3. Run-time recoverable failure 4. Group integrity failure 5. Non-repeat-gardable error. The resource may be reconfigured on the next boot. An error status of 00 (for example, 3.16(00)) indicates that the memory DIMM has not had any errors logged against it by the service processor. 320 Service Guide To change the memory configuration, select the number of the memory DIMM. The memory DIMM state will change from configured to deconfigured or from deconfigured to configured. In the previous example menu, each line shows two DIMMs and indicates whether they are configured. To enable or disable Memory Repeat Gard, use menu option 77 of the Memory Configuration/Deconfiguration Menu. To enable or disable runtime recoverable error repeat gard, use option 78 of the memory configuration/deconfiguration menu. The failure history of each DIMM is retained. If a DIMM with a history of failures is brought back online by disabling Repeat Gard, it remains online if it passes testing during the boot process. However, if Repeat Gard is enabled, the DIMM is taken offline again because of its history of failures. v Power Control Network Utilities Menu POWER CONTROL NETWORK UTILITIES MENU 1. Lamp Test for all Operator Panels 2. Display I/O Type 3. Change I/O Type 98. Return to Previous Menu 0> – Lamp Test for All Operator Panels Selecting this option tests the media drawer operator panel’s indicators by causing them to blink on and off for approximately 30 seconds. – Display I/O Type This option is not available on this system. – Change I/O Type Use this option to change the I/O type after a service action or configuration change, if the I/O type is incorrect. If this option is chosen, you will be asked to make the following entries: 1. For the I/O drawer address, type 1. 2. If a 4-slot PCI riser has been replaced, or is present in the system, type A6 for the I/O type. If a 6-slot PCI riser has been replaced, or is present in the system, type A3 for the I/O type. If either value is not valid, a failure message displays on the console. Press Enter to return to the Power Control Network Utilities Menu. Chapter 7. Using the Service Processor 321 v LED Indicator Menu This menu displays the state of the processor subsystem disturbance/system attention LED and the fault/identify LED on the I/O subsystem(s). Use this menu to toggle the attention/fault LEDs between identify (blinking) and off. Option 1 is only available when the system is in the error state (the CEC is powered on and the service processor menus are available). It is not available when the system is in standby. An example of this menu follows: LED Indicator 1. Set/Reset Identify LED state 2. Clear System Attention Indicator 98. Return to Previous Menu Enter LED index 0 > If Option 1 is selected, a list of location codes of the I/O subsystems and the system unit drawer is shown. The screen is similar to the following: 1. U0.2-P1 2. U0.3-P1 Enter number corresponding to the location code, or press Return to continue, or ’x’ to return to the menu If one of the devices is selected using the index number, the present state of its LED is displayed, and you are given the option to toggle it as shown in the following example screen. The final state of the LED is then displayed whether or not its state was changed. U0.2-P1 is currently in the off state Select from the following (1=IDENTIFY ON, 2=IDENTIFY OFF) 0>2 Please wait U0.2-P1 is currently in the OFF state (Press Return to continue) Option 2, Clear System Attention Indicator, clears the attention indicator on the operator panel. The processor subsystem disturbance/LED is on the operator panel. The I/O drawer fault/identify LED is located on the front of each I/O subsystem. v MCM/L3 Interposer Plug Count Menu This option is not available on this system. 322 Service Guide v Performance Mode Setup Menu If certain types of processor cards are installed in the system, this menu is not available. For other types of processor cards, this menu is active after the first boot. Note: The first time the system is booted after NVRAM is cleared, Not Applicable displays under Performance Mode Setup Menu on the screen. This might also happen if the service processor is replaced, or the processor cards are upgraded. If Performance Mode Setup Menu is selected when Not Applicable displays the screen, the system responds with Not Applicable and redisplays the system information menu. The setup menu can be displayed after the performance mode is set, which occurs when the system is rebooted for the first time. The default performance mode is set by the firmware during IPL. The default mode provides the optimum performance for the hardware configuration of the system. The performance mode is systemwide; it cannot be set on a per-partition basis. You can override the default setting by using the Default Performance Mode menu, which will be similar to the following: Default Performance Mode: Standard Operation 1. Current Performance Mode: Standard Operation 98. Return to Previous Menu 0>1 Selecting option 1 displays the following performance modes: Select from the following options: 1. Large Commercial System optimization 2. Standard Operation 3. Turbo Database Mode 0> Notes: 1. Some processor card changes cause the default performance mode to change. This new setting is not reflected in the menu until after the system is rebooted with the new configuration. 2. If the NVRAM has been cleared, the default performance mode and the current performance mode will indicate uninitialized. Chapter 7. Using the Service Processor 323 If you want to override the default setting, a brief description of each performance mode follows: Large Commercial System Optimization Is the setting for systems that do not fall into the other two categories, standard operation and turbo database mode. This setting provides the optimum performance for most applications. Standard Operation Optimizes the system for large memory bandwidth applications where minimal sharing of data occurs and the likelihood of significant hardware data-prefetching exists. Turbo Database Mode Optimizes system operation for environments where there is a large amount of data sharing among processes running concurrently on the system. Language Selection Menu The service processor menus and messages are available in various languages. This menu allows selecting languages in which the service processor and system firmware menus and messages are displayed. LANGUAGE SELECTION MENU 1. English 2. Francais 3. Deutsch 4. Italiano 5. Espanol 98. Return to Previous Menu 99. Exit from Menus 0> Note: Your virtual terminal window must support the ISO-8859 character set to correctly display languages other than English. 324 Service Guide Call-In/Call-Out Setup Menu Note: The information in this section regarding the configuring of serial ports, and modems attached to those serial ports, applies only to the serial ports (S1 and S2) on the CEC backplane (location U0.1-P1). These serial ports are normally used for call-in and call-out. Call-out is disabled in partitioned systems. The call-out function is normally handled by the Service Focal Point application running on the HMC. None of these menus are applicable to the serial ports, or modems attached to those serial ports, on the HMC. CALL-IN/CALL-OUT SETUP MENU 1. Modem Configuration Menu 2. Serial Port Selection Menu 3. Serial Port Speed Setup Menu 4. Telephone Number Setup Menu 5. Call-Out Policy Setup Menu 6. Customer Account Setup Menu 7. Call-Out Test NOT supported in LPAR mode 98. Return to Previous Menu 99. Exit from Menus 0> v Modem Configuration Menu, see “Modem Configuration Menu” on page 326. v Serial Port Selection Menu, see “Serial Port Selection Menu” on page 326. v Serial Port Speed Setup Menu, see “Serial Port Speed Setup Menu” on page 327. v Telephone Number Setup Menu, see “Telephone Number Setup Menu” on page 327. v Call-Out Policy Setup Menu, see “Call-Out Policy Setup Menu” on page 328. v Customer Account Setup Menu, see “Customer Account Setup Menu” on page 329. v Call-Out Test tests the configuration after the modem is installed and configured correctly (disabled in partitioned systems). Chapter 7. Using the Service Processor 325 Modem Configuration Menu Note: This option applies only to a modem attached to serial port 1 (S1) or serial port 2 (S2) on the CEC backplane. The first two lines of the Modem Configuration Menu contain status of the current selections. Selections are made in the sections labeled Modem Ports and Modem Configuration File Name. Select the serial port that you want to activate and then select the modem configuration file for the modem on the port. If you want to set up all of the serial ports with modems, make your selections one port at a time. Modem Configuration Menu Port 1 Modem Configuration File Name: Port 2 Modem Configuration File Name: To make changes, First select the port and then the configuration file name Modem Ports: 1. Serial port 1 2. Serial port 2 Modem Configuration File Name: 5. none9. modem_z_sp 6. modem_f_sp 10. modem_m1_sp 7. modem_f0_sp 11. modem_m0_sp 8. modem_f1_sp 12. modem_m1_sp 30. Save configuration to NVRAM and Configure modem 98. Return to Previous Menu 0> For information on choosing a modem configuration file, see “Sample Modem Configuration Files” on page 513 and “Transfer of a Modem Session” on page 517. Serial Port Selection Menu This menu allows you to enable or disable the call-in and call-out functions of each serial port in any combination. Note: Call-out is disabled in partitioned systems. Serial Port Selection Menu 1. Serial Port 1 Call-Out: Currently Disabled NOT supported in LPAR mode 2. Serial Port 2 Call-Out: Currently Disabled NOT supported in LPAR mode 3. Serial Port 1 Call-In: Currently Disabled 4. Serial Port 2 Call-In: Currently Disabled 98. Return to Previous Menu 0> 326 Service Guide Call-in and ring indicate power-on cannot be enabled at the same time. If ring-indicate power-on is already enabled and you try to enable call-in, a message prompts you for confirmation. Refer to the message displayed on the screen. Serial Port Speed Setup Menu This menu allows you to set serial port speed to enhance terminal performance or to accommodate modem capabilities. Serial Port Speed Setup Menu 1. Serial Port 1 Speed: Currently 9600 2. Serial Port 2 Speed: Currently 9600 98. Return to Previous Menu 0> A serial port speed of 9600 baud or higher is recommended. The following are valid serial port speeds: 50 600 4800 75 1200 7200 110 1800 9600 134 2000 19200 150 2400 38000 300 3600 57600 115200 Telephone Number Setup Menu Note: These menus are available, however, the function of calling out to report a system failure is normally handled by the Service Focal Point application running on the HMC. Use this menu to set or change the telephone numbers for reporting a system failure. Telephone Number Setup Menu 1. Service Center Telephone Number: Currently Unassigned 2. Customer Administration Center Telephone Number: Currently Unassigned 3. Digital Pager Telephone Number: Currently Unassigned 4. Customer Voice Telephone Number: Currently Unassigned 5. Customer System Telephone Number: Currently Unassigned 98. Return to Previous Menu 0> Chapter 7. Using the Service Processor 327 v Service Center Telephone Number is the number of the service center computer. The service center usually includes a computer that takes calls from servers with call-out capability. This computer is referred to as the catcher. The catcher expects messages in a specific format to which the service processor conforms. For more information about the format and catcher computers, refer to the readme file in the AIX /usr/samples/syscatch directory. Contact your service provider for the correct service center telephone number to enter. Until you have that number, leave this field unassigned. v Customer Administration Center Telephone Number is the number of the System Administration Center computer (catcher) that receives problem calls from servers. Contact your system administrator for the correct telephone number to enter here. Until you have that number, leave this field unassigned. v Digital Pager Telephone Number is the number for a numeric pager carried by someone who responds to problem calls from your system. Contact your administration center representative for the correct telephone number to enter. For test purposes, use a test number, which you can change later (see the note on page 337). Note: If the system is running in partitioned mode, call-out is disabled. However, if the system is booted in full system partition mode, at least one of the preceding three telephone numbers must be assigned in order for the call-out test to complete successfully. v Customer Voice Telephone Number is the telephone number of a phone near the system or answered by someone responsible for the system. This is the telephone number left on the pager for callback. For test purposes, use a test number, which you can change later. v Customer System Telephone Number is the telephone number to which your system’s modem is connected. The service or administrative center representatives need this number to make direct contact with your system for problem investigation. This is also referred to as the call-in phone number. Call-Out Policy Setup Menu Note: Call-out is disabled on partitioned systems. Call out settings can be set using the following menu: CALL-OUT POLICY SETUP MENU 1. Call-Out policy (First/All): Currently First 2. Remote timeout, (in seconds): Currently 120 3. Remote latency, (in seconds): Currently 2 4. Number of retries: Currently 2 98. Return to Previous Menu 0> 328 Service Guide v Call-Out policy can be set to first or all. If call-out policy is set to first, the service processor stops at the first successful call-out to one of the following numbers in the order listed: 1. Service Center 2. Customer Administrative Center 3. Pager If call-out policy is set to all, the service processor attempts a call-out to all of the following numbers in the order listed: 1. Service Center 2. Customer Administrative Center 3. Pager v Remote timeout and remote latency are functions of your service provider’s catcher computer. Either use the defaults or contact your service provider for recommended settings. v Number of retries is the number of times you want the system to retry calls that failed to complete. Customer Account Setup Menu This menu allows users to enter information that is specific to their account. Customer Account Setup Menu 1. Customer Account Number: Currently Unassigned 2. Customer RETAIN Login userid: Currently Unassigned 3. Customer RETAIN login password: Currently Unassigned 98. Return to Previous Menu 0> v Customer Account Number is assigned by your service provider for record-keeping and billing. If you have an account number, enter it. Otherwise, leave this field unassigned. v Customer RETAIN Login User ID and Customer RETAIN Login Password apply to a service function to which your service provider might have access. If your service provider does not use RETAIN, leave these fields unassigned. Call-out Test Menu Note: Call-out test is disabled in partitioned systems. Use this menu to test the configuration after the modem is installed and configured. To execute this test successfully, at least one of the following numbers must be assigned: v Service center telephone number v Customer administration center telephone number v Digital pager telephone number Chapter 7. Using the Service Processor 329 Service Processor Parameters in Service Mode (Full System Partition) When the system is in service mode, the following service-processor parameters are suspended: v Unattended Start Mode v Reboot/Restart Policy v Call-Out v Surveillance When service mode is exited, the service processor functions are reactivated. System Power-On Methods This section discusses the following system power-on methods: v Power-on Switch v Service Processor Menus Privileged users can power on the system by selecting the System Control Power Menu option from the main menu and then selecting the Power-on System option from the System Power Control Menu. General users should select Power-on System on the General User Menu. v Remote Power-on via Ring-Indicate Signal The system automatically powers on when it detects a ²ring indicate² signal from a modem attached to serial port 1 (S1) or serial port 2 (S2). A remote user can call the system to activate ring detection by the modem. Listen for a few more rings than the threshold number for starting the system. The system powers on without answering the call. v Unattended start mode - refer to Enable/Disable Unattended Start Mode on page 314. The service processor can be enabled to recover from the loss of ac power (see Enable/Disable Unattended Power-On Mode in the SYSTEM POWER CONTROL MENU). When ac power is restored, the system returns to the power state at the time ac loss occurred. For example, if the system was powered on when ac loss occurred, it reboots/restarts when power is restored. If the system was powered off when ac loss occurred, it remains off when power is restored. v Timed power-on - refer to the shutdown -t command on servers using AIX. Working in conjunction with AIX, the service processor in your system can operate a timer, much like the wake-up timer on your clock radio. You can set the timer so that your system powers on at a certain time after shutting down. The timer is battery-operated, so power interruptions that occur while the system is off do not affect its accuracy. Refer to the AIX shutdown -t command for details on setting the timer. Note: If an ac power loss is in progress when the timed power-on attempt occurs, the system cannot power on when ac power is restored. v Follow-up to a Failed Boot Attempt The service processor initiates a power-on sequence if a failed boot attempt is detected (due to a hardware or software failure). v Fast or Slow Boot (IPL) Using the service processor menus, you can select the IPL type, mode, and speed of your system. Attention: Selecting fast IPL results in several diagnostic tests being skipped and a shorter memory test being run. 330 Service Guide Service Processor Reboot/Restart Recovery Reboot describes bringing the system hardware back up; for example, from a system reset or power-on. The boot process ends when control passes to the operating system process. Restart describes activating the operating system after the system hardware is reinitialized. Restart must follow a successful reboot. Boot (IPL) Speed When the system enters reboot recovery, slow IPL is automatically started, which gives the POST an opportunity to locate and report any problems that might otherwise be unreported. Failure During Boot Process During the boot process, either initially after system power-on or upon reboot after a system failure, the service processor monitors the boot progress. If progress stops, the service processor can reinitiate the boot process (reboot) if enabled to do so. The service processor can re-attempt this process according to the number of retries selected in the Reboot/Restart Policy Setup Menu. Failure During Normal System Operation When the boot process completes and control transfers to the operating system (OS), the service processor can monitor operating system activity (see the Set Surveillance Parameters option in the SERVICE PROCESSOR SETUP MENU). If OS activity stops due to a hardware- or software-induced failure, the service processor can initiate a reboot/restart process based on the settings in the Service Processor Reboot/Restart Policy Setup Menu and the OS automatic restart settings (see the operating system documentation), if you are running in a full system partition. Operating system surveillance is disabled in partitioned systems. If you are using the AIX operating system, the menu item under SMIT for setting the restart policy is Automatically Reboot After Crash. The default is false. When the setting is true, and if the service processor parameter ²Use OS-Defined Restart Policy² is yes (the default), the service processor takes over for AIX to reboot/restart after a hardware or surveillance failure. Service Processor Reboot/Restart Policy Controls The operating system’s automatic restart policy (see operating system documentation) indicates the operating system response to a system crash. The service processor can be instructed to refer to that policy by the Use OS-Defined Restart Policy setup menu. If the operating system has no automatic restart policy, or if it is disabled, then the service processor-restart policy can be controlled from the service processor menus. Use the Enable Supplemental Restart Policy selection. Use OS-Defined restart policy - The default setting is no. In a full-system partition, this causes the service processor to refer to the OS Automatic Restart Policy setting and take action (the same action the operating system would take if it could have responded to the problem causing the restart). When this setting is no, or if the operating system did not set a policy, the service processor refers to enable supplemental restart policy for its action. Chapter 7. Using the Service Processor 331 Enable supplemental restart policy - The default setting is yes. When set to yes in a full system partition, the service processor restarts the server when the operating system loses control and either: v The Use OS-Defined restart policy is set to No. OR v The Use OS-Defined restart policy is set to Yes and the operating system has no automatic restart policy. If set to Yes on a full-system partition, the service processor restarts the system when the system loses control and it is detected by service processor surveillance. Refer to “Service Processor Reboot/Restart Recovery” on page 331. The following table describes the relationship among the operating system and service processor restart controls in a full system partition: OS Automatic reboot/restart after crash setting Service processor to use OS-Defined restart policy? Service Processor Enable supplemental restart policy? System response None No1 No None No1 Yes1 Restarts None Yes No None Yes Yes1 Restarts False2 No1 No False2 No1 Yes1 Restarts False2 Yes No False2 Yes Yes1 True No1 No True No1 Yes1 Restarts True Yes No Restarts True Yes Yes1 Restarts 1 Service processor default 2 AIX default Note: In a partitioned system, the service processor’s supplemental restart policy is the only setting that is used. System Firmware Updates This section provides information and instructions for updating the system firmware. You may need to perform these steps if you are installing an option or if your support representative has instructed you to update your firmware. 332 Service Guide If you cannot download from the Web, do the following: v If the system cannot be powered on, but the service processor menus are available, see “Updating System Firmware From the Service Processor Menus” on page 334. v If the service processor programming has been corrupted, the service processor will automatically enter recovery mode when power is applied to the system. To check the level of firmware that is currently on the system, see “Determining the Level of Firmware on the System” on page 334. General Information on System Firmware Updates All the system firmware types that can be reprogrammed are updated at the same time. They are: v System power control network programming v Service processor programming v IPL programming v Run-time abstraction services Retain and store the latest firmware diskettes each time the firmware gets updated in the event that the firmware becomes corrupted and must be reloaded. Chapter 7. Using the Service Processor 333 Determining the Level of Firmware on the System The firmware level is denoted by XXYYMMDD, where XX = model designation, YY = year, MM = month, and DD = day of the release. If your system is running AIX, the platform firmware level can be determined by either of the following methods: v On the AIX command line, by typing: lscfg -vp|grep -p Platform A line that begins with ROM level (alterable).. displays the firmware level that is currently on the system. v Looking at the top of the service processor main menu. If the system is running Linux, the platform firmware level can be determined by either of the following methods: v On the Linux command line, type: /usr/sbin/ibmras/nvram --print-vpd | grep RR A line similar to Alterable ROM Level RR021114 displays the firmware level that is currently on the system. v Looking at the top of the service processor menu main menu. Updating System Firmware From the Service Processor Menus This procedure requires a diskette drive to be installed in the system. This procedure also requires a set of firmware update diskettes in backup format. The service processor menus are available while the system is powered off. As a privileged user, from the service processor main menu, select Service Processor Setup, then select Reprogram Flash EPROM Menu. The update process requests update diskettes as needed. 334 Service Guide Updating System Firmware from a NIM Server Refer to “Running Standalone Diagnostics from a Network Installation Management (NIM) Server with an HMC Attached to the System” on page 302. Recovery Mode Contact your marketing representative for information about obtaining and installing system firmware update packages. Code Action A1FD 0000 System firmware has been corrupted and must be reflashed. A1FD 0001 Insert update diskette 1. A1FD 0002 Insert update diskette 2. A1FD 0003 Insert update diskette 3. A1FD 000n Insert update diskette n. Notes: 1. If the wrong diskette is inserted at any time, or if the diskette is left in the drive after it has been read, B1FD 001F is displayed, indicating that the wrong diskette is in the drive. 2. If B1FD 001A is displayed at any time during the process, the service processor must be reset by activating the pinhole reset switch on the primary drawer operator panel. Configuring and Deconfiguring Processors or Memory All failures that crash the system with a machine check or check stop, even if intermittent, are reported as a diagnostic callout for service repair. To prevent the recurrence of intermittent problems and improve the availability of the system until a scheduled maintenance window, processors and memory DIMMs with a failure history are marked ²bad² to prevent their being configured on subsequent boots. This function is called repeat gard. A processor or memory DIMM is marked ²bad² under the following circumstances: v A processor or memory DIMM fails built-in self-test (BIST) or power-on self-test (POST) testing during boot (as determined by the service processor). v A processor or memory DIMM causes a machine check or check stop during runtime, and the failure can be isolated specifically to that processor or memory DIMM (as determined by the processor runtime diagnostics in the service processor). v A processor or memory DIMM reaches a threshold of recovered failures that results in a predictive callout (as determined by the processor run-time diagnostics in the service processor). During boot time, the service processor does not configure processors or memory DIMMs that are marked “bad.” Chapter 7. Using the Service Processor 335 If a processor or memory DIMM is deconfigured, the processor or memory DIMM remains offline for subsequent reboots until it is replaced or memory repeat gard is disabled. The repeat gard function also provides the user with the option of manually deconfiguring a processor or memory DIMM, or re-enabling a previously deconfigured processor or memory DIMM. For information about configuring or deconfiguring a processor, see the Processor Configuration/Deconfiguration Menu on page 318. For information on configuring or deconfiguring a memory DIMM, see the Memory Configuration/Deconfiguration Menu on page 319. Both of these menus are submenus under the System Information Menu. You can enable or disable CPU Repeat Gard or Memory Repeat Gard using the Processor Configuration/Deconfiguration Menu. Run-Time CPU Deconfiguration (CPU Repeat Gard) L1 instruction cache recoverable errors, L1 data cache correctable errors, and L2 cache correctable errors are monitored by the processor runtime diagnostics (PRD) code running in the service processor. When a predefined error threshold is met, an error log with warning severity and threshold exceeded status is returned to AIX. At the same time, PRD marks the CPU for deconfiguration at the next boot. AIX will attempt to migrate all resources associated with that processor to another processor and then stop the defective processor. Service Processor System Monitoring - Surveillance Surveillance is a function in which the service processor monitors the system, and the system monitors the service processor. This monitoring is accomplished by periodic samplings called heartbeats. Surveillance is available during the following phases: v System firmware bringup (automatic) v Operating system runtime (optional) Note: Operating system surveillance is disabled in partitioned systems. System Firmware Surveillance System firmware surveillance is automatically enabled during system power-on. It cannot be disabled by the user, and the surveillance interval and surveillance delay cannot be changed by the user. If the service processor detects no heartbeats during system IPL (for a set period of time), it cycles the system power to attempt a reboot. The maximum number of retries is set from the service processor menus. If the fail condition persists, the service processor leaves the machine powered on, logs an error, and displays menus to the user. If Call-out is enabled, the service processor calls to report the failure and displays the operating-system surveillance failure code on the operator panel. Operating System Surveillance Note: Operating system surveillance is disabled on partitioned systems. Operating system surveillance provides the service processor with a means to detect hang conditions, as well as hardware or software failures, while the operating system is running. It also provides the operating system with a means to detect a service processor failure caused by the lack of a return heartbeat. Operating system surveillance is not enabled by default, allowing you to run operating systems that do not support this service processor option. 336 Service Guide For operating system surveillance to work correctly, you must set these parameters: v Surveillance enable/disable v Surveillance interval The maximum time the service processor should wait for a heartbeat from the operating system before timeout. v Surveillance delay The length of time to wait from the time the operating system is started to when the first heartbeat is expected. Surveillance does not take effect until the next time the operating system is started after the parameters have been set. If desired, you can initiate surveillance mode immediately from service aids. In addition to the previously discussed options, another option allows you to select immediate surveillance, and rebooting of the system is not necessarily required. If operating system surveillance is enabled (and system firmware has passed control to the operating system), and the service processor does not detect any heartbeats from the operating system, the service processor assumes the system is hung and takes action according to the reboot/restart policy settings. See “Service Processor Reboot/Restart Recovery” on page 331. If surveillance is selected from the service processor menus that are only available at bootup, surveillance is enabled by default as soon as the system boots. From service aids, the selection is optional. Call-Out (Call-Home) Note: Call-out is disabled on partitioned systems. The call-out function is handled by the Service Focal Point application on the Hardware Management Console (HMC). The service processor can call out (call-home) when it detects one of the following conditions: v System firmware surveillance failure v Operating system surveillance failure (if supported by operating system) v Restarts v Critical hardware failure v Abnormal operating system termination To enable the call-out feature, do the following: 1. Connect a modem to any serial port. 2. Set up the following using the service processor menus or diagnostic service aids: v Enable call-out for the serial port where the modem is connected. v Enter the modem configuration file name. v Set up site-specific parameters (phone numbers for call-out, call-out policy, number of call-out retries, and so on). Chapter 7. Using the Service Processor 337 3. To call out before restart, set Call-out before restart to ENABLED from the Reboot/Restart Policy Setup menu. Note: Some modems, such as IBM 7857-017, are not designed for the paging function. Although they can be used for paging, they will return an error message when they do not get the expected response from another modem. Therefore, even though the paging was successful, the error message will cause the service processor to retry, continuing to place pager calls for the number of retries specified in the call-out policy setup menu. These retries result in redundant pages. Console Mirroring Console mirroring allows a user on one serial port to monitor the service processor activities on another serial port. This can be done on the locally attached HMC virtual terminal window or windows remotely attached through modems. Console mirroring is supported on any combination of locally or remotely attached monitors. The serial port from which console mirroring is enabled is called the primary port. The mirror port is determined when keyboard input is detected from one of the other two serial ports. From this point forward, the service processor sends information to the active port and the mirror port only. Console mirroring ends when the service processor releases control of the serial ports to the system firmware. Console mirroring is supported on serial port 1 (S1), serial port 2 (S2), and serial port 3 (S3). Remote attachment through modem is supported on serial port 1 (S1) and serial port 2 (S2). System Configuration The following describes the configuration for console mirroring: v Service processor v Modem connected to one serial port and enabled for incoming calls v Local HMC virtual terminal window connected to the other serial port. This local terminal can be connected directly to your system or connected through another modem. Console mirroring can be started by either of the following methods: v Remote session first, then local session added: 1. Remote session is already in progress. 2. Remote user uses service processor menus to enable console mirroring, allowing both consoles to be active. v Local session first, then remote session added: 1. Local session is already in progress. 2. The service processor receives a call from the remote user. 3. The local user selects the option to enable console mirroring. The service processor immediately begins mirroring service processor menus. 338 Service Guide Service Processor Error Logs The service processor error logs, an example of which follows, contain error conditions detected by the service processor. Error Log 1. 11/30/99 19:41:56 Service Processor Firmware Failure B1004999 Enter error number for more details. Press Return to continue, or ’x’ to return to menu. Press "C" to clear error log, any other key to continue. > Note: The time stamp in this error log is coordinated universal time (UTC), which is also referred to as Greenwich mean time (GMT). Operating system error logs have additional information available and can time stamp with local time. Entering an error number provides nine words of system reference code (SRC) data; an example menu follows. Detail: 6005 SRC - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - word11:B1004999 word12:0110005D word13:00000000 word14:00000000 word15:00001111 word16:00000000 word17:B1004AAA word18:0114005D word19:A4F1E909 B1004999 Press Return to continue, or ’x’ to return to menu. If Return is pressed, the contents of NVRAM will be dumped 320 bytes at a time, starting at address 0000. Chapter 7. Using the Service Processor 339 LCD Progress Indicator Log The following is an example of the LCD progress indicator log. It shows the types of entries that can appear in the log, and is for example purposes only. The progress indicator codes are listed from top (latest) to bottom (oldest). LCD Progress Indicator Log B0FF 0539..17 0538..17 0539..17 0538..17 0539..17 0581 0538..17 0539..12 0538..12 0539.. 0821..01-K1-00 0539.. 0728..01-R1-00-00 0539.. 0664..40-60-00-1,0 0539.. 0777..U0.1-P2-I1/E1 0539.. 0742..U0.1-P2-I2/E1 0539.. 0776..U0.1-P2-I3/T1 E139 E1FB E139 E183 Press Return to continue, or ’x’ to return to menu. > EAA1..U0.1-P1-I4 E172..U0.1-P1 E172..U0.1-P1-I4 E172..U0.1-P1 94BB 9109 9380 9108 9107 9106 9105 9118 9104 9103 9102 90FD 340 Service Guide Service Processor Operational Phases This section provides a high-level flow of the phases of the service processor. Pre-Standby Phase This phase is entered when the system is connected to a power source. The system may or may not be fully powered on. This phase is exited when the power-on self-tests (POSTs) and configuration tasks are completed. The pre-standby phase components are: v Service Processor Initialization - Service processor performs any necessary hardware and software initialization. v Service Processor POST - Service processor conducts Power-on self-tests on its various work and code areas. v Service Processor Unattended Start Mode Checks - To assist fault recovery. If unattended start mode is set, the service processor automatically reboots the system. The service processor does not wait for user input or power-on command, but moves through the phase and into the bring-up phase. Access the SMS menus or the service processor menus to reset the unattended start mode. Standby Phase The standby phase can be reached in either of the following ways: v With the system off and power connected (the normal path), recognized by OK in the LCD display. OR v With the system on after an operating system fault, recognized by an 8-digit code in the LCD display. In the standby phase, the service processor takes care of some automatic duties and is available for menus operation. The service processor remains in the standby phase until a power-on request is detected. Chapter 7. Using the Service Processor 341 The standby phase components are as follows: v Modem Configuration The service processor configures the modem (if installed) so that incoming calls can be received, or outgoing calls can be placed. v Dial In Monitor incoming phone line to answer calls, prompt for a password, verify the password, and remotely display the standby menu. The remote session can be mirrored on the local HMC virtual terminal window if the system is so equipped and if the user enables this function. v Menus The service processor menus are password-protected. Before you can access them, you need either the general user-password or privileged-user password. Service processor menus are available on ASCII terminals attached to the serial ports on the system, and on terminal emulators. On HMC-managed systems, service processor menus are also available on the HMC interface. Bring-Up Phase On a system that is powered-on to full system partition, this phase is entered upon power-on, and exited upon loading of the operating system. On a system that is powered-on to partition standby, this phase is entered upon power-on, and exited when the partition manager and hypervisor have been loaded and become fully operational in system memory. The end of this phase is indicated by LPAR on the operator panel. At this point, the server can start multiple logical partitions that would be activated through the HMC interface. The bring-up phase components are as follows: v Retry Request Check The service processor checks to see if the previous boot attempt failed. If the specified number of failures are detected, the service processor displays an error code and places an outgoing call to notify an external party if the user has enabled this option. v Dial Out The service processor can dial a preprogrammed telephone number in the event of an IPL failure. The service processor issues an error report with the last reported IPL status indicated and any other available error information. v Update Operator Panel The service processor displays operator panel data on the HMC virtual terminal window if a remote connection is active. v Environmental Monitoring The service processor provides expanded error recording and reporting. v System Firmware Surveillance (Heartbeat Monitoring) The service processor monitors and times the interval between system firmware heartbeats. v Responding to System Processor Commands The service processor responds to any command issued by the system processor. 342 Service Guide Runtime Phase This phase includes the tasks that the service processor performs during steady-state execution of the operating system. v Environmental Monitoring The service processor monitors voltages, temperatures, and fan speeds (on some servers). v Responding to System Processor Commands The service processor responds to any command issued by the system processor. v Run-Time Surveillance (not supported on partitioned systems) If the device driver is installed and surveillance enabled, the service processor monitors the system heartbeat. If the heartbeat times out, the service processor places an outgoing call. This is different from the bring-up phase scenario, where the specified number of reboot attempts are made before placing an outgoing call. v HMC surveillance On an HMC-managed system, the service processor monitors the communication link between the managed system and the HMC. If the service processor detects that this communication link has been broken, it will post an error to the operating system running on the managed system. Chapter 7. Using the Service Processor 343 344 Service Guide Chapter 8. Using System Management Services Use the system management services menus to view information about your system or partition, and to perform tasks such as setting a password, changing the boot list, and setting the network parameters. Notes: 1. On some of the system management services (or service processor) screens, you will see the term LPAR. LPAR is equivalent to the term logically partitioned system or partitioned system. 2. In a partitioned system, only those devices that are assigned to the partition that is being booted display in the SMS menus. In a full system partition, all devices in the system display in the SMS menus. To start the system management services, do the following: 1. For a partitioned system, use the Hardware Management Console (HMC) to restart the partition. For a full system partition, restart the system. 2. For a partitioned system, watch the virtual terminal window on the HMC. For a full system partition, watch the firmware console. 3. Look for the POST indicators memory, keyboard, network, scsi, speaker, which appear across the bottom of the screen. Press the numeric 1 key after the word keyboard appears, and before the word speaker appears. For more information on the POST indicator words, refer to “POST Indicators” on page 7. After the system management services starts, the following screen displays: Main Menu 1 Select Language 2 Change Password Options NOT available in LPAR mode 3 View Error Log 4 Setup Remote IPL (Initial Program Load) 5 Change SCSI Settings 6 Select Console NOT available in LPAR mode 7 Select Boot Options -------------------------------------------------------------------------------------------------- Navigation keys: X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ Note: The system management services can also be started using the Service Processor Boot Mode Menu. See page 316. 345 On all menus except the Main Menu, there are several navigation keys: M Return to the main menu. ESC Return to the previous menu. X Exit the system management services and start the operating system. If X is entered, you are asked to confirm your choice to exit the SMS menus and start the operating system. When there is more than one page of information to display, there are two additional navigation keys: N Display the next page of the list. P Display the previous page of the list. Note: The lowercase navigation key has the same effect as the uppercase key that is shown on the screen. For example, m or M returns you to the main menu. On each menu screen, you are given the option of choosing a menu item and pressing Enter (if applicable), or selecting a navigation key. Select Language Note: Your TTY must support the ISO-8859 character set to properly display languages other than English. This option allows you to change the language used by the text-based System Management Services menus. SELECT LANGUAGE 1. English 2. Francais 3. Deutsch 4. Italiano 5. Espanol -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ 346 Service Guide Change Password Options The Change Password Options menu enables you to select from password utilities. Note: This option is not available in partitioned systems. Password Utilities 1 Set Privileged-Access Password 2 Remove Privileged-Access Password -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ The privileged-access password protects against the unauthorized starting of the system programs. Note: If the privileged-access password has been enabled, you are asked for the privileged-access password at startup every time you boot your system. If you previously had set a privileged-access password and want to remove it, select Remove Privileged-Access Password. View Error Log Use this option to view or clear your system’s error log. A menu similar to the following displays when you select this option. Error Log Date Time Error Code Location Entry 1. 01/04/96 12:13:22 25A80011 00-00 Entry 2. no error logged 1. Clear error log -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ Note: The time stamp in his error log is coordinated universal time (UTC), which is also referred to as Greenwich mean time (GMT). The operating system error logs have more information available and can time stamp with your local time. Chapter 8. Using System Management Services 347 Setup Remote IPL (Initial Program Load) This option allows you to enable and set up the remote startup capability of your system or partition. You must first specify the network parameters. Network Parameters 1. IP Parameters 2. Adapter Parameters 3. Ping Test -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ Selecting IP Parameters displays the following menu. IP Parameters 1. Client IP Address [000.000.000.000] 2. Server IP Address [000.000.000.000] 3. Gateway IP Address [000.000.000.000] 4. Subnet Mask [255.255.255.000] -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ To change IP (Internet Protocol) parameters, type the number of the parameters for which you want to change the value. Attention: If the client system and the server are on the same subnet, set the gateway IP address to [0.0.0.0]. 348 Service Guide Selecting Adapter Parameters allows you to view an adapter’s hardware address, as well as configure network adapters that require setup. A menu similar to the following displays. Attention: In a partitioned system, only those network adapters that have been assigned to the partition being booted display in the Adapters Parameters menu. In a full system partition, all network adapters in the system are listed in the adapter parameters menu. Device Slot Hardware Address 1. 10/100 Mbps Ethernet PCI Adapter II 4:U0.1-P1-I4/E1 000629aca72d 2. 10/100 Ethernet Adapter Integrated:U0.1-P1/E1 0020357A0530 3. High Speed Token-Ring Adapter 1:U0.1-P1-I1/T1 000629be04e1 4. Gigabit Ethernet PCI Adapter 3:U0.1-P1-I3/E1 0004ac7c9ec7 -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ Entering adapter parameters on these screens will automatically update the parameters on the ping test screen. Selecting an adapter on this menu displays configuration options for that adapter. The current setting is indicated by <===. Adapter Parameters 10/100 Mbps Ethernet PCI Adapter II 1. ethernet, 10, rj45, full 2. ethernet, 10, rj45, half 3. ethernet, 100, rj45, full 4. ethernet, 100, rj45, half 5. ethernet, auto, rj45, auto <=== -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ To change the adapter parameters, select an option from the list. Chapter 8. Using System Management Services 349 To test a connection to a remote system unit, select Ping Test from the Network Parameters Menu. After selecting the Ping Test option, you must select which adapter communicates with the remote system. Adapter Parameters Device Slot Hardware Address 1. 10/100 Mbps Ethernet PCI Adapter II 4:U0.1-P1-I4/E1 000629aca72d 2. 10/100 Ethernet Adapter Integrated:U0.1-P1/E1 0020357A0530 3. High Speed Token-Ring Adapter 1:U0.1-P1-I1/T1 000629be04e1 4. Gigabit Ethernet PCI Adapter 3:U0.1-P1-I3/E1 0004ac7c9ec7 -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ After selecting an adapter, you will be asked to set the parameters for the adapter; a menu similar to the following displays: Adapter Parameters 10/100 Mbps Ethernet PCI Adapter II 1. ethernet, 10, rj45, full 2. ethernet, 10, rj45, half 3. ethernet, 100, rj45, full 4. ethernet, 100, rj45, half <=== 5. ethernet, auto, rj45, auto 6. Execute Ping Test -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ 350 Service Guide After selecting which adapter and its parameters to use to ping the remote system, you must provide the addresses needed to communicate with the remote system. Ping Test 1. Client IP Address [129.132.4.20] 2. Server IP Address [129.132.4.10] 3. Gateway IP Address [129.132.4.30] 4. Subnet Mask [255.255.255.0] 5. Execute Ping Test -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ Notes: 1. After the ping test is initiated, it may take up to 60 seconds to return a result. 2. If the ping test passes or fails, the firmware will stop and wait for a key to be pressed before continuing. Change SCSI Settings This option allows you to view and change the addresses of the SCSI controllers attached to your system. SCSI Utilities 1. Hardware Spin Up Delay 2. Change SCSI Id -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ Select Console Note: This option is not available on partitioned systems. The Select Console utility allows the user to select which console to use to display the SMS menus. This selection is only for the SMS menus and does not affect the display used by the operating system. Follow the instructions that display on the screen. The firmware automatically returns to the SMS main menu. Chapter 8. Using System Management Services 351 Select Boot Options Use this menu to view and set various options regarding the installation devices and boot devices. 1. Select Install or Boot a Device 2. Select Boot Devices 3. Multiboot Startup -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ Select Install or Boot a Device Allows you to select a device from which to boot or install the operating system. This option is for the current boot only. Select Boot Devices Allows you to set the boot list. Multiboot Startup Toggles the multiboot startup flag, which controls whether the multiboot menu is invoked automatically on startup. If Select Install or Boot a Device is selected, the following menu displays: Select Device Type 1. Diskette 2. Tape 3. CD/DVD 4. IDE 5. Hard Drive 6. Network 7. None 8. List All Devices -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ If a device is selected that is not in the system, a menu with the following message displays: .---------------------------------------------------------. | THE SELECTED DEVICES WERE NOT DETECTED IN THE SYSTEM ! | | Press any key to continue. | Á---------------------------------------------------------’ 352 Service Guide If Hard Drive is selected, the following menu displays: Select Hard Drive Type 1. SCSI 2. SSA 3. SAN 4. None 5. List All Devices -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ If List All Devices is selected, a menu similar to the following displays, depending on the devices that are installed in the system: Select Device Device Current Device Number Position Name 1. 1 SCSI 18200 MB Harddisk (loc=U0.1-P2/Z1-A8,0) 2. - SCSI 18200 MB Harddisk (loc=U0.1-P2/Z1-A9,0) 3. - SCSI 18200 MB Harddisk (loc=U0.1-P2/Z1-Aa,0) 4. None 5. List all devices -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ The appropriate device can then be selected for this boot or installation. When a device is selected for installing the operating system, or to boot from, the Select Task menu allows you to get more information about the device, or to boot from that device in normal mode or service mode. The following is an example of this menu. Select Task SCSI 18200 MB Harddisk (loc=U0.1-P2/Z1-A8,0) 1. Information 2. Normal Mode Boot 3. Service Mode Boot -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ If either Normal Mode Boot or Service Mode Boot is selected, the next screen will ask, Are you sure?. If you answer yes, the device will be booted in the appropriate mode. If you answer no, the firmware will return to the Select Task menu. Chapter 8. Using System Management Services 353 Select Boot Devices Attention: In a partitioned system, only those devices from which an operating system can be booted that are assigned to the partition that is being booted display on the select boot devices menu. In a full system partition, devices from which an operating system can be booted display on the Select Boot Devices menu. Select this option to view and change the customized boot list, which is the sequence of devices read at startup. Configure Boot Device Order 1. Select 1st Boot Device 2. Select 2nd Boot Device 3. Select 3rd Boot Device 4. Select 4th Boot Device 5. Select 5th Boot Device 6. Display Current Setting 7. Restore Default Setting -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ When any of the options 1-5 is selected, the Select Device Type screen will be displayed, which is similar to the following: Select Device Type 1. Diskette 2. Tape 3. CD/DVD 4. IDE 5. Hard Drive 6. Network 7. None 8. List All Devices -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ When a device type is selected, such as item 5, a Select Task menu is displayed. The following is an example of that menu for a hard disk. 354 Service Guide Select Task SCSI 18200 MB Harddisk (loc=U0.1-P2/Z1-A8,0) 1. Information 2. Set Boot Sequence: Configure as 1st Boot Device -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ Selecting Information displays a menu similar to the following for a hard disk. Device Information /pci@3fffdf0a000/pci@2,4/scsi@1/sd@8,0 : (Integrated) (Bootable) DEVICE : SCSI 18200 MB Harddisk ( loc=U0.1-P2/Z1-A8,0) NAME : sd DEVICE-TYPE : block -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ The Set Boot Sequence option allows you to set the location of the device in the boot list. Display Current Settings This option displays the current setting of the customized boot list. An example of this menu, with one device in the boot list, follows. Current Boot Sequence 1. SCSI 18200 MB Harddisk (loc=U0.1-P1/Z1-A8,0) 2. None 3. None 4. None 5. None -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ Chapter 8. Using System Management Services 355 Restore Default Settings This option restores the boot list to the default boot list. The default boot list will vary depending on the devices that are installed in the system. The default boot list is as follows: 1. Primary diskette drive (if installed) 2. CD-ROM drive (if installed) 3. Tape drive (if installed) 4. Hard disk drive (if installed) 5. Network adapter Multiboot Startup Multiboot Startup toggles the multiboot startup flag, which controls whether the multiboot menu is invoked automatically on startup. Exiting System Management Services After you have finished using the system management services, type x (for exit) to boot your system or partition. 356 Service Guide Chapter 9. Removal and Replacement Procedures Notes: 1. Servicing of the 286/E4 can be performed with the system in either the vertical or horizontal position. Servicing of the 286/C4 is performed with the system in the rack and placed into the service position. The service position for the 286/C4 is described in “Placing the 286/C4 into the Service Position” on page 364. 2. Before performing any of the removal or replacement procedures in this chapter, read the following danger and caution notices. In the system you are about to set up or service: v The ac power interface connector is considered the main power disconnect device. v This system has redundant power supply capabilities, meaning that it has the ability to have two power supplies running simultaneously in the same system unit. When instructed to disconnect the power source, ensure that all power cables have been unplugged. DANGER An electrical outlet that is not correctly wired could place hazardous voltage on metal parts of the system or the devices that attach to the system. It is the responsibility of the customer to ensure that the outlet is correctly wired and grounded to prevent an electrical shock. Before installing or removing signal cables, ensure that the power cables for the system unit and all attached devices are unplugged. When adding or removing any additional devices to or from the system, ensure that the power cables for those devices are unplugged before the signal cables are connected. If possible, disconnect all power cables from the existing system before you add a device. Use one hand, when possible, to connect or disconnect signal cables to prevent a possible shock from touching two surfaces with different electrical potentials. During an electrical storm, do not connect cables for display stations, printers, telephones, or station protectors for communications lines. D05 CAUTION: This product is equipped with a three-wire power cable and plug for the user’s safety. Use this power cable with a properly grounded electrical outlet to avoid electrical shock. C01 CAUTION: This unit has more than one power supply cord. To reduce the risk of electrical shock, disconnect two power supply cords before servicing. C21 357 Procedure List Procedure Procedure Name and Page Location Adapters “PCI Adapters” on page 379 Battery Replacement “Replacing the Battery” on page 432 Bezels “Bezels” on page 377 Bezel Door “Bezel Door (286/E4)” on page 376 Cable Management Arm “Installing the Cable-Management Arm” on page 501 CD-ROM Drive “Media Drives (Diskette, CD-ROM, Tape, or Disk Drive)” on page 401 CEC Backplane “Central Electronics Complex (CEC) Backplane” on page 443 Covers “Covers” on page 366 Disk Drive “Hot-Plug Disk Drives” on page 405 Disk Drive Backplane “Disk Drive Backplane” on page 415 Disk Drive Configuration or Deconfiguration “Deconfiguring (Removing) or Configuring a Disk Drive” on page 405 Diskette Drive “Media Drives (Diskette, CD-ROM, Tape, or Disk Drive)” on page 401 Fans “Fans” on page 395 HMC Appendix E, “Installing a Hardware Management Console (HMC)”, on page 503 I/O Card (Refer to PCI Riser Card) Media Drives “Media Drives (Diskette, CD-ROM, Tape, or Disk Drive)” on page 401 Memory DIMMs “Memory DIMMs” on page 427 Operating Position “Returning the 286/C4 to the Operating Position” on page 365 Operator Panel “Operator Panel” on page 419 PCI Riser Card “PCI Riser Card” on page 409 Power Supply “Power Supplies” on page 436 Processor Card “Processor Card” on page 423 Rack Installation “286/C4 Removal and Installation into a Rack” on page 496 Release Latch and Brackets “286/C4 Drawer-Release Latch and Brackets” on page 362 Service Position “Placing the 286/C4 into the Service Position” on page 364 Starting the System “Starting the System without an HMC Attached” on page 360 Static-Sensitive Devices “Handling Static-Sensitive Devices” on page 359 Stopping the System “Stopping the System without an HMC Attached” on page 359 System Board (Refer to CEC Backplane) Rack Hardware “Attaching the Mounting Hardware to the Rack Enclosure” on page 485 VPD Update “System Vital Product Data (VPD) Update Procedure” on page 421 358 Service Guide Handling Static-Sensitive Devices Attention: Electronic boards, diskette drives, and disk drives are sensitive to static electricity discharge. These devices are wrapped in antistatic bags to prevent this damage. Take the following precautions: v If you have an antistatic wrist strap available, use it while handling the device. v Do not remove the device from the antistatic bag until you are ready to install the device in the system. v With the device still in its antistatic bag, touch it to a metal frame of the system. v Grasp cards and boards by the edges. Hold drives by the frame. Avoid touching the solder joints or pins. v If you need to lay the device down while it is out of the antistatic bag, lay it on the antistatic bag. Before picking it up again, touch the antistatic bag and the metal frame of the system at the same time. v Handle the devices carefully to prevent permanent damage. Stopping the System without an HMC Attached Attention: When shutting down your system to install options, shut down all applications first and then shut down the operating system. The system power turns off and the system goes into standby mode when the operating system is shut down. Before removing power from the system, ensure that the shutdown process is complete. Failure to do so can result in the loss of data. Some option-installation procedures do not require the system to be stopped for installation. The option-installation procedures in this chapter will direct you here if stopping the system is required. 1. Log in to the system as root user. 2. Have your system administrator stop all applications that are running on the system. 3. At the command line, ask the administrator to type one of the following commands: v If your system is running AIX, type shutdown v If your system is running Linux, type shutdown now -h 4. After you shut down the operating system, set the power switches of any attached devices to Off. Stopping the System with an HMC Attached and AIX Installed The HMC user interface provides a power-off function to turn off the power to the system. If the system is logically partitioned, AIX in each partition may be stopped individually to make repairs. If the system power must be turned off, all of the logical partitions must be stopped. AIX provides hot-swap procedures for adapters and devices that support hot-swap removal and installation. You do not have to shut down a partition that controls an adapter or device that supports hot-swap. Individual power components and fans can be serviced with the power on for power systems equipped with redundant power and cooling. Before servicing the system, check removal and installation procedures. If the system is operating in a full system partition under AIX, typing the shutdown command causes the system to shut down and power off. The -F flag shuts down the system without warning system users. Check with the system administrator before using this command. If you cannot use this method, you can power off the system by pressing the power button on the operator panel or typing the appropriate command at the HMC. Attention: Using the power button on the operator panel or commands at the HMC to power off the system can cause unpredictable results in the data files. The next IPL will also take longer to complete if all applications in all partitions are not stopped beforehand. Chapter 9. Removal and Replacement Procedures 359 Stopping the System with an HMC Attached and Linux Installed The HMC user interface provides a power-off function to turn off the power to the system. If the system is logically partitioned, Linux in each partition may be stopped individually to make repairs. If the system power must be turned off, all of the logical partitions must be stopped. Linux does not provide hot-swap procedures for adapter and device removal and installation. You must shut down all partitions and the system power must be turned off to service an adapter or device. Individual power components and fans may be serviced with power on for power systems equipped with redundant power and cooling. Before servicing this system, check removal and installation procedures. If the system is operating in a full system partition under Linux, typing the shutdown now -h command causes the system to shut down and power off. The -F flag shuts down the system without warning system users. Check with the system administrator before using this command. If you cannot use this method, you can power off the system by pressing the power button on the operator panel or typing the appropriate command at the HMC. Attention: Using the power-on button on the operator panel or commands at the HMC to power off the system can cause unpredictable results in the data files. The next IPL will also take longer to complete if all applications in all partitions are not stopped beforehand. Starting the System without an HMC Attached Power on the system only after all of the following steps are completed: v All I/O drawer cables are connected v Base system cables are connected v All PCI cables to supported subsystems are connected v The hardware management console (HMC) is connected (if required) v Power is connected to the system To power on the system, do the following: 1. Open any system or rack doors necessary so that you can view the operator panel. 2. Connect the power source to the system unit. Before you press the power-on button on your operator panel, observe the following: v The power LED is slowly blinking. v An OK prompt is visible in the operator panel display. 3. Press the power-on button on the operator panel, and observe the following: a. The power LED begins to visibly blink faster. b. The system cooling fans are activated and begin to accelerate to operating speed. Note: There is approximately a 30-second transition period between the time the power button is pressed and the power LED remains on solid (no longer blinking). 360 Service Guide c. The power LED stays on solid and progress indicators, also referred to as checkpoints, are visible on the operator panel display. 1 Power-On Button 6 Operator Panel Display 2 Power LED 7 (FS1) Front Serial Connector (RJ-48 Connector) 3 Attention LED 8 Service Processor Reset Switch (Pinhole) 4 SCSI Port Activity 9 System Reset Button 5 Ethernet Port Activity Starting the System with an HMC Attached Power on the system only after all of the following steps are completed: v All I/O drawer cables are connected v Base system cables are connected v All PCI cables to supported subsystems are connected v The hardware management console (HMC) is connected (if required) v Power is connected to the system After the required cables are installed and the power cables are connected, the HMC user interface provides a power-on function to turn on the power to the system. If a system operator panel is available, the power-on button on the operator panel can be pushed to initialize the system but the preferred method is to use the HMC if you are booting logical partitions. Progress indicators, also referred to as checkpoints, are visible on the operator panel display (if available) as the system power is turned on. The power LED on the base system stops blinking and stays on, indicating the system power is on. The base system and I/O subsystems are powered on through the system power control network (SPCN). When power is applied, the power LEDs on the base system go from blinking to continuous, and the power LEDs on the I/O subsystem(s) come on and stay on, which indicates that power levels are satisfactory in the subsystems. After the system is powered on, all logical partitions will be in a standby or powered on state. Each logical partition must then be booted through the HMC. The state of all logical partitions that have been booted will be running. Chapter 9. Removal and Replacement Procedures 361 286/C4 Drawer-Release Latch and Brackets The following procedure covers the removal and replacement of the system’s drawer-release latch. Drawer-Release Latch Removal To remove one of the 286/C4 drawer-release latches, do the following: 1. Open the front rack door and place the system into the service position as described in “Placing the 286/C4 into the Service Position” on page 364. 2. Remove the retaining screw located on the side of the release latch. 3. Pull the release latch off the latch bracket post. 1 System Drawer-Release Latch 5 System Drawer-Release Latch 2 System Drawer-Release Latch Bracket (Right) 6 Retaining Screws 3 286/C4 7 Release Latch Retaining Screw 4 System Drawer-Release Latch Bracket (Left) 8 Latch Bracket Post 362 Service Guide Drawer-Release Latch Replacement To replace one of the 286/C4 drawer-release latches, do the following: 1. If you have not already done so, open the front rack door and place the system into the service position as described in “Placing the 286/C4 into the Service Position” on page 364. 2. Push the release latch onto the latch bracket post. 3. Secure the drawer-release latch onto the latch bracket post with a release latch retaining screw. 4. Push the system drawer back into the operating position as described in “Returning the 286/C4 to the Operating Position” on page 365. Drawer-Release Latch Bracket Removal To remove the 286/C4 drawer-release latches brackets, do the following: 1. Open the front rack door and place the system into the service position as described in “Placing the 286/C4 into the Service Position” on page 364. 2. Remove the front bezel as described in “Front Bezel Removal (286/C4)” on page 377. 3. Remove the release latch as described in “Drawer-Release Latch Removal” on page 362. 4. Remove the retaining screws that secure the latch bracket to the side of the 286/C4. 1 System Drawer-Release Latch 5 System Drawer-Release Latch 2 System Drawer-Release Latch Bracket (Right) 6 Retaining Screws 3 286/C4 7 Release Latch Retaining Screw 4 System Drawer-Release Latch Bracket (Left) 8 Latch Bracket Post Chapter 9. Removal and Replacement Procedures 363 Drawer-Release Latch Bracket Replacement To replace the 286/C4 drawer-release latches brackets, do the following: 1. If you have not already done so, open the front rack door and place the system into the service position as described in “Placing the 286/C4 into the Service Position”. 2. Using the supplied retaining screws, secure the latch brackets to the side of the 286/C4. Note: The latch brackets are right-side and left-side dependent. 3. Replace the release latch as described in “Drawer-Release Latch Replacement” on page 363. 4. Replace the front bezel as described in “Front Bezel Replacement (286/C4)” on page 377. 5. Push the system drawer back into the operating position as described in “Returning the 286/C4 to the Operating Position” on page 365. Placing the 286/C4 into the Service Position Attention: When placing the 286/C4 into the service position, it is essential that all stability plates must be firmly in position to prevent the rack from toppling. Ensure that only one system drawer at a time is in the service position. Before doing any service actions inside the 286/C4, you must put the system into the service position. To place the system into the service position, do the following: 1. Open the front rack door. 2. If your system is equipped with two thumbscrews securing it to the rack, remove the thumbscrews. The screws are located on the bezel, just above each system-release latch. 1 286/C4 Front Bezel 2 M5 16-mm Retaining Screw 3. Release the system-release latches on both the left and right sides. 4. Pull the system drawer out from the rack until the rails are fully extended. Note: When the system rails are fully extended, safety latches on the slide rails lock into place. This action prevents the system from being accidentally pulled out too far and dropped. The following illustration shows a system drawer in the service position. 364 Service Guide 1 286/C4 4 Stabilizing Bar 2 19-Inch Rack 5 Front Rack Door 3 Extension Rail Safety Release Tab 6 System Drawer-Release Latch After completing the service actions, return the system to the operating position. Returning the 286/C4 to the Operating Position To return the 286/C4 to the operating position, do the following: 1. Press in on both spring-loaded release tabs, located on each rail. See the illustration on page 365 for the location of the release tabs. 2. Ensure the system drawer-release latches, located on the system drawer, are in the open position. See the illustration on page 365 for the location of the system drawer-release latches. 3. Push the system drawer straight back into the rack until both system drawer-release latches have locked into position. 4. If you removed two thumbscrews from the bezel front, replace them at this time. 5. Close the front rack door. Chapter 9. Removal and Replacement Procedures 365 Covers Before performing the following procedures, read the “Safety Notices” on page xi. Service Access Cover Removal (286/C4) To remove the service access cover, do the following: 1. Ensure that the rack has been correctly stabilized before opening the front and rear rack doors. 2. Open the front rack door and place the 286/C4 into the service position as described in “Placing the 286/C4 into the Service Position” on page 364. 3. Loosen the three captive thumbscrews located on the rear of the cover. See the following illustration for thumbscrew locations. 4. Placing both hands on each side of the cover, slide the cover toward the rear of the system drawer. After the front of the service access cover has cleared the upper chassis ledge, lift the cover up and off the system drawer. Attention: For proper cooling and airflow, replace the cover before turning on the system. Operating the system for extended periods of time (over 30 minutes) with the cover removed might damage the system components. 1 Thumbscrews 4 Access Cover Locking Tab 2 Service Access Cover 5 Upper Chassis Ledge 3 286/C4 Service Access Cover Replacement (286/C4) To replace the service access cover, do the following: 1. Align the service access cover with the top of the system, about 25 mm (1 inch) from the front of the system. The flanges on the left and right sides of the cover should be on the outside of the system chassis. 2. Hold the service access cover against the system drawer, slide it toward the front of the system. The front edge of the service access cover slides beneath the upper chassis ledge. 3. Tighten the three thumbscrews located on the rear of the cover. 366 Service Guide Top Cap Removal (286/E4) To remove the top cap, do the following: 1. Remove the bezel door as described in “Bezel Door Removal” on page 376. 2. Remove the front bezel as described in “Front Bezel Removal (286/E4)” on page 378. 3. From the front of the top cap, grasp the two lower side edges and pry out to release the lower front tabs. 4. Pivot the cover up to release the two back tabs. 5. Lift the top cap up and off the system. 1 Top Cap 4 Bezel Door 2 Top Cap Rear Locking Tabs 5 Bezel 3 Top Cap Front Locking Tabs 6 286/E4 Top Cap Replacement (286/E4) To replace the top cap, do the following: 1. Align, and then insert the two tabs located on the back-bottom edge of the top cap with the two mating slots located on the top cover. 2. Rotate the cover down, snapping the two lower front tabs into place on the operator panel bracket. 3. Replace the front bezel as described in “Front Bezel Replacement (286/E4)” on page 378. 4. Replace the bezel door as described in “Bezel Door Replacement” on page 376. Chapter 9. Removal and Replacement Procedures 367 Top Cover Removal (286/E4) To remove the top cover, do the following: 1. Remove the bezel door as described in “Bezel Door Removal” on page 376. 2. Remove the front bezel as described in “Front Bezel Removal (286/E4)” on page 378. 3. Remove the top cap as described in “Top Cap Removal (286/E4)” on page 367. 4. Remove the operator panel as described in “Operator Panel Removal” on page 419. 5. Remove the two retaining screws that secure the operator-panel EMC bracket to the operator-panel bracket. 6. Remove the four retaining screws that secure the operator-panel bracket to the top cover. 7. Remove the four retaining screws that secure the top cover to the system chassis. 8. Remove the front screw that secures the top cover to the right panel assembly. This screw is located on the front inside edge of the system, just below the top cover. 9. Remove the back cover as described in “Back Cover Removal (286/E4)” on page 372. 10. From the back of the system, remove the rear screw that secures the top cover to the right panel assembly. This screw is located on the rear inside edge of the system, just below the top cover. 11. From the back of the system, remove the two retaining screws that secure the top cover to the system top-cover rear bracket. 12. Lift the top cover up and off the system chassis. 1 Top Cap 6 286/E4 2 Operator Panel EMC Bracket 7 Bezel Door 3 Operator Panel 8 Bezel 4 Operator Panel Bracket 9 Rear Top Cover Mounting Bracket 5 Top Cover 10 Rear Cover 368 Service Guide Top Cover Replacement (286/E4) To replace the top cover, do the following: 1. Ensure that the rear top cover bracket is in place. If the rear top cover bracket has not been installed, install it now with two retaining screws. 2. Mount, and then align the top cover onto the system chassis. 3. Replace the four retaining screws that secure the top cover to the system chassis. Note: Do not tighten the screws at this time 4. Replace the front screw that secures the top cover to the right panel assembly. This screw will be located on the front inside edge of the system, just below the top cover. 5. At the back of the system, replace the rear screw that secures the top cover to the right panel assembly. This screw will be located on the rear inside edge of the system, just below the top cover. 6. At the back of the system, replace the two retaining screws that secure the top cover to the top-cover rear bracket. 7. Tighten all replaced screws in step 3, then tighten the screws replaced in steps 4 through 6. 8. Replace the back cover as described in “Back Cover Replacement (286/E4)” on page 373. 9. Mount, and then align the operator-panel bracket, ensuring that the operator-panel bracket screw-clearance holes are exactly centered over their mating screw holes, located on the top cover. 10. With a retaining screw, secure one of the operator-panel bracket corners . Repeat this step, securing a second corner on the operator-panel bracket, diagonally across from the first. Note: Before going to the next step, ensure that the four remaining clearance holes on the operator-panel bracket are exactly centered over their mating screw holes. If not, loosen one of the screws and realign the clearance holes to the screw holes. 11. With two retaining screws, secure the two remaining corners of the operator-panel bracket. 12. Mount, then secure with two retaining screws, the operator-panel EMC bracket onto the operator-panel bracket. 13. Replace the operator panel as described “Operator Panel Replacement” on page 420. 14. Replace the top cap as described in “Top Cap Replacement (286/E4)” on page 367. 15. Replace the front bezel as described in “Front Bezel Replacement (286/E4)” on page 378. 16. Replace the bezel door as described in “Bezel Door Replacement” on page 376. Chapter 9. Removal and Replacement Procedures 369 Outer Side Cover Removal (286/E4) To remove the outer side cover, do the following: 1. Loosen the thumbscrew on the rear of the cover. See the following illustration for the thumbscrew location. 2. Slide the cover toward the rear of the system to release the four locking tab slots located on the top flange of the outer side cover. 3. Grasp the top back corner of the outer side cover. 4. Pull out the top edge of the outer side cover. Note: The top edge of the outer side cover must be clear of the top cover. 5. Pull up to release the bottom edge of the outer side cover. 6. Put the cover in a safe place. 1 286/E4 4 Cover Retaining Slot 2 Locking Tab Slot for Outer Side Cover 5 Thumbscrew 3 Outer Side Cover Outer Side Cover Replacement (286/E4) To replace the outer side cover, do the following: 1. Place the bottom edge retaining slot of the outer side cover over the outer edge of the system base. 2. Pivot the outer side cover up toward the top of the system. 3. Align the four top locking tab slots, located on the outer side cover, with the locking tabs, located on the bottom side of the top cover. 4. Push the cover in. When correctly seated, push the outer side cover forward, toward the front of the chassis. The four top locking tabs are now locked in place. 5. Align the thumbscrew, located on the rear of the outer side cover, with the screw hole located on the back of the chassis. 6. Tighten the thumbscrew. 370 Service Guide Service Access Cover Removal (286/E4) To remove the service access cover, do the following: 1. Remove the 286/E4’s outer side cover as described in “Outer Side Cover Removal (286/E4)” on page 370. 2. Loosen the three captive thumbscrews located on the rear of the cover. See the following illustration for thumbscrew locations. 3. Slide the cover toward the rear of the system drawer. After the front of the service access cover has cleared the front chassis ledge, lift the cover out of the system. Attention: For proper cooling and airflow, replace the cover before turning on the system. Operating the system for extended periods of time (over 30 minutes) with the cover removed might damage the system components. 1 286/E4 4 Service Access Cover 2 Service Access Cover Locking Tab 5 Outer Side Cover 3 Front Chassis Ledge 6 Thumbscrews Chapter 9. Removal and Replacement Procedures 371 Service Access Cover Replacement (286/E4) To replace the service access cover, do the following: 1. Align the service access cover with the side of the system, about 25 mm (1 inch) from the front of the system. The flanges on the top and bottom of the cover wrap around the system chassis. 2. Hold the service access cover against the system drawer and slide it toward the front of the system. The front edge of the service access cover slides beneath the chassis ledge. 3. Tighten the three thumbscrews located on the rear of the cover. Back Cover Removal (286/E4) To remove the back cover, do the following: 1. Grasp each side of the back cover and pull straight up and then away from the system. This action releases the two top and two bottom tabs that secure the cover to the back of the system. See the following illustration for the tab locations. 2. Put the back cover in a safe place. 1 Back Cover Top Retaining Tab (Quantity 2) 3 Back Cover Bottom Locking Slot (Quantity 2) 2 Back Cover Top Locking Slot (Quantity 2) 4 Back Cover Bottom Retaining Tab (Quantity 2) 372 Service Guide Back Cover Replacement (286/E4) To replace the back cover, do the following: 1. Ensure all cables have been connected and are correctly seated. 2. Align the back cover side edges with the side edges of the system chassis back. 3. Raise the back cover about 1 inch higher than the system, ensuring tab to slot clearance. 4. Push the back cover straight down. This action seats the tabs into their retaining slots. Disk Drive Access Cover Removal (286/E4) To remove the disk drive access cover, do the following: 1. Open the bezel door. 2. Locate the disk drive access cover on the front bezel of the system, as shown in the following illustration. 3. Press down on the release tab located at the top of the access cover. 4. Gently pull the top of the access cover out and away from the system. Note: The two alignments tabs that are located on the bottom front edge of the access cover must be lifted up and then out of their slots. 5. Put the cover in a safe place. 1 286/E4 4 Disk Drive Access Cover 2 Bezel Door 5 Alignment Tabs 3 Disk Drive Access Cover Release Tab Disk Drive Access Cover Replacement (286/E4) To replace the disk drive access cover, do the following: 1. Insert the two alignment tabs, located on the bottom front edge of the access cover, into the two slots located on the front bezel. 2. After inserting the alignment tabs into the slots, pivot the access cover up until the release tab clicks into the closed position. 3. Close the bezel door. Chapter 9. Removal and Replacement Procedures 373 Right Side Cover (System Base) Removal (286/E4) To remove the right side cover (system base), do the following: 1. Remove the bezel door as described in “Bezel Door Removal” on page 376. 2. Remove the front bezel as described in “Front Bezel Removal (286/E4)” on page 378. 3. Remove the top cap as described in “Top Cap Removal (286/E4)” on page 367. 4. Remove the operator panel as described in “Operator Panel Removal” on page 419. 5. Remove the back cover as described in “Back Cover Removal (286/E4)” on page 372. 6. Remove the outer service access cover as described in “Outer Side Cover Removal (286/E4)” on page 370. 7. Remove the top cover as described in “Top Cover Removal (286/E4)” on page 368. 8. Remove the four retaining screws that secure the system chassis to the system base. 9. Use three people to lift the system up and off the alignment pins. 10. To remove the system base plate, lay the system onto its side and remove the four retaining screws. 1 Top Cap 8 Bezel 2 Operator Panel EMC Bracket 9 Rear Top Cover Mounting Bracket 3 Operator Panel 10 Rear Cover 4 Operator Panel Bracket 11 System Base Plate 5 Top Cover 12 Operator Panel Extension Cable 6 286/E4 13 Right Side Cover (System Base) 7 Bezel Door 374 Service Guide Right Side Cover (System Base) Replacement (286/E4) To replace the right side cover (system base), do the following: 1. Ensure that the system base plate is secured to the system chassis. If the system base plate has not been secured to the system chassis, lay the system chassis onto its side and secure the system base plate with four retaining screws. 2. Using three people, lift the system up and seat the system chassis and the attached mounting plate over the two alignment pins. The alignment pins are located at each end of the system base. 3. Using four retaining screws, secure the system chassis to the system base . 4. Replace the top cover as described in “Top Cover Replacement (286/E4)” on page 369. 5. Replace the outer service access cover as described in “Outer Side Cover Replacement (286/E4)” on page 370. 6. Replace the back cover as described in “Back Cover Replacement (286/E4)” on page 373. 7. Replace the top cap as described in “Top Cap Replacement (286/E4)” on page 367. 8. Replace the operator panel as described in “Operator Panel Replacement” on page 420. 9. Replace the front bezel as described in “Front Bezel Replacement (286/E4)” on page 378. 10. Replace the bezel door as described in “Bezel Door Replacement” on page 376. Chapter 9. Removal and Replacement Procedures 375 Bezel Door (286/E4) The following procedure covers removal and replacement of the bezel door. Bezel Door Removal Refer to the following illustration while you perform the steps in this procedure. To remove the bezel door, do the following: 1. Open the bezel door by grasping the bezel door handle. Pull the bezel door out and away from the system. 2. Press down on the top back edge of the door. 3. Gently swivel the top back edge of the door forward, out past the top of the system. 4. Lift the door up to release it from the lower retaining post. 1 286/E4 4 Bezel Door 2 System Handle Assembly 5 Bezel Door Handle 3 Bezel Door Top Retaining Post 6 Bezel Door Lock Tab 7 Bezel Door Lower Retaining Post Bezel Door Replacement To replace the bezel door, do the following: 1. Place the bezel door on the lower retaining post. See the preceding illustration. 2. Rotate the bezel door toward the top of the system. 3. Press down on the back edge of the bezel door, and seat the top post into its mating hole. Note: The mating hole for the top post is located on the bottom side of the system handle assembly. 4. Close the bezel door. 376 Service Guide Bezels Before performing the following procedures, read the “Safety Notices” on page xi. Front Bezel Removal (286/C4) To remove the bezel, do the following: 1. Open the front rack door and place the system into the service position as described in “Placing the 286/C4 into the Service Position” on page 364. 2. Press in on both bezel-release tabs at the same time. See the following illustration for bezel-release tab locations. 3. Gently pull the bezel away from the system. 4. Put the bezel in a safe place. 1 Rack Door 3 Bezel-Release Tab (Located on each side of bezel) 2 Bezel 4 19-Inch Rack Front Bezel Replacement (286/C4) To replace the front bezel, do the following: 1. Open the front rack door and place the system into the service position as described in “Placing the 286/C4 into the Service Position” on page 364. 2. Seat the two tabs located on the bottom edge of the bezel into their mating slots located on the bottom edge of the system. 3. Pivot the bezel up, aligning the release tabs to the mating slots located on the front of the system. 4. Gently push the tabs into the slots until the bezel seats against the front of the system. 5. Push the system drawer back into the operating position as described in “Returning the 286/C4 to the Operating Position” on page 365. 6. Close the rack door. Chapter 9. Removal and Replacement Procedures 377 Front Bezel Removal (286/E4) Refer to the following illustrations while you perform the steps in this procedure. To remove the front bezel, do the following: 1. Remove the bezel door as described in “Bezel Door (286/E4)” on page 376. 2. Press down on the two bezel-release levers. 3. Pull the top of the bezel out and away from the system chassis. 4. Gently pull the bezel up off the base, releasing the lower bezel locking tabs. 5. Put the bezel and bezel door in a safe place. 1 286/E4 4 Front Door 2 Bezel-release lever 5 Lower Bezel Locking Tab (Quantity 2) 3 Front Bezel 6 Lower Bezel Retaining Slots (Quantity 2) Front Bezel Replacement (286/E4) To replace the front bezel, do the following: 1. Place the two lower bezel locking tabs into the retaining slots located on the chassis base. 2. Rotate the bezel up toward the top of the chassis, ensuring that the aligning pins are aligned with their mating holes located on the chassis. 3. Gently push the bezel in until the two bezel release levers are seated in their respective slots. 4. Replace the bezel door as described in “Bezel Door (286/E4)” on page 376. 378 Service Guide PCI Adapters Before performing this procedure, read “Safety Notices” on page xi. Notes: 1. Some PCI adapter cards are shipped from the manufacturer with a blue handle or support along the back edge of the card. To use in this system unit, you must remove the blue handle or support from the card. 2. With this system, you can install PCI adapters with the power on. These adapters are referred to as hot-pluggable PCI adapters. Some adapters are not hot-pluggable, and the system must be powered off for this type of adapter installation. If you are removing a PCI adapter that is: v Non-hot-pluggable, go to “Non-Hot-Pluggable PCI Adapter Removal” on page 381. v Hot-pluggable, go to “Hot-Pluggable PCI Adapter Removal” on page 385. PCI Hot-Plug Manager Access The installation instructions for hot-pluggable PCI adapters refer you to these procedures when it is appropriate to perform them. Attention: The Linux operating system does not support any hot-pluggable procedures, nor does it support the hot-plugging of any hot-pluggable PCI adapters. A system with the Linux operating system installed on one or more partitions must be shut down and powered off before replacing any PCI adapter assigned to a Linux partition. When replacing a PCI adapter in any partition with the Linux operating system installed, follow the non-hot-pluggable adapter procedures. Note: A PCI adapter is only hot-pluggable if the PCI adapter is supported for hot-plug applications. See the PCI Adapter Placement Reference, order number SA38-0538, for complete information regarding your PCI adapter. Accessing Hot-Plug Management Functions Note: Removing or installing a hot-pluggable PCI adapter requires the system administrator to take the PCI adapter offline prior to performing the operation. Before taking an adapter offline, the devices attached to the adapter must be taken offline as well. This action prevents a service representative or user from causing an unexpected outage for system users. For additional information about taking an adapter offline or removing it from the system configuration, see the AIX System Management Guide: Operating System and Devices. This publication is also contained on the AIX Documentation CD. The documentation is made accessible by loading the documentation CD onto the hard disk or by mounting the CD in the CD-ROM drive. Chapter 9. Removal and Replacement Procedures 379 To access the hot-plug menus, do the following: 1. Log in as root user. 2. At the command line, type smitty. 3. Select Devices. 4. Select PCI Hot Plug Manager and press Enter. 5. The PCI Hot-Plug Manager menu displays. Return to the procedure that directed you here. The following section describes the menu options. PCI Hot-Plug Manager Menu The following options are available from the PCI Hot Plug Manager menu: Note: For information about the PCI slot LED states, see “Component LEDs” on page 466. List PCI Hot-Plug Slots Provides a descriptive list of all slots that support PCI hot-plug capability. If the listing for a slot indicates it holds an “Unknown” device, select the Install/Configure Devices Added after IPL to configure the adapter in that slot. Add a PCI Hot-Plug Adapter Allows the user to add a new PCI hot-plug-capable adapter to the slot with the system turned on. You will be asked to identify the PCI slot that you have selected prior to the actual operation. The selected PCI slot will go into the Action state and finally into the On state. Note: The system will indicate the slot holds an “Unknown” device until you perform the Install/Configure Devices Added After IPL option to configure the adapter. Replace/Remove a PCI Hot-Plug Adapter Allows the user to remove an existing adapter, or replace an existing adapter with an identical one. For this option to work, the adapter must be in the Defined state (see the “Unconfigure a Device” option). You will be asked to identify the PCI slot prior to the actual operation. The selected PCI slot will go into the Action state. Identify a PCI Hot-Plug Slot Allows the user to identify a PCI slot. The selected PCI slot will go into the Identify state. See “Component LEDs” on page 466. Unconfigure a Device Allows the user to put an existing PCI adapter into the Defined state if the device is no longer in use. This step must be completed successfully before starting any removal or replacement operation. If this step fails, the customer must take action to release the device. 380 Service Guide Configure a Defined Device Allows a new PCI adapter to be configured into the system if software support is already available for the adapter. The selected PCI slot will go into the On state. Install/Configure Devices Added After IPL The system attempts to configure any new devices and tries to find and install any required software from a user-selected source. The add, remove, and replace functions return information to the user indicating whether the operation was successful. If additional instructions are provided on the screen, complete the recommended actions. If the instructions do not resolve the problem, do the following: v If the adapter is listed as Unknown, perform the Install/Configure Devices Added After IPL option to configure the adapter. v If you receive a warning indicating that needed device packages are not installed, the system administrator must install the specified packages before you can configure or diagnose the adapter. v If you receive a failure message indicating a hardware error, the problem might be either the adapter or the PCI slot. Isolate the problem by retrying the operation in a different PCI slot, or trying a different adapter in the slot. If you determine that you have failing hardware, call your service representative. v Do not use Install/Configure Devices Added After IPL if your system is set up to run HACMP clustering. Consult with your system administrator or software support to determine the correct method to configure the replacement device. Non-Hot-Pluggable PCI Adapter Removal To remove a PCI adapter, do the following: 1. On a 286/C4, open the front and rear rack doors and place the system into the service position as described in “Placing the 286/C4 into the Service Position” on page 364. On a 286/E4, open the bezel door. 2. Shut down the system as described in either “Stopping the System without an HMC Attached” on page 359 or “Stopping the System with an HMC Attached and AIX Installed” on page 359. 3. Disconnect the power source to the system. Note: This system may be equipped with a second power supply. Before continuing with this procedure, ensure that the system power source has been completely disconnected. 4. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on page 366 or “Service Access Cover Removal (286/E4)” on page 371. See the following illustration. 5. Determine which adapters you plan to remove, then label and disconnect all cables attached to that adapter. 6. Record the slot number and location of each adapter being removed. Note: Adapter slots are numbered on the rear of the system unit. 7. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 359. 8. If you are removing a long adapter, make note of how the back edge of the adapter slides into the molded grooves located on the back wall of the PCI cooling fan bracket. Chapter 9. Removal and Replacement Procedures 381 1 Service Access Cover 3 System Unit 1a Outer Access Cover 4 PCI Cooling Fan Bracket 2 PCI Adapters 9. Rotate the adapter retainer clip counterclockwise, as shown in the following illustration. 10. The tab resting on the adapter retention bracket should spring to a vertical position, coming to rest next to the adapter retention latch. 11. Carefully grasp the PCI adapter by its top edge or upper corners, and remove it from the system. Store the adapter in a safe place. 12. If you do not plan to install another adapter into the vacated slot, seal the expansion slot using an expansion-slot cover. If you are installing another adapter, go to “Adding or Replacing a Non-Hot-Pluggable PCI Adapter” on page 383. 13. If you have other options to install, refer to “Procedure List” on page 358. If you do not have other options to install, continue on to the next step. 14. Replace the service access cover as described in “Service Access Cover Replacement (286/C4)” on page 366 or “Service Access Cover Replacement (286/E4)” on page 372. 15. On a 286/C4, push the system drawer back into the operating position as described in “Returning the 286/C4 to the Operating Position” on page 365. 16. Connect the adapter cables. 17. Reconnect the power source to the system. 18. On a 286/C4, route the cables through the cable-management arm. 19. Go to ²MAP 0410: Repair Checkout² in Diagnostic Information for Multiple Bus Systems. If necessary, refer to “Starting the System without an HMC Attached” on page 360. After you have completed checking the system close, the rack or system doors. 382 Service Guide 1 286/C4 or 286/E4 5 PCI adapter faceplate (dotted lines) 2 Adapter Retainer Assembly 6 Adapter Retainer Assembly (in the unlocked position) 3 Adapter Retainer Assembly (retainer seat down and the retainer clip in the locked position) 7 PCI Adapter 4 Adapter Retainer Assembly (retainer seat down and the retainer clip in the unlocked position) Adding or Replacing a Non-Hot-Pluggable PCI Adapter To add or install an adapter, do the following: 1. On a 286/C4, open the front and rear rack doors and place the system into the service position as described in “Placing the 286/C4 into the Service Position” on page 364. On a 286/E4, open the bezel door. 2. Shut down the system as described in either “Stopping the System without an HMC Attached” on page 359 or “Stopping the System with an HMC Attached and AIX Installed” on page 359. 3. Disconnect the power source to the system. Note: This system may be equipped with a second power supply. Before continuing with this procedure, ensure that the system has been completely disconnected from its power source. 4. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on page 366 or “Service Access Cover Removal (286/E4)” on page 371. 5. Refer to the PCI Adapter Placement Reference for information regarding slot restrictions for adapters that can be used in this system. 6. If necessary, remove the adapter expansion slot shield. 7. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 359. 8. If necessary, remove the adapter from the antistatic package. Attention: Avoid touching the components and gold-edge connectors on the adapter. Chapter 9. Removal and Replacement Procedures 383 9. Place the adapter, component-side up, on a flat, static-protective surface. 10. Set any jumpers or switches as instructed by the adapter’s manufacturer. 11. Carefully grasp the adapter by its top edge, and align the adapter with the expansion slot and its connector on the PCI riser card. 12. Press the adapter firmly into its connector. Attention: When you install an adapter into the system, be sure that it is completely and correctly seated in its connector located on the PCI riser card. Inserting and seating the adapter card improperly might cause damage to the PCI riser card or the adapter. 13. Lower the tab onto the PCI adapter EMC shield/connector faceplate. Rotate the adapter retainer clip clockwise until it covers the tab at approximately a 45-degree angle. See the following illustration. 1 System Unit 2 Adapter Retainer Assembly (in the unlocked position) 3 Adapter Retainer Assembly (retainer seat down and the retainer clip in the unlocked position) 4 Adapter Retainer Assembly (retainer seat down and the retainer clip in the locked position) 5 PCI adapter face plate (dotted lines) 14. Replace the service access cover as described in “Service Access Cover Replacement (286/C4)” on page 366 or “Service Access Cover Replacement (286/E4)” on page 372. 15. If you are servicing a 286/C4, push the system drawer back into the operating position as described in “Returning the 286/C4 to the Operating Position” on page 365. 16. Connect the adapter cables. 17. Reconnect the power source to the system. 18. If you are servicing a 286/C4, route the cables through the cable-management arm. 19. Go to ²MAP 0410: Repair Checkout² in Diagnostic Information for Multiple Bus Systems. If necessary, refer to “Starting the System without an HMC Attached” on page 360. After you have completed checking the system, close the rack or system doors. 384 Service Guide Hot-Pluggable PCI Adapter Removal Note: Removing a hot-pluggable PCI adapter requires the system administrator to take the PCI adapter offline before performing any PCI adapter hot-plug procedures. Before taking an adapter offline, the devices attached to the adapter must also be taken offline. This action prevents a service representative or user from causing an unexpected outage for system users. For additional information about taking an adapter offline or removing it from the system configuration, see the AIX System Management Guide: Operating System and Devices. This publication is also contained on the AIX Documentation CD. The documentation is made accessible by loading the documentation CD onto the hard disk or by mounting the CD in the CD-ROM drive. To remove a hot-pluggable PCI adapter, do the following: 1. On a 286/C4, open the front and rear rack doors, and place the system into the service position as described in “Placing the 286/C4 into the Service Position” on page 364. On a 286/E4, open the bezel door. 2. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on page 366 or “Service Access Cover Removal (286/E4)” on page 371. 3. Determine which adapters you plan to remove. 4. Record the slot number and location of each adapter being removed. Note: Adapter slots are numbered on the rear of the system unit. 5. Ensure that any processes or applications that might use the adapter are stopped. 6. Refer to “PCI Hot-Plug Manager Access” on page 379, and follow the steps in the access procedure to select PCI Hot Plug Manager. Then return here to continue. 7. Select Unconfigure a Device and press Enter. 8. Press F4 to display the Device Names menu. 9. Select the adapter you are removing. 10. Use the Tab key to answer NO to Keep Definition and YES to Unconfigure Child Devices. Press Enter. 11. The ARE YOU SURE screen displays. Press Enter to verify the information. Successful unconfiguration is indicated by the OK message displayed next to the Command field at the top of the screen. 12. Label and disconnect all cables attached to that adapter. 13. Press F3 to return to the PCI Hot-Plug Manager menu. 14. Select List PCI Hot-Plug Slots. 15. If the List PCI Hot-Plug Slots menu indicates that the slot is not empty, go to step 16. Chapter 9. Removal and Replacement Procedures 385 16. If the List PCI Hot-Plug Slots menu indicates that the slot is empty, do the following: a. Select F3 to cancel. b. Select Add PCI Hot-Plug Adapter, and press Enter. The display screen indicates that the PCI slot is empty. c. On the display screen, select the indicated empty slot. Look at the adapter you intend to remove. A slow blinking amber LED located next to the adapter on the PCI riser card indicates that the slot has been identified. Press Enter. The adapter has been put into the action state. d. Disconnect any cables that are connected to the adapter being removed. e. Completely remove the hot-plug adapter from the slot. The LED goes off. f. Go to step 24 on page 387. 17. Move the cursor to select the adapter that you are removing and press Enter. (The description entry displays as unknown). 18. Press the Tab key until the entry field displays the remove operation and then press the Enter key. Follow the instructions that display on the screen until you are instructed to remove the adapter. 19. When you are instructed to remove the adapter from the adapter slot, disconnect any cables that are connected to the adapter being removed. 20. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 359. 21. If you are removing a long adapter, make note of how the back edge of the adapter slides into the molded grooves located on the back wall of the PCI cooling fan bracket. 1 Service Access Cover 3 System Unit 1a Outer Side Cover 4 PCI Cooling Fan Bracket 2 PCI Adapters 22. Turn the locking latch, and lift the plastic retainer seat off the adapter. 386 Service Guide 1 System Unit 5 PCI adapter face plate (dotted lines) 2 Adapter Retainer Assembly 6 Adapter Retainer Assembly (in the unlocked position) 3 Adapter Retainer Assembly (retainer seat down and the retainer clip in the locked position) 7 PCI Adapter 4 Adapter Retainer Assembly (retainer seat down and the retainer clip in the unlocked position) 23. Carefully grasp the adapter by the edges, and pull it straight out from the PCI riser card. 24. If you are not installing another adapter in this slot, place an expansion slot cover in the adapter slot opening. 25. Lower the plastic retainer seat over the PCI adapter faceplate. 26. Rotate the locking latch clockwise until it clicks into the locked position. 27. Continue to follow the screen instructions until you receive a message that the adapter removal is successful. Successful removal is indicated by the OK message displayed next to the Command field at the top of the screen. 28. If you do not have other adapters to remove, continue with the next step. OR If you have other adapters to remove, press the F3 key to return to the PCI Hot-Plug Manager menu and then return to step 14 on page 385. 29. Press F10 to exit the Hot-Plug Manager. 30. If you have added, removed, or replaced any adapters, run the diag -a command. If the system responds with a menu or prompt, follow the instructions to complete the device configuration. 31. If you have other options to install, refer to “Procedure List” on page 358. If you do not have other options to install, continue on to the next step. 32. Replace the service access cover as described in “Service Access Cover Replacement (286/C4)” on page 366 or “Service Access Cover Replacement (286/E4)” on page 372. 33. On a 286/C4, push the system drawer back into the operating position as described in “Returning the 286/C4 to the Operating Position” on page 365. 34. Connect the adapter cables. 35. On a 286/C4, route the cables through the cable-management arm. 36. On a 286/C4, close the rack doors. On a 286/E4, close the bezel door. Chapter 9. Removal and Replacement Procedures 387 Replacing a Hot-Pluggable PCI Adapter Notes: 1. Use this procedure only when you are replacing an adapter with an identical adapter. If you are replacing an adapter with an adapter that is not identical to the adapter removed, go to “Hot-Pluggable PCI Adapter Removal” on page 385 and “Installing a Hot-Pluggable PCI Adapter” on page 392. 2. Removing a hot-pluggable PCI adapter requires the system administrator to take the PCI adapter offline before performing any PCI adapter hot-plug procedures. Before taking an adapter offline, the devices attached to the adapter must be taken offline as well. This action prevents an unexpected outage for system users. For additional information about taking an adapter offline or removing it from the system configuration, see the AIX System Management Guide: Operating System and Devices. This publication is also contained on the AIX Documentation CD. The documentation is made accessible by loading the documentation CD onto the hard disk or by mounting the CD in the CD-ROM drive. To replace a hot-plug PCI adapter, do the following: 1. On a 286/C4, open the front and rear rack doors and place the system into the service position as described in “Placing the 286/C4 into the Service Position” on page 364. On a 286/E4, open the bezel door. 2. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on page 366 or “Service Access Cover Removal (286/E4)” on page 371. 3. Refer to the PCI Adapter Placement Reference, order number SA38-0538, for information regarding slot restrictions for adapters that can be used in this system. 4. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 359. 5. Determine the slot from which you are removing the adapter. 6. Ensure that any processes or applications that might use the adapter are stopped. 7. Refer to “PCI Hot-Plug Manager Access” on page 379, and follow the steps in the access procedure to select PCI Hot Plug Manager. Then return here to continue. 8. Select Unconfigure a Device and press Enter. Note: If the adapter or device has a problem that prevents it from being replaced by hot-plug, it must be replaced by shutting down the system. Go to “Adding or Replacing a Non-Hot-Pluggable PCI Adapter” on page 383. 9. Press F4 to display the Device Names menu. 388 Service Guide 10. From the menu, select the adapter you are removing. 11. Use the tab key to answer YES to Keep Definition and YES to set Unconfigure Child Devices. Press Enter. 12. The ARE YOU SURE screen displays. Press Enter to verify the information. Successful unconfiguration is indicated by the OK message displayed next to the Command field at the top of the screen. 13. Press F3 to return to the PCI Hot-Plug Manager menu. 14. Select List PCI Hot-Plug Slots and press Enter. 15. If the List PCI Hot-Plug Slots menu indicates that the slot is not empty, go to substep 17. If the List PCI Hot-Plug Slots menu indicates that the slot is empty, do the following: a. Select F3 to cancel. b. Select Add PCI Hot-Plug Adapter, and press Enter. The display screen indicates that the PCI slot is empty. c. On the display screen, select the indicated empty slot. Look at the adapter you intend to replace or reseat. A slow blinking amber LED located on the PCI riser card is indicating that the slot has been identified. Press Enter. The adapter is now in the action state. d. Completely remove the hot-plug adapter from the slot. The LED goes off. e. Reinstall the adapter, ensuring that it is correctly seated. Press Enter. f. Select F3 to return to the PCI Hot-Plug Adapter Menu. g. Select Install/Configure Devices Added after IPL. h. Select F3 to return to the PCI Hot-Plug Adapter Menu. i. Connect the appropriate cables and devices to the adapter. j. Select List PCI Hot-Plug Slots, and press Enter. k. Return to the error code or procedure that sent you here, if the List PCI Hot-Plug Slots menu indicates that the slot for the just replaced or reseated adapter is empty. 16. Select F3 to cancel. 17. Select Replace/Remove a PCI Hot-Plug Adapter and press Enter. The Replace/Remove a PCI Hot-Plug Adapter menu displays. 18. Move the cursor to select the adapter that you are removing, and press Enter. 19. Press the Tab key until the entry field displays the replace operation, and then press Enter. Follow the instructions that display on the screen until you are instructed to remove the adapter. 20. When you are instructed to remove the adapter from the adapter slot, disconnect any cables that are connected to the adapter being removed. Chapter 9. Removal and Replacement Procedures 389 21. Turn the locking latch, and lift the plastic stop for the adapter. 1 System Unit 5 PCI adapter faceplate (dotted lines) 2 Adapter Retainer Assembly 6 Adapter Retainer Assembly (in the unlocked position) 3 Adapter Retainer Assembly (retainer seat down and the retainer clip in the locked position) 7 PCI Adapter 4 Adapter Retainer Assembly (retainer seat down and the retainer clip in the unlocked position) 22. Carefully grasp the adapter by the edges, and pull it straight out from the PCI riser card. 23. If necessary, remove the replacement adapter from the antistatic package. Attention: Avoid touching the components and gold-edge connectors on the adapter. 24. Place the adapter, component-side up, on a flat, static-protective surface. 25. Set any jumpers or switches as instructed by the adapter manufacturer. 26. Replace the adapter into the adapter slot. Carefully grasp the adapter by the edges and align the adapter in the slot guides. Insert the adapter fully into the adapter slot connector. If you are installing a full-length adapter, ensure that both ends of the adapter engage the card guides. 27. Lower the plastic retaining seat over the PCI adapter face plate (dotted lines). Rotate the locking latch clockwise until it clicks into the locked position. Some full-length cards can be supported by rotating the blue adapter latch on the right end of the adapter counterclockwise. 390 Service Guide 1 System Unit 2 Adapter Retainer Assembly (in the unlocked position) 3 Adapter Retainer Assembly (retainer seat down and the retainer clip in the unlocked position) 4 Adapter Retainer Assembly (retainer seat down and the retainer clip in the locked position) 5 PCI adapter faceplate (dotted lines) 28. Connect appropriate cables and devices to the adapter. 29. Continue to follow the screen instructions until you receive a message that the replacement is successful. Successful replacement is indicated by the OK message displayed next to the Command field at the top of the screen. 30. Press the F3 key to return to the PCI Hot-Plug Manager menu. 31. Select Install/Configure Devices Added After IPL and press Enter. Follow the instructions on the screen. Successful replacement is indicated by the OK message displayed next to the Command field at the top of the screen. Do not use Install/Configure Devices Added After IPL if your system is set up to run HACMP clustering. Consult with your system administrator or software support to determine the correct method to configure the replacement device. 32. If you do not have other adapters to replace, continue with the next step. OR If you have other adapters to replace, press the F3 key to return to the PCI Hot-Plug Manager menu and then return to step 14 on page 389. 33. Press F10 to exit the Hot-Plug Manager. If you have added, removed, or replaced any adapters, run the diag -a command. If the system responds with a menu or prompt, follow the instructions to complete the device configuration. Chapter 9. Removal and Replacement Procedures 391 34. Replace the service access cover as described in “Service Access Cover Replacement (286/C4)” on page 366 or “Service Access Cover Replacement (286/E4)” on page 372. 35. On a 286/C4, push the system drawer back into the operating position as described in “Returning the 286/C4 to the Operating Position” on page 365. 36. Connect the adapter cables. 37. On a 286/C4, route the cables through the cable-management arm. 38. On a 286/C4, close the rack doors. On a 286/E4, close the bezel door. Installing a Hot-Pluggable PCI Adapter Note: This system supports PCI hot-pluggable adapters, so the system does not need to be shut down and the power cables removed before adding or replacing an adapter. To add or install an adapter, do the following: 1. On a 286/C4, open the front and rear rack doors and place the system into the service position as described in “Placing the 286/C4 into the Service Position” on page 364. On a 286/E4, open the bezel door. 2. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on page 366 or “Service Access Cover Removal (286/E4)” on page 371. 3. Refer to the PCI Adapter Placement Reference, order number SA38-0538, for information regarding slot restrictions for adapters that can be used in this system. 4. If necessary, remove the adapter expansion slot shield. 5. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 359. 6. If necessary, remove the adapter from the antistatic package. Attention: Avoid touching the components and gold-edge connectors on the adapter. 7. Place the adapter, component-side up, on a flat, static-protective surface. 8. Set any jumpers or switches as described by the adapter’s manufacturer. 9. Refer to “PCI Hot-Plug Manager Access” on page 379, and follow the steps in the access procedure to select PCI Hot Plug Manager. Then return here to continue. 10. From the PCI Hot-Plug Manager menu, select Add a PCI Hot-Plug Adapter and press Enter. The Add a Hot-Plug Adapter window displays. 11. See the PCI Adapter Placement Reference, order number SA38-0538, for adapter placement information. Then select an empty PCI slot for the adapter. 12. Select the appropriate empty PCI slot from the ones listed on the screen, and press Enter. 392 Service Guide 13. Rotate the locking retainer clip, lift the plastic stop, and remove the expansion slot cover. 1 System Unit 5 PCI adapter faceplate (dotted lines) 2 Adapter Retainer Assembly 6 Adapter Retainer Assembly (in the unlocked position) 3 Adapter Retainer Assembly (retainer seat down and the retainer clip in the locked position) 7 PCI Adapter 4 Adapter Retainer Assembly (retainer seat down and the retainer clip in the unlocked position) 14. Follow the instructions on the screen to install the adapter until the LED for the specified PCI slot is set to the Action state. See “Component LEDs” on page 466. 15. When you are instructed to install the adapter in the adapter slot, carefully grasp the adapter by the edges and align the adapter in the slot guides. Insert the adapter fully into the adapter slot connector. If you are installing a full-length adapter, ensure that both ends of the adapter engage the card guides. Chapter 9. Removal and Replacement Procedures 393 16. Lower the tab onto the PCI adapter faceplate. Rotate the adapter retainer clip clockwise until it covers the tab at approximately a 45-degree angle. See the following illustration. 1 System Unit 2 Adapter Retainer Assembly (in the unlocked position) 3 Adapter Retainer Assembly (retainer seat down and the retainer clip in the unlocked position) 4 Adapter Retainer Assembly (retainer seat down and the retainer clip in the locked position) 5 PCI adapter faceplate (dotted lines) 17. Replace the service access cover as described in “Service Access Cover Replacement (286/C4)” on page 366 or “Service Access Cover Replacement (286/E4)” on page 372. 18. On a 286/C4, push the system drawer back into the operating position as described in “Returning the 286/C4 to the Operating Position” on page 365. 19. Connect the adapter cables. 20. On a 286/C4, route the cables through the cable-management arm. 21. On a 286/C4, close the rack doors. On a 286/E4, close the bezel door. 394 Service Guide Fans Before performing the following procedures, read “Safety Notices” on page xi. Removing a Processor Fan in a Non-Hot-Swap Environment Note: A processor fan in a non-hot-swap environment means that your system is configured at the factory with one processor fan installed. To place your system into an environment that enables the hot-swapping of the processor fans, you must purchase a second processor fan and insert it into the unused docking connector. It is not necessary to shut down the system when adding the second processor fan. To remove a processor fan in a non-hot-swap environment, do the following: 1. On a 286/C4, open the front and rear rack doors and place the system into the service position as described in “Placing the 286/C4 into the Service Position” on page 364. On a 286/E4, open the bezel door. 2. Shut down the system as described in either “Stopping the System without an HMC Attached” on page 359 or “Stopping the System with an HMC Attached and AIX Installed” on page 359. 3. Disconnect the power source from the system. Note: This system may be equipped with a second power supply. Before continuing with this procedure, ensure that the power source to the system has been completely disconnected. 4. Continue with the procedure in “Hot-Swap Fan Assembly Removal” on page 397. 1 286/C4 4 Processor Fan Amber LED 2 Service Access Cover 5 Finger Holes 3 Processor Card Cooling Fan (Fan #1) 6 Processor Fan Bottom Connector 7 Processor Cooling Fans Access Cover Chapter 9. Removal and Replacement Procedures 395 1 2 8 3 4 7 5 6 1 286/E4 5 Finger Holes 2 Outer Side Cover 6 Processor Fan Bottom Connector 3 Service Access Cover 7 Processor Cooling Fans Access Cover 4 Processor Fan Amber LED 8 Processor Card Cooling Fan (Fan #1) Replacing a Processor Fan in a Non-Hot-Swap Environment To replace a processor fan in a non-hot-swap environment, do the following: 1. Lift the processor fan access cover. 2. Place your thumb and index finger into the two holes located on the top of the replacement fan. 3. Align the connector located on the fan assembly with the docking connector located on the CEC backplane. 4. Insert and firmly press the replacement fan into the docking connector closest to the back of the chassis. 5. Ensure the cooling fan access cover springs back to the closed position. 6. Replace the service access cover as described in “Service Access Cover Replacement (286/C4)” on page 366 or “Service Access Cover Replacement (286/E4)” on page 372. 7. On a 286/C4, push the system drawer back into the operating position as described in “Returning the 286/C4 to the Operating Position” on page 365. 8. Reconnect the power cables. 9. Power on the system as described in “Starting the System without an HMC Attached” on page 360. 10. After the 286/C4 has successfully powered on, close the rack doors. After the 286/E4 has successfully powered on, close the bezel door. 396 Service Guide Hot-Swap Fan Assembly Removal This section contains procedures for removing processor cooling fans and PCI adapter cooling fans. Select the removal procedure according to the hot-swap fan assembly that you are servicing. To remove a hot-swap fan, do the following: 1. On a 286/C4, open the front and rear rack doors and place the system into the service position as described in “Placing the 286/C4 into the Service Position” on page 364. On a 286/E4, open the bezel door. 2. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on page 366 or “Service Access Cover Removal (286/E4)” on page 371. 3. Locate the cooling fan assemblies, as shown in the following illustration. 4. The failing fan assembly has the amber-colored LED that is lit. Note: All of the hot-swap fan assemblies used in your system have an amber-colored LED located on the top of the fan assembly. When lit, this LED indicates that its fan assembly has a problem. The fan might not be spinning or is not spinning fast enough to help cool the system unit. Hot-Swap Processor Fan Removal If you are servicing one of the processor cooling fan assemblies, do the following: 1. Lift the processor fan access cover. 2. Place your thumb and index finger into the two finger holes, located on the top of the faulty fan. 3. Lift the faulty fan out of the system. Notes: a. The processor cooling fans have a docking connector, located on the bottom of the fan assembly. The docking connectors automatically disconnect as the fan is being removed from the chassis. b. Unless you are performing a hot-swap removal or replacement of the processor fans, keep the processor fan access cover closed. Hot-Swap PCI Adapter Fan Removal Attention: The system will power off if the failing PCI cooling fan has not been exchanged within five minutes. If you are servicing one of the PCI adapter cooling fan assemblies, do the following: 1. Disconnect the cooling fan’s power cable from the connector, located on the front of the fan assembly. 2. Place your index finger below the front connector and slide the fan out of the fan cage. Note: The adapter cooling fan assemblies are arranged in stacked pairs. You cannot replace a single fan in the stacked pair. Chapter 9. Removal and Replacement Procedures 397 286/C4 1 286/C4 7 Processor Fan Bottom Connector 2 Service Access Cover 8 Processor Cooling Fans Access Cover 3 Processor Fan Amber LED 9 Adapter Cooling Fan Retaining Bracket 4 Finger Holes 10 PCI Adapter Cooling Fan Assembly (Fan #4) 5 Processor Card Cooling Fan (Fan #1) 11 PCI Adapter Cooling Fan Assembly (Fan #3) 6 Processor Card Cooling Fan (Fan #2) 12 PCI Adapter Fan Amber LED 13 Adapter Cooling Fan Front Connector 398 Service Guide 286/E4 1 286/E4 8 Processor Cooling Fans Access Cover 2 Outer Side Cover 9 Processor Card Cooling Fan (Fan #2) 3 Service Access Cover 10 Adapter Cooling Fan Assembly (Fan #4) 4 Processor Fan Amber LED 11 Adapter Cooling Fan Assembly (Fan #3) 5 Finger Holes 12 PCI Adapter Fan Amber LED 6 Processor Card Cooling Fan (Fan #1) 13 PCI Adapter Cooling Fan Front Connector 7 Processor Fan Bottom Connector Hot-Swap Fan Assembly Replacement Note: This note applies only to systems running the AIX operating system. If a fan assembly is being replaced for a redundant failure, after the service repair action is completed, ask the customer to check the crontab file for any power/cooling warning messages. When a power or cooling error is encountered, AIX adds an entry to the crontab file to ²wall² a warning message every 12 hours, to alert or remind the customer of the problem. Replacing the faulty part does not clear this crontab entry, so unless the crontab file is edited to remove this entry, the customer continues to be reminded of the failure despite its having been repaired. Use the crontab -l command to read the crontab file to determine if an entry exists. Use the crontab -e command to edit the file. The hot-swap fan removal section has a procedure for processor cooling fans and PCI adapter cooling fans. Select the removal procedure according to the hot-swap fan assembly you are servicing. To replace your system’s hot-swap cooling fans, do the following: Chapter 9. Removal and Replacement Procedures 399 Hot-Swap Processor Fan Replacement When installing a hot-swap processor cooling fan, do the following: 1. Lift the processor fan access cover. 2. Place your thumb and index finger into the two holes located on the top of the replacement fan. 3. Align the connector located on the fan assembly with the docking connector located on the CEC backplane. 4. Insert the replacement fan and firmly press it into its docking connector. 5. Verify that the replaced cooling fan’s LED is not lit. If the fan LED is lit, reseat the fan or the fan cable. 6. For proper cooling of the processor, ensure the processor cooling fan access cover springs back to the closed position. 7. Replace the service access cover as described in “Service Access Cover Replacement (286/C4)” on page 366 or “Service Access Cover Replacement (286/E4)” on page 372. 8. On a 286/C4, push the system drawer back into the operating position as described in “Returning the 286/C4 to the Operating Position” on page 365 and close the rack doors. On a 286/E4, close the bezel door. Hot-Swap PCI Adapter Fan Replacement Attention: The system will power off if the failing PCI cooling fan has not been exchanged within five minutes. When installing a hot-swap PCI adapter cooling fan, do the following: 1. Before inserting the fan assembly into the fan cage, ensure that the fan connector located on the fan housing is facing the front of the system. 2. Insert the replacement adapter cooling fan assembly into the cooling fan cage. Note: When seated correctly, the top edge of the fan assembly and the fan cage are flush with each other. 3. Reconnect the power cable to the connector located on the front of the fan assembly. 4. Verify that the replaced cooling fan’s LED is not lit. If the fan LED is lit, reseat the fan or the fan cable. 5. Replace the service access cover as described in “Service Access Cover Replacement (286/C4)” on page 366 or “Service Access Cover Replacement (286/E4)” on page 372. 6. On a 286/C4, push the system drawer back into the operating position as described in “Returning the 286/C4 to the Operating Position” on page 365 and close the rack doors. On a 286/E4, close the bezel door. 400 Service Guide Media Drives (Diskette, CD-ROM, Tape, or Disk Drive) Note: The two media bays can be used to house an IDE CD-ROM, IDE DVDROM, SCSI DVDRAM, tape drive, or diskette drive. See the illustration on 402 and 403 for the location of the two media bays. However, if your system has an IDE CD-ROM, it should be placed in media bay 1. Before performing this procedure, read the following: v “Safety Notices” on page xi v “Laser Safety Information” on page xiii Media Removal To remove any of the media drives located in either media bay 1 or media bay 2, do the following: 1. On a 286/C4, open the front and rear rack doors and place the system into the service position as described in “Placing the 286/C4 into the Service Position” on page 364. On a 286/E4, open the bezel door. 2. If you are servicing a 286/E4, open the bezel door. 3. Shut down the system as described in either “Stopping the System without an HMC Attached” on page 359 or “Stopping the System with an HMC Attached and AIX Installed” on page 359. 4. Disconnect the power source to the system. Note: This system may be equipped with a second power supply. Before continuing with this procedure, ensure that the power source to the system has been completely disconnected. 5. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on page 366 or “Service Access Cover Removal (286/E4)” on page 371. 6. Remove the front bezel as described in “Front Bezel Removal (286/C4)” on page 377 or “Front Bezel Removal (286/E4)” on page 378. 7. Disconnect the power and signal cables from the media device that you are servicing. 8. Grasping the top front of the media device, press the release tab located on the bottom center of the media drive, and slide the media drive out of the media bay. 9. Remove the four retaining screws holding the carrier tray assembly to the media drive. 10. Put the media drive and carrier tray assembly in a safe place. Chapter 9. Removal and Replacement Procedures 401 286/C4 1 Service Access Cover 6 Disk Drive (4 Pack) 2 286/C4 7 Optional Media Device (for example: Tape Drive, DVD, CD-ROM, Diskette Drive) 3 Media Carrier Tray 8 IDE CD-ROM 4 Diskette Drive (Optional) 9 Media Carrier Trays 5 Diskette Drive Cage (Optional) 10 Front Bezel 402 Service Guide 286/E4 1 286/E4 7 IDE CD-ROM 2 Disk Drive (4 Pack) 8 Diskette Drive Carrier (Optional) 3 Media Carrier Trays 9 Diskette Drive Cage (Optional) 4 Front Bezel 10 Diskette Drive (Optional) 5 Bezel Door 11 Outer Side Cover 6 Optional Media Device (for example: Tape Drive, DVD, CD-ROM, Diskette Drive) 12 Service Access Cover Chapter 9. Removal and Replacement Procedures 403 Media Replacement To replace a media drive, do the following: 1. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 359. 2. Set any jumpers or switches on the device according to the documentation provided with the drive. 3. Using four retaining screws, secure the media carrier tray to the bottom of the media drive. 4. Insert the media drive into the media bay, pushing the drive in until the release tab clicks into the locked position. 5. Connect the media signal and power cables to the drive. 6. Replace the bezel as described in “Front Bezel Replacement (286/C4)” on page 377 or “Front Bezel Replacement (286/E4)” on page 378. 7. Replace the service access cover as described in “Service Access Cover Replacement (286/C4)” on page 366 or “Service Access Cover Replacement (286/E4)” on page 372. 8. If you are servicing a 286/C4, push the system drawer back into the operating position as described in “Returning the 286/C4 to the Operating Position” on page 365. 9. Reconnect the power source to the system. 10. Power on the system as described in “Starting the System without an HMC Attached” on page 360. 11. After the 286/C4 has successfully powered on, close the rack doors. After the 286/E4 has successfully powered on, close the bezel door. 404 Service Guide Hot-Plug Disk Drives Before performing the following procedure, read “Safety Notices” on page xi. Attention: The Linux operating system does not support hot-pluggable procedures, nor does it support hot-plugging of any disk drives. A system with the Linux operating system installed on one or more partitions must be shut down and powered off before replacing any disk drives assigned to a Linux partition. Note: You do not have to power off the system to remove a hot-plug disk drive. Before you perform these procedures, ensure that the customer has taken appropriate actions to back up the data for the drive you are removing, and that the drive has been removed from the configuration. Physically removing a hot-plug disk drive from the system before it has been removed from the system configuration can cause unrecoverable data corruption. Deconfiguring (Removing) or Configuring a Disk Drive Use this procedure if you are removing or installing a drive while the system power is turned on. Deconfiguring (Removing) Before removing a disk drive from a volume group, do the following: 1. Log in as root user. 2. Before a disk can be removed from a volume group, all data must be backed up and the data removed from the disk to be replaced. Use AIX Logical Volume Manager to perform this task. 3. At the command line, type smitty. 4. Select System Storage Manager (Physical and Logical Storage) and press Enter. 5. Select Logical Volume Manager and press Enter. 6. Select Volume Groups and press Enter. 7. Select Set Characteristics of a Volume Group and press Enter. 8. Select Remove a Physical Volume from a Volume Group. 9. Press F4 to list the available volume groups, then select the volume group name and press Enter. 10. Press F4 to select a physical volume, and follow the instructions on the screen to select the physical volume. Then press Enter. 11. Press F3, then select Cancel, to go back to the first menu. 12. Select System Storage Management (Physical and Logical Storage). 13. Select Removable Disk Management. 14. Select Remove a Disk. 15. Select the desired disk from the list on the screen and press Enter. 16. Follow the instructions on the screen to remove the drive. 17. When you are asked ²Are you sure?,² press Enter. The power LED on the drive that you selected will remain on. 18. Remove the disk drive by pulling the disk drive lever toward you until it is completely open. Then remove the disk drive from the slot. The LED on the top of the slot will turn off when the disk drive is removed. 19. Press F10 to exit smitty. Chapter 9. Removal and Replacement Procedures 405 Configuring (Replacing) 1. Remove the disk drive from its protective packaging, and open the drive latch handle. 2. Install the disk drive in the drive slot. Align the disk drive with the drive slot rails, and slide the disk drive into the slot until it contacts the backplane at the rear of the drive bay. The drive should be in far enough for the latch handle to engage the latch. Push the disk drive lever up and to the rear to lock the disk drive. The LED above the slot will turn on. 3. Log in as root user. 4. At the command line, type smitty. 5. Select Devices. 6. Select Install/Configure Devices Added After IPL and press Enter. Successful configuration is indicated by the OK message displayed next to the Command field at the top of the screen. 7. Press F3s, Cancel, to go back to the first menu and select System Storage Management (Physical and Logical Storage) and press Enter. 8. Select Logical Volume Manager and press Enter. 9. Select Volume Groups and press Enter. 10. Select Set Characteristics of a Volume Group and press Enter. 11. Select Add a Physical Volume to a Volume Group. 12. Fill in the fields for the drive you are adding to the system. Press F4 for a list of selections. 13. To finish the drive configuration, see the AIX System Management Guide: Operating System and Devices. 14. Press F10 to exit smitty. Hot-Plug Disk Drive Removal Attention: Before you perform the following procedure, ensure that the customer has taken appropriate actions to back up the data for the drive you are removing, and that the drive has been removed from the configuration. Physically removing a hot-plug drive from the system before it has been removed from the system configuration can cause unrecoverable data corruption. Refer to “Deconfiguring (Removing) or Configuring a Disk Drive” on page 405. To remove a hot-plug disk drive, do the following: 1. Before removing the disk, ensure all data has been saved. 2. Deconfigure the drive you are removing from the system. See “Deconfiguring (Removing) or Configuring a Disk Drive” on page 405. 3. On a 286/C4, open the rack front door. On a 286/E4, open the front bezel door. 4. Remove the front bezel as described in “Front Bezel Removal (286/C4)” on page 377 or the disk drive access cover as described in “Disk Drive Access Cover Removal (286/E4)” on page 373. 5. Locate the hot-plug disk drive you plan to remove. If you are removing a faulty disk drive, a lit amber LED located at the front right side of each disk drive will help you isolate the faulty drive. 6. Log in as root user. 7. At the command line, type smitty. 406 Service Guide 8. Select System Storage Management (Physical and Logical Storage). 9. Select Removable Disk Management. 10. Select Remove a Disk. 11. Select the desired disk from the list on the screen and press Enter. 12. Remove the disk drive by placing the handle on the disk drive into the open position (perpendicular to the drive) and pulling the hot-plug disk drive away from the bay. The lit LED on the disk drive will turn off when the disk drive is removed. 1 286/C4 3 Disk Drive Release Handle 2 Disk Drive 4 Front Bezel 1 286/E4 3 Disk Drive Release Handle 2 Disk Drive 4 Disk Drive Access Cover 13. Press F10 to exit smitty. Chapter 9. Removal and Replacement Procedures 407 Hot-Plug Disk Drive Replacement Attention: Before you perform these procedures, ensure that the customer has taken appropriate actions to back up the data for the drive you are removing, and that the drive has been removed from the configuration. Physically removing a hot-plug drive from the system before it has been removed from the system configuration can cause unrecoverable data corruption. Refer to “Deconfiguring (Removing) or Configuring a Disk Drive” on page 405. Do not use Install/Configure Devices Added After IPL if your system is set up to run HACMP clustering. Consult with your system administrator or software support to determine the correct method to configure the replacement disk. To install a hot-plug disk drive, do the following: 1. Ensure the tray handle is open (perpendicular to the drive). 2. Align the drive/tray assembly so that it engages the guide rails in the bay. 3. Push the drive assembly into the bay until the tray handle engages the lock mechanism. 4. Push the tray handle in until it locks. The lit LED on the disk drive will turn on. 5. Log in as root user. 6. At the command line, type smitty. 7. Select Devices. 8. Select Install/Configure Devices Added After IPL and press Enter. Successful configuration is indicated by the OK message displayed next to the Command field at the top of the screen. 9. Press F10 to exit smitty. 10. Replace the front bezel as described in “Front Bezel Replacement (286/C4)” on page 377 or “Disk Drive Access Cover Replacement (286/E4)” on page 373. 11. On a 286/C4, close the rack doors. On a 286/E4, close the bezel door. 408 Service Guide PCI Riser Card Note: To perform this procedure requires a considerable amount of force when removing or installing the riser card from its docking connectors. If you are servicing a 286/C4, it is strongly recommended that the system drawer be removed from the rack. If you are servicing a 286/E4, it is strongly recommended that the system be placed on its side, on a flat and stable surface. To avoid potential breakage, the 286/E4 stabilizer foot closest to the desktop or floor is free from resting on any surface. Before performing the following procedure, read “Safety Notices” on page xi. PCI Riser Card Removal To remove the PCI riser card, do the following: 1. If you are servicing a 286/E4, open the bezel door. If you are servicing a 286/C4 rack-mounted drawer, open the front and rear rack doors. 2. Shut down the system as described in either “Stopping the System without an HMC Attached” on page 359 or “Stopping the System with an HMC Attached and AIX Installed” on page 359. 3. Disconnect the power source from the system. Note: This system may be equipped with a second power supply. Before continuing with this procedure, ensure that the power source to the system has been completely disconnected. 4. Label and disconnect all adapter cables, located at the rear of the system unit. 5. If you are servicing a 286/C4, place the system drawer into the service position as described in “Placing the 286/C4 into the Service Position” on page 364. 6. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on page 366 or “Service Access Cover Removal (286/E4)” on page 371. 7. Disconnect all power and signal cables from the media drives, located in media bay 1 and media bay 2. Then remove the media drives as described in “Media Removal” on page 401. 8. Remove the front bezel as described in “Front Bezel Removal (286/C4)” on page 377 or “Front Bezel Removal (286/E4)” on page 378. 9. Remove the media device support shelf by loosening the thumbscrew and grasping the back edge of the shelf, sliding it back, then lifting it up and out of the system chassis. Note: The support leg located on the bottom of the support shelf is used to deactivate a microswitch. The microswitch is located on the CEC backplane and when deactivated, cuts off power to many of the components and connectors located on the CEC backplane. This support shelf must be correctly seated and secured in position after any new FRU is added to the system and you perform a system boot. 10. Record the slot number and location of each adapter. 11. Remove the processor assembly #2 as described in “Processor Card Removal” on page 423. If your system is equipped with a processor filler panel instead of a second processor card assembly, remove the processor filler panel. 12. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 359. 13. Remove the PCI adapters as described in “Non-Hot-Pluggable PCI Adapter Removal” on page 381. 14. Remove all power, signal, and SCSI cables from the PCI riser card. Chapter 9. Removal and Replacement Procedures 409 15. Remove all screws that secure the PCI riser card to the CEC backplane and chassis. 1 PCI Riser Card (The six slotted riser card shown above. The four slotted riser card mounts the same.) 2 Removal Sling 3 CEC Backplane 4 CEC Backplane Connectors to PCI Riser Card Connectors 5 ²Push² Labels 16. Grasp each end of the removal sling located toward the back of the system chassis. Slowly lift up the back end of the PCI riser card. You may need to rock the sling back and forth several times to aid in the removal of the PCI riser card connectors from their docking connectors located on the CEC backplane. As the connectors begin to separate, grasp the front end of the PCI riser card and pull it up and out of the system. Put the PCI riser card in a safe place. Notes: a. The PCI riser card may require up to 60 pounds of pulling force to free it from the docking connectors. Depending on the system drawer’s location in the rack, it is strongly recommended that the system drawer be removed from the rack to perform the PCI riser card removal and replacement procedures. b. When placing the 286/E4 on its side, ensure that the stabilizer foot closest to the desktop or floor surface is free from resting on any surface. 410 Service Guide 286/C4 1 Service Access Cover 6 Media Support Tray 2 PCI Adapters 7 Front Bezel 3 PCI Riser Card 8 IDE CD-ROM 4 286/C4 9 Processor Card Assembly #2 5 Media Device 10 Processor Filler Panel Chapter 9. Removal and Replacement Procedures 411 286/E4 1 Outer Side Cover 8 Media Device 2 Service Access Cover 9 IDE CD-ROM 3 PCI Adapters 10 Front Bezel 4 PCI Riser Card 11 Bezel Door 5 Back Cover 12 System Stabilizer Foot 6 286/E4 13 Processor Card Assembly #2 7 Media Device Support Tray 14 Processor Filler Panel 412 Service Guide PCI Riser Card Replacement To replace the PCI riser card, do the following: 1. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 359. 2. If necessary, remove the PCI riser card from the antistatic package. 3. Carefully grasp the PCI riser card along two edges, and align the four connectors located on the bottom side of the PCI riser card with the four standoff connectors located on the CEC backplane. 4. Ensure you have proper alignment of the PCI riser card connectors. Seat the PCI riser card into the CEC backplane connectors. a. To perform this procedure requires a considerable amount of force when removing or installing the PCI riser card from its docking connectors. If you are servicing a 286/C4, it is strongly recommended that the system drawer be removed from the rack. If you are servicing a 286/E4, it is strongly recommended that the system be placed on its side, on a flat and stable surface. To avoid potential breakage, the 286/E4 stabilizer foot closest to the desktop or floor is free from resting on any surface. 1 PCI Riser Card 3 CEC Backplane Connectors to PCI Riser Card Connectors 2 CEC Backplane 4 ²Push² Labels b. To ensure the PCI riser card docking connectors are fully seated, using your thumbs, alternately push down on the two ²push² labels that are mounted on the PCI riser card. Push until no more movement is felt and then push down hard again on both ²push² labels. This alternate pushing technique is allowed only for this card. Note: Mounted below the CEC backplane is an aluminum plate. The plate aids in stiffening the CEC backplane and keeps it from flexing or cracking when the PCI riser card is inserted. c. If you experience any problem when you boot the system, repeat substep 4, but the second time, you must push harder. Chapter 9. Removal and Replacement Procedures 413 5. Replace the screws that secure the PCI riser card to the CEC backplane and chassis. Note: For proper alignment, insert all screws before tightening. 6. Reconnect the power, signal, and SCSI cables. 7. Replace all adapters to their proper locations as described in “Adding or Replacing a Non-Hot-Pluggable PCI Adapter” on page 383. 8. Reconnect all of the labeled PCI adapter cables to their respective PCI adapters. 9. Replace the processor #2 assembly as described in “Adding or Replacing a Processor Card” on page 426. If you removed a processor filler panel, replace the processor filler panel now. 10. Replace the media device support shelf. Tighten the thumbscrew located on the support shelf. Note: When replacing the support shelf, ensure that the lower support leg, located on the bottom side of the support shelf, activates the microswitch located on the CEC backplane. Failure to do so results in the system not being able to power on. 11. Replace all removed media drives, ensuring that the media drives removed are returned to the same media bay locations from which they were removed. Refer to “Media Replacement” on page 404. 12. Reconnect all power and signal cables to their respective media drives. 13. Replace the front bezel as described in “Front Bezel Replacement (286/C4)” on page 377 or “Front Bezel Replacement (286/E4)” on page 378. 14. Replace the service access cover as described in “Service Access Cover Replacement (286/C4)” on page 366 or “Service Access Cover Replacement (286/E4)” on page 372. 15. Reconnect the power cables. 16. Replace the service access cover as described in “Service Access Cover Replacement (286/C4)” on page 366 or “Service Access Cover Replacement (286/E4)” on page 372. 17. If you are servicing a 286/C4, push the system drawer back into the operating position as described in “Returning the 286/C4 to the Operating Position” on page 365. 18. Power on the system as described in “Starting the System without an HMC Attached” on page 360. Note: If you experience any problem when you boot the system, repeat substep 4, but the second time, you must push harder. 19. If you are servicing a 286/C4, close the rack doors. If you are servicing a 286/E4, close the bezel door. 414 Service Guide Disk Drive Backplane To perform these procedures requires you to remove and install the PCI riser card. Removing or Installing a PCI riser card from its docking connectors, requires a considerable amount of force. If you are servicing a 286/C4, it is strongly recommended that the system drawer be removed from the rack. If you are servicing a 286/E4, it is strongly recommended that the system be placed on its side, on a flat and stable surface. To avoid potential breakage, the 286/E4 stabilizer foot closest to the desktop or floor is free from resting on any surface. Before performing the following procedure, read “Safety Notices” on page xi. Disk Drive Backplane Removal To remove the disk drive backplane, do the following: 1. If you are servicing a 286/E4, open the bezel door. If you are servicing a 286/C4 rack-mounted drawer, open the front and rear rack doors. 2. Shut down the system as described in either “Stopping the System without an HMC Attached” on page 359 or “Stopping the System with an HMC Attached and AIX Installed” on page 359. 3. Disconnect the power source from the system. Note: This system may be equipped with a second power supply. Before continuing with this procedure, ensure that the power source to the system has been completely disconnected. 4. Label and disconnect all adapter cables, located at the rear of the system unit. 5. If you are servicing a 286/C4, place the system drawer into the service position as described in “Placing the 286/C4 into the Service Position” on page 364. 6. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on page 366 or “Service Access Cover Removal (286/E4)” on page 371. 7. Remove the front bezel as described in “Front Bezel Removal (286/C4)” on page 377 or “Front Bezel Removal (286/E4)” on page 378. 8. Disconnect and label all cables from the media drives, located in media bay 1 and media bay 2, then remove the media drives as described in “Media Removal” on page 401. Note the location of each removed media drive. 9. Remove the media device support shelf by loosening the captive thumbscrew. Grasp the back edge of the media device support shelf, slide the media device support shelf back, while lifting it up and out of the system chassis. 10. Remove the processor assembly #2 as described in “Processor Card Removal” on page 423. If your system is equipped with a processor filler panel instead of a second processor card assembly, remove the processor filler panel. 11. Record the slot number and location of each adapter. 12. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 359. 13. Remove the PCI adapters as described in “Non-Hot-Pluggable PCI Adapter Removal” on page 381. 14. Remove the PCI riser card as described in “PCI Riser Card Removal” on page 409. 15. Remove all hot-plug disk drives as described in “Hot-Plug Disk Drive Removal” on page 406. 16. Remove the PCI adapter cooling fans as described in “Hot-Swap Fan Assembly Removal” on page 397. 17. Remove the PCI adapter cooling fan stabilizing cage by loosening the retaining screw located along the bottom back edge of the stabilizing cage. Chapter 9. Removal and Replacement Procedures 415 18. Disconnect all power and signal cables connected to the disk drive backplane. 19. Remove the four retaining screws that secure the disk drive cage to the front of the system chassis. 20. Slide the disk drive cage back until it clears the upper chassis ledge. 21. Pull the disk drive cage assembly out of the system. 22. Remove the six retaining screws attached to the disk drive backplane. 286/C4 1 Service Access Cover (drawer) 9 286/C4 Chassis 2 PCI Adapters 10 Disk Drive (4 Pack) 3 PCI Riser Card 11 Media Device 4 PCI Cooling Fan 12 Front Bezel 5 Cable-Management Bracket 13 IDE CD-ROM Media 6 PCI Cooling Fan 14 Media Device Support Shelf 7 Disk Drive Backplane 15 Processor Assembly #2 8 Disk Drive Cage 16 Processor Filler 416 Service Guide 286/E4 1 Outer Side Cover 11 Disk Drive (4 Pack) 2 Service Access Cover 12 Media Device Support Shelf 3 PCI Adapters 13 Media Device 4 PCI Cooling Fan 14 Front Bezel 5 Cable-Management Bracket 15 Bezel Door 6 Back Cover 16 IDE CD-ROM Media 7 PCI Riser Card 17 System Stabilizer Foot 8 286/E4 18 Processor Assembly #2 9 Disk Drive Backplane 19 Processor Filler 10 Disk Drive Cage Chapter 9. Removal and Replacement Procedures 417 Disk Drive Backplane Replacement To replace the disk drive backplane, do the following: 1. Using the six supplied retaining screws, secure the backplane to the disk drive cage. 2. Carefully place the disk drive cage assembly into the system. 3. Slide the disk drive cage assembly toward the front of the system. 4. Ensure the six screw holes located on the cage assembly correctly align with the mating screw holes located on the system chassis. 5. Insert and finger-tighten the six retaining screws. After all screws have been started, finish tightening them. 6. Reconnect all power and signal cables to the disk drive backplane. 7. Replace the PCI adapter cooling fan stabilizing cage by installing the two tabs located on the bottom edge of the cage into their mating slots located on the chassis. Secure the cage to the chassis with a retaining screw. 8. Replace the PCI adapter cooling fans as described in “Hot-Swap Fan Assembly Replacement” on page 399. 9. Replace all hot-plug disk drives as described in “Hot-Plug Disk Drive Replacement” on page 408. 10. Replace the media device support shelf. 11. Replace all previously removed media drives, ensuring that the media drives are returned to the same media bay from which they were removed. Refer to “Media Replacement” on page 404, if necessary. 12. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 359. 13. Replace the PCI riser card as described in “PCI Riser Card Replacement” on page 413. 14. Replace the PCI adapters as described in “Adding or Replacing a Non-Hot-Pluggable PCI Adapter” on page 383. 15. Reconnect all adapter cables, located at the rear of the system unit. 16. Reconnect the power source to the system. 17. Replace the front bezel as described in “Front Bezel Replacement (286/C4)” on page 377 or “Front Bezel Replacement (286/E4)” on page 378. 18. Power on the system as described in “Starting the System without an HMC Attached” on page 360. 19. Replace the service access cover as described in “Covers” on page 366. 20. If you are servicing a 286/C4, place the system drawer into the operating position as described in “Returning the 286/C4 to the Operating Position” on page 365. 21. If you are servicing a 286/C4, close the front and rear rack doors. 22. If you are servicing a 286/E4, close the bezel door. 418 Service Guide Operator Panel Before performing the following procedure, read “Safety Notices” on page xi. Operator Panel Removal To remove the operator panel, do the following: 1. On a 286/C4, open the front and rear rack doors and place the system into the service position as described in “Placing the 286/C4 into the Service Position” on page 364. On a 286/E4, open the bezel door. 2. Shut down the system as described in either “Stopping the System without an HMC Attached” on page 359 or “Stopping the System with an HMC Attached and AIX Installed” on page 359. 3. Disconnect the power source from the system. Note: This system may be equipped with a second power supply. Before continuing with this procedure, ensure that the power source to the system has been completely disconnected. 4. If necessary, on a 286/E4, remove the bezel door as described in “Bezel Door Removal” on page 376. 5. Remove the front bezel as described in “Front Bezel Removal (286/C4)” on page 377 or “Front Bezel Removal (286/E4)” on page 378. 6. If you are servicing a 286/E4, remove the top cap as described in “Top Cap Removal (286/E4)” on page 367. 7. If you are servicing a 286/C4, remove the service access cover as described in “Service Access Cover Removal (286/C4)” on page 366. 8. Locate the operator panel, as shown in the following illustrations. 9. Remove the two retaining screws, located along the bottom edge of the operator panel. 10. Disconnect the power / signal cable from the back of the operator panel. 11. Remove the operator panel by grasping the edges of the panel and pulling it out of its bay. 286/C4 1 Service Access Cover (drawer) 4 Operator Panel Retaining Screws 2 System Unit 5 Operator Panel 3 Front Bezel Chapter 9. Removal and Replacement Procedures 419 286/E4 1 Top Cap 4 Front Bezel 2 Operator Panel 5 Bezel Door 3 Operator Panel Retaining Screws 6 286/E4 Operator Panel Replacement Attention: If you are replacing the operator panel with a new FRU, you must write the system VPD information (machine type, model number, and serial number) into the VPD module. You must contact the support center to obtain instructions and the necessary password to enter the information into the VPD module. Note: Refer to “System Vital Product Data (VPD) Update Procedure” on page 421 for the correct procedure. To replace the operator panel, do the following: 1. Carefully slide the operator panel back into the system bay. 2. Connect the signal and power cable to the connector on the new operator panel. 3. Secure the operator panel with the two retaining screws previously removed. 4. Replace the service access cover, as described in “Service Access Cover Replacement (286/C4)” on page 366. 5. If you are servicing a 286/C4, push the system drawer back into the operating position as described in “Returning the 286/C4 to the Operating Position” on page 365. 6. Reconnect all system power cables. 7. Replace the front bezel as described in “Front Bezel Replacement (286/C4)” on page 377 or “Front Bezel Replacement (286/E4)” on page 378. 8. If necessary replace the bezel door as described in “Bezel Door Replacement” on page 376. 9. If you are servicing a 286/E4, replace the top cap as described in “Top Cap Replacement (286/E4)” on page 367. 10. Power on the system as described in “Starting the System without an HMC Attached” on page 360. 11. If the 286/C4 has successfully powered on, close the rack doors. If the 286/E4 has successfully powered on, close the bezel door. 420 Service Guide System Vital Product Data (VPD) Update Procedure If you replaced the operator panel with a new FRU, perform the following procedure to ensure that the VPD is correct. 1. After you install the new operator panel, wait for the OK message to display in the operator panel display. 2. Start the Service Processor Firmware menu. 1. Service Processor Setup 2. System Power Control Menu 3. System Information Menu 4. Language Selection 5. Call In/Call Out Setup 6. Set System Name 99. Exit from Menu Main Menu Service Processor Firmware Firmware Level: xxxxxxxx Copyright 1999, IBM Corporation 1> Command Prompt 3. At the command prompt, type the code that accesses the hidden menus. If necessary, call your local support center to obtain the code. This menu is for IBM Authorized use only. If you have not been authorized to use this menu, please discontinue use immediately. Press Return to continue, or X to return to menu 1. 4. Press Return to continue. VPD Serial Number is not programmed. Enter the VPD Serial Number (7 ASCII digits): xxxxxxx Chapter 9. Removal and Replacement Procedures 421 5. Type the VPD serial number. Note: The serial number must be entered correctly. Enter the last seven digits only. Do not include the dash (-) in the serial number as a digit. If the serial number is not entered correctly, a new operator panel must be ordered and installed. VPD Serial Number has been programmed successfully. The current TM field is: xxxx-xxx Do you want to change the TM field (y/n)? 6. Type y (yes) if the system unit’s type/model (TM) you are working on is different from the one listed on the screen. 7. Type the machine type and model number of the system unit. Enter the TM data (8 ASCII digits): xxxx-xxx TM has been programmed successfully The current MN field is 1980 Do you want to change the MN field (y/n)? 8. The MN field is for manufacturing use only. Enter n (no) in this field. 9. Enter 99 at the command line of the Main Menu to exit. 422 Service Guide Processor Card Before performing the following procedure, read “Safety Notices” on page xi. Processor Card Removal To remove the processor card, do the following: 1. On a 286/C4, open the front and rear rack doors and put the system into the service position as described in “Placing the 286/C4 into the Service Position” on page 364. On a 286/E4, open the bezel door. 2. Shut down the system as described in either “Stopping the System without an HMC Attached” on page 359 or “Stopping the System with an HMC Attached and AIX Installed” on page 359. 3. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on page 366 or “Service Access Cover Removal (286/E4)” on page 371. 4. Remove the front bezel as described in “Front Bezel Removal (286/C4)” on page 377 or “Front Bezel Removal (286/E4)” on page 378. 5. Disconnect all power and signal cables from the media drives, located in media bay 1 and media bay 2. Then remove the media drives as described in “Media Removal” on page 401. 6. Remove the media device support shelf by loosening the thumbscrew and grasping the back edge of the shelf, sliding it back, then lifting it up and out of the system chassis. Note: The support leg located on the bottom of the support shelf is used to deactivate a microswitch. The microswitch is located on the CEC backplane and when deactivated, cuts off power to many of the components and connectors located on the CEC backplane. This support shelf must be correctly seated and secured in position after any new FRU is added to the system and you perform a system boot. 7. Before handling any card, board, or memory DIMM, touch any metal surface of the chassis with one hand to minimize static electricity discharge. Refer to “Handling Static-Sensitive Devices” on page 359. 8. Grasp the ends of both processor assembly release handles. Lift each handle until it is perpendicular (90 degrees) to the top of the processor assembly. By placing the handle perpendicular to the top of the processor assembly, the base or hinged portion of each handle acts as a cam and will gently pry the processor card connector from the docking connector located on the CEC backplane. 9. Remove the processor assembly from the system unit. Attention: To prevent damage to the card and the card connectors, open or close the retention latches at the same time. 10. Lay the processor assembly, cover-side up, on a flat, static-protective surface. Chapter 9. Removal and Replacement Procedures 423 11. From the bottom of the processor assembly, lift the side access cover. Four hinges hold the cover to the top of the processor assembly. Pivot the cover up to reveal the memory DIMMs or DIMM connectors. 12. Remove all memory DIMMs as described in “Memory DIMM Removal” on page 427. Put the DIMMs in a safe place (you will need to install these DIMMs onto the new processor card). 286/C4 1 Service Access Cover 7 286/C4 2 Processor Access Cover 8 Optional Media Device (for example: Diskette Drive or Tape Drive) 3 Processor Assembly Lifting and Release Handles 9 IDE CD-ROM Drive 4 Processor Assembly 10 Front Bezel 5 Processor Card 11 Media Device Support Shelf 6 Memory DIMMs 12 Processor Filler 424 Service Guide 286/E4 1 286/E4 9 Service Access Cover 2 Media Device Support Shelf 10 Outer Side Cover 3 Media Carrier Trays 11 Processor Access Cover 4 Optional Media Device (for example: Diskette Drive or Tape Drive) 12 Processor Assembly Lifting and Release Handles 5 IDE CD-ROM Drive 13 Processor Card 6 Front Bezel 14 Processor Assembly 7 Bezel Door 15 Memory DIMMs 8 Processor Filler Chapter 9. Removal and Replacement Procedures 425 Adding or Replacing a Processor Card Notes: 1. Adding an additional processor card or installing a new processor card into your system may require the updating of the system firmware. 2. Before handling any card, board, or memory DIMM, touch any metal surface of the chassis with one hand to minimize static electricity discharge. Refer to “Handling Static-Sensitive Devices” on page 359. 3. If your system is configured with one processor card, place the card into location U0.1-P1-C1 on the CEC backplane. The remaining processor connector must have the processor filler panel in place. The filler panel protects the connector located on the CEC backplane from dust or damage and assists in airflow and cooling. 4. If a second processor is to be added to your system, remove the protective filler panel from the system, and install the new processor assembly into the secondary processor bay location U0.1-P1-C2. To add or replace a processor card, do the following: 1. From the bottom of the processor assembly, lift the processor access cover. Four hinges hold the cover to the top of the processor assembly. Pivot the cover up to reveal the memory DIMMs or DIMM connectors. Install the memory DIMMs as described in “Memory DIMMs” on page 427. 2. Pivot the processor access cover back into the closed position. 3. Grasp the release handles located on top of the processor assembly. Pivot the handle upward to 90 degrees. This action puts the handle cams into the correct position to help seat the processor card connector into the docking connector located on the CEC backplane. Attention: To prevent damage to the processor assembly and the processor card connector, open or close both release handles at the same time. 4. Align the processor card connector with the docking connector located on the CEC backplane. 5. Carefully insert the processor assembly into the system. 6. As the processor connector is seated into its CEC backplane docking connector, begin lowering the processor assembly release handles. This action correctly seats and locks the processor assembly to the CEC backplane. 7. If necessary, replace the processor filler panel. 8. Replace the media device support shelf. Tighten the thumbscrew located on the support shelf. Note: When replacing the media device support shelf, ensure that the lower support leg, located on the bottom side of the support shelf, activates the microswitch located on the CEC backplane. Failure to do so results in the system not being able to power on. 9. Replace all removed media drives, ensuring that the media drives removed are returned to the same media bay locations from which they were removed. Refer to “Media Replacement” on page 404. 10. Reconnect all power and signal cables to their respective media drives. 11. Replace the front bezel as described in “Front Bezel Replacement (286/C4)” on page 377 or “Front Bezel Replacement (286/E4)” on page 378. 12. Replace the service access cover as described in “Service Access Cover Replacement (286/C4)” on page 366 or “Service Access Cover Replacement (286/E4)” on page 372. 13. On a 286/C4, push the system drawer back into the operating position as described in “Returning the 286/C4 to the Operating Position” on page 365. 14. Power on the system as described in “Starting the System without an HMC Attached” on page 360. 15. If the 286/C4 has successfully powered on, close the rack doors. If the 286/E4 has successfully powered on, close the bezel door. 426 Service Guide Memory DIMMs Before performing the following procedures, read “Safety Notices” on page xi. Memory DIMM Removal To remove the memory DIMMs, do the following: 1. On a 286/C4, open the front and rear rack doors and put the system into the service position as described in “Placing the 286/C4 into the Service Position” on page 364. On a 286/E4, open the bezel door. 2. Shut down the system as described in either “Stopping the System without an HMC Attached” on page 359 or “Stopping the System with an HMC Attached and AIX Installed” on page 359. 3. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on page 366 or “Service Access Cover Removal (286/E4)” on page 371. 4. Remove the front bezel as described in “Front Bezel Removal (286/C4)” on page 377 or “Front Bezel Removal (286/E4)” on page 378. 5. Disconnect all power and signal cables from the media drives, located in media bay 1 and media bay 2. Remove the media drives as described in “Media Removal” on page 401. 6. Remove the media device support shelf by loosening the thumbscrew and grasping the back edge of the shelf, sliding it back, then lifting it up and out of the system chassis. Note: The support leg located on the bottom of the support shelf is used to deactivate a microswitch. The microswitch is located on the CEC backplane and when deactivated, cuts off power to many of the components and connectors located on the CEC backplane. This support shelf must be correctly seated and secured in position after any new FRU is added to the system and you perform a system boot. 7. Before handling any card, board, or memory DIMM, touch any metal surface of the chassis with one hand to minimize static electricity discharge. Refer to “Handling Static-Sensitive Devices” on page 359. 8. Grasp the end of both processor assembly release handles. Lift each handle until it is perpendicular (90 degrees) to the top of the processor assembly. By placing the handle perpendicular to the top of the processor assembly, the base or hinged portion of each handle acts as a cam and will gently pry the processor card connector from the docking connector located on the CEC backplane. 9. Remove the processor assembly from the system unit. 10. Lay the processor assembly, cover-side up, on a flat, static-protective surface. Chapter 9. Removal and Replacement Procedures 427 11. From the bottom of the processor assembly, lift the side access cover. Four hinges hold the cover to the top of the processor assembly. Pivot the cover up to reveal the memory DIMMs or DIMM connectors. 286/C4 1 Service Access Cover 7 286/C4 2 Processor Access Cover 8 Optional Media Device (for example: Diskette Drive, Tape Drive) 3 Processor Assembly Lifting and Release Handle 9 IDE CD-ROM Drive 4 Processor Assembly 10 Front Bezel 5 Processor Card 11 Media Device Support Shelf 6 Memory DIMMs 428 Service Guide 286/E4 1 286/E4 9 Service Access Cover 2 Media Device Support Shelf 10 Outer Side Cover 3 Media Carrier Trays 11 Processor Access Cover 4 Optional Media Device (for example: Diskette Drive or Tape Drive) 12 Processor Assembly Lifting and Release Handles 5 IDE CD-ROM Drive 13 Processor Card 6 Front Bezel 14 Processor Assembly 7 Bezel Door 15 Memory DIMMs 8 Processor Filler Chapter 9. Removal and Replacement Procedures 429 12. Locate the memory DIMMs, and determine which DIMM you want to remove. Note: Memory DIMMs must be installed in quads (groups of 4) and in the correct slot. The only two valid slot combinations that you can use are 1, 3, 6, and 8 OR 2, 4, 5, and 7. See the following illustration. Number Description Location Code M1 Memory slot 1 (J2A) U0.1-P1-C1-M1 M2 Memory slot 2 (J2B) U0.1-P1-C1-M2 M3 Memory slot 3(J3A) U0.1-P1-C1-M3 M4 Memory slot 4(J3B) U0.1-P1-C1-M4 M5 Memory slot 5(J1B) U0.1-P1-C1-M5 M6 Memory slot 6(J1A) U0.1-P1-C1-M6 M7 Memory slot 7(J0B) U0.1-P1-C1-M7 M8 Memory slot 8(J0A) U0.1-P1-C1-M8 Note: The location code column indicates the location codes for processor card 1. If your system is configured for processor card 2, substitute C2 in place of C1. Attention: To prevent damage to the memory DIMM and the memory DIMM connectors, open or close the retention latches at the same time. 13. Remove the memory DIMM by pushing the tabs out and then down. The tabs’ camming action forces the memory DIMM out of the connector. 14. Pull the memory DIMM out of the connector as shown in the following illustration. 1 2 1 1 Connector Tabs (unlocked position) 2 Memory DIMM 430 Service Guide Memory DIMM Replacement To replace a memory DIMM, do the following: 1. Before handling any card, board, or memory DIMM, touch any metal surface of the chassis with one hand to minimize static electricity discharge. Refer to “Handling Static-Sensitive Devices” on page 359. 2. If necessary, remove the memory DIMM from its antistatic package. 3. Ensure that the connector locking tabs are pushed out in the unlocked position before installing a new memory DIMM, as shown in the following illustration. 4. Carefully grasp the memory DIMM along two edges and align the connector. 5. Insert the memory card firmly into the connector. 6. Secure the memory DIMM with the locking tabs located at each end of the DIMM connector, as shown in the following illustration. Note: Memory DIMMs must be installed in groups of 4 (quads). 1 Locking Tabs (unlocked position) 2 Locking Tabs (locked position) 7. Pivot the processor access cover back into the closed position. 8. Grasp the release handles located on top of the processor assembly. Pivot the handle upward to 90 degrees. This action puts the handle cams into the correct position to help seat the processor card connector into the docking connector located on the CEC backplane. Attention: To prevent damage to the processor assembly and the processor card connector, open or close both release handles at the same time. 9. Align the processor card connector with the docking connector located on the CEC backplane. 10. Carefully insert the processor assembly into the system. 11. As the processor connector is seated into its CEC backplane docking connector, begin lowering the processor assembly release handles. This action correctly seats and locks the processor assembly to the CEC backplane. 12. If necessary, replace the processor filler panel. 13. Replace the media device support shelf. Tighten the thumbscrew located on the support shelf. Note: When replacing the support shelf, ensure that the lower support leg activates the microswitch located on the CEC backplane. Failure to do so results in the system not being able to power on. 14. Replace all removed media drives, ensuring that the media drives removed are returned to the same media bay locations from which they were removed. Refer to “Media Replacement” on page 404. 15. Reconnect all power and signal cables to their respective media drives. 16. Replace the front bezel as described in “Front Bezel Replacement (286/C4)” on page 377 or “Front Bezel Replacement (286/E4)” on page 378. 17. Replace the service access cover as described in “Service Access Cover Replacement (286/C4)” on page 366 or “Service Access Cover Replacement (286/E4)” on page 372. 18. On a 286/C4, push the system drawer back into the operating position as described in “Returning the 286/C4 to the Operating Position” on page 365. 19. Power on the system as described in “Starting the System without an HMC Attached” on page 360. 20. If the 286/C4 has successfully powered on, close the rack doors. If the 286/E4 has successfully powered on, close the bezel door. Chapter 9. Removal and Replacement Procedures 431 Replacing the Battery Before performing the following procedure, read “Safety Notices” on page xi. CAUTION: A lithium battery can cause fire, explosion, or severe burn. Do not recharge, disassemble, heat above 100°C (212°F), solder directly to the cell, incinerate, or expose cell contents to water. Keep away from children. Replace only with the part number specified for your system. Use of another battery may present a risk of fire or explosion. The battery connector is polarized; do not attempt to reverse polarity. Dispose of the battery according to local regulations. To replace the system’s battery, do the following: 1. On a 286/C4, open the front and rear rack doors and put the system into the service position as described in “Placing the 286/C4 into the Service Position” on page 364. On a 286/E4, open the bezel door. 2. Shut down the system as described in either “Stopping the System without an HMC Attached” on page 359 or “Stopping the System with an HMC Attached and AIX Installed” on page 359. 3. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on page 366 or “Service Access Cover Removal (286/E4)” on page 371. 4. Remove the front bezel as described in “Front Bezel Removal (286/C4)” on page 377 or “Front Bezel Removal (286/E4)” on page 378. 5. Disconnect and label all power and signal cables from the media drives, located in media bay 1 and media bay 2. Then remove the media drives as described in “Media Removal” on page 401. Note the location of each media drive removed. 6. Remove the media device support shelf by loosening the thumbscrew and grasping the back edge of the shelf, sliding it back, then lifting it up and out of the system chassis. Note: The support leg located on the bottom of the support shelf is used to deactivate a microswitch. The microswitch is located on the CEC backplane and when deactivated, cuts off power to many of the components and connectors located on the CEC backplane. This support shelf must be correctly seated and secured in position after any new FRU is added to the system and you perform a system boot. 432 Service Guide 7. Remove processor assembly 1 as described in “Processor Card Removal” on page 423. Note: Processor assembly 1 is the processor assembly that is closest to the power supplies. See the following illustration. 286/C4 1 Service Access Cover 6 Front Bezel 2 Processor Assembly 1 7 Battery (positive (+) side facing up) 3 286/C4 8 Media Device Support Shelf 4 Media Device (for example: Tape Drive or Diskette Drive) 9 Captive Thumbscrew 5 IDE CD-ROM Drive Chapter 9. Removal and Replacement Procedures 433 286/E4 1 286/E4 7 Media Device Support Shelf 2 Media Carrier Trays 8 Captive Thumbscrew 3 Media Device (for example: Tape Drive or Diskette Drive) 9 Battery (positive (+) side facing up) 4 IDE CD-ROM Drive 10 Service Access Cover 5 Front Bezel 11 Outer Side Cover 6 Bezel Door 12 Processor Assembly 434 Service Guide 8. Locate the battery on the CEC backplane as shown in the following illustration. 9. Use your fingernail to pry the battery out of its plastic mount. After the bottom of the battery has cleared the top edge of the plastic mount, pull it up and away from the CEC backplane. Note: The battery’s plastic mount connections are soldered to the CEC backplane. If the battery mount is damaged or removed, permanent damage can result to the CEC backplane. 1 CEC backplane 2 Battery 3 Battery Plastic Mount 10. Replace the battery with (FRU) part number 00P3903 or equivalent. 11. When installing the new battery, ensure that the battery polarity is correct. For this system, the positive (+) side of the battery should be facing up in its plastic mount. 12. Gently insert the new battery into its plastic mount. 13. Replace processor assembly 1 as described in “Adding or Replacing a Processor Card” on page 426. 14. Replace the media device support shelf. Tighten the thumbscrew located on the support shelf. Note: When replacing the support shelf, ensure that the lower support leg activates the microswitch located on the CEC backplane. Failure to do so results in the system not being able to power on. 15. Replace all media drives, ensuring that the media drives removed are returned to the same media bay from which they were removed as described in “Media Replacement” on page 404. 16. Reconnect all power and signal cables to their respective media drives. 17. Replace the front bezel as described in “Front Bezel Replacement (286/C4)” on page 377 or “Front Bezel Replacement (286/E4)” on page 378. 18. Replace the service access cover, as described in “Service Access Cover Replacement (286/C4)” on page 366 or “Service Access Cover Replacement (286/E4)” on page 372. 19. If you are servicing a 286/C4, push the system drawer back into the operating position as described in “Returning the 286/C4 to the Operating Position” on page 365. 20. Turn on the system power. 21. If you are running AIX, the bootlist command may need to be run while logged in as root user. This action rebuilds the customized boot list. IP (Internet protocol) parameters may also need to be reentered. Chapter 9. Removal and Replacement Procedures 435 Power Supplies DANGER Do not attempt to open the covers of the power supply. Power supplies are not serviceable and are to be replaced as a unit. D02 This system drawer has power supplies that have hot-swap capabilities. To use this feature, you must have two power supplies in place. Before performing any of the following procedures, read and understand all of the safety notices beginning with “Safety Notices” on page xi. Non-Hot-Swap Power Supply Removal If you are servicing a system that has only one power supply installed, you must follow the non-hot-swap power supply removal procedure. Systems that were configured with one power supply at the factory have a power supply filler panel. The filler panel protects the vacant power supply connector from dust or damage. To remove a non-hot-swap power supply, do the following: 1. On a 286/C4, open the front and rear rack doors and place the system into the service position as described in “Placing the 286/C4 into the Service Position” on page 364. On a 286/E4, open the bezel door. 2. Shut down the system as described in either “Stopping the System without an HMC Attached” on page 359 or “Stopping the System with an HMC Attached and AIX Installed” on page 359. 3. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on page 366 or “Service Access Cover Removal (286/E4)” on page 371. 4. Remove the power cable from the rear of the power supply. 5. Wait at least 10 seconds before performing the next step. 6. Grasp the ends of both power supply handles, and slide the retractable spring-activated portion of each handle toward its hinged base. This action releases the locking tab located on the bottom side of each release handle. See the following illustration. 1 Power Supply 2 Locking Tab Receiver Hole 3 Retractable Spring-Activated Portion of the Handle 7. Lift the handle until it is perpendicular (90 degrees) to the top of the power supply. By placing the handle perpendicular to the top of the power supply, the base or hinged portion of each handle acts as a cam and will gently pry the power supply from its connector located on the CEC backplane. 436 Service Guide 1a Outer Side Cover (286/E4 Only) 4 Power Supply Filler Panel 1 Service Access Cover 5a 286/C4 2 Power Supply #1 (Primary) 5b 286/E4 3 Power Supply Release Handle 8. After the power supply is released from its connector, pull the power supply out of the system. Note: During normal operation, each power-supply bay must have either a power supply or filler panel installed for proper cooling. Chapter 9. Removal and Replacement Procedures 437 Non-Hot-Swap Power Supply Replacement To replace a non-hot-swap power supply, do the following: 1. Grasp the handles located on top of the new power supply. Pivot the handles upward to 90 degrees. This action puts the handle cams into the correct position to help seat the power supply into its connector. 2. Carefully insert the new power supply into the bay originally used by the faulty power supply. 3. Lower the power supply release handles, carefully pressing the power supply into the connector. The spring latch located on the underside of each handle will snap into place, indicating that the power supply is seated correctly and locked into position. Note: Improper operation of the spring-activated portion of the power supply handles can result in the following: v The retractable spring remains compressed - This situation keeps the handle from locking the power supply in its seated position. To eliminate this problem, lower the handle into the locked position, and push the spring activated handle end toward the locking tab. This action forces the spring latch to mat with the locking tab in the spring-latch receiver hole. v The locking tab receiver is bent - Before lowering the handle to lock the power supply into the seated position, ensure that the locking tab located on top of the power supply in the receiver hole is not bent. Before the spring latch will seat onto the locking tab, the locking tab, if bent, must be straightened. 1 Spring Latch 4 Power Supply 2 Spring-Activated Handle End 5 Spring-Latch Receiver Hole 3 Power Supply Release Handle 6 Locking Tab 4. Plug the power cable into the back of the power supply. 5. Reconnect the power source to the system. 6. The green AC Good LED located on the power supply comes on solid. 7. Replace the service access cover as described in “Service Access Cover Replacement (286/C4)” on page 366 or “Service Access Cover Replacement (286/E4)” on page 372. If you are servicing a 286/E4, go to step 10. If you are servicing a 286/C4, go to step 8. 8. Push the system drawer back into the operating position as described in “Returning the 286/C4 to the Operating Position” on page 365. 9. Route the cables through the cable-management arm. 10. Power on the system drawer as described in “Starting the System without an HMC Attached” on page 360. 11. After the 286/C4 has successfully powered on, close the rack doors. After the 286/E4 has successfully powered on, close the bezel door. 438 Service Guide Hot-Swap Power Supply Removal Before performing this procedure, read the following “Safety Notices” on page xi. If you are servicing a system that has two power supplies installed, you can replace each power supply (one at a time) without shutting down the system. To remove a hot-swap power supply, do the following: Note: You do not need to turn off the power to the system to install hot-swap power supplies. 1. On a 286/C4, open the front and rear rack doors and place the system into the service position as described in “Placing the 286/C4 into the Service Position” on page 364. On a 286/E4, open the bezel door. 2. Visually check the power supply LEDs. If the amber LED is lit, suspect that power supply to be faulty. 3. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on page 366 or “Service Access Cover Removal (286/E4)” on page 371. 4. Remove the power cable from the rear of the faulty power supply. 5. Wait at least 10 seconds before performing the next step. 6. Grasp the ends of both power supply handles, and slide the retractable spring-activated portion of each handle toward its hinged base. This action releases the locking tab located on the bottom side of each release handle. See the following illustration. 1 Power Supply 2 Locking Tab Receiver Hole 3 Retractable Spring-Activated Portion of the Handle 7. Lift the handle until it is perpendicular (90 degrees) to the top of the power supply. By placing the handle perpendicular to the top of the power supply, the base or hinged portion of each handle acts as a cam and will gently pry the power supply from its connector located on the CEC backplane. 8. After the power supply is released from its connector, pull the power supply out of the system. 9. If you are not replacing the power supply, do the following: a. Install a power-supply filler panel. Note: During normal operation, each power-supply bay must have either a power supply or filler panel installed for proper cooling. b. Disconnect the removed power supply’s power cable from either the power distribution panel or electrical outlet. 10. Verify that the green DC Good LED on the remaining power supply is lit. The green LED indicates that the power supply is operating correctly. 11. Replace the service access cover as described in “Service Access Cover Removal (286/C4)” on page 366 or “Service Access Cover Removal (286/E4)” on page 371. 12. On a 286/C4, push the system drawer back into the operating position as described in “Returning the 286/C4 to the Operating Position” on page 365. 13. On a 286/C4, close the rack doors. On a 286/E4, close the bezel door. Chapter 9. Removal and Replacement Procedures 439 1a Outer Side Cover (286/E4 Only) 4 Power Supply Filler Panel 1 Service Access Cover 5a 286/C4 2 Power Supply #1 (Primary) 5b 286/E4 3 Power Supply Release Handle 440 Service Guide Hot-Swap Power Supply Replacement Note: This note applies only to systems running the AIX operating system. If a power supply is being replaced for a redundant failure, after the service repair action is completed, ask the customer to check the crontab file for any power/cooling warning messages. When a power or cooling error is encountered, AIX adds an entry to the crontab file to ²wall² a warning message every 12 hours, to alert or remind the customer of the problem. Replacing the faulty part does not clear this crontab entry, so unless the crontab file is edited to remove this entry, the customer continues to be reminded of the failure despite its having been repaired. Use the crontab -l command to read the crontab file to determine if an entry exists. Use the crontab -e command to edit the file. To replace a hot-swap power supply, do the following: 1. If you are adding a new power supply to a system that was configured with only one power supply installed, remove the power-supply filler panel from power supply #2 bay. 2. Grasp the handle located on top of the new power supply. Pivot the handle upward to 90 degrees. This action puts the handle cams into the correct position to help seat the power supply into its connector. 3. Carefully insert the power supply into the same power supply bay from which the faulty power supply was just removed. 4. Lower the power supply handles, carefully pressing the power supply into the connector. The spring latch located on the underside of each handle will snap into place, indicating that the power supply is seated correctly and locked into position. Note: Improper operation of the spring-activated portion of the power supply handles can result in the following: v The retractable spring remains compressed - This situation keeps the handle from locking the power supply in its seated position. To eliminate this problem, lower the handle into the locked position, and push the spring activated handle end toward the locking tab. This action forces the spring latch to mat with the locking tab in the spring-latch receiver hole. v The locking tab receiver is bent - Before lowering the handle to lock the power supply into the seated position, ensure that the locking tab located on top of the power supply in the receiver hole is not bent. Before the spring latch will seat onto the locking tab, the locking tab, if bent, must be straightened. Chapter 9. Removal and Replacement Procedures 441 1 Spring Latch 4 Power Supply 2 Spring-Activated Handle End 5 Spring-Latch Receiver Hole 3 Power Supply Release Handle 6 Locking Tab 5. Plug the power cable for the added power supply into the power cable connector located on the back of the power supply. 6. Reconnect the power source to the system. The green AC Good LED located on the power supply should come on solid. 7. Verify that the green DC Good LED on the power supply is lit. The green LED indicates that the power supply is operating correctly. 8. Replace the service access cover as described in “Service Access Cover Removal (286/C4)” on page 366 or “Service Access Cover Removal (286/E4)” on page 371. 9. If you are servicing a 286/C4, push the system drawer back into the operating position as described in “Returning the 286/C4 to the Operating Position” on page 365. 10. On a 286/C4, close the rack doors. On a 286/E4, close the bezel door. 442 Service Guide Central Electronics Complex (CEC) Backplane Before performing the following procedure, read the “Safety Notices” on page xi. To perform this procedure, you must remove and install the PCI riser card. To remove or install the PCI riser card from its docking connectors, requires a considerable amount of force. If you are servicing a 286/C4, it is strongly recommended that the system drawer be removed from the rack. If you are servicing a 286/E4, it is strongly recommended that the system be placed on its side, on a flat and stable surface. To avoid potential breakage, the 286/E4 stabilizer foot closest to the desktop or floor is free from resting on any surface. CEC Backplane Removal Attention: Before you remove or disconnect any components, record their location. Before removing the CEC backplane from the system for servicing or replacement, record the settings of any jumpers located on the CEC backplane. To remove the CEC backplane (system board), do the following: 1. On a 286/C4, open the front and rear rack doors and place the system into the service position as described in “Placing the 286/C4 into the Service Position” on page 364. On a 286/E4, open the bezel door. 2. Shut down the system as described in either “Stopping the System without an HMC Attached” on page 359 or “Stopping the System with an HMC Attached and AIX Installed” on page 359. 3. Disconnect the power source from the system. Note: This system may be equipped with a second power supply. Before continuing with this procedure, ensure that the power source to the system has been completely disconnected. 4. Label and disconnect all other cables located at the rear of the system unit. 5. Remove the front bezel as described in “Front Bezel Removal (286/C4)” on page 377 or “Front Bezel Removal (286/E4)” on page 378. 6. Remove the service access cover as described in “Service Access Cover Removal (286/C4)” on page 366 or “Service Access Cover Removal (286/E4)” on page 371. 7. Record the slot number and location of each adapter being removed. 8. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 359. 9. Remove the PCI adapters as described in “Non-Hot-Pluggable PCI Adapter Removal” on page 381. 10. Disconnect all power and signal cables from the media drives, located in media bay 1 and media bay 2. Then remove the media drives as described in “Media Removal” on page 401. 11. Remove the media device support shelf by loosening the thumbscrew and grasping the back edge of the shelf, sliding it back, then lifting it up and out of the system chassis. Note: The support leg located on the bottom of the support shelf is used to deactivate a microswitch. The microswitch is located on the CEC backplane and when deactivated, cuts off power to many of the components and connectors located on the CEC backplane. This support shelf must be correctly seated and secured in position after any new FRU is added to the system and you perform a system boot. 12. Remove all processor cards or the processor filler panel as described in “Processor Card Removal” on page 423. Chapter 9. Removal and Replacement Procedures 443 13. Remove all power supplies or the power supply filler panel as described in “Non-Hot-Swap Power Supply Removal” on page 436 or “Hot-Swap Power Supply Removal” on page 439. 14. Remove the PCI riser card as described in “PCI Riser Card Removal” on page 409. 15. Remove the PCI cooling fans as described in “Hot-Swap Fan Assembly Removal” on page 397. 16. Remove the PCI cooling fan’s stabilizing bracket. Note: The bracket is secured to the CEC backplane with one hex-head screw and two alignment tabs. 17. Remove and label all remaining power, signal, and SCSI cables. 18. Remove the processor cooling fans as described in “Hot-Swap Fan Assembly Removal” on page 397. 19. Remove the processor cooling fan cage assembly by removing the two hex-head screws located at the base of the cage and the eight hex-head screws that screw in from the back of the system chassis. 20. Remove the 16 threaded standoffs that secure the rear connectors located on the CEC backplane to the back of the system chassis. 21. Remove the power supply and processor assembly front support bracket. Note: The bracket is secured to the system chassis with four hex-head screws. 22. Remove the plastic membrane that covers the CEC backplane. 23. Remove the CEC backplane retaining screws. Note: For the locations of the retaining screws, see the following CEC backplane illustration. Note: Use care when removing the CEC backplane. Standoffs attached to the chassis base may damage the components attached to the bottom of the CEC backplane. 444 Service Guide 24. Slide the CEC backplane toward the front of the system (about 7 mm). A strip of sponge rubber attached to the back (bottom) of the CEC backplane will compress against an aluminum plate that is secured to the system chassis, preventing the CEC backplane from traveling forward any farther. This action allows the connectors located on the back of the CEC backplane to have enough room to clear the rear chassis bulkhead. 1 CEC Backplane 2 286/EC or 286/E4 25. To remove the CEC from the chassis, lift the front edge of the CEC backplane and pull it out and away from the system chassis. 26. Put the CEC backplane in a safe place. Chapter 9. Removal and Replacement Procedures 445 1 Service Access Cover (drawer) 11 Front Bezel 2 Primary and Redundant PCI Cooling Fans 12 IDE CD-ROM Drive 3 PCI Cooling Fan and Cable Management Bracket 13 Operator Panel 4 Processor Cooling Fans 14 Front Power Supply and Processor Support Bracket 5 Processor Cooling Fan Cage 15 Media Device Support Shelf 6 PCI Riser Card 16 Processor Assemblies 1 and 2 7 PCI Adapters 16a Processor Filler Panel 8 CEC Backplane 17 Power Supply #1 (Primary) Power Supply #2 (Redundant) 9 286/C4 Chassis 17a Power Supply Filler Panel 10 Optional Media Device (for example diskette drive or tape drive) 18 Plastic Membrane 446 Service Guide 1 Outer Side cover 13 Optional Media Device (for example diskette drive or tape drive) 2 Service Access Cover 14 IDE CD-ROM Drive 3 PCI Cooling Fans 15 Bezel 4 PCI Cooling Fan and Cable Management Bracket 16 Bezel Door 5 Processor Cooling Fans 17 System Front Stabilizer Foot 6 Processor Cooling Fan Cage 18 Front Power Supply and Processor Support Bracket 7 PCI Riser Card 19 Power Supply #1 (Primary) Power Supply #2 (Redundant) 8 PCI Adapters 19a Power Supply Filler Panel 9 Back Cover 20 Processor Assemblies 1 and 2 10 CEC Backplane 20a Processor Filler Panel 11 286/E4 21 Plastic Membrane 12 Media Device Support Shelf Chapter 9. Removal and Replacement Procedures 447 CEC Backplane Replacement To replace the CEC backplane (system board), do the following: 1. Before handling any card, board, or memory DIMM, be sure to use your electrostatic discharge strap to minimize static-electric discharge. Refer to “Handling Static-Sensitive Devices” on page 359. 2. If necessary, remove the CEC backplane from the antistatic package. 3. Carefully grasp the CEC backplane along two edges, and align the retaining screw holes with its mating screw holes located on the system chassis. Note: Use care when replacing the CEC backplane. Standoffs attached to the chassis base may damage the components attached to the bottom of the CEC backplane. 4. Replace the CEC backplane and slide it toward the back of the system. Ensure that all of the back connectors located on the CEC backplane are positioned correctly through the rear chassis bulkhead. 5. Replace the retaining screws that secure the CEC backplane to the system chassis. Note: For proper alignment, insert all screws and finger-tighten only. Ensure the CEC backplane is not binding. 6. Tighten all CEC backplane retaining screws, including the 16 threaded standoffs that secure each connector to the back of the system chassis. 7. Reconnect all power, signal, and SCSI cables that attach to the CEC backplane. 8. Replace the plastic membrane over the CEC backplane. 9. Replace the front power supply and processor support bracket. 10. Replace the processor cooling fan cage assembly. 11. Replace the eight screws that secure the processor cooling fan cage assembly to the rear chassis. 12. Replace the 16 threaded standoffs that secure the CEC backplane rear connectors to the rear chassis. 13. Replace the processor cooling fans as described in “Hot-Swap Fan Assembly Replacement” on page 399. 14. Replace the PCI adapter cooling fan stabilizing cage by installing the two tabs located on the bottom edge of the cage into their mating slots located on the chassis. Secure the cage to the chassis with a retaining screw. 15. Replace the PCI cooling fans as described in “Hot-Swap Fan Assembly Replacement” on page 399. 16. Replace the PCI riser card as described in “PCI Riser Card Replacement” on page 413. 17. Replace all PCI Adapters as described in “Adding or Replacing a Non-Hot-Pluggable PCI Adapter” on page 383. 18. Replace the processor assembly and, if applicable, the processor assembly filler panel. 19. Replace the media device support shelf. Tighten the thumbscrew located on the media device support shelf. Note: When replacing the support shelf, ensure that the lower support leg activates the microswitch located on the CEC backplane. Failure to do so results in the system not being able to power on. 20. Replace all removed media drives, ensuring that the media drives removed are returned to the same media bay from which they were removed. Refer to “Media Replacement” on page 404, if necessary. 21. Reconnect all power and signal cables to their respective media drives. 22. Replace the power supplies and, if applicable, the power supply filler panel as described in “Non-Hot-Swap Power Supply Replacement” on page 438 or “Hot-Swap Power Supply Replacement” on page 441. 448 Service Guide 23. Replace the front bezel as described in “Front Bezel Replacement (286/C4)” on page 377 or “Front Bezel Replacement (286/E4)” on page 378. 24. Replace the service access cover, as described in “Service Access Cover Replacement (286/C4)” on page 366 or “Service Access Cover Replacement (286/E4)” on page 372. 25. If you are servicing a 286/C4, push the system drawer back into the operating position as described in “Returning the 286/C4 to the Operating Position” on page 365. 26. Reconnect all system power cables. 27. The service processor settings should have been set to their default values. If the service processor settings have not been set to their default values, the default values must be restored. To restore the default values settings, do one of the following: v Manually reset all settings. v If the service processor settings had previously been saved, The settings may be restored using the procedure in “Saving and Restoring Service Processor Settings” on page 306. 28. Check the firmware level that is on the new CEC backplane. Look at the top of the service processor main menu. If the firmware is not at the correct level, refer to “System Firmware Updates” on page 332. 29. Power on the system as described in “Starting the System without an HMC Attached” on page 360. 30. Reset the time and date. 31. If you are running AIX, the bootlist command may need to be run while logged in as root user. This rebuilds the customized boot list. Internet protocol (IP) parameters may also need to be reentered. 32. If you are servicing a 286/C4 and the system has successfully powered on, close the rack doors. If you are servicing a 286/E4 and the system has successfully powered on, close the bezel door. 33. Check the firmware level on the new backplane; see “Determining the Level of Firmware on the System” on page 334. If the firmware is back-level, download the latest firmware from CORE or the Web. (See “System Firmware Updates” on page 332.) Chapter 9. Removal and Replacement Procedures 449 450 Service Guide Chapter 10. Parts Information This chapter contains parts information for the 286/C4 and 286/E4. 451 System Parts 452 Service Guide Index Number FRU Part Number Units Per Assembly Description 1 N/A 1 Service Access Cover (drawer) 2 09P5865 Up to 2 Processor Cooling Fans 3 09P5887 (Up to 6) EMC PCI-Slot Shield 4 N/A 1 Processor Cooling Fan Cage and Rear Bulk Head Assembly 5 See Note Up to 6 PCI Adapters 6 09P5290 1 Six Slotted PCI Riser Card 09P2702 1 Four Slotted PCI Riser Card (not shown) 7 N/A 1 Chassis 8 00P4988 1 Right Rack Extension Rail (286/C4 Only) 9 09P5902 1 Diskette Drive Cage (Optional) 10 76H4091 93F2361 02K3488 1 Diskette Drive (Optional) 11 See Note 1 Media Device (example: Tape Drive) 12 09P5874 Up to 2 Bezel Media Slot Filler 13 09P5873 1 Front Bezel (286/C4 Only) 14 See Note 1 IDE CD-ROM (optional) 15 00P3210 1 Operator Panel 16 00P2797 2 Media Carrier Trays 17 00P2801 Up to 2 Media Bay Filler 18 00P4989 1 Left Rack Extension Rail (286/C4 Only) 19 00P4133 1 Media Device Support Shelf 20 See Note 4 Disk Drives (4 Pack) 21 00P4132 1 Disk Drive Cage/Backplane Assembly 22 00P2799 Up to 6 PCI Plastic Light Pipe and Air Flow Separators 23 00P2800 Up to 6 PCI Card Lock Assemblies Note: See Diagnostic Information for Multiple Bus Systems for part numbers. Chapter 10. Parts Information 453 System Parts (continued) 454 Service Guide Index Number FRU Part Number Units Per Assembly Description 1 21P4312 1 Cable Management Arm Assembly (286/C4 Only) 2 00P2806 1 Cable Carrier Support Bracket (286/C4 Only) 3 00P2807 1 Rack Flange Mounting Bracket (286/C4 Only) 4 00P2733 1 Processor Assembly, 1.45 GHz, 1-way 00P2736 Up to 2 Processor Assembly, 1.45 GHz, 2-way 5 53P3224 Up to 16 256 MB DIMMs 53P3226 Up to 16 512 MB DIMMs 53P3230 Up to 16 1 GB DIMMs 53P3232 Up to 16 2 GB DIMMs 6 00P3203 1 per RIO connector RIO Connector Cover (only available on 286/C4 that has not been upgraded to support an I/O subsystem) 7 00P4488 1 CEC Backplane 8 09P5866 2 Units PCI Cooling Fans (2 Fans per Unit) 9 N/A 1 Internal Cable Management bracket 10 N/A 1 Chassis 11 00P4134 1 Right Release Latch Bracket (286/C4 Only) 12 00P2804 2 Release Latch (286/C4 Only) 13 00P4135 1 Left Release Latch Bracket (286/C4 Only) 14 00P2805 2 Latch Mounting Bracket (286/C4 Only) 15 00P4342 Up to 2 Power Supply (Redundant Power Supply is Optional) 16 N/A 1 Front Power Supply and Processor Assembly Support Bracket 17 N/A 1 Plastic separator 18 09P5893 1 Processor Filler 19 09P5892 1 Power Supply Filler Not Shown 42F6841 Up to 2 AC Power Cable, (Power distribution bus to AC power supply, 286/C4 only) Note: See Diagnostic Information for Multiple Bus Systems for part numbers. Chapter 10. Parts Information 455 286/E4 Cover Set and Brackets Index Number FRU Part Number Units Per Assembly Description 1 00P3789 1 Top Cover Assembly (286/E4 Only) 2 00P3791 1 Operator Panel Extension Cable (286/E4 Only) 3 N/A 1 286/E4 4 00P3695 1 Top Cover Rear Mounting Bracket (286/E4 Only) 5 00P2810 1 System Bottom Mounting Plate (286/E4 Only) 6 00P3790 1 System Base and Right Cover Assembly (286/E4 Only) 7 00P3788 1 Power Cable Bracket (286/E4 Only) 456 Service Guide Chapter 10. Parts Information 457 Index Number FRU Part Number Units Per Assembly Description 1 00P3787 1 Operator Panel EMC Bracket (286/E4 Only) 2 00P2811 1 Operator Panel Bracket (286/E4 Only) 3 00P4637 1 Operator Panel EMC Filler (286/E4 Only) 4 00P2871 1 Top Cap (286/E4 Only) 5 00P2869 1 Disk Drive Access Cover (286/E4 Only) 6 00P2868 1 Front Door (286/E4 Only) 7 09P5885 1 Front Bezel (286/E4 Only) 8 N/A 1 286/E4 9 00P2593 1 Left Side Outer Access Cover (286/E4 Only) 10 00P2594 1 Rear Cover (286/E4 Only) 458 Service Guide System Internal Cables The following diagrams show the system cable connections. The first diagram illustrates the routing of the internal power cables. The second diagram illustrates the routing of the signal cables. Internal Power Cable Routing Diagram Index Number FRU Part Number Units Per Assembly Description 1 00P3210 1 Operator Panel 2 See Note 1 IDE CD-ROM 3 See Note Up to 2 SCSI Media Device 4 09P5870 1 CEC Backplane to Operator Panel Signal / Power Cable 5 09P5867 1 2-drop Power Cable from CEC Backplane to IDE CD-ROM and other SCSI Media Devices 6 09P5888 1 CEC Backplane to Disk Drive Backplane Power Cable 7 00P4132 1 Disk Drive Backplane 8 09P5864 1 Diskette Drive to CEC Backplane Power Cable 9 09P5900 1 PCI Cooling Fans #3 and #4 Power Cable 10 See Note 1 Diskette Drive 11 00P4488 1 CEC Backplane Notes: 1. For more information on power cables, go to “External AC Power Cables” on page 43. 2. See Diagnostic Information for Multiple Bus Systems for part numbers. Note: Chapter 10. Parts Information 459 Internal Signal Cable Routing Diagram 460 Service Guide Index Number FRU Part Number Units Per Assembly Description 1 00P3210 1 Operator Panel 2 See Note 1 SCSI Media Device (50-Pin) 3 See Note 1 SCSI Media Device (68-Pin) 4 See Note 1 SCSI Media Device (68-Pin) 5 09P5870 1 CEC Backplane to Operator Panel Signal / Power Cable 5 00P3791 1 Operator Panel Extension Cable (286/E4 Only) 5 00P3822 1 Operator Panel Interposer (286/E4 Only) 6 1 SCSI Media 50-pin to 68-pin Converter Cable 7 09P5862 1 Integrated SCSI Media Device Signal Cable 8 09P5863 1 CEC Backplane Connector to Diskette Drive Signal Cable 9 09P5868 1 CEC Backplane to IDE CD-ROM Signal Cable 10 See Note 1 Diskette Drive 11 See Note 1 IDE CD-ROM 12 00P4132 1 Disk Drive Backplane 13 09P5895 1 PCI Riser Card to Disk Drive Backplane Signal Cable (50 pin) 14 09P5889 1 PCI Riser Card to Disk Drive Backplane Signal Cable (68 pin) 15 09P5290 1 6 Slotted PCI Riser Card 15a 09P2702 1 4 Slotted PCI Riser Card 16 00P4488 1 CEC Backplane 17 09P5869 1 PCI Riser Card to External SCSI Signal Cable 18 N/A 1 External SCSI Port 19 09P5899 1 CEC Backplane to Rack Indicator Signal Cable 20 N/A 1 Rack Indicator Port 21 00P3200 1 CEC Backplane to Debug Port Signal Cable 22 N/A 1 Debug Parallel Port Note: See Diagnostic Information for Multiple Bus Systems for part numbers. Chapter 10. Parts Information 461 Power Cable Index Part Number Country or Region 1 1838574 Bahamas, Barbados, Bolivia, Brazil, Canada, Costa Rica, Dominican Republic, El Salvador, Ecuador, Guatemala, Guyana, Haiti, Honduras, Jamaica, Japan, Netherlands Antilles, Panama, Peru, Philippines, Taiwan, Thailand, Trinidad, Tobago, U.S.A. (except Chicago), Venezuela 2 86G7648 Bahamas, Barbados, Bermuda, Bolivia, Brazil, Canada, Cayman Islands, Colombia, Costa Rica, Dominican Republic, Ecuador, El Salvador, Guatemala, Guyana, Haiti, Honduras, Jamaica, Japan, Korea (South), Mexico, Netherlands Antilles, Nicaragua, Panama, Peru, Philippines, Puerto Rico, Saudi Arabia, Suriname, Trinidad, Taiwan, U.S.A. (except Chicago), Venezuela 2 62X1045 Chicago, U.S.A. 3 6952311 Argentina, Australia, New Zealand 4 13F9979 Abu Dhabi, Austria, Belgium, Bulgaria, Botswana, China (Macau S.A.R), Egypt, Finland, France, Germany, Greece, Iceland, Indonesia, Korea (South), Lebanon, Luxembourg, Netherlands, Norway, Portugal, Saudi Arabia, Spain, Sudan, Sweden, Turkey, Yugoslavia 5 13F9997 Denmark 6 14F0015 Bangladesh, Burma, Pakistan, South Africa, Sri Lanka 7 14F0033 Bahrain, Bermuda, Brunei, Channel Islands, China (Hong Kong S.A.R), Cyprus, Ghana, India, Iraq, Ireland, Jordan, Kenya, Kuwait, Malawi, Malaysia, Nigeria, Oman, People’s Republic of China, Qatar, Sierra Leone, Singapore, Tanzania, Uganda, United Arab Emirates (Dubai), United Kingdom, Zambia 8 14F0051 Liechtenstein, Switzerland 9 14F0069 Chile, Ethiopia, Italy 10 14F0087 Israel 11 6952291 Paraguay, Colombia, Uruguay 42F6841 AC Power Cable, (Power distribution bus to AC power supply, 286/C4 only) 1 2 3 4 5 6 7 8 9 10 11 462 Service Guide Keyboards and Mouse (White) Index Number FRU Part Number Units Per Assembly Description 1 93H8120 1 Keyboard, 101 United States English (ID 103P) 93H8123 1 Keyboard, 102 Spanish (ID 171) 93H8125 1 Keyboard, 102 Arabic (ID 238) 08L0904 1 Keyboard, 102 Belgium French (ID 120) 08L0905 1 Keyboard, 102 Belgium (ID 120) 93H8134 1 Keyboard, 102 Greek (ID 319) 93H8135 1 Keyboard, 101 Hebrew (ID 212) 08L0911 1 Keyboard, 102 Italy (ID 142) 08L0912 1 Keyboard, 102 Norwegian (ID 155) 93H8143 1 Keyboard, 101 Russian (ID 443) 08L0914 1 Keyboard, 102 Spanish (ID 172) 08L0915 1 Keyboard, 102 Sweden/Finland (ID 153) 08L0916 1 Keyboard, 105 Swiss F/G (ID 150) 08L0917 1 Keyboard, 102 UK English (ID 166) 93H8153 1 Keyboard, US English ISO9995 (ID 103P) 93H8154 1 Keyboard, 106 Japan (ID 194) 93H8155 1 Keyboard, 101 Chinese/US (ID 467) 93H8156 1 Keyboard, 103 Korea (ID 413) 2 76H5084 1 Three Button Mouse Chapter 10. Parts Information 463 464 Service Guide Chapter 11. Fault and Attention LEDs This chapter contains information about fault and attention LEDs, which assist in identifying failing components in your system. Operator Panel Display If a failing component is detected in your system, an amber-colored attention LED is turned on solid (not blinking). 1 Power-On Button 6 Operator Panel Display 2 Power LED 7 (FS1) Front Serial Connector (RJ-48 Connector) 3 Attention LED 8 Service Processor Reset Switch (Pinhole) 4 SCSI Port Activity 9 System Reset Button 5 Ethernet Port Activity Number Component Name Component Description 1 Power-On Button Turns the system power on and off. 2 Power LED Blinking - When connected to the power source (System is in standby mode). Solid - When power button has been pressed. Note: There is approximately a 30-second transition period from the time the power button is pressed until the power-on LED goes from blinking to on solid. During the transition period, you may observe the blinking intervals accelerate. 3 Attention LED Normal State - LED is off. 4 SCSI Port Activity Normal State - LED is on when there is SCSI activity. 5 Ethernet Port Activity Normal State - LED is on when there is Ethernet activity. 6 Operator Panel Display Displays current status of system startup, or diagnostic information in the event of a hardware problem. 7 Front Serial Connector (FS1) Serial port uses RJ-48 connector. Use to plug in external devices at the front of the system unit. 8 Service Processor Reset Switch (Pinhole) Service Personnel Use 9 System Reset Button Resets the system 465 Component LEDs Individual LEDs are located on or near the failing components. The LEDs are located either on the component itself or on the carrier of the component (for example, memory card, fan, memory module, or processor). LEDs are either green or amber. Green LEDs indicate either of the following: v Electrical power is present. v Activity is occurring on a link. (The system could be sending or receiving information.) Amber LEDs indicate a fault or identify condition. If your system or one of the components on your system has an amber LED turned on or blinking, identify the problem and take the appropriate action to restore the system to normal. The following table identifies the location, color and status of your system component LEDs. 466 Service Guide Unit (FRU) LED Function Location LED Color OFF ON Blink Power and Packaging LEDs Rack Indicator Drawer Sum Top of Rack Amber (1x) Normal Fault Identify AC Input Green (1x) No Input AC Good N/A System Unit Attention Operator Panel + Rear Amber (2x) Normal Fault Identify Power Operator Panel + Rear Green (2x) No AC System Powered On v Standby v Transition of Power from Standby to Power-On (Blinking intervals may accelerat before remaining on solid) Fans Identify Fan Top Amber (4x) Normal Fault N/A Power Supply (AC) Input Source Power Supply Front + Top Green (1x) No Input AC Input Good N/A Fault Power Supply Front + Top Amber (1x) Normal Fault Fast Blink - Identify Slow Blink - SPCN communication failure .25 Hz DC Output Good Power Supply Front + Top Green (1x) DC Off DC Good Standby Disk Drives (DASD) Activity DASD Board Green (4x) No Activity Activity N/A Remove Identify DASD Board Amber (4x) Normal Action Identify Optional Media No LED Electronics Component LEDs PCI Adapter Connectors (Slots) Activity Chassis Rear, Inside System Next to Adapter Green Refer to the PCI Adapter Table on page 469. Identify Chassis Rear, Inside System Next to Adapter Amber Refer to the PCI Adapter Table on page 469. Memory DIMMs Identify Processor Housing (Top) Amber (8x) Normal Identify CEC Backplane Identify CEC Backplane Amber (1x) Normal Identify Processor Board Identify Processor Housing (Top) Amber (1x) Normal Identify Chapter 11. Fault and Attention LEDs 467 Unit (FRU) LED Function Location LED Color OFF ON Blink Unit (FRU) LED Function Location LED Color OFF ON Blink PCI Riser Card Identify PCI Riser Card Amber (1x) Normal Identify Disk Drive Backplane No LED Port LED Indicators RIO Port Identify CEC Backplane Amber (2x) Normal N/A Identify SCSI Port Activity Operator Panel Green (1x) No Activity Activity Ethernet Port Ethernet 1, Ethernet 2 Mode Chassis Rear Green (2x) 10 Mbit Mode 100 Mbit Mode Activity Chassis Rear Green (2x) No Activity Activity Activity Operator Panel Green (1x) No Activity Activity See the following illustration for Ethernet connector LED locations on the rear of the 286/C4 and 286/E4. 1 Ethernet 1 Connector and LED 2 Ethernet 2 Connector and LED 3 Mode LED 10 Mbit - Off 100 Mbit - On 4 Activity LED On - Receiving Information Off - Not Receiving Information 468 Service Guide See the following illustration for RIO connector LED locations on the rear of the 286/C4 and 286/E4. 1 RIO-G 0 Identify LED 2 System Drawer Rear Power LED 3 System Drawer Rear Attention LED 4 RIO-G 1 Identify LED The following table explains the meaning of the on and off status of the PCI-X adapter LEDs. PCI Adapter Table LED Inactivity Active Identify Action Green + Amber (2 LEDs) Green - Off Amber - Off Green - On Amber - Off Green - On Amber - Blinking Green - Off Amber - Blinking Resetting the LEDs After the repair action is completed, do the following: 1. Log in as root user. 2. At the command line, type diag. 3. Select Task Selection. 4. Select Log Repair Action. 5. Select the device that was repaired. 6. Press F10 to exit diagnostics. If the Attention LED remains on after you have completed the repair action and reset the LEDs, call for service support. Chapter 11. Fault and Attention LEDs 469 470 Service Guide Appendix A. Environmental Notices Product Recycling and Disposal Components of the system unit, such as structural parts and circuit boards, can be recycled where recycling facilities exist. Companies are available to disassemble, reutilize, recycle, or dispose of electronic products. Contact your account representative for more information. This system unit contains parts such as circuit boards, cables, electromagnetic compatibility gaskets and connectors which may contain lead and copper/beryllium alloys that require special handling and disposal at end of life. Before this unit is disposed, these materials must be removed and recycled or discarded according to applicable regulations. This book contains specific information on each battery type where applicable. This product may contain a sealed lead acid, nickel cadmium, nickel metal hydride, lithium, or lithium ion battery. Consult your user manual or service manual for specific battery information. The battery must be recycled or disposed of properly. Recycling facilities may not be available in your area. For information on disposal of batteries, contact your local waste disposal facility. 471 Acoustical Noise Emissions The equivalent continuous A-weighted sound pressure level at workstations (emission sound pressure level at the 1-meter bystander positions) does not exceed 70 dB(A). Declared Acoustical Noise Emissions Product Configuration Declared A-Weighted Sound Power Level, LWAd (B) Declared A-Weighted Sound Pressure Level, (dB) at 1 meter Bystander Position Operating Idling Operating Idling 6E4 (Deskside) 6.0 5.9 42 41 6C4 (Rack Drawer) 6.1 6.0 44 43 Notes: 1. LWAd is the declared (upper limit) sound power level for a random sample of machines. (1B = 10dB) 2. LpAm is the mean value of the A-weighted sound pressure level at the 1-meter bystander positions for a random sample of machines. 3. 286/E4 levels apply to the following system configurations: 2 way 1 gigahertz processor, 2 disk drives, 2048 GB of RAM, non-redundant power supply (One power supply, 1 processor fan). 4. 286/C4 levels apply to the following hardware configurations: 2 way 1 gigahertz processor, 2 disk drives, 2048 GB of RAM, redundant power supply (Two 645 watt power supplies, 2 processor fans). 5. All measurements made in conformance with ISO 7779 and declared in conformance with ISO 9296. 472 Service Guide Appendix B. Notices This information was developed for products and services offered in the U.S.A. The manufacturer may not offer the products, services, or features discussed in this document in other countries. Consult the manufacturer’s representative for information on the products and services currently available in your area. Any reference to the manufacturer’s product, program, or service is not intended to state or imply that only that product, program, or service may be used. Any functionally equivalent product, program, or service that does not infringe any intellectual property right of the manufacturer may be used instead. However, it is the user’s responsibility to evaluate and verify the operation of any product, program, or service. The manufacturer may have patents or pending patent applications covering subject matter described in this document. The furnishing of this document does not give you any license to these patents. You can send license inquiries, in writing, to the manufacturer. The following paragraph does not apply to the United Kingdom or any country where such provisions are inconsistent with local law: THIS MANUAL IS PROVIDED ²AS IS² WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF NON-INFRINGEMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Some states do not allow disclaimer of express or implied warranties in certain transactions; therefore, this statement may not apply to you. This information could include technical inaccuracies or typographical errors. Changes are periodically made to the information herein; these changes will be incorporated in new editions of the publication. The manufacturer may make improvements and/or changes in the product(s) and/or the program(s) described in this publication at any time without notice. Information concerning products made by other than the manufacturer was obtained from the suppliers of those products, their published announcements, or other publicly available sources. The manufacturer has not tested those products and cannot confirm the accuracy of performance, compatibility or any other claims related to products made by other than the manufacturer. Questions on the capabilities of products made by other than the manufacturer should be addressed to the suppliers of those products. 473 474 Service Guide Appendix C. Service Processor Setup and Test Attention: This procedure applies to modems attached to the serial ports (S1, S2, and S3) on the 286/C4 and 286/E4 systems. It does not affect the operation of the modem attached to the Hardware Management Console (HMC). The call-out function is normally handled by the Service Focal Point application running on the HMC. For your convenience, an example of a basic service processor setup checklist is included here. Your setup may include more or fewer of the available features, so you can adjust this checklist for your own application so that it is helpful to you. Service Processor Setup Checklist 1. Unplug the power cord from the system. 2. Attach a local terminal for this setup procedure. 3. Plug in the system and power on the local terminal. Attention: Make sure that system power remains off. 4. Start the service processor menus. See Chapter 7, “Using the Service Processor”, on page 305. 5. Set the system name. See “Privileged User Menus” on page 307. 6. Enable surveillance. See “Service Processor System Monitoring - Surveillance” on page 336. 7. Configure Call-In/Call-Out. See “Call-In/Call-Out Setup Menu” on page 325. 8. Exit the service processor menus. 9. Unplug the power cord from the system. 10. Attach modems (if needed). See “Accessing the Service Processor Menus Remotely” on page 306. 11. Plug in the system. Attention: Make sure that the system power remains off. 12. Test both of the following: v Call-In, see “Testing Call-In” on page 476 v Call-Out, see “Testing Call-Out” on page 476 13. Use the “Save or Restore Hardware Management Policies,” in the “Introduction to Tasks and Service Aids” section of the Diagnostic Information for Multiple Bus Systems to back up the service processor settings. Note: This step is strongly recommended to protect the usefulness of the service processor and the availability of the system. Your service processor is now ready for use. 475 Testing the Setup This section contains sample testing procedures to help ensure that your setup is working. These tests include communicating with the system operating system. Before you start, ensure that the necessary serial port(s) is configured. If you need assistance, refer to “Serial Port Configuration” on page 477. The system should be powered off as a result of the “Service Processor Setup Checklist” on page 475. Testing Call-In 1. At your remote terminal, call in to your system. Your system answers and offers you the Service Processor Main Menu after requesting your privileged access password. 2. Select System Power Control. 3. Select Power-On System. When you are asked if you wish to continue powering on the system, type Y. 4. After the system firmware and operating system have initialized the system, the login prompt displays at your remote terminal if you set up Seamless Modem Transfer (refer to “Transfer of a Modem Session” on page 517 for more information). This may take several minutes. When the login prompt displays, you have successfully called the service processor. 5. Type logout to disconnect from the operating system. The message No Carrier displays on your remote terminal. 6. Call your system again. The operating system answers and offers you the login prompt. If these tests are successful, call-in is working. 7. Log in and type shutdown to shut down your system. 8. The message No Carrier displays on your remote terminal. Testing Call-Out During the setup, you entered your phone numbers for the pager (on page 328) and customer voice (on page 328). These numbers are used for this test. 1. Your remote terminal is disconnected as a result of the Call-In test. 2. Call your system again. 3. At the service processor main menu, select Call-In/Call-Out Setup menu, then select Call-Out test. This action causes a simulated error condition for the purposes of this test. 4. After a few moments, a message displays, regarding an illegal entry. Press Enter to clear the message and return to the main menu. 5. When your telephone rings, answer the call. You should hear the sound of a telephone being dialed. Your computer is trying to page you. If this test is successful, call-out is working correctly. Return to the “Telephone Number Setup Menu” on page 327 to enter the actual telephone numbers your system will use for reporting problems. 476 Service Guide Serial Port Configuration To configure the serial port on an AIX system, run the following from an AIX console: 1. Log in as root user. 2. To determine if you have any serial ports already configured, type: lsdev -Cc tty If no serial ports are configured, none are listed. If you want to configure serial ports that are not listed, continue with the remaining steps. 3. Identify the serial port(s) with the modem(s). 4. Type smit tty 5. Select add tty 6. Select RS232 7. Select Baud rate 9600 or higher. 8. Select login enable and set the flow control to RTS. 9. Commit the selections and set up any other needed serial ports. 10. Exit SMIT. Appendix C. Service Processor Setup and Test 477 478 Service Guide Appendix D. Setting Up the System Rack Before setting up a rack check through the following specifications and clearances, then go to “286/C4 Rack Installation Inventory” on page 481. Rack Specifications This section contains specifications for the system rack. Model 14T/0 and 14T/4 Rack The mechanical packaging, cooling, power supply, and environmental specifications for the rack are shown in the following table: Dimensions Height 1804 mm 71.0 in. Capacity 36 EIA Units With PDP - DC only 1926 mm 75.8 in. Width without side panels 623 mm 24.5 in. With side panels 644 mm 25.4 in. Depth with rear door 1042 mm 41.0 in. RS/6000 T00 with both doors 1098 mm 43.3 in. pSeries T00 with both doors 1147 mm 45.2 in. Weight Base Rack 244 kg 535 lbs Full Rack 816 kg 1795 lbs. Electricalæ (sum specified values for drawers or enclosures in rack) AC Rack Power source loading maximum in kVA (per PDB)4 4.8 Voltage range (V ac) 200 to 240 Frequency (hertz) 50 or 60 Temperature Requirements (see specifications for drawers or enclosures) This table continues on the following page. 479 This table continued from the previous page. Humidity Requirements (see specifications for drawers or enclosures) Noise Emissions (see specifications for drawers or enclosures) Clearances Front Back Left Right Install/Air Flow Maintenance of a proper service clearance should allow proper airflow. Service 915mm(36 in) 915mm(36 in) 915mm(36 in) 915mm(36 in) 1. Configuration dependent, base rack weight plus the weight of the drawers mounted in the rack. The rack can support up to a maximum weight of 35 lbs/EIA Unit. 2. The total rack power should be derived from the sum of the power used by the drawers in the rack. 3. Each AC Power Distribution Bus (PDB) can supply 4.8 kVA. A rack can have up to four PDB’s as required by the drawers mounted in the rack. Noise Emission Notes 1. LWAd is the declared (upper limit) sound power emission level for a production series of machines. 2. LpAm is the mean value of the A-weighted sound pressure emission levels at the operator position (if any) for a production series of machines. 3. m is the mean value of the space-averaged A-weighted sound pressure emission levels at the one-meter positions for a production series of machines. 4. N/A = Not Applicable (no operator position). 5. All measurements are made in accordance with ISO DIS 779 and reported in conformance with ISO DIS 7574/4. 480 Service Guide System Service Clearances The following illustration shows the minimum space required. The amount of space needed by the units during service is indicated by the dotted line in the following figure. For multiple racks placed side by side, the left and right clearances apply only to the leftmost and rightmost rack. Rack Configuration 2474 mm (97.4 in.) 3564 mm 915 mm (141 in.) (36 in.) 915 mm (36 in.) 610 mm (24 in.) 610 mm (24 in.) Front Caster Location 121(4.8) (3.1) 80 Note: Rack units are large and heavy and are not easily moved. Because maintenance activities require access at both the front and back, allow for extra room. The footprint shows the radius of the swinging doors on the rack. 286/C4 Rack Installation Inventory h Books, CD-ROM and Other Media h ²About Your Machine² Document h Power Cables (1 standard, 2 optional) h 9-Pin to 25-Pin Serial Converters (2) (optional) Appendix D. Setting Up the System 481 h ASCII Terminal (optional) h Display, Cable (optional), and Cable Toroid (optional) h Mouse (optional) h Keyboard (optional), Wrist/Palm Rest (optional) h286/C4 h Rack-Mounting Template h 2 Slide Rail Assemblies h Cable Management Arm 482 Service Guide h Rack-Mounting Kit contains: 8 system to rail screws 8 rail to rack screws 2 nut clips 2 latch-mounting brackets 2 system-retaining thumbscrews alignment pin bracket kit 8 assorted alignment pins 1 strip of self-adhesive placement dots h RJ-48 to 9-Pin Converter Cable (1) Appendix D. Setting Up the System 483 Read the Rack Safety Instructions Review the following instructions for mounting the system drawer into the rack. Rack Safety Instructions v Do not install this unit in a rack where the ambient temperatures will exceed 35 degrees C. v Do not install this unit in a rack where the airflow is compromised. Any side, front or back of the unit used for air flow through the unit must not be in indirect contact with the rack. v Ensure that a hazardous condition is not created due to uneven mechanical loading when installing this unit in a rack. If the rack has a stabilizer it must be firmly attached before installing or removed this unit. v Consider the connection of the equipment to the supply circuit so that the overloaded circuits do not compromise the supply wiring or over-current protection. v An electrical outlet that is not correctly wired could place hazardous voltage on the metal parts of the system or the devices that attach to the system. It is the responsibility of the customer to ensure that the outlet is correctly wired and grounded to prevent an electrical shock. When installing the 286/C4, you will need the following items: v Rack-Mounting Template v 2 Slide Rail Assemblies v Cable Management Arm v Rack-Mounting Kit Envelope v Screwdriver or similar tool, if available 484 Service Guide Attaching the Mounting Hardware to the Rack Enclosure Before performing this procedure, read through each step and study the illustrations. Attention: Mounting the rails is a complex procedure. To install the rack rails correctly, you must read, and then perform each procedure step in the order given. Failure to perform each step in the order given may cause rail failure. To install the rack-mounting hardware into the rack, do the following: 1. Locate the rack-mounting hardware kit, rack-mounting template, and the system rails that were shipped with your system. 1 Rack-Mounting Template 2 Rack-Mounting Hardware Kit 3 System Rail Assemblies Notes: a. The system rails are front-to-back and left-to-right side dependent when you are standing in the front of and facing the rack. The rails are labeled on the front lower corner, left or right. The back of the rails can be identified by the two large alignment pins and the rail-length adjusting plate. The top of the rails can be identified by the screws located on the top of the front and rear rail flange. See the following illustration. b. The rack-mounting hardware kit has four changeable rail-alignment brackets and two sets of alignment pins. If the alignment pins that came adjoined to the rails do not fit your rack, remove the brackets from the rails. To remove the alignment-pin bracket from each of the rails, do the following: 1) Remove the screw located on top of the rail-bracket assembly. 2) Slide the top of the alignment pin bracket from beneath the front-rail bracket. 3) Lift the bottom tab out of the front-rail bracket slot. After determining the correct size of the alignment pin to use, mount the alignment pins onto the replacement brackets. Install the replacement brackets onto the rails. Rail bracket assembly procedure is the reverse of the removal procedure. c. Always mount the brackets that contain the short alignment pins onto the front of the rail. Appendix D. Setting Up the System 485 1 Left and Right Inner Rails 5 Top-Rail Flange Screw (Rear) 2 Left Rail Label 6 Rear-Rail Alignment Pins 3 Left Rail Assembly 7 Rail-Length Adjusting Plate 4 Right Rail Assembly 8 Inner-Rail Safety Release Latch 2. Remove the front rack door, and if necessary, remove the rear rack door. For 14T/0 and 14T/4 racks, refer to , order number SA23-2544, for information about removing the rack doors. 3. If you do not have a rack-mounting template, go to step 4. Using the rack-mounting template, determine where in the rack to place the system. Make note of the Electronics Industries Association (EIA) location number. Note: The rack-mounting template has printed illustrations located on the front of the template. Each illustration is designed to aid in identifying the EIA location holes used when planning to populate your rack. Do not use the rack-mounting template without reading and understanding the following substeps. Each step must be completed in its entirety. Attention: Mounting the rails is a complex procedure. To install the rack rails correctly, you must read, and then perform each procedure step in the order given. Failure to perform each step in the order given may cause rail failure. 4. Note the following when using the rack-mounting template: v Each black or white unit on the template is equal to 1 EIA unit. v Each EIA unit consists of three holes. v The EIA units illustrated on the template must be aligned with an EIA unit located on the rack. v It is not necessary to align like-colored EIA units. For example, a black EIA unit illustrated on the rack-mounting template does not have to be aligned with a black EIA unit located on the rack. A black EIA unit on the rack-mounting template can be aligned with a white EIA unit located on the rack. See the following illustration. 486 Service Guide To use the rack mounting template, do the following: a. Remove the protective coating from each adhesive strip located on the back of the rack-mounting template. Lightly press the template into position on the rack. Ensure that both the left and right sides are at the corresponding EIA locations. b. Install the nut clips into the holes as indicated by the template. The nut clips aid in securing the system drawer to the rack while in transit. Note: Whenever a populated rack is being moved, secure your system drawers with two retaining thumbscrews threaded through the nut clips. This action secures the system front bezel and system chassis to the rack. 1 Nut Clip 2 Self-Adhesive Dots c. Locate the four dots, two printed on each side of the template. Place a self-adhesive dot directly across from the template’s printed dots on or near the rack’s EIA numbering strip. You will be using these dots to aid in correctly positioning the rail-alignment pins located on the front of each rail. d. Remove the rack-mounting template from the front of the rack. The front of your rack should now contain two nut clips and four self-adhesive dots. e. Mount the rack-mounting template to the rack’s rear EIA flange. Place the rack-mounting template at the same EIA numbered location that was used on the front of the rack. f. Wrap a self-adhesive dot directly across from the template’s printed dots. Ensure that a portion of the self-adhesive dot wraps around the rack’s flange so that it can be seen from the front of the rack. g. Do not install any nut clips on the rack’s rear flange as indicated by the rack mounting template. h. Remove the rack-mounting template from the rear of the rack. The rear of your rack should now contain four self-adhesive dots that have been partially wrapped around the rack’s rear flanges. i. Continue to step 5. Appendix D. Setting Up the System 487 The following illustration shows one EIA unit and four EIA units. Depending on the rack manufacturer, the EIA units may be separated either by color or by a line. Note the holes along the rail are not evenly spaced. If your rack has no color or line separation between EIA units, assume that each EIA unit begins where the hole spacing is closest together. 1 Self-Adhesive Dot Placement 3 Nut Clip 2 EIA Unit Hole Spacing 5. If you do not have a rack-mounting template, do the following: a. Determine where in the rack to place the system. The system you are about to install measures 4 EIA units high. Make note of the EIA location number. Note: An EIA unit on your rack consists of a grouping of three holes. b. Facing the front of the rack and working from the right side, place a supplied self-adhesive dot next to the top hole of the bottom EIA unit. Note: The self-adhesive dots are used to aid in identifying locations on the rack. If you no longer have any of the dots, use some other form of marking tool to aid you in identifying hole locations (for example, tape, a marker, or pencil). c. Place another self-adhesive dot next to the top hole of the above EIA unit. Note: If you are counting the holes, begin with the hole identified by the first dot and count up four holes. Place the second dot next to the fourth hole. d. Secure a nut clip to the rack, four holes up from the top dot (bottom hole of the top EIA unit). The nut clip aids in securing your system drawer to the rack while in transit. Notes: 1) If you are counting the holes, begin with the hole identified by the top dot and count up four holes. Place the nut clip next to the fourth hole. 2) Whenever a populated rack is being moved, secure the system drawers with two retaining thumbscrews threaded through the nut clips. This action secures the system front bezel and system chassis to the rack. 3) When counting from the nut clip to the bottom dot, there is an 8-hole span. e. Repeat substeps a through d to the corresponding holes located on the left side of the rack. 488 Service Guide 6. Before installing the system rail assemblies into your rack, you must first determine the type of system rail assembly shipped with your 286/C4. To determine the system rail assembly type, do the following: a. Unpack the system rail assemblies. b. Fully extend the inner and middle rail sections of each rail assembly. c. Does the middle rail section of each system rail assembly have a retaining screw clearance hole drilled through it as shown in the following illustration? If so, go to “System Rail Assembly (Type 1)” on page 490. 1 System Rail Assembly (Type 1) 3 Middle Rail Section 2 Rail to System Retaining Screw Clearance Hole d. If your system rail assemblies do not have a clearance hole drilled through the middle rail section, as shown in the following illustration, go to “System Rail Assembly (Type 2)” on page 493. 1 System Rail Assembly (Type 2) 3 Middle Rail Section 2 Outer Rail Section 4 Inner Rail Section Appendix D. Setting Up the System 489 System Rail Assembly (Type 1) Attention: Mounting the rails is a complex procedure. To install the rack rails correctly, you must read, and then perform each procedure step in the order given. Failure to perform each step in the order given may cause rail failure. To install the system rail assembly (type 1), do the following: 1. Select the right rail assembly. 1 Right Rail Assembly (Type 1) 4 Middle Rail Section 2 Outer Rail Section 5 Inner Rail Section 3 Rail to System Retaining Screw Clearance Hole Note: The system rails are front-to-back and left-to-right side dependent when you are standing in the front of and facing the rack. The rails are labeled on the front lower corner, left or right. The back of the rails can be identified by the two large alignment pins and the rail-length adjusting plate. The top of the rails can be identified by the screws located on the top of the front and rear rail flange. 2. Position yourself at the front of the rack. 3. Facing the rack, insert the rail’s rear-alignment pins into the rack’s rear flange holes identified by the self-adhesive placement dots previously installed. 4. Insert the rail’s front-alignment pins into the rack front flange holes identified by the self-adhesive placement dots previously installed. Ensure that the EIA location is the same from front to back and the rails are level from front to back. 5. Loosely thread one of the rail-retaining screws into the rail’s bottom screw hole. The screw hole is located one position below the bottom alignment pin. 6. Locate one of the latch brackets, and place the bottom slot of the latch bracket over the rail-retaining screw. 490 Service Guide 1 Front Slide Rail Alignment Pin 2 Retaining Screws 3 Latch Bracket 7. Finger-tighten a second retaining screw through the top screw hole of the latch bracket. After the latch bracket is in place, ensure that both screws are finger-tight. Attention: Do not tighten any of the screws more than finger-tight until instructed to do so. Tightening the screws prevents the rails from self-aligning when the system drawer is attached. 8. Go to the back of the rack. 9. Loosen, but do not remove, the six retaining screws that secure the rail-length adjusting plate to the rail. Note: The rail-length adjusting plate should not move freely when loosened. 1 Rail-Length Adjusting Plate 2 Retaining Screws for Rail-Length Adjusting Plate 10. Facing the back of the rack, ensure that the right rail’s two alignment pins are positioned in the correct EIA unit’s holes. Use two rack-retaining screws to attach the rail brackets to the rack flanges. Finger-tighten both screws. Appendix D. Setting Up the System 491 11. Using a screwdriver, tighten the six screws that secure the rail-length adjusting plate. 1 Rail Assembly (Labeled ²Left² in the Front Lower Corner) 2 Rack to Cable Management Arm Bracket 3 Top Rear Alignment Pin 4 Bottom Screw 5 Top Screw 12. Locate the bracket that attaches the cable management arm to the rack flange. 13. Repeat substeps 1 through 11 when installing the left rail. 14. Position the bracket’s bottom hole over the rail’s top alignment pin. Note: The bracket’s bottom slot should now align with the top screw hole of the rail flange. 15. Attach a second retaining screw through the screw hole located below the top alignment pin. Finger-tighten the screw. 16. Using a screwdriver, tighten the six screws that secure the rail-length adjusting plate. 17. After both rails have been installed, ensure that none of the rail’s front retaining screws are more than finger-tight. The rails must be level from front to back and from left to right. 18. The rails are now completely installed. All screws used to attach the rails to the rack must be finger-tight only. Notes: a. Both slide rails must be completely retracted into the rack. b. The rails must be level from front to back and from left to right. 19. Read and understand the following caution notices, then go to “Installing the 286/C4 into the Rack Enclosure” on page 497. CAUTION: The stabilizer must be firmly attached to the bottom rear of the rack to prevent the rack from turning over when the drawers are pulled out of the rack. Do not pull out or install any drawer or feature if the stabilizer is not attached to the rack. C17 CAUTION: This unit weighs between 32 kg (70.5 pounds) and 55 kg (121.2 pounds). Three persons are required to safely move it. Using less than three persons to move it can result in injury. C05 492 Service Guide System Rail Assembly (Type 2) Attention: Mounting the rails is a complex procedure. To install the rack rails correctly, you must read, and then perform each procedure step in the order given. Failure to perform each step in the order given may cause rail failure. To install the system rail assembly (type 2), do the following: 1. Beginning with the right slide rail, remove the right inner rail, as shown in the following illustration. 1 Right Outer Rail Assembly 2 Right Inner Rail 2. Position yourself at the front of the rack. 3. Facing the rack, note the EIA number of the previously installed self-adhesive dots. Insert the rail’s rear-alignment pins into the corresponding EIA unit holes located on the rear flange. 4. Insert the rail’s front-alignment pins into the rack front flange holes that have been identified by the self-adhesive placement dots. Ensure that the EIA number is the same from front to back. 5. Loosely thread one of the rail-retaining screws into the rail’s bottom screw hole. The screw hole is located one position below the bottom alignment pin. 6. Locate one of the latch brackets, and place the bottom slot of the latch bracket over the rail-retaining screw. Finger-tighten the screw. 1 Front Slide Rail Alignment Pin 2 Retaining Screws 3 Latch Bracket Appendix D. Setting Up the System 493 7. Finger-tighten a second retaining screw through the top screw hole of the latch bracket. After the latch bracket is in place, ensure that both screws are finger-tight. Attention: Do not tighten any of the screws more than finger-tight until instructed to do so. Tightening the screws prevents the rails from self-aligning when the system drawer is attached. 8. Go to the back of the rack. 9. Loosen, but do not remove, the six retaining screws that secure the rail-length adjusting plate to the rail. Note: The rail-length adjusting plate should not move freely when loosened. 1 Rail-Length Adjusting Plate 2 Retaining Screws for Rail-Length Adjusting Plate 10. Facing the back of the rack, ensure that the right rail’s two alignment pins are positioned in the correct EIA unit’s holes. Use two rack-retaining screws to attach the rail brackets to the rack flanges. Finger-tighten both screws. 11. Using a screwdriver, tighten the six screws that secure the rail-length adjusting plate. 12. Locate the bracket that attaches the cable management arm to the rack flange. 13. Repeat substeps 1 through 11 when installing the left rail. 1 Rail Assembly (Labeled ²Left² in the Front Lower Corner) 2 Rack to Cable Management Arm Bracket 3 Top Rear Alignment Pin 4 Bottom Screw 5 Top Screw 14. Position the bracket’s bottom hole over the rail’s top alignment pin. Note: The bracket’s bottom slot should now align with the top screw hole of the rail flange. 494 Service Guide 15. Attach a second retaining screw through the screw hole located below the top alignment pin. Finger-tighten the screw. 16. Using a screwdriver, tighten the six screws that secure the rail-length adjusting plate. 17. After both rails have been installed, ensure that none of the rail’s front retaining screws are more than finger-tight. The rails must be level from front to back and from left to right. 18. The rails are now completely installed. All screws used to attach the rails to the rack must be finger-tight only. Notes: a. Both slide rails must be completely retracted into the rack. b. The rails must be level from front to back and from left to right. 19. Read and understand the following caution notices, then go to “Installing the 286/C4 into the Rack Enclosure” on page 497. CAUTION: The stabilizer must be firmly attached to the bottom rear of the rack to prevent the rack from turning over when the drawers are pulled out of the rack. Do not pull out or install any drawer or feature if the stabilizer is not attached to the rack. C17 CAUTION: This unit weighs between 32 kg (70.5 pounds) and 55 kg (121.2 pounds). Three persons are required to safely move it. Using less than three persons to move it can result in injury. C05 Appendix D. Setting Up the System 495 286/C4 Removal and Installation into a Rack Before performing these procedures, read “Safety Notices” on page xi. System Drawer Removal CAUTION: This unit weighs between 32 kg (70.5 pounds) and 55 kg (121.2 pounds). Three persons are required to safely move it. Using fewer than three persons to move it can result in injury. CAUTION: v Removing the system drawer requires three people. v Do not place more than one drawer in the service position at one time. v Ensure that the system cables are not tangled and will not bind when placing the system drawer into the service or operating positions. To remove the system drawer from the rack, do the following: 1. If you have not already done so, shut down the system as described in “Stopping the System without an HMC Attached” on page 359. 2. Open the front and rear rack doors. 3. Disconnect all cables from the rear of the system drawer. 4. Disconnect the cable-management arm by pulling out the two captive snap buttons from the rear of the system drawer. 5. From the front of the system drawer, lift the lower section of the system drawer release latches, which are located on each side of the system drawer. The system drawer release latch clicks as it is put into the release position. 6. Grasp the top section of both latches, and slowly pull the system drawer out into the service position. 7. Depress both safety release tabs located on each inner rail, and pull the system drawer out about 2 inches. 8. Put the system drawer into the service position. 9. Using three persons, grasp the system drawer handles located on each side of the system. 10. Depress the safety latches located on each rail. 11. Remove the system by pulling it completely out and away from the rack and outer rack rails. 12. Push the extended outer rails carefully back into the rack. Note: There is a gravity-activated locking tab at the back of each extended outer rail. The locking tab locks the outer rails into their fully extended position and is unlocked by the action of the inner rail being pushed back into the rack. To compensate for the separation of the inner rail from the outer rail, center the lock tab with your hand, then push the outer rail back into its retracted position. 13. Close the front and rear rack doors. 496 Service Guide Installing the 286/C4 into the Rack Enclosure Attention: When installing this system drawer into a rack, ensure that a hazardous condition is not created due to uneven mechanical loading. If your rack uses a stabilizer bar, it must be firmly attached before installing or removing this system. Choose one of the following system installation procedures. The installation procedure you choose depends on the type of system rail assembly that was installed into your rack. v If your system rail assemblies are type 1, go to “Install the 286/C4 onto a System Rail Assembly (Type 1)”. v If your system rail assemblies are type 2, go to “Install the 286/C4 onto a System Rail Assembly (Type 2)” on page 499. Install the 286/C4 onto a System Rail Assembly (Type 1) Attention: When installing this unit in a rack, ensure that a hazardous condition is not created due to uneven mechanical loading. If your rack uses a stabilizer bar, it must be firmly attached before installing or removing this system. To install the 286/C4 onto a system rail assembly (type 1), do the following: 1. Ensure both rails are in the retracted position in the rack. 2. Using three persons, grasp the two handles located on each side of the system drawer, and lift the system drawer. 3. Place the system drawer on top of the retracted slide rails. Note: Built-in ledges located on each side of the system prevent the system from falling past the slide rails. 4. Push the system all the way into the rack. 5. Pull the system out of the rack until you can see the second screw hole located on the inner rail. Notice that the inner rails followed the system drawer out of the rack. 6. Align the second screw hole, located on the inner rail, with the second threaded screw hole, located on the side of the system drawer. Secure the system drawer to the inner rail with M3 screws. 7. Align the front screw holes of each rail with the threaded screw holes on the system drawer. Secure each screw hole with M3 screws. 8. To secure the third and fourth M3 retaining screws to the system, do the following: a. Pull the system drawer from the rack. Attention: Do not pull the system drawer out past the safety latches. b. Align the rail to system retaining-screw access hole located on the middle rail to the third position retaining screw hole located on the inner rail. c. Secure an M3 screw into each rail’s third position retaining screw hole. d. Align the rail to system retaining-screw access hole located on the middle rail to the fourth position retaining screw hole located on the inner rail. e. Secure an M3 screw into each rail’s fourth position retaining screw hole. Appendix D. Setting Up the System 497 1 Rack Enclosure 3 Safety Release Latch 2 System Retaining-Screw Access Hole 4 286/C4 9. Tighten all eight M3 screws, securing the inner rail to the system drawer side. 10. Push the system drawer about halfway back into the rack. Use a screwdriver or similar tool to tighten the front four M7 16-mm screws that secure the system rail assemblies to the rack’s front flange. 11. Push the system drawer completely into the rack. Use a screwdriver or similar tool to tighten the rear four M7 16-mm screws that secure the system rail assemblies to the rack’s rear flange. Note: After the system rail assemblies are installed, do not extend each rail section past its safety release latch. The safety release latches stop the rails from overextending and separating. This action prevents the system drawer from being accidentally pulled out too far and dropped. 12. Tighten the two M3 screws that secure each rail-length adjusting plate. 13. Slowly slide the system drawer back and forth on the rails, watching for any signs of binding. 14. For additional stability needed when transporting the rack, fasten the system drawer to the rack enclosure by inserting an M8 16-mm thumbscrew through the bezel and chassis bracket, and screwing it to the rack flange with a screw clip nut. 1 286/C4 Front Bezel 2 M8 16-mm Thumbscrew 3 System Drawer Release Latch 498 Service Guide Install the 286/C4 onto a System Rail Assembly (Type 2) To install the 286/C4 onto a system rail assembly (type 2), do the following: 1. Using eight M3 rail to system retaining screws, four on each side, mount the inner rails to each side of the system drawer. Tighten each of the screws with a screwdriver or similar tool. 1 Outer Rail Section 4 M3 Rail to System Retaining Screws 2 Middle Rail Section 5 Safety Latch 3 Inner Rail Section 6 286/C4 Note: The safety release latch, located on each inner rail, is mounted onto the side of the system facing away from the system drawer. See the illustration on page 486. 2. Using three persons, grasp the two handles located on each side of the system drawer, and lift the system drawer. 3. Insert the inner rails located on the system drawer into the rails mounted in the rack. Attention: When matching the inner rails to the outer rails, ensure that both rail faceplates glide past each other. Do not, under any circumstances, force the rails. If the system drawer does not glide freely into the rack, completely remove the drawer from the outer rails. After the drawer is clear of the outer rails, reposition the system, then reinsert the inner rails. Repeat this process until the system drawer glides freely into the rack. 4. Simultaneously press the safety latches, located on each side of the system drawer, and push the system into the rack. Appendix D. Setting Up the System 499 1 Rack Enclosure 3 286/C4 2 Safety Release Latch 4 5. Slide the system drawer in and out of the rack two or three times. This action aligns the system drawer to the rails. The system drawer should glide on the rails. 6. Push the system drawer back into the rack. Using a screwdriver or similar tool, tighten the rear four retaining screws that secure the slide rails to the rack’s rear flange. 7. Slide the system drawer about halfway out of the rack. 8. Using a screwdriver or similar tool, tighten the front four retaining screws that secure the slide rails to the rack’s front flange. Note: After the system rails are installed, do not extend them past their safety latches. The safety release latches stop the rails from overextending and separating. This action prevents the system drawer from being accidentally pulled out too far and dropped. Attention: If any binding is detected, loosen the eight rail-retaining screws (front and rear), and repeat substeps 4 through 7. 9. For additional stability needed when transporting the rack, fasten the system drawer to the rack enclosure by inserting an M8 16-mm thumbscrew through the bezel and chassis bracket, and screwing it to the rack flange with a screw clip nut. 1 286/C4 Front Bezel 2 Thumbscrew 3 System Drawer Release Latch 500 Service Guide Installing the Cable-Management Arm To install the 286/C4 cable-management arm, do the following: 1. Attach the cable carrier support bracket to the back of the chassis with two screws. 2. Place the two captive snap buttons, located on the cable-management arm, into the unlocked (pulled out) position. 3. Align and insert the two captive snap buttons into the holes located on the cable carrier support bracket. Push in on the head of the snap buttons to lock them into position. 1 286/C4 4 Rack Flange Mounting Bracket 2 Cable Carrier Support Bracket 5 Cable Management Arm 3 Captive Snap Button (Qty. 2) 4. Open the cable-management arm so that the hinged mounting plate lies flat against the flange-mounting bracket. 5. Align the holes, located on the cable-management arm mounting plate, with the threaded holes on the rack-flange mounting bracket. Secure the bracket with two screws. Note: To avoid any binding of the cable-management arm, ensure that the cable-management arm is level. Appendix D. Setting Up the System 501 502 Service Guide Appendix E. Installing a Hardware Management Console (HMC) The Hardware Management Console (HMC) user interface provides the functions needed to create and maintain a single or multiple-partitioned environment. Position the HMC and Monitor Position the HMC and monitor at or near their desired location. v The HMC weighs between 34 kg (75 pounds) and 41 kg (90 pounds). Do not try to lift the HMC by yourself. v Monitors can weigh as much as 35 kg (77 pounds). Use caution when lifting or moving the monitor. v Leave enough space around the HMC to safely and easily complete the setup procedures. v Observe standard ergonomic guidelines while arranging your system unit so that you can work comfortably and safely. For more information on arranging your workstation, visit the Healthy Computing Web address at: http://www.pc.ibm.com/us/healthycomputing. v Be sure to maintain at least 51 mm (2 inches) of space on the sides of the system unit and 152 mm (6 inches) at the rear of the system unit to allow the system unit to cool properly. The front of the system requires a minimum of 76 mm (3 inches) of space. Blocking the air vents can cause overheating, which might result in a malfunction or permanent damage to the system unit. v Place the HMC in a location where all necessary power outlets and network connections can safely be reached. v Place the display in a stable and sturdy location. 503 Connect the Cables Use the following steps to connect the cables to your HMC. Look for the small icons on the back of your HMC that show where to attach the keyboard, mouse and display cables. 1. Attach the monitor cable to the monitor connector and tighten the screws. 2. If a label for the monitor was shipped with your system, attach the label to the bottom-right corner of the monitor. 3. Attach the power cables to the monitor as shown in the following illustration. If the HMC has a voltage switch, ensure that it is in the correct position for the supply voltage. Attention: Do not plug the power cables into an electrical outlet at this point. 4. Connect the mouse and keyboard to their connectors, as shown in the following illustration. 504 Service Guide 5. Connect the HMC serial cable into the HMC1 connector located on the back of your system unit. For two HMCs, connect the redundant HMC serial cable into the HMC2 connector located on the back of your system unit. The following illustration shows the location of the serial ports on the back of the HMC, as well as HMC1 and HMC2 connectors located on back of your system unit. 1 286/C4 or 286/E4 2 HMC1 Connector 3 HMC2 Connector 4 Hardware Management Console (HMC) The cable part numbers used for connecting the 286/C4 and 286/E4 to the HMC serial cables are as follows: 11P3955 6m cable 9Pos to 9Pos 11P3956 15m cable 9Pos to 9Pos 31L7196 15m cable 9Pos to 25Pos Appendix E. Install a HMC 505 Connect the 8-Port Adapter Cables If you are using any optional 8-port adapters, connect the cables to the appropriate connectors in slots 1 through 4. Connect the External Modem To connect the external HMC modem, do the following: 1. Connect the modem cable to the external HMC modem (1). 2. Connect the other end of the modem cable to serial port 2. 3. Connect the phone cable line port of the external modem (2). 4. Connect the other end of the phone cable to the analog jack on your wall. 1 External HMC Modem 2 External Modem Connector 506 Service Guide Check the Microswitch Setting on the Modem Check the microswitch settings on the modem. The settings should be set as follows: Switch Position Function 1 Up Force DTR 2 Up Flow Control &E4 3 Down Result Codes Enabled 4 Up Modem Emulation Disabled 5 *Down Auto Answer Enabled 6 Up Maximum Throughput Enabled 7 Up RTS Normal Functions 8 Down Enable Command Mode 9 Down Remote Digital Loopback Test Enabled 10 Up Dial-Up Line Enabled 11 Down AT Responses Enabled (Extended Responses Disabled) 12 *Down Asynchronous Operation 13 UP 28.8KB Line Speed 14 Up 15 Up CD and DSR Normal Functions 16 Up 2-Wire Leased Line Enabled Note: * Only switches 5 and 12 are changed from the factory default settings. Appendix E. Install a HMC 507 Connect the LAN Cable (If Applicable) The LAN cable is recommended because an active partition reports errors to the HMC’s Service Focal Point application through the LAN network. The HMC must be attached to the system’s LAN, and a partition should have a LAN adapter assigned as a resource. Connect the LAN to the adapters assigned to the partition and to the LAN connection on the HMC system hardware. Plug in the HMC Power Cable 1. Plug in the power cable as shown in the following illustration. 2. Plug in the power cables for the monitor, HMC, and modem into electrical outlets. Configure the Network To configure the network, do the following: v Log in to the (HMC) using the user ID hscroot and password abc123. Note: User IDs and passwords are case-sensitive. Enter the user ID and password exactly as shown. v Select the System Configuration Application in the Navigation area. v Select Customize Network Settings in the Contents area. v Type the appropriate network information in the IP Address, Name Services, and Routing tabs on the Network Configuration window. v Select OK. For more information on configuring the network, refer to the Hardware Management Console Maintenance Guide, order number SA23-1251. Configure Inventory Scout Services Inventory Scout Services is an AIX tool that surveys managed systems for hardware and software information. To set up Inventory Scout Services for each system managed by the HMC, and for the partition running an instance of AIX, the partition must be configured in order to conduct microcode surveys or collect Vital Product Data (VPD). To set up Inventory Scout Services for each managed system and partition, you must be a member of one of the following roles: v System Administrator v Advanced Operator v Service Representative 508 Service Guide Refer to the Hardware Management Console Maintenance Guide, order number SA23-1251 for more information on the previous roles. Note: v The Inventory Scout listening port is set by starting the invscoutd daemon on the AIX images. This port will default to 808. v If a system has been powered on using the Full System Partition power-on option, you must configure the Full System Partition to use Inventory Scout Services. To set up Inventory Scout Services, do the following: 1. In the Navigation area, double-click the Inventory Scout Services icon. 2. In the Contents area, select Inventory Scout Profile Configuration. 3. From the list, select a managed system. 4. Select Next. 5. From the list, select the partition you want to configure. 6. Select Next. 7. The next window identifies the selected partition. Type the following: v Partition password (Use the Inventory Scout user ID password) v Inventory Scout listening port v IP address of the AIX partition Click Finish if you have completed configuration of all partitions. OR Click Back to continue to configure additional partitions until all are configured. Collect Vital Product Data Information Use this task to collect the Vital Product Data (VPD) for the specified managed system into a file. To collect the managed system’s VPD, you must be a member of one of the following roles: v System Administrator v Advanced Operator v Service Representative To collect the managed systems VPD, do the following: 1. In the Navigation area, double-click the Inventory Scout Services icon. 2. In the Contents area, select Collect VPD Information. 3. From the list, select the name of the managed system for which you want to collect the Vital Product Data. 4. Select Next. 5. The wizard requests confirmation about the managed system, and then prompts you to insert a blank, DOS-formatted diskette into the HMC diskette drive. 6. Click Finish. This file is then copied to the diskette in the specified drive. To conduct a microcode survey, refer to the Hardware Management Console Maintenance Guide, order number SA23-1251. Appendix E. Install a HMC 509 Configure the Service Agent Note: You must configure the network before you configure the Service Agent. To start the Service Agent, click on the Service Agent icon on the HMC virtual terminal window. If you are configuring Service Agent for the first time, select Start Service Agent Processes to start the service agent processes. Select Service Agent UI. You are prompted to type specific information or parameters pertinent to Service Agent. To configure the Service Agent, do the following: 1. Read and accept the License Agreement. 2. For the initial password prompt, type: password The word password is the default password that has been supplied. Note: User IDs and passwords are case-sensitive. Enter the user ID and password exactly as shown. 3. Type the information as follows: Customer Contact Name The person that we can reach when an error is reported to us by the Service Agent. Customer Phone Number The phone number associated with the person whose name is placed in the Customer Contact Name field. Customer Email The e-mail address of the person whose name is placed in the Customer Contact Name field. Queue Country/Region The country or region in which the HMC is located. Click the drop-down list to expand the country or region choices. Type the first letter of the desired country or region to move to the first occurrence of a country or region whose first letter matches your choice. Gateway Type The machine type of the processor subsystem. Gateway Serial Number The serial number of the processor subsystem that you designate to be the machine to send Service Agent information to us. Gateway Model Number The model number of the processor subsystem that you designate to be the machine to send Service Agent information to us. 4. Select Continue. The Service Agent interface displays. 5. Select Network. Fill in the appropriate information. Note: Some of the fields are prefilled with information taken from the Service Agent initial configuration panel. 6. Select OK. 7. Expand the contents of the Gateway category by clicking the key next to your Gateway machine. 8. Select Dialer. 9. Select Location. Select the location phone number closest to your physical location. Several of the fields are then completed by Service Agent based on the location phone number you selected. 510 Service Guide 10. Select Modem. Select the modem that you will use for Service Agent communications. After your selection, several of the remaining fields are filled in by Service Agent. 11. Select OK to save the Dialer configuration. 12. Expand the contents of the Administration category by clicking the key to its left. 13. Select Register. 14. Select your Gateway machine by clicking it. 15. Select Register. The ²Would you like to IGN now² window displays. 16. Select Yes. The Service Agent is now attempting to connect to the Service Agent server, using the modem you have selected. 17. Select CallLog. Check the details pane to view the status of the just-attempted call. To determine the success or failure of the connection attempt, examine the Description column. For more information on the Service Agent, refer to the Hardware Management Console Maintenance Guide, order number SA23-1251. Appendix E. Install a HMC 511 512 Service Guide Appendix F. Modem Configurations Attention: This appendix applies only to modems attached to serial ports S1 and S2 located on the CEC backplane (location U0.1-P1). The service processor is designed to place little demand on an attached modem, thereby increasing the setup and connection success rates. Sample Modem Configuration Files This appendix contains information about several sample modem configuration files that either work directly with your modem or provide a good starting point for a custom setup. The sample modem configuration files are located in your service processor firmware in the /usr/share/modems subdirectory (if your system / server is running AIX) with the following names. A listing of each specific file is included in this appendix. Generic Modem Configuration Files AIX File Name Service Processor Firmware File Name modem_z.cfg modem_z_sp modem_z0.cfg modem_z0_sp modem_f.cfg modem_f_sp modem_f0.cfg modem_f0_sp modem_f1.cfg modem_f1_sp Specific Modem Configuration Files AIX File Name Service Processor Firmware File Name modem_m0.cfg modem_m0_sp modem_m1.cfg modem_m1_sp Use the following selection procedures and your modem manual to determine which of the configuration files is suitable for your use. Configuration File Selection Use the following steps to select a configuration file: 1. Is your modem an IBM 7852-400? If Yes, use modem configuration file modem_m0.cfg and go to step 7 on page 514. Note: The IBM 7852-400 modem has DIP switches on the right side of the unit. See “IBM 7852-400 DIP Switch Settings” on page 516 for the correct switch settings. If No, continue with step 2 on page 514. 513 2. Is your modem an IBM 7857-017? If Yes, use modem configuration file modem_m1.cfg and go to step 7. Note: The IBM 7857-017 modem has two telephone line connections on the back of the unit. One is marked LL (for Leased Line), and the other is marked PTSN (for Public Telephone Switched Network). The service processor expects to use the modem on the public network, so the telephone line should attach to the PTSN connector. If No, continue with step 3. 3. Does your modem respond to the extended command set (prefixed with &)? If Yes, go to step 5. If No, continue with step 4. 4. Does your modem respond to: v ATZ reset command or v ATZn reset commands, where n can be 0, 1, and so on? If ATZ, configuration file modem_z.cfg is recommended. If ATZn, configuration file modem_z0.cfg is recommended. Go to step 7. 5. Does your modem command set include a test for V.42 error correction at the remote modem (often called “Auto-Reliable Mode”)? If Yes, disable this test. You can use sample configuration files /usr/share/modem_m0.cfg or /usr/share/modem_m1.cfg as models to help you create a file for your particular modem. See “Customizing the Modem Configuration Files” on page 515 for more information. Go to step 7. If No, go to step 6. 6. Does your modem respond to: v AT&F reset command or v AT&Fn reset commands, where n can be 0, 1, and so on? If AT&F, configuration file modem_f.cfg is recommended. If AT&Fn, configuration file modem_f0.cfg or modem_f1.cfg is recommended, depending on which provides the hardware flow control profile. 7. You have completed selection of the configuration file. If your modem configuration selection is not available in the Service Processor Modem Configuration Menu, you must access it through the Configure Remote Maintenance Policy Service Aid. If you find it necessary to adjust any of these configuration files, use the manual provided with your modem to accomplish that task. It is recommended you select settings that enable hardware flow control and respond to DTR. Note: Some older modems do not respond to the X0 or &R1 commands. Edit out these commands from the modem configuration file if yours is such a modem. See your modem manual for more information. Some modems, such as the IBM 7857-017, are not designed for the paging function. Although they can be used for paging, they return an error message when they do not get the expected response from another modem. Therefore, even though the paging was successful, the error message causes the service processor to retry, continuing to place pager calls for the number of retries specified in the Call-Out Policy Setup Menu. These retries result in redundant pages. 514 Service Guide Examples for Using the Generic Sample Modem Configuration Files The following table contains information to help you determine which modem configuration file to use with various modems. Modem Setup Z Setup Z0 (Rare) Setup F Setup F0 Setup F1 AT&T DataPort 2001 (Ring interrupt only on first ring) X Bocamodem 1440E X Hayes Smart Modem 300 X IBM 5841 X IBM 5843 X IBM 7851 X IBM 7852-10 X IBM 7855 X USRobotics 36.6K Sportster X Zoom V.32 X Customizing the Modem Configuration Files You can create your own modem configuration files or modify the samples provided. After you customize your modem configuration files, you must access them through the Configure Remote Maintenance Policy Service Aid rather than from the service processor menus. Note: If you have already set up your serial ports, line speeds, authorizations, and telephone numbers from the service processor menus, use the service aid to specify your customized modem configuration files. If you have not already set up your serial ports, line speeds, authorizations, and telephone numbers from the service processor menus, use the service aids to set them while you specify your customized modem configuration files. To disable Auto-Reliable Mode testing of the remote modem, use the sample modem configuration file /usr/share/modems/modem_f.cfg as a model that you can modify, as follows: 1. Find the necessary command in your modem manual. 2. Copy the /usr/share/modems/modem_f.cfg file to a new file with a different name (for example, modem_fx.cfg). 3. In the new file (modem_fx.cfg), change the line Send "ATE0T\r" to Send "ATcccE0T\r" where ccc is the added command as specified in your modem manual, as follows: Change the third line of each of the following stanzas: v condout v condin v ripo 4. Save the changes. Appendix F. Modem Configurations 515 IBM 7852-400 DIP Switch Settings If you are using a 7852-400 modem to enable service processor communications, for proper operation, the dual inline package (DIP) switches must be set according to the following table: Switch Position Function 1 Up Force DTR 2 Up Flow Control &E4 3 Down Result Codes Enabled 4 Down Modem Emulation Disabled 5 Up Auto Answer Enabled 6 Up Maximum Throughput Enabled 7 Up RTS Normal Functions 8 Down Enable Command Mode 9 Down Remote Digital Loopback Test Enabled 10 Up Dial-Up Line Enabled 11 *Up AT Responses Enabled (Extended Responses Disabled) 12 *Down Asynchronous Operation 13 Up 28.8KB Line Speed 14 Up 15 Up CD and DSR Normal Functions 16 Up 2-Wire Leased Line Enabled * Only switches 11 and 12 are changed from the factory default settings. Xon/Xoff Modems Some early modems assume software flow control (Xon/Xoff) between the computer and the modem. Modems with this design send extra characters during and after the transmitted data. The service processor cannot accept these extra characters. If your configuration includes such a modem, your functional results may be unpredictable. The sample modem configuration files included in this appendix do not support these modems, so custom configuration files are necessary. Anchor Automation 2400E is an example of such a modem. If you experience unexplainable performance problems that may be due to Xon/Xoff characters, it is recommended that you upgrade your modem. Ring Detection Most modems produce an interrupt request each time they detect a ring signal. Some modems generate an interrupt only on the first ring signal that they receive. AT&T DataPort 2001 is an example of such a modem. The service processor uses the ring interrupt request to count the number of rings when Ring Indicate Power-On (RIPO) is enabled. If your modem produces an interrupt on only the first ring, set Ring Indicate Power-On to start on the first ring. Otherwise, you can choose to start Ring Indicate Power-On on any ring count. 516 Service Guide Terminal Emulators The service processor is compatible with simple ASCII terminals, and therefore compatible with most emulators. When a remote session is handed off from the service processor to the operating system, agreement between terminal emulators becomes important. The server’s operating system will have some built-in terminal emulators. You may also have a commercially available terminal emulation. It is important that the local and host computers select the same or compatible terminal emulators so that the key assignments and responses match, ensuring successful communications and control. For best formatting, choose line wrap in your terminal emulator setup. Recovery Procedures Situations such as line noises and power surges can sometimes cause your modem to enter an undefined state. When it is being used for dial-in, dial-out or ring indicate power-on, your modem is initialized each time one of these actions is expected. If one of these environmental conditions occur after your modem has been initialized, it might be necessary to recover your modem to a known state. If your modem communicates correctly with remote users, it is probably in control. It may be wise to occasionally change some of the functional settings and then change them back, just for the sense of security that the modem is communicating, and to ensure it has been initialized recently. If your system is particularly difficult to access physically, another strategy is to protect it with an Uninterruptible Power Source (UPS) and a phone-line surge protector. In case recovery becomes necessary, shut down your system using established procedures. Disconnect the power cable and press the power button to drain capacitance while power is disconnected. Disconnect and reconnect modem power, and then reconnect system power to completely reinitialize your system. Transfer of a Modem Session Because many modem command variations exist, the sample modem configuration files located at the end of this appendix have been written to capture the largest number of workable modem settings. The modem command &Dn (where n is a number) generally sets the modem response to the Data Terminal Ready (DTR) signal from the server’s serial port. The desired response is that the modem will hold a connection while DTR is enabled, and drop the connection when DTR is released. Using this mechanism, the server hangs up on a connection under normal conditions. Consult your modem manual for its specific response scheme for the &Dn command. Two strategies are available for dealing with the modem’s response to DTR: v Recovery v Prevention Before you use one of these strategies, determine if your server’s modem is set up correctly to respond to DTR. With the remote terminal connected to serial port 1 and defined as the Primary Console Device, there are two tests you can perform: 1. Will the modem drop the connection after the System initialization complete message displays at the remote terminal? If Yes, the modem is set up correctly. Appendix F. Modem Configurations 517 If No, try another &Dn setting for your server’s modem. See your modem manual for this information. The &Dn command appears in three places each in three of the sample modem configuration files, as follows: 2. Will the server’s modem disconnect when the power drops? You can make this observation at the remote terminal by commanding your server to shut down and power off. (Use the AIX command shutdown -F.) Watch for the message NO CARRIER on your remote terminal. If Yes, this is the correct response. The modem is set up correctly. If No, try another &Dn setting for your server’s modem. See your model manual for this information. The &Dn command appears in three places each in three of the sample modem configuration files. Only the following sample modem configuration files contain the &Dn command (in three places each): v modem_f.cfg v modem_f0.cfg v modem_f1.cfg If you are using modem_z.cfg or modem_z0.cfg, you cannot control DTR response. If your remote terminal does not disconnect after logging off, you must command the remote terminal emulator to hang up. This then breaks the connection. Recovery Strategy The recovery strategy consists of making two calls to establish a remote session. This solution is the easiest to implement and allows more freedom for configuring your server’s serial ports. To set up a remote terminal session, dial into the service processor and start the system. After the operating system is loaded and initialized, the connection will be dropped. At this point, call the server back and the operating system will answer and offer you the login prompt. Prevention Strategy The disconnect is caused by the operating system when it initializes the Primary Console. The tests listed in “Transfer of a Modem Session” on page 517 are conducted with the remote terminal selected as the primary console to manifest the modem’s response to DTR transitions. v If a local ASCII terminal or a graphics console is to be a permanent part of your server, then make one of them the primary console. Your remote terminal will no longer experience the connection loss. v If a local console is not a permanent part of your server, you can still assign either the unused graphics console or the unused serial port as the primary console. This gives you the desired seamless connection at your remote terminal. v If you choose to use the unused serial port as the primary console, some initialization traffic will be sent to any serial device attached to that port. As a result, that serial device’s connection and function could be affected. These impacts may make that port unattractive for devices other than a temporary local ASCII terminal. 518 Service Guide Modem Configuration Sample Files Sample File modem_m0.cfg ## COMPONENT_NAME: (ESPSETUP) ENTRY SERVICE PROCESSOR SETUP: modem_m0 ## FUNCTIONS: Modem configuration file specifically for IBM 7852-400 # modem with Auto-Reliable feature. This feature must be turned off # for Catcher calls. This example uses the AT&F reset command to # choose the factory defaults. ## (C) COPYRIGHT International Business Machines Corp. 1996 # All Rights Reserved # Licensed Materials - Property of IBM ## US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. ### The modem has configuration switches. They should be set to the # factory default settings, except switches 11 and 12. These must be # to UP ("AT" responses) and DOWN (Asynchronous operation), respectively. ICDelay 1 DefaultTO 10 CallDelay 120 ## %N Call-Out phone number %R Return phone number ### PROGRAMMING NOTE: No blanks between double quote marks ("). condout: send "AT&F&E2E0T\r" # Reset to factory defaults # Reliable mode # Echo off ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "AT&E12&E14\r" # Disable pacing # Disable data compression expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful. send "AT&SF1&S0S9=1\r" # DSR independent of CD # Force DSR on. # CD respond time=100ms expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful. send "ATV0S0=0\r" # Numeric response code # Auto-Answer off expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful. done connect: send "ATDT%N\r" # Tone dialing command. # %N from Call Home setup. # Expect a connection response. expect "33\r" or "31\r" or "28\r" or "26\r" or "24\r" or "21\r" or "19\r" or "13\r" or "12\r" or "1\r" busy "7\r" timeout 60 done retry: send "A/" # Repeat the previous command. # Expect a connection response. expect "33\r" or "31\r" or "28\r" or "26\r" or "24\r" or "21\r" or Appendix F. Modem Configurations 519 "19\r" or "13\r" or "12\r" or "1\r" busy "7\r" timeout 60 done disconnect: delay 2 # Separate from previous data. send "+++" # Assure command mode. delay 2 # Allow mode switching delay. send "ATH0T\r" # Set modem switch-hook down # (i.e., hang up). ignore "0\r" or "OK\r" timeout 2 # Ignore modem response. send "ATE0Q1\r" # Initialize modem: Echo OFF, # Disable responses. ignore "0\r" timeout 1 done condin: send "AT&F&E2E0T\r" # Reset to factory defaults. # Reliable mode # Echo off ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "AT&E12&E14\r" # Disable pacing # Disable data compression expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful send "AT&SF1&S0S9=1\r" # DSR independent of CD. # Force DSR on. # CD respond time=100ms expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful. send "ATV0S0=2\r" # Numberic response code # Answer on 2nd ring expect "0\r" timeout 2 # Confirm commands successful. done waitcall: ignore "2\r" timeout 1 # Ignore first ring. expect "2\r" timeout 10 # Pickup 2nd ring or timeout # Expect a connection response. expect "33\r" or "31\r" or "28\r" or "26\r" or "24\r" or "21\r" or "19\r" or "13\r" or "12\r" or "1\r" busy "7\r" timeout 60 done page: send "ATDT%N,,,,%R;\r" # %N = pager call center number # Add enough commas to wait for # time to enter paging number. # %R = paging number expect "0\r" timeout 60 # Confirm successful command. delay 2 # Wait before hanging up. send "ATH0\r" # Hang up. expect "0\r" timeout 2 # Confirm successful command. done ripo: send "AT&F&E2E0T\r" # Reset to factory defaults. # Reliable mode # Echo off ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "AT&E12&E14\r" # Disable pacing # Disable data compression expect "0\r" or "OK\r\n" timeout 2 # Confirm successful command. send "AT&SF1&S0S9=1\r" # DSR independent of CD. # Force DSR on. # CD respond time=100ms 520 Service Guide expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful. send "ATV0S0=0\r" # Numeric response code # Auto Answer OFF expect "0\r" timeout 2 # Confirm commands successful. done # error: # Handle unexpected modem # responses. expect "8\r" or "7\r" or "6\r" or "4\r" or "3\r" delay 2 done Sample File modem_m1.cfg ## COMPONENT_NAME: (ESPSETUP) ENTRY SERVICE PROCESSOR SETUP modem_m1 ## FUNCTIONS: Modem configuration file specifically for IBM 7857-017 modem with # Auto-Reliable feature. This feature must be turned off for Catcher calls. # This example uses the AT&F reset command to choose the factory defaults. ## To allow dial commands for digital pagers, it is necessary to reduce # the number of characters in the dial command. Each comma (delay) has # been set to 6 seconds (S8=6) for that reason. ### (C) COPYRIGHT International Business Machines Corp. 1996 # All Rights Reserved # Licensed Materials - Property of IBM ## US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. # ICDelay 1 DefaultTO 10 CallDelay 120 ## %N Call-Out phone number %R Return phone number ### PROGRAMMING NOTE: No blanks between double quote marks ("). condout: send "AT&F*E0E0\r" # Reset to factory defaults. # *E0=data compression disabled # E0=echo disabled ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "AT#F0*Q2S8=6\r" # Trellis modulation disabled # Retrain with adaptive rate # Set ,=6second expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful send "ATV0X0S0=0\r" # Numeric response code # AT compatible messages # Auto-Answer disabled expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful. done connect: send "ATDT%N\r" # Tone dialing command. # %N from Call Home setup. expect "1\r" busy "7\r" timeout 60 # Expect a connection response. done Appendix F. Modem Configurations 521 retry: send "A/" # Repeat the previous command. expect "1\r" busy "7\r" timeout 60 # Expect a connection response. done disconnect: delay 2 # Separate from previous data. send "+++" # Assure command mode. delay 2 # Allow mode switching delay. send "ATH0\r" # Set modem switch-hook down # (i.e., hang up). ignore "0\r" or "OK\r" timeout 2 # Ignore modem response. send "ATE0Q1\r" # Initialize modem: Echo OFF, # Disable responses. ignore "0\r" timeout 1 done condin: send "AT&F*E0E0\r" # Reset to factory defaults. # *E0=data compression disabled # E0=echo disabled ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "AT#F0*Q2\r" # Trellis modulation disabled # Retrain with adaptive rate expect "0\r" or "OK\r\n" timeout 2 # Confirm commands successful send "ATV0X0S0=2\r" # Numeric response code # AT compatible messages # Answer on 2nd ring expect "0\r" timeout 2 # Confirm commands successful. done waitcall: ignore "2\r" timeout 1 # Ignore first ring. expect "2\r" timeout 10 # Pick up second ring # or timeout. # Expect a connection response. expect "1\r" timeout 60 done page: send "ATD%N,%R\r" # %N = pager call center number # commas=6sec wait time to # enter paging number. # %R = return number expect "0\r" or "3\r" timeout 30 # Confirm successful command. delay 2 # Wait before hanging up. send "+++" # Assure command mode. delay 2 # Allow mode switching delay. send "ATH0\r" # Hang up. expect "0\r" timeout 2 # Confirm successful command. done ripo: send "AT&F*E0E0\r" # Reset to factory defaults. # *E0=data compression disabled # E0=echo disabled ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "AT#F0*Q2\r" # Trellis modulation disabled # Retrain with adaptive rate expect "0\r" or "OK\r\n" timeout 2 # Confirm successful command. send "ATV0X0S0=0\r" # Numeric response code # AT compatible messages # Auto-Answer disabled expect "0\r" timeout 2 # Confirm commands successful. 522 Service Guide done # error: # Handle unexpected modem # responses. expect "8\r" or "7\r" or "4\r" or "3\r" delay 2 done Sample File modem_z.cfg ## COMPONENT_NAME: (ESPSETUP) ENTRY SERVICE PROCESSOR SETUP Z ## FUNCTIONS: Modem configuration file for many early Hayes* compatible modems. # This example uses the ATZ reset command to choose the factory defaults. # This setup will work for many modems, but it is required for early vintage # modems which respond to neither the ATZ0 reset command nor the extended (&) # commands. Refer to your modem manual. ## * Trademark of Hayes Microcomputer Products, Inc. ## (C) COPYRIGHT International Business Machines Corp. 1996 # All Rights Reserved # Licensed Materials - Property of IBM ## US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. ### If the modem has configuration switches, they should be set to the # factory default settings. ICDelay 1 DefaultTO 10 CallDelay 120 # AT Attention Code , Inserts delay in dialing commands # Z Reset to factory defaults Q0 Turn on responses # E0 Turn echo off Q1 Turn off responses # V0 Use numeric responses S0=0 Automatic answer inhibit # +++ Escape to command mode S0=2 Answer on second ring # H0 Hang-up T = Tone mode. When used as T\r, it is a # no op to maintain program synchronization # when modem may/will echo the commands. ## %N Call-Out phone number %P Paging phone number # %S Modem speed (available to users) ## Following are common responses from a wide range of modems: # 16, 15, 12, 10, 5 and 1 are connection responses. Add others as required. # 7=busy; 6=no dial tone; 4=error; 3=no carrier; 2=ring; 0=OK ## PROGRAMMING NOTE: No blanks between double quote marks ("). condout: send "ATZQ0T\r" # Reset to factory defaults. ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful. Appendix F. Modem Configurations 523 done connect: send "ATDT%N\r" # Tone dialing command. # %N from Call Home setup. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60 done retry: send "A/" # Repeat the previous command. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60 done disconnect: delay 2 # Separate from previous data. send "+++" # Assure command mode. delay 2 # Allow mode switching delay. send "ATH0T\r" # Set modem switch-hook down # (i.e., hang up). ignore "0\r" or "OK\r" timeout 2 # Ignore modem response. send "ATE0Q1\r" # Initialize modem: Echo OFF, # Disable responses. ignore "0\r" timeout 1 done condin: send "ATZQ0T\r" # Reset to factory defaults. ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=2\r" # Set AutoAnswer ON expect "0\r" timeout 2 # Confirm command successful. done waitcall: ignore "2\r" timeout 1 # Ignore first ring. expect "2\r" timeout 10 # Pick up second ring # or timeout. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" timeout 60 done page: send "ATDT%N,,,,%R;\r" # %N = pager call center number # Add enough commas to wait for # time to enter paging number. # %R = paging number # Confirm successful command. expect "0\r" timeout 60 delay 2 # Wait before hanging up. send "ATH0T\r" # Hang up. expect "0\r" timeout 2 # Confirm successful command. done ripo: send "ATZQ0T\r" # Reset to factory defaults. 524 Service Guide ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful. done # RI Power On enabled. error: # Handle unexpected modem # responses. expect "8\r" or "7\r" or "6\r" or "4\r" or "3\r" delay 2 done Sample File modem_z0.cfg ## COMPONENT_NAME: (ESPSETUP) ENTRY SERVICE PROCESSOR SETUP Z0 ## FUNCTIONS: Modem configuration file for some early Hayes* compatible modems. # This example uses the ATZ0 reset command to choose the factory defaults. # This setup is recommended for modems that will respond to the ATZ0 command # and which do not respond to the extended (&) commands. Refer to your modem # manual. ## * Trademark of Hayes Microcomputer Products, Inc. ## (C) COPYRIGHT International Business Machines Corp. 1996 # All Rights Reserved # Licensed Materials - Property of IBM ## US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. ### If the modem has configuration switches, they should be set to the # factory default settings. ICDelay 1 DefaultTO 10 CallDelay 120 # AT Attention Code , Inserts delay in dialing commands # Z0 Reset. Restore Profile 0 Q0 Turn on responses # E0 Turn echo off Q1 Turn off responses # V0 Use numeric responses S0=0 Automatic answer inhibit # +++ Escape to command mode S0=2 Answer on second ring # H0 Hang-up X0=0 Limit modem response codes # T = Tone mode. When used as T\r, it is a # no op to maintain program synchronization # when modem may/will echo the commands. ## %N Call-Out phone number %P Paging phone number # %S Modem speed (available to users) ## Following are common responses from a wide range of modems: # 16, 15, 12, 10, 5 and 1 are connection responses. Add others as required. # 7=busy; 6=no dial tone; 4=error; 3=no carrier; 2=ring; 0=OK ## PROGRAMMING NOTE: No blanks between double quote marks ("). Appendix F. Modem Configurations 525 condout: send "ATZ0Q0T\r" # Reset modem. Select profile 0 ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful. done connect: send "ATDT%N\r" # Tone dialing command. # %N from Call Home setup. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60 done retry: send "A/" # Repeat the previous command. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60 done disconnect: delay 2 # Separate from previous data. send "+++" # Assure command mode. delay 2 # Allow mode switching delay. send "ATH0T\r" # Set modem switch-hook down # (i.e., hang up). ignore "0\r" or "OK\r" timeout 2 # Ignore modem response. send "ATE0Q1\r" # Initialize modem: Echo OFF, # Disable responses. ignore "0\r" timeout 1 done condin: send "ATZ0Q0T\r" # Reset modem. Select profile 0 ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=2\r" # Set AutoAnswer ON expect "0\r" timeout 2 # Confirm command successful. done waitcall: ignore "2\r" timeout 1 # Ignore first ring. expect "2\r" timeout 10 # Pick up second ring # or timeout. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" timeout 60 done page: send "ATDT%N,,,,%R;\r" # %N = pager call center number # Add enough commas to wait for # time to enter paging number. # %R = paging number 526 Service Guide # Confirm successful command. expect "0\r" timeout 60 delay 2 # Wait before hanging up. send "ATH0T\r" # Hang up. expect "0\r" timeout 2 # Confirm successful command. done ripo: send "ATZ0Q0T\r" # Reset modem. Select profile 0 ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful. done # RI Power On enabled. error: # Handle unexpected modem # responses. expect "8\r" or "7\r" or "6\r" or "4\r" or "3\r" delay 2 done Sample File modem_f.cfg ## COMPONENT_NAME: (ESPSETUP) ENTRY SERVICE PROCESSOR SETUP F ## FUNCTIONS: Modem configuration file for many recent Hayes* compatible modems. # This example uses the AT&F reset command to choose the factory defaults. # This set up is preferred for modems with extended (&) commands. For early # vintage modems, setup Z or Z0 is recommended. If your modem responds to # the extended (&) commands and to factory default choices (&Fn), setup file # F0 or F1 is recommended. ## * Trademark of Hayes Microcomputer Products, Inc. ## (C) COPYRIGHT International Business Machines Corp. 1996 # All Rights Reserved # Licensed Materials - Property of IBM ## US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. ### If the modem has configuration switches, they should be set to the # factory default settings. ICDelay 1 DefaultTO 10 CallDelay 120 # AT Attention Code , Inserts delay in dialing commands # &F Reset to default profile Q0 Turn on responses # E0 Turn echo off Q1 Turn off responses # V0 Use numeric responses S0=0 Automatic answer inhibit # +++ Escape to command mode S0=2 Answer on second ring # H0 Hang-up X0=0 Limit modem response codes # T = Tone mode. When used as T\r, it is a # no op to maintain program synchronization # when modem may/will echo the commands. # Appendix F. Modem Configurations 527 # &C1 Detect CD &D2 Respond to DTR (often the default) ## %N Call-Out phone number %P Paging phone number # %S Modem speed (available to users) ## Following are common responses from a wide range of modems: # 16, 15, 12, 10, 5 and 1 are connection responses. Add others as required. # 7=busy; 6=no dial tone; 4=error; 3=no carrier; 2=ring; 0=OK ## PROGRAMMING NOTE: No blanks between double quote marks ("). condout: send "AT&FQ0T\r" # Reset to factory defaults. ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful. send "AT&C1&D2\r" # Detect carrier and DTR. expect "0\r" timeout 2 # Confirm command successful. done connect: send "ATDT%N\r" # Tone dialing command. # %N from Call Home setup. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60 done retry: send "A/" # Repeat the previous command. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60 done disconnect: delay 2 # Separate from previous data. send "+++" # Assure command mode. delay 2 # Allow mode switching delay. send "ATH0T\r" # Set modem switch-hook down # (i.e., hang up). ignore "0\r" or "OK\r" timeout 2 # Ignore modem response. send "ATE0Q1\r" # Initialize modem: Echo OFF, # Disable responses. ignore "0\r" timeout 1 done condin: send "AT&FQ0T\r" # Reset to factory defaults. ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=2\r" # Set AutoAnswer ON expect "0\r" timeout 2 # Confirm command successful. send "AT&C1&D2\r" # Detect carrier and DTR. expect "0\r" timeout 2 # Confirm command successful. 528 Service Guide done waitcall: ignore "2\r" timeout 1 # Ignore first ring. expect "2\r" timeout 10 # Pick up second ring # or timeout. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" timeout 60 done page: send "ATDT%N,,,,%R;\r" # %N = pager call center number # Add enough commas to wait for # time to enter paging number. # %R = paging number # Confirm successful command. expect "0\r" timeout 60 delay 2 # Wait before hanging up. send "ATH0T\r" # Hang up. expect "0\r" timeout 2 # Confirm successful command. done ripo: send "AT&FQ0T\r" # Reset to factory defaults. ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful. send "AT&C1&D2\r" # Detect carrier and DTR. expect "0\r" timeout 2 # Confirm command successful. done # RI Power On enabled. error: # Handle unexpected modem # responses. expect "8\r" or "7\r" or "6\r" or "4\r" or "3\r" delay 2 done Sample File modem_f0.cfg ## COMPONENT_NAME: (ESPSETUP) ENTRY SERVICE PROCESSOR SETUP F0 ## FUNCTIONS: Modem configuration file for many recent Hayes* compatible modems. # This example uses the AT&F0 reset command to choose the factory defaults. # This set up is preferred for modems with extended (&) commands. For early # vintage modems, setup Z or Z0 is recommended. If your modem responds to # the extended (&) commands and to factory default choices (&Fn), but doesn’t # work properly with this setup file, setup F1 is recommended. ## * Trademark of Hayes Microcomputer Products, Inc. ## (C) COPYRIGHT International Business Machines Corp. 1996 # All Rights Reserved # Licensed Materials - Property of IBM ##US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. # Appendix F. Modem Configurations 529 ## If the modem has configuration switches, they should be set to the # factory default settings. ICDelay 1 DefaultTO 10 CallDelay 120 # AT Attention Code , Inserts delay in dialing commands # &F0 Reset. Restore profile 0 Q0 Turn on responses # E0 Turn echo off Q1 Turn off responses # V0 Use numeric responses S0=0 Automatic answer inhibit # +++ Escape to command mode S0=2 Answer on second ring # H0 Hang-up X0=0 Limit modem response codes # T = Tone mode. When used as T\r, it is a # no op to maintain program synchronization # when modem may/will echo the commands. ## &C1 Detect CD &D2 Respond to DTR (often the default) # &R1 Ignore RTS (CTS) ## %N Call-Out phone number %P Paging phone number # %S Modem speed (available to users) ## Following are common responses from a wide range of modems: # 16, 15, 12, 10, 5 and 1 are connection responses. Add others as required. # 7=busy; 6=no dial tone; 4=error; 3=no carrier; 2=ring; 0=OK ## PROGRAMMING NOTE: No blanks between double quote marks ("). condout: send "AT&F0Q0T\r" # Reset modem. Select profile 0 ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful. send "AT&C1&D2&R1\r" # Detect carrier and DTR, # Ignore RTS. expect "0\r" timeout 2 # Confirm command successful. done connect: send "ATDT%N\r" # Tone dialing command. # %N from Call Home setup. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60 done retry: send "A/" # Repeat the previous command. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60 done disconnect: delay 2 # Separate from previous data. send "+++" # Assure command mode. 530 Service Guide delay 2 # Allow mode switching delay. send "ATH0T\r" # Set modem switch-hook down # (i.e., hang up). ignore "0\r" or "OK\r" timeout 2 # Ignore modem response. send "ATE0Q1\r" # Initialize modem: Echo OFF, # Disable responses. ignore "0\r" timeout 1 done condin: send "AT&F0Q0T\r" # Reset modem. Select profile 0 ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=2\r" # Set AutoAnswer ON expect "0\r" timeout 2 # Confirm command successful. send "AT&C1&D2&R1\r" # Detect carrier and DTR, # Ignore RTS. expect "0\r" timeout 2 # Confirm command successful. done waitcall: ignore "2\r" timeout 1 # Ignore first ring. expect "2\r" timeout 10 # Pick up second ring # or timeout. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" timeout 60 done page: send "ATDT%N,,,,%R;\r" # %N = pager call center number # Add enough commas to wait for # time to enter paging number. # %R = paging number # Confirm successful command. expect "0\r" timeout 60 delay 2 # Wait before hanging up. send "ATH0T\r" # Hang up. expect "0\r" timeout 2 # Confirm successful command. done ripo: send "AT&F0Q0T\r" # Reset modem. Select profile 0 ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful. send "AT&C1&D2&R1\r" # Detect carrier and DTR, # Ignore RTS. expect "0\r" timeout 2 # Confirm command successful. done # RI Power On enabled. error: # Handle unexpected modem # responses. expect "8\r" or "7\r" or "6\r" or "4\r" or "3\r" delay 2 done Appendix F. Modem Configurations 531 Sample File modem_f1.cfg ## COMPONENT_NAME: (ESPSETUP) ENTRY SERVICE PROCESSOR SETUP F1 ## FUNCTIONS: Modem configuration file for many recent Hayes* compatible modems. # This example uses the AT&F1 reset command to choose the factory defaults. # This set up is for modems with extended (&) commands and which do not work # properly with setup F0. For early vintage modems, setup Z or Z0 is # recommended. ## * Trademark of Hayes Microcomputer Products, Inc. ## (C) COPYRIGHT International Business Machines Corp. 1996 # All Rights Reserved # Licensed Materials - Property of IBM ## US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. ### If the modem has configuration switches, they should be set to the # factory default settings. ICDelay 1 DefaultTO 10 CallDelay 120 # AT Attention Code , Inserts delay in dialing commands # &F1 Reset. Restore profile 1 Q0 Turn on responses # E0 Turn echo off Q1 Turn off responses # V0 Use numeric responses S0=0 Automatic answer inhibit # +++ Escape to command mode S0=2 Answer on second ring # H0 Hang-up X0=0 Limit modem response codes # T = Tone mode. When used as T\r, it is a # no op to maintain program synchronization # when modem may/will echo the commands. ## &C1 Detect CD &D2 Respond to DTR (often the default) # &R1 Ignore RTS (CTS) ## %N Call-Out phone number %P Paging phone number # %S Modem speed (available to users) ## Following are common responses from a wide range of modems: # 16, 15, 12, 10, 5 and 1 are connection responses. Add others as required. # 7=busy; 6=no dial tone; 4=error; 3=no carrier; 2=ring; 0=OK ## PROGRAMMING NOTE: No blanks between double quote marks ("). condout: send "AT&F1Q0T\r" # Reset modem. Select profile 1 ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful. send "AT&C1&D2&R1\r" # Detect carrier and DTR, # Ignore RTS. expect "0\r" timeout 2 # Confirm command successful. done 532 Service Guide connect: send "ATDT%N\r" # Tone dialing command. # %N from Call Home setup. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60 done retry: send "A/" # Repeat the previous command. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60 done disconnect: delay 2 # Separate from previous data. send "+++" # Assure command mode. delay 2 # Allow mode switching delay. send "ATH0T\r" # Set modem switch-hook down # (i.e., hang up). ignore "0\r" or "OK\r" timeout 2 # Ignore modem response. send "ATE0Q1\r" # Initialize modem: Echo OFF, # Disable responses. ignore "0\r" timeout 1 done condin: send "AT&F1Q0T\r" # Reset modem. Select profile 1 ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=2\r" # Set AutoAnswer ON expect "0\r" timeout 2 # Confirm command successful. send "AT&C1&D2&R1\r" # Detect carrier and DTR, # Ignore RTS. expect "0\r" timeout 2 # Confirm command successful. done waitcall: ignore "2\r" timeout 1 # Ignore first ring. expect "2\r" timeout 10 # Pick up second ring # or timeout. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" timeout 60 done page: send "ATDT%N,,,,%R;\r" # %N = pager call center number # Add enough commas to wait for # time to enter paging number. # %R = paging number # Confirm successful command. expect "0\r" timeout 60 delay 2 # Wait before hanging up. send "ATH0T\r" # Hang up. expect "0\r" timeout 2 # Confirm successful command. done Appendix F. Modem Configurations 533 ripo: send "AT&F1Q0T\r" # Reset modem. Select profile 1 ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "ATE0T\r" # Initialize modem: Echo OFF, expect "0\r" or "OK\r\n" timeout 2 # Enable responses (Numeric), send "ATQ0V0X0T\r" # Limit response codes. expect "0\r" timeout 2 # Confirm commands successful. send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful. send "AT&C1&D2&R1\r" # Detect carrier and DTR, # Ignore RTS. expect "0\r" timeout 2 # Confirm command successful. done # RI Power On enabled. error: # Handle unexpected modem # responses. expect "8\r" or "7\r" or "6\r" or "4\r" or "3\r" delay 2 done 534 Service Guide Index A access cover (rack) 366 replacement 366 acoustical noise emissions 472 adapters 379 removal 381, 385 replacement 383, 392 adding FRUs 53 serviceable event comments 52 AIX codes 28 AIX location codes 24 attention LED 56 HMC 56 attention LEDs 465 automatic call home overview 49 B base system drawer will not power on and no error codes are available 45 battery 432 disposal, recycling 471 bezel (286/C4) 377 removal 377 replacement 377 bezels (286/E4) 378 removal 378 replacement 378 boot list 304 boot mode menu 316 boot problems 173 C cable management arm 501 cabling HMC 504 LAN 508 power HMC 508 call-in testing 476 call-in/call-out setup menu 325 call-out testing 476 call-out (call-home) 337 call-out policy setup menu 328 cannot power on the I/O drawer, power present LED does not come on 45 checklist, inventory 481 checkpoints 46, 141 boot problems 173 firmware 150 checkpoints (continued) service processor 143 clearances, service 481 code, error 179 common firmware error codes 286 component LEDs 46 configuration client 303 network 508 NIM server 302 configure memory 335 processor 335 configuring service agent 510 console mirroring 338 enable/disable 310 console strategy 7 covers 366 access cover (rack) 366 removal 371 replacement 372 customer account setup menu 329 D deconfigure memory 335 processor 335 deconfiguring disk drives 405 description, ipl flow 141 devices 510 attached device setup 510 setup 510 diagnostics considerations 297 loading 302 online 298 concurrent mode 299, 300 maintenance mode 300 running service mode 299 running online 298 service mode 298 standalone 301, 302 considerations 301 NIM server 302 partition considerations 301 using 297 diagnostics overview 45 diagram system logic flow 21 disk drive backplane 415 disk drives deconfiguring 405 documentation AIX 300 535 E electrical safety xi laser compliance statement xiii rack 484 emissions noise 472 end of call MAP for systems with service focal point 45 entry MAP 45, 57 error code to FRU index 179 error codes common firmware 286 firmware 192 operator panel 182 problem determination generated 295 service processor 211 SPCN 183 system firmware 286 error logs service processor 339 extended error data managing 52 F fan assemblies 395 removal 397 replacement 399 fan locations 14 fan removal, hot-swap 397 fan removal, non-hot-swap 395 fan replace, hot-swap 399 fan replace, non-hot-swap 396 features, system 2 architecture, bus 2 keyboard 4 media drives 3 memory 3 mouse 4 operator panel 5 ports, input/output 5 ports, serial 5 power supply 4 processors 3 security 6 firmware 334 error codes 192 level 334 update 334 firmware checkpoints 150 firmware updates system 332 flow, IPL 141 front door removal 376 replacement 376 FRU index 179 memory DIMM present bits 285 updating information 52 FRU isolation 47 FRU LEDs activating 53 deactivating 53 FRUs adding 53 replacing 53 G general user menu 306 H handling static–sensitive devices 359 hardware management console 6 hardware service functions 53 activating and deactivating FRU LEDs 53 highlighting xvii HMC 6, 508 attention LED 56 cabling 504, 505 external modem 506 installation 503 keyboard connection 504 monitor connection 504 mouse connection 504 power cords 508 resetting the system attention LED 56 HMC (hardware management console) system error interrogation 54 HMC attached system error interrogation 54 HMC system resetting attention LED 56 hot-plug disk drives 405 removal 406 replacement 408 hot-plug management functions 379 hot-plug manager 379 hot-plug manager menu 380 hot-pluggable PCI adapter installation overview 379 I I/O problem isolation 45 index, error code to FRU 179 indicators, POST 7 information, reference 1 install rack 497 installation HMC 503 instructions, multiple FRU 26 instructions, rack safety 484 inventory 481 inventory scout 508 IPL flow 141 phases 141 536 Service Guide IPL flow without an HMC 142 isolation, FRU 47 J JTAG problem isolation 45 K keyboards 463 keys numeric 1 key 8 L LAN cable 508 language selection menu 324 laser compliance statement xiii laser safety information xiii LCD progress indicator log 340 LEDs 465 attention LEDs 465 component 46, 466 fault LEDs 465 indicator menu 322 operator panel display LEDs 465 resetting 469 location codes 23 AIX 24 format 23 physical 23 locations fans 14 memory DIMMs 18 power supply 12 system board 15, 16 system unit 9 system unit front view 10 system unit rear view 9 logic diagram, GP 21 logic diagram, GQ 22 M maintenance analysis procedures 45 maintenance analysis procedures (MAPs) 57 minimum configuration Map 111 problem determination MAP 64 managing extended error data 52 MAP 1520, power 89 MAP 1521, power 91 MAP 1548, minimum configuration 123 MAP rules 57 quick entry MAP 58 media drives 401 removal 401 replacement 404 memory configure 335 deconfigure 335 module present bits 285 memory configuration/deconfiguration menu 320 memory DIMMs 427 memory DIMMs location 18 memory problem resolution 45 menu boot mode 316 call-in/call-out 325 call-out policy setup 328 customer account setup 329 general user 306 language selection 324 LED indicator 322 main 307 memory configuration/deconfiguration 320 modem configuration 326 OS surveillance setup 310 power control network utilities 321 privileged user 307 processor configuration/deconfiguration 318 reboot/restart policy setup 315 ring indicate 314 serial port selection 326 serial port speed setup 327 setup 309 system information 317 system power control 314 telephone setup 327 messages, service processor checkpoints 143 minimum configuration MAP 111 modem configuration file selection 513 configurations 513 external connection 506 transfer 517 modem_f.cfg, sample file 527 modem_f0.cfg, sample file 529 modem_f1.cfg, sample file 532 modem_m0.cfg, sample file 519 modem_m1.cfg, sample file 521 modem_z.cfg, sample file 523 modem_z0.cfg, sample file 525 N network configuration 508 NIM server configuration 302 client 303 standalone diagnostics 302 noise emissions 472 noise emission 480 Index 537 non-HMC attached system error interrogation 55 O online diagnostics concurrent mode 299, 300 maintenance mode 300 modes of operation 298 running 298 service mode 298 operating considerations online and standalone diagnostics 297 operational phases service processor 341 operator panel 19, 419 removal 419 replacement 420 operator panel display 465 operator panel error codes 182 overview hot-pluggable PCI adapter 379 service focal point 48 overview, diagnostics 45 P parameters service processor service mode 330 partitions considerations standalone 301 parts keyboard 463 passwords 309 general change 310 privileged change 310 phases, IPL 141 physical location codes 23, 28 POST indicators 7 POST keys 8 1 key 8 5 key 8 6 key 8 numeric 1 8 power and lightpath LEDs 46 power cables 43 power control network utilities menu 321 power cords 508 power MAP 45 power supply 439 removal 439 replacement 441 power supply locations 12 power-on methods 330 Power-On Self-Test 7 primary console 518 privileged user menus 307 problem determination generated error codes 295 problem determination MAP 45, 64 problem isolation procedures 45 processor configure 335 deconfigure 335 processor card 423 removal 423 replacement 426 processor configuration/deconfiguration menu 318 processor subsystem problem isolation 45 product disposal 471 Q quick entry MAP 45, 58 quick entry MAP for systems with service focal point 45 R rack setup 479 cable management arm 501 inventory 481 rack hardware 485 rack install 497 rack installation 479 rack safety 484 rack, configure 481 reboot/restart policy setup menu 315 recycling 471 reference information 1 related publications xvii removal 371 adapters 381, 385 bezel (286/C4) 377 bezel (286/E4) 378 caution and danger 357 front door 376 HMC installation 503 hot-plug disk drives 406 hot-swap fan assembly 397 operator panel 419 power supply 439 processor card 423 service access cover 371 static–sensitive devices 359 system board 443 removal and replacement procedures 357 access cover removal 286/E4 371 adapter removal, hot-plug 385 adapter removal, non-hot-plug 381 adapter, adding or replacing a non-hot-plug 383 adapter, hot-plug management functions 379 adapter, hot-plug manager 379 adapter, hot-plug manager menu 380 adapter, install hot-plug 392 adapter, replace hot-plug 388 adapters, PCI 379 back cover removal, 286/E4 372 back cover replacement, 286/E4 373 538 Service Guide removal and replacement procedures (continued) battery, replacing 432 bezel door removal, 286/E4 376 bezel door replacement, 286/E4 376 bezel door, 286/E4 376 bezel removal, 286/C4 377 bezel removal, 286/E4 378 bezel replacement, 286/C4 377 bezel replacement, 286/E4 378 bezels 377 bracket, release latch 363 CD-ROM drive 401 cover removal, disk drive access 373 cover replacement, disk drive access 373 covers 366 devices, handling static sensitive 359 disk drive drive 401 disk drive, configuring 405 disk drive, deconfiguring 405 disk drive, hot-plug removal 406 disk drive, hot-plug replacement 408 disk drives, hot-plug 405 fan removal, hot-swap 397 fan removal, non-hot-swap 395 fan replace, hot-swap 399 fan replace, non-hot-swap 396 fans 395 HMC adapter cabling 506 HMC installation 503 HMC modem connections 506 HMC network, configure 508 HMC power cabling 508 HMC to LAN cabling 508 HMC to modem switch settings 507 HMC, cabling 504 HMC, monitor position 503 inventory scout, configure 508 media drives 401 media removal 401 media replacement 404 memory DIMM removal 427 memory DIMM replacement 431 memory DIMMs 427 operating position, 286/C4 365 outer side cover removal, 286/E4 370 outer side cover replacement, 286/E4 370, 372 PCI adapters 379 power supplies 436 power supply, hot-swap removal 439 power supply, non-hot-swap removal 436 power supply, non-hot-swap replacement 438, 441 processor card 423 processor card removal 423 processor card replacing 426 processor card, adding 426 release latch 362 removal and installation procedure, 286/C4 496 removal, system drawer from rack 496 service access cover removal, 286/C4 366 service access cover replacement, 286/C4 366 service agent, configure 510 removal and replacement procedures (continued) service position, 286/C4 364 starting the system with an HMC 361 starting the system without an HMC 360 stopping the system with an HMC 359 stopping the system with an HMC and Linux 360 stopping the system without an HMC 359 vital product data, collect 509 replacement 372 adapters 383, 392 bezel (286/C4) 377, 378 front door 376 hot-plug disk drives 408 hot-swap fan assembly 399 media drives 401, 404 operator panel 420 power supply 441 processor card 426 service access cover 372 system board 448 top cover (rack) 366 replacing FRUs 53 resetting the system attention LED 56 HMC 56 restart recovery 331 RETAIN 329 retries 329 ring indicate power-on menu 314 S safety notices xi saving service processor settings 475 Self-Test, Power-On 7 serial port selection menu 326 snoop setup menu 311 speed setup menu 327 serial port configurations 5 service agent configuring 510 service clearance 481 service focal point activating and deactivating FRU LEDs 53 adding FRUs 53 call home overview 49 closing a serviceable event 52 enabling surveillance notifications 50 hardware service functions 53 overview 48 replacing FRUs 53 saving and managing extended error data 52 serviceable events 51 setting up surveillance 50 settings 49 testing error reporting 49 updating FRU information 52 viewing and adding serviceable event comments 52 viewing error details 51 viewing service processor error details 52 Index 539 service focal point (continued) viewing serviceable event details 51 viewing serviceable event partition information 53 viewing serviceable events 51 service inspection guide 44 service position 364 service processor 305 access local 305 remote 306 backup settings 475 boot mode menu 316 call-in/call-out setup menu 325 call-out call-home 337 call-out policy setup menu 328 checklist 475 checkpoints 143 console mirroring 338 customer account setup menu 329 error codes 211 error logs 339 general user menu 306 language selection menu 324 main menu 307 memory configuration/deconfiguration menu 320 menus 305 modem configuration menu 326 monitoring - surveillance 336 operational phases 341 OS surveillance setup menu 310 parameters service mode 330 passwords 309 power control network utilities menu 321 privileged user menus 307 procedures in service mode 330 processor configuration/deconfiguration menu 318 reboot/restart policy setup menu 315 reboot/restart recovery 331 ring indicate power-on menu 314 serial port selection menu 326 serial port snoop setup menu 311 serial port speed setup menu 327 settings saving and restoring 306 setup 475 setup checklist 475 setup menu 309 system information menu 317 system power control menu 314 telephone setup menu 327 test 475 service processor checkpoints 143 serviceable event viewing partition information 53 serviceable events adding comments 52 selecting and viewing 51 updating FRU information 52 viewing comments 52 viewing details 51 serviceable events (continued) viewing error details 51 viewing service processor error details 52 working with 51 setup menu 309 slow boot 301 SPCN error codes 183 spcn error codes table directed you here and the error code is 1011 1B0x 45 specifications 42, 479 noise emission notes 480 specifications, power cables 43 standalone diagnostics 301, 302 NIM server 302, 303 partition considerations 301 stopping the system with an HMC 359 stopping the system with an HMC and Linux 360 stopping the system without an HMC 359 strategy, console 7 surveillance monitoring 336 system attention LED 46 HMC attached system error interrogation 54 non-HMC attached system error interrogation 55 resetting attention LED 56 system board 443 removal 443 replacement 448 system board locations 15, 16 system cables 37, 459 system firmware update messages 286 system firmware updates 332 system information menu 317 system logic flow diagram 21 system management services 345 boot devices, select 354 boot options, select 352 error log, view 347 exiting SMS 356 IPL, remote setup 348 IPL, setup 348 multiboot startup 356 password, change options 347 password, privileged-access 347 SCSI settings, change 351 select console 351 select language 346 settings, display current 355 settings, restore default 356 system power-on methods 330 system specifications 479 system VPD update 421 system, stopping with an HMC 359 stopping with an HMC and Linux 360 stopping without an HMC 359 system, starting with an HMC 361 system, starting without an HMC 360 540 Service Guide T telephone setup menu 327 terminal type identifying 298 testing the setup call-in 476 call-out 476 trademarks xviii transfer of a modem session 517 U updating FRU information 52 V viewing serviceable event comments 52 serviceable event details 51 serviceable event error details 51, 52 serviceable event partition information 53 vital product data (VPD) 509 VPD update procedure 421 W web sites firmware yupdates 334 microcode updates 334 Index 541 542 Service Guide Vos remarques sur ce document / Technical publication remark form Titre / Title : Bull ESCALA PL420T and PL420R Service Guide Nº Reférence / Reference Nº : 86 A1 42EG 00 Daté / Dated : February 2003 ERREURS DETECTEES / ERRORS IN PUBLICATION AMELIORATIONS SUGGEREES / SUGGESTIONS FOR IMPROVEMENT TO PUBLICATION Vos remarques et suggestions seront examinées attentivement. Si vous désirez une réponse écrite, veuillez indiquer ci-après votre adresse postale complète. Your comments will be promptly investigated by qualified technical personnel and action will be taken as required. If you require a written reply, please furnish your complete mailing address below. NOM / NAME : Date : SOCIETE / COMPANY : ADRESSE / ADDRESS : Remettez cet imprimé à un responsable BULL ou envoyez-le directement à : Please give this technical publication remark form to your BULL representative or mail to: BULL CEDOC 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE Technical Publications Ordering Form Bon de Commande de Documents Techniques To order additional publications, please fill up a copy of this form and send it via mail to: Pour commander des documents techniques, remplissez une copie de ce formulaire et envoyez-la à : BULL CEDOC ATTN / Mr. L. CHERUBIN 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE Phone / Téléphone : +33 (0) 2 41 73 63 96 FAX / Télécopie +33 (0) 2 41 73 60 19 E–Mail / Courrier Electronique : srv.Cedoc@franp.bull.fr Or visit our web sites at: / Ou visitez nos sites web à: http://www.logistics.bull.net/cedoc http://www–frec.bull.com http://www.bull.com CEDOC Reference # No Référence CEDOC Qty Qté CEDOC Reference # No Référence CEDOC Qty Qté CEDOC Reference # No Référence CEDOC Qty Qté _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] _ _ _ _ _ _ _ _ _ [ _ _ ] [ _ _ ] : no revision number means latest revision / pas de numéro de révision signifie révision la plus récente NOM / NAME : Date : SOCIETE / COMPANY : ADRESSE / ADDRESS : PHONE / TELEPHONE : FAX : E–MAIL : For Bull Subsidiaries / Pour les Filiales Bull : Identification: For Bull Affiliated Customers / Pour les Clients Affiliés Bull : Customer Code / Code Client : For Bull Internal Customers / Pour les Clients Internes Bull : Budgetary Section / Section Budgétaire : For Others / Pour les Autres : Please ask your Bull representative. / Merci de demander à votre contact Bull. BULL CEDOC 357 AVENUE PATTON B.P.20845 49008 ANGERS CEDEX 01 FRANCE 86 A1 42EG 00 ORDER REFERENCE PLACE BAR CODE IN LOWER LEFT CORNER Utiliser les marques de découpe pour obtenir les étiquettes. Use the cut marks to get the labels. 86 A1 42EG 00 Service Guide ESCALA PL420T and PL420R 86 A1 42EG 00 Service Guide ESCALA PL420T and PL420R 86 A1 42EG 00